Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2025 Jul 4.
Published in final edited form as: Curr Biol. 2023 Jun 7;33(12):2504–2514.e3. doi: 10.1016/j.cub.2023.05.046

The origin and evolution of sex chromosomes, revealed by sequencing of the Silene latifolia female genome

Jingjing Yue 1,#, Marc Krasovec 2,3,#, Yusuke Kazama 4, Xingtan Zhang 1,5, Wangyang Xie 1, Shencheng Zhang 5, Xiuming Xu 6, Baolin Kan 1, Ray Ming 1,*, Dmitry A Filatov 2,*
PMCID: PMC7617815  EMSID: EMS206174  PMID: 37290443

Summary

White campion (Silene latifolia, Caryophyllaceae) was the first vascular plant where sex chromosomes were discovered. This species is a classic model for studies on plant sex chromosomes due to presence of large clearly distinguishable X and Y chromosomes that originated de novo about 11 million years ago (MYA), but lack of genomic resources for this relatively large genome (~2.8 Gb) remains a significant hurdle. Here we report S. latifolia female genome assembly integrated with sex-specific genetic maps of this species, focusing on sex chromosomes and their evolution. The analysis reveals highly heterogeneous recombination landscape with strong reduction in recombination rate in the central parts of all chromosomes. Recombination on the X chromosome in female meiosis primarily occurs at the very ends and over 85% of the X-chromosome length is located in a massive (~330 Mb) gene-poor rarely recombining pericentromeric region (Xpr). The results indicate that the non-recombining region on the Y chromosome (NRY) initially evolved in a relatively small (~15Mb) actively recombining region at the end of the q-arm, possibly as a result of inversion on the nascent X chromosome. The NRY expanded about 6 MYA via linkage between the Xpr and the sex-determining region, which may have been caused by expanding pericentromeric recombination suppression on the X chromosome. These findings shed light on the origin of sex chromosomes in S. latifolia and yield genomic resources to assist ongoing and future investigations on sex chromosome evolution.

Keywords: sex chromosome evolution, recombination suppression, genome sequence, Silene latifolia

Introduction

Sex chromosomes are a peculiar part of the genome that evolved independently many times in different groups of organisms 1,2. In vascular plants they were first discovered exactly a century ago by Kathleen Blackburn 3 who reported that Silene latifolia (then known as Lychnis alba) has “an XY pair of chromosomes in the male with a corresponding XX in the female”. Despite independent origins of sex chromosomes in different organisms, their properties are quite similar, indicating the generality of evolutionary forces and processes driving their evolution. Non-recombining sex-specific Y(or W)-chromosomes are typically genetically degenerate, while X(or Z)-chromosomes that actively recombine in the homogametic sex, contain hundreds to thousands of functional genes 4. The contrasting properties of the Y(or W)- and X(or Z)-chromosomes are striking, given they typically evolve from a pair of autosomes 5 and initially have the same gene composition 6. This represents a good illustration of how the differences in recombination landscape can drive evolution of major changes in chromosome structure, density of genes and repetitive DNA 710.

Evolution of a non-recombining sex-specific region is a key step in sex chromosome evolution, yet how and why recombination cessation on nascent sex chromosomes evolves remains poorly understood 1115. Species with very small sex-determining regions (e.g. in fish 16,17 and plants 1822) demonstrate that recombination suppression around the sex-determining gene(s) is not obligatory, raising questions why large non-recombining regions repeatedly evolved and expanded in size on sex chromosomes of many species independently 11,14,15,2327. Species where large non-recombining regions around sex-determining genes evolved recently, such as S. latifolia 28 that is the focus of this study, allow us to study what causes evolution of recombination suppression on sex chromosomes 15.

Studies in S. latifolia have contributed significantly to our understanding of sex chromosome evolution 2931. Early work in this species 32 has inspired the development of ideas about how separate sexes and sex chromosomes originate 33. More recent work brought important clues on Y chromosome degeneration in plants 28,3436, sex-biased expression 37, dosage compensation 36,3840 and specific sex linked gene evolution 4145. The S. latifolia sex-chromosomes evolved de novo during or after the transition to separate sexes about 11 million years ago 28, that is relatively recent compared to mammals 46 or birds 47, where the sex chromosomes are at least 10 times older. The de novo evolution of X and Y chromosomes in S. latifolia offers a rare opportunity to study the origination of sex chromosomes once a species evolves separate sexes 5. Relatively recent origin of sex chromosomes in this species enables the analysis of evolutionary dynamics of sex chromosomes at the early stages of their evolution.

Fragmented assemblies of the S. latifolia female 36 and male 28 genomes remain a significant limitation in the work with this species. This fragmentation does not allow one to establish the order and location of genomic contigs and the locations of the recombining and non-recombining regions on the sex chromosomes and autosomes. Relatively large size of the genome (haploid ~2.8 Gb 48,49) and abundance of repetitive sequences has, so far, precluded the construction of better genome assemblies. In this study, we took advantage of multiple long read sequencing datasets to generate a new female assembly close to chromosome scale and integrate it with genetic maps. This revealed highly uneven recombination landscape throughout S. latifolia genome, with peripheral recombination in all chromosomes. Extensive pericentromeric recombination suppression (PRS) in females is particularly pronounced on the X chromosome, where it has likely contributed to evolution of sex chromosomes by facilitating or even causing the expansion of the NRY. Furthermore, the PRS on the X chromosome significantly affects the patterns of genetic diversity and divergence in the sex-linked genes that have previously been attributed to Y chromosome degeneration.

Results

Genome sequencing, assembly and annotation

We used PacBio HiFi sequencing (104.9 Gb, 40× sequence coverage; Table S1) to create a draft assembly of the S. latifolia female genome. The initial assembly was 2.64 Gb long (contig N50=23.1 Mb; Table S2) with 98.0% genome completeness as assessed by BUSCO (Table S3). This assembly was then integrated with Hi-C data using ALLHiC pipeline 50. This yielded 12 pseudo-chromosomes with a total size of 2.45 Gb and an anchor rate of 93.0% (Figure 1A; Table S2). The number of assembled pseudo-chromosomes corresponds to the 12 chromosomes in the S. latifolia genome. The accuracy of Hi-C based pseudo-chromosome construction was evaluated by chromatin contact matrix, which showed a well-organized interaction contact pattern along the diagonals within each pseudo-chromosome (Figure 1B, Table S4). The quality of the female S. latifolia assembly was further verified by RNA-seq and genomic short Illumina reads from 28,36. The results showed that 100% of transcripts were aligned with 99.9% of single base accuracy for the genome (Table S5) and 99.1% of the genomic Illumina reads mapped to the assembly, covering 94.0% of the genome assembly (Table S6). All these results indicated the high quality of the assembled S. latifolia female genome.

Figure 1. The genome assembly of S. latifolia female genome.

Figure 1

A) Rings indicate twelve chromosomes (I), gene density (II), TE density (III), GC density (IV), copia LTR retrotransposons density (V), gypsy LTR retrotransposons density (VI). The dotted lines show the boundaries between PAR, Xpr and qXdr on the X chromosome. B) Hi-C chromatin interactions at 1 Mb resolution for twelve chromosomes. See also Tables S1-S9.

The assembled genome was integrated with the previously published RNA-seq-based sex-specific maps 51, which revealed a good correspondence between the genome sequence and the maps (Figures 2A and S1). In particular, 4745 and 4598 markers of the female- and male-specific maps 51, respectively, were located in the genome, of which 812 and 809 markers were X-linked. This analysis revealed that X-linked markers mapped to the scaffold “Chr12”, indicating its correspondence to the X chromosome. Furthermore, the sex-linked genes SlX1, SlX3, SlX4, SlCyp and SlssX previously located on the X chromosome with fluorescent in situ hybridisation 52, are all located on the scaffold “Chr12” (Figure 2A), confirming that this scaffold represents the sequence of the X chromosome.

Figure 2. Genetic versus physical position for the X chromosome (A) and synonymous divergence between X- and Y-linked gametologs in the Xpr and qXdr (B).

Figure 2

The names of genes on panel A show the locations of sex-linked genes genetically mapped in the previous studies 35,43,45. The genetic map of the PAR in panel A is longer in the male map due to obligatory sex chromosome paring in the PAR in male meiosis. See also Figure S1.

We annotated 37,796 protein-coding genes in the S. latifolia female genome with 96.9% annotation completeness assessed by BUSCO (Tables 1 and S7). The average gene density on the X chromosome was significantly lower than that on the autosomes, except Chr4 and Chr6 (Table 1). We annotated 93 microRNAs (miRNAs) in S. latifolia female genome (Table S2). Annotation of repetitive sequences revealed that they represent 82.8% (2.18 Gb) of the genome assembly (Table S8). LTR retrotransposons (LTRs) were the most abundant transposable elements (TEs) representing 71.72% of the S. latifolia female genome (Figure1A, Table S8). Within the LTRs, Ty3/gypsy elements comprise the most abundant group, composing 35.9% of the genome, followed by Ty1/copia (12.8%) (Table S8). TEs were significantly more abundant on the X chromosome compared to autosomes, except Chr4 and Chr6 (Table 1), comprising 87.1% of the X chromosome, with 13.3% represented by Ty1/copia elements, and 37.3% by Ty3/gypsy elements (Figure 1A). The TEs comprise significantly higher proportion of the X chromosome compared to autosomes, except Chr4 and Chr6 (Table 1).

Table 1. The chromosome lengths and the densities of genes and TEs in S.latifolia female genome.

Chromosome Contigs Length (bp) Genes/Mb P-value* TE%** P-value*
ChrX 110 370,597,487 12.1 87.1
Chr1 188 193,566,110 18.3 7.30E-10 82.8 2.20E-39
Chr2 38 196,073,315 18.4 4.70E-06 83.1 8.90E-26
Chr3 34 165,661,342 18.9 4.60E-11 82.9 4.10E-34
Chr4 52 223,726,948 11.9 0.087 87.3 0.0029
Chr5 51 165,362,148 18.4 8.00E-12 82.7 7.20E-47
Chr6 27 179,322,760 11.2 0.037 87.2 3.00E-05
Chr7 57 193,230,414 14.1 0.16 85.1 0.00017
Chr8 119 172,294,220 15.8 5.20E-06 83.8 5.20E-31
Chr9 55 203,056,505 17.3 0.074 83.5 1.10E-09
Chr10 34 192,831,310 17.6 0.034 83.4 9.10E-14
Chr11 48 196,798,467 14.6 0.02 85.1 3.70E-11
*

Significance of difference from the X chromosome (ChrX) was tested using the two-sided Mann-Whitney-Wilcoxon test with P adjust Homl correction. 1 Mb window and 0.2 Mb slide steps were used to calculate the TE abundance and gene density.

**

TE% is the % of DNA sequence represented by transposable elements

Recombination suppression on the X chromosome

The comparison of the physical and genetic distances along the genomic scaffolds revealed reduced recombination in the central part of all chromosomes, indicating considerable pericentromeric recombination suppression in both sexes (Figure S1). In particular, the female meiosis recombination in the distal regions of the X chromosome is ~2 cM/Mb on average (Figure 2A), while the central region, comprising at least 85% of the X chromosome length, is rarely recombining (~0 cM/Mb). This is consistent with the view that species with large chromosomes tend to have highly peripheral recombination, with large pericentromeric regions lacking recombination 53,54.

One of the distal actively recombining regions in the X chromosome scaffold contains the genes previously identified 36,51 to be located in the pseudoautosomal region (PAR). As S. latifolia PAR is located in the p-arm of the X chromosome 5557, we will refer to this region as the p-arm X distal region (pXdr or PAR; the left side on Figure 2), while the actively recombining region on the other side of the X chromosome will be referred to as q-arm X distal region (qXdr; the right side on Figure 2). The rarely recombining region between the pXdr and qXdr will be referred to as the X pericentromeric region (Xpr).

The PAR boundary (as defined in 58) is located close to the boundary between pXdr and Xpr. According to the previously published sex-specific genetic map 51, female recombination rate at the X-linked genes adjacent to the PAR boundary (the ‘right’ and ‘mid’ regions in 58) is zero, suggesting that they belong to the rarely recombining Xpr region. On the other hand, the pseudoautosomal genes adjacent to the PAR boundary (the ‘left’ region in 58) appear to recombine more actively, suggesting that they are located in the pXdr. Thus, it is likely that the PAR boundary coincides with the pXdr/Xpr boundary, though genetic mapping in several independent crosses is needed to confirm this result.

The genes previously used to define ‘evolutionary strata’ on S. latifolia sex chromosomes 36,42,43,51,5860 – the regions where cessation of X:Y recombination occurred first (stratum 1: SlX4, SlX7, E766, E713, E758, E750, E378 35,42,43,60) and then more recently (stratum2: SlssX, Slap3X, E799, E817, E819 35,43,61) are located in the qXdr and the Xpr, respectively (Figure 2A). This indicates that initial cessation of recombination between the proto-X and proto-Y chromosomes around 11 MYA28 occurred in an actively recombining qXdr that is relatively small (~15 Mb long), while expansion of the NRY about 6 MYA involved inclusion of the massive (~330Mb) Xpr into the non-recombining male-specific region. Higher X:Y synonymous divergence in qXdr compared to Xpr (Figure 2B and Table 2), reported in the next section, is consistent with this conclusion.

Table 2. Average substitution rates (± standard errors) in the Xpr, qXdr and PAR genes.

Non-dioecious S. uniflora was used as an outgroup (OG).

PAR Xpr qXdr
#genes 178 760 198
#codons 211,704 839,597 254,442
dS X:OG 0.124±0.0068 0.120±0.0042 0.123±0.0070
dS X:Y 0.063±0.0018 0.091±0.0042
dN/dS X 0.315±0.0357 0.267±0.0461
dN/dS Y 0.591 ±0.1237*** 0.408±0.0487***
dN/dS X:OG 0.221±0.0170 0.285±0.0123 0.233±0.0186
***

Paired t-tests, P<0.001, revealing significantly higher dN/dS for Y-linked compared to X-linked genes in both Xpr and qXdr.

The actively recombining regions at the ends of the X chromosome are gene-rich, while the Xpr is gene poor (Figure 1A, Table S9), with the average gene density of 17.8 and 6.6 genes per megabase, respectively. The distribution of TEs showed the opposite pattern, with TEs significantly more abundant in Xpr compared to the ends of the X chromosome (Figure 1A, Table S9). Similar TE depletion and gene enrichment is evident at the ends of most other chromosomes (Figure 1A).

Substitution rates in X- and Y-linked genes

To reconstruct Y-linked gemetologs for the X-linked genes we employed previously described approach, which uses male-specific Y-linked SNPs to identify the sequence reads corresponding to the Y chromosome and assemble Y-consensus from these reads 28,34,36. The accuracy of the Y-reconstruction was confirmed in comparisons with the sequences of Y-linked genes previously obtained by manual Sanger sequencing (e.g. SlY1, SlY4, DD44Y, SlssY 6164). The resulting Y-linked genes were aligned with X-linked gametologs as well as with their homologs (identified by blast) from a non-dioecious outgroup Silene uniflora 65.

Average synonymous divergence between the X- and Y-linked gametologs (dS X:Y) was 6.3% (±0.18%) and 9.1% (±0.42%) in the Xpr and the qXdr, respectively (Table 2 and Figure 2B). Consistent with relaxation of selection on the Y chromosome, the non-synonymous to synonymous substitution rate ratios (dN/dS) for the Y-linked genes were significantly higher compared to their X-linked gametologs in both Xpr and qXdr regions (0.591 vs 0.315 and 0.408 vs 0.267, respectively; Table 2).

The analysis of divergence between the genes in S. latifolia female genome and their homologs in outgroup S. uniflora (Table 2) revealed that dN/dS ratios of the X-linked genes in the qXdr are not significantly different from those in the pseudoautosomal genes (0.233±0.0186 and 0.221±0.0170, respectively; t-test, NS), while the genes in the Xpr have significantly higher dN/dS ratios (0.285±0.0123; t-test, P<0.001). This is consistent with reduced efficacy of selection in the rarely recombining Xpr on the X chromosome compared to the recombining PAR and qXdr that actively recombines in females.

Gene expression

Y-linked copies were significantly less actively expressed compared to their X-linked gametologs (paired t-test, P<0.0.001), reflecting degeneration of Y-linked genes at the level of gene expression. Overall (X+Y or X+X) expression is female-biased, while this is not the case in the PAR or autosomal genes (Figure 3). The presence of the female bias indicates that the dosage compensation system (if any is present in S. latifolia 34,36,38,39,66) is only partial and it does not adequately compensate for reduced expression of Y-linked genes. However, it is possible that the genes with reduced X+Y (compared to X+X) are not dosage sensitive and thus do not need to be compensated. Female bias is significantly stronger in the qXdr compared to the Xpr (t-test, P<0.001; Figure 3), indicating that the extent of Y-degeneration in gene expression is stronger for the Y-linked gametologs of qXdr than Xpr genes, possibly because the qXdr is the oldest part of the sex chromosome. Alternatively, weaker female bias in the Xpr compared to the qXdr is due to differences in female recombination rates in these regions. As Xpr is rarely recombining, the difference in recombination rate (and hence, efficacy of purifying selection keeping genes functional) between the X- and Y-linked genes is smaller compared to that for genes in qXdr that is actively recombining in females.

Figure 3. Female-biased expression of the genes in the qXdr and Xpr regions on the sex chromosomes.

Figure 3

All comparisons between the groups, except the Aut:PAR comparison, are significant (t-tests, P<0.001).

Discussion

Lack of reference genome sequence for S. latifolia significantly limited the previous work devoted to evolution of sex chromosomes in this interesting system. Here we reported the assembly of the female genome and its integration with genetic maps 36,51, which shed light on structure and evolution of sex chromosomes in this species. It revealed that recombination in the S. latifolia genome primarily occurs in relatively small regions at the ends of the chromosomes (Figure S1), which is particularly pronounced on the X chromosome (Figure 2A) and may have played a role in sex chromosome evolution, as discussed below. Suppression of recombination in the region around the sex-determining gene(s) is a key step in NRY formation and sex chromosome evolution. Our results help to understand how such recombination suppression evolves, stressing the importance of the pre-existing recombination landscape on nascent sex chromosomes. They also illustrate that the real sequence of events leading to sex chromosome evolution may deviate significantly from the classic ‘evolutionary strata’ scenario 67 and the models of NRY expansion discussed in the literature 14,24,26,27,68.

Evolution of recombination suppression on sex chromosomes

The size of the qXdr is only about 15 Mb, indicating that recombination suppression between the sex chromosomes in S. latifolia males has initially evolved in a relatively small region, not dissimilar to other dioecious plants, such as papaya 18, persimmon 19, kiwifruit 20, asparagus 21, ginkgo 22 etc. However, unlike papaya 69, kiwifruit 70 and Rumex 71 where the NRY evolved in a rarely recombining pericentromeric region, initial recombination suppression in S. latifolia has evolved in the actively recombining region (qXdr), where the GSFX – the X-linked gametolog of the putative sex-determining gene GSFY 72, is located (Figure 2A). This is consistent with the comparison of genetic maps in S. latifolia and Silene vulgaris that revealed the presence of active recombination in this region in both species (green shading in Figure 4). It is interesting that the order of genes in this region is inverted between the two species (Figure 4C-D), suggesting that the initial recombination suppression on the proto-Y chromosome in this region was caused by an inversion on the proto-X chromosome. However, without the data from other Silene species it is difficult to test whether this inversion arose in S. latifolia or S. vulgaris lineages.

Figure 4. The comparison of the physical (A) and genetic (B-D) maps of the X chromosome in S. latifolia (A-C) and its homologous chromosome in non-dioecious S. vulgaris (D).

Figure 4

Panel B shows the genetic positions from 36 for a subset of X-linked genes from this study that correspond to genes previously mapped in 35,43. Panels C and D show S. latifolia and S. vulgaris genetic maps from 35. The locations of the same gene in different maps are linked with dotted lines. Green and grey shading show the genes and regions corresponding to the qXdr and Xpr, respectively. Note that qXdr is long (i.e. actively recombining) in the genetic maps of both S. latifolia and S. vulgaris, while Xpr is long only in S. vulgaris and the genetic distance in this physically large region is nearly zero in S. latifolia, indicating evolution of suppressed recombination on the X chromosome in this region.

Partial genetic maps are available for Silene otites and Silene pseudotites 73 that independently evolved separate sexes and homomorphic ZW and XY sex chromosomes corresponding to S. latifolia linkage groups LG3 and LG6, respectively. It is interesting that in both cases the sex-determining region corresponds to the central rarely recombining parts of homologous S. latifolia autosomes, which suggests that pre-existing pericentromeric recombination suppression facilitated evolution of sex chromosomes in S. otites and S. pseudotites, as it was also reported for Rumex 71. Unfortunately, the partial maps available for these Silene species focus on LG1, LG3 and LG6, which does not allow us to test whether the inversion in the S. latifolia stratum 1 of the X chromosome (Figure 4 C,D) occurred in S. latifolia lineage.

If confirmed that recombination between S. latifolia proto-X and Y chromosomes is suppressed by the X-linked inversion, this would contrast with the common assumption in the models that such inversions are Y- rather than X-linked 24,26,68. Many of the models developed with Y-linked inversions in mind do not work for X-linked inversions. For example, the deleterious mutations sheltering model 24 requires the inversion to be linked to a permanently heterozygous locus, such as the Y chromosome. However, some of the existing models may still work for X-linked inversions. For example, the crux of the recently proposed early emergence of dosage compensation model 26 is that evolving expression modifiers prevent reversion to X:Y recombination in the region where X:Y recombination stopped due to fixation of an inversion. While that model was developed with Y-linked inversions in mind, it may still work if X:Y recombination is suppressed by an X-linked inversion.

The comparison of the genetic map lengths in the Xpr (Figure 4B,C) with the homologous region in S. vulgaris (Figure 4D) reveals that recombination suppression is specific to the S. latifolia Xpr and thus may have evolved during the evolution of sex chromosomes in this species. It is interesting to speculate that the expansion of S. latifolia NRY to include Xpr may have been driven or facilitated by the evolution of pericentromeric recombination suppression in the central region of the X chromosome. For example, the NRY expansion could have occurred when the pericentromeric recombination suppression on the X chromosome became so extensive that it reached the sex-determining region. Comparative analysis of genetic and physical distances in a few other Silene species will be necessary to test this conjecture.

Tip-biased distribution of recombination on S. latifolia chromosomes

Recombination suppression in central regions of the chromosomes is not universal in plants, with a lot of variation across species 54, but species with large chromosomes (>100 Mb) tend to have peripheral distribution of recombination 53,54. The concentration of crossovers at the ends of the chromosomes may be explained by mechanistic and/or adaptation-related causes. For example, the mechanistic ‘telomere-initiation’ model postulates that crossovers tend to occur at the ends of chromosomes because the recombination machinery starts at the telomeres and proceeds inward 53,74. Alternatively, distal distribution of recombination may be selected for to ensure sufficient recombination in gene-dense regions at the ends of large chromosomes 54. Regardless of whether the cause is mechanistic or adaptation-driven, the peripheral distribution of recombination may be fuelled by the tendency of ‘junk DNA’, such as transposable elements, to accumulate in gene-poor rarely recombining central chromosome regions 7 that hence expand over time, limiting the actively recombining gene-rich regions to the ends of large chromosomes. This ‘expanding junkyard’ model helps to explain why large chromosomes tend to have peripheral recombination, while on the smaller chromosomes this tendency is weaker, if present at all 54.

Given the association between the extent of peripheral recombination and the chromosome size 54, it is interesting to speculate that the expansion of S. latifolia genome may have contributed to evolution of recombination suppression between the X and Y chromosomes. The size of the S. latifolia genome (~2.8 Gb) is nearly three-fold larger compared to non-dioecious outgroups such as S. vulgaris (1.13 Gb 48) or S. conica (0.9 Gb 49). Many species in genus Dianthus that is closely related to Silene, have even smaller genomes (e.g. Dianthus deltoides genome ~0.45 Gb 75), making it likely that the large size of S. latifolia genome is a derived state. As the chromosome number in diploid Silene is mostly conserved (N=12), tripling of the genome size in the S. latifolia lineage (possibly due to accumulation of TE 76) must have tripled the size of the chromosomes, which could have exaggerated peripheral recombination, particularly so on the X chromosome that is the second largest (after the Y) chromosome in S. latifolia genome.

Strongly peripheral recombination may have facilitated evolution of recombination suppression between the X and Y chromosome. This conjecture is in line with the growing body of evidence that genome-wide variation in recombination rate, such as reduced recombination in the heterogametic sex (heterochiasmy) 7779, or pericentromeric recombination suppression 71 can play a significant role in sex chromosome evolution by facilitating recombination suppression between the X and Y chromosomes. While heterochiasmy is unlikely to be important in S. latifolia 51, the expansion of pericentromeric recombination suppression may have contributed significantly to evolution of sex chromosomes in this species.

The comparisons of genetic distances for autosomal genes in S. latifolia and its relatives would allow us to test whether the expansion of pericentromeric recombination suppression is X-specific, or it occurred on all chromosomes. Unfortunately, with the previous efforts focusing on sex chromosomes, little data is available for autosomes. The genetic distances between 14 autosomal genes in LG1 of S. latifolia and S. otites are similar in both species (Figure 1d in 73), indicating little change in recombination landscape on that autosome over evolutionary time separating these species. Similarly, little difference in recombination distances between three autosomal genes frpericentromeric recombination suppressionom LG9 (E534, E526 and E157) was reported for S. latifolia and S. vulgaris 43. The similarity of genetic distances for analysed autosomal genes in S. latifolia, S. otites and S. vulgaris suggest that suppression of recombination in the central region of S. latifolia X chromosome is probably specific to that chromosome. However, genetic mapping (ideally complemented with physical location) for more genes in several Silene species is necessary to reveal the evolution of recombination suppression.

Extensive pericentromeric recombination suppression on the X chromosome

The rarely recombining Xpr is a large gene poor region spanning ~330Mb in the central part of the S. latifolia X chromosome (Figure 2). The lack of recombination in this region explains the clustering of genes near the PAR boundary in the genetic maps published previously 36,43. Furthermore, this explains the apparent contradiction between the cytogenetic evidence that placed SlX1, SlX3, SlCyp and DD44X at the very end of the q-arm on the X chromosome 52, while the genetic mapping was consistently placing these genes closer to the middle of the X chromosome map 36,43. Our results reveal that these genes are located in the distal region on the q-arm of the X chromosome, which is consistent with the cytogenetic evidence 52, while their placement closer to the middle of the genetic maps was caused by the collapse of the physically massive, but rarely recombining Xpr in the genetic maps.

Genes in the qXdr and the Xpr had evolved in very different recombination landscapes before the cessation of recombination between the X and Y chromosomes in these regions. Indeed, the average dN/dS ratio for the X-linked genes in Xpr is almost two-fold higher compared to the qXdr (Table 2), indicating less effective purifying selection in the former compared to the latter. Partial relaxation of purifying selection in S. latifolia X-linked genes was previously reported 28, but it was interpreted in the context of lower effective population size of the X-linked compared to autosomal genes rather than reduced recombination on much of the S. latifolia X chromosome.

Given the pre-existing recombination suppression in the Xpr, the reduction in the efficacy of selection caused by complete cessation of recombination in the NRY may have been much weaker for the genes in the Xpr compared to the qXdr. This implies that cessation of recombination due to inclusion in the NRY may have had little impact on the already rarely recombining genes in the Xpr. However, given significantly higher dN/dS for Y-linked compared to X-linked genes in the Xpr (0.591±0.1237 and 0.315±0.0357, respectively; Table 2), inclusion of this region in the NRY resulted in further reduction in the efficacy of selection in Y-linked genes compared to their X-linked gametologs in the Xpr.

Furthermore, the female-bias in gene expression is significantly weaker for the genes in the Xpr compared to the qXdr (Figure 3). While these findings could be interpreted as a result of more recent inclusion of the Xpr in the NRY compared to the qXdr, they are also consistent with the idea that genetic degeneration, caused by complete recombination suppression, affected the qXdr to a greater degree compared to the Xpr that was already rarely recombining. Weak Y-degeneration in the Xpr may, at least partly, account for the previously reported slower rate of Y-degeneration of the S. latifolia Y chromosome compared to other studied species 28,34,59. However, the slow Y-degeneration was reported for both evolutionary strata in S. latifolia 28, which cannot be explained by pre-existing recombination suppression as genes in the qXdr undergo frequent recombination on the X chromosome.

Conclusions

The analyses presented above allow us to reconstruct the likely scenario of sex chromosome evolution in Silene and assess its implications for our understanding of evolutionary processes on nascent sex chromosomes. The initial recombination suppression, resulting in formation of NRY, occurred around 11 million years ago 28 at the actively recombining end of the proto-sex-chromosomes, corresponding to the qXdr of the S. latifolia X chromosome. This event may have been caused by an inversion on the proto-X chromosome, given the order of genes in the qXdr is inverted compared to homologous region in S. vulgaris (Figure 4C,D). A few million years later this was followed by NRY expansion via inclusion of the massive Xpr into the male-specific region, possibly driven by evolution of strongly peripheral recombination on the X chromosome. The Xpr region continues to recombine in females, albeit at a very low rate. Recombination suppression in the Xpr may have pre-dated or evolved in concert with NRY expansion. Given the homologous region in S. vulgaris is recombining normally (Figure 4D), the latter possibility appears plausible. Either way, reduced recombination in Xpr may have facilitated NRY expansion.

This scenario of sex chromosome evolution in S. latifolia deviates from the standard ‘evolutionary strata’ model of step-wise NRY expansion 67 when an actively recombining part of the pseudoautosomal region becomes Y-linked and stops recombining, while its homologous region continues to actively recombine on the X chromosome. Consistent with this model, the NRY in S. latifolia evolved in an actively recombining region (qXdr), but contrary to that model, the NRY expansion resulted in sex-linkage of a region that may have already been rarely recombining (Xpr), meaning that both X- and Y-linked genes in this region evolve under reduced efficacy of selection. Failing to take into account such pre-existing recombination landscape may lead to misinterpretation in evolutionary genetic analyses of sequence polymorphism and divergence on young sex chromosomes in general.

The scenario described above is compatible with the classic ‘two genes’ model of dioecy and sex chromosome evolution 33, with inversion in qXdr preventing recombination between two sex-determining genes, SPF and GSF. The X-linked homolog of the already isolated sex-determining gene (GSF 72), along with SlWUS1 that was likely involved in evolution of gynoecium suppression 72 are indeed located in qXdr (Figure 2A), though the SPF gene(s) remain to be identified. Our results indicate that X chromosome may play an active role in NRY evolution, with X- (rather than Y-) linked inversions contributing to evolution of recombination suppression between the X and Y chromosomes, which remains to be considered by models of sex chromosome evolution.

Star Methods

Resource Availability

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Dmitry A. Filatov (dmitry.filatov@biology.ox.ac.uk)

Materials availability

This study did not generate new unique reagents.

Data and code availability

The S. latifolia female genome assembly and annotation have been deposited to Genome Warehouse (GWH) database in BIG data Center (https://ngdc.cncb.ac.cn/gwh/) under accession number GWHCBIJ00000000 and BioProject accession PRJCA014197. The PacBio and Hi-C data have been uploaded to NCBI Sequence Read Archive (SRA) database as Bioproject PRJNA952727.

Experimental Model and Subject Details

Plant material

S. latifolia seedlings for the highly inbred K-line 80 were grown in the greenhouse of the Centre for Genomics and Biotechnology at Fujian Agriculture and Forestry University (FAFU) at 22°C under long-day photoperiod (16h of artificial light).

Method Details

Genome sequencing and assembly

PacBio library construction and sequencing of the female genome

DNA for sequencing was extracted from young leaves of female S. latifolia plants. Genomic DNA was sheared and size-selection was carried out using BluePippin system. SMRTbell™ libraries were constructed according to the protocol from PacBio. Subsequently, 104.9 Gb of sequence data were generated with the PacBio Sequel IIe System. PacBio high-quality HiFi reads was generated from subreads by CCS software (v6.4, https://github.com/PacificBiosciences/ccs). HiFi reads were assembled using hifiasm software 81 with default parameters, which yielded a 2.64 Gb long contig-level genome assembly (Table S2).

Hi-C library construction and sequencing

For Hi-C scaffolding, young leaves of S. latifolia female plants from the inbred K-line 80 were used to construct the Hi-C libraries by ANOROAD company according to previously published methods 82. The Illumina HiSeq X Ten platform was used to sequence the chimeric fragments representing the original cross-linked fragments after constructing the paired-end sequencing libraries. Hi-C reads were uniquely mapped to the contig assemblies and reads within 500 bp regions of HindIII restriction sites were retained for further analysis. A total of 134.8 Gb raw sequencing Hi-C reads were used to assemble the chromosome-level genome using ALLHiC (v0.9.8, https://github.com/tangerzhang/ALLHiC). Hi-C sequencing data were assessed using HiC-Pro 83 and the results showed a high proportion of validated reads (59.2%) (Table S4).

Chromosome assembly by integration with Hi-C scaffolding and genetic map

The chromosome level genome assembly was integrated with the previously published RNA-seq-based high-density genetic maps 36,51, but mostly the sex-specific maps from the recent study 51 were used in the analyses as they contained nearly three times more genes than the older sex-average map 36. We aligned the sequences of the markers to the S. latifolia female contig genome using BLASTN 84, retaining BLAST hits with >97% identity longer than 100 bp. As the maps were constructed with transcriptome sequencing 36,51, the markers represent actual expressed genes rather than often repetitive non-coding regions, which facilitated finding of these markers in the genome sequence. Due to presence of introns in the genomic sequence of the genetically mapped genes, each gene usually had several adjacent blast hits, corresponding to different exons. The lowest position of the blast hit for each gene was regarded as the genomic position of the particular marker. The correspondence of the positions of markers in the genetic map and genome sequence was checked manually. The contigs were divided into different groups according to the markers of RNA-seq based map, and the group information was combined with the ALLHiC pipeline 50. Finally, the contigs were linked into 12 pseudo-chromosomes in the S. latifolia female genome.

The integration of the chromosome level assembly with the genetic map revealed minor inconsistencies, which were corrected as follows. The Hi-C reads were aligned to the contigs using the Chromap software (v0.2.4) 85. Subsequently the run-assembly-visualizer.sh script from 3D-DNA pipeline (v180922) 86 was used to obtain input files for Juicebox Assembly Tools (v2.20.00) 87. Then the Juicebox Assembly Tools was used to get the corrected chromosomes.

Assessment of accuracy and completeness of the chromosome assembly

The chromatin contact matrix was used to assess the accuracy of chromosome assemblies based on Hi-C by using HiC Explorer (v3.7.2) 88. The accuracy and completeness of the assembly was assessed by BUSCO 89, and mapping of RNA-seq and genomic Illumina paired end reads. The single base accuracy and the genome coverage rate by the short reads indicated the accuracy of our S. latifolia genome assembly (Table S5 and S6).

X-chromosome identification

The X chromosome was identified by the presence of X-linked genes from the genetic maps 36,51 as well as the X-linked genes identified previously, such as SlX1 62, DD44X 64, SlX3 60, SlX4 63, SlssX 61, SlCypX 42 and the ‘E-genes’ from 35,43 (E777, E757, E780, E799, E330 etc).

Genome annotation with RNA-seq data

RNA-seq data from the previous study 36 were used for S. latifolia female genome annotation. Protein-coding genes were annotated based on a previously published approach 90. Briefly, RNA-seq transcripts together with homologous proteins were inputted into GETA pipeline (v.1.0, https://github.com/chenlianfu/geta). All parameters were set to the original defaults and the false-positive gene models were filtered by set Pfam database path.

Genome filtering and correction

To improve the gene annotation of S. latifolia female genome, firstly, the perl script GetaFilter.pl (https://github.com/LengFeng00/biotree.club.git) was used to filter genes that did not meet the standard. The genes were required to meet at least one of the following conditions: FPKM>3, blastp hits with >30% identity and >100 alignment length, containing at least one Pfam domain and Augustus_transcript Support_percentage > 50. Secondly, the IGV-GSAman software (https://gitee.com/CJchen/IGV-sRNA) was used to manually correct and filter out the genes that included TE sequences in coding regions. Finally, the completeness of genome annotation was evaluated by BUSCO.

miRNA annotation

For miRNA annotation, the plant miRNAs downloaded from the publicly available database miRBase (most recent access 5 July 2018, http://www.mirbase.org/) were aligned to S. latifolia female genome using bowtie 91 as described previously 90. The mapping results were filtered with PERL script filter_alignments.pl from the miRDP1.3 package 92. The miRDeep-P program 92, a plant-seecific scoring system explicitly anticipating miRNAs for plants, was used to trim, filter and identification of new miRNAs.

Transposable element (TE) annotation

The EDTA v1.9.6 de novo annotation tool 93 was used to annotate LTR, terminal inverted repeat, and Helitron elements. For EDTA, the following parameters were used in addition to defaults: --step all, --species others, --sensitive 0, --anno 1, and –threads 4. Sequences with multiple paralogs were mapped back to the genome and manually extended to determine the full-length boundary of each TE. A total of 15,560 full-length, representative Copia and Gypsy copies were successfully annotated. The significance of difference in TE abundance between autosomes and the X chromosome was tested with R package ggpubr (version 0.4.0). Significance was tested using the two-sided Mann-Whitney-Wilcoxon test with multiple comparison. P.adjust homl method was used to adjust the P values.

Reconstruction of Y-linked genes

To reconstruct Y-linked gametologs for the X-linked genes we followed segregation-based approach described previously 28,34,36. In particular, using previously published sequence data from genetic crosses, we called single nucleotide polymorphisms (SNPs) for parents and progeny, including 10 F1 progeny with genome sequence data available 28 and 50 F2 progeny with transcriptome data available 36. Trimmed RNAseq reads were mapped against the reference genome with BWA mem v0.7.17 94 and sorted with Samtools v1.7 95. Then, SNP calling was done with Samtools mpileup (options: -d 1000 -q 20 -Q 20) and sites filtered with bcftools filter 1.7.

The analysis of SNP segregation in these genetic crosses allowed us to identify Y-linked SNPs that are always inherited from father to sons and never to daughters. The sequence reads containing the Y-SNPs along with their paired reads were separated and assembled into contigs. For this Y-reconstruction procedure we focused on coding regions, while (often repetitive) non-coding regions were excluded from analysis. As described previously 28,34,36, this approach allows accurate reconstruction of Y-linked homologs of X-linked genes in S. latifolia. The accuracy of the Y-reconstruction was confirmed using the sequences of previously published Y-linked genes (SlY1 62, DD44Y 64, SlY3 60, SlY4 63, SlssY 61 and SlY7 42).

Analysis of gene expression

For gene expression analysis we used previously published RNA-seq data from 34,36,96. The analysis of gene expression was conducted with RSEM package 97 with default options.

Phylogenetic analyses

To analyse substitution rates in sex-linked genes we created three-sequence alignments including X- and Y-linked gametologs as well as a sequence of homologous gene from an outgroup species. As an outgroup we used non-dioecious Silene uniflora for which a fragmented genome assembly is already available 65. We used CDS sequences of X-linked genes to blast-search the S. uniflora genomes. The best-matching homologs were aligned with the X- and Y-linked genes using muscle 98. These three-sequence alignments were used for analysis of synonymous and non-synonymous substitution rates in X- and Y-linked genes with codeml from PAML package 99. Substitution rates were estimated using branch model 100 allowing for separate substitution rate for each branch of the phylogeny.

Quantification and Statistical Analysis

The Wilcoxon tests were performed using R ggpubr package (version 0.4.0) and the P values were adjusted with p.adjust homl correction in R. Paired t-tests were done in Excel.

Key Resource Table.

REAGENT or
RESOURCE
SOURCE IDENTIFIER
Deposited Data
Silene latifolia female This study GWH: PRJCA014197
genome assembly Silene latifolia PacBio This study SRA: PRJNA952727
sequences
Silene latifolia Hi-C data
This study SRA: PRJNA952727
Experimental Models: Organisms/Strains
Silene latifolia plants Kazama et al. 80 Inbred K-line
Software and Algorithms
CCS v6.4 NA https://github.com/PacificBiosciences/ccs
Hifiasm Cheng et al.81 https://github.com/chhylp123/hifiasm
ALLHiC V0.9.13 Zhang et al.50 https://github.com/tangerzhang/ALLHiC/wiki
HiC-Pro Servant et al.83 https://github.com/nservant/HiC-Pro
BLASTN Altschul et al.84 https://blast.ncbi.nlm.nih.gov/Blast.cgi
Chromap V0.2.4 Zhang et al.85 https://github.com/haowenz/chromap
3D-DNA V180922 Dudchenko et al.86 https://github.com/aidenlab/3d-dna
Juicebox Assembly Tools Durand et al.87 https://github.com/aidenlab/Juicebox
HiC Explorer v3.7.2 Wolff et al.88 https://hicexplorer.readthedocs.io/en/latest/
BUSCO Simao et al.89 https://busco.ezlab.org/
GETA v1.0 NA https://github.com/chenlianfu/geta
GetaFilter.pl NA https://github.com/LengFeng00/biotree.club.git
IGV-GSAman NA https://gitee.com/CJchen/IGV-sRNA
Bowtie Langmead et al.91 https://bowtie-bio.sourceforge.net/manual.shtml
miRDeep-P Yang et al.92 https://github.com/rajewsky-lab/mirdeep2
EDTA v1.9.6 Ou et al.93 https://github.com/topics/edta
BWA-MEM Li and Durbin.94 https://github.com/lh3/bwa
Samtools v1.7 Li et al.95 https://github.com/samtools/samtools
RSEM Li et al.97 https://github.com/deweylab/RSEM
PAML Yang et al.99 http://abacus.gene.ucl.ac.uk/software/paml.html
branch model Yang et al.100 https://github.com/Bumblebee-Project/Bumblebee/wiki/Branching-Model

Supplementary Material

Supplementary Materials

Acknowledgments

This work was supported by BBSRC grant BB/P009808/1 to DAF, startup fund from Fujian Agriculture and Forestry University grant to RM and JSPS KAKENHI Grants JP21KK0128 and JP22H05071 to YK.

Footnotes

Author contributions

DAF and RM conceived the study, DAF, MK, YK and JY generated the sequence data, JY, XZ, SZ, WX, XX and BK conducted genome assembly and annotation, DAF integrated the assembly with the genetic map. MK, JY and DAF analysed the data, DAF and JY wrote the paper. All authors contributed to editing the manuscript.

Declaration of interest

The authors declare no competing interests.

References

  • 1.Bull JJ. Evolution of sex determining mechanisms. Benjamin/Cummings Pub. Co., Advanced Book Program; 1983. [Google Scholar]
  • 2.Westergaard M. The mechanism of sex determination in dioecious flowering plants. Adv Genet. 1958;9:217–281. doi: 10.1016/s0065-2660(08)60163-7. [DOI] [PubMed] [Google Scholar]
  • 3.Blackburn KB. Sex chromosomes in plants. Nature. 1923;112:687–688. doi: 10.1038/112687c0. [DOI] [Google Scholar]
  • 4.Charlesworth D. In: Evolutionary Genomics and Proteomics. Pagel M, Pomiankowski A, editors. Sinauer Associates; 2008. Sex chromosome origins and evolution; pp. 207–240. [Google Scholar]
  • 5.Filatov DA. Evolutionary history of Silene latifolia sex chromosomes revealed by genetic mapping of four genes. Genetics. 2005;170:975–979. doi: 10.1534/genetics.104.037069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Wright AE, Dean R, Zimmer F, Mank JE. How to make a sex chromosome. Nat Commun. 2016;7:12087. doi: 10.1038/ncomms12087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kent TV, Uzunovic J, Wright SI. Coevolution between transposable elements and recombination. Philos Trans R Soc Lond B Biol Sci. 2017;372 doi: 10.1098/rstb.2016.0458. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Gaut BS, Wright SI, Rizzon C, Dvorak J, Anderson LK. Recombination: an underappreciated factor in the evolution of plant genomes. Nat Rev Genet. 2007;8:77–84. doi: 10.1038/nrg1970. [DOI] [PubMed] [Google Scholar]
  • 9.Charlesworth B, Charlesworth D. The degeneration of Y chromosomes. Philos Trans R Soc Lond B Biol Sci. 2000;355:1563–1572. doi: 10.1098/rstb.2000.0717. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Charlesworth D. The timing of genetic degeneration of sex chromosomes. Philos Trans R Soc Lond B Biol Sci. 2021;376:20200093. doi: 10.1098/rstb.2020.0093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Bergero R, Charlesworth D. The evolution of restricted recombination in sex chromosomes. Trends Ecol Evol. 2009;24:94–102. doi: 10.1016/j.tree.2008.09.010. [DOI] [PubMed] [Google Scholar]
  • 12.Charlesworth D. Evolution of recombination rates between sex chromosomes. Philos Trans R Soc Lond B Biol Sci. 2017;372 doi: 10.1098/rstb.2016.0456. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Charlesworth D. When and how do sex-linked regions become sex chromosomes? Evolution. 2021;75:569–581. doi: 10.1111/evo.14196. [DOI] [PubMed] [Google Scholar]
  • 14.Olito C, Abbott JK. The evolution of suppressed recombination between sex chromosomes and the lengths of evolutionary strata. Evolution. 2023;77:1077–1090. doi: 10.1093/evolut/qpad023. [DOI] [PubMed] [Google Scholar]
  • 15.Charlesworth D. Why and how do Y chromosome stop recombining? J Evol Biol. 2023 doi: 10.1111/jeb.14137. [DOI] [PubMed] [Google Scholar]
  • 16.Koyama T, Nakamoto M, Morishima K, Yamashita R, Yamashita T, Sasaki K, Kuruma Y, Mizuno N, Suzuki M, Okada Y, et al. A SNP in a steroidogenic enzyme is associated with phenotypic sex in seriola fishes. Curr Biol. 2019;29:1901–1909.:e1908. doi: 10.1016/j.cub.2019.04.069. [DOI] [PubMed] [Google Scholar]
  • 17.Kuhl H, Guiguen Y, Hohne C, Kreuz E, Du K, Klopp C, Lopez-Roques C, Yebra-Pimentel ES, Ciorpac M, Gessner J, et al. A 180 Myr-old female-specific genome region in sturgeon reveals the oldest known vertebrate sex determining system with undifferentiated sex chromosomes. Philos Trans R Soc Lond B Biol Sci. 2021;376:20200089. doi: 10.1098/rstb.2020.0089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Liu Z, Moore PH, Ma H, Ackerman CM, Ragiba M, Yu Q, Pearl HM, Kim MS, Charlton JW, Stiles JI, et al. A primitive Y chromosome in papaya marks incipient sex chromosome evolution. Nature. 2004;427:348–352. doi: 10.1038/nature02228. [DOI] [PubMed] [Google Scholar]
  • 19.Akagi T, Henry IM, Tao R, Comai L. A Y-chromosome-encoded small RNA acts as a sex determinant in persimmons. Science. 2014;346:646–650. doi: 10.1126/science.1257225. [DOI] [PubMed] [Google Scholar]
  • 20.Akagi T, Pilkington SM, Varkonyi-Gasic E, Henry IM, Sugano SS, Sonoda M, Firl A, McNeilage MA, Douglas MJ, Wang T, et al. Two Y-chromosome-encoded genes determine sex in kiwifruit. Nat Plants. 2019;5:801–809. doi: 10.1038/s41477-019-0489-6. [DOI] [PubMed] [Google Scholar]
  • 21.Harkess A, Huang K, van der Hulst R, Tissen B, Caplan JL, Koppula A, Batish M, Meyers BC, Leebens-Mack J. Sex determination by two Y-linked genes in garden asparagus. Plant Cell. 2020 doi: 10.1105/tpc.19.00859. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Gong W, Filatov DA. Evolution of the sex-determining region in Ginkgo biloba. Philos Trans R Soc Lond B Biol Sci. 2022;377:20210229. doi: 10.1098/rstb.2021.0229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Rice WR. The accumulation of sexually antagonistic genes as a selective agent promoting the evolution of reduced recombination between primitive sex chromosomes. Evolution. 1987;41:911–914. doi: 10.1111/j.1558-5646.1987.tb05864.x. [DOI] [PubMed] [Google Scholar]
  • 24.Jay P, Tezenas E, Veber A, Giraud T. Sheltering of deleterious mutations explains the stepwise extension of recombination suppression on sex chromosomes and other supergenes. PLoS Biol. 2022;20:e3001698. doi: 10.1371/journal.pbio.3001698. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 25.Cavoto E, Neuenschwander S, Goudet J, Perrin N. Sex-antagonistic genes, XY recombination and feminized Y chromosomes. J Evol Biol. 2018;31:416–427. doi: 10.1111/jeb.13235. [DOI] [PubMed] [Google Scholar]
  • 26.Lenormand T, Roze D. Y recombination arrest and degeneration in the absence of sexual dimorphism. Science. 2022;375:663–666. doi: 10.1126/science.abj1813. [DOI] [PubMed] [Google Scholar]
  • 27.Jeffries DL, Gerchen JF, Scharmann M, Pannell JR. A neutral model for the loss of recombination on sex chromosomes. Philos Trans R Soc Lond B Biol Sci. 2021;376:20200096. doi: 10.1098/rstb.2020.0096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Krasovec M, Chester M, Ridout K, Filatov DA. The mutation rate and the age of the sex chromosomes in Silene latifolia. Current Biology. 2018;28:1832–1838. doi: 10.1016/j.cub.2018.04.069. [DOI] [PubMed] [Google Scholar]
  • 29.Charlesworth D. Plant contributions to our understanding of sex chromosome evolution. New Phytol. 2015;208:52–65. doi: 10.1111/nph.13497. [DOI] [PubMed] [Google Scholar]
  • 30.Charlesworth D. Plant sex chromosomes. Annu Rev Plant Biol. 2016;67:397–420. doi: 10.1146/annurev-arplant-043015-111911. [DOI] [PubMed] [Google Scholar]
  • 31.Kejnovsky E, Vyskot B. Silene latifolia: the classical model to study heteromorphic sex chromosomes. Cytogenet Genome Res. 2010;129:250–262. doi: 10.1159/000314285. [DOI] [PubMed] [Google Scholar]
  • 32.Westergaard M. Aberrant Y chromosomes and sex expression in Melandrium album. Hereditas. 1946;32:419–443. doi: 10.1111/j.1601-5223.1946.tb02784.x. [DOI] [PubMed] [Google Scholar]
  • 33.Charlesworth B, Charlesworth D. Model for evolution of dioecy and gynodioecy. American Naturalist. 1978;112:975–997. doi: 10.1086/283342. [DOI] [Google Scholar]
  • 34.Chibalina MV, Filatov DA. Plant Y chromosome degeneration is retarded by haploid purifying selection. Curr Biol. 2011;21:1475–1479. doi: 10.1016/j.cub.2011.07.045. [DOI] [PubMed] [Google Scholar]
  • 35.Bergero R, Qiu S, Charlesworth D. Gene loss from a plant sex chromosome system. Curr Biol. 2015;25:1234–1240. doi: 10.1016/j.cub.2015.03.015. [DOI] [PubMed] [Google Scholar]
  • 36.Papadopulos AS, Chester M, Ridout K, Filatov DA. Rapid Y degeneration and dosage compensation in plant sex chromosomes. Proc Natl Acad Sci U S A. 2015;112:13021–13026. doi: 10.1073/pnas.1508454112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Zemp N, Tavares R, Muyle A, Charlesworth D, Marais GA, Widmer A. Evolution of sex-biased gene expression in a dioecious plant. Nat Plants. 2016;2:16168. doi: 10.1038/nplants.2016.168. [DOI] [PubMed] [Google Scholar]
  • 38.Krasovec M, Kazama Y, Ishii K, Abe T, Filatov DA. Immediate dosage compensation is triggered by the deletion of Y-linked genes in Silene latifolia. Curr Biol. 2019;29:2214–2221. doi: 10.1016/j.cub.2019.05.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Muyle A, Zemp N, Deschamps C, Mousset S, Widmer A, Marais GA. Rapid de novo evolution of X chromosome dosage compensation in Silene latifolia, a plant with young sex chromosomes. PLoS Biol. 2012;10:e1001308. doi: 10.1371/journal.pbio.1001308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Muyle A, Shearn R, Marais GA. The evolution of sex chromosomes and dosage compensation in plants. Genome Biol Evol. 2017;9:627–645. doi: 10.1093/gbe/evw282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Krasovec M, Nevado B, Filatov DA. A comparison of selective pressures in plant X-linked and autosomal genes. Genes (Basel) 2018;9:234. doi: 10.3390/genes9050234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Bergero R, Forrest A, Kamau E, Charlesworth D. Evolutionary strata on the X chromosomes of the dioecious plant Silene latifolia: evidence from new sex-linked genes. Genetics. 2007;175:1945–1954. doi: 10.1534/genetics.106.070110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Bergero R, Qiu S, Forrest A, Borthwick H, Charlesworth D. Expansion of the pseudo-autosomal region and ongoing recombination suppression in the Silene latifolia sex chromosomes. Genetics. 2013;194:673–686. doi: 10.1534/genetics.113.150755. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Qiu S, Bergero R, Forrest A, Kaiser VB, Charlesworth D. Nucleotide diversity in Silene latifolia autosomal and sex-linked genes. Proc Biol Sci. 2010;277:3283–3290. doi: 10.1098/rspb.2010.0606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Qiu S, Bergero R, Guirao-Rico S, Campos JL, Cezard T, Gharbi K, Charlesworth D. RAD mapping reveals an evolving, polymorphic and fuzzy boundary of a plant pseudoautosomal region. Mol Ecol. 2016;25:414–430. doi: 10.1111/mec.13297. [DOI] [PubMed] [Google Scholar]
  • 46.Bellott DW, Hughes JF, Skaletsky H, Brown LG, Pyntikova T, Cho TJ, Koutseva N, Zaghlul S, Graves T, Rock S, et al. Mammalian Y chromosomes retain widely expressed dosage-sensitive regulators. Nature. 2014;508:494–499. doi: 10.1038/nature13206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Zhou Q, Zhang J, Bachtrog D, An N, Huang Q, Jarvis ED, Gilbert MTP, Zhang G. Complex evolutionary trajectories of sex chromosomes across bird taxa. Science. 2014;346:1332. doi: 10.1126/science.1246338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Siroky J, Lysak MA, Dolezel J, Kejnovsky E, Vyskot B. Heterogeneity of rDNA distribution and genome size in Silene spp. Chromosome Res. 2001;9:387–393. doi: 10.1023/a:1016783501674. [DOI] [PubMed] [Google Scholar]
  • 49.Williams AM, Itgen MW, Broz AK, Carter OG, Sloan DB. Long-read transcriptome and other genomic resources for the angiosperm Silene noctiflora. G3 (Bethesda) 2021;11 doi: 10.1093/g3journal/jkab189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Zhang X, Zhang S, Zhao Q, Ming R, Tang H. Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. Nat Plants. 2019;5:833–845. doi: 10.1038/s41477-019-0487-8. [DOI] [PubMed] [Google Scholar]
  • 51.Filatov DA. Heterochiasmy and sex chromosome evolution in Silene. Genes. 2023;14:543. doi: 10.3390/genes14030543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Howell EC, Armstrong SJ, Filatov DA. Dynamic gene order on the Silene latifolia Y chromosome. Chromosoma. 2011;120:287–296. doi: 10.1007/s00412-011-0311-3. [DOI] [PubMed] [Google Scholar]
  • 53.Haenel Q, Laurentino TG, Roesti M, Berner D. Meta-analysis of chromosome-scale crossover rate variation in eukaryotes and its significance to evolutionary genomics. Mol Ecol. 2018;27:2477–2497. doi: 10.1111/mec.14699. [DOI] [PubMed] [Google Scholar]
  • 54.Brazier T, Glemin S. Diversity and determinants of recombination landscapes in flowering plants. PLoS Genet. 2022;18:e1010141. doi: 10.1371/journal.pgen.1010141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Armstrong SJ, Filatov DA. A cytogenetic view of sex chromosome evolution in plants. Cytogenet Genome Res. 2008;120:241–246. doi: 10.1159/000121073. [DOI] [PubMed] [Google Scholar]
  • 56.Lengerova M, Moore RC, Grant SR, Vyskot B. The sex chromosomes of Silene latifolia revisited and revised. Genetics. 2003;165:935–938. doi: 10.1093/genetics/165.2.935. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Kazama Y, Sugiyama R, Suto Y, Uchida W, Kawano S. The clustering of four subfamilies of satellite DNA at individual chromosome ends in Silene latifolia. Genome. 2006;49:520–530. doi: 10.1139/g05-130. [DOI] [PubMed] [Google Scholar]
  • 58.Filatov DA. Recent expansion of the non-recombining sex-linked region on Silene latifolia sex chromosomes. J Evol Biol. 2022;35:1696–1708. doi: 10.1111/jeb.14063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Bergero R, Charlesworth D. Preservation of the Y transcriptome in a 10-million-year-old plant sex chromosome system. Curr Biol. 2011;21:1470–1474. doi: 10.1016/j.cub.2011.07.032. [DOI] [PubMed] [Google Scholar]
  • 60.Nicolas M, Marais G, Hykelova V, Janousek B, Laporte V, Vyskot B, Mouchiroud D, Negrutiu I, Charlesworth D, Moneger F. A gradual process of recombination restriction in the evolutionary history of the sex chromosomes in dioecious plants. PLoS Biol. 2005;3:e4. doi: 10.1371/journal.pbio.0030004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Filatov DA. Substitution rates in a new Silene latifolia sex-linked gene, SlssX/Y. Mol Biol Evol. 2005;22:402–408. doi: 10.1093/molbev/msi003. [DOI] [PubMed] [Google Scholar]
  • 62.Delichere C, Veuskens J, Hernould M, Barbacar N, Mouras A, Negrutiu I, Moneger F. SlY1, the first active gene cloned from a plant Y chromosome, encodes a WD-repeat protein. Embo J. 1999;18:4169–4179. doi: 10.1093/emboj/18.15.4169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Atanassov I, Delichere C, Filatov DA, Charlesworth D, Negrutiu I, Moneger F. Analysis and evolution of two functional Y-linked loci in a plant sex chromosome system. Mol Biol Evol. 2001;18:2162–2168. doi: 10.1093/oxfordjournals.molbev.a003762. [DOI] [PubMed] [Google Scholar]
  • 64.Moore RC, Kozyreva O, Lebel-Hardenack S, Siroky J, Hobza R, Vyskot B, Grant SR. Genetic and functional analysis of DD44, a sex-linked gene from the dioecious plant Silene latifolia, provides clues to early events in sex chromosome evolution. Genetics. 2003;163:321–334. doi: 10.1093/genetics/163.1.321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Papadopulos AST, Helmstetter AJ, Osborne OG, Comeault AA, Wood DP, Straw EA, Mason L, Fay MF, Parker J, Dunning LT, et al. Rapid parallel adaptation to anthropogenic heavy metal pollution. Mol Biol Evol. 2021;38:3724–3736. doi: 10.1093/molbev/msab141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Muyle A, Zemp N, Fruchard C, Cegan R, Vrana J, Deschamps C, Tavares R, Hobza R, Picard F, Widmer A, Marais GAB. Genomic imprinting mediates dosage compensation in a young plant XY system. Nat Plants. 2018;4:677–680. doi: 10.1038/s41477-018-0221-y. [DOI] [PubMed] [Google Scholar]
  • 67.Lahn BT, Page DC. Four evolutionary strata on the human X chromosome. Science. 1999;286:964–967. doi: 10.1126/science.286.5441.964. [DOI] [PubMed] [Google Scholar]
  • 68.Ironside JE. No amicable divorce? Challenging the notion that sexual antagonism drives sex chromosome evolution. Bioessays. 2010;32:718–726. doi: 10.1002/bies.200900124. [DOI] [PubMed] [Google Scholar]
  • 69.Iovene M, Yu Q, Ming R, Jiang J. Evidence for emergence of sex-determining gene(s) in a centromeric region in Vasconcellea parviflora. Genetics. 2015;199:413–421. doi: 10.1534/genetics.114.173021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Pilkington SM, Tahir J, Hilario E, Gardiner SE, Chagne D, Catanach A, McCallum J, Jesson L, Fraser LG, McNeilage MA, et al. Genetic and cytological analyses reveal the recombination landscape of a partially differentiated plant sex chromosome in kiwifruit. BMC Plant Biol. 2019;19:172. doi: 10.1186/s12870-019-1766-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Rifkin JL, Beaudry FEG, Humphries Z, Choudhury BI, Barrett SCH, Wright SI. Widespread recombination suppression facilitates plant sex chromosome evolution. Mol Biol Evol. 2021;38:1018–1030. doi: 10.1093/molbev/msaa271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Kazama Y, Kitoh M, Kobayashi T, Ishii K, Krasovec M, Yasui Y, Abe T, Kawano S, Filatov DA. A CLAVATA3-like gene acts as a gynoecium suppression function in white campion. Mol Biol Evol. 2022;39 doi: 10.1093/molbev/msac195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Martin H, Carpentier F, Gallina S, Gode C, Schmitt E, Muyle A, Marais GAB, Touzet P. Evolution of young sex chromosomes in two dioecious sister plant species with distinct sex determination systems. Genome Biol Evol. 2019;11:350–361. doi: 10.1093/gbe/evz001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Otto SP, Payseur BA. Crossover interference: shedding light on the evolution of recombination. Annu Rev Genet. 2019;53:19–44. doi: 10.1146/annurev-genet-040119-093957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Siljak-Yakovlev S, Muratović E, Bogunić F, Bašić N, Šolić ME, Boškailo A, Bourge M, Valentin N, Stevanović V, Garnatje T, et al. Genome size of Balkan flora: a database (GeSDaBaF) and C-values for 51 taxa of which 46 are novel. Plant Systematics and Evolution. 2020;306 doi: 10.1007/s00606-020-01670-y. [DOI] [Google Scholar]
  • 76.Filatov DA, Howell EC, Groutides C, Armstrong SJ. Recent spread of a retrotransposon in the Silene latifolia genome, apart from the Y chromosome. Genetics. 2009;181:811–817. doi: 10.1534/genetics.108.099267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Bergero R, Gardner J, Bader B, Yong L, Charlesworth D. Exaggerated heterochiasmy in a fish with sex-linked male coloration polymorphisms. PNAS. 2019;116:6924–6931. doi: 10.1073/pnas.1818486116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Rifkin JL, Hnatovska S, Yuan M, Sacchi BM, Choudhury BI, Gong Y, Rastas P, Barrett SCH, Wright SI. Recombination landscape dimorphism and sex chromosome evolution in the dioecious plant Rumex hastatulus. Philos Trans R Soc Lond B Biol Sci. 2022;377:20210226. doi: 10.1098/rstb.2021.0226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Sardell JM, Kirkpatrick M. Sex differences in the recombination landscape. Am Nat. 2020;195:361–379. doi: 10.1086/704943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Kazama Y, Sugiyama R, Matsunaga S, Shibata F, Uchida W, Hizume M, Kawano S. Organization of the KpnI family of chromosomal distal-end satellite DNAs in Silene latifolia. J Plant Res. 2003;116:317–326. doi: 10.1007/s10265-003-0106-6. [DOI] [PubMed] [Google Scholar]
  • 81.Cheng H, Concepcion GT, Feng X, Zhang H, Li H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021;18:170–175. doi: 10.1038/s41592-020-01056-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Xie T, Zheng JF, Liu S, Peng C, Zhou YM, Yang QY, Zhang HY. De novo plant genome assembly based on chromatin interactions: a case study of Arabidopsis thaliana. Mol Plant. 2015;8:489–492. doi: 10.1016/j.molp.2014.12.015. [DOI] [PubMed] [Google Scholar]
  • 83.Servant N, Varoquaux N, Lajoie BR, Viara E, Chen CJ, Vert JP, Heard E, Dekker J, Barillot E. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 2015;16:259. doi: 10.1186/s13059-015-0831-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 85.Zhang H, Song L, Wang X, Cheng H, Wang C, Meyer CA, Liu T, Tang M, Aluru S, Yue F, et al. Fast alignment and preprocessing of chromatin profiles with Chromap. Nat Commun. 2021;12:6566. doi: 10.1038/s41467-021-26865-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Dudchenko O, Batra SS, Omer AD, Nyquist SK, Hoeger M, Durand NC, Shamim MS, Machol I, Lander ES, Aiden AP, Aiden EL. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science. 2017;356:92–95. doi: 10.1126/science.aal3327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Durand NC, Robinson JT, Shamim MS, Machol I, Mesirov JP, Lander ES, Aiden EL. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 2016;3:99–101. doi: 10.1016/j.cels.2015.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Wolff J, Bhardwaj V, Nothjunge S, Richard G, Renschler G, Gilsbach R, Manke T, Backofen R, Ramirez F, Gruning BA. Galaxy HiCExplorer: a web server for reproducible Hi-C data analysis, quality control and visualization. Nucleic Acids Res. 2018;46:W11–W16. doi: 10.1093/nar/gky504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Simao FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
  • 90.Yue J, VanBuren R, Liu J, Fang J, Zhang X, Liao Z, Wai CM, Xu X, Chen S, Zhang S, et al. SunUp and Sunset genomes revealed impact of particle bombardment mediated transformation and domestication history in papaya. Nat Genet. 2022;54:715–724. doi: 10.1038/s41588-022-01068-1. [DOI] [PubMed] [Google Scholar]
  • 91.Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Yang X, Li L. miRDeep-P: a computational tool for analyzing the microRNA transcriptome in plants. Bioinformatics. 2011;27:2614–2615. doi: 10.1093/bioinformatics/btr430. [DOI] [PubMed] [Google Scholar]
  • 93.Ou S, Su W, Liao Y, Chougule K, Agda JRA, Hellinga AJ, Lugo CSB, Elliott TA, Ware D, Peterson T, et al. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol. 2019;20:275. doi: 10.1186/s13059-019-1905-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26:589–595. doi: 10.1093/bioinformatics/btp698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Subgroup GPDP The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Hu XS, Filatov DA. The large-X effect in plants: increased species divergence and reduced gene flow on the Silene X-chromosome. Molecular Ecology. 2016;25:2609–2619. doi: 10.1111/mec.13427. [DOI] [PubMed] [Google Scholar]
  • 97.Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Edgar RC. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. 2004;5:113. doi: 10.1186/1471-2105-5-113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24:1586–1591. doi: 10.1093/molbev/msm088. [DOI] [PubMed] [Google Scholar]
  • 100.Yang Z, Nielsen R. Synonymous and nonsynonymous rate variation in nuclear genes of mammals. J Mol Evol. 1998;46:409–418. doi: 10.1007/pl00006320. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Materials

Data Availability Statement

The S. latifolia female genome assembly and annotation have been deposited to Genome Warehouse (GWH) database in BIG data Center (https://ngdc.cncb.ac.cn/gwh/) under accession number GWHCBIJ00000000 and BioProject accession PRJCA014197. The PacBio and Hi-C data have been uploaded to NCBI Sequence Read Archive (SRA) database as Bioproject PRJNA952727.

RESOURCES