Skip to main content
BMC Biology logoLink to BMC Biology
. 2022 Jun 9;20:133. doi: 10.1186/s12915-022-01338-8

A gene deriving from the ancestral sex chromosomes was lost from the X and retained on the Y chromosome in eutherian mammals

Jennifer F Hughes 1,✉,#, Helen Skaletsky 1,2,#, Peter K Nicholls 1,3,#, Alexis Drake 1, Tatyana Pyntikova 1, Ting-Jan Cho 1, Daniel W Bellott 1, David C Page 1,2,4
PMCID: PMC9178871  PMID: 35676717

Abstract

Background

The mammalian X and Y chromosomes originated from a pair of ordinary autosomes. Over the past ~180 million years, the X and Y have become highly differentiated and now only recombine with each other within a short pseudoautosomal region. While the X chromosome broadly preserved its gene content, the Y chromosome lost ~92% of the genes it once shared with the X chromosome. PRSSLY is a Y-linked gene identified in only a few mammalian species that was thought to be acquired, not ancestral. However, PRSSLYs presence in widely divergent species—bull and mouse—led us to further investigate its evolutionary history.

Results

We discovered that PRSSLY is broadly conserved across eutherians and has ancient origins. PRSSLY homologs are found in syntenic regions on the X chromosome in marsupials and on autosomes in more distant animals, including lizards, indicating that PRSSLY was present on the ancestral autosomes but was lost from the X and retained on the Y in eutherian mammals. We found that across eutheria, PRSSLY’s expression is testis-specific, and, in mouse, it is most robustly expressed in post-meiotic germ cells. The closest paralog to PRSSLY is the autosomal gene PRSS55, which is expressed exclusively in testes, involved in sperm differentiation and migration, and essential for male fertility in mice. Outside of eutheria, in species where PRSSLY orthologs are not Y-linked, we find expression in a broader range of somatic tissues, suggesting that PRSSLY has adopted a germ-cell-specific function in eutherians. Finally, we generated Prssly mutant mice and found that they are fully fertile but produce offspring with a modest female-biased sex ratio compared to controls.

Conclusions

PRSSLY appears to be the first example of a gene that derives from the mammalian ancestral sex chromosomes that was lost from the X and retained on the Y. Although the function of PRSSLY remains to be determined, it may influence the sex ratio by promoting the survival or propagation of Y-bearing sperm.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12915-022-01338-8.

Keywords: Y chromosome, Sex chromosomes, Sex ratio, Trypsin-like serine protease

Background

The sex chromosomes of all marsupials and placental mammals share a common origin—they evolved from a pair of autosomes during the past ~180 million years, likely after the divergence of monotremes [1, 2]. After the proto-Y chromosome acquired the male-determining gene SRY, the sex chromosomes began to differentiate, mediated through recombination suppression, which occurred in a series of discrete steps and spread across the length of the chromosomes [3]. The X and Y followed drastically different evolutionary trajectories because the X undergoes regular meiotic recombination in females, while the Y has no recombination partner over the majority of its length [4]. Consequently, the Y lost ~92% of its ancestral genes through repeated deletion and pseudogenization events. Despite this apparent evolutionary freefall, the gene content of the Y chromosome among eutherian mammals is highly conserved, suggesting that the remaining genes perform essential functions and are under strong selective pressure [5].

We identified a widespread yet uncharacterized mammalian Y-linked gene—PRSSLY (protease, serine-like Y) that appears to be the first example of a gene that survived on the Y chromosome but was lost from the X chromosome. PRSSLY escaped notice until recently because it is not present on the first three Y chromosomes sequenced to completion—human, chimpanzee, and rhesus macaque [68]. PRSSLY was first discovered on the mouse [9] and dog [10] Y chromosomes (initially named DYNG in dog), and homologs were subsequently found on the Y chromosomes of bull [11] and pig [12]. PRSSLY appears to encode a massive protein (average size: 2212 amino acid residues; largest: 4591 residues in deer; mammalian genome average: ~400 residues). Despite its size, PRSSLY contains only one identifiable domain—trypsin-like serine proteasemaking it part of the large PRSS gene family, which consists of 27 autosomal family members in human. PRSSLY’s expression pattern in eutherian mammals is testis-specific, suggesting a role in sperm development. Given its distinctive species distribution (conserved in diverse mammals, but lost in some lineages), gene structure, and expression pattern, we explored the evolution and function of PRSSLY.

Results

PRSSLY is widely distributed across mammals and has a unique gene structure

First, we assessed the conservation of PRSSLY in mammals by conducting a comprehensive survey of mammalian genomes for the gene’s presence or absence. The number of complete Y chromosomes available for analysis is limited, so we searched available male genomic DNA and testis RNA-seq datasets. In total, we examined datasets from 47 mammals and found that PRSSLY is present in species representing every major mammalian lineage, but it has been lost or pseudogenized repeatedly in multiple lineages: primates, felines, naked mole rat, horse, dolphin, and opossum (Fig. 1, Additional file 1). For 24 species, there is evidence of Y linkage because (i) PRSSLY sequence was found in confirmed Y chromosome sequence or (ii) PRSSLY was present in male whole-genome-shotgun sequence or RNA-seq datasets but missing from available female whole-genome-shotgun sequence (Additional file 1). The human, chimpanzee, and rhesus Y chromosomes contain loci with homology to PRSSLY, but these loci are pseudogenes. We found evidence that these pseudogenes are transcribed at low levels (via analysis of publicly available RNA-seq datasets), but they have severely truncated open reading frames (Additional file 2: Fig. S1). Y-linkage in mammals is not universal, however. PRSSLY has apparently translocated to an autosome at least three times in the rodent lineage (rat, mole vole, and naked mole rat) (Fig. 1). Phylogenetic analysis confirms that the autosomal copies in these species cluster with other PRSSLY homologs as expected, indicating that they are recently translocated (Additional file 2: Fig. S2). PRSSLY homologs are X-linked in marsupials and autosomal in monotremes (Additional file 2: Fig. S3, Additional file 1). Beyond mammals, we found PRSSLY homologs in species as divergent as lizards, newts, and caecilians (Fig. 1, Additional file 2: Fig. S4), so it likely arose in the tetrapod ancestor. However, this gene appears to have been lost in several major tetrapod lineages, including archelosauria (birds, crocodilians, and turtles), snakes, and frogs (Additional file 3).

Fig. 1.

Fig. 1

Species distribution of PRSSLY homologs. At left, tree diagram shows evolutionary relationships between species. Line length is proportional to time, with scale shown at bottom. Red asterisks indicate loss or pseudogenization of PRSSLY in a given lineage. At right, status of PRSSLY and chromosomal location (if known) are indicated

The unusual gene structure of PRSSLY differs vastly between species (Fig. 2). The region encoding the conserved trypsin-like serine protease domain is contained within four to nine exons spanning ~1750 bp on average. However, in many species (e.g., mouse lemur, tree shrew, ferret, and deer), the entire open reading frame (ORF) spans >10 kb, with most of the ORF residing in a single exon (Fig. 2). These massive exons dwarf typical exons, which are ~300 bp on average in the human genome, and they rival the longest known coding exon, which is ~21 kb (in the gene MUC16) [13]. These long ORFs have no identifiable domains, only show homology between closely related species (Additional file 2: Fig. S5), and are less conserved than the trypsin-like serine protease domain. Using sequences from 27 animals, including mammals, reptiles, and amphibians, we calculated non-synonymous to synonymous substitution rate ratios (Ka/Ks) across the length of PRSSLY and PRSSLY homologs. We found that Ka/Ks is close to one within the upstream ORF and that Ka/Ks is much lower (from 0.001 to 0.31) within the conserved domain, indicating a slower rate of evolution within this domain (Additional file 2: Fig. S6, Additional files 4, 5). Furthermore, the non-synonymous substitution rate is consistently higher within the upstream ORF compared to the conserved domain (Additional file 2: Fig. S6).

Fig. 2.

Fig. 2

PRSSLY gene structure. Exons are indicated by boxes and are drawn to scale; introns are indicated by lines and are not drawn to scale. Conserved trypsin-like serine protease domains are shaded blue. Purple and orange lines indicate putative translation start sites and stop sites, respectively

PRSSLY is testis-specific in eutherians but more broadly expressed in other animals

Next, we investigated the expression pattern of PRSSLY and its homologs across species. We examined expression in species where RNA-seq datasets from multiple tissues, including testis, were publicly available, and where the transcriptome was well annotated. In eutherian mammals, where Y linkage is nearly universal, PRSSLY is testis-specific (Fig. 3). Humans have 27 autosomal members of the PRSS gene family; about half of the family members share this testis-specific expression pattern, including PRSSLY’s closest relative PRSS55 (Additional file 2: Fig. S7). We were able to refine the expression pattern of PRSSLY in mouse and bull. Using a germ-cell depleted mouse model [14, 15], we found that PRSSLY is expressed exclusively in adult germ cells (Additional file 2: Fig. S8). In bull, we analyzed previously published RNA-seq datasets generated from purified germ cells (pachytene spermatocytes and round spermatids) [16] and were able to detect transcription of bull PRSSLY in these samples (Additional file 2: Fig. S8), providing evidence that it is transcribed in male germ cells.

Fig. 3.

Fig. 3

RNA-seq analysis of PRSSLY across tissues in eight eutherian mammals. Expression levels for PRSSLY were estimated in transcript per million (TPM) units. TPM values are plotted on a log10 scale. For some tissues, multiple biological replicates were analyzed for each tissue; means with standard errors are plotted. Details and source data can be found in Additional file 6

Next, we examined the timing of PRSSLY expression in finer detail. For mouse, we examined publicly available single-cell RNA-seq datasets generated from adult whole testis [17]. Our analysis confirmed that PRSSLY is expressed only in germ cells and is absent in six somatic cell types included in this dataset (Fig. 4A). In germ cells, PRSSLY is barely detectable in pre-meiotic and meiotic cells (spermatogonia and spermatocytes, respectively), but is highly expressed in round spermatids—the developmental stage immediately following meiosis (Fig. 4A). PRSSLY expression is greatly reduced during the next stage of post-meiotic development (elongating spermatids).

Fig. 4.

Fig. 4

RNA-seq analysis of PRSSLY across development in mouse, rat, and rabbit. A For single-cell RNA-seq analysis in mouse, expression levels for PRSSLY are shown as reads per million mapped reads (RPM). At right, representative spermatogenic cells are shown (created with BioRender.com). B For bulk RNA-seq in mouse, rat, and rabbit across developmental timepoints, expression levels for PRSSLY were estimated in transcript per million (TPM) units. TPM values are plotted on a log10 scale. For some timepoints, multiple biological replicates were analyzed for each tissue; means with standard errors are plotted. Details and source data can be found in Additional file 6

We also examined publicly available testis bulk RNA-seq datasets spanning developmental timepoints from embryo to adult [18]. Such datasets were available for mouse, rat, and rabbit, and in all species the onset of PRSSLY expression correlates with the onset of meiosis (Fig. 4B). In rat, where PRSSLY was translocated to an autosome, we looked at available time course RNA-seq data from a variety of female tissues: ovary, brain, heart, kidney, and liver. PRSSLY expression was detected, at very low levels, in ovary, but was absent in all somatic tissues (Fig. 4B; Additional file 6). As in testis, the onset of PRSSLY expression in the rat ovary correlates with the onset of meiosis, but the functional relevance of this ovarian expression is unknown. The factor that activates PRSSLY at the onset of meiosis may be expressed in both males and females and conserved between mouse and rat. Since the entire PRSSLY gene, including introns, was translocated from the Y chromosome to chr14, the promoter was also likely translocated, accounting for the conserved expression pattern. Unfortunately, the sequence upstream of PRSSLY is too short to allow for comparison.

Outside of eutherians, PRSSLY homologs, which are located on the X chromosome or autosomes, are more broadly expressed (Fig. 5). We examined publicly available RNA-seq data for two marsupials, two monotremes, and two lizards where multiple tissue types, including testis and ovary, were available. Non-Y-linked PRSSLY homologs are expressed in both males and females in both gonadal and somatic tissues. In most species, especially lizards, PRSSLY expression is highest in testis.

Fig. 5.

Fig. 5

RNA-seq analysis of PRSSLY homologs (located on X chromosome and autosomes) across tissues in marsupials, monotremes, and lizards. Expression levels for PRSSLY homologs were estimated in transcript per million (TPM) units. TPM values are plotted on a log10 scale. For some tissues, multiple biological replicates were analyzed for each tissue; means with standard errors are plotted. Details and source data can be found in Additional file 6

Unique evolutionary history of PRSSLY

The chromosomal location and expression pattern of PRSSLY have evolved over time. The most parsimonious explanation for the gene’s evolutionary trajectory is supported by synteny analysis (Fig. 6). We propose that the gene originated in the tetrapod ancestor on the autosome pair that eventually became the proto-X and Y chromosomes in mammals [19]. In the ancestor of placental mammals, the X and Y chromosomes were expanded through an autosomal transposition event. PRSSLX/Y was located within stratum 2, which is part of the ancestral, conserved region [3]. After the placental-marsupial split, PRSSLX/Y was lost from the Y chromosome but retained on the X in marsupials, and lost from the X chromosome but retained on the Y in eutherian mammals. The Y-linked version in eutherians then became restricted in its expression pattern, perhaps acquiring a novel function in spermatogenesis. This evolutionary trajectory is highly unusual. While ~92% of the 636 genes once shared between the X and Y chromosomes have been lost from the eutherian Y chromosome and retained on the X chromosome [20], PRSSLY is the first and only example of an ancestral X-Y pair gene lost from the X chromosome and retained on the Y chromosome.

Fig. 6.

Fig. 6

Syntenic relationships in anole lizard, wallaby, and human. Gene positions (blue boxes) in vicinity of autosomal PRSSLYL in anole lizard and X-linked PRSSLX in wallaby, as well as syntenic region on human X chromosome, which is missing a PRSSLY homolog, are shown. Gene positions based on genome assemblies for anole lizard chr1 (Broad AnoCar2.0/anoCar2) and human X chromosome (GRCh38/hg38). The genome assembly for wallaby does not provide sufficient X coverage, but an X-chromosome-derived BAC sequence is available (accession number CU234131) that contains PRSSLX and upstream genes. Dashed lines connect homologous genes

Sex ratio of Prssly-knockout offspring is skewed towards females

We explored Prssly’s function by generating likely loss-of-function CRISPR mutations in mice. We designed guide RNAs to target exons 6 and 8, which are part of the conserved trypsin-like serine protease domain (Fig. 7A). We obtained four founder males with various frame-disrupting mutations: (i) a 407-bp deletion between exons 6 and 8, creating a premature stop (Δ407); (ii) a 289-bp retroviral insertion into exon 6, creating a premature stop (ins289); (iii) a 14-bp deletion in exon 6, creating a premature stop (Δ14); and (iv) a 47-bp deletion, including the first 20 bp of exon 8, likely disrupting splicing (Δ47) (Additional file 2: Fig. S9). The mutations were introduced near the 3’ terminus of the gene (Additional file 2: Fig. S9), so we cannot rule out the possibility that Prssly’s function is partly preserved in these mutants.

Fig. 7.

Fig. 7

Sex ratio distortion in PRSSLY mutant offspring. A Structure of mouse Prssly gene. Exons are indicated by boxes, and introns by lines; both drawn to scale. Conserved trypsin-like serine protease domains are shaded blue. Arrows indicate positions of CRISPR guide RNAs. B and C Total number of male and female offspring in all four mutant lines vs. controls (B) and number of male and female offspring in each mutant line (C). Two-sided chi-square tests comparing offspring sex numbers in mutants (observed) vs. controls (expected) were performed; p=0.019 for all mutants vs. controls (B) and p=0.043 for Δ47 (C). P values for Δ407, Δ14, and ins289 were not significant

Given Prssly’s testis-specific expression pattern in mouse, dog, and bull [911], we anticipated that these mutations might affect spermatogenesis. We found that Prssly mutants had testis weights that were within the normal range, but were significantly less than those of controls (Additional file 2: Fig. S10). However, males carrying any of the four alleles were fertile and had normal testis histology (Additional file 2: Fig. S11). A recent study, which also generated and characterized a Prssly-mutant mouse via CRISPR (targeting exon 5), confirmed these results: mutants were fertile with normal testis size and sperm morphology [21].

We continued breeding the mutant lines, and a clear phenotype gradually emerged: the sex ratio of the offspring of the Prssly mutant males was skewed towards females. We generated 95 litters and a total of 601 offspring (Additional file 7). Among these 601 offspring of Prssly mutant males, 47.4% were male, which is significantly lower than the 52.2% males we observed among 255 offspring of control males (Fig. 7B). If we consider each of the four mutant lines separately, the strength of the sex-ratio skewing varies (Fig. 7C). We observe no effect in the Δ14 mutant, which may indicate that PRSSLY is at least partially functional in this line or we have an inadequate number of offspring to detect sex ratio skewing.

We also found that the sizes of the weaned litters produced by Prssly mutant mice were significantly smaller (~1.5 fewer offspring per litter) than litters produced by control mice (Additional file 2: Fig. S12), so the sex-ratio skewing could be due to a sex difference in embryonic lethality or early postnatal survival rate. However, our breeding experiments were not designed to track variation in litter size (e.g. offspring were not counted immediately after birth) so we cannot conclude that there is a Prssly-related effect. Moreover, when each mutant is considered separately, the magnitude of the litter size difference (Additional file 2: Fig. S12) does not correlate with the magnitude of the sex-ratio skewing (Fig. 7C), suggesting that the two observations are unrelated.

Discussion

We characterized a novel testis-specific Y-linked gene—PRSSLY—that is widespread in eutherian mammals and has ancient origins, dating back at least ~350 million years. PRSSLY is the first known example of a gene that survived on the mammalian Y chromosome but was lost from the X chromosome. The mammalian X and Y chromosomes originated from a pair of autosomes [2, 22]. Over the past ~180 million years, the X and Y chromosomes followed divergent evolutionary paths, with the Y losing ~92% of the genes it once shared with the X, while the ancestral gene content of the X remained essentially unchanged [20]. It is thus highly unusual that PRSSLY, which was clearly present on the ancestral autosome that gave rise to the mammalian X and Y, was lost specifically from the X chromosome in eutherian mammals. The Y copy was subsequently lost in several distinct eutherian lineages, but PRSSLY has survived for tens of millions of years in most lineages. In marsupials, the opposite, and more common, pattern appears, with the X copy being retained and the Y copy being lost. However, not all marsupial lineages have retained the X copy, which parallels the lineage-specific loss of the Y copy in eutherian mammals.

We probed Prssly’s function in mice and found that Prssly mutants are fertile, yet produce more female offspring than expected. In mice, Sly – Slx/Slx1 are sex-chromosome genes that have been found to influence the sex ratio through intragenomic conflict in post-meiotic germ cells. Unlike Prssly, Sly – Slx/Slx1 are not conserved outside of the Mus lineage. Sly – Slx/Slx1 are also highly amplified on the sex chromosomes, with ~120 copies of Sly on the mouse Y long arm and ~40 copies of Slx/Slx1 on the mouse X chromosome. Mice with a deletion encompassing two-thirds of the Y long-arm produce excess females (38% male) [23]. ShRNA-knockdowns of Slx/Slx1 in males results in offspring sex ratio skewing towards males (60% males) [24]. A separate study showed that targeted deletion and duplication of the Slx/Slx1 gene family skewed sex ratios towards males and females, respectively [25]. Sly and Slx/Slxl1 deficiencies result in sperm head/spermatid elongation defects and sperm release defects, respectively [23, 26]. Double knock-down of Sly and Slx/Slx1 rescues both the sperm defects and the skewed sex ratio [24]. We found no connection between Prssly and Sly/Slx/Slx1 when we examined testis single-cell RNA-seq data [17] for evidence of correlated gene expression, so these systems appear to operate independently.

Although we do not yet know the mechanism by which PRSSLY affects the sex ratio in mice, PRSSLY likely operates directly in the male germline at or after the onset of meiosis based on its expression pattern. The function of PRSSLY’s closest relative—PRSS55—may also provide some clues. PRSS55 is essential for male mouse fertility, playing a role in sperm motility and sperm–egg binding [27] as well as structural differentiation and energy metabolism [28]. Although PRSSLY is not required for fertility, it may act in a similar post-meiotic fashion to ensure the propagation of Y-bearing sperm. A full characterization of sperm morphology and sperm count in Prssly mutants will help elucidate this mechanism.

Conclusions

This study uncovers a widespread mammalian Y-linked gene—PRSSLY— that appears to have survived on the Y chromosome but was lost from the X in eutherians, defying the trend set by >600 genes that followed the opposite evolutionary path during X-Y differentiation. In mice, Prssly is expressed strictly in post-meiotic male germ cells and appears to influence the sex ratio, perhaps by promoting the propagation of Y-bearing sperm. Whether PRSSLY plays a similar role in other species remains to be determined. If so, this discovery could open the door to the possibility of manipulating sex ratios in livestock, which would be of great interest, both biologically and commercially.

Methods

Identification of PRSSLY homologs

Using NCBI Blast suite with default parameters, we performed TBLASTN (protein sequence against translated nucleotide database) searches of NCBI’s non-redundant nucleotide database using PRSSLY sequences from bull and mouse as query sequences. Once more divergent PRSSLY sequences were identified (i.e., wallaby, lizard, and caecilian), we repeated the TBLASTN searches with the newly identified sequences as queries. To search for PRSSLY in species without available male genomic sequence, we scanned NCBI’s Sequence Read Archive database for available testis RNA-seq datasets and performed mapping analyses using PRSSLY sequence from the most closely related species (Additional file 1). To confirm that PRSSLY homologs were missing in certain species, we searched genomic assemblies using NCBI Blast suite with default parameters, using PRSSLY homolog in most closely related species as the query sequence. For species with or without closely related PRSSLY homologs we used BLASTN or TBLASTN, respectively. When genomic assemblies were not available, we searched short read datasets (RNA-seq or WGS) using the following pipeline: Fastq files were reformatted to fasta files; BLAST database was created using the makeblastdb function (version 2.10.1+); resulting database was searched with blastn (version 2.10.1+) (Additional file 3). We determined that PRSSLY is single-copy in all species with high-quality reference assemblies. For species without such assemblies, we searched for evidence of multiple PRSSLY copies using the following strategies but found none. First, we found no polymorphisms in PRSSLY RNA-seq reads. Second, we found no increased coverage of PRSSLY in raw genomic reads.

Alignments, phylogenetic, and dot plot analyses

Nucleotide sequence alignment of conserved regions of PRSSLY homologs was performed using PRANK (version 121002) with default parameters [29]. Phylogenetic tree using nucleotide alignment was generated using PhyML (version 3.3) with default parameters [30]. Amino acid sequence alignments were performed using Clustalw (version 2.1) with default parameters [31]. Phylogenetic trees of PRSS gene family using amino acid alignment were generated using maximum likelihood in PHYLIP (version 3.66) with Jones-Taylor-Thornton model. For Ka-Ks analysis, separate alignments were generated (using Clustalw) for the conserved trypsin-like serine protease domain and the upstream ORF region. Ka/Ks calculations were performed with KaKs_Calculator (version 2.0) using codon alignments [32]. Alignment lengths of upstream ORFs were determined using FASTA (version 36.3) [33]. Dotplots were generated in MacVector (version 17.0.10) using default parameters.

RNA-seq analysis

For each species, RNA-seq datasets were downloaded from NCBI’s Sequence Read Archive database, and transcriptomes were downloaded from Ensembl (transcriptome versions given in Additional file 6). For bulk analyses, RNA-seq reads were mapped to their respective transcriptomes using Salmon version 1.6.0 with the mapping validation option enabled [34]. For single-cell analysis, reads were mapped using Bowtie version 1.2.2 [35], and cell types were assigned as previously published [17].

Generation of CRISPR mutations and mouse husbandry

The Prssly mutant mice were generated via a CRISPR/Cas9-mediated strategy on the C57BL/6J background. We designed two gRNAs, one targeting the end of exon 6 and the other targeting the start of exon 8, with the goal of producing a cut at both sites, and ideally, a deletion of the genomic DNA between these two sites. Experimental and control animals were backcrossed to C57BL/6J for an additional two generations or more. Deletions and insertions in founders and offspring were confirmed by PCR amplification and Sanger sequencing. Male offspring with edits to Prssly were subsequently backcrossed to C57BL/6J for two or more additional generations. The integrity of the long arm was confirmed by 18 PCR assays spanning the mouse Y (Additional file 8). The line used for controls was derived from founder littermates that did not contain CRISPR edits or mutations. Thus, controls and mutants shared the same paternal Y chromosome lineage. Litters were counted and sexed at day 5 and again prior to weaning. We genotyped males and females for the presence of the Y chromosome and found perfect correlation with observed phenotypic sex. To minimize variability between controls and mutants, all mice were maintained in the same room, were handled by the same staff, received cage changes on the same day, and received the same diet. Data collection for controls and mutants was performed in parallel. All experiments conformed to principles and guidelines approved by the Committee on Animal Care at the Massachusetts Institute of Technology.

Supplementary Information

12915_2022_1338_MOESM1_ESM.xlsx (4MB, xlsx)

Additional file 1. Table of species distribution of PRSSLY homologs, including accession numbers for genomic and expression datasets used for identification of PRSSLY.

12915_2022_1338_MOESM2_ESM.docx (4MB, docx)

Additional file 2: Fig. S1. Structure and RNA-seq analysis of human, chimpanzee, and rhesus PRSSLY pseudogenes. Fig. S2. Phylogenetic analysis of PRSSLY nucleotide sequences. Fig. S3. Confirmation of X-linkage of PRSSLY in marsupials. Fig. S4. Phylogenetic analyses of PRSS family amino acid sequences. Fig. S5. Sequence conservation across PRSSLY gene sequences. Fig. S6. Analysis of synonymous (Ks) and non-synonymous (Ka) substitution rates across PRSSLY. Fig. S7. Expression of human PRSS homologs across tissues. Fig. S8. Gene expression analysis of PRSSLY in purified male germ cells and germ-cell-depleted testis. Fig. S9. Four CRISPR-induced mutations in mouse PRSSLY. Fig. S10. Testis weights of control and Prssly mutant mice. Fig. S11. Testis histology of control and Prssly mutant mice. Fig. S12. Litter sizes in Prssly mutants and controls.

12915_2022_1338_MOESM3_ESM.xlsx (23.9KB, xlsx)

Additional file 3. Table detailing evidence that PRSSLY homologs are missing in species.

12915_2022_1338_MOESM4_ESM.xlsx (33.5KB, xlsx)

Additional file 4. Ka-Ks analysis of PRSSLY and homologs.

12915_2022_1338_MOESM5_ESM.docx (58.4KB, docx)

Additional file 5. Sequences used for Ka-Ks analysis of PRSSLY and homologs.

12915_2022_1338_MOESM6_ESM.xlsx (43.3KB, xlsx)

Additional file 6. RNA-seq analyses including accession numbers and descriptions of RNA-seq datasets, transcriptomes used for mapping, and mapping results.

12915_2022_1338_MOESM7_ESM.xlsx (17.2KB, xlsx)

Additional file 7. Breeding of PRSSLY mutant mice.

12915_2022_1338_MOESM8_ESM.xlsx (13.1KB, xlsx)

Additional file 8. PCR assays across mouse Y long arm in PRSSLY mutant mice.

12915_2022_1338_MOESM9_ESM.docx (109.3KB, docx)

Additional file 9. Sequences and GenBank accession numbers for PRSSLY sequences.

Acknowledgements

We thank S. Markoulaki and the Genetically Engineered Models core facility at Whitehead Institute for CRISPR/Cas9 injections. We thank M. Mikedis for assistance with imaging of mouse testis sections. We thank J. Mueller for comments on the manuscript.

Abbreviations

ORF

Open reading frame

Ka

Non-synonymous substitution rate

Ks

Synonymous substitution rate

Authors’ contributions

JFH., HS, PKN, and DCP designed the study. JFH, HS, and DWB performed computational analyses. PKN and AD performed mouse breeding studies. TJC, TP, and DWB performed sequencing studies. JFH and DCP wrote the paper. The authors read and approved the final manuscript.

Funding

The group of D.C.P. is supported by the Howard Hughes Medical Institute, the Whitehead Institute, and philanthropic gifts from Brit and Alexander d’Arbeloff, Authur W. and Carol Tobin Brill, and Charles Ellis.

Availability of data and materials

Nucleotide sequences for PRSSLY and non-Y-linked PRSSLY homologs that were assembled from genomic and/or RNA-seq data are available in the Third Party Annotation Section of the DDBJ/ENA/GenBank databases under the accession numbers TPA: BK059441-BK059443, BK059500-BK59524, and OK484381-OK484382 (see Additional File 9 for all sequences).

Declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Jennifer F. Hughes, Helen Skaletsky, and Peter K. Nicholls contributed equally to this work.

Contributor Information

Jennifer F. Hughes, Email: jhughes@wi.mit.edu

Helen Skaletsky, Email: skaletsky@wi.mit.edu.

Peter K. Nicholls, Email: P.Nicholls@bradford.ac.uk

Alexis Drake, Email: drake@wi.mit.edu.

Tatyana Pyntikova, Email: pyntikova@wi.mit.edu.

Ting-Jan Cho, Email: cho@wi.mit.edu.

Daniel W. Bellott, Email: bellott@wi.mit.edu

David C. Page, Email: dcpage@wi.mit.edu

References

  • 1.Livernois AM, Graves JA, Waters PD. The origin and evolution of vertebrate sex chromosomes and dosage compensation. Heredity. 2012;108:50–58. doi: 10.1038/hdy.2011.106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ohno S. Sex Chromosomes and Sex-linked Genes. Berlin: Springer-Verlag; 1967. [Google Scholar]
  • 3.Lahn BT, Page DC. Four evolutionary strata on the human X chromosome. Science. 1999;286:964–967. doi: 10.1126/science.286.5441.964. [DOI] [PubMed] [Google Scholar]
  • 4.Charlesworth B. The evolution of sex chromosomes. Science. 1991;251:1030–1033. doi: 10.1126/science.1998119. [DOI] [PubMed] [Google Scholar]
  • 5.Bellott DW, Hughes JF, Skaletsky H, Brown LG, Pyntikova T, Cho TJ, Koutseva N, Zaghlul S, Graves T, Rock S, Kremitzki C, Fulton RS, Dugan S, Ding Y, Morton D, Khan Z, Lewis L, Buhay C, Wang Q, Watt J, Holder M, Lee S, Nazareth L, Alfoldi J, Rozen S, Muzny DM, Warren WC, Gibbs RA, Wilson RK, Page DC. Mammalian Y chromosomes retain widely expressed dosage-sensitive regulators. Nature. 2014;508:494–499. doi: 10.1038/nature13206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Skaletsky H, Kuroda-Kawaguchi T, Minx PJ, Cordum HS, Hillier L, Brown LG, Repping S, Pyntikova T, Ali J, Bieri T, Chinwalla A, Delehaunty A, Delehaunty K, Du H, Fewell G, Fulton L, Fulton R, Graves T, Hou SF, Latrielle P, Leonard S, Mardis E, Maupin R, McPherson J, Miner T, Nash W, Nguyen C, Ozersky P, Pepin K, Rock S, Rohlfing T, Scott K, Schultz B, Strong C, Tin-Wollam A, Yang SP, Waterston RH, Wilson RK, Rozen S, Page DC. The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes. Nature. 2003;423:825–837. doi: 10.1038/nature01722. [DOI] [PubMed] [Google Scholar]
  • 7.Hughes JF, Skaletsky H, Brown LG, Pyntikova T, Graves T, Fulton RS, Dugan S, Ding Y, Buhay CJ, Kremitzki C, Wang Q, Shen H, Holder M, Villasana D, Nazareth LV, Cree A, Courtney L, Veizer J, Kotkiewicz H, Cho TJ, Koutseva N, Rozen S, Muzny DM, Warren WC, Gibbs RA, Wilson RK, Page DC. Strict evolutionary conservation followed rapid gene loss on human and rhesus Y chromosomes. Nature. 2012;483:82–86. doi: 10.1038/nature10843. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Hughes JF, Skaletsky H, Pyntikova T, Graves TA, van Daalen SK, Minx PJ, Fulton RS, McGrath SD, Locke DP, Friedman C, Trask BJ, Mardis ER, Warren WC, Repping S, Rozen S, Wilson RK, Page DC. Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content. Nature. 2010;463:536–539. doi: 10.1038/nature08700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Soh YQ, Alfoldi J, Pyntikova T, Brown LG, Graves T, Minx PJ, Fulton RS, Kremitzki C, Koutseva N, Mueller JL, Rozen S, Hughes JF, Owens E, Womack JE, Murphy WJ, Cao Q, de Jong P, Warren WC, Wilson RK, Skaletsky H, Page DC. Sequencing the mouse Y chromosome reveals convergent gene acquisition and amplification on both sex chromosomes. Cell. 2014;159:800–813. doi: 10.1016/j.cell.2014.09.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Li G, Davis BW, Raudsepp T, Pearks Wilkerson AJ, Mason VC, Ferguson-Smith M, O'Brien PC, Waters PD, Murphy WJ. Comparative analysis of mammalian Y chromosomes illuminates ancestral structure and lineage-specific evolution. Genome Res. 2013;23:1486–1495. doi: 10.1101/gr.154286.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hughes JF, Skaletsky H, Pyntikova T, Koutseva N, Raudsepp T, Brown LG, Bellott DW, Cho TJ, Dugan-Rocha S, Khan Z, Kremitzki C, Fronick C, Graves-Lindsay TA, Fulton L, Warren WC, Wilson RK, Owens E, Womack JE, Murphy WJ, Muzny DM, Worley KC, Chowdhary BP, Gibbs RA, Page DC. Sequence analysis in Bos taurus reveals pervasiveness of X-Y arms races in mammalian lineages. Genome Res. 2020;30:1716–1726. doi: 10.1101/gr.269902.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Skinner BM, Sargent CA, Churcher C, Hunt T, Herrero J, Loveland JE, Dunn M, Louzada S, Fu B, Chow W, Gilbert J, Austin-Guest S, Beal K, Carvalho-Silva D, Cheng W, Gordon D, Grafham D, Hardy M, Harley J, Hauser H, Howden P, Howe K, Lachani K, Ellis PJ, Kelly D, Kerry G, Kerwin J, Ng BL, Threadgold G, Wileman T, Wood JM, Yang F, Harrow J, Affara NA, Tyler-Smith C. The pig X and Y Chromosomes: structure, sequence, and evolution. Genome Res. 2016;26:130–139. doi: 10.1101/gr.188839.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Piovesan A, Antonaros F, Vitale L, Strippoli P, Pelleri MC, Caracausi M. Human protein-coding genes and gene feature statistics in 2019. BMC Res Notes. 2019;12:315. doi: 10.1186/s13104-019-4343-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Mueller JL, Skaletsky H, Brown LG, Zaghlul S, Rock S, Graves T, Auger K, Warren WC, Wilson RK, Page DC. Independent specialization of the human and mouse X chromosomes for the male germ line. Nat Genet. 2013;45:1083–1087. doi: 10.1038/ng.2705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Soh YQ, Junker JP, Gill ME, Mueller JL, van Oudenaarden A, Page DC. A gene regulatory program for meiotic prophase in the fetal ovary. PLoS Genet. 2015;11:e1005531. doi: 10.1371/journal.pgen.1005531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Lesch BJ, Silber SJ, McCarrey JR, Page DC. Parallel evolution of male germline epigenetic poising and somatic development in animals. Nat Genet. 2016;48:888–894. doi: 10.1038/ng.3591. [DOI] [PubMed] [Google Scholar]
  • 17.Green CD, Ma Q, Manske GL, Shami AN, Zheng X, Marini S, Moritz L, Sultan C, Gurczynski SJ, Moore BB, Tallquist MD, Li JZ, Hammoud SS. A comprehensive roadmap of murine spermatogenesis defined by single-cell RNA-seq. Dev Cell. 2018;46:651–667. doi: 10.1016/j.devcel.2018.07.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Cardoso-Moreira M, Halbert J, Valloton D, Velten B, Chen C, Shao Y, Liechti A, Ascencao K, Rummel C, Ovchinnikova S, Mazin PV, Xenarios I, Harshman K, Mort M, Cooper DN, Sandi C, Soares MJ, Ferreira PG, Afonso S, Carneiro M, Turner JMA, VandeBerg JL, Fallahshahroudi A, Jensen P, Behr R, Lisgo S, Lindsay S, Khaitovich P, Huber W, Baker J, Anders S, Zhang YE, Kaessmann H. Gene expression across mammalian organ development. Nature. 2019;571:505–509. doi: 10.1038/s41586-019-1338-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bull JJ. Evolution of Sex Determining Mechanisms. 1st ed. Wake DB, Slatkin MW, editors. Benjamin/Cummings: Menlo Park; 1983. [Google Scholar]
  • 20.Bellott DW, Page DC. Dosage-sensitive functions in embryonic development drove the survival of genes on sex-specific chromosomes in snakes, birds, and mammals. Genome Res. 2021;31:198–210. doi: 10.1101/gr.268516.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Holmlund H, Yamauchi Y, Durango G, Fujii W, Ward MA. Two acquired mouse Y chromosome-linked genes, Prssly and Teyorf1, are dispensable for male fertility. Biol Reprod. 2022; epub April 29. [DOI] [PMC free article] [PubMed]
  • 22.Ohno S, Becak W, Becak ML. X-autosome ratio and the behavior pattern of individual X-chromosomes in placental mammals. Chromosoma. 1964;15:14–30. doi: 10.1007/BF00326912. [DOI] [PubMed] [Google Scholar]
  • 23.Conway SJ, Mahadevaiah SK, Darling SM, Capel B, Rattigan AM, Burgoyne PS. Y353/B: a candidate multiple-copy spermiogenesis gene on the mouse Y chromosome. Mamm Genome. 1994;5:203–210. doi: 10.1007/BF00360546. [DOI] [PubMed] [Google Scholar]
  • 24.Cocquet J, Ellis PJ, Mahadevaiah SK, Affara NA, Vaiman D, Burgoyne PS. A genetic basis for a postmeiotic X versus Y chromosome intragenomic conflict in the mouse. PLoS Genet. 2012;8:e1002900. doi: 10.1371/journal.pgen.1002900. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kruger AN, Brogley MA, Huizinga JL, Kidd JM, de Rooij DG, Hu YC, Mueller JL. A neofunctionalized X-linked ampliconic gene family is essential for male fertility and equal sex ratio in mice. Curr Biol. 2019;29:3699–706 e5. doi: 10.1016/j.cub.2019.08.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Cocquet J, Ellis PJ, Yamauchi Y, Riel JM, Karacs TP, Rattigan A, Ojarikre OA, Affara NA, Ward MA, Burgoyne PS. Deficiency in the multicopy Sycp3-like X-linked genes Slx and Slxl1 causes major defects in spermatid differentiation. Mol Biol Cell. 2010;21:3497–3505. doi: 10.1091/mbc.e10-07-0601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Shang X, Shen C, Liu J, Tang L, Zhang H, Wang Y, Wu W, Chi J, Zhuang H, Fei J, Wang Z. Serine protease PRSS55 is crucial for male mouse fertility via affecting sperm migration and sperm-egg binding. Cell Mol Life Sci. 2018;75:4371–4384. doi: 10.1007/s00018-018-2878-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Zhu F, Li W, Zhou X, Chen X, Zheng M, Cui Y, Liu X, Guo X, Zhu H. PRSS55 plays an important role in the structural differentiation and energy metabolism of sperm and is required for male fertility in mice. J Cell Mol Med. 2021;25:2040–2051. doi: 10.1111/jcmm.16116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Loytynoja A. Phylogeny-aware alignment with PRANK. Methods Mol Biol. 2014;1079:155–170. doi: 10.1007/978-1-62703-646-7_10. [DOI] [PubMed] [Google Scholar]
  • 30.Guindon S, Delsuc F, Dufayard JF, Gascuel O. Estimating maximum likelihood phylogenies with PhyML. Methods Mol Biol. 2009;537:113–137. doi: 10.1007/978-1-59745-251-9_6. [DOI] [PubMed] [Google Scholar]
  • 31.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Wang D, Zhang Y, Zhang Z, Zhu J, Yu J. KaKs_Calculator 2.0: a toolkit incorporating gamma-series methods and sliding window strategies. Genomics Proteomics Bioinformatics. 2010;8:77–80. doi: 10.1016/S1672-0229(10)60008-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Pearson WR, Lipman DJ. Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A. 1988;85:2444–2448. doi: 10.1073/pnas.85.8.2444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017;14:417–419. doi: 10.1038/nmeth.4197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

12915_2022_1338_MOESM1_ESM.xlsx (4MB, xlsx)

Additional file 1. Table of species distribution of PRSSLY homologs, including accession numbers for genomic and expression datasets used for identification of PRSSLY.

12915_2022_1338_MOESM2_ESM.docx (4MB, docx)

Additional file 2: Fig. S1. Structure and RNA-seq analysis of human, chimpanzee, and rhesus PRSSLY pseudogenes. Fig. S2. Phylogenetic analysis of PRSSLY nucleotide sequences. Fig. S3. Confirmation of X-linkage of PRSSLY in marsupials. Fig. S4. Phylogenetic analyses of PRSS family amino acid sequences. Fig. S5. Sequence conservation across PRSSLY gene sequences. Fig. S6. Analysis of synonymous (Ks) and non-synonymous (Ka) substitution rates across PRSSLY. Fig. S7. Expression of human PRSS homologs across tissues. Fig. S8. Gene expression analysis of PRSSLY in purified male germ cells and germ-cell-depleted testis. Fig. S9. Four CRISPR-induced mutations in mouse PRSSLY. Fig. S10. Testis weights of control and Prssly mutant mice. Fig. S11. Testis histology of control and Prssly mutant mice. Fig. S12. Litter sizes in Prssly mutants and controls.

12915_2022_1338_MOESM3_ESM.xlsx (23.9KB, xlsx)

Additional file 3. Table detailing evidence that PRSSLY homologs are missing in species.

12915_2022_1338_MOESM4_ESM.xlsx (33.5KB, xlsx)

Additional file 4. Ka-Ks analysis of PRSSLY and homologs.

12915_2022_1338_MOESM5_ESM.docx (58.4KB, docx)

Additional file 5. Sequences used for Ka-Ks analysis of PRSSLY and homologs.

12915_2022_1338_MOESM6_ESM.xlsx (43.3KB, xlsx)

Additional file 6. RNA-seq analyses including accession numbers and descriptions of RNA-seq datasets, transcriptomes used for mapping, and mapping results.

12915_2022_1338_MOESM7_ESM.xlsx (17.2KB, xlsx)

Additional file 7. Breeding of PRSSLY mutant mice.

12915_2022_1338_MOESM8_ESM.xlsx (13.1KB, xlsx)

Additional file 8. PCR assays across mouse Y long arm in PRSSLY mutant mice.

12915_2022_1338_MOESM9_ESM.docx (109.3KB, docx)

Additional file 9. Sequences and GenBank accession numbers for PRSSLY sequences.

Data Availability Statement

Nucleotide sequences for PRSSLY and non-Y-linked PRSSLY homologs that were assembled from genomic and/or RNA-seq data are available in the Third Party Annotation Section of the DDBJ/ENA/GenBank databases under the accession numbers TPA: BK059441-BK059443, BK059500-BK59524, and OK484381-OK484382 (see Additional File 9 for all sequences).


Articles from BMC Biology are provided here courtesy of BMC

RESOURCES