Abstract
Male germ cells share a common origin across animal species, therefore they likely retain a conserved genetic program that defines their cellular identity. However, the unique evolutionary dynamics of male germ cells coupled with their widespread leaky transcription pose significant obstacles to the identification of the core spermatogenic program. Through network analysis of the spermatocyte transcriptome of vertebrate and invertebrate species, we describe the conserved evolutionary origin of metazoan male germ cells at the molecular level. We estimate the average functional requirement of a metazoan male germ cell to correspond to the expression of approximately 10,000 protein-coding genes, a third of which defines a genetic scaffold of deeply conserved genes that has been retained throughout evolution. Such scaffold contains a set of 79 functional associations between 104 gene expression regulators that represent a core component of the conserved genetic program of metazoan spermatogenesis. By genetically interfering with the acquisition and maintenance of male germ cell identity, we uncover 161 previously unknown spermatogenesis genes and three new potential genetic causes of human infertility. These findings emphasize the importance of evolutionary history on human reproductive disease and establish a cross-species analytical pipeline that can be repurposed to other cell types and pathologies.
Research organism: D. melanogaster, Human, Mouse
eLife digest
Sperm are one of the most remarkable cells in nature, safely housing genetic information while also often moving through foreign environments in search of an egg to fertilize. Central for sexual reproduction, sperm cells of all shapes and sizes are found in animals, plants and even some species of fungi. You may be familiar with the streamlined structure of human sperm, for example, with its round head and flexible tail; but the sperm cells of fruit flies are about 300 times longer, and those found in mice have a hook-shaped head. Relatedly, the genes involved in the creation of reproductive cells often show rapid evolution, with their sequences quickly diverging between species. Due to the complexity of the network of genetic interactions taking place during sperm development, it has so far been difficult to fully isolate the ‘core program’ that governs sperm assembly and allows these cells to acquire their distinct identity. Whether this program could be conserved and shared across the tree of life, in particular, remains unclear.
In response, Brattig-Correia, Almeida, Wyrwoll et al. first conducted analyses that allowed them to pinpoint the genes that were ‘switched on’ during the formation of human, mouse and fruit fly sperm. Assessing the ‘age’ of these genes showed that a large proportion had emerged early during evolution. Shared across the three species, these deeply conserved genes were shown to play a fundamental role in sperm cells acquiring and maintaining their identity.
Further genetic experiments were conducted in fruit flies to refine these findings, highlighting a set of 161 previously unknown genes essential for sperm formation. By combining these results with genetic data from men unable to have children, Brattig-Correia, Almeida, Wyrwoll et al. were able to identify three new genes that could play a role in human infertility.
This work emphasizes how our understanding of human reproductive development can benefit from examining this process in other species, and its evolutionary history. In particular, the knowledge gained from these comparative approaches could ultimately help develop better genetic tests and treatments for human infertility.
Introduction
Understanding what defines the uniqueness of a given cell type out of the 843 predicted cellular fates in the human body is a complex and fascinating problem (Han et al., 2020). Through Conrad Waddington’s foundational work, we have come to appreciate that developmental trajectories ultimately dictate cell type identity via the establishment of specific transcriptional programs (Hermann et al., 2018). The fact that transcriptomes tend to cluster by tissue type rather than by species (Merkin et al., 2012) clearly indicates that gene expression identity can be maintained across many million years of evolutionary divergence. This echoes the modular nature of eukaryotic biological processes, whose intervening macromolecular complexes are typically built by the addition of younger components to a core block of ancient subunits (Wan et al., 2015).
The emergence of germ cells is considered one of the first cell type specializations in metazoan history (Arendt, 2008). Since the capability to undergo both sexual reproduction and gametogenesis were already present in the unicellular ancestor of all Metazoa (Sebé-Pedrós et al., 2017), the split between germ line and soma presumably provided early multicellular organisms increased robustness against mutations while minimizing genetic conflict between different cell lineages (Extavour, 2007). Yet, reproductive protein genes are some of the most rapidly evolving across the tree of life (Swanson and Vacquier, 2002), leading to substantial variation in male reproductive tissues even between closely related species (Ramm et al., 2014; Fitzpatrick et al., 2022). At the molecular level, this variation can be mainly traced back to rapid evolutionary changes in gene expression in the testis (Cardoso-Moreira et al., 2019) – likely facilitated by widespread transcriptional leakage in male germ cells (Soumillon et al., 2013) – and to the preferential emergence of new genes in this organ (Kaessmann, 2010), where they often acquire functions in spermatogenesis (Kondo et al., 2017). Although the rapid divergence of genes directly involved in reproduction has traditionally been interpreted from an adaptationist stance, relaxed selection and drift have been recently proposed to account for this pattern (Dapper and Wade, 2020). Central to this debate is the often-overlooked contribution of old genes for germ cell development and function.
We have recently observed, in a wide range of plant species, a substantial contribution of old genes to the pollen transcriptome, suggestive of an ancient transcriptional program common to plant male gametes (Julca et al., 2021). Such observation finds parallel in the concept of a metazoan germ line multipotency program (Fierro-Constaín et al., 2017), and is supported by transcriptional similarities between equivalent spermatogenic cell types across mammalian species (Lau et al., 2020; Shami et al., 2020; Murat et al., 2023). This led us to explore the possibility that, despite the rapid evolution of reproductive protein genes, the functional basis of metazoan male germ cell identity relies on an old, evolutionarily conserved genetic program that can provide relevant insight into spermatogenesis and human infertility.
To test this hypothesis, we devised an interdisciplinary research platform based on four combined approaches (Figure 1a). First, we determined, through comparison of genome sequences, the age of the gene expression program of male germ cells from three evolutionarily distant metazoan species: humans (Homo sapiens), mice (Mus musculus), and fruit flies (Drosophila melanogaster). Then, we used network science to infer the significance of deeply conserved genes within the context of the complex male germ line transcriptome. Subsequently, through developmental biology (in vivo RNAi in fruit fly testes), we defined the role of a key subset of the conserved germ cell transcriptome in male reproductive fitness. Finally, we merged this information with clinical genetics to identify new potential causes of human infertility. Overall, we show that deeply conserved genes play a prominent role in male germ cell regulation and that the disruption of this ancient genetic program leads to human reproductive disease.
Results
A more conservative estimate of the complexity of the male germ cell transcriptome
Male germ cell development is divided into three conserved stages (Navarro-Costa et al., 2020). The first is the pre-meiotic stage and corresponds to the mitotic expansion of committed precursors (spermatogonia). Meiosis defines the second stage, with the newly differentiated spermatocytes undergoing reductive division. In the third stage, the post-meiotic cells (spermatids) embark on a cytodifferentiation program that culminates in the formation of mature male gametes.
To understand to what extent male germ cell transcription quantitatively differs from that of somatic lineages, we collected previously published high-quality RNA-Seq datasets from pre-meiotic, meiotic, and post-meiotic germ cell populations, and compared them with representative somatic cell types of the primary embryonic layers: neurons (ectoderm), muscle (mesoderm), and enterocytes (endoderm; Figure 1b and Supplementary file 1). We centered our analysis on three evolutionarily distant gonochoric species with excellent genome annotations – humans, mice, and fruit flies (the last two also being well-established animal models for the functional study of human spermatogenesis) – and observed that, as previously reported (Soumillon et al., 2013; Xia et al., 2020), male germ cells have a generally more diversified transcriptome than their somatic counterparts (as measured by the percentage of the entire genome that each cell type expresses). Yet, this increased complexity was mainly restricted to extremely permissive minimum average expression cut-offs ranging from >0.01 to >0.5 transcripts per million (TPM), suggestive of a significant degree of transcriptional noise or other previously hypothesized translation-independent roles for lowly expressed transcripts (Liu and Zhang, 2020). Indeed, the use of a more suitable threshold for protein-coding transcripts (TPM >1; Thul et al., 2017) revealed that, on average, metazoan male germ cells tend to express approximately 10,000 genes (9,468±498–12,195±1144, depending on the species), corresponding to half of the entire protein-coding genome (48.0–67.9%; Figure 1b and Supplementary file 1). In both humans and mice, this range is comparable to that observed in somatic cell types such as neurons and enterocytes (53.4–53.7% and 37.3–45.6% of the genome, respectively). Only in fruit flies, a species with a smaller genome (13,947 protein-coding genes vs. 22,802 and 22,287 in humans and mice, respectively), the diversity of the male germ cell transcriptome diverged substantially from that found in representative somatic tissues (with an average of 67.9 ± 3.6% of the genome expressed in germ cells vs. 33.3 ± 12.5% in the soma). It should be noted that this percentage may be higher in fruit flies due to the presence of transcripts of somatic origin in the germ cell dataset (see Supplementary file 1). Collectively, these observations suggest that the average functional requirement of a metazoan male germ cell corresponds to the expression of approximately 10,000 protein-coding genes. Such a number brings a significant but largely overlooked functional constraint to the general notion that almost all protein-coding genes are transcribed in the male germ line.
The male germ cell transcriptome has an old evolutionary origin
We next assessed the contribution of deeply conserved genes to the male germ cell transcriptome. For this we used phylostratigraphy, a technique that determines the evolutionary age of groups of homologous genes (defined as orthogroups in our analysis, including both orthologs and paralogs) by mapping their last common ancestor in a species tree (Domazet-Loso et al., 2007). We assembled our tree based on the proteomes of 25 phylogenetically representative eukaryotic species, and assigned each orthogroup to the oldest phylogenetic node (phylostratum) it could be traced back to (Figure 1c and Figure 1—figure supplement 1). This way, phylostrata ranked 1–5 were the oldest and contained orthogroups common to all Metazoa, while phylostratum 16 contained the youngest, species-specific orthogroups. A total of 113,757 orthogroups were identified in the 25 representative eukaryotes, 85.5% of which (97,270) were species-specific when adding all phylostrata. By using TPM >1 as minimum expression cut-off (to minimize the effects of widespread, low-level transcription in the male germ line), we observed that 48.0–67.9% of the entire protein-coding genome was expressed in male germ cells, depending on the species. Despite the rapid divergence typically associated with reproduction-related genes, we found that the majority (65.2 to 70.3%) of genes expressed in male germ cells mapped to the oldest-ranking phylostrata containing orthogroups common to all Metazoa, henceforth referred to as deeply conserved genes (Figure 1d). Indeed, in all three tested species, this percentage was not significantly different from that recorded in the transcriptome of the somatic cell types (also at a threshold of TPM >1). The fraction of deeply conserved genes that were ubiquitously expressed (proxy to their involvement in cellular housekeeping processes) varied from 49.7 to 63.4% (Figure 1—figure supplement 2a, b). This strongly suggests that almost half of all deeply conserved genes expressed by male germ cells are involved in more specific roles than just the maintenance of basal cellular functions.
By summing the products of the age of all expressed genes and their expression levels at a given developmental stage - a metric known as the transcriptome age index - TAI (Domazet-Lošo and Tautz, 2010), we determined the transcriptome age of the different male germ cell stages (Figure 1e). As recently reported in mammalian spermatogenesis (Murat et al., 2023), meiotic and pre-meiotic cells across all three tested species had lower TAIs than post-meiotic cells, indicative of older transcriptomes. This trend was less obvious in the fruit fly as in this species post-meiotic transcription is largely residual (Lim et al., 2012). Collectively, we observed that, both in vertebrates and invertebrates, the male germ cell transcriptome has an old evolutionary origin that is tempered by the increased expression of younger genes at later developmental stages. Our data, indicative of a core spermatogenic program spanning more than 600 million years of evolution, significantly expand the breadth of previous observations in the mammalian lineage (Lau et al., 2020; Shami et al., 2020; Murat et al., 2023), while emphasizing the potential relevance of using conserved expression between distant animal species to gain insight into the molecular basis of human infertility.
Deeply conserved genes are central components of the male germ cell transcriptome
We next addressed the possible significance of the abundant expression of deeply conserved genes in the male germ line. For this, we focused on meiosis, a fundamental process conserved through eukaryotes. Besides the existence of an ancient meiotic toolkit required for recombination and reduction division across sexually reproducing species (Schurko and Logsdon, 2008), the mitosis-to-meiosis transition in the male germ line is also home to a transcriptional burst associated with the acquisition of male germ cell identity (Maezawa et al., 2020). The transcripts produced in this burst are mainly required for post-meiotic development, with their translation being shifted towards the later spermatogenic stages (Wang et al., 2020). Accordingly, when analyzing the expression pattern of all human and mouse genes annotated to the ‘spermatid differentiation’ Gene Ontology (GO) term (GO:0048515), we observed that only a small fraction of these was expressed primarily/exclusively after meiosis (5.7% [13 out of 228] and 8.7% [25 out of 289] in humans and mice, respectively). Similar to previous reports in fruit flies (Lim et al., 2012; Maezawa et al., 2020; Jan et al., 2017), most of the mammalian spermatid differentiation genes were found to be expressed already at the meiotic stage, with 82.9% (189 out of 228, in humans) and 77.9% (225 out of 289, in mice) being expressed in spermatocytes or in spermatocytes and spermatogonia. These results show that the meiotic transcriptome provides a particularly suitable entry-point into the genetic basis of the core spermatogenic program. Hence, we assembled protein-protein interaction (PPI) networks based on the transcriptome data of human, mouse, and fruit fly spermatocytes, where nodes represent all expressed genes and edge weights indicate the probability of the connected genes contributing to a specific function according to the STRING database (Szklarczyk et al., 2019; Figure 1f). The structure of these networks reflects the multiple genetic interdependencies responsible for cellular function, as illustrated by the characteristic clustering of functionally related genes into topologically defined modules (Choobdar et al., 2019). Network edges were filtered to remove low confidence interactions (those with a combined confidence score <0.5, Figure 1—figure supplement 3a–c), and, as before, only genes expressed at TPM >1 were included to minimize the confounding effects of widespread low-level transcription. The resulting networks contained between 7961 and 11,322 genes, depending on the species, and an average of 301,036±66,416 edges.
Consistent with phylostratigraphy, the spermatocyte PPI networks included a substantial fraction of genes conserved across all Metazoa (45.1 to 58.1% of all genes; Figure 1g). As a starting point for our analyses, we set out to determine to what extent the previously reported increased connectivity of conserved genes in PPI networks (Wuchty et al., 2003; Brown and Jurisica, 2007) is maintained in a cell type characterized by rapid rates of evolution. For comparison purposes, we assembled, as before, PPI networks based on the transcriptome data of human, mouse, and fruit fly enterocytes (a somatic cell type with a comparably young transcriptome; Figure 1—figure supplement 2b). These somatic cell networks, albeit smaller due to lower transcriptome diversity in enterocytes (Figure 1b), were comparable in size to those of the spermatocytes, containing between 3,310 and 9,837 genes, depending on the species, and an average of 203,296±117,545 edges (Figure 1—figure supplement 3a–d). Similarly to what we observed in enterocytes, deeply conserved genes in the spermatocyte PPI networks were significantly more connected than their non-conserved counterparts (higher degree centrality), and their interactors were themselves more connected than those of non-conserved genes (higher page rank, Kolmogorov-Smirnov test for the two analyses; Figure 1h and Figure 1—figure supplement 4). Although the connectivity properties of the spermatocyte conserved genes were lower than those expressed in enterocytes, they were still salient enough for machine learning classifiers to predict reliably if a gene was conserved just based on spermatocyte network features. More specifically, using a Random Forest classifier, the AUC (area under the receiver operating characteristic curve) score was 0.74, 0.75, and 0.82 in the human, mouse, and fruit fly spermatocyte datasets, respectively (Figure 1i). Classification performance was equally high when using a different supervised learning algorithm (linear Support Vector Machine) and an alternative performance metric (precision and recall; Figure 1—figure supplement 5).
To address a possible ascertainment bias associated with more available information on conserved genes in the STRING database, we tested to what extent degree centrality and page rank were affected by network rewiring. In this approach, a variable percentage (20 to 100%) of all edges is randomly shuffled across the network, thus diluting any latent biases in the datasets. We observed that both network properties remained higher in conserved genes even when 80% of all edges were rewired in the spermatocyte PPI networks (Figure 1—figure supplement 6). Such result suggests that the increased connectivity of conserved genes is mostly driven by their intrinsic properties, rather than by differences stemming from the amount of source data available. Based on these analyses, we conclude that despite the testis being a rapidly evolving organ and a preferential birthplace for new genes, deeply conserved genes remain central components in the male germ cell transcriptome of evolutionarily distant species. Accordingly, we decided to explore this increased connectivity as a means of gaining insight into the conserved molecular basis of metazoan spermatogenesis.
A core component of the ancient genetic program of spermatogenesis
We have previously shown that it is possible to simplify the complexity of biological networks by removing redundancy and only retaining the most relevant interactions that sustain the dynamics of the system (Gates et al., 2021). To do so, one can extract from a network its distance backbone, i.e., the small subset of metric edges that is sufficient to compute all shortest paths in the original network (Simas et al., 2021). This technique retains the minimum number of required edges to preserve all network nodes, as well as all of the natural hierarchy of connections and community structures (Brattig Correia et al., 2023). In the context of biological networks, such distance backbone uncovers the subgraph of gene interactions more likely to convey the key functional characteristics of the gene expression program (Ren et al., 2018).
By extending orthology to the backbone, we have developed a new type of multilayer network: the orthoBackbone. It expands on the concept of interologs (Walhout et al., 2000; Sun and Kim, 2011), as it consists of only the edges that connect the same pair of orthologs in the network backbone of different species (Figure 2a). Since this technique excludes all backbone edges (and corresponding nodes) that do not involve the same orthologs across multiple species, it ultimately identifies a subset of conserved genes whose functional interactions at the protein level are part of the distance backbone across evolution. In this regard, the orthoBackbone offers access to what can be considered core features of the genetic program of a given cellular state. Importantly, the use of fruit flies (an animal without male crossing over and recombination) in this inter-species approach significantly reduces the contribution of the ancient meiotic toolkit to the orthoBackbone, thus emphasizing functional interactions associated with spermatogenesis but not with meiosis per se.
Figure 2. Functional analysis of the conserved genetic program of male germ cells.
(a) The orthoBackbone methodology. First, the most relevant associations are determined by defining the individual metric backbones (based on shortest paths) of protein-protein interaction (PPI) networks from different species. Of the backbone edges (in green), those connecting the same orthologous genes across the different species are selected as part of the evolutionarily conserved orthoBackbone (in red, with asterisks). In the case of a one-to-many conserved edge relationship, inclusion depends on at least one of the multiple edges being part of the backbone. Letters depict different genes, B’ and B’’ are paralogs, and numbers indicate distances between genes. (b) The orthoBackbone represents less than 3% of all functional interactions (edges) in the spermatocyte PPI networks. (c) The orthoBackbone connects >70% of all conserved genes expressed in spermatocytes. Gene conservation (across Metazoa) was defined based on eggNOG orthogroups. (d) orthoBackbone genes are preferentially involved in gene expression regulation compared with other equally conserved genes. Charts represent the top 5 terms of an unfiltered gene ontology (GO) enrichment analysis for biological processes of the human male germ cell orthoBackbone. False discovery rate ≤0.05; see Figure 2—figure supplement 1 for the expanded GO analyses. (e) The male germ cell orthoBackbone reveals a core set of 79 functional interactions between 104 gene expression regulators of spermatogenesis. Solid dots indicate genes with testis specific/enriched expression. Post-transc. reg.: Post-transcriptional regulation; RNA mod.: RNA modification. (f) Conserved mitosis-to-meiosis transcriptional burst genes were defined based on their upregulation at mammalian meiotic entry and/or downregulation at meiotic exit. In both cases, genes also had to be expressed in insect spermatogenesis. Green lines link orthologs (920 in fruit flies, 797 in humans and 850 in mice) based on eggNOG orthogroups. Expression level in normalized absolute log(FPKM +1). (g) An in vivo RNAi screen in fruit fly testes uncovers the functional requirement of 250 conserved transcriptional burst genes (27.2%) for male reproductive fitness. Silencing of the 920 genes was induced at the mitosis-to-meiosis transition using the bam-GAL4 driver. Color-code for the recorded testicular phenotype as in “h”. Results reflect a total of four independent experiments. Threshold for impaired reproductive fitness (red horizontal line) corresponds to a 75% fertility rate (>2 standard deviations of the mean observed in negative controls). (h) Conserved transcriptional burst genes are required for diverse spermatogenic processes. Testicular phenotypes of the 250 hits were defined by phase-contrast microscopy and assigned to five classes based on the earliest manifestation of the phenotype. (i) Transcriptional burst genes reveal 161 new, evolutionarily conserved regulators of spermatogenesis (64.4% of all hits, homologous to 179 and 187 in humans and mice, respectively). Phenotype novelty was defined by lack of previously published evidence of a role in male fertility/spermatogenesis in humans, mice or fruit flies. (j) All data acquired in this screen are freely available in the form of an open-access gene browser (Meiotic Navigator).
Figure 2—figure supplement 1. Top 10 terms of an unfiltered gene ontology (GO) enrichment analysis of the human male germ cell orthoBackbone.
We observed that the orthoBackbone drastically reduced the size of the spermatocyte PPI networks, retaining only 1.7 to 2.7% of all edges, depending on the species (Figure 2b). This almost residual fraction of edges nevertheless contained 70% of all deeply conserved genes in the network (Figure 2c and Supplementary file 2), further illustrating that strong functional relationships between deeply conserved genes are also maintained in the male germ line. GO term enrichment analysis for biological processes revealed that orthoBackbone genes were preferentially involved in gene expression and protein regulation, in contrast with the other (not retained) conserved genes that were mainly associated with cell signaling pathways (Figure 2d and Figure 2—figure supplement 1). To better understand to what extent the orthoBackbone can provide insight into cell-type specific processes, we determined the degree of similarity between the spermatocyte and enterocyte orthoBackbones. We noted a modest overlap between both subnetworks, with just 16.2% to 18.0% of common edges depending on the species (against background expectations of 33.6 to 61.5%, based on Jaccard similarity index scores between the PPI networks of both cell types; Figure 1—figure supplement 3d).
Using GO annotations, we further selected spermatocyte orthoBackbone interactions involving gene expression regulators previously linked to spermatogenesis. By doing so, we obtained a subset of 79 functional interactions between 104 human genes (out of the total of 3,596 in the orthoBackbone; Figure 2e). These were ascribed to a wide gamut of regulatory processes and involved noteworthy spermatogenesis genes such as RFX2 (transcription), CDYL (chromatin remodeling) and BOLL (translation), among others (Wu et al., 2016; Liu et al., 2017; Shah et al., 2010). Of the 104 genes, 34 were testis-specific/enriched while 60 had low tissue specificity, consistent with previous observations that cell identity relies on the integration of cell type-specific genes with more broadly expressed regulators (Kotliar et al., 2019). Based on its conservation in the orthoBackbone of evolutionarily distant species and on the significant spermatogenic role of the intervening genes, we posit that this subnetwork of 79 functional interactions involving 104 genes likely represents a core component of the conserved genetic program of spermatogenesis. This number fits well with previous estimates, based on expression trajectories of 1:1 orthologous genes, of an ancestral program of amniote spermatogenesis constituted by 389 genes (Murat et al., 2023).
Male germ cell identity has a broad functional basis
We next set out to determine the functional consequences of disrupting the spermatocyte orthoBackbone. Since the latter represents a still sizeable interaction subnetwork consisting of more than 3,000 genes, we decided to select candidate genes likely to affect the acquisition and maintenance of male germ cell identity. For this, we again took into account how the mitosis-to-meiosis transcriptional burst activates the expression of spermatogenesis genes – a process that has been likened to that of cellular reprogramming (Guo et al., 2018; Alavattam et al., 2019). We reasoned that genes that are upregulated at meiotic entry and/or downregulated at meiotic exit (thus part of the transcriptional burst) likely represent instructive elements for the male germ cell state, hence preferential routes to tamper with this cellular identity.
Through differential gene expression analysis, we identified, in human and mouse spermatocytes, all expressed genes that shared a similar upregulation at meiotic entry and/or downregulation at meiotic exit. Of these 970 mammalian differentially expressed genes (DEGs), we selected as candidates for functional assessment only those that were also expressed in fruit fly germ cells (Figure 2f). We did not take into account differential expression in the latter species, since the largely residual levels of post-meiotic transcription in fruit fly spermatogenesis thwarts direct comparisons with the mammalian system (Lim et al., 2012). The resulting 920 fruit fly genes (homologous to 797 and 850 in humans and mice, respectively) were silenced specifically at the mitosis-to-meiosis transition by Drosophila in vivo RNAi, using the well-established bam-GAL4 driver (White-Cooper, 2012). Of these, a total of 250 (27.2%) were essential for male fertility, as their silencing resulted, upon mating with wildtype females, in egg hatching rates below the cut-off of 75% (>2 standard deviations of the mean observed in negative controls: 91.6 ± 8.5%; Figure 2g). We observed that 76.0% of the hits (190 out of 250) were part of the orthoBackbone, compared with 60.0% (402 out of 670) in genes without any obvious effect on male reproductive fitness upon silencing, with this difference being statistically significant (two-sided Fisher’s exact test p<0.0001). Cytological analysis of all 250 genes by testicular phase-contrast microscopy in the RNAi males revealed diverse origins for the infertility phenotype, with the earliest manifestation of cellular defects ranging from the pre-meiotic to the mature gamete stage (Figure 2h). By exploring publicly available information (see Materials and methods), we determined that 161 (64.4%) of all hits had never been previously associated with male reproduction in any species (Figure 2i). Accordingly, these 161 new deeply conserved spermatogenesis genes (homologous to 179 and 187 in humans and mice, respectively) represent both a significant advance in our understanding of the genetic basis of male germ cell development, and a valuable resource to explore from a precision medicine perspective. To facilitate open access to this information, we made all data available in the form of a user-friendly gene browser (https://pages.igc.pt/meionav; Figure 2j). Overall, by merging our results with previously published data we conclude that at least 43.5% of the mitosis-to-meiosis transcriptional burst (our 250 hits +150 previously reported male fertility genes that gave a negative result in our assay) is required for a diverse range of germ cell functions. This illustrates the pervasive influence of the mitosis-to-meiosis transition on multiple facets of the spermatogenic program and argues for a broad functional basis underlying male germ cell identity.
An ancient regulator of human spermatogenesis
One of our 161 newly identified spermatogenesis genes – the Drosophila RING finger protein 113 (dRNF113), a spliceosomal component also known as mdlc (Carney et al., 2013) – emerged as a particularly interesting germ cell regulator from a clinical perspective. More specifically, by analyzing our in-house whole exome database containing sequencing data from 74 cases of human male meiotic arrest, we identified an infertile man harboring a homozygous loss of function (LoF) variant in a human paralog (RNF113B). This frameshift variant c.556_565del;p.(Thr186GlyfsTer119) leads to the abrogation of the protein’s two functional domains (a C3H1-type zinc finger and a RING finger) and to its truncation (Figure 3a and Figure 3—figure supplement 1a). The identified man (M1911) is part of a consanguineous family of Middle-Eastern ancestry and shared the homozygous state of the RNF113B LoF variant with his equally infertile brother, but not with his fertile heterozygous brother (Figure 3—figure supplement 2a–c and Materials and methods). We excluded the possibility that this genetic variant is an indicator of another, nearby variant that is homozygous in the affected individuals and represents the actual cause of infertility in both men (Materials and methods). Remarkably, the results from the testicular biopsy of M1911 revealed an equivalent meiotic arrest phenotype to that observed in dRNF113-silenced fruit flies. Indeed, spermatocytes were, by far, the predominant cell type in the male gonads of both species: they were the most advanced cell stage observed in 89.0% of all assessed human seminiferous tubules (vs. 9% in controls) and occupied an average of 64.1% of the entire testis area in fruit flies (vs. 12.3% in controls; Figure 3b–c and Figure 3—figure supplement 1b–c). Early (round) spermatids were practically absent in both species, with cellular debris accumulating in the post-meiotic region of the fruit fly gonad. The similar reproductive impairment phenotypes recorded in both species highlights the often-complex relationship between tissue specificity and functional significance. In insects, dRNF113 is a broadly expressed gene essential for organismal viability (Carney et al., 2013), whose particular spermatogenic function was uncovered by germ cell-specific silencing. Both roles were functionally uncoupled in primates via the existence of two RNF113 copies, with the correctly spliced form of RNF113B being mainly testis-specific (Szcześniak et al., 2011). The existence of an independent duplication event in the rodent lineage offered us the opportunity to further explore this concept. Compared with its human ortholog (RNF113B), the mouse duplication (Rnf113a2) is broadly expressed, similarly to Rnf113a1, human RNF113A and fruit fly dRNF113. Yet, the generation of a whole-body Rnf113a2 knockout (KO) mouse revealed a comparable phenotype to that of the human RNF113B LoF variant: severe spermatogenic impairment without any overt somatic defects. When compared to wildtype littermates, male homozygous Rnf113a2KO/KO mice were sterile and had visibly smaller testes that were largely devoid of germ cells (SCO: Sertoli cell-only phenotype) except for the rare presence of spermatogonia and spermatocytes (Figure 3d and Figure 3—figure supplement 1d–f).
Figure 3. Deeply conserved regulators of human spermatogenesis.
(a) Similar domain structure of the RNF113 proteins in humans (RNF113B), fruit flies (dRNF113), and mice (Rnf113a2). All contain a C3H1-type zinc finger and a RING finger domain. Numbers indicate amino acid residue position. (b) Human RNF113B is required for meiotic progression past the primary spermatocyte stage. Testicular histology of M1911 [RNF113B loss of function (LoF) variant] and of a control sample with normal spermatogenesis. See Figure 3—figure supplement 1b for phenotype quantification. Arrowheads: primary spermatocytes; arrows: spermatids. Scale bars: 100 μm (overview), 50 μm (insets), and 10 μm (meiotic region). (c) Silencing fruit fly dRNF113 results in meiotic arrest. Phase-contrast microscopy. See Figure 3—figure supplement 1c for phenotype quantification. Arrowheads: primary spermatocytes; asterisks: early (round) spermatids; arrows: late (elongating) spermatids; sv- seminal vesicle. Scale bars: 50 μm (whole testis) and 20 μm (meiotic region). (d) Mouse Rnf113a2 is essential for spermatogenesis. Testes of whole-body homozygous Rnf113a2 knockout mice (Rnf113a2KO/KO) are essentially devoid of germ cells, with the rare occurrence of meiotic and pre-meiotic stages. The testicular histology of a wildtype littermate control (Rnf113a2WT/WT) is presented for comparison (normal spermatogenesis). See Figure 3—figure supplement 1d–g for additional data. Arrowheads: primary spermatocytes; arrows: spermatids. Scale bars: 50 μm (overview) and 20 μm (intratubular region). (e) Human RNF113B is predominantly expressed at meiotic entry. Data analyzed from our recently published single cell RNA-Seq atlas of normal spermatogenesis (Di Persio et al., 2021). Usg.: undifferentiated spermatogonia; Dsg.: differentiated spermatogonia / preleptotene; Lp.: leptotene; Zg.: zygotene; Pc.: pachytene; Dp.: diplotene; M: meiotic divisions; Rsp.: round spermatids; Esp.: elongating spermatids. (f) The nuclear levels of the fruit fly dRNF113 protein increase at meiotic entry. Images are maximum projections of the entire nuclear volume. Spermatocytes correspond to late prophase I cells. Dotted lines delimit the nuclear envelope (as assessed by fluorescent wheat germ agglutinin). Scale bar: 5 μm. a.u- arbitrary units. ****p<0.0001 (unpaired t-test). (g) RNF113B is required for normal gene expression during human spermatogenesis. Differential gene expression (DGE) analysis of RNA-Seq data obtained from testicular biopsies of M1911 (RNF113B LoF variant, left and right testis) and of three controls with normal spermatogenesis. Down and upregulated genes in blue and red, respectively. orthoBackbone differentially expressed genes (DEGs) are outlined. FC: fold change. FDR: false discovery rate. Edge disruption corresponds to the number of orthoBackbone edges containing at least one DEG. (h) dRNF113 regulates gene expression in the fruit fly male gonad. Whole testes samples (in triplicate) in both experimental conditions. (i) Network of functional associations between orthoBackbone genes downregulated both in the RNF113B LoF and dRNF113 RNAi. Node size indicates the result of the page rank metric in the spermatocyte protein-protein interaction network (measure of the connectivity of interacting genes), and color specifies if the gene has a known role in spermatogenesis (in any species). Testicular phenotype of men affected by variants in HSPA2 and KPNA2 (red nodes) are depicted in “j”. Edge thickness indicates STRING functional association scores, and color specifies the main source of data for the associations. (j) LoF variants in the orthoBackbone genes HSPA2 and KPNA2 are potentially associated with human male infertility. Testicular histology of individuals M2190 and M2098 (HSPA2 and KPNA2 variants, respectively) reveals a complete loss of germ cells (Sertoli cell-only phenotype). Arrowheads: Sertoli cells. Scale bars: 100 μm (overview), 50 μm (insets), and 10 μm (intratubular region).
Figure 3—figure supplement 1. The RNF113 proteins are required for male germ cell development across vertebrate and invertebrate species.
Figure 3—figure supplement 2. The loss of function variant in RNF113B detected in individual M1911.
Figure 3—figure supplement 3. Silencing the spliceosome component Prp19 in the fruit fly testis.
Figure 3—figure supplement 4. The loss of RNF113B has a considerable impact on the conserved genetic program of spermatogenesis.
Figure 3—figure supplement 5. Association of HSPA2 loss-of-function variants and the fruit fly Hsc70-1 RNAi with male infertility.
Figure 3—figure supplement 6. Association of KPNA2 loss-of-function variants and the fruit fly dKPNA2 RNAi with male infertility.
Strengthening our interest in the RNF113 genes was their possible involvement with male germ cell identity. By analyzing our recently published single-cell RNA-Seq dataset of normal human spermatogenesis (Di Persio et al., 2021), we observed that RNF113B was predominantly expressed at the mitosis-to-meiosis transcriptional burst, peaking at the diplotene stage of prophase I (Figure 3e). This mirrored the protein localization pattern of fruit fly dRNF113, characterized by a substantial nuclear accumulation in primary spermatocytes (Figure 3f). Also in mice, Rnf113a2 was expressed at the transcriptional burst, with an earlier expression in undifferentiated spermatogonia being the likely cause of the slightly more severe histological phenotype of the Rnf113a2KO/KO mice (Figure 3—figure supplement 1g). Consistent with their role in gene expression, the analysis of M1911’s testicular transcriptome via RNA-Seq revealed the deregulation of 21.7% of all expressed genes (Figure 3g). Notably, the deregulation detected in M1911’s testicular transcriptome had a clear impact on the spermatocyte orthoBackbone, with 30.0% of its network edges being disrupted (i.e. containing at least one deregulated gene). A similar effect was patent in the dRNF113 RNAi, with 20.3% of the fruit fly testicular transcriptome being deregulated, resulting in the disruption of 26.3% of the orthoBackbone (Figure 3h). For comparison, the silencing of Prp19, another transcriptional burst gene and spliceosomal component that RNF113 proteins associate to Gatti da Silva et al., 2019, had a lower effect on the orthoBackbone with 15.2% of disrupted edges (significantly lower than in the dRNF113 RNAi; two-sided Fisher’s exact test P<0.0001), despite a similar meiotic arrest phenotype (Figure 3—figure supplement 3a-d). Based on the above, we conclude that the RNF113 genes have retained a key role in spermatogenesis for more than 600 million years of evolution. Furthermore, they provided us the means to interfere with a sizable fraction of the male germ cell orthoBackbone in humans and fruit flies – an advantage we next explored from a clinical perspective.
The orthoBackbone as an ancillary tool in clinical genetics
Male infertility is a complex human disease with a poorly defined genetic component. This contributes to a low causative diagnostic yield (typically just 30%), and to a paucity of clinically validated genes (currently just 104, in contrast with the more than 700 already associated with other clinical disorders such as intellectual disability [Houston et al., 2021; Vissers et al., 2016]). Since male infertility affects up to 12% of all men (Agarwal et al., 2015), addressing such knowledge gap is an issue of clear medical importance. To narrow the gap, we explored the possibility that spermatogenesis is particularly sensitive to disturbances in the male germ cell orthoBackbone. Thus, we harnessed the sizeable effect of RNF113B on this network as a means of identifying additional genetic causes of human infertility. Indeed, we noted that 38.0% (30 out of 79) of the functional interactions that define our previously identified core component of the conserved genetic program of spermatogenesis were disrupted in M1911 (i.e. these interactions contained at least one differentially expressed gene; Figure 3—figure supplement 4). In particular, we observed that 19.2% (20 out of 104) of this gene subset was found to be downregulated, compared with 10.0% (1685 out of 16,824) of downregulated genes in the testicular transcriptome of M1911, with this difference being statistically significant (two-sided Fisher’s exact test P=0.0046; Figure 3g). Therefore, we posited that the similar spermatogenic impairment recorded in the human RNF113B LoF variant and on the fruit fly dRNF113 RNAi ultimately reflected the downregulation, in both species, of a common set of orthoBackbone genes.
By defining the overlap between differentially expressed orthoBackbone genes in the testicular transcriptome of M1911 and of the dRNF113 RNAi, we identified 61 conserved human genes that were similarly downregulated in both species. These formed a connected network (based on STRING data) suggestive of their functional co-involvement in related biological processes (Figure 3i). Of the 61 genes, 27 had already been linked to male germ cell defects in different species: four were clinically validated male infertility genes (CDC14A, CFAP91, DNAI1 and DNAI2), and the other 23 had been previously reported in animal models (see Materials and methods). The fact that among the latter was BOLL, one of the oldest known metazoan gametogenesis genes (Shah et al., 2010), was a particularly noteworthy observation. We next tested if these 27 genes could be used to identify new genetic causes of human infertility. For this, we analyzed whole-exome sequencing data of 1021 azoospermic men from the MERGE cohort (Wyrwoll et al., 2020). Filtering these exomes for LoF variants in the 27 selected orthoBackbone genes revealed two new human male infertility candidate genes: HSPA2 and KPNA2.
HSPA2 – heat shock protein family A member 2 – is a molecular chaperone of the highly conserved 70 kDa heat shock protein (HSP70) gene family. HSP70 members are involved in cellular proteostasis from bacteria to human, with the mouse and fruit fly HSPA2 homologs (Hspa2 and Hsc70-1 to Hsc70-5, respectively) being required for meiotic progression past the primary spermatocyte stage (Eddy, 1999; Azuma et al., 2021). Indeed, we observed that the silencing of Hsc70-1 in the fruit fly testis, also with the bam-GAL4 driver, resulted in an equivalent meiotic arrest to that of dRNF113, strongly suggesting that the misexpression of this chaperone is a central element of the latter phenotype (Figure 3—figure supplement 5a–b). Two different heterozygous LoF variants were detected in our male infertility cohort, with this gene having a predicted autosomal-dominant inheritance (Figure 3—figure supplement 5c–f and Materials and methods). The first variant, detected in individual M1678, is the early stop-gain variant c.175C>T;p.(Gln59Ter) that truncates more than 90% of the protein, likely leading to nonsense-mediated decay. There is only one individual listed in the gnomAD database with this variant (minor allele frequency = 0.0004%). The second, identified in individual M2190, is the frameshift variant c.1806dup;p.(Glu603ArgfsTer81) which affects the distal end of the protein’s nucleotide-binding domain. This variant is not listed in the gnomAD database. Histopathological analysis of M2190’s testicular tissue revealed a SCO phenotype in 259 assessed seminiferous tubules, a likely aggravation of the extensive cell death accompanied by tubule vacuolization reported in Hspa2 mutant mice (Dix et al., 1996; Figure 3j).
KPNA2 – karyopherin subunit alpha 2 – is a nuclear importin that functions as an adapter protein in nucleocytoplasmic transport. Its mouse ortholog (Kpna2) is required for the nuclear accumulation of Hop2 (Ly-Huynh et al., 2011), a conserved regulator of meiotic progression from yeast to mammals (Leu et al., 1998; Petukhova et al., 2003). The main function of Hop2 is to repair meiotic DNA double-strand breaks (DSBs), with the corresponding mouse mutant being characterized by a primary spermatocyte arrest coupled to extensive cell death. Since male fruit flies dispense with meiotic DSBs (John et al., 2016), the inclusion of KPNA2 in the male germ cell orthoBackbone seems counter-intuitive. Yet, Drosophila importin alpha 2 (dKPNA2, also known as Pendulin) is required for post-meiotic development (Mason et al., 2002), with its silencing specifically at meiotic entry being associated with aborted spermiogenesis (Figure 3—figure supplement 6a–b). This post-meiotic function might also be present in its mammalian orthologs, as suggested by the nuclear localization of Kpna2 in elongating mouse spermatids (Ly-Huynh et al., 2011). We detected two heterozygous KPNA2 LoF variants in our cohort of infertile men, with this gene also having a predicted autosomal-dominant inheritance (Figure 3—figure supplement 6c–g and Materials and methods). The splice-site variant (c.667–2A>G;p.?) in individual M1645 is predicted to disrupt the correct splicing of intron 6 due to the loss of an acceptor site. Segregation analysis revealed the de novo occurrence of this variant, strongly supporting both its pathogenicity and our recent results, pointing to the important contribution of heterozygous de novo mutations to male infertility (Oud et al., 2022; Wyrwoll et al., 2023). The other KPNA2 LoF variant, in individual M2098, is the frameshift variant c.1158_1161del;p.(Ser387ArgfsTer14) which affects 27% of the protein, including its minor nuclear localization signal binding site. Both KPNA2 variants are not listed in the gnomAD database. The available testicular histopathology report for the latter individual lists a SCO phenotype as the cause of the azoospermia, again suggestive of the possible deterioration of an initially meiotic phenotype (Figure 3j).
Based on three in silico prediction tools (Materials and methods), both HSPA2 and KPNA2 can represent haploinsufficient genes and/or have autosomal dominant inheritance, with the evidence being stronger for KPNA2. Together with the identified de novo occurrence of one of its LoF variants, this makes KPNA2 a strong novel autosomal-dominant candidate gene while more data is still needed before drawing similar conclusions on HSPA2.
In summary, by experimentally disrupting the male germ cell orthoBackbone across evolutionarily distant species, we were able to uncover two new candidate genes for human infertility (HSPA2 and KPNA2) in addition to RNF113B, affecting a combined total of five individuals. This successful merger between basic and clinical research highlights the advantages of comparative biology as means of dampening inter-species differences in reproductive physiology, while providing a conceptual framework for a more efficient prioritization of clinically relevant genetic variants in human reproductive disease.
Discussion
In summary, we have identified a conserved male germ cell gene expression program, spanning more than 600 million years of evolution, that can provide clinical insight into the molecular basis of human infertility. We estimate the average functional requirement of a metazoan male germ cell to correspond to the expression of approximately 10,000 protein-coding genes, the majority of which are deeply conserved. Indeed, despite the testis being a rapidly evolving organ and frequent birthplace of genetic novelty, deeply conserved genes are central components of the male germ cell transcriptome, with younger genes being preferentially expressed in late spermatogenesis. We further pinpoint 79 functional interactions between 104 transcriptional regulators that, owing to their salient network properties across different species, likely define a core component of the ancient genetic program of spermatogenesis. By functionally interfering with 920 deeply conserved genes associated with male germ cell identity, we uncover 161 previously unknown spermatogenesis genes required for a broad range of germ cell functions, as well as three new potential genetic causes of human infertility.
We propose that the transcriptional identity of metazoan male germ cells is built around a relatively small network of deeply conserved gene interactions with an overarching functional impact. These observations may provide a unifying genetic basis for the deep conservation of fundamental germ cell biological processes such as sperm motility and gamete fusion (Speer et al., 2021; Moi et al., 2022). The existence of a shared genetic identity in metazoan spermatocytes can be regarded as a ramification of an ancestral multipotency program already present in germ line precursor cells (Fierro-Constaín et al., 2017). Indeed, even in species with divergent germ line segregation strategies, conserved functional interactions at the post-translational level are required for primordial germ cell specification (Perillo et al., 2022; Colonnetta et al., 2022). It should be noted that our conclusions were based on a small fraction of the diversity of all metazoan species, as a result of data availability limitations and our focus on well-characterized gonochoric animal models of human spermatogenic impairment. The latter led to the exclusion of established animal models such as C. elegans (hermaphrodites) and zebrafish (juvenile hermaphrodites) since it is largely unknown to what extent their male germ cell transcriptome may deviate from the gonochoric program (by retaining oogenesis-related characteristics, for example). The advent of more comprehensive gene expression datasets from less-studied organisms will surely provide a more encompassing context to cross-species comparisons of this nature.
The actual benefit of comparative biology for the identification of new genetic causes of human disease is often a contentious topic (Pound and Ritskes-Hoitinga, 2018). Central to this debate is the moderate success in translating animal data to the clinical setting, coupled with the fact that the vast majority of human genetic variants are not shared with other species (Benton et al., 2021). By focusing on the deep evolutionary past of human spermatogenesis, we have identified 79 functional interactions that likely represent a core component of the conserved genetic program of male germ cells, and 179 novel functionally validated candidate genes. Exploring to what extent these core functional interactions reflect direct protein-protein contacts is an important area for future research. In this regard, the recent development of robust in silico predictive systems, such as AlphaFold-Multimer, will greatly facilitate the implementation of such studies (Evans et al., 2021).
The translational application of our cross-species platform in human reproductive healthcare has so far uncovered three new genes associated with human male infertility (RNF113B, HSPA2, and KPNA2). The assumed autosomal dominant inheritance of HSPA2 and KPNA2 awaits further confirmation by additional data from independent cohorts and segregation analyses, but stands in line with other well-established male infertility genes such as DMRT1, NR5A1, and SYCP2 (Houston et al., 2021). Also, this information can potentially be harnessed in other clinically relevant contexts: the spermatocyte orthoBackbone can be further explored to pinpoint key pathways that may facilitate the reconstitution of meiotic entry in vitro, as well as to maximize the effectiveness of exome sequencing approaches in infertile men through the inclusion of the orthoBackbone genes in prioritization lists for disease-causing variants. Conversely, future iterations of our analytical pipeline can be specifically tailored for the identification of newly emerged genes that may have acquired key roles in reproductive processes, such as those involved in species-specific gamete recognition (Raj et al., 2017).
Taken together, our results emphasize the often-overlooked contribution of evolutionary history to human disease and illustrate how interdisciplinary research can significantly expand our knowledge of fundamental cellular processes. Accordingly, all the code required for repurposing our analytical pipeline to other cell types and pathologies is available as an open-access resource at https://github.com/rionbr/meionav (copy archived at Brattig Correia, 2024). These resources will likely contribute to a renewed appreciation of comparative biology in the medical field.
Materials and methods
Phylostratigraphic analysis
For humans, mice and fruit flies, published RNA-Seq data of different stages of spermatogenesis and of representative cell types of the three primary embryonic layers (Supplementary file 1) were downloaded from the Sequence Read Archive. Data were checked for quality control and preprocessed by trimming adaptor sequences (Trim galore!; v0.5.0; Babraham Bioinformatics). Gene expression was quantified as transcripts per million (TPM) using Salmon v0.14.1 (Patro et al., 2017). Transcript-level information was aggregated at the gene level, thus minimizing the potential issue of transcript diversity. Orthogroups (set of homologous genes derived from the last common ancestor) were defined based on the proteomes of 25 species representing key phylogenetic positions (Figure 1—figure supplement 1), and were assembled using OrthoFinder v2.4.0 (Emms and Kelly, 2019), with DIAMOND v0.9.24.125 as aligner under default settings (Buchfink et al., 2015). A species tree reflecting the current consensus of the eukaryotic phylogeny (Dunn et al., 2008; Fairclough et al., 2013; Laumer et al., 2019) served as the basis for the phylostratigraphic analysis (Figure 1—figure supplement 1). Each orthogroup was assigned to a phylostratum (node) by identifying the oldest clade found in the orthogroup (Domazet-Loso et al., 2007), using ETE v3.0 with the ‘get_common_ancestor’ option. Phylostrata were assigned a node number, ranging from 1 (oldest) to 16 (youngest, species-specific). Only genes with a minimum average TPM >1 were considered expressed. After mapping all expressed genes to the phylostratum containing their corresponding orthogroup, the distribution of the transcriptome allocated to phylostrata 1–5 (orthogroups common to all metazoa) was compared, in a pair-wise manner, between germ cells and soma using the Mann–Whitney U test as implemented in the SciPy python package (Virtanen et al., 2020). The transcriptome age index (TAI) of the germ cell samples was calculated by dividing the product of each gene’s TPM value and node number by the sum of all TPM values (Domazet-Lošo and Tautz, 2010). Higher TAI values represent younger transcriptomes.
Protein-protein interaction (PPI) network construction
To assemble the spermatocyte PPI networks based on transcriptome data, previously published RNA-Seq datasets of purified human and mouse spermatocytes and of the spermatocyte-enriched median region of the fruit fly testes were used to identify all expressed genes (see Supplementary file 1 for the list of all selected datasets). Salmon v0.14.1 was used to quantify gene expression levels, and genes were considered expressed based on a minimum average expression level of TPM >1. All potential interactions between expressed genes were retrieved from the STRING v11 database (Szklarczyk et al., 2019). For this, we used the combined confidence score, which corresponds to the estimated likelihood of a given association being true, given the underlying evidence. To remove low-confidence interactions, only PPIs with a STRING combined confidence score ≥0.5 were included (Figure 1—figure supplement 3a–c). A similar approach was employed to assemble enterocyte PPI networks. A multilayer network representation was employed to define orthologous relationships, with each layer corresponding to a particular species and nodes belonging to the same orthogroup being connected across layers. Orthologous connections were established using the EggNOG v5 database at the metazoan level (Huerta-Cepas et al., 2019) and a one-to-many relationship was employed when identifying orthologs between network layers. A variable percentage of all spermatocyte network edges was randomly shuffled to test for possible ascertainment bias in conserved genes (Figure 1—figure supplement 6).
Backbone and orthoBackbone computation
The initial step to define the male germ cell orthoBackbone was the extraction of the metric backbone of each species’ spermatocyte PPI network (i.e. each layer of the multilayer network). The metric backbone is the subgraph that is sufficient to compute all shortest paths in the network, thus removing edges that break the triangle inequality (and are therefore redundant in regards to the shortest paths). Importantly, the metric backbone retains all metric edges while preserving all network nodes, the natural hierarchy of connections and community structures (Simas et al., 2021; Brattig Correia et al., 2023). Considering that a set of the PPIs in each layer are evolutionarily conserved, we computed the orthoBackbone: a single shared subnetwork obtained by collapsing the metric backbone of all layers according to the orthologous relationships. More precisely, the orthoBackbone corresponds to a subgraph of the metric backbone of every layer, where each edge has an analogous edge connecting the same orthologous genes in all other species’ layers. For cases where there was not a one-to-one conserved edge relationship (i.e. an edge of one layer’s backbone mapped to several edges in other species’ layers due to the existence of paralogs), the inclusion criterion was for at least one of these homologous edges to be part of the backbone (Figure 2a). Note that because only edges between orthologous genes can be part of the orthoBackbone, nodes (including orthologous nodes) that are left with no edges are removed. Thus, the orthoBackbone typically has fewer nodes than its original network layer and it is not necessarily a metric backbone. Next, annotations retrieved from the Gene Ontology (GO) resource database (release 2020-09-10) were used to select all human orthoBackbone edges connecting at least one spermatogenesis-related gene linked to gene expression regulation. This led to the identification of the 79 functional interactions in spermatocytes that form a core component of the conserved genetic program of spermatogenesis (Figure 2e). To determine the impact of using three evolutionarily distant species to define the orthoBackbone, we calculated the size of this subgraph when calculated based on just two species. We observed that the orthoBackbone of a human and mouse spermatocyte contains 19,333 edges (5.5% of all functional interactions in the human spermatocyte network), and that of a human and fruit fly spermatocyte contains 9,285 edges (2.6% of all functional interactions).
Differential gene expression analysis of the mitosis-to-meiosis transcriptional burst
Reads were aligned to their respective genomes (GRCh38, GRCm38, and BDGP6.22) using HISAT2 v2.1.0 under default parameters (Kim et al., 2019). Uniquely mapped read counts were generated using FeatureCounts v1.5.0-p1 and ENSEMBL GTF annotations (Liao et al., 2014). Differential gene expression analysis was performed using a likelihood test (McCarthy et al., 2012), as implemented in the edgeR package (Robinson et al., 2010). Genes that were significantly upregulated at meiotic entry (spermatocyte vs. spermatogonia) and/or downregulated at meiotic exit (spermatid vs. spermatocyte) were selected for further analysis as part of the mitosis-to-meiosis transcriptional burst that activates the spermatogenic program. Three criteria were employed to define differentially expressed genes (DEGs): false discovery rate ≤0.05; abs[log2(fold change)]≥1; and average log2(normalized counts per million)≥1. Subsequently, EggNOG orthogroups defined at the metazoan level (Huerta-Cepas et al., 2019) were used to establish which genes had a similar up/downregulation behavior in both species. This list was further trimmed by only retaining those whose fruit fly orthologs were also expressed at average log2(normalized counts per million)>1 in the corresponding testes, leading to 920 fruit fly genes (homologous to 797 and 850 in humans and mice, respectively).
Drosophila in vivo RNAi
An in vivo UAS-GAL4 system was used to silence the 920 conserved transcriptional burst genes specifically at the mitosis-to-meiosis transition (Brand and Perrimon, 1993). Silencing was induced using the bam-GAL4 driver (kindly provided by Renate Renkawitz-Pohl, Philipps Universität Marburg, Germany), which promotes high levels of shRNA expression in late spermatogonia and early primary spermatocytes (White-Cooper, 2012). UAS-hairpin lines targeting the selected genes were purchased from the Bloomington Drosophila Stock Centre (BDSC) and the Vienna Drosophila Resource Centre (VDRC). Lines previously associated with a phenotype in the literature, regardless of the tissue, were preferentially chosen for this experiment. In the 10 cases where no lines were available, these were generated in-house following standard procedures (Perkins et al., 2015). Briefly, shRNA were designed using DSIR (Vert et al., 2006), with the corresponding sequences being cloned into the pWalium 20 vector. Constructs were injected into fruit flies carrying an attp site on the third chromosome (BDSC stock #24749) and transgene-carrying progeny were selected to establish the UAS-hairpin lines. A similar strategy was employed to generate a second independent RNAi reagent for dRNF113 (dRNF113 #2). The antisense sequences selected for each locus are listed in Supplementary file 3, and information on all tested lines is available at the Meiotic Navigator gene browser.
For assessing fertility, gene-silenced males were mated with wildtype Oregon-R virgin females (2 males:4 females) for 12 hours at 25 °C. Laid eggs were left to develop for 24 hr at 25 °C before the percentage of egg hatching was determined. This percentage (fertility rate) served as a measure of the male reproductive fitness associated with each tested gene. All fruit flies were 3–7 days post‐eclosion and fertility rates correspond to an average of four independent experiments, with a minimum of 25 eggs scored per replicate. Every batch of experiments included a negative control [RNAi against the mCherry fluorophore (BDSC stock #35785) – a sequence absent in the fruit fly genome], and a positive control [RNAi against Ribosomal protein L3 (BDSC stock #36596) – an essential unit of the ribosome]. A cut-off of <75% fertility rate was established to define impaired reproductive fitness based on the rate observed in the negative controls (>2 standard deviations of 91.6 ± 8.5%). For dRNF113, Hsc70-1 (VDRC stock #106510) and dKPNA2 (VDRC stock #34265) extended fertility tests were conducted, with ~100 eggs scored per replicate. To this end, egg-laying cages with apple juice agar plates as substrate were set up with a 1:2 male to female ratio. All Drosophila lines were maintained at 25 °C in polypropylene bottles containing enriched medium (cornmeal, molasses, yeast, beet syrup, and soy flour).
The novelty of the hits was determined based on the following exclusion criteria (applied to all 250 hits and their orthologs): i- clinically validated human infertility genes (Houston et al., 2021); ii- genes associated with a mouse male infertility phenotype (MP:0001925) in the Mouse Genome Informatics database (Baldarelli et al., 2021); and iii- fruit fly genes with an associated male germ cell or male reproductive phenotype in the FlyBase database (Larkin et al., 2021). For the latter two categories, information reflects data available in both systems as of April 2023.
Human and mouse testicular imaging
Testicular biopsies of the azoospermic individuals M1911 (RNF113B variant), M2098 (KPNA2 variant) and M2190 (HSPA2 variant) were collected at the Centre for Reproductive Medicine and Andrology (CeRA) in Münster, Germany or at the University Hospital Giessen, Germany. As a control for testicular imaging, tissue of individual M2951 (who underwent a vasectomy reversal at the CeRA) was used. All testicular tissues were fixed in Bouin’s solution and embedded in paraffin using an automatic ethanol and paraffin row (Bavimed Laborgeräte GmbH). Serial sections (5μm-thick) were stained with periodic acid-Schiff according to standard procedures. Testicular histopathology was performed as part of the routine clinical work up. For quantification, at least 100 testicular tubules per testis were screened for the most advanced germ cell type present (categories: spermatogonia, spermatocytes, round spermatids and elongating spermatids), or in their absence, for Sertoli cells or tubular shadows.
Mouse testes were dissected from 10 week-old animals and fixed in modified Davidson’s fluid for 48 hr at room temperature before being paraffin-embedded according to standard procedures (Meistrich and Hess, 2013). Serial sections (3μm-thick) were stained with haematoxylin-eosin. Slides were analyzed with the NanoZoomer-SQ Digital slide scanner (Hamamatsu Photonics).
Drosophila testicular imaging
Squash preparations of freshly dissected Drosophila testes were performed as previously described (Bonaccorsi et al., 2011) and examined using a phase contrast microscope (Nikon Eclipse E400). Phenotypes of all silenced genes associated with decreased reproductive fitness (250 in total) were assigned to one of five classes, based on the earliest stage in which the cytological defects were detected: 1- pre-meiotic (evidence of increased cell death and/or clear morphological defects in late spermatogonia); 2- meiotic (failure to progress successfully through meiosis, evidence of increased cell death, and/or clear morphological defects in spermatocytes); 3- post-meiotic (abnormal spermiogenesis characterized by sperm individualization defects and/or significant spermatid morphological defects); 4- gamete (decreased mature sperm numbers and/or motility compared with controls after qualitative assessment); 5- undetectable (no cytological defects). Images from at least 2 pairs of testes were acquired per genotype and were corrected for background illumination as previously described (Landini, 2020). These are all available in the Meiotic Navigator gene browser.
For the quantification of the meiotic area, phase-contrast images of different testicular regions were acquired at 40 x and stitched together, using the MosaicJ tool (Thévenaz and Unser, 2007) in the ImageJ software (v1.8, National Institutes of Health), to reconstruct a high-resolution image of the entire testis. On average, eight individual images were acquired to assemble each complete testis. The meiotic area corresponds to the ratio between the area (in pixels) occupied by primary spermatocytes and that of the entire gonad. A total of 15 different testes were quantified per genotype across 3 independent experiments, and groups were compared using unpaired t-tests.
For the dRNF113 immunofluorescence assay, a modified whole-mount protocol specifically designed for the analysis of Drosophila spermatocytes was employed (Raich et al., 2020). Briefly, testes were dissected in testis buffer (183 mM KCl, 47 mM NaCl, 10 mM Tris-HCl, 1 mM EDTA and 1 mM PMSF), transferred to a pre-fix solution containing 4% formaldehyde (PolySciences) in PBS, and then fixed for 20 min using a heptane-fixative mix at 3:1. The fixative consisted of 4% formaldehyde in a PBS +0.5% NP-40 (Merck) solution. After washing the fixative, samples were incubated for 1 hr in PBS supplemented with 0.3% Triton X-100 (Sigma-Aldrich), 1% (w/v) bovine serum albumin (BLIRT) and 1% (w/v) donkey serum (Thomas Fisher Scientific). Primary antibody incubation (anti-dRNF113 at 1:250; kindly provided by Chris Doe, University of Oregon, USA) was performed overnight at 4 °C in PBS supplemented with 1% (w/v) bovine serum albumin and 1% (w/v) donkey serum. After washing the primary antibody solution, samples were incubated for 1 hr with a goat anti-guinea pig secondary antibody (Alexa Fluor 488 conjugate; 1:1000; Invitrogen) and wheat germ agglutinin to label the nuclear envelope (Alexa Fluor 647 conjugate; 1:500; Invitrogen). Testes were mounted in Vectashield mounting medium with DAPI (Vector Laboratories) and the entire nuclear volume of individual cells was acquired as 1μm-thick slices using a 63 x oil immersion objective and 10 x digital zoom in a Leica SP5 confocal microscope. Slices were stacked into maximum intensity Z-projections and the relative intensity of the dRNF133 signal was measured as previously described (Prudêncio et al., 2018). A total of 30 late spermatogonia and 30 mature primary spermatocytes (late prophase I) were quantified across three independent experiments, and the two groups were compared using an unpaired t-test.
Generation of genetically modified mice
A mutant allele for the mouse Rnf113a2 gene was generated using the CRISPR/Cas9 system in the FVB/N background. gRNA was assembled by in vitro hybridization of an Alt-R scRNA targeting the sequence ATGATCCAGAAGACGAGTGG with the Alt-R tracrRNA (both from Integrated DNA Technologies, Inc). An active ribonucleoprotein (RNP) complex was obtained by incubation of the gRNA with the Cas9 protein. The RNP containing 1 μM of the hybridized gRNA and 100 ng/μl of Cas9 were microinjected into the pronucleus of fertilized oocytes and transferred into pseudopregnant females according to standard procedures (Hogan et al., 1994). The resulting pups were genotyped by PCR using primers Rnf113-F and Rnf113-R (Supplementary file 3). PCR products were run on a native 15% polyacrylamide gel in TBE buffer, from which 3 pups were identified with a PCR pattern consisting of a mix between a fragment matching the expected size for the wildtype allele and a smaller fragment. The smaller fragments were recovered, amplified, and sequenced, revealing deletions of 3, 9, or 14 nucleotides. Only the allele containing the 14-nucleotide deletion is expected to disrupt Rnf113a2 by generating a frameshift leading to an immediate stop codon after amino acid 64 of the 338 in the full-length. A mouse line was generated from this founder protein (Rnf113a2em1Mllo, here forth referred to as Rnf113a2KO). Homozygous Rnf113a2KO/KO mice were generated by intercrosses between heterozygous mice. Genotyping was performed by PCR using primers Rnf113-F and Rnf113-Chk-wt-R for the wildtype allele and Rnf113-F and Rnf113-Chk-mut-R for the mutant allele (Supplementary file 3). All animals were housed at the Mouse Facility of Instituto Gulbenkian de Ciência (MF-IGC). Wildtype FVB/N mice were generated in the production area of the MF-IGC. Transgenic animals were produced and kept in the barrier experimental area of the MF-IGC.
Characterization of mouse reproductive parameters
Upon sexual maturation, 8-week-old male F2 Rnf113a2KO/KO homozygous mice and corresponding wildtype littermates (Rnf113a2WT/WT) were caged individually and mated with two 8-week-old wildtype FVB/N females. Five males were tested for each genotype to meet the requirements for statistical validity. After the mating period, females with a confirmed copulatory plug were separated. To promote animal welfare and to reduce the number of animals used, we avoided the normal delivery of pups as a means of assessing fertility. For this, separated females were euthanized by cervical dislocation 10 days post copulation and male fertility was evaluated by the presence of developing embryos. Whenever present, the number of embryos was recorded to quantitate the number of progeny sired by the tested males. All males were considered tested after at least two females had confirmed copulatory plug. To complement the in vivo fertility tests, 10-week-old males were euthanized by cervical dislocation for testicular collection and to examine internal anatomy. Testes were weighted before being processed for histology (see ‘Human and mouse testicular imaging’). All experiments followed the Portuguese (Decreto-Lei n° 113/2013) and European (Directive 2010/63/EU) legislations, concerning housing, husbandry, and animal welfare.
RNA-Seq in human and Drosophila testes
Total RNA from two snap-frozen testicular tissue samples of the azoospermic individual M1911 (RNF113B variant) as well as from one sample each from three unrelated individuals with normal spermatogenesis (M1544 [obstructive azoospermia], M2224 [anorgasmia] and M2234 [previous vasectomy]; Siebert-Kuss et al., 2023), were extracted using the Direct-zol RNA Microprep kit (Zymo Research), following the manufacturer’s instructions. RNA quality was estimated by electrophoresis (Agilent Technologies), with all samples having a RNA integrity number (RIN) >4.5 (range: 4.5–5.6), except one replicate of M1911 (with RIN = 2.0). rRNA depletion was performed with the NEBNext rRNA Depletion Kit v2 (New England Biolabs), and total RNA libraries were prepared with the NEBNext Ultra II Directional RNA Library Prep Kit for Illumina (New England Biolabs), using 700 ng of RNA per sample. Paired-end sequencing with 150 bp per read was performed using the NextSeq2000 system (Illumina) at the University of Münster, Germany. An average of 44 million reads was generated per sample. The expression pattern of RNF113B was determined in a control testis by retrieving the corresponding data from our recently published single-cell RNA-Seq dataset of normal human spermatogenesis (Di Persio et al., 2021).
For fruit flies, RNA was extracted from 40 pairs of adult testes per sample per condition (3–7 days post‐eclosion) using the PureLink RNA Mini Kit (Thermo Fisher Scientific) following the manufacturer’s instructions. Three conditions were analyzed: dRNF113 RNAi (meiotic arrest), Prp19 RNAi (meiotic arrest) and the mCherry RNAi negative control (normal spermatogenesis), with three independent biological replicates per condition. Extracted RNA was treated with DNAse (Thermo Fisher Scientific), with a RNA quality number of 10 for all samples, as assessed by electrophoresis (Advanced Analytical Technologies). Total RNA libraries were prepared using the Zymo-Seq RiboFree Total RNA-Seq Library Kit (Zymo Research), using 1000 ng of RNA per sample. Paired-end sequencing with 150 bp per read was performed using DNBSEQ technology (BGI Group), with an average of 46 million reads per sample. Confirmation of knockdown efficiency via quantitative RT-PCR was performed for both the dRNF113 and Prp19 RNAi. For this, cDNA was synthetized using the Transcriptor First Strand cDNA Synthesis Kit (Roche), according to the manufacturer’s instructions. Reactions were performed in a QuantStudio 6 Real-Time PCR System (Thermo Fisher Scientific), using SYBR Green chemistry (Applied Biosystems). All reactions were set up in triplicates and Act57B and RpL32 were used as internal controls. Primers are listed in Supplementary file 3.
RNA-Seq reads were aligned against the human and fruit fly genomes (GRCh38 and BDGP6.32, respectively) using the STAR aligner (Dobin et al., 2013). Gene-level counts were obtained using feature-counts, taking into account the strandedness. Counts were normalized with the TMM method (Robinson and Oshlack, 2010), and differential gene expression analysis was performed using a quasi-likelihood F-test (Lun et al., 2016), as implemented in the edgeR package. DEGs were defined using the previously listed criteria (see ‘Differential gene expression analysis of the mitosis-to-meiosis transcriptional burst’). To determine to what extent the human patient sample with lower RIN could be affecting our results, we repeated this analysis correcting for sample degradation using DegNorm (Xiong et al., 2019) This revealed that 97.8% of all DEGs, as defined by our criteria, were also listed as such after taking into account sample degradation. This fits with a largely similar number of expressed genes (at TPM >1) detected in the control and in the experimental groups (average of 53.9% and 46.8% expressed protein-coding genes, respectively).
Whole exome sequencing and orthoBackbone analysis
Whole exome sequencing (WES) in the 1,021 azoospermic men included in the MERGE study cohort was performed as previously described (Wyrwoll et al., 2020). All men provided written informed consent, in agreement with local requirements. The study protocol was approved by the Münster Ethics Committees/Institutional Review Boards (Ref. No. Münster: 2010–578 f-S) in accordance with the Helsinki Declaration of 1975. Exome sequencing data were filtered for high-confidence loss-of-function (LoF) variants in the 27 orthoBackbone genes that were similarly downregulated in the testicular transcriptome of M1911 and of the dRNF113 RNAi, and for which there were available functional data supporting a role in spermatogenesis. These genes were: BAZ1A, BOLL, CDC14A, CFAP61, CFAP91, CNOT6, CNOT7, CUL3, DHX36, DNAH7, DNAH8, DNAI1, DNAI2, DRC7, EIF2A, HSPA2, KPNA2, LIPE, MTMR7, PPP1R2, PSMF1, SKP1, SRPK1, TOB1, TPGS2, TUBA4A, and UBE2K. From this list, all those previously associated with known causes of human infertility or with congenital syndromic conditions were excluded (CFAP61, CFAP91, DNAH7, DNAH8, DNAI1, DNAI2, and LIPE). The inheritance mode for each candidate gene was predicted using three sources: the DOMINO algorithm (Quinodoz et al., 2017), the gnomAD observed/expected (o/e) constraint score for LoF variants (Karczewski et al., 2020), and the recent dosage sensitivity map of the human genome (Collins et al., 2022). Filtering criteria were: stop-gain, frameshift, and splice site variants with a minor allele frequency (MAF) <0.01 in gnomAD, a maximum occurrence of 10 x in our in-house database and a read depth >10. Variants were only considered when detected in accordance with the predicted mode of inheritance.
Clinical genetics
The homozygous frameshift variant c.556_565del;p.(Thr186GlyfsTer119) in RNF113B in individual M1911 was identified by filtering WES data of 74 men with complete bilateral meiotic arrest for rare LoF variants. Two first degree cousins of M1911 were also infertile, but genetic analyses were not possible. It is likely that both are also affected by the homozygous RNF113B frameshift variant, as the parents of both men are consanguineous (therefore, family ascendants of M1911). The heterozygous LoF variants in the HSPA2 and KPNA2 genes (2 cases for each, for a total of four unrelated men) were identified by analyzing data from 1,021 azoospermic men from the MERGE cohort. For HSPA2, the heterozygous stop-gain variant c.175C>T;p.(Gln59Ter) was identified in individual M1678, and the heterozygous frameshift variant c.1806dup;p.(Glu603ArgfsTer81) in M2190. For KPNA2, the heterozygous splice acceptor variant c.667–2A>G;p.? was detected in individual M1645, and the heterozygous frameshift variant c.1158_1161del;p.(Ser387ArgfsTer14) in M2098.
The inheritance for HSPA2 and KPNA2 is predicted to be ‘very likely autosomal dominant’ by DOMINO. The gnomAD o/e scores are 0.32 (0.17–0.67) for HSPA2 and 0.21 (0.11–0.44) for KPNA2. The pHaplo scores are 0.350 for HSPA2 and 0.747 for KPNA2. The proposed ‘hard’/conservative thresholds for the ‘loss of function observed/expected upper bound fraction’ or ‘LOEUF’ score (the upper limit of the confidence interval) is <0.35 and for pHaplo >0.86. Thus, both LOEUF and pHaplo hint towards likely autosomal dominant inheritance, more so for KPNA2 than for HSPA2, but the formal cut-offs are not met in both cases.
We tested the possibility that the actual cause of infertility in M1911 and his infertile brother was due to another homozygous variant located in the vicinity of RNF113B. To this end, we queried all genes within a range of 1 million base pairs in both directions from RNF113B. The following 12 genes were analyzed: FARP1, IPO5, RAP2A, MBNL2, MIR3170, STK24, STK-24AS1, SLC15A1, DOCK9-AS1, DOCK9, LINC00456, and DOCK9-DT. Of these, only IPO5 is robustly expressed in the testis (data: GTEx Project, NIH). Variants in this gene have been associated with keratoconus, but not with male infertility. Additionally, no coding variants affecting any of these 12 genes (MAF <1%) were detected in M1911. These results render extremely unlikely the possibility that M1911’s infertility phenotype is caused by a co-segregating variant.
To rule out possible alternative monogenic causes for the infertility phenotype in all five selected individuals, the sequences of 230 genes previously identified with an at least limited level of evidence of being associated with male infertility (Houston et al., 2021), were screened in the exomes of M1911 (RNF113B variant), M1678 (HSPA2 variant), M2190 (HSPA2 variant), M1645 (KPNA2 variant), and M2098 (KPNA2 variant). Of these variants, only those affecting genes with a quantitative impact on spermatogenesis, that were consistent with the reported mode of inheritance of the respective gene, and occurring <10 x in our in-house database, were considered for further analysis. These criteria identified the heterozygous missense variant c.2377G>T;p.(Ala793Ser) in DNMT1 as a possible alternative cause for male infertility in M1911, a possibility that was disproven by subsequent familial studies. Briefly, DNA from M1911’s mother, a fertile brother and an azoospermic brother were available for segregation analysis, which was performed by Sanger sequencing. The azoospermic brother of M1911 was also homozygous for the frameshift variant in RNF113B, while the mother and the fertile brother were heterozygous for the variant. Of note, the missense variant c.2377G>T p.(Ala793Ser) in DNMT1 was also identified in M1911’s fertile brother in a similarly heterozygous state (primers are listed in Supplementary file 3). The latter result strongly suggests that the DNMT1 variant has no pathogenic effect on male fertility. Segregation analysis was also performed in M1654’s parents, revealing that the splice-site variant c.667–2A>G;p.? was not present either in his father or in his mother (i.e. indicating that it was a de novo variant). Paternity was confirmed by short-tandem-repeat analysis. No chromosomal aberrations or Y-chromosomal microdeletions in the AZF regions were identified in any of the selected individuals.
Clinical data of individuals M1678 and M2190 (HSPA2 variants)
The testicular phenotype of M1678 is unknown, as this individual did not undergo a testicular biopsy. However, FSH levels of 26.7 U/L (normal range 1–7 U/L) and a bi-testicular volume of 10 mL (normal value >24 mL) are clearly indicative of non-obstructive azoospermia. M2190 underwent a testicular biopsy at the Urology and Andrology department of the University Hospital of Giessen, Germany. He was diagnosed with complete bilateral SCO and sperm retrieval was unsuccessful.
Clinical data of individuals M1645 and M2098 (KPNA2 variants)
Individual M1645 underwent a testicular biopsy in a different clinical unit, with the corresponding pathology report stating complete bilateral SCO without successful sperm retrieval. Histology data was not available for this man. M2098 was also diagnosed with complete bilateral SCO and sperm retrieval was equally unsuccessful.
Acknowledgements
We gratefully acknowledge all men that gave their informed consent to be included in the Male Reproductive Genomics (MERGE) study cohort. The authors wish to thank Gabriel Martins and José Marques from the Imaging Unit of Instituto Gulbenkian de Ciência (IGC, Portugal) for assistance in acquiring and processing microscopy data. We acknowledge the IGC’s Histology Facility and Genomics Unit, respectively, for the analysis of mouse testicular tissue and preparation of the fruit fly RNA-Seq libraries, Nicole Terwort (University of Münster) for preparing the RNA extractions from human testes, and the Core Facility Genomics of the Medical Faculty of the University of Münster for conducting library preparation and RNA sequencing in samples from individual M1911. Renate Renkawitz-Pohl (Philipps-Universität Marburg, Germany) and Chris Doe (University of Oregon, USA) kindly provided us the bam-GAL4 line and the anti-dRNF113 antibody, respectively. We thank Élio Sucena, Raquel Oliveira and Mónica Bettencourt-Dias (all from the IGC), Patrícia Beldade (Faculty of Sciences, University of Lisbon), and Peter Ellis (University of Kent, UK) for insightful discussions.
Funding Statement
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Contributor Information
Jörg D Becker, Email: jbecker@itqb.unl.pt.
Paulo Navarro-Costa, Email: navarro-costa@medicina.ulisboa.pt.
Wei Yan, The Lundquist Institute, United States.
Wei Yan, The Lundquist Institute, United States.
Funding Information
This paper was supported by the following grants:
Fundação para a Ciência e a Tecnologia PTDC/MEC-AND/30221/2017 to Paulo Navarro-Costa.
Fundação para a Ciência e a Tecnologia EXPL/MEC-AND/0676/2021 to Paulo Navarro-Costa.
Fundação para a Ciência e a Tecnologia CEECIND/03345/2018 to Jörg D Becker.
Fundação para a Ciência e a Tecnologia PD/BD/114362/2016 to Chandra Shekhar Misra.
National Institutes of Health 1R01LM012832 to Luis M Rocha.
National Science Foundation Research Traineeship grant 1735095 to Luis M Rocha.
Deutsche Forschungsgemeinschaft 329621271 to Frank Tüttelmann.
Ministry of Education, Singapore MOE2018-T2-2-053 to Irene Julca, Marek Mutwil.
Additional information
Competing interests
No competing interests declared.
Author contributions
Resources, Software, Formal analysis, Investigation, Visualization, Methodology, Conceptualization.
Formal analysis, Investigation, Methodology.
Formal analysis, Investigation, Methodology.
Formal analysis, Investigation, Methodology, Conceptualization.
Formal analysis, Investigation, Visualization, Methodology.
Data curation, Investigation, Methodology, Conceptualization.
Investigation.
Investigation.
Investigation.
Formal analysis, Investigation.
Investigation.
Formal analysis, Investigation, Methodology.
Formal analysis, Investigation, Methodology.
Formal analysis, Investigation, Methodology.
Investigation.
Validation, Investigation, Methodology.
Validation.
Validation, Investigation, Methodology, Funding acquisition, Writing – review and editing, Conceptualization.
Conceptualization, Validation, Investigation, Methodology, Funding acquisition, Writing – review and editing.
Conceptualization, Validation, Investigation, Methodology, Funding acquisition, Writing – review and editing.
Conceptualization, Supervision, Investigation, Methodology, Funding acquisition, Writing – review and editing.
Conceptualization, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing.
Ethics
All men provided written informed consent, in agreement with local requirements. The study protocol was approved by the Münster Ethics Committees/Institutional Review Boards (Ref. No. Münster: 2010-578-f-S) in accordance with the Helsinki Declaration of 1975.
All mouse experiments followed the Portuguese (Decreto-Lei no 113/2013) and European (Directive 2010/63/EU) legislations, concerning housing, husbandry, and animal welfare.
Additional files
Data availability
An interactive online application compiling experimental results on 920 evolutionarily conserved spermatocyte genes and associated microscopy images (Meiotic Navigator) is available at https://pages.igc.pt/meionav. Computational data and custom R and Python scripts used in the analysis are available from https://github.com/rionbr/meionav (copy archived at Brattig Correia, 2024). All testicular RNA-Seq data generated in this study (Supplementary file 1) were deposited in the European Genome-Phenome Archive (human data) and in the Sequence Read Archive (fruit fly data).
The following datasets were generated:
Correia B. 2022. Control RNAi Replicate1. NCBI Sequence Read Archive. SRX12573744
Correia B. 2022. Control RNAi Replicate2. NCBI Sequence Read Archive. SRX12573745
Correia B. 2022. Control RNAi Replicate3. NCBI Sequence Read Archive. SRX12573746
Correia B. 2022. dRNF113 RNAi Replicate1. NCBI Sequence Read Archive. SRX12573747
Correia B. 2022. dRNF113 RNAi Replicate2. NCBI Sequence Read Archive. SRX12573748
Correia B. 2022. dRNF113 RNAi Replicate3. NCBI Sequence Read Archive. SRX12573749
Correia B. 2022. Prp19 RNAi Replicate1. NCBI Sequence Read Archive. SRX12573750
Correia B. 2022. Prp19 RNAi Replicate2. NCBI Sequence Read Archive. SRX12573751
Correia B. 2022. Prp19 RNAi Replicate3. NCBI Sequence Read Archive. SRX12573752
Correia B. 2022. Total RNA sequencing in M1911. European Genome-Phenome Archive. EGAD00001008639
Correia B. 2022. Stage-specific gene and transcript dynamics in human male germ cells. European Genome-Phenome Archive. EGAD00001008652
The following previously published datasets were used:
Jan SZ, Volmmer TL, Jongejan A, Roling MD, Silber SJ, de Rooji DG, Hamer G, Repping S, van Pelt AMM. 2016. Homo sapiens male germ cell transcriptome. NCBI BioProject. PRJNA310976
Soumillon M, Necsulea A, Weier M, Brawand D, Zhang X, Gu H, Barthès P, Kokkinaki M, Nef S, Gnirke A, Dym M, de Massy B, Mikkelsen TS, Kaessmann H. 2013. Cellular source and mechanisms of high transcriptome complexity in the mammalian testis (mouse) NCBI BioProject. PRJNA187158
Vedelek V, Bodai L, Grézal G, Kovács B, Boros LM, Laurinyecz B, Sinka R. 2018. Analysis of Drosophila melanogaster testis transcriptome. NCBI BioProject. PRJNA496287
Pharmaceuticals Regeneron 2019. RNA Sequencing Of Intestinal Mucosa in Celiac patients. NCBI BioProject. PRJNA528755
Kazakevych J, Sayols S, Messner B, Krienke C, Soshnikova N. 2016. Dynamic changes in chromatin states during specification and differentiation of adult intestinal stem cells (mouse) NCBI BioProject. PRJNA352951
Dutta D, Dobson AJ, Houtz PL, Gläßer C, Revah J, Korzelius J, Patel PH, Edgar B, Buchon N. 2014. Transcriptome profiling of Drosophila intestinal cells. NCBI BioProject. PRJNA260852
Xu X, Stoyanova EI, Lemiesz A, Xing J, Mash DC, Heintz N. 2017. Species and Cell-Type Properties of Classically Defined Human and Rodent Neurons and Glia (human) NCBI BioProject. PRJNA395921
Lindholm ME, Huss M, Solnestam BW, Kjellqvist S, Lundeberg J, Sundberg CJ. 2014. The human skeletal muscle transcriptome - sex differences, alternative splicing and tissue homogeneity assessed with RNA sequencing. NCBI BioProject. PRJNA252429
Lindholm ME, Huss M, Solnestam BW, Kjellqvist S, Lundeberg J, Sundberg CJ. 2017. PRJNA395915; Species and Cell-Type Properties of Classically Defined Human and Rodent Neurons and Glia (mouse) NCBI BioProject. PRJNA395915
Terry EE, Zhang X, Hoffmann C, Hughes LD, Lewis SA, Li J, Wallace MJ, Riley LA, Douglas CM, Gutierrez-Monreal MA, Lahens NF, Gong MC, Andrade F, Esser KA, Hughes ME. 2017. Transcriptional Profiling Reveals Extraordinary Diversity Among Skeletal Muscle Tissues (mouse) NCBI BioProject. PRJNA391968
Crocker A, Guan XJ, Murphy CT. 2015. Transcriptome Analysis of Drosophila Mushroom Body Neurons by Cell Type Reveals Memory-Related Changes in Gene Expression. NCBI BioProject. PRJNA302146
DeAguero AA, Castillo L, Oas ST, Kiani K, Bryantsev AL, Cripps RM. 2018. Regulation of fiber-specific actin expression by the Drosophila SRF ortholog Blistered. NCBI BioProject. PRJNA485776
References
- Agarwal A, Mulgund A, Hamada A, Chyatte MR. A unique view on male infertility around the globe. Reproductive Biology and Endocrinology. 2015;13:37. doi: 10.1186/s12958-015-0032-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Alavattam KG, Maezawa S, Sakashita A, Khoury H, Barski A, Kaplan N, Namekawa SH. Attenuated chromatin compartmentalization in meiosis and its maturation in sperm development. Nature Structural & Molecular Biology. 2019;26:175–184. doi: 10.1038/s41594-019-0189-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arendt D. The evolution of cell types in animals: emerging principles from molecular studies. Nature Reviews. Genetics. 2008;9:868–882. doi: 10.1038/nrg2416. [DOI] [PubMed] [Google Scholar]
- Azuma M, Ogata T, Yamazoe K, Tanaka Y, Inoue YH. Heat shock cognate 70 genes contribute to Drosophila spermatocyte growth progression possibly through the insulin signaling pathway. Development, Growth & Differentiation. 2021;63:231–248. doi: 10.1111/dgd.12734. [DOI] [PubMed] [Google Scholar]
- Baldarelli RM, Smith CM, Finger JH, Hayamizu TF, McCright IJ, Xu J, Shaw DR, Beal JS, Blodgett O, Campbell J, Corbani LE, Frost PJ, Giannatto SC, Miers DB, Kadin JA, Richardson JE, Ringwald M. The mouse gene expression database (GXD): 2021 update. Nucleic Acids Research. 2021;49:D924–D931. doi: 10.1093/nar/gkaa914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benton ML, Abraham A, LaBella AL, Abbot P, Rokas A, Capra JA. The influence of evolutionary history on human health and disease. Nature Reviews. Genetics. 2021;22:269–283. doi: 10.1038/s41576-020-00305-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonaccorsi S, Giansanti MG, Cenci G, Gatti M. Preparation of live testis squashes in Drosophila. Cold Spring Harbor Protocols. 2011;2011:rot5577. doi: 10.1101/pdb.prot5577. [DOI] [PubMed] [Google Scholar]
- Brand AH, Perrimon N. Targeted gene expression as a means of altering cell fates and generating dominant phenotypes. Development. 1993;118:401–415. doi: 10.1242/dev.118.2.401. [DOI] [PubMed] [Google Scholar]
- Brattig Correia R, Barrat A, Rocha LM. Contact networks have small metric backbones that maintain community structure and are primary transmission subgraphs. PLOS Computational Biology. 2023;19:e1010854. doi: 10.1371/journal.pcbi.1010854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brattig Correia R. Meionav. swh:1:rev:3e0cca6bea206023ea3a3b322c5bfdd2081e9842Software Heritage. 2024 https://archive.softwareheritage.org/swh:1:dir:1bacb3d9127b42d19fc3a7efdab5454ec441043e;origin=https://github.com/rionbr/meionav;visit=swh:1:snp:3926262a4adc916051d837ace870bbe95635401c;anchor=swh:1:rev:3e0cca6bea206023ea3a3b322c5bfdd2081e9842
- Brown KR, Jurisica I. Unequal evolutionary conservation of human protein interactions in interologous networks. Genome Biology. 2007;8:R95. doi: 10.1186/gb-2007-8-5-r95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nature Methods. 2015;12:59–60. doi: 10.1038/nmeth.3176. [DOI] [PubMed] [Google Scholar]
- Cardoso-Moreira M, Halbert J, Valloton D, Velten B, Chen C, Shao Y, Liechti A, Ascenção K, Rummel C, Ovchinnikova S, Mazin PV, Xenarios I, Harshman K, Mort M, Cooper DN, Sandi C, Soares MJ, Ferreira PG, Afonso S, Carneiro M, Turner JMA, VandeBerg JL, Fallahshahroudi A, Jensen P, Behr R, Lisgo S, Lindsay S, Khaitovich P, Huber W, Baker J, Anders S, Zhang YE, Kaessmann H. Gene expression across mammalian organ development. Nature. 2019;571:505–509. doi: 10.1038/s41586-019-1338-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carney TD, Struck AJ, Doe CQ. midlife crisis encodes a conserved zinc-finger protein required to maintain neuronal differentiation in Drosophila. Development. 2013;140:4155–4164. doi: 10.1242/dev.093781. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Choobdar S, Ahsen ME, Crawford J. The DREAM Module Identification Challenge Consortium et al. Assessment of network module identification across complex diseases. Nature Methods. 2019;16:843–852. doi: 10.1038/s41592-019-0509-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Collins RL, Glessner JT, Porcu E, Lepamets M, Brandon R, Lauricella C, Han L, Morley T, Niestroj L-M, Ulirsch J, Everett S, Howrigan DP, Boone PM, Fu J, Karczewski KJ, Kellaris G, Lowther C, Lucente D, Mohajeri K, Nõukas M, Nuttle X, Samocha KE, Trinh M, Ullah F, Võsa U, Epi25 Consortium. Estonian Biobank Research Team. Hurles ME, Aradhya S, Davis EE, Finucane H, Gusella JF, Janze A, Katsanis N, Matyakhina L, Neale BM, Sanders D, Warren S, Hodge JC, Lal D, Ruderfer DM, Meck J, Mägi R, Esko T, Reymond A, Kutalik Z, Hakonarson H, Sunyaev S, Brand H, Talkowski ME. A cross-disorder dosage sensitivity map of the human genome. Cell. 2022;185:3041–3055. doi: 10.1016/j.cell.2022.06.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Colonnetta MM, Goyal Y, Johnson HE, Syal S, Schedl P, Deshpande G. Preformation and epigenesis converge to specify primordial germ cell fate in the early Drosophila embryo. PLOS Genetics. 2022;18:e1010002. doi: 10.1371/journal.pgen.1010002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dapper AL, Wade MJ. Relaxed selection and the rapid evolution of reproductive genes. Trends in Genetics. 2020;36:640–649. doi: 10.1016/j.tig.2020.06.014. [DOI] [PubMed] [Google Scholar]
- Di Persio S, Tekath T, Siebert-Kuss LM, Cremers J-F, Wistuba J, Li X, Meyer Zu Hörste G, Drexler HCA, Wyrwoll MJ, Tüttelmann F, Dugas M, Kliesch S, Schlatt S, Laurentino S, Neuhaus N. Single-cell RNA-seq unravels alterations of the human spermatogonial stem cell compartment in patients with impaired spermatogenesis. Cell Reports. Medicine. 2021;2:100395. doi: 10.1016/j.xcrm.2021.100395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dix DJ, Allen JW, Collins BW, Mori C, Nakamura N, Poorman-Allen P, Goulding EH, Eddy EM. Targeted gene disruption of Hsp70-2 results in failed meiosis, germ cell apoptosis, and male infertility. PNAS. 1996;93:3264–3268. doi: 10.1073/pnas.93.8.3264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Domazet-Loso T, Brajković J, Tautz D. A phylostratigraphy approach to uncover the genomic history of major adaptations in metazoan lineages. Trends in Genetics. 2007;23:533–539. doi: 10.1016/j.tig.2007.08.014. [DOI] [PubMed] [Google Scholar]
- Domazet-Lošo T, Tautz D. A phylogenetically based transcriptome age index mirrors ontogenetic divergence patterns. Nature. 2010;468:815–818. doi: 10.1038/nature09632. [DOI] [PubMed] [Google Scholar]
- Dunn CW, Hejnol A, Matus DQ, Pang K, Browne WE, Smith SA, Seaver E, Rouse GW, Obst M, Edgecombe GD, Sørensen MV, Haddock SHD, Schmidt-Rhaesa A, Okusu A, Kristensen RM, Wheeler WC, Martindale MQ, Giribet G. Broad phylogenomic sampling improves resolution of the animal tree of life. Nature. 2008;452:745–749. doi: 10.1038/nature06614. [DOI] [PubMed] [Google Scholar]
- Eddy EM. Role of heat shock protein HSP70-2 in spermatogenesis. Reviews of Reproduction. 1999;4:23–30. doi: 10.1530/ror.0.0040023. [DOI] [PubMed] [Google Scholar]
- Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biology. 2019;20:238. doi: 10.1186/s13059-019-1832-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Evans R, O’Neill M, Pritzel A, Antropova N, Senior A, Green T, Žídek A, Bates R, Blackwell S, Yim J, Ronneberger O, Bodenstein S, Zielinski M, Bridgland A, Potapenko A, Cowie A, Tunyasuvunakool K, Jain R, Clancy E, Kohli P, Jumper J, Hassabis D. Protein complex Prediction with AlphaFold-Multimer. bioRxiv. 2021 doi: 10.1101/2021.10.04.463034. [DOI]
- Extavour CGM. Evolution of the bilaterian germ line: lineage origin and modulation of specification mechanisms. Integrative and Comparative Biology. 2007;47:770–785. doi: 10.1093/icb/icm027. [DOI] [PubMed] [Google Scholar]
- Fairclough SR, Chen Z, Kramer E, Zeng Q, Young S, Robertson HM, Begovic E, Richter DJ, Russ C, Westbrook MJ, Manning G, Lang BF, Haas B, Nusbaum C, King N. Premetazoan genome evolution and the regulation of cell differentiation in the choanoflagellate Salpingoeca rosetta. Genome Biology. 2013;14:R15. doi: 10.1186/gb-2013-14-2-r15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fierro-Constaín L, Schenkelaars Q, Gazave E, Haguenauer A, Rocher C, Ereskovsky A, Borchiellini C, Renard E. The conservation of the germline multipotency program, from sponges to vertebrates: a stepping stone to understanding the somatic and germline origins. Genome Biology and Evolution. 2017;9:474–488. doi: 10.1093/gbe/evw289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fitzpatrick JL, Kahrl AF, Snook RR. SpermTree, a species-level database of sperm morphology spanning the animal tree of life. Scientific Data. 2022;9:30. doi: 10.1038/s41597-022-01131-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gates AJ, Brattig Correia R, Wang X, Rocha LM. The effective graph reveals redundancy, canalization, and control pathways in biochemical regulation and signaling. PNAS. 2021;118:e2022598118. doi: 10.1073/pnas.2022598118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gatti da Silva GH, Jurica MS, Chagas da Cunha JP, Oliveira CC, Coltri PP. Human RNF113A participates of pre‐mRNA splicing in vitro. Journal of Cellular Biochemistry. 2019;120:8764–8774. doi: 10.1002/jcb.28163. [DOI] [PubMed] [Google Scholar]
- Guo J, Grow EJ, Mlcochova H, Maher GJ, Lindskog C, Nie X, Guo Y, Takei Y, Yun J, Cai L, Kim R, Carrell DT, Goriely A, Hotaling JM, Cairns BR. The adult human testis transcriptional cell atlas. Cell Research. 2018;28:1141–1157. doi: 10.1038/s41422-018-0099-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Han X, Zhou Z, Fei L, Sun H, Wang R, Chen Y, Chen H, Wang J, Tang H, Ge W, Zhou Y, Ye F, Jiang M, Wu J, Xiao Y, Jia X, Zhang T, Ma X, Zhang Q, Bai X, Lai S, Yu C, Zhu L, Lin R, Gao Y, Wang M, Wu Y, Zhang J, Zhan R, Zhu S, Hu H, Wang C, Chen M, Huang H, Liang T, Chen J, Wang W, Zhang D, Guo G. Construction of a human cell landscape at single-cell level. Nature. 2020;581:303–309. doi: 10.1038/s41586-020-2157-4. [DOI] [PubMed] [Google Scholar]
- Hermann BP, Cheng K, Singh A, Roa-De La Cruz L, Mutoji KN, Chen I-C, Gildersleeve H, Lehle JD, Mayo M, Westernströer B, Law NC, Oatley MJ, Velte EK, Niedenberger BA, Fritze D, Silber S, Geyer CB, Oatley JM, McCarrey JR. The mammalian spermatogenesis single-cell transcriptome, from spermatogonial stem cells to spermatids. Cell Reports. 2018;25:1650–1667. doi: 10.1016/j.celrep.2018.10.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hogan B, Beddington R, Constantini F, Lacy E. Manipulating the mouse embryo: a laboratory manual, second edition. Cold Spring Harbor Laboratory Press; 1994. [Google Scholar]
- Houston BJ, Riera-Escamilla A, Wyrwoll MJ, Salas-Huetos A, Xavier MJ, Nagirnaja L, Friedrich C, Conrad DF, Aston KI, Krausz C, Tüttelmann F, O’Bryan MK, Veltman JA, Oud MS. A systematic review of the validated monogenic causes of human male infertility: 2020 update and A discussion of emerging gene-disease relationships. Human Reproduction Update. 2021;28:15–29. doi: 10.1093/humupd/dmab030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, Mende DR, Letunic I, Rattei T, Jensen LJ, von Mering C, Bork P. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Research. 2019;47:D309–D314. doi: 10.1093/nar/gky1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jan SZ, Vormer TL, Jongejan A, Röling MD, Silber SJ, de Rooij DG, Hamer G, Repping S, van Pelt AMM. Unraveling transcriptome dynamics in human spermatogenesis. Development. 2017;144:3659–3673. doi: 10.1242/dev.152413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- John A, Vinayan K, Varghese J. Achiasmy: male fruit flies are not ready to mix. Frontiers in Cell and Developmental Biology. 2016;4:75. doi: 10.3389/fcell.2016.00075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Julca I, Ferrari C, Flores-Tornero M, Proost S, Lindner A-C, Hackenberg D, Steinbachová L, Michaelidis C, Gomes Pereira S, Misra CS, Kawashima T, Borg M, Berger F, Goldberg J, Johnson M, Honys D, Twell D, Sprunck S, Dresselhaus T, Becker JD, Mutwil M. Comparative transcriptomic analysis reveals conserved programmes underpinning organogenesis and reproduction in land plants. Nature Plants. 2021;7:1143–1159. doi: 10.1038/s41477-021-00958-2. [DOI] [PubMed] [Google Scholar]
- Kaessmann H. Origins, evolution, and phenotypic impact of new genes. Genome Research. 2010;20:1313–1326. doi: 10.1101/gr.101386.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, Collins RL, Laricchia KM, Ganna A, Birnbaum DP, Gauthier LD, Brand H, Solomonson M, Watts NA, Rhodes D, Singer-Berk M, England EM, Seaby EG, Kosmicki JA, Walters RK, Tashman K, Farjoun Y, Banks E, Poterba T, Wang A, Seed C, Whiffin N, Chong JX, Samocha KE, Pierce-Hoffman E, Zappala Z, O’Donnell-Luria AH, Minikel EV, Weisburd B, Lek M, Ware JS, Vittal C, Armean IM, Bergelson L, Cibulskis K, Connolly KM, Covarrubias M, Donnelly S, Ferriera S, Gabriel S, Gentry J, Gupta N, Jeandet T, Kaplan D, Llanwarne C, Munshi R, Novod S, Petrillo N, Roazen D, Ruano-Rubio V, Saltzman A, Schleicher M, Soto J, Tibbetts K, Tolonen C, Wade G, Talkowski ME, Genome Aggregation Database Consortium. Neale BM, Daly MJ, MacArthur DG. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–443. doi: 10.1038/s41586-020-2308-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nature Biotechnology. 2019;37:907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kondo S, Vedanayagam J, Mohammed J, Eizadshenass S, Kan L, Pang N, Aradhya R, Siepel A, Steinhauer J, Lai EC. New genes often acquire male-specific functions but rarely become essential in Drosophila. Genes & Development. 2017;31:1841–1846. doi: 10.1101/gad.303131.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kotliar D, Veres A, Nagy MA, Tabrizi S, Hodis E, Melton DA, Sabeti PC. Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq. eLife. 2019;8:e43803. doi: 10.7554/eLife.43803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Landini G. Background illumination correction – Novel context-based segmentation algorithms for intelligent microscopy. 2020. [October 2, 2021]. https://blog.bham.ac.uk/intellimic/g-landini-software/background-illumination-correction/
- Larkin A, Marygold SJ, Antonazzo G, Attrill H, Dos Santos G, Garapati PV, Goodman JL, Gramates LS, Millburn G, Strelets VB, Tabone CJ, Thurmond J, FlyBase Consortium FlyBase: updates to the Drosophila melanogaster knowledge base. Nucleic Acids Research. 2021;49:D899–D907. doi: 10.1093/nar/gkaa1026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lau X, Munusamy P, Ng MJ, Sangrithi M. Single-cell RNA sequencing of the cynomolgus macaque testis reveals conserved transcriptional profiles during mammalian spermatogenesis. Developmental Cell. 2020;54:548–566. doi: 10.1016/j.devcel.2020.07.018. [DOI] [PubMed] [Google Scholar]
- Laumer CE, Fernández R, Lemer S, Combosch D, Kocot KM, Riesgo A, Andrade SCS, Sterrer W, Sørensen MV, Giribet G. Revisiting metazoan phylogeny with genomic sampling of all phyla. Proceedings of the Royal Society B. 2019;286:20190831. doi: 10.1098/rspb.2019.0831. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leu JY, Chua PR, Roeder GS. The meiosis-specific Hop2 protein of S. cerevisiae ensures synapsis between homologous chromosomes. Cell. 1998;94:375–386. doi: 10.1016/s0092-8674(00)81480-4. [DOI] [PubMed] [Google Scholar]
- Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
- Lim C, Tarayrah L, Chen X. Transcriptional regulation during Drosophila spermatogenesis. Spermatogenesis. 2012;2:158–166. doi: 10.4161/spmg.21775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu S, Yu H, Liu Y, Liu X, Zhang Y, Bu C, Yuan S, Chen Z, Xie G, Li W, Xu B, Yang J, He L, Jin T, Xiong Y, Sun L, Liu X, Han C, Cheng Z, Liang J, Shang Y. Chromodomain protein CDYL Acts as a crotonyl-coa hydratase to regulate histone crotonylation and spermatogenesis. Molecular Cell. 2017;67:853–866. doi: 10.1016/j.molcel.2017.07.011. [DOI] [PubMed] [Google Scholar]
- Liu H, Zhang J. Higher germline mutagenesis of genes with stronger testis expressions refutes the transcriptional scanning hypothesis. Molecular Biology and Evolution. 2020;37:3225–3231. doi: 10.1093/molbev/msaa168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lun ATL, Chen Y, Smyth GK. It’s DE-licious: a recipe for differential expression analyses of rna-seq experiments using quasi-likelihood methods in edgeR. Statistical Genomics. 2016;1418:391–416. doi: 10.1007/978-1-4939-3578-9_19. [DOI] [PubMed] [Google Scholar]
- Ly-Huynh JD, Lieu KG, Major AT, Whiley PAF, Holt JE, Loveland KL, Jans DA. Importin alpha2-interacting proteins with nuclear roles during mammalian spermatogenesis. Biology of Reproduction. 2011;85:1191–1202. doi: 10.1095/biolreprod.111.091686. [DOI] [PubMed] [Google Scholar]
- Maezawa S, Sakashita A, Yukawa M, Chen X, Takahashi K, Alavattam KG, Nakata I, Weirauch MT, Barski A, Namekawa SH. Super-enhancer switching drives a burst in gene expression at the mitosis-to-meiosis transition. Nature Structural & Molecular Biology. 2020;27:978–988. doi: 10.1038/s41594-020-0488-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mason DA, Fleming RJ, Goldfarb DS. Drosophila melanogaster Importin α1 and α3 can replace importin α2 during spermatogenesis but not oogenesis. Genetics. 2002;161:157–170. doi: 10.1093/genetics/161.1.157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McCarthy DJ, Chen Y, Smyth GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Research. 2012;40:4288–4297. doi: 10.1093/nar/gks042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meistrich ML, Hess RA. In: Spermatogenesis: methods and protocols. Carrell DT, Aston KI, editors. Humana Press; 2013. Assessment of spermatogenesis through staging of seminiferous tubules; pp. 299–307. [DOI] [PubMed] [Google Scholar]
- Merkin J, Russell C, Chen P, Burge CB. Evolutionary dynamics of gene and isoform regulation in Mammalian tissues. Science. 2012;338:1593–1599. doi: 10.1126/science.1228186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moi D, Nishio S, Li X, Valansi C, Langleib M, Brukman NG, Flyak K, Dessimoz C, de Sanctis D, Tunyasuvunakool K, Jumper J, Graña M, Romero H, Aguilar PS, Jovine L, Podbilewicz B. Discovery of archaeal fusexins homologous to eukaryotic HAP2/GCS1 gamete fusion proteins. Nature Communications. 2022;13:3880. doi: 10.1038/s41467-022-31564-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murat F, Mbengue N, Winge SB, Trefzer T, Leushkin E, Sepp M, Cardoso-Moreira M, Schmidt J, Schneider C, Mößinger K, Brüning T, Lamanna F, Belles MR, Conrad C, Kondova I, Bontrop R, Behr R, Khaitovich P, Pääbo S, Marques-Bonet T, Grützner F, Almstrup K, Schierup MH, Kaessmann H. The molecular evolution of spermatogenesis across mammals. Nature. 2023;613:308–316. doi: 10.1038/s41586-022-05547-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Navarro-Costa PA, Molaro A, Misra CS, Meiklejohn CD, Ellis PJ. Sex and suicide: the curious case of toll-like receptors. PLOS Biology. 2020;18:e3000663. doi: 10.1371/journal.pbio.3000663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oud MS, Smits RM, Smith HE, Mastrorosa FK, Holt GS, Houston BJ, de Vries PF, Alobaidi BKS, Batty LE, Ismail H, Greenwood J, Sheth H, Mikulasova A, Astuti GDN, Gilissen C, McEleny K, Turner H, Coxhead J, Cockell S, Braat DDM, Fleischer K, D’Hauwers KWM, Schaafsma E, Genetics of Male Infertility Initiative (GEMINI) consortium. Nagirnaja L, Conrad DF, Friedrich C, Kliesch S, Aston KI, Riera-Escamilla A, Krausz C, Gonzaga-Jauregui C, Santibanez-Koref M, Elliott DJ, Vissers LELM, Tüttelmann F, O’Bryan MK, Ramos L, Xavier MJ, van der Heijden GW, Veltman JA. A de novo paradigm for male infertility. Nature Communications. 2022;13:154. doi: 10.1038/s41467-021-27132-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nature Methods. 2017;14:417–419. doi: 10.1038/nmeth.4197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perillo M, Swartz SZ, Wessel GM. A conserved node in the regulation of Vasa between an induced and an inherited program of primordial germ cell specification. Developmental Biology. 2022;482:28–33. doi: 10.1016/j.ydbio.2021.11.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perkins LA, Holderbaum L, Tao R, Hu Y, Sopko R, McCall K, Yang-Zhou D, Flockhart I, Binari R, Shim H-S, Miller A, Housden A, Foos M, Randkelv S, Kelley C, Namgyal P, Villalta C, Liu L-P, Jiang X, Huan-Huan Q, Wang X, Fujiyama A, Toyoda A, Ayers K, Blum A, Czech B, Neumuller R, Yan D, Cavallaro A, Hibbard K, Hall D, Cooley L, Hannon GJ, Lehmann R, Parks A, Mohr SE, Ueda R, Kondo S, Ni J-Q, Perrimon N. The transgenic RNAi project at harvard medical school: resources and validation. Genetics. 2015;201:843–852. doi: 10.1534/genetics.115.180208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Petukhova GV, Romanienko PJ, Camerini-Otero RD. The Hop2 protein has a direct role in promoting interhomolog interactions during mouse meiosis. Developmental Cell. 2003;5:927–936. doi: 10.1016/s1534-5807(03)00369-1. [DOI] [PubMed] [Google Scholar]
- Pound P, Ritskes-Hoitinga M. Is it possible to overcome issues of external validity in preclinical animal research? Why most animal models are bound to fail. Journal of Translational Medicine. 2018;16:304. doi: 10.1186/s12967-018-1678-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prudêncio P, Guilgur LG, Sobral J, Becker JD, Martinho RG, Navarro-Costa P. The Trithorax group protein dMLL3/4 instructs the assembly of the zygotic genome at fertilization. EMBO Reports. 2018;19:e45728. doi: 10.15252/embr.201845728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quinodoz M, Royer-Bertrand B, Cisarova K, Di Gioia SA, Superti-Furga A, Rivolta C. DOMINO: using machine learning to predict genes associated with dominant disorders. American Journal of Human Genetics. 2017;101:623–629. doi: 10.1016/j.ajhg.2017.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raich N, Contremoulins V, Karess RE. Immunostaining of whole-mount Drosophila Testes for 3D confocal analysis of large spermatocytes. Journal of Visualized Experiments. 2020;01:61061. doi: 10.3791/61061. [DOI] [PubMed] [Google Scholar]
- Raj I, Sadat Al Hosseini H, Dioguardi E, Nishimura K, Han L, Villa A, de Sanctis D, Jovine L. Structural basis of egg coat-sperm recognition at fertilization. Cell. 2017;169:1315–1326. doi: 10.1016/j.cell.2017.05.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramm SA, Schärer L, Ehmcke J, Wistuba J. Sperm competition and the evolution of spermatogenesis. Molecular Human Reproduction. 2014;20:1169–1179. doi: 10.1093/molehr/gau070. [DOI] [PubMed] [Google Scholar]
- Ren Y, Ay A, Kahveci T. Shortest path counting in probabilistic biological networks. BMC Bioinformatics. 2018;19:465. doi: 10.1186/s12859-018-2480-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson MD, McCarthy DJ, Smyth GK. edgeR : a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biology. 2010;11:R25. doi: 10.1186/gb-2010-11-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schurko AM, Logsdon JM. Using a meiosis detection toolkit to investigate ancient asexual “scandals” and the evolution of sex. BioEssays. 2008;30:579–589. doi: 10.1002/bies.20764. [DOI] [PubMed] [Google Scholar]
- Sebé-Pedrós A, Degnan BM, Ruiz-Trillo I. The origin of Metazoa: a unicellular perspective. Nature Reviews. Genetics. 2017;18:498–512. doi: 10.1038/nrg.2017.21. [DOI] [PubMed] [Google Scholar]
- Shah C, Vangompel MJW, Naeem V, Chen Y, Lee T, Angeloni N, Wang Y, Xu EY. Widespread presence of human BOULE homologs among animals and conservation of their ancient reproductive function. PLOS Genetics. 2010;6:e1001022. doi: 10.1371/journal.pgen.1001022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shami AN, Zheng X, Munyoki SK, Ma Q, Manske GL, Green CD, Sukhwani M, Orwig KE, Li JZ, Hammoud SS. Single-cell rna sequencing of human, macaque, and mouse testes uncovers conserved and divergent features of mammalian spermatogenesis. Developmental Cell. 2020;54:529–547. doi: 10.1016/j.devcel.2020.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siebert-Kuss LM, Krenz H, Tekath T, Wöste M, Di Persio S, Terwort N, Wyrwoll MJ, Cremers J-F, Wistuba J, Dugas M, Kliesch S, Schlatt S, Tüttelmann F, Gromoll J, Neuhaus N, Laurentino S. Transcriptome analyses in infertile men reveal germ cell-specific expression and splicing patterns. Life Science Alliance. 2023;6:e202201633. doi: 10.26508/lsa.202201633. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simas T, Correia RB, Rocha LM. The distance backbone of complex networks. Journal of Complex Networks. 2021;9:cnab021. doi: 10.1093/comnet/cnab021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soumillon M, Necsulea A, Weier M, Brawand D, Zhang X, Gu H, Barthès P, Kokkinaki M, Nef S, Gnirke A, Dym M, de Massy B, Mikkelsen TS, Kaessmann H. Cellular source and mechanisms of high transcriptome complexity in the mammalian testis. Cell Reports. 2013;3:2179–2190. doi: 10.1016/j.celrep.2013.05.031. [DOI] [PubMed] [Google Scholar]
- Speer KF, Allen-Waller L, Novikov DR, Barott KL. Molecular mechanisms of sperm motility are conserved in an early-branching metazoan. PNAS. 2021;118:e2109993118. doi: 10.1073/pnas.2109993118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun MGF, Kim PM. Evolution of biological interaction networks: from models to real data. Genome Biology. 2011;12:235. doi: 10.1186/gb-2011-12-12-235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Swanson WJ, Vacquier VD. The rapid evolution of reproductive proteins. Nature Reviews. Genetics. 2002;3:137–144. doi: 10.1038/nrg733. [DOI] [PubMed] [Google Scholar]
- Szcześniak MW, Ciomborowska J, Nowak W, Rogozin IB, Makałowska I. Primate and rodent specific intron gains and the origin of retrogenes with splice variants. Molecular Biology and Evolution. 2011;28:33–37. doi: 10.1093/molbev/msq260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P, Jensen LJ, von Mering C. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Research. 2019;47:D607–D613. doi: 10.1093/nar/gky1131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thévenaz P, Unser M. User-friendly semiautomated assembly of accurate image mosaics in microscopy. Microscopy Research and Technique. 2007;70:135–146. doi: 10.1002/jemt.20393. [DOI] [PubMed] [Google Scholar]
- Thul PJ, Åkesson L, Wiking M, Mahdessian D, Geladaki A, Ait Blal H, Alm T, Asplund A, Björk L, Breckels LM, Bäckström A, Danielsson F, Fagerberg L, Fall J, Gatto L, Gnann C, Hober S, Hjelmare M, Johansson F, Lee S, Lindskog C, Mulder J, Mulvey CM, Nilsson P, Oksvold P, Rockberg J, Schutten R, Schwenk JM, Sivertsson Å, Sjöstedt E, Skogs M, Stadler C, Sullivan DP, Tegel H, Winsnes C, Zhang C, Zwahlen M, Mardinoglu A, Pontén F, von Feilitzen K, Lilley KS, Uhlén M, Lundberg E. A subcellular map of the human proteome. Science. 2017;356:eaal3321. doi: 10.1126/science.aal3321. [DOI] [PubMed] [Google Scholar]
- Vert JP, Foveau N, Lajaunie C, Vandenbrouck Y. An accurate and interpretable model for siRNA efficacy prediction. BMC Bioinformatics. 2006;7:520. doi: 10.1186/1471-2105-7-520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, Burovski E, Peterson P, Weckesser W, Bright J, van der Walt SJ, Brett M, Wilson J, Millman KJ, Mayorov N, Nelson ARJ, Jones E, Kern R, Larson E, Carey CJ, Polat İ, Feng Y, Moore EW, VanderPlas J, Laxalde D, Perktold J, Cimrman R, Henriksen I, Quintero EA, Harris CR, Archibald AM, Ribeiro AH, Pedregosa F, van Mulbregt P, SciPy 1.0 Contributors SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature Methods. 2020;17:261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vissers LELM, Gilissen C, Veltman JA. Genetic studies in intellectual disability and related disorders. Nature Reviews. Genetics. 2016;17:9–18. doi: 10.1038/nrg3999. [DOI] [PubMed] [Google Scholar]
- Walhout AJ, Sordella R, Lu X, Hartley JL, Temple GF, Brasch MA, Thierry-Mieg N, Vidal M. Protein interaction mapping in C. elegans using proteins involved in vulval development. Science. 2000;287:116–122. doi: 10.1126/science.287.5450.116. [DOI] [PubMed] [Google Scholar]
- Wan C, Borgeson B, Phanse S, Tu F, Drew K, Clark G, Xiong X, Kagan O, Kwan J, Bezginov A, Chessman K, Pal S, Cromar G, Papoulas O, Ni Z, Boutz DR, Stoilova S, Havugimana PC, Guo X, Malty RH, Sarov M, Greenblatt J, Babu M, Derry WB, Tillier ER, Wallingford JB, Parkinson J, Marcotte EM, Emili A. Panorama of ancient metazoan macromolecular complexes. Nature. 2015;525:339–344. doi: 10.1038/nature14877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Z-Y, Leushkin E, Liechti A, Ovchinnikova S, Mößinger K, Brüning T, Rummel C, Grützner F, Cardoso-Moreira M, Janich P, Gatfield D, Diagouraga B, de Massy B, Gill ME, Peters AHFM, Anders S, Kaessmann H. Transcriptome and translatome co-evolution in mammals. Nature. 2020;588:642–647. doi: 10.1038/s41586-020-2899-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- White-Cooper H. Tissue, cell type and stage-specific ectopic gene expression and RNAi induction in the Drosophila testis. Spermatogenesis. 2012;2:11–22. doi: 10.4161/spmg.19088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu Y, Hu X, Li Z, Wang M, Li S, Wang X, Lin X, Liao S, Zhang Z, Feng X, Wang S, Cui X, Wang Y, Gao F, Hess RA, Han C. Transcription factor RFX2 is a key regulator of mouse spermiogenesis. Scientific Reports. 2016;6:20435. doi: 10.1038/srep20435. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wuchty S, Oltvai ZN, Barabási AL. Evolutionary conservation of motif constituents in the yeast protein interaction network. Nature Genetics. 2003;35:176–179. doi: 10.1038/ng1242. [DOI] [PubMed] [Google Scholar]
- Wyrwoll MJ, Temel ŞG, Nagirnaja L, Oud MS, Lopes AM, van der Heijden GW, Heald JS, Rotte N, Wistuba J, Wöste M, Ledig S, Krenz H, Smits RM, Carvalho F, Gonçalves J, Fietz D, Türkgenç B, Ergören MC, Çetinkaya M, Başar M, Kahraman S, McEleny K, Xavier MJ, Turner H, Pilatz A, Röpke A, Dugas M, Kliesch S, Neuhaus N, GEMINI Consortium. Aston KI, Conrad DF, Veltman JA, Friedrich C, Tüttelmann F. Bi-allelic Mutations in M1Ap are a frequent cause of meiotic arrest and severely impaired spermatogenesis leading to male infertility. American Journal of Human Genetics. 2020;107:342–351. doi: 10.1016/j.ajhg.2020.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wyrwoll MJ, Köckerling N, Vockel M, Dicke A-K, Rotte N, Pohl E, Emich J, Wöste M, Ruckert C, Wabschke R, Seggewiss J, Ledig S, Tewes A-C, Stratis Y, Cremers JF, Wistuba J, Krallmann C, Kliesch S, Röpke A, Stallmeyer B, Friedrich C, Tüttelmann F. Genetic architecture of azoospermia-time to advance the standard of care. European Urology. 2023;83:452–462. doi: 10.1016/j.eururo.2022.05.011. [DOI] [PubMed] [Google Scholar]
- Xia B, Yan Y, Baron M, Wagner F, Barkley D, Chiodin M, Kim SY, Keefe DL, Alukal JP, Boeke JD, Yanai I. Widespread transcriptional scanning in the testis modulates gene evolution rates. Cell. 2020;180:248–262. doi: 10.1016/j.cell.2019.12.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xiong B, Yang Y, Fineis FR, Wang JP. DegNorm: normalization of generalized transcript degradation improves accuracy in RNA-seq analysis. Genome Biology. 2019;20:75. doi: 10.1186/s13059-019-1682-7. [DOI] [PMC free article] [PubMed] [Google Scholar]