Skip to main content
PLOS Genetics logoLink to PLOS Genetics
. 2023 Mar 1;19(3):e1010656. doi: 10.1371/journal.pgen.1010656

The RNA binding protein DND1 is elevated in a subpopulation of pro-spermatogonia and targets chromatin modifiers and translational machinery during late gestation

Victor A Ruthig 1,2,*, Talia Hatkevich 2, Josiah Hardy 2, Matthew B Friedersdorf 3, Chloé Mayère 4,5, Serge Nef 4,5, Jack D Keene 3, Blanche Capel 2,*
Editor: Erez Raz6
PMCID: PMC10010562  PMID: 36857387

Abstract

DND1 is essential to maintain germ cell identity. Loss of Dnd1 function results in germ cell differentiation to teratomas in some inbred strains of mice or to somatic fates in zebrafish. Using our knock-in mouse line in which a functional fusion protein between DND1 and GFP is expressed from the endogenous locus (Dnd1GFP), we distinguished two male germ cell (MGC) populations during late gestation cell cycle arrest (G0), consistent with recent reports of heterogeneity among MGCs. Most MGCs express lower levels of DND1-GFP (DND1-GFP-lo), but some MGCs express elevated levels of DND1-GFP (DND1-GFP-hi). A RNA-seq time course confirmed high Dnd1 transcript levels in DND1-GFP-hi cells along with 5-10-fold higher levels for multiple epigenetic regulators. Using antibodies against DND1-GFP for RNA immunoprecipitation (RIP)-sequencing, we identified multiple epigenetic and translational regulators that are binding targets of DND1 during G0 including DNA methyltransferases (Dnmts), histone deacetylases (Hdacs), Tudor domain proteins (Tdrds), actin dependent regulators (Smarcs), and a group of ribosomal and Golgi proteins. These data suggest that in DND1-GFP-hi cells, DND1 hosts coordinating mRNA regulons that consist of functionally related and localized groups of epigenetic enzymes and translational components.

Author summary

DND1, a RNA-binding protein, has emerged as a regulator of germ cell identity in vertebrates. In the absence of Dnd1, germ cells are lost or tend to differentiate into teratomas or somatic cell types in mice and zebrafish, pointing to an essential role of this post-transcriptional regulator in maintaining the unique identity of germ cells. Using a mouse line carrying a GFP-tag on endogenous Dnd1 (DND1GFP), we discovered distinct populations of male germ cells expressing high and low levels of DND1 during late gestation. RNA sequencing identified differences between these populations. DND-GFP-hi cells initially express elevated levels of transcripts encoding pluripotency genes which they later down-regulate. Transcripts encoding translational machinery and epigenetic regulators are higher in DND1-GFP-hi cells throughout gestation and frequently map as RNA-binding targets from DND1 RNA regulons as discovered by RNA immunoprecipitation (RIP)-sequencing. MGCs expressing high levels of DND1 are much less likely to express a phagocytic signal on their cell surface and are more likely to express transcripts for genes associated with the transition to pro-spermatogonial fate.

Introduction

Dead end 1 (Dnd1) is a RNA binding protein (RBP) that is essential for the maintenance of the germ cell population in multiple vertebrate species [1, 2]. For example, in many inbred mouse strains, most germ cells carrying the Dnd1Ter/Ter mutation undergo apoptosis soon after specification [3, 4]. On some genetic backgrounds, Dnd1Ter/Ter male germ cells (MGCs) that escape apoptosis often differentiate into teratomas with many embryonic cell types represented [3, 5]. This is broadly consistent with findings in zebrafish, where some dnd1 mutant germ cells die, while others undergo differentiation to somatic lineages during migration to the gonad [6]. These studies suggest that DND1 plays a critical role in maintaining germ cell identity by promoting survival and suppressing differentiation to somatic fate.

Suppression of somatic fate occurs through epigenetic mechanisms in C. elegans [7, 8] and is believed to occur through similar mechanisms in mammals. During migration in mammals, germ cell DNA is depleted of methylation [9, 10] through a combination of active oxidation via the Tet enzymes and passive loss of 5mC through rapid proliferation [11]. At embryonic day (E)8.5, the stable repressive mark H3K9me2 is eliminated and H3K27me3 is acquired at bivalent developmental loci [12, 13]. The capacity to generate embryonic germ cell lines is high at E12.5, suggesting that many embryonic differentiation pathways are “open” at this stage.

During the last third of fetal life until a few days after birth (E14.5 to about postnatal day (P)2), MGCs enter a long period of cell cycle arrest in G0 [1416]. During this period MGCs undergo extensive epigenetic reprogramming that correlates with a reduction in their potential to give rise to teratomas [5] or to embryonic germ cell lines [17]. By E16.5, when Dnmt3a and Dnmt3l are transcribed, re-methylation begins [1820]. By birth, DNA is hypermethylated [19, 20]. ATAC-seq and ChIP-seq for 6 histone marks documented chromatin accessibility changes between E13.5-P1, mainly associated with the acquisition of repressive histone marks including H3K27me3 and H3K9me2/3 [21] often around transposable elements [20]. It is not known exactly when these changes occur or whether they are uniform across the population, but several lines of evidence suggest that DND1 is involved.

First, a prior study showed that DND1 promotes cell cycle arrest in G0 by mediating transcript stabilization of the methyltransferase Ezh2 which deposits the repressive mark H3K27me3 at the genetic locus of the activating cell cyclin Ccnd1 [22]. In our previous study of the Dnd1Ter/Ter mutants, we found that transcripts for multiple epigenetic modifiers were down-regulated relative to levels in wild type MGCs [23]. However, it was not clear whether this was a direct or indirect effect of DND1 loss. By conducting digestion optimized RNA immunoprecipitation (DO-RIP)-seq against DND1 in HEK293 cells, we identified the DND1 binding motif, DAUBAW ([A/G/U]AU [C/G/U]A[A/U]), and many in vitro DND1 targets [23]. Our binding motif and in vitro target data complimented a similar study on DND1 [2]. Although some targets in HEK293 cells were chromatin modifiers, many pluripotency and epigenetic regulators significantly altered in Dnd1Ter/Ter mutants are not expressed in HEK293 cells [23].

Functionally, DND1 has been shown to stabilize some target transcripts by blocking microRNA-mediated destruction [24], and destabilize other target transcripts by recruitment to processing (P) bodies as part of the CNOT (carbon catabolite repressor 4-negative on TATA) complex [1, 2]. Target transcript degradation via the CNOT complex relies on protein-protein interactions between DND1 and a second RNA binding protein NANOS2, which can modify DND1 target selection [1, 25]. Recent structural analysis has shown that motif recognition prompts a two-step RNA binding mechanism resulting in oppositional binding by the two tandem RNA recognition motifs (RRM) of DND1 in a clamp fashion, while the double-stranded recognition motif (DSRM) remains exposed, putatively to be bound by a protein partner such as NANOS2 [26].

Absence of an available antibody against DND1 interfered with analysis of its role in MGC fate commitment in vivo. To solve this problem, we used CRISPR/Cas9 editing to establish a knock-in mouse line expressing a Dnd1GFP fusion allele from the endogenous locus. In both male and female fetal development, levels of DND1GFP expression varied among germ cells [27], consistent with heterogeneity in the germ cell population reported by multiple groups [2830]. At E14.5, E16.5, and E18.5 we detected DND1GFP-hi (DND1-GFP-hi) and DND1GFP-lo (DND1-GFP-lo) expressing populations. We show that these populations have distinct transcriptomes with high levels of phosphatidylserine on the surface of DND1-GFP-lo cells, suggesting reduced viability. Using the GFP-tag on DND1 to perform RNA immunoprecipitation (RIP) assays at E14.5, E16.5, and E18.5, we show that DND1 binds multiple RNAs encoding cell cycle genes, epigenetic regulators and genes associated with the Golgi and vesicle transport, all of which are present at higher levels in DND1-GFP-hi cells. These data suggest that high levels of DND1 play a direct role in promoting viability and reprogramming of MGCs during cell cycle arrest by binding to and regulating the translation of transcripts that encode critical chromatin modifiers and translational machinery.

Results

During late gestation subpopulations of MGCs express high or low levels of Dnd1

We previously used CRISPR/Cas9 to target the endogenous allele of Dnd1 and insert eGfp fused in frame to the 5’ end of the Dnd1 gene (Dnd1GFP). This allele revealed heterogeneity in male germ cells (MGCs) during stages preceding mitotic arrest (E11.5-E14.5) [27], consistent with results from other labs [2830]. We used DND1-GFP to isolate MGCs at later fetal stages. Using fluorescence activated cell sorting (FACS) we isolated DND1-GFP-positive cells at E14.5, E16.5, E18.5, and P3. To our surprise FACS revealed distinct DND1-GFP-lo and DND1-GFP-hi populations at E14.5, E16.5 and E18.5 (Fig 1A and 1B). At E14.5, the DND1-GFP-hi population accounts for <1% of MGCs. Their numbers peak at E16.5, when DND1-GFP-hi cells represent 12% of total MGCs, and decline again at E18.5, when the DND1-GFP-hi population accounts for only 5% of MGCs (S1 Table). FACS analysis of MGCs from Oct4-GFP testes at E18.5 did not reveal high and low subpopulations (Fig 1B). As MGCs remain in G0 cell cycle arrest throughout these stages, differences in DND1 levels cannot be related to cell cycle differences.

Fig 1. Fetal male germ cells express low or high levels of DND1-GFP and Dnd1.

Fig 1

A: FACS histogram showing one DND1-GFP-negative, and two DND1-GFP-positive populations, representing DND1-GFP-lo and DND1-GFP-hi cells at E18.5. B: FACS quantification for the number of DND1-GFP-lo and DND1-GFP-hi germ cells from E14.5, E16.5, E18.5, P3 DND1-GFP testes (n = 3,4,4,3 respectively). FACS analysis of germ cells from E18.5 Oct4-GFP testes (n = 4) did not show GFP low and high subpopulations. C: Whole mount confocal imaging of unstained freshly dissected E16.5 and E18.5 DND1-GFP testes showing heterogenous endogenous DND1-GFP fluorescence. Scale bar = 20 μm. D: scRNA-seq UMAP for XX and XY germ cell populations from E9.0-E16.5 [32] and violin plot showing DND1 expression levels in clusters 0–10. Red box surrounds E16.5 male germ cell populations represented by clusters 9 and 10. Corresponding violin plots show that clusters 9 and 10 contain cells expressing low and high levels of Dnd1 respectively. E: re-analysis of E16.5 male germ cell scRNA-seq from [31]. UMAP colored to show low to high levels of Dnd1 expression (grey to blue to pink). Violin plot showing levels of Dnd1 expression in clusters 0, 1, 2, 3, and 4, with a level of 2.0 denoted by red broken line. Quantification of total number of Dnd1-expressing cells with expression ≥2.0 = 711 (of 3798 total DND1+ cells).

To discount the possibility that high and low levels of DND1 were an artifact of cell dissociation and isolation via FACS, we investigated whether MGCs expressing high and low levels of DND1 were visible within live testis cords. We mounted E16.5 and E18.5 Dnd1GFP freshly dissected testes on a slide and used confocal microscopy to qualitatively explore levels of endogenous DND1-GFP. DND1-GFP-lo and DND1-GFP-hi cells were evident in all images captured from live samples (Figs 1C and S1) and fixed samples (S2A Fig). To quantitatively assess endogenous DND1-GFP levels, we quantified the endogenous DND1-GFP in DND1-GFP-hi (S2B Fig) and DND1-GFP-lo (S2C Fig) cells in fixed testes, normalizing to DNA (S2D Fig). DND1-GFP-lo cells appear as a continuum. However, in quantitative and FACS analysis, a distinct DND1-GFP-hi population emerged. In both live and fixed testis cord samples, we found that DND1-GFP-hi cells were sometimes found in clusters and were sometimes dispersed in cords. To rule out the possibility that high/low expression was an artifact of the knock-in, we investigated 2 single cell datasets from E16.5 gonads [31, 32]. Both datasets defined at least one subpopulation of DND1-GFP-hi cells based on native expression from the locus (Fig 1D and 1E). To further ensure that the DND1-GFP-hi population is not due to an over-expression of the modified Dnd1GFP allele, we isolated DND1-GFP-lo and DND1-GFP-hi populations from Dnd1GFP/+ heterozygotes and performed RT-qPCR for expression from each allele. Similar to our previous analyses, we see a higher mean expression of Dnd1 relative to Gapdh in the DND1-GFP-hi cells, as compared to the DND1-GFP-lo cells (S2E Fig, p-value ≤0.01). However, when comparing mean expression of eGfp relative to Dnd1, we found no significant difference in DND1-GFP-hi vs. DND1-GFP-lo cells (S2F Fig, p-value >0.05).

Although DND1-GFP-lo and DND1-GFP-hi cells share 62–84% of their transcriptomes, DND1-GFP-hi cells express higher levels of Dnd1, pluripotency markers and components of the OxPhos pathway

To determine whether the transcriptome differed between cells expressing high and low levels of DND1, we isolated 4 independent replicates of paired DND1-GFP-lo and DND1-GFP-hi cells by FACS at each of two timepoints (E16.5 and E18.5, S1 Table) and performed RNA-seq. Transcriptome data within each set of 4 samples (E16.5 DND1-GFP-lo, E16.5 DND1-GFP-hi, E18.5 DND1-GFP-lo, or E18.5 DND1-GFP-hi) was highly correlated with all replicate samples from that set, scoring a Pearson correlation coefficient of ≥0.88. Overall, among all high and low samples the lowest level of correlation was 0.76, indicating similarities among all these germ cell populations (S3 Fig). Global comparison of the hiloRNA-seq datasets at E16.5 revealed that out of the expression of 11,937 total transcripts, 7,271 (61%) transcripts are shared between DND1-GFP-lo and DND1-GFP-hi cells. At E18.5, the populations are more similar transcriptionally: out of 12,208 total transcripts, 10,198 (84%) are shared (Fig 2A). Shared genes include transcriptional regulators, metabolite interconversion enzymes, and protein modifiers and modulators.

Fig 2. Although the germ cell transcriptome is mostly shared between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5, the subset of significant differentially expressed genes indicate functional differences between DND1-GFP-lo and DND1-GFP-hi cells.

Fig 2

A: Global shared (pink) versus DND1-GFP-lo (blue) or DND1-GFP-hi (yellow; red) specific expression at E16.5 and E18.5 from hiloRNA-seq analysis. B: Euler plots of E16.5 DND1-GFP-lo (blue), E16.5 DND1-GFP-hi (yellow), E18.5 DND1-GFP-lo (dark blue), E18.5 DND1-GFP-hi (red) show global shared and significant differentially expressed genes. C: Expression of Dnd1 declined between E16.5 and E18.5 but was consistently higher in Dnd1-GFP-hi cells. Other markers associated with the germline stem cell population behaved in a similar manner. D: Transcripts of genes associated with the oxidative phosphorylation pathway were elevated in DND1-GFP-hi cells. E: Examples of intercellular signaling genes that were expressed higher in Dnd1-GFP-lo cells. hiloRNA-seq expression P-value (DESeq2) between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5: not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****).

However, despite the high level of similarity, there were significant differences in the transcriptomes of DND1-GFP-lo and DND1-GFP-hi cells at both E16.5 and E18.5. A large group of differentially expressed genes (DEGs) were detected (expression TPM ≥5.0 in lo or hi cells, L2FC ≥1.0 or <-1.0 for hi versus lo, p-value ≤0.05) in DND1-GFP-lo or DND1-GFP-hi cells at E16.5 and E18.5, with a higher number of differentially expressed genes in DND1-GFP-hi cells (Fig 2B and S2 Table). At E16.5 DND1-GFP-lo cells have 1,032 over-expressed genes, while DND1-GFP-hi cells have 3,634 over-expressed genes. At E18.5, only 154 genes were significantly elevated in DND1-GFP-lo cells compared to 1,856 genes significantly elevated in DND1-GFP-hi cells (Fig 2B). Only 35 differentially expressed genes were shared between DND1-GFP-lo cells at E16.5 and E18.5. Whereas in DND1-GFP-hi cells, 1,709 differentially expressed genes were common between E16.5 and E18.5 (Fig 2B), showing a strong conservation of the differentially expressed genes in DND1-GFP-hi cells over time.

As a first analysis, we investigated whether Dnd1 and other markers of germline stem cells are expressed at different levels in DND1-GFP-lo and DND1-GFP-hi cells. DND1-GFP-hi cells showed approximately 7-8-fold higher levels of Dnd1 transcripts, confirming expectations, although levels of Dnd1 expression decline between E16.5 and E18.5 (Fig 2C). At E16.5, DND1-GFP-hi cells still had higher levels of transcripts of other markers associated with the germline stem cell population, including Nanos2, Dazl, Ddx4, Dppa3, Tfap2c, and Sox2, all of which sharply declined between E16.5 and E18.5. Of these genes, only levels of Dnd1, Dazl and Ddx4 remain elevated at E18.5 (Fig 2C). Gene ontology (GO) analysis on differentially expressed genes (DEGs) identified differences in energy metabolism between the DND1-GFP-lo and DND1-GFP-hi populations. Most striking of these metabolic differences was the increase of oxidative phosphorylation in DND1-GFP-hi cells (Fig 2D). DND1-GFP-lo cells expressed higher levels of transcripts of genes associated with signaling, such as Bmp5 and Wnt4 (Fig 2E). In support of this DEG data, re-analysis of the E16.5 MGC scRNA-seq data [31] showed that many of the same genes related to epigenetic and pluripotency regulation (Brdt, Dnmt3a, Mael, Piwil4) had elevated expression in the same clusters as DND1-hi cells (clusters 0,1; S4 Fig), whereas other DEGs that were more highly expressed in our DND1-GFP-lo cells (Kdm6b, Lefty2) had elevated expression in the same cluster as DND1-lo cells (cluster 2; S4 Fig). Some epigenetic and pluripotency regulation genes highly expressed in DND1-GFP-hi cells (Dazl, Dppa5a) did not follow this pattern in the scRNA-seq data (S4 Fig).

DND1-GFP-lo and DND1-GFP-hi cells show differences in presence of phosphatidylserine on their cell surface

Several characteristics have been reported to distinguish populations of MGCs at E14.5 [30, 32]. We used immunofluorescent staining and confocal imaging to qualitatively correlate expression of a few of these markers with high or low expression of DND1-GFP at E14.5 (S5A–S5C Fig) and E18.5 (S5D–S5F Fig). For example, expression of the Line1 translation product, ORF1p, was associated with MGCs on a male differentiation trajectory [30]. However, ORF1p did not correlate with high or low levels of DND1-GFP (S5A and S5D Fig). Based on the metabolic differences we detected at the transcriptome level, we tested whether DND1-GFP-lo and DND1-GFP-hi cells showed higher levels of mitochondria using an antibody against TOMM20, but we found no clear differences between populations (S5B and S5E Fig), and levels of cPARP did not consistently correlate with high or low levels of DND1 at E14.5 or E18.5 (S5C and S5F Fig).

Gene ontology (GO) analysis of significantly differentially expressed genes that were over-expressed in either DND1-GFP-lo or DND1-GFP-hi cells identified apoptosis and cell death related genes in both cell populations (S3 Table). However, while DND1-GFP-hi cells favored genes related to the “Intrinsic Apoptotic Signaling Pathway” (S3 Table), DND1-GFP-lo cells favored genes related to the “Extrinsic Apoptotic Signaling Pathway” (S3 Table). To explore differences in cell death mechanisms between the two populations, we examined if DND1-GFP-lo cells had an increase in phosphatidylserine on the outer plasma membrane. Phosphatidylserine was shown to mark spermatogonia for phagocytosis by Sertoli cells in the adult testis [3335]. To detect this membrane marker, we incubated dissociated E16.5 testes cells with a AF647-fluorescent conjugated antibody against Annexin V, which has high affinity for phosphatidylserine, and performed FACS analysis for Annexin V and DND1-GFP. DND1-GFP-lo cells, which dramatically over-express Anxa5 only at E16.5, exhibit a significantly higher (p<0.0001) geometric mean of Annexin V fluorescence (Fig 3A and 3B). By comparing DND1-GFP-lo and DND1-GFP-hi cells to Annexin V signal (Fig 3C), we found that 70.0% of DND1-GFP-lo cells were Annexin V-positive, whereas only 3.5% of DND1-GFP-hi cells were Annexin V-positive (Fig 3D, p = 0.0014). Moreover, DND1-GFP-lo cells represented a mean of 99.25% of all Annexin V positive cells and just 37% of all Annexin V-negative cells (Fig 3D, p = 0.0008).

Fig 3. DND1-GFP-lo cells express more phosphatidylserine on their outer membrane, as seen through Annexin V fluorescence.

Fig 3

A: Representative profiles of Annexin V fluorescence in E16.5 DND1-GFP-hi (yellow) and E16.5 DND1-GFP-lo (sky blue) populations. B: Geometric mean of Annexin V fluorescence in DND1-GFP-lo and DND1-GFP-hi cells. C: Representative fluorescence profile of DND1-GFP and Annexin V. Q1 (quadrant 1) = DND1-GFP-lo cells + Annexin V-positive. Q2 = DND1-GFP-hi cells + Annexin V-positive. Q3 = DND1-GFP-hi cells + Annexin V-negative. Q4 = DND1-GFP-lo cells + Annexin V-negative. D: Percent of DND1-GFP-lo cells and DND1-GFP-hi cells that are Annexin V-positive. E: Percent of total Annexin V-positive and negative cells that are DND1-GFP-lo cells. A-E: Technical replicates = 4, biological replicates > 2 fetal testes. P value: ****<0.0001, *** = 0.0008, ** = 0.0014.

Global analysis revealed that DND1-GFP-hi cells express higher levels of chaperones, chromatin/ histone regulators, and genes associated with translation

To determine whether transcripts that differed between DND1-GFP-lo and DND1-GFP-hi cells belonged to specific categories or protein classes, significantly over-expressed gene lists were generated by stage and DND1 expression level (up in: E16.5 DND1-GFP-lo, E16.5 DND1-GFP-hi, E18.5 DND1-GFP-lo, or E18.5 DND1-GFP-hi). Over-expression gene lists were submitted to PANTHER for grouping by protein class [36, 37]. This approach resolved signatures that were strikingly different between DND1-GFP-lo and DND1-GFP-hi cells at both time points (Fig 4A). PANTHER revealed that DND1-GFP-lo cells showed higher levels of genes associated with defense/immunity (Skint7, Skint8, Hsh2d, Cle18a), intercellular signaling (Bmp5, Tgfa, Wnt4, Fgf10), transmembrane signal receptors (Amhr2, Igflr1, Ddr2, and Prir) and transporters (Rhag, Catsper3, Kcnmb4, Slc’s) at both timepoints (Figs 4A, 2E and S6). In contrast, transcripts encoding chaperones, chromatin binding/regulation, nucleic acid metabolism, and translational proteins were elevated in DND1-GFP-hi cells at both timepoints (Figs 4A, S7 and 5A–5C).

Fig 4. Expression of some protein class categories are significantly different in DND1-GFP-hi vs DND1-GFP-lo cells and consistent between E16.5 and E18.5.

Fig 4

A: PANTHER analysis of significant differentially expressed genes in E16.5 and E18.5 DND1-GFP-lo cells, and E16.5 and E18.5 DND1-GFP-hi cells. Results displayed as protein class categories by dataset and graphed with the number of genes in each category corresponding to the size of the circle. Protein classes distinguishing DND1-GFP-lo cells at both stages include “Defense and Immunity”, “Transmembrane Signal Receptor” and “Intercellular Signal Molecule” (noted with black vertical line). Protein classes distinguishing DND1-GFP-hi cells at both stages include “Chaperone”, “Chromatin Binding/Regulation”, “Nucleic Acid Metabolism”, and “Translational” (noted with black vertical line). B: Heat map of chromatin / epigenetic regulators showing: L2FC in DND1-GFP-hi vs DND1-GFP-lo cells from hiloRNA-seq (E16.5, E18.5).

Fig 5. In DND1-GFP-hi cells, most transcripts for proteins involved in RNA binding, ribosomal subunits, and Golgi vesicle transport are either shared between E16.5 and E18.5, or unique to E16.5.

Fig 5

Venn diagrams showing temporal distribution (E16.5 (yellow), E18.5 (red)) of genes in three GO biological processes significantly over-represented in DND1-GFP-hi cells compared to DND1-GFP-lo cells: (A) RNA binding protein genes, (B) ribosomal subunit genes, and (C) Golgi vesicle transport genes. Box-and-whisker plots show expression for example genes in each category. hiloRNA-seq expression P-value (DESeq2) between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5: not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****).

In most cases, differences between DND1-GFP-lo and DND1-GFP-hi cells were more pronounced at E16.5 with expression levels converging at E18.5. We identified 757 transcripts significantly converged between DND1-GFP-lo and DND1-GFP-hi cells: 419 transcripts decreasing in DND1-GFP-hi cells and 338 transcripts increasing in DND1-GFP-hi cells to approach E18.5 DND1-GFP-lo levels. Based on GO analysis in g:Profiler decreasing transcripts were primarily associated with metabolic processes, whereas increasing transcripts were primarily associated with cell migration and signaling.

Despite the convergence of some transcripts, many epigenetic regulators maintain higher expression levels (TPM≥5.0, log2-fold change (L2FC)≥1.0, p-val<0.05) in DND1-GFP-hi cells throughout G0. This group included two members of the BET-BRD (bromo and extra terminal domain) family, two Dnmt (DNA methyltransferase) members, three Hdac (histone de-acetylase) members, four repressive members of the Kdm family (lysine demethylase), four members of the Smarc (matrix-associated, actin dependent chromatin regulators) family, and five components of the SAGA complex (Trrap, Kat2a, Taf5l, Taf6l, Taf9), a highly conserved histone deubiquitinase and histone acetyltransferase complex. Genes involved in chromosome stability were also elevated in DND1-GFP-hi cells (Fig 4B). For example, four members of the Tudor family involved in transposon repression (Fig 4B) and Piwil1, Piwil2, and Piwil4, which mediate the piRNA pathway, showed higher expression in DND1-GFP-hi cells (Fig 5A).

In addition, genes encoding proteins involved in RNA metabolism and translation were expressed at higher levels in DND1-GFP-hi cells. The majority expressed at E18.5 were also expressed at E16.5 (Figs 4A and 5A–5C). We focused on transcripts that encoded genes classified as RNA binding proteins, ribosomal subunit genes, and Golgi vesicle transport that were highly significant at both E16.5 and E18.5. RNA binding proteins included Mael, Piwil4, Rbfox2, and an exportin (Xpo5) (Fig 5A). Many genes encoding ribosomal subunits (including Mrp and Rp family members) were over-expressed in DND1-GFP-hi cells (Fig 5B), as well as a large group of transcripts encoding proteins involved in Golgi vesicle transport (including Exoc3, Pfdn1, Sys1, and Tmed9) (Fig 5C).

RNA immunoprecipitation sequencing (RIP-seq) identified mRNA targets of DND1 at E14.5, E16.5, and E18.5

One goal of producing the Dnd1GFP knock-in line was to use the tagged protein for RNA immunoprecipitation sequencing (RIP-seq) analysis from germ cells in vivo. Previous results from analysis of a DND1 tagged allele in HEK293 cells, or in vitro with ESCs or derived germline stem cell lines identified targets of DND1 [2, 5, 22, 23, 38, 39]. However, these experiments missed in vivo targets expressed specifically in MGCs during the last third of fetal development and could not address the dynamics of the in vivo system. We recently used RIP-RT-qPCR in a proof-of-principle experiment to assay for a group of predicted targets of DND1 in vivo [27]. To extend this preliminary analysis, we isolated MGCs from 3 stages of development, E14.5 (the beginning of G0), E16.5 (a midpoint in G0), and E18.5 (just prior to birth), and performed RNA immunoprecipitation using an antibody against GFP [4042]. After performing initial quality control checks and analysis (see Materials and Methods; S8 and S9A Figs) we used differential analysis between paired RNA input and immunoprecipitation (IP) samples to determine significant enrichment/non-enrichment of transcript binding targets of DND1 (S9B Fig and S4 Table). Quantification and comparison of the targets at each timepoint produced an Euler plot [43] showing that 17% of targets are conserved at all timepoints (E14.5, E16.5, E18.5) with only 13% unique to E14.5, 16% unique to E16.5, 48% unique to E18.5, and a limited group of targets (≤5%) shared by only two stages (Fig 6A). The full lists of targets at each timepoint (S2 Table) were submitted for gene ontology analysis in g:Profiler [44]. Out of the list of 363 biological processes collectively ascribed to target lists at each time point (S5 Table), 109 categories were present at all three timepoints (S5 Table).

Fig 6. RIP-seq reveals conservation of many DND1 targets at all 3 stages.

Fig 6

A: Euler plot of temporal distribution (E14.5 (pink), E16.5 (green), E18. (orange)) of significantly enriched transcript targets of DND1. B: Bubble plot from g:Profiler analysis of significantly enriched transcript targets of DND1 showing change in number of targets (Y-axis) for prominent GO biological processes related to cell cycle and metabolism (upper), and cell movement and ECM-cytoskeleton organization (lower) the -log10 P-value of each GO biological process is denoted by bubble size with developmental stages progressing along the X-axis. C: Bar graphs showing target enrichment as a function of L2FC at E14.5, E16.5, E18.5. RIP-seq enrichment P-value (DESeq2): not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****). D: number of significantly enriched transcript targets at E14.5, E16.5, E18.5 for GO biological processes identified in hiloRNA-seq analysis (Fig 4A): translation, RNA binding, ribosomal subunit, Golgi vesicle transport. E: Heatmap of significantly enriched transcript targets at E14.5, E16.5, E18.5 for the GO biological process cell cycle.

Significant terms (adjusted p-value ≤0.01) were analyzed in a bubble plot depicting the number of DND1 targets (Y-axis) that were part of a given GO term at each stage (X-axis) with the significance (-log10 adjusted p-value) represented by the size of the bubble. This analysis demonstrated that the number of target transcripts in a GO category often both increased over time and increased in significance score as a category (Fig 6B). Thematically many of these GO terms were related to regulation of cell differentiation through control of cell cycle or metabolism (Fig 6B, upper plot), and cell movement or intra/extra-cellular structure (Fig 6B, lower plot). Transcript targets in each of these three categories include Bmp1 & Pard3 (regulation of cell differentiation through cell cycle control), Nox4 & Atp2b4 (metabolism), and Lmna & Mmp2 (cell movement or intra/extra-cellular structure) (Fig 6C). Many transcripts encoding translational proteins, RNA binding proteins, ribosomal subunits, and Golgi vesicle transport were expressed at higher levels in DND1-GFP-hi cells (Fig 5A–5C). To investigate whether these same GO categories were over-represented among DND1 targets, the GO-derived gene lists were used in a SQL database to identify genes that were also targets of DND1. Transcripts for many genes associated with translation, RNA binding, and Golgi vesicle transport were targets of DND1, especially at E16.5 and E18.5. However, despite the abundance of ribosomal subunit genes differentially expressed between DND1-GFP-lo and DND1-GFP-hi cells, very few ribosomal subunit transcripts were targets of DND1 (Fig 6D). LaminA (Lmna) was one of a small number of transcripts that showed the most significant association with DND1 at E14.5 (Fig 6C). Transcripts associated with the cell cycle tended to show highest association with DND1 at E16.5, whereas transcripts associated with metabolic process, intracellular organization, and cell motility showed an increasing association with DND1 from E14.5 to E18.5 and were highly represented among the 367 targets that were only significant at E18.5.

Cell cycle regulation is critical in fetal male germ cell development. The resultant g:Profiler category of cell cycle contained a list of target genes sometimes with oppositional roles in the cell cycle (Fig 6E). This included genes involved with positive regulation of cell cycle (Atrx, Numa1, Pold1), negative regulation of cell cycle (Cenpf, Ctnnb1, Knl1), and the DNA damage checkpoint (Cul4a, Ddb1, Topbp1). Transcript targets in these three cell-cycle sub-categories did not present a global pattern of enrichment in any category. There were genes that were targeted by DND1 only later in the time course (Ctnnb1, Pam, Topbp1), and others that peaked as targets at E16.5 (Eif4g1, Atrx, Trrap). The three strongest targets in this cell cycle sub-analysis (Mid1, Mycbp2, Trrap) were most significantly associated with DND1 at E16.5 (Fig 6E). Mid1 (Midline 1), a member of the TRIM (tripartite motif) family, is an E3 uibiquitin ligase that regulates microtubule anchoring as part of cytokinesis [45]. Mycbp2 (MYC Binding Protein 2) and Trrap (Transformation/Transcription Domain Associated Protein) are both MYC binding proteins that are linked to E2 activity and transcriptional activation [4654].

Many epigenetic regulators that are differentially expressed dependent on the level of DND1 are also targets of DND1

Of the 869 DND1 transcript targets identified in our analysis, 164 were found in E16.5 DND1-GFP-hi cells (18.87%), 81 in E18.5 DND1-GFP-hi cells (9.32%), 81 in E16.5 DND1-GFP-lo cells (9.32%), and 6 in E18.5 DND1-GFP-lo cells (0.01%). A major common class between hiloRNA-seq and RIP-seq were genes related to epigenetic regulation, which we previously found to be significantly disrupted in germ cells carrying the Ter mutation (Dnd1Ter/Ter) [23]. We explored the overlap between epigenetic regulators that were differentially expressed in the hiloRNA-seq dataset (usually over-expressed in DND1-GFP-hi cells) (Fig 4B) and enriched as in vivo targets of DND1 at E14.5, E16.4 and E18.5 (L2FC>0.5, p-val<0.05) (S10A Fig). Brd3 and Brdt are both over-expressed in DND1-GFP-hi cells, but only Brd3 maps as a target. The DNA methyltransferases, Dnmt3a and Dnmt3l, are higher in DND1-GFP-hi cells, and both enrich as targets of DND1 at one or more stages (Figs 4B, S10A and S10B). Hdac2 and Hdac6 are over-expressed in DND1-GFP-hi cells, but only Hdac4, Hdac5, and Hdac6 enrich as targets of DND1. Several H3K4 demethylases Kdm1a, Kdm1b, Kdm3a, Kdm5b, and Kdm6b enrich as targets of DND1, but only the first four of these, which are associated with repression, are higher in DND1-GFP-hi cells (Figs 4B, S10A and S10B). Although four of the SWI/SNF-related matrix-associated, actin dependent regulators (Smarcs) are higher in DND1-GFP-hi cells, only two of these (Smarc1 and Smarcd2) enrich as targets of DND1 (S10A and S10B Fig). Four members of the Tdrd family of transposon regulators are higher in DND1-GFP-hi cells, and all four of these (Tdrd1, Tdrd5, Tdrd7, Tdrd9) are targets of DND1. Multiple members of the highly conserved histone deubiquitinase and histone acetyltransferese SAGA complex (Trrap, Kat2a, Taf5l, Taf6l, and Taf9) are higher in DND1-GFP-hi cells, but only one central co-factor (Trrap) is a consistent target of DND1. As a further step, we expanded our SQL database to encompass our previously published Dnd1Ter-seq dataset [23]. Nearly all of these chromatin regulators are genes that fail to be up-regulated in DND1Ter/Ter mutant germ cells by E14.5 (S10A Fig). Only one histone deacetylase (Hdac10) is significantly upregulated in DND1Ter/Ter mutant germ cells and, notably, Hdac10 is not a target of DND1 and not significantly differentially expressed in E16.5 DND1 cells (S10A Fig). Similar three-way analysis was conducted to identify DND1 transcript targets, differentially expressed between DND1-GFP-hi and DND1-GFP-lo cells that were dysregulated in Dnd1Ter/Ter mutant germ cells related to extracellular matrix organization and the cell cycle. Unlike epigenetic regulation, three-way analysis for extracellular matrix organization and cell cycle genes had less consistent regulatory patterns across all three datasets (S2 Table).

Two recent publications have reported groups of genes associated with the transition of pro-spermatogonia to spermatogonia identity [16, 55]. Part of this transitional process was reported to be characterized by expression of genes related to cell migration, a process that was strongly represented in the gene ontology analysis of our RIP-seq data. We repeated our two-way analysis with the family of collagens, many of which were strongly enriched as DND1 targets (S11A Fig), to identify any patterns in our hiloRNA-seq dataset (S11B Fig). Notably there was very limited significant differential expression of these collagens between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 or E18.5. Five out of six of the genes reported by the Wilkinson group to be associated with the intermediate pro-spermatogonia identity [55] showed higher expression in DND1-GFP-hi cells, and all six enriched as binding targets of DND1 (S11C Fig). Of the genes identified by the Zhao group [16] (Zxdc, Tie1, Myog, Mef2c and Krt18), Zxdc, Myog and Krt18 are expressed at higher levels in E16.5 and E18.5 DND1-GFP-hi cells, while Tie1 and Mef2c are expressed similarly between DND1-GFP-lo and hi cells. Notably, only Zxdc enriched as a target and only at E16.5 and E18.5 (S11D Fig).

Three Y-linked genes (Eif2s3y, Uty, Smcy) were significantly differentially expressed between DND1-GFP-lo and DND1-GFP-hi cells at E16.5. The Y-linked translation initiation factor, Eif2s3y, which is known to be important in spermatogonial proliferation and differentiation [56, 57], was elevated in DND1-GFP-hi cells at E16.5, while both Y-linked epigenetic regulators, Uty and Smcy, were elevated in DND1-GFP-lo cells at E16.5 and E18.5. We investigated the expression of the X-linked homologs (Eif2s3x, Utx, Smcx) in DND1-GFP-hi and DND1-GFP-lo cells and if any of these sex chromosome genes enriched as targets. None of the X-linked genes (Eif2s3x, Utx, Smcx) were significantly differentially expressed between DND1-GFP-lo and DND1-GFP-hi cells. All the epigenetic regulators enriched as targets at least at one stage, while Uty, Smcx, and Smcy enriched as targets at all stages (Utx, at E14.5 only) (S12 Fig).

Discussion

We show that there are two populations of MGCs expressing low and high levels of DND1 during G0 arrest in late fetal life. These two populations were first discovered by FACS analysis of MGCs expressing the DND1 allele tagged with GFP at the endogenous locus (Dnd1GFP). These findings were supported by live imaging, quantitative microscopy, and analysis of single cell sequencing datasets from 2 independent labs, which also reveal MGC populations expressing high and low levels of native Dnd1 [31, 32]. DND1-GFP-lo and DND1-GFP-hi populations do not correlate with any markers of MGC heterogeneity previously reported [30]. However, our analyses revealed that the transcriptomes of DND1-GFP-lo cells are highly correlated between E16.5 and E18.5, and significantly different from DND1-GFP-hi cells (which are also highly correlated between both stages). The finding that higher levels of phosphatidylserine on the surface of E16.5 MGCs (as determined by Annexin V detection) is correlated with low levels of DND1 suggests that DND1-GFP-hi cells may be favored to survive and give rise to the pro-spermatogonial population. Prominent among genes that show higher expression in DND1-GFP-hi cells, and also enrich as targets of DND1, is a group of genes encoding epigenetic regulators and translational machinery. Some of these novel direct targets have been previously shown to be dysregulated in the presence of mutated Dnd1 (Dnd1Ter/Ter) [23], while other targets identified by us (Tdrd7, Rock2, Mylk, Anxa5) have previously been reported as targets of DND in zebrafish [24, 38]. These results suggest that DND1 coordinates RNA regulons of related epigenetic factors that control reprogramming of MGCs during cell cycle arrest.

Defects in RNA metabolism are likely to underlie teratoma formation in the presence of the Dnd1Ter/Ter mutation. The eukaryotic translation initiation factor, Eif2s2, has been shown to interact genetically with DND1. Although full deficiency for Eif2s2 was embryonic lethal, partial deficiency protected against teratomas by restricting germ cell proliferation and differentiation [58, 59]. Several EIFs enrich as targets of DND1 (Eif3a, Eif4g3, Eif3c). Other EIFs were significantly differentially expressed between DND1-GFP-lo and DND1-GFP-hi cells (Eif3c, Eif3f, Eif2s2, Eif2s3y). Eif3f has been shown to interact with DND1 in the context of Xenopus germline development [60]. Eif2s3y has been shown to be important in spermatogonial differentiation and proliferation after birth [56, 57]. There is also Increasing evidence suggesting that ribosomal subunits have specificity [61]. It will be very interesting to determine whether certain of the ribosomal subunits expressed at high levels in DND1-GFP-hi cells have specific translational roles in MGCs. Overall, the interaction of DND1 with these and other components of eukaryotic translational machinery needs further study.

Consistent with the idea that DND1-GFP-hi cells are a select population of MGCs, at E16.5, they express much higher levels of genes associated with the germline stem cell population including Dazl, Ddx4, Nanos2, Dppa3, Tfap2c, Pou5f1, and Sox2, although transcripts for all these genes decline by E18.5. DND1-GFP-hi cells show higher expression of genes associated with oxidative phosphorylation, and also express elevated levels of other RNA binding proteins relevant to male fate specification such as Dazl and Nanos2. DAZL is associated with the transition to MGC differentiation [62] and NANOS2 is well documented to work in complex with DND1 to target transcripts of pluripotency genes and activating cyclins for degradation [1, 25, 63]. Dazl has also been shown to be targeted by NANOS2 as part of negative regulation and commitment to MGC fate [64].

DND1-GFP-hi cells express higher levels of epigenetic regulators. It is well established that MGCs undergo extensive epigenetic reprogramming during G0 arrest [1821]. Among the genes elevated in DND1-GFP-hi cells were four members of the Tdrd family of transposon regulators, several DNA methyltransferases, four repressive chromatin regulators of the Kdm family, four members of the SWI/SNF-related Smarc family, and multiple members of the histone deubiquitinase and histone acetyltransferese SAGA complex. Many of these gene transcripts are also targets of DND1, suggesting that DND1 regulates their availability during cell cycle arrest. The regulation of multiple epigenetic enzymes by DND1 suggests that one reason that Dnd1Ter/Ter mutant germ cells tend to give rise to teratomas is a failure to epigenetically silence somatic gene expression.

MGCs undergo hypertranscription from the stage when they begin to populate the gonad until at least E15.5 [65]. Hypertranscription in MGCs is regulated by MYC proteins that trigger release of paused RNA Pol II [65]. Interestingly, the Pol II complex is activated by the addition of BRD4 [66, 67], which is expressed at higher levels in DND1-GFP-hi cells and maps as a target of DND1 at E16.5 and E18.5. DND1 could also function upstream of MYC proteins and other transcription factors. Trrap, a highly conserved and well documented regulator of MYC and other transcription factors, is a target of DND1 that peaks at E16.5 (Figs 6E, S10A and 10B) [4653]. It is possible that hypertranscription itself is under partial DND1 control.

The large number of RBPs present in MGCs may be essential for the post-transcriptional regulation of the many transcripts that accumulate in MGCs as a result of hypertranscription [65]. A novel finding in this study was the large group of transcripts encoding proteins associated with translation that are up-regulated in DND1-GFP-hi cells and are also bound by the DND1 protein. DND1 binds many transcripts related to vesicle fusion and Golgi function, and many of these transcripts are also up-regulated in DND1-GFP-hi cells. These data suggest that DND1 not only regulates transcript stability but may also control when or where transcripts are translated, acting as a hub for post-transcriptional regulation in the germline as suggested by Gross-Thebing and Raz [68]. It is unclear whether or how transcripts bound by DND1 and sequestered in various germ granules are transferred to the ribosome for translation.

Results from this study raise many questions. First, what is responsible for low or high levels of DND1? Since this study began, we have seen variations in the proportion of DND1-GFP-lo and DND1-GFP-hi cells based on FACS analysis. Are DND1 levels in MGCs regulated by surrounding somatic cells or are levels of DND1 intrinsically regulated? TFAP2c (which is highly upregulated in DND1-GFP-hi cells) was reported to regulate DND1 levels in induced primordial germ cells (PGCs) [69]. TFAP2c responds to α-ketogluterate levels, which suggests a metabolic connection. However, we found no correlation with levels of mitochondria, which we used as a surrogate for energy differences in DND1-GFP-lo and DND1-GFP-hi cells.

Second, acting in its role as a RNA-binding protein, DND1 is reported to stabilize transcripts in some instances [5, 24] but has also been shown to destabilize transcripts through cooperation with NANOS2 as part of the CNOT complex to bring transcripts to the processing (P) bodies for degradation [1, 2, 70]. The diverse group of DND1 targets reported here, including positive regulators of the cell cycle, which we predict are sequestered and degraded by DND1 during G0, as well as epigenetic enzymes, which we predict are protected by DND1, is consistent with more than one role for DND1. It is possible that DND1 acts to degrade transcripts at one stage, but to protect transcripts at another stage. However, the failure to find a consistent pattern in transcripts bound by DND1 at early or late stages suggests that it can perform either function at all stages. It is unclear whether this depends on characteristics of the target, its cellular localization, and/or other interacting proteins such as NANOS2 [1, 2, 70]. Cross examination of our RIP-seq and hilo-RNA-seq datasets revealed a limited number of DND1 targets that were significantly up-regulated or down-regulated from E16.5 to E18.5 specifically in DND1-GFP-lo or DND1-GFP-hi cells. Certain of these transcripts could be stabilized or destabilized in a DND1-GFP-lo or DND1-GFP-hi specific manner. However up-regulation could also be the product of hypertranscription occurring during this period. Future molecular follow up is needed to untangle expression changes due to transcriptional vs. post-transcriptional regulation.

Third, we have not yet determined whether populations of DND1-GFP-lo and DND1-GFP-hi cells are episodic or clonal. While DND1-GFP-hi cells are sometimes found in clusters within cords (as might be expected of clones), sometimes they are highly dispersed. In theory, episodic levels of DND1 could be compatible with sequestration of transcripts in high cells, followed by release for translation in low cells. Information about whether translational differences exist between the two populations is the next goal.

The biological significance of DND1-GFP-hi and DND1-GFP-lo cells is not yet entirely clear. A large proportion of MGCs are lost by the time of birth [30, 71, 72]. Consistent with this, both DND1-GFP-hi and DND1-GFP-lo cells express genes related to cell death and apoptosis. Although mechanism(s) regulating MGC apoptosis or protection from apoptosis during G0 are largely unexplored, our data suggest a preference for intrinsic apoptosis in G0 DND1-GFP-hi cells and extrinsic apoptosis in G0 DND1-GFP-lo cells. This is consistent with our findings of elevated phosphatidylserine in 16.5 DND1-GFP-lo cells, presumably, to mark cells for phagocytosis. Overall, these data suggest that MGCs expressing low levels of DND1 are more likely to be eliminated through an extrinsic cell death pathway that is independent of canonical apoptotic machinery, such as cPARP. Such germ cell death (GCD) mechanisms have been described in the Drosophila testes [73, 74].

One attractive possibility is that DND1-GFP-hi cells acquire the epigenetic modifications that facilitate differentiation as pro-spermatogonia, consistent with the accumulation of transcripts for epigenetic regulators in DND1-GFP-hi cells. In support of the idea that DND1-GFP-hi cells are a leading population of pro-spermatogonia, at E16.5 and E18.5, some markers recently reported to define a novel intermediate pro-spermatogonia identity are expressed at higher levels in DND1-GFP-hi cells [16, 55], but the pattern is inconsistent. Future work will aim to resolve these questions.

Materials and methods

Ethics statement

All mice were housed in accordance with National Institutes of Health (NIH) guidelines, and experiments were conducted with the approval of the Duke University Medical Center (DUMC) Institutional Animal Care and Use Committee (IACUC # A126-17-05).

Colony maintenance and timed matings

The Dnd1EGFP knock-in allele was produced in the Duke Transgenic Core by CRISPR/Cas9 pronuclear injection into B6SJLF1/J (JAX stock# 100012) embryos as described [27]. Knock-in founders were bred with CD1 mice (Charles River Labs strain code 022). Progeny heterozygous for the knock-in were intercrossed to produce homozygous offspring, which were intercrossed to maintain a homozygous hybrid colony (Dnd1EGFP/EGFP, hereafter written simply as Dnd1GFP). Approximately every six generations, homozygous males were outbred with CD1 females. New heterozygous progeny were selected then intercrossed to restore a homozygous hybrid strain colony. For timed matings, homozygotes were intercrossed, females were checked for plugs, and staged as day E0.5 if positive.

Fresh unfixed and unstained whole mount imaging

Fetuses from timed matings were dissected at E16.5 and E18.5 to collect testes. After excision, testes were washed in PBS followed by mounting on a slide (Thermo Fisher Scientific, cat# 12-550-15) in polyvinyl alcohol (PVA) with a coverslip (VWR, cat# 48366–227). Mounted samples were directly imaged using a SP8 Leica DM6000CS upright confocal microscope with affiliated Leica software (LAS AF 3). All tile stitching was done with Leica software, remaining image processing was done with FIJI (version 2.3.0/1.53f) [75] and Adobe Photoshop (version 23.1.0).

Whole mount immunofluorescence on fixed samples

Whole mount immunofluorescence was performed for stages E14.5 and E18.5 as described [27, 76, 77]. Briefly, for E14.5 and E18.5, embryonic gonads were dissected from embryos and fixed in 4% paraformaldehyde in PBS at 4°C overnight on a rocking platform. All samples were washed in PBS for 30–60 minutes, rocked for 15 minutes at room temperature at each step of a dehydration gradient (25%, 50%, 75% and 100% methanol in PBS), then stored in 100% fresh methanol at -20°C. Prior to immunofluorescent staining, all samples were incubated at room temperature while rocking for 15 minutes at each step of the rehydration gradient (75%, 50% and 25% methanol in PBS), and washed twice in PBS for 15 minutes each. Samples were incubated rocking at room temperature in a permeabilizing solution (2% Triton X-100 in PBS) for 1 hour followed by incubation in blocking solution for 1 hour, then transferred to 4°C in blocking solution (1% Triton X-100, 3% BSA, and 10% Horse Serum in PBS) with primary antibodies overnight. The following day, samples were washed by rocking in wash solution (1% Triton X-100 in PBS) three times at room temperature for 30 minutes each, then incubated at 4°C in blocking solution with secondary antibodies and Hoechst (Invitrogen, cat# H3570) overnight. All antibodies are listed in Table 1. On day 3, samples were washed three times in wash solution rocking at room temperature for 30 minutes each, mounted in PVA (aka mowiol) mounting medium, and cover-slipped for imaging on a SP8 Leica DM6000CS upright confocal microscope with affiliated Leica software (LAS AF 3). All tile stitching was done with Leica software, remaining image processing was done with FIJI (version 2.3.0/1.53f) [75] and Adobe Photoshop (version 23.1.0).

Table 1. Antibodies.

Against Raised In Concentration Vendor Catalog # Reference
GFP Chicken 1:5000 Abcam ab13970 [27]
GCNA Rat 1:1000 Abcam ab82527 [78]
ORF1p Rabbit 1:100 Abcam ab216324 [30]
TOMM20 Rabbit 1:1000 Abcam ab186734 [79]
cPARP Rabbit 1:100 Cell Signaling Technology 9544 [30]
Against Raised In Concentration Vendor Catalog # Conjugate
Chicken Donkey 1:1000 Jackson Immuno Research 703-545-155 AF488
Rat Donkey 1:1000 Jackson Immuno Research 712-165-153 Cy3
Rabbit Donkey 1:1000 Jackson Immuno Research 711-605-152 AF647
Annexin V N/A 1drop/200μL Invitrogen R37175 AF647

Quantification of DND1-GFP levels

The method for quantifying DND1-GFP-hi and DND1-GFP-lo cells (S2 Fig) was adapted from a previous publication [80]. Testes were fixed as previously described (refer to “Whole Mount Immunofluorescence on fixed samples” section). In fixed whole mount testes, endogenous GFP signal and Hoechst (Invitrogen, cat# H3570) staining were used for quantification in each experiment. A total of 50 individual MGCs over two biological replicates were randomly selected, and using the GFP signal, were individually cropped along the X and Y planes with FIJI [75]. Using the GFP signal and the Hoechst signal as visual guides, a projected z-stack was created for each MGC using the sum of intensities for the z-slices containing only the selected MGC. After manually creating a mask to outline the selected cell, DND1 fluorescent intensity for each MGC was obtained using GFP fluorescence mean intensity. Next, to account for imaging heterogeneity throughout the tissue, DND1-GFP fluorescent intensities were normalized to the average of the mean intensity of Hoechst staining, which is assumed to be equal across all MGCs. Because DND1-GFP-hi cells represent ~12% of the population in E16.5 MGCs and 50 MGCs were analyzed, we determined that the cluster of 5 cells (12% of 50 cells is approximately 5 cells) with the highest expression were among the DND1-GFP-hi population.

scRNA-seq re-analysis

Demultiplexed fastq files of E16.5 male germ cell scRNA-seq data originally published in Law et al. 2019 (GSE124904) were downloaded to the Duke Computing Cluster (DCC) from the Sequence Read Archive (SRA) using the SRA toolkit (n = 3: SRR8427519, SRR8427520, SRR8427521) [31, 81]. SRA-based names for I1, R1, and R2 fastq files were changed to be compatible with Cell Ranger [82]. The Cell Ranger Count pipeline (read alignment, UMI counting) was performed on each biological replicate referenced to the mm10 mouse genome (GRCm38, v3.0.0). The Cell Ranger Aggregation pipeline was performed on the molecule_h5 files for each biological replicate to generate a single integrated E16.5 dataset for downstream analysis. Quality control and clustering was performed on the aggregated data using Seurat [83]. Stability indexing with Clustree [84] was used to choose a resolution of 0.1. The resultant clustered data was used for analysis and visualization in Seurat and exported to Loupe Browser (10x Genomics) for exploratory visualization.

RNA Immunoprecipitation (RIP) and RNA Extraction

RNA immunoprecipitation (RIP) experiments were performed as previously described with adjustments to account for the very low input from fetal testes [27, 41, 42]. Briefly, fetal testes at either E14.5, E16.5, or E18.5 were dissected away from the mesonephros and collected. Pooled testes (6–16 testes total for a given biological replicate) were washed in DEPC PBS and incubated at 37°C in fresh clean Gibco TrypLE (Thermo Fisher Scientific, catalog #12563029) for 7 or 10 minutes depending on stage (E14.5, or E16.5 and E18.5 respectively). TrypLE was inactivated with ice cold DEPC PBS and aspirated off testes. Testes were then resuspended in ice cold DEPC PBS and pipetted vigorously to dissociate cells. Cells were pipetted through a 0.32μm cell strainer into a fresh Eppendorf tube and pelleted by 4°C centrifugation at 1,000 x g for 10 minutes. The cell pellet was resuspended in 20μL complete Poly Lysis Buffer (PLB) [41], incubated on ice for 5 minutes and stored at -80°C for at least 24 hours. The PLB cell suspension was then thawed on ice and cellular debris was pelleted by 4°C centrifugation at 21,000 x g for 10 minutes. After centrifugation 3μL of supernatant was collected for RNA extraction as input sample. The remaining 17μL of supernatant was used for immunoprecipitation via incubation at 4°C on a tube rotator in NT2 Buffer [41] with NT2 Buffer washed GFP-Trap magnetic agarose beads (ChromoTek, catalog #gtma-20) for 2–4 hours. After immunoprecipitation, beads were pelleted, washed in NT2 buffer and pelleted again for RNA extraction as immunoprecipitation (IP) sample. RNA from input and IP samples was extracted using the RNeasy Micro Kit per the manufacturers protocol (Qiagen, catalog #74004). Extracted RNA from input and IP samples at E14.5, E16.5, and E18.5 was used in RNA-sequencing.

RIP-seq analysis

After RNA immunoprecipitation and RNA extraction, paired input and immunoprecipitation (IP) samples were submitted for sequencing. Paired biological replicates per time point were sent for quality control (E14.5, n = 5 biological replicate pairs, E16.5, n = 4 biological replicate pairs, and E18.5, n = 5 biological replicate pairs; total of 28 samples). RNA quality control was measured on a Qubit (Thermo Fisher Scientific). All samples passed quality control and the 3 biological replicate pairs per time point with the lowest contaminant score and the most RNA were selected for library preparation. Sample libraries were prepared with SMART-Seq v4 Ultra Low Input RNA Kit (Takara Clontech Kit Cat# 63488). Libraries were sequenced on a NoveSeq 6000 (Illumina) as 50bp paired-end reads (~46 M reads/sample input, ~50 M reads/sample IP). Reads were mapped to the mm10 (GRCm38) mouse genome using Salmon with default settings (~78% mapping input, ~57% mapping IP) [85]. Mapped reads were annotated using the Ensemble Mus musculus reference (version 79). Read abundance from Salmon (TPM) values were used in post-annotation quality checks: correlation matrix, principal component analysis (PCA), scatter plots. Paired input and IP samples were then evaluated for transcript enrichment in IP vs input using DeSeq2 [86]. The resulting data table was used for refining target transcript gene lists in SQL [87, 88].

Fluorescence Associated Cell Sorting (FACS)

Sorting was performed at the Duke Flow Cytometry Shared Resource core on a B-C Astrios Sorter. Initially, E18.5 DND1GFP testis samples compared with ovaries from female siblings used as negative controls, defined a gating protocol to distinguish between GFP negative, GFP positive-low, and GFP positive-high gonadal cells. The established gating protocol was used for all sorting experiments (E14.5, E16.5, E18.5, P3). Sorting was performed as described previously [27]. Briefly, stage E16.5 and E18.5 testes were collected as described in the RIP-RNA Extraction method section. However, after TrypLE treatment testes were dissociated in 3% BSA in DEPC PBS. After dissociation the cell suspension was passed through a 0.32μm cell strainer. Collected cells were then sorted on presence and absence of GFP (GFP positive and GFP negative) and level of GFP (GFP positive-low, and GFP positive-high). Cells were sorted directly into RLT buffer from RNeasy Micro Kit (Qiagen, catalog #74004). RNA extraction was performed as described in the RIP-RNA extraction method section.

hiloRNA-seq analysis

Sorts were performed as described in the Fluorescence Associated Cell Sorting (FACS) section. After FACS and RNA extraction, paired DND1-GFP-lo and DND1-GFP-hi samples were submitted for sequencing. Four independently collected paired biological replicates per time point were sent for quality control (4 DND1-GFP-hi and 4 DND1-GFP-lo paired samples, for both E16.5 and E18.5, 16 samples total). RNA quality control was measured on a TapeStation (Agilent Technologies) using High Sensitivity RNA ScreenTape (Agilent Technologies). All samples passed quality control. Sample libraries were prepared with SMART-Seq v4 Ultra Low Input RNA Kit (Takara Clontech Kit Cat# 63488). Libraries were sequenced on a NoveSeq 6000 (Illumina) as 50bp paired-end reads (~54 M reads/sample DND1-GFP-hi, ~50 M reads/sample DND1-GFP-lo). Reads were mapped to the mm10 (GRCm38) mouse genome using Salmon with default settings (~46% mapped DND1-GFP-hi, ~28% DND1-GFP-lo) [85]. Mapped reads were annotated using the Ensemble Mus musculus reference (version 79). Read abundance from Salmon (TPM) values were used in post-annotation quality checks: correlation matrix, principal component analysis (PCA). Paired DND1-GFP-lo and DND1-GFP-hi samples were then evaluated for significantly differentially expressed genes using DeSeq2 [86]. The resulting data table was used for refining gene lists in SQL [87, 88].

Super-series (hiloRNA-seq, RIP-seq, Dnd1Ter-seq) analysis

The resulting data tables contained expression (TMP) and differential gene expression (hi vs lo; E16.5 vs E18.5; L2FC, p-value) for hiloRNA-seq, and expression (TMP) and transcript enrichment (input vs IP; L2FC, p-value) for RIP-seq. The data tables were queried in SQL using expression, differential expression, and significance cutoffs (TMP>5, TMP>10, L2FC>1, L2FC<-1, p-value ≤0.05) to produce lists of noteworthy genes. The resultant gene lists were cross referenced for gene ontology and pathway analysis in g:Profiler [44] and PANTHER [89, 90]. As a secondary approach, full gene lists for specific biological processes were downloaded from the Gene Ontology Consortium [91, 92]. These full gene lists were cross-referenced with our data tables using expression, differential expression, and significance cutoffs filters to identify genes related to a specific biological process that were also significant in one or both data sets. As a final step we also cross-referenced full gene lists from the Gene Ontology Consortium in a three-way analysis that simultaneously queried hilo-RNA-seq, RIP-seq, and our previously published Dnd1Ter-seq data sets (GSE132719) [23].

Data processing, statistical analysis, and Visualization

All software used in the analysis is listed in Table 2. Computing heavy data processing was exclusively performed with SLURM on the Duke Computing Cluster and the Scientific Computing Unit at Weill Cornell Medicine. Computing light data processing was exclusively performed in RStudio on a personal computer [93]. Statistical analysis was performed using the included algorithms that come standard with Seurat (scRNA-seq), and DeSeq2 (RNA-seq) [83, 86]. Initial data visualization utilized included packages. Advanced downstream visualization was either done with EulerR [94, 95] or in GraphPad Prism (version 9.3, La Jolla, CA, USA). Data are presented as the mean ± standard error of the mean. Differences were considered significant when the p-value was ≤ 0.05 (*), ≤ 0.01 (**), ≤ 0.001(***), or ≤ 0.0001 (****). GO identification numbers for GO terms used to derive gene lists are reported in Table 3. Full data tables for hiloRNA-seq and RIP-seq data were also exported together into excel where a rudimentary lookup function accessibly queries the data on a per gene basis and plots basic visualizations of the resultant data (S4 Table). For most final data figures the Ito-Okabe Color Universal Design (CUD) color palette was used for visual inclusivity [96, 97].

Table 2. Software.

Program Version Reference
Biobase 2.52.0 [98]
BiocGenerics 0.38.0 [98]
10x Genomics Cell Ranger 3.1.0 [82]
clustree 0.4.4 [84]
corrplot 0.90 [99]
DBI 1.1.1 [100]
dbplyr 2.1.1 [101]
DelayedArray 0.18.0 [102]
DESeq2 1.32.0 [86]
dplyr 1.0.7 [103]
eulerr 6.1.1 [94, 95]
ggplot2 3.3.5 [104]
gprofiler2 0.2.1 [44]
janitor 2.1.0 [105]
knitr 2.0 [106]
mixtools 1.2.0 [107]
openxlsx 4.2.4 [108]
patchwork 1.1.1 [109]
plyr 1.8.6 [110]
R 4.1.1 [111]
RColorBrewer 1.1.2 [112]
readr 2.0.2 [113]
RSQLite 2.2.8 [87]
Rstudio 2021.09.1 [93]
Salmon 1.3.0 [85]
Seurat 4.0.5 [83]
SRAtoolKit 2.9.6 [81]
stats 4.1.1 [111]
stringr 1.4.0 [114]
tidyverse 1.3.1 [115]
tximport 1.20.0 [116]
umap 0.2.7.0 [117]
viridisLite 0.4.0 [118]
10x Genomics Loupe Browser 4.1.0 10x Genomics, proprietary

Table 3. GO Biological Process Gene Lists.

GO-BP Name GO:#
Histone Modification GO:0016570
Regulation Chromatin Organization GO:1902275
Glycolysis GO:0061621
Oxidative Phosphorylation GO:0006119
Translation GO:0006412
RNA binding GO:0003723
Ribosomal Subunit GO:0044391
DNA Damage Checkpoint Signaling GO:0000077
Cell Cycle GO:0007049
Extracellular Matrix Organization GO:0030198
Regulation of Histone Ubiquitination GO:0033182
Golgi Vesicle Transport GO:0048193
Cytoskeleton Organization GO:0007010
Chromosome Organization GO:0051276
DNA Conformation Change GO:0071103
Protein Modification Process GO:0036211
Cell Differentiation GO:0030154
Intrinsic Apoptotic Signaling Pathway GO:0097193
Extrinsic Apoptotic Signaling Pathway GO:0097191

Annexin V Analyses of phospholipid phosphatidylserine

Fetal E16.5 DND1GFP testes were collected as described in the RIP-RNA Extraction method section. However, after TrypLE treatment, testes were dissociated in 200μL of 1x Annexin-binding buffer (Annexin-binding buffer, 5x concentrate, Invitrogen, #V13246). For a GFP-negative control, a E16.5 DND1GFP fetal limb was dissociated using the same method. One drop of Annexin V, Alexa Fluor 647 Ready Flow Reagent (Invitrogen, #R37175) was placed in each sample and incubated in the dark for 15 minutes. Following incubation, the cell suspensions were passed through a 0.32μm cell strainer and were analyzed on a SH800 Cell Sorter by Sony Biotechnology with a 100-μm microfluidic sorting chip and using Fluorescent Laser 2 (FL2) and Fluorescent Laser 4 (FL4) for detection of DND1-GFP and Annexin-647/APC-A, respectively.

To determine the geometric mean of Annexin-647/APC-A fluorescence for DND1-GFP-hi and DND1-GFP-lo populations, we first calculated laser compensation using FlowJ v10.8.1 Software (BD Life Sciences). Next, we gated to exclude dead cells (“Live” population, S13A Fig) by using the FSC-A and SSC-A parameters. Taking this population, we identified the DND1-GFP-lo and DND1-GFP-hi populations using the CompFL2A::EGFP-A (x-axis) and cell count parameters (y-axis). GFP fluorescent ranges of 104−104.6 and 104.7–105.1 defined the DND1-GFP-lo and -hi populations, respectively. The geometric mean of Annexin V-647 fluorescence was determined for each population.

To determine the percent of cells that are Annexin V-647 positive/negative in each population, we first plotted EGFP fluorescence by the 647/APC-A fluorescence (x and y axes, respectively, S13B Fig), identifying 647/APC-A positive populations (fluorescence of >103.6). After creating this 647/APC-A positive threshold and isolating only the DND1-GFP-hi and DND1-GFP-lo cells, quadrants were created to identify the cells that were DND1-GFP-lo and Annexin V positive (quadrant 1), DND1-GFP-hi and Annexin V positive (quadrant 2), DND1-GFP-hi and Annexin V negative (quadrant 3), and DND1-GFP-lo and Annexin V negative (quadrant 4). Cell counts within these quadrants were then used to for subsequent calculations.

Reverse-transcription quantitative PCR (RT-qPCR) of Dnd1GFP/+ DND1-GFP-lo and DND1-GFP-hi populations

Fetal E16.5 Dnd1GFP/+ testes were collected from a CD1 female mated with a Dnd1GFP/GFP male and were dissociated into a single cell suspension as described in the RIP-RNA Extraction method section. The cell suspensions were passed through a 0.32μm cell strainer DND1-GFP-lo and DND1-GFP-hi and cells were collected on a SH800 Cell Sorter by Sony Biotechnology with a 100-μm microfluidic sorting chip using Fluorescent Laser 2 (FL2) for detection of DND1-GFP. Similar to Annexin V analyses, GFP fluorescent ranges of 104−104.6 and 104.7–105.1 defined the DND1-GFP-lo and -hi populations, respectively. Cells were pelleted by centrifugation at 500 x g for 10 minutes in 4° C, and RNA was extracted using RNeasy Micro Kit per the manufacturer’s protocol (Qiagen, catalog #74004). RNA was reverse transcribed into cDNA using the Verso cDNA synthesis kit (Thermo Fisher Scientific, catalog #AB4100A). 1μL of cDNA was used in three separate 10μL PCR reactions, as previously described [27], with primers for DND1, eGFP, and GAPDH, respectively (see below for primer sequences). The PCR program is as follows: initial denaturation for 3 minutes at 95°C followed by 35 cycles of 30 s at 95°C, 30 s at 58°C, and 30 s at 72°C, with a final extension of 5 min at 72°C. RT-PCR products were run on 2.5% agarose gel for 1 hour at 90 V. RT-qPCR products were visualized using a Bio-Rad Imager and Quantity One software (Bio-Rad Laboratories). To determine mean intensities, product bands were subjected to densitometry analysis using ImageJ [75]. For each sample, 4 technical replicates were performed, and biological replicates of ≥4 fetal testes were used. Primers have been previously published [27], and sequences are listed in Table 4.

Table 4. RT-qPCR Primers.

Gene Direction Sequence Reference
eGfp Forward CTT CTT CAA GTC CGC CAT GCC [27]
eGfp Reverse GGT GTT CTG CTG GTA GTG GTC [27]
Gapdh Forward GGT GAA GGT CGG TGT GAA CG [5, 27]
Gapdh Reverse CTC GCT CCT GGA AGA TGG TG [5, 27]
Dnd1 Forward GCC CTG GTA GAA GGT CAG TCA C [27]
Dnd1 Reverse GCC CTG TTC CTA AAC ACT TGG TC [27]

Supporting information

S1 Table. Quantitative data from DND1-GFP Fluorescence Associated Cell Sorting (FACS).

(XLSX)

S2 Table. Complete lists for all significantly differentially expressed genes, and GO biological process gene list filtering for hiloRNA-seq and RIP-seq.

(XLSX)

S3 Table. Complete list of apoptosis and cell death terms, with gene Ensembl IDs from Biological Process Gene Ontology analysis in g:Profiler of E16.5 DND1-GFP-hi and DND1-GFP-lo cells.

(XLSX)

S4 Table. Combined DND1-seq (hiloRNA-seq, RIP-seq) dataset with graphic output.

(XLSX)

S5 Table. Complete RIP-seq g:Profiler results.

(XLSX)

S1 Fig. Fetal male germ cells visibly express low and high levels of DND1-GFP.

Whole mount confocal imaging of unstained freshly dissected A: E16.5 and B: E18.5 DND1-GFP testes showing heterogenous endogenous DND1-GFP fluorescence. Insets show cord region (white dotted box, a’ and b’) and rete testis region (yellow dotted box, a” and b”). All scale bars are 200μm. Scale for A and B in B, scale for a’, a”, b’, and b” in b”.

(TIF)

S2 Fig. Quantification of DND1-GFP in E16.5 MGCs.

A: View of fixed MGCs in a testis cord with DND1-GFP-hi cell marked by an arrow and a DND1-GFP-lo cell marked by a wedge. Scale bar = 50μm. B, C: Representative images showing a quantified DND1-GFP-hi cell (B) and a quantified DND1-GFP-lo cell (C) from an E16.5 testis. b’, c’: DND1-GFP (endogenous signal). b”, c”: DNA (Hoechst stain). Scale bar = 10μm. D: Quantification of DND1-GFP fluorescence in 50 randomly selected DND1-GFP MGCs from multiple testes cords in 2 separate mice. Dashed line demarcates the relative fluorescence boundary between DND1-GFP-lo cells and DND1-GFP-hi cells (below and above the line, respectively). E: Mean intensity of Dnd1 expression relative to Gapdh expression in DND1-GFP-lo (blue) and DND1-GFP-hi (yellow) cells from Dnd1GFP/+ heterozygotes. F: Mean intensity of eGfp expression relative to Dnd1 expression in DND1-GFP-lo and DND1-GFP-hi cells from Dnd1GFP/+ heterozygotes. E-F: Technical replicates = 4, biological replicates ≥ 4 fetal testes. P value: ** < 0.01, n.s. = not significant.

(TIF)

S3 Fig. Pearson correlation matrix of transcriptomes from DND1-GFP-lo and DND1-GFP-hi samples at E16.5 and E18.5.

The matrix shows that DND1-GFP-lo and DND1-GFP-hi cells are most similar to biological replicate samples, but all samples of male germ cells share high similarity.

(TIF)

S4 Fig. Genes expressed at higher or lower levels in DND1-GFP-hi cells often show a similar correlation in E16.5 sc-RNA-seq data from Law et al.

Re-analysis of E16.5 male germ cell scRNA-seq from [31]. Cluster 0 and Cluster 1 have higher levels of Dnd1 and Cluster 2 has lower levels of Dnd1 (also Fig 1E). Violin plots showing levels of other germ cell genes differentially expressed between DND1-GFP-lo and DND1-GFP-lo cells in scRNA-seq of E16.5 male germ cell from [31].

(TIF)

S5 Fig. Markers previously reported to be associated with the apoptotic or male germ cell determination pathway do not consistently correlate with low or high DND1-GFP.

Imaging from whole mount confocal microscopy on E14.5 DND1-GFP testes (A-C) and E18.5 DND1-GFP testes (D-F) for: ORF1p (A & D), TOMM20 (B & E), and CPARP (C & F) (red). All sections stained for DND1-GFP (GFP, green), germ cell nuclear antigen (GCNA, blue), and DNA (Hoechst, white). Blue triangles mark DND1-GFP-lo and red triangles mark DND1-GFP-hi germ cells. Scales in C and F are 50μm and apply to all images.

(TIF)

S6 Fig. Examples of genes from PANTHER protein classes “Transcriptional Regulation”, “Defense/Immunity”, “Transporters”, and “Transmembrane Signal Receptors” that were significantly differentially expressed in DND1-GFP-lo compared to DND1-GFP-hi cells at E16.5 and/or E18.5.

hiloRNA-seq expression P-value (DESeq2) between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5: not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****).

(TIF)

S7 Fig. Chromatin modifiers are disproportionately over-expressed in DND1-GFP-hi cells with genes either shared between DND1-GFP-hi cells at E16.5 and E18.5 or exclusive to DND1-GFP-hi cells E16.5.

Euler plot of significant differentially expressed genes that are part of the GO biological process category “regulation of chromatin modification”. There are no genes differentially expressed in both E16.5 and E18.5 DND1-GFP-lo cells. Chart shows top 10 genes for each region of the Euler plot (for full list see S2 Table).

(TIF)

S8 Fig. Input biological replicates are most similar to each other across all time points, while immunoprecipitation (IP) biological replicates are most similar to each other within a time point.

Pearson correlation matrix of input and IP biological replicates at E14.5, E16.5, E18.5 used in RIP-seq with breakouts showing isolated correlations for input and IP biological replicates at each time point.

(TIF)

S9 Fig. All paired input and immunoprecipitation (IP) biological replicates at all time points show a subset of transcripts that graph as targets.

A: Scatter plots of log2(TPM) for paired input and immunoprecipitation (IP) biological replicates at E14.5, E16.5, E18.5. Guideline with slope of 1 is plotted to help delineate genes in quadrant I that are above the line, indicative of being a transcript target of DND1. B: Gene expression values (TPM) for input and IP biological replicates at E14.5, E16.5, E18.5 for highly expressed housekeeping genes that did not enrich as targets (Gapdh, Acta2), and a highly expressed gene that did enrich as a target (Lamb1).

(TIF)

S10 Fig. Many epigenetic regulators that are differentially expressed dependent on the level of DND1 are both targets of DND1 and fail to up-regulate in Dnd1Ter/Ter mutant germ cells.

A: Heat map of epigenetic regulators showing: L2FC in DND1-GFP-hi vs DND1-GFP-lo cells from hiloRNA-seq (E16.5, E18.5); enrichment of transcript as a DND1 target as a function of L2FC in RIP-seq (E14.5, E16.5, E18.5); and L2FC in Dnd1Ter/Ter mutant vs wild type germ cells from Dnd1Ter-seq (E14.5) from [23]. B: Expanded gene-level hiloRNA-seq and RIP-seq data for representatives from each epigenetic regulator family in Figs 4B and S10A. With the exception of Hdac4, transcript levels for all genes shown are higher in DND1-GFP-hi cells, and all genes except Brdt map as significant targets of DND1 at one or more stages. hiloRNA-seq expression P-value (DESeq2) between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5 and RIP-seq enrichment P-value (DESeq2): not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****).

(TIF)

S11 Fig. Collagens and genes associated with the novel intermediate pro-spermatogonia identity [16, 55] are DND1 targets, but only some are significantly differentially regulated in DND1-GFP-lo and DND1-GFP-hi cells.

A: Heat map of collagens showing L2FC in DND1-GFP-lo vs DND1-GFP-hi cells from hiloRNA-seq (E16.5, E18.5). B: Enrichment of transcript as DND1 targets as a function of L2FC in RIP-seq (E14.5, E16.5, E18.5). C, D: Expanded gene-level hiloRNA-seq and RIP-seq data for representative genes associated with (C) intermediate pro-spermatogonia identity as reported in [55], and (D) pro-spermatogonia to spermatogonial stem cell transition as reported in [16]. hiloRNA-seq expression P-value (DESeq2) between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5 and RIP-seq enrichment P-value (DESeq2): not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****).

(TIF)

S12 Fig. Y-linked and X-linked genes are differentially regulated in DND1-GFP-lo and DND1-GFP-hi cells and some enrich as DND1 targets.

Gene-level hiloRNA-seq and RIP-seq data for Y-linked (Eif2s3y, Uty, Smcy) and X-linked (Eif2s3x, Utx, Smcx) genes. hiloRNA-seq expression P-value (DESeq2) between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5 and RIP-seq enrichment P-value (DESeq2): not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****).

(TIF)

S13 Fig. Workflow of DND1-GFP and Annexin V FACS analysis.

A: Live cells were gated using FSC-A and SSC-A parameters, and DND1-GFP-lo cells and DND1-GFP-hi cells were identified with GFP fluorescence and cell count parameters. DND1-GFP-lo and DND1-GFP-hi populations were plotted by 647/APC-A (Annexin) fluorescence. B: Annexin V-positive threshold was determined by examining live cells for GFP and 647/APC-A fluorescence. Using this threshold, the DND1-GFP positive cells were divided into DND1-GFP-hi and DND1-GFP-lo, creating four quadrants.

(TIF)

Acknowledgments

We would like to thank the teams in the: Duke Light Microscopy Core Facility (NIH Shared Instrumentation grant 1S10RR027867-01), especially Dr. Lisa Cameron and Dr. Benjamin Carlson; Duke Cancer Institute Flow Cytometry Shared Resource, especially Dr. Mike Cook and Lynn Martinek; and Duke Center for Genomic and Computational Biology’s Sequencing and Genomic Technologies Shared Resource, especially Dr. Nicolas Devos. We are also grateful to past and present members of the Capel Lab, especially Jordan Batchvarov. The computations in this paper were run on HPC resources supported by the Duke Compute Cluster at Duke University and the Scientific Computing Unit at Weill Cornell Medicine.

Data Availability

The raw hiloRNA-seq and RIP-seq data is available on GEO (GSE197610) as a super-series. The complete script for all data analysis has also been deposited in GEO as a singular simple text file in Rmarkdown syntax along with all the supporting, intermediate, and Rstudio data files used in the analysis. These scripts are complete but contain redactions to protect the cybersecurity of the Duke Computing Cluster and the Scientific Computing Unit at Weill Cornell Medicine, and therefore will need adjustments to function on other computing clusters and with Rstudio sessions. The unprocessed matrix counts and processed Loupe File from the aggregated E16.5 scRNA-seq re-analysis have also been deposited on GEO (GSE197610) as part of the same super-series and are linked to the raw scRNA-seq data published in Law et al. 2020 which is also on GEO (GSE124904) https://doi.org/10.1038/s41467-019-10596-0. The Dnd1Ter-seq dataset published in Ruthig et al. 2019 was deposited with that publication on GEO (GSE132719) https://doi.org/10.1242/dev.175950.

Funding Statement

This work was supported by: the National Institutes of Health National Institute of General Medical Sciences (NIH-NIGMS) [F32GM129956] as salary support to VAR; a Duke Josiah Charles Trent Memorial Foundation grant to VAR; a Duke Cancer Institute (DCI) Pilot Grant to BC; the National Institutes of Health Eunice Kennedy Shriver National Institute of Child Health and Human Development (NIH-NICHD) [R37HD039963] to BC; and funding from the Duke Summer Research Opportunity Program (SROP) to JH. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Suzuki A, Niimi Y, Shinmyozu K, Zhou Z, Kiso M, Saga Y. Dead end1 is an essential partner of NANOS2 for selective binding of target RNAs in male germ cell development. EMBO Rep. 2016;17(1):37–46. doi: 10.15252/embr.201540828 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Yamaji M, Jishage M, Meyer C, Suryawanshi H, Der E, Yamaji M, et al. DND1 maintains germline stem cells via recruitment of the CCR4-NOT complex to target mRNAs. Nature. 2017;543(7646):568–72. doi: 10.1038/nature21690 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Cook MS, Coveney D, Batchvarov I, Nadeau JH, Capel B. BAX-mediated cell death affects early germ cell loss and incidence of testicular teratomas in Dnd1(Ter/Ter) mice. Developmental biology. 2009;328(2):377–83. doi: 10.1016/j.ydbio.2009.01.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Sakurai T, Iguchi T, Moriwaki K, Noguchi M. The ter mutation first causes primordial germ cell deficiency in ter/ter mouse embryos at 8 days of gestation. Development, Growth & Differentiation. 1995;37(3):293–302. [DOI] [PubMed] [Google Scholar]
  • 5.Cook MS, Munger SC, Nadeau JH, Capel B. Regulation of male germ cell cycle arrest and differentiation by DND1 is modulated by genetic background. Development. 2011;138(1):23–32. doi: 10.1242/dev.057000 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Gross-Thebing T, Yigit S, Pfeiffer J, Reichman-Fried M, Bandemer J, Ruckert C, et al. The Vertebrate Protein Dead End Maintains Primordial Germ Cell Fate by Inhibiting Somatic Differentiation. Developmental cell. 2017;43(6):704–15 e5. doi: 10.1016/j.devcel.2017.11.019 [DOI] [PubMed] [Google Scholar]
  • 7.Updike DL, Knutson AK, Egelhofer TA, Campbell AC, Strome S. Germ-granule components prevent somatic development in the C. elegans germline. Current biology: CB. 2014;24(9):970–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Ciosk R, DePalma M, Priess JR. Translational regulators maintain totipotency in the Caenorhabditis elegans germline. Science. 2006;311(5762):851–3. doi: 10.1126/science.1122491 [DOI] [PubMed] [Google Scholar]
  • 9.Hajkova P, Erhardt S, Lane N, Haaf T, El-Maarri O, Reik W, et al. Epigenetic reprogramming in primordial germ cells. Mech Dev. 2002;117:15. [DOI] [PubMed] [Google Scholar]
  • 10.Maatouk DM, Kellam LD, Mann MR, Lei H, Li E, Bartolomei MS, et al. DNA methylation is a primary mechanism for silencing postmigratory primordial germ cell genes in both germ cell and somatic cell lineages. Development. 2006;133(17):3411–8. doi: 10.1242/dev.02500 [DOI] [PubMed] [Google Scholar]
  • 11.Yamaguchi S, Hong K, Liu R, Inoue A, Shen L, Zhang K, et al. Dynamics of 5-methylcytosine and 5-hydroxymethylcytosine during germ cell reprogramming. Cell Res. 2013;23(3):329–39. doi: 10.1038/cr.2013.22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Lesch BJ, Page DC. Poised chromatin in the mammalian germ line. Development. 2014;141(19):3619–26. doi: 10.1242/dev.113027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Seki Y, Hayashi K, Itoh K, Mizugaki M, Saitou M, Matsui Y. Extensive and orderly reprogramming of genome-wide chromatin modifications associated with specification and early development of germ cells in mice. Dev Biol. 2005;278:440–58. doi: 10.1016/j.ydbio.2004.11.025 [DOI] [PubMed] [Google Scholar]
  • 14.Western PS, Miles DC, van den Bergen JA, Burton M, Sinclair AH. Dynamic regulation of mitotic arrest in fetal male germ cells. Stem cells. 2008;26(2):339–47. doi: 10.1634/stemcells.2007-0622 [DOI] [PubMed] [Google Scholar]
  • 15.Shimada R, Koike H, Hirano T, Kato Y, Saga Y. NANOS2 suppresses the cell cycle by repressing mTORC1 activators in embryonic male germ cells. iScience. 2021;24(8):102890. doi: 10.1016/j.isci.2021.102890 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Zhao J, Lu P, Wan C, Huang Y, Cui M, Yang X, et al. Cell-fate transition and determination analysis of mouse male germ cells throughout development. Nature communications. 2021;12(1):6839. doi: 10.1038/s41467-021-27172-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Nagano R, Tabata S, Nakanishi Y, Ohsako S, Kurohmaru M, Hayashi Y. Reproliferation and relocation of mouse male germ cells (gonocytes) during prespermatogenesis. The Anatomical record. 2000;258(2):210–20. doi: [DOI] [PubMed] [Google Scholar]
  • 18.Seisenberger S, Andrews S, Krueger F, Arand J, Walter J, Santos F, et al. The dynamics of genome-wide DNA methylation reprogramming in mouse primordial germ cells. Mol Cell. 2012;48(6):849–62. doi: 10.1016/j.molcel.2012.11.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Kubo N, Toh H, Shirane K, Shirakawa T, Kobayashi H, Sato T, et al. DNA methylation and gene expression dynamics during spermatogonial stem cell differentiation in the early postnatal mouse testis. BMC Genomics. 2015;16:624. doi: 10.1186/s12864-015-1833-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Pastor WA, Stroud H, Nee K, Liu W, Pezic D, Manakov S, et al. MORC1 represses transposable elements in the mouse male germline. Nat Commun. 2014;5:5795. doi: 10.1038/ncomms6795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kawabata Y, Kamio A, Jincho Y, Sakashita A, Takashima T, Kobayashi H, et al. Sex-specific histone modifications in mouse fetal and neonatal germ cells. Epigenomics. 2019;11(5):543–61. doi: 10.2217/epi-2018-0193 [DOI] [PubMed] [Google Scholar]
  • 22.Gu W, Mochizuki K, Otsuka K, Hamada R, Takehara A, Matsui Y. Dnd1-mediated epigenetic control of teratoma formation in mouse. Biol Open. 2018;7(1). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Ruthig VA, Friedersdorf MB, Garness JA, Munger SC, Bunce C, Keene JD, et al. The RNA-binding protein DND1 acts sequentially as a negative regulator of pluripotency and a positive regulator of epigenetic modifiers required for germ cell reprogramming. Development. 2019;146(19). doi: 10.1242/dev.175950 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kedde M, Strasser MJ, Boldajipour B, Oude Vrielink JA, Slanchev K, le Sage C, et al. RNA-binding protein Dnd1 inhibits microRNA access to target mRNA. Cell. 2007;131(7):1273–86. doi: 10.1016/j.cell.2007.11.034 [DOI] [PubMed] [Google Scholar]
  • 25.Hirano T, Wright D, Suzuki A, Saga Y. A cooperative mechanism of target RNA selection via germ-cell-specific RNA-binding proteins NANOS2 and DND1. Cell Rep. 2022;39(11):110894. doi: 10.1016/j.celrep.2022.110894 [DOI] [PubMed] [Google Scholar]
  • 26.Duszczyk MM, Wischnewski H, Kazeeva T, Arora R, Loughlin FE, von Schroetter C, et al. The solution structure of Dead End bound to AU-rich RNA reveals an unusual mode of tandem RRM-RNA recognition required for mRNA regulation. Nature communications. 2022;13(1):5892. doi: 10.1038/s41467-022-33552-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ruthig VA, Yokonishi T, Friedersdorf MB, Batchvarova S, Hardy J, Garness JA, et al. A transgenic DND1GFP fusion allele reports in vivo expression and RNA-binding targets in undifferentiated mouse germ cellsdagger. Biol Reprod. 2021;104(4):861–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Hermann BP, Mutoji KN, Velte EK, Ko D, Oatley JM, Geyer CB, et al. Transcriptional and translational heterogeneity among neonatal mouse spermatogonia. Biol Reprod. 2015;92(2):54. doi: 10.1095/biolreprod.114.125757 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Sakashita A, Kawabata Y, Jincho Y, Tajima S, Kumamoto S, Kobayashi H, et al. Sex Specification and Heterogeneity of Primordial Germ Cells in Mice. PLoS One. 2015;10(12):e0144836. doi: 10.1371/journal.pone.0144836 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Nguyen DH, Soygur B, Peng SP, Malki S, Hu G, Laird DJ. Apoptosis in the fetal testis eliminates developmentally defective germ cell clones. Nat Cell Biol. 2020;22(12):1423–35. doi: 10.1038/s41556-020-00603-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Law NC, Oatley MJ, Oatley JM. Developmental kinetics and transcriptome dynamics of stem cell specification in the spermatogenic lineage. Nature communications. 2019;10(1):2787. doi: 10.1038/s41467-019-10596-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Mayere C, Neirijnck Y, Sararols P, Rands CM, Stevant I, Kuhne F, et al. Single-cell transcriptomics reveal temporal dynamics of critical regulators of germ cell fate during mouse sex determination. FASEB J. 2021;35(4):e21452. doi: 10.1096/fj.202002420R [DOI] [PubMed] [Google Scholar]
  • 33.Maeda Y, Shiratsuchi A, Namiki M, Nakanishi Y. Inhibition of sperm production in mice by annexin V microinjected into seminiferous tubules: possible etiology of phagocytic clearance of apoptotic spermatogenic cells and male infertility. Cell Death Differ. 2002;9(7):742–9. doi: 10.1038/sj.cdd.4401046 [DOI] [PubMed] [Google Scholar]
  • 34.Shiratsuchi A, Umeda M, Ohba Y, Nakanishi Y. Recognition of phosphatidylserine on the surface of apoptotic spermatogenic cells and subsequent phagocytosis by Sertoli cells of the rat. The Journal of biological chemistry. 1997;272(4):2354–8. doi: 10.1074/jbc.272.4.2354 [DOI] [PubMed] [Google Scholar]
  • 35.Elliott MR, Zheng S, Park D, Woodson RI, Reardon MA, Juncadella IJ, et al. Unexpected requirement for ELMO1 in clearance of apoptotic germ cells in vivo. Nature. 2010;467(7313):333–7. doi: 10.1038/nature09356 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Thomas PD, Ebert D, Muruganujan A, Mushayahama T, Albou LP, Mi H. PANTHER: Making genome-scale phylogenetics accessible to all. Protein Sci. 2022;31(1):8–22. doi: 10.1002/pro.4218 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Mi H, Muruganujan A, Huang X, Ebert D, Mills C, Guo X, et al. Protocol Update for large-scale genome and gene function analysis with the PANTHER classification system (v.14.0). Nat Protoc. 2019;14(3):703–21. doi: 10.1038/s41596-019-0128-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Goudarzi M, Banisch TU, Mobin MB, Maghelli N, Tarbashevich K, Strate I, et al. Identification and regulation of a molecular module for bleb-based cell motility. Developmental cell. 2012;23(1):210–8. doi: 10.1016/j.devcel.2012.05.007 [DOI] [PubMed] [Google Scholar]
  • 39.Zhu R, Iacovino M, Mahen E, Kyba M, Matin A. Transcripts that associate with the RNA binding protein, DEAD-END (DND1), in embryonic stem (ES) cells. BMC Mol Biol. 2011;12:37. doi: 10.1186/1471-2199-12-37 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Nicholson CO, Friedersdorf M, Keene JD. Quantifying RNA binding sites transcriptome-wide using DO-RIP-seq. RNA. 2017;23(1):32–46. doi: 10.1261/rna.058115.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Nicholson CO, Friedersdorf MB, Bisogno LS, Keene JD. DO-RIP-seq to quantify RNA binding sites transcriptome-wide. Methods. 2017;118–119:16–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Keene JD, Komisarow JM, Friedersdorf MB. RIP-Chip: the isolation and identification of mRNAs, microRNAs and protein components of ribonucleoprotein complexes from cell extracts. Nat Protoc. 2006;1(1):302–7. doi: 10.1038/nprot.2006.47 [DOI] [PubMed] [Google Scholar]
  • 43.Larsson J. eulerr: Area-Proportional Euler and Venn Diagrams with Ellipses. 2020. [Google Scholar]
  • 44.Raudvere U, Kolberg L, Kuzmin I, Arak T, Adler P, Peterson H, et al. g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic acids research. 2019;47(W1):W191–W8. doi: 10.1093/nar/gkz369 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Zanchetta ME, Meroni G. Emerging Roles of the TRIM E3 Ubiquitin Ligases MID1 and MID2 in Cytokinesis. Front Physiol. 2019;10:274. doi: 10.3389/fphys.2019.00274 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.McMahon SB, Van Buskirk HA, Dugan KA, Copeland TD, Cole MD. The novel ATM-related protein TRRAP is an essential cofactor for the c-Myc and E2F oncoproteins. Cell. 1998;94(3):363–74. doi: 10.1016/s0092-8674(00)81479-8 [DOI] [PubMed] [Google Scholar]
  • 47.Frank SR, Schroeder M, Fernandez P, Taubert S, Amati B. Binding of c-Myc to chromatin mediates mitogen-induced acetylation of histone H4 and gene activation. Genes Dev. 2001;15(16):2069–82. doi: 10.1101/gad.906601 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Bouchard C, Dittrich O, Kiermaier A, Dohmann K, Menkel A, Eilers M, et al. Regulation of cyclin D2 gene expression by the Myc/Max/Mad network: Myc-dependent TRRAP recruitment and histone acetylation at the cyclin D2 promoter. Genes Dev. 2001;15(16):2042–7. doi: 10.1101/gad.907901 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.DeRan M, Pulvino M, Greene E, Su C, Zhao J. Transcriptional activation of histone genes requires NPAT-dependent recruitment of TRRAP-Tip60 complex to histone promoters during the G1/S phase transition. Mol Cell Biol. 2008;28(1):435–47. doi: 10.1128/MCB.00607-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Liu X, Tesfai J, Evrard YA, Dent SY, Martinez E. c-Myc transformation domain recruits the human STAGA complex and requires TRRAP and GCN5 acetylase activity for transcription activation. The Journal of biological chemistry. 2003;278(22):20405–12. doi: 10.1074/jbc.M211795200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.McMahon SB, Wood MA, Cole MD. The essential cofactor TRRAP recruits the histone acetyltransferase hGCN5 to c-Myc. Mol Cell Biol. 2000;20(2):556–62. doi: 10.1128/MCB.20.2.556-562.2000 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Sawan C, Hernandez-Vargas H, Murr R, Lopez F, Vaissiere T, Ghantous AY, et al. Histone acetyltransferase cofactor Trrap maintains self-renewal and restricts differentiation of embryonic stem cells. Stem cells. 2013;31(5):979–91. doi: 10.1002/stem.1341 [DOI] [PubMed] [Google Scholar]
  • 53.Tapias A, Zhou ZW, Shi Y, Chong Z, Wang P, Groth M, et al. Trrap-dependent histone acetylation specifically regulates cell-cycle gene transcription to control neural progenitor fate decisions. Cell Stem Cell. 2014;14(5):632–43. doi: 10.1016/j.stem.2014.04.001 [DOI] [PubMed] [Google Scholar]
  • 54.Pao KC, Wood NT, Knebel A, Rafie K, Stanley M, Mabbitt PD, et al. Activity-based E3 ligase profiling uncovers an E3 ligase with esterification activity. Nature. 2018;556(7701):381–5. doi: 10.1038/s41586-018-0026-1 [DOI] [PubMed] [Google Scholar]
  • 55.Tan K, Song HW, Wilkinson MF. Single-cell RNAseq analysis of testicular germ and somatic cell development during the perinatal period. Development. 2020;147(3). doi: 10.1242/dev.183251 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Yamauchi Y, Riel JM, Ruthig VA, Ortega EA, Mitchell MJ, Ward MA. Two genes substitute for the mouse Y chromosome for spermatogenesis and reproduction. Science. 2016;351(6272):514–6. doi: 10.1126/science.aad1795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Yamauchi Y, Riel JM, Stoytcheva Z, Ward MA. Two Y genes can replace the entire Y chromosome for assisted reproduction in the mouse. Science. 2014;343(6166):69–72. doi: 10.1126/science.1242544 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Heaney JD, Michelson MV, Youngren KK, Lam MY, Nadeau JH. Deletion of eIF2beta suppresses testicular cancer incidence and causes recessive lethality in agouti-yellow mice. Hum Mol Genet. 2009;18(8):1395–404. doi: 10.1093/hmg/ddp045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Lam MY, Heaney JD, Youngren KK, Kawasoe JH, Nadeau JH. Trans-generational epistasis between Dnd1Ter and other modifier genes controls susceptibility to testicular germ cell tumors. Hum Mol Genet. 2007;16(18):2233–40. doi: 10.1093/hmg/ddm175 [DOI] [PubMed] [Google Scholar]
  • 60.Aguero T, Jin Z, Chorghade S, Kalsotra A, King ML, Yang J. Maternal Dead-end 1 promotes translation of nanos1 by binding the eIF3 complex. Development. 2017;144(20):3755–65. doi: 10.1242/dev.152611 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Shi Z, Fujii K, Kovary KM, Genuth NR, Rost HL, Teruel MN, et al. Heterogeneous Ribosomes Preferentially Translate Distinct Subpools of mRNAs Genome-wide. Mol Cell. 2017;67(1):71–83 e7. doi: 10.1016/j.molcel.2017.05.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Nicholls PK, Schorle H, Naqvi S, Hu YC, Fan Y, Carmell MA, et al. Mammalian germ cells are determined after PGC colonization of the nascent gonad. Proc Natl Acad Sci U S A. 2019;116(51):25677–87. doi: 10.1073/pnas.1910733116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Niimi Y, Imai A, Nishimura H, Yui K, Kikuchi A, Koike H, et al. Essential role of mouse Dead end1 in the maintenance of spermatogonia. Developmental biology. 2019;445(1):103–12. doi: 10.1016/j.ydbio.2018.11.003 [DOI] [PubMed] [Google Scholar]
  • 64.Kato Y, Katsuki T, Kokubo H, Masuda A, Saga Y. Dazl is a target RNA suppressed by mammalian NANOS2 in sexually differentiating male germ cells. Nature communications. 2016;7:11272. doi: 10.1038/ncomms11272 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Percharde M, Wong P, Ramalho-Santos M. Global Hypertranscription in the Mouse Embryonic Germline. Cell Rep. 2017;19(10):1987–96. doi: 10.1016/j.celrep.2017.05.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Percharde M, Bulut-Karslioglu A, Ramalho-Santos M. Hypertranscription in Development, Stem Cells, and Regeneration. Developmental cell. 2017;40(1):9–21. doi: 10.1016/j.devcel.2016.11.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Liu W, Ma Q, Wong K, Li W, Ohgi K, Zhang J, et al. Brd4 and JMJD6-associated anti-pause enhancers in regulation of transcriptional pause release. Cell. 2013;155(7):1581–95. doi: 10.1016/j.cell.2013.10.056 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Gross-Thebing T, Raz E. Dead end and Detour: The function of the RNA-binding protein Dnd in posttranscriptional regulation in the germline. Curr Top Dev Biol. 2020;140:181–208. doi: 10.1016/bs.ctdb.2019.12.003 [DOI] [PubMed] [Google Scholar]
  • 69.Murakami K, Gunesdogan U, Zylicz JJ, Tang WWC, Sengupta R, Kobayashi T, et al. NANOG alone induces germ cells in primed epiblast in vitro by activation of enhancers. Nature. 2016;529(7586):403–7. doi: 10.1038/nature16480 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Wright D, Kiso M, Saga Y. Genetic and structural analysis of the in vivo functional redundancy between murine NANOS2 and NANOS3. Development. 2021;148(1). doi: 10.1242/dev.191916 [DOI] [PubMed] [Google Scholar]
  • 71.Coucouvanis EC, Sherwood SW, Carswell-Crumpton C, Spack EG, Jones PP. Evidence that the mechanism of prenatal germ cell death in the mouse is apoptosis. Exp Cell Res. 1993;209(2):238–47. doi: 10.1006/excr.1993.1307 [DOI] [PubMed] [Google Scholar]
  • 72.Wang RA, Nakane PK, Koji T. Autonomous cell death of mouse male germ cells during fetal and postnatal period. Biol Reprod. 1998;58(5):1250–6. doi: 10.1095/biolreprod58.5.1250 [DOI] [PubMed] [Google Scholar]
  • 73.Yacobi-Sharon K, Namdar Y, Arama E. Alternative germ cell death pathway in Drosophila involves HtrA2/Omi, lysosomes, and a caspase-9 counterpart. Developmental cell. 2013;25(1):29–42. doi: 10.1016/j.devcel.2013.02.002 [DOI] [PubMed] [Google Scholar]
  • 74.Zohar-Fux M, Ben-Hamo-Arad A, Arad T, Volin M, Shklyar B, Hakim-Mishnaevski K, et al. The phagocytic cyst cells in Drosophila testis eliminate germ cell progenitors via phagoptosis. Sci Adv. 2022;8(24):eabm4937. doi: 10.1126/sciadv.abm4937 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, et al. Fiji: an open-source platform for biological-image analysis. Nat Methods. 2012;9(7):676–82. doi: 10.1038/nmeth.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Lin YT, Barske L, DeFalco T, Capel B. Numb regulates somatic cell lineage commitment during early gonadogenesis in mice. Development. 2017;144(9):1607–18. doi: 10.1242/dev.149203 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Defalco T, Saraswathula A, Briot A, Iruela-Arispe ML, Capel B. Testosterone levels influence mouse fetal Leydig cell progenitors through notch signaling. Biol Reprod. 2013;88(4):91. doi: 10.1095/biolreprod.112.106138 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Serra ND, Velte EK, Niedenberger BA, Kirsanov O, Geyer CB. Cell-autonomous requirement for mammalian target of rapamycin (Mtor) in spermatogonial proliferation and differentiation in the mousedagger. Biol Reprod. 2017;96(4):816–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Endow SA, Miller SE, Ly PT. Mitochondria-enriched protrusions are associated with brain and intestinal stem cells in Drosophila. Commun Biol. 2019;2:427. doi: 10.1038/s42003-019-0671-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Hatkevich T, Boudreau V, Rubin T, Maddox PS, Huynh JR, Sekelsky J. Centromeric SMC1 promotes centromere clustering and stabilizes meiotic homolog pairing. PLoS genetics. 2019;15(10):e1008412. doi: 10.1371/journal.pgen.1008412 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Leinonen R, Sugawara H, Shumway M, International Nucleotide Sequence Database C. The sequence read archive. Nucleic acids research. 2011;39(Database issue):D19–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Zheng GX, Terry JM, Belgrader P, Ryvkin P, Bent ZW, Wilson R, et al. Massively parallel digital transcriptional profiling of single cells. Nature communications. 2017;8:14049. doi: 10.1038/ncomms14049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Hao Y, Hao S, Andersen-Nissen E, Iii WMM, Zheng S, Butler A, et al. Integrated analysis of multimodal single-cell data. Cell. 2021. doi: 10.1016/j.cell.2021.04.048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Zappia L, Oshlack A. Clustering trees: a visualization for evaluating clusterings at multiple resolutions. GigaScience. 2018;7(7). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017;14(4):417–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. doi: 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Müller K, Wickham H, James DA, Falcon S. RSQLite: ’SQLite’ Interface for R. 2021. [Google Scholar]
  • 88.Baxendale P, Codd EF. A Relational Model of Data for Large Shared Data Banks. Communications of the ACM. 1970;13(6):377. [Google Scholar]
  • 89.Thomas PD, Kejariwal A, Guo N, Mi H, Campbell MJ, Muruganujan A, et al. Applications for protein sequence-function evolution data: mRNA/protein expression analysis and coding SNP scoring tools. Nucleic acids research. 2006;34(Web Server issue):W645–50. doi: 10.1093/nar/gkl229 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Mi H, Ebert D, Muruganujan A, Mills C, Albou LP, Mushayamaha T, et al. PANTHER version 16: a revised family classification, tree-based classification tool, enhancer regions and extensive API. Nucleic acids research. 2021;49(D1):D394–D403. doi: 10.1093/nar/gkaa1106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature genetics. 2000;25(1):25–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Gene Ontology Consortium. The Gene Ontology resource: enriching a GOld mine. Nucleic acids research. 2021;49(D1):D325–D34. doi: 10.1093/nar/gkaa1113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.RStudio Team. RStudio: Intregrated Development for R. Boston, MA: RStudio, PBC; 2020. [Google Scholar]
  • 94.Larsson J. eulerr: Area-Proportional Euler and Venn Diagrams with Ellipses. 2021. [Google Scholar]
  • 95.Larsson J, Gustafsson P, editors. A Case Study in Fitting Area-Proportional Euler Diagrams with Ellipses Using eulerr. Proceedings of International Workshop on Set Visualization and Reasoning; 2018. 2018/4//; Edinburgh, United Kingdom: CEUR Workshop Proceedings. [Google Scholar]
  • 96.Ross JA. Colour-blindness: how to alienate a grant reviewer. Nature. 2007;445(7128):593–. doi: 10.1038/445593c [DOI] [PubMed] [Google Scholar]
  • 97.Ichihara YG, Okabe M, Iga K, Tanaka Y, Musha K, Ito K. Color Universal Design—The selection of four easily distinguishahle colors for all color vision types. Proc Spie. 2008;6807. [Google Scholar]
  • 98.Huber W, Carey J V, Gentleman R, et al. Orchestrating high-throughput genomic analysis with Bioconductor. Nature Methods. 2015;12(2):115–21. doi: 10.1038/nmeth.3252 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Wei T, Simko V. R package ’corrplot’: Visualization of a Correlation Matrix. 2021. [Google Scholar]
  • 100.Databases RSIGo, Wickham H, Müller K. DBI: R Database Interface. 2021. [Google Scholar]
  • 101.Wickham H, Girlich M, Ruiz E. dbplyr: A ’dplyr’ Back End for Databases. 2021. [Google Scholar]
  • 102.Pagès H, with contributions from Peter H, Lun A. DelayedArray: A unified framework for working transparently with on-disk and in-memory array-like datasets. 2021. [Google Scholar]
  • 103.Wickham H, François R, Henry L, Müller K. dplyr: A Grammar of Data Manipulation. 2021. [Google Scholar]
  • 104.Wickham H. ggplot2: Elegant Graphics for Data Analysis: Springer-Verlag New York; 2016. [Google Scholar]
  • 105.Firke S. janitor: Simple Tools for Examining and Cleaning Dirty Data. 2021. [Google Scholar]
  • 106.Xie Y. Dynamic Documents with R and knitr. 2nd ed. Raton Boca, Florida: Chapman and Hall/CRC; 2015. [Google Scholar]
  • 107.Benaglia T, Chauveau D, Hunter DR, Young D. mixtools: An R Package for Analyzing Finite Mixture Models. Journal of Statistical Software. 2009;32(6):1–29. [Google Scholar]
  • 108.Schauberger P, Walker A. openxlsx: Read, Write and Edit xlsx Files. 2021. [Google Scholar]
  • 109.Pedersen TL. patchwork: The Composer of Plots. 2020. [Google Scholar]
  • 110.Wickham H. The Split-Apply-Combine Strategy for Data Analysis. Journal of Statistical Software. 2011;40(1):1–29. [Google Scholar]
  • 111.R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2021. [Google Scholar]
  • 112.Neuwirth E. RColorBrewer: ColorBrewer Palettes. 2014. [Google Scholar]
  • 113.Wickham H, Hester J. readr: Read Rectangular Text Data. 2021. [Google Scholar]
  • 114.Wickham H. stringr: Simple, Consistent Wrappers for Common String Operations. 2019. [Google Scholar]
  • 115.Wickham H, Averick M, Bryan J, Chang W, McGowan LDA, François R, et al. Welcome to the tidyverse. Journal of Open Source Software. 2019;4(43):1686–. [Google Scholar]
  • 116.Soneson C, Love MI, Robinson MD. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Research. 2015;4(1521). doi: 10.12688/f1000research.7563.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Konopka T. umap: Uniform Manifold Approximation and Projection. 2020. [Google Scholar]
  • 118.Garnier Simon, Ross Noam, Rudis Robert, et al. viridis—Colorblind-Friendly Color Maps for R. 2021. [Google Scholar]

Decision Letter 0

John M Greally, Erez Raz

19 Sep 2022

Dear Dr Ruthig,

Thank you very much for submitting your Research Article entitled 'The RNA Binding Protein DND1 Is Elevated in a Subpopulation of Pro-Spermatogonia and Targets Chromatin Modifiers and Translational Machinery during Late Gestation' to PLOS Genetics.

The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review a much-revised version. We cannot, of course, promise publication at that time.

Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

Erez Raz

Guest Editor

PLOS Genetics

John Greally

Section Editor

PLOS Genetics

Dear Victor,

I am sending the referees comments on your paper.

Everyone agreed the work is of interest for readers of the Journal and that the data will serve as an important resource.

Two points were considered more important -

1. Provide clear criteria for defining cells as Hi or Lo when microscopy data is concerned (when co-staining with other markers). Was there a threshold above which cells were defined as "Hi"? Was there a control for the signal level at the same imaging plane?

2. A clear suggestion for a "take-home message" should be included. i.e., what is the functional significance of the Hi and Lo Dnd1-expressing populations for gamete formation?

We hope you find the comments of the referees useful !

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Overall

In this very well written manuscript, Ruthig et al identify populations of embryonic germ cells expressing either high or low levels of a Dnd1-GFP reporter knock-in allele. Using this allele, the authors identify fundamental transcriptional differences between these populations and direct RNA targets of the DND1 RNA binding protein. Several regulators of the epigenome and translational machinery are expressed at higher levels in Dnd1-hi cells and are shown to be direct targets of Dnd1. Overall, the experiments shown are very well thought out, performed appropriately, and data correctly interpreted. There are a couple of critical issues to address. First, there is an early emphasis in the abstract and introduction on the role of Dnd1 in teratoma formation in mice based on experiments using the Dnd1-ter allele (with little mention of germ cell deficiency in all background or that teratomas are specific to one background). Although these hi and low cells might tell us something about teratoma formation experiments testing a functional connection are not performed and a teratoma susceptible background is not used. Teratomas are barely mentioned in the discussion. It is suggested that this early emphasis on teratomas be reduced or additional experiments be performed (likely outside the scope of the paper and not being requested). Second, and this addresses overall impact, it is still a bit unclear to me what the role of high Dnd1 during the time course of germ cell development is and where the Dnd1-hi population of cells sits in this process (from PGC to committed pro-spermatogonia). Is this a dynamic population, and representing the nexus at which differentiation is starting/completing? What is the critical nature of this event/stage of a germ cell? Why are pluripotency genes (e.g. Sox2) and critical male germ cell differentiation genes (e.g. Nanos2) expressed higher in these cells compared to Dnd1-low? I think the authors attempt to address this in the discussion but it does need some clarification as a hypothesis. Showing functional impact of high DND1 on germ cells (e.g. DNMT3L or SOX2 protein is detected more frequently or is higher in this population; cells are proliferating/not proliferating) would add significance if achievable.

Major comments

Abstract, introduction, and discussion: There is a lot of early emphasis on loss of Dnd1 and teratomas in mice. Couple of issues with this. First, the experiments conducted are not on a teratoma susceptible background and it isnt mentioned anywhere that teratomas happen on a specific inbred background. Second, there is little said in the discussion tying the data to teratoma development. Third, the one mention of teratomas in the discussion is that de-regulation of epigenetic regulators in the absence of Dnd1 could lead to acquisition of somatic fate and teratomas. How does this work given that the first step in teratoma formation in mice is establishment of pluripotent embryonal carcinoma cells, in which somatic fate has yet to be established? I’d suggest de-emphasizing teratomas in the introduction and focusing on germ cell development/differentiation. Otherwise, data from a teratoma susceptible background – which is outside the scope of the paper (and data I am not asking for) – should be included.

Introduction, line 62-64 – again, teratoma forming capacity of E12.5 germ cells is limited to one strain. So differentiation pathways are “open” in this strain (using this data). Better data to show the general “openness” of these pathways in germ cells in general is the capacity to generate embryonic germ cell lines from early germ cells and that capacity being lost as sex-specific differentiation initiates.

Introduction - Maybe discuss the (putative) molecular functions of Dnd1 up front. Doesn’t have to be a lot of detail (can leave for the discussion to integrate the findings of the paper into its known role), but something should be mentioned in the intro.

Introduction – what experimental evidence is there that mouse germ cells “trans-differentiate” to form teratomas? I think that this is a point of contention in the field – are germ cells lineage specified prior to the time tumors form. Do they “escape” germ cell identity or fail to acquire/lock in identity. I’d suggest not using this term, provide specific evidence that they “trans-differentiate”, or indicate that the precise mechanism of germ cell transformation/trans-differentiation/de-dedifferentiation is not known.

Abstract, introduction, and discussion: The potential critical role of translational control/rates in teratoma formation from germ cells has been previously described (Heaney et al, Hum Mol Genetics, 2009). This work describes deficiency for Eif2s2 as reducing teratoma incidence in mice. Important considering that the authors address DND1 regulation of translation regulators/factors and specifically mention Eif2s3x and y). So, if teratomas are going to be part of the theme of the manuscript, it would be a good idea to reference this paper. These data are not in agreement with the concept of the role of Dnd1 in regulation of translation factors and teratomas. Dnd1-hi cells have higher expression of translation regulators (assuming higher rates of translation?) but should also maybe be more protected from forming teratomas? Dnd1 deficient cells should then maybe have less expression of these translation regulators but Dnd1-ter germ cells form embryonal carcinoma cells at a high rate in 129 mice.

Results: Data in Fig S1 is very interesting. Is the GFP weaker in areas closest to the rete testis? Is there some biological significance to this? Or is this an imaging artifact?

Results: Fig 1 - Does expression from the endogenous (unmodified) Dnd1 allele match the reporter? I’d assume that one could collect the hi and low cells and look at RNA expression in hets? Probably a better control to assess “transgene” artifact than going to independent RNA-seq studies. You have the hi and low populations separated here. Should be able to confirm native allele RNA expression. But, looks like dnd1 expression in the sorted populations might be embedded in the RNA-seq data (Fig 3C)? Might want to find a way of connecting these.

Results: Fig 2 – Using qualitative fluorescent intensity of the GFP reporter to look at whether hi and low GFP cells associate with these markers or not is probably not a great approach. I see the value of these data, but the RNa-seq data are much more convincing. I suggest moving to supporting info. There isnt much gained from these data.

Results: Fig 3D – not too sure what is being shown here. Is this a combination of different genes? If yes, this needs to be explained a bit better in the results (and which genes are involved).

Results: Line 200 – 202 “Despite the convergence of some transcripts, many epigenetic regulators maintain higher expression levels (TPM≥5.0, log2-fold change (L2FC)≥1.0, p-val<0.05) in DND1-GFP-hi cells during G0.” Can it really be said that these cells are in G0. I don’t think the proliferative status (or stage of cell cycle arrest – pH3 vs Ki67 staining) of the hi and low cells were assessed here. Is this known?

Results and discussion: After reading the discussion, I was still left with the questions – what is the fundamental difference between Ddn1-hi and low? Other than differences in gene expression – how are they functionally different. Proliferating/not proliferating? Still expressing pluripotency factors (protein)/not retain pluripotent characteristics? Initiated NANOS2 or DNMT3 (a or l) protein expression or not? For example, Dnmt3a and Dnmt3l RNA are expressed higher in DND1-hi cells. How does this translate to protein? Are the proteins differentially present/absent in the DND1-low and hi populations? Maybe can do this by imaging? Are the proteins expressed differently? Understand that this is much harder as westerns would be terribly difficult with cell numbers available.

Results: Most of the direct targets of DND1 appear to be expressed higher in DND1-hi cells. Are there examples of transcripts that are direct targets of DND1 and are lower in DND1-hi cells? Gets to an overall.

Discussion: Line 341 – 344. Nanos2 is not a marker of pluripotency. In fact, having some genes such as Sox2 (a marker of pluripotency) and a male germ regulator of sex-specific differentiation (Nanos2) both increased in expression in Dnd1-hi cells is counterintuitive. This should be addressed in the discussion. Other genes that are indicative of male germ cell differentiation (Tdrd’s, methyltransferases) are also higher. So why would Sox2, Tfap2c be higher? Are these cells that are being actively transitioned from a more “PGC-like” state to a more “differentiated” state so we are seeing this in the transcriptional profiles? Maybe I missed it, but is Sox2 a target of Dnd1? If not, would support that these Dnd1-hi cells are being actively pushed to differentiate by high Dnd1?

Minor comments

Overall, the figures are put together very well. Recommend that the text sizing be adjusted for some panels (e.g. 3C, 7B) as they are very hard to read without zooming in A LOT.

Results: Probably best to refer to the allele as a knock-in, not a transgene which usually is used when describing a randomly integrated DNA construct.

Reviewer #2: The manuscript by Ruthig et al., describes molecular phenotyping analysis of a subpopulation of male germ cells in mouse, which has been identified by transgenic insertion of GFP into DND1 gene and detection of a selectively high DND1 expressing cell population. The discovery of this cell population by the transgene is validated by endogenous DND1 expression analysis in single cell transcriptome and further analysed by differential gene expression analysis. This analysis suggest striking heterogeneity and associated differences in gene expression profiles between the two cell populations. Differentially expressed genes suggest DND1-associated regulation of cell cycle, cellular metabolism, protein translation and chromatin state regulator genes. Of note, several of these functions were already suggested before in the literature. Subsequently, the authors identify RNAs bound by DND1 using RIP and suggest that the two datasets indicate roles for DND1 in regulating subsets of RNAs and affect translation of target genes. The authors main claim is that the different DND1 levels indicate male germ cell heterogeneity, and their results pinpoint features of the heterogenous cell populatjons using their transcriptomic and RIP analyses. The manuscript is primarily descriptive but may represent potentially important, biologically relevant description of germ cell heterogeneity and may contribute to better understanding of the role of DND1 in germ cell fate if the heterogeneity detected is better linked to more comprehensive cell biological/histological characterisation of the novel cell population and if the transcriptome and RIP results are more robustly correlated.

There are several questions arising from the current version of the ms, which need addressing (particularly Q1-3).

1. Does the heterogeneity detected by the transgenic insertion of GFP marker faithfully represent true endogenous (non-transgenic) heterogeneity? While single cell data suggest that high expressing DND1 cells indeed exist, are those cells in the single cell data showing similar transcript dynamic of candidate targets and DEGs seen in the FACS isolated cells?

2. What is the cellular and functional relevance of the molecular heterogeneity detected in this ms? This question is important for assessing the meaning of the detected transcriptomic differences, given that the authors do not see correlation between the high DND1 expression and known variable features of germ cells such as differentiating or apoptotic germ cells. Without providing some validation to the observed DEG and RNA targeting phenomena the study will not represent sufficient advance. A complementing histological characterisation of the high DND1 cells will also offer the benefits of validation of the predicted molecular (e.g. chromatin remodelling) and cellular (e.g. cell cycle) functions highlighted by the authors. Such characterisation/validation may include high DND1 cell group-specific detection of changes in cell cycle markers, detection of expected global changes in repressive chromatin states (e.g. comparative immunohistochemistry of DNA and histone methylation markers in high and low DND1 cells), analysis of global translation regulation and oxidative phosphorylation.

3. A related question to that above, is whether the authors see differential subcellular localisation patterns of DND1 and or cell distributions at different stages of germ cell development between DND1 high and low cells? Is there any change in cellular behaviour that may indicate the distinct fates between these cell groups? Figure 3 indicates some clustering of DND1 high cells. Is this stereotypical for tissue position or frequency of clustering?

4. It is difficult to reconcile or appreciate the partial overlap between DEG and RIP results. The composition of the subsets of genes that are differentially expressed in high DND1 cells that appear to be direct targets could be explained by the composite effect of DND1 dependent stabilisation/degradation of RNAs and the indirect effect of DND1-dependent translation regulation associated RNA degradation. One would expect tighter correlation between temporal expression dynamic (DEG) and temporal degree of DND1 targeting (RIP) at the different stages of germ cell development tested upon direct effect. Can such temporal analysis of correlation between the expression changes and RIP results in E16.5-E18.5 be carried out?

5. Not essential but certainly relevant question and relatively easy to check whether the DND1 targets match those which are dependent on DND1 homologs in other organisms (e.g. zebrafish).

Minor points:

1. The data in Figure 4 is hard to follow due to the multiple colours which are difficult to assign using the colour code in the legends. Can the authors revise this figure to link the DEG categories shown in stack bars across stages and show the GO terms next to the stacks?

2. Font size in Fig. 7B should be increased.

3. Protein is misspelt in Figure 4 legend.

4. There is no need to abbreviate the word Figure in Figure titles (or indeed throughout the ms).

Reviewer #3: Ruthig et al report on two populations of male germ cells characterized by different levels of Dead end expression. The difference between the two population is detected by the expression level of a fusion between DND1 and GFP that is inserted within the endogenous locus. The cell population expressing higher level of DND1-GFP show higher level of epigenetic regulators RNAs, and the protein binds RNAs of such regulators.

As DND plays central roles in the development of germ cells, the lists of possible DND1 in vivo targets are important resource for future functional studies. The results are thus clearly of interest for the readership of the Journal.

Addressing/explaining better some of the issues listed below would improve the manuscript.

Introduction

1. “During the last third of fetal life until a few days after birth (E14.5 to about P2) (7-9), male germ cells (MGCs) enter a long period of cell cycle arrest, during which they undergo extensive reprogramming that reduces their potential to give rise to teratomas and drives cells towards the pro-spermatogonial (PSG) fate.” – has it been critically shown that “extensive reprogramming” at these stages indeed reduces teratoma formation? If so, the source should be cited. In certain contexts, reprogramming is thought to promote teratoma formation, so would be good if the authors define “reprogramming” in their case and align them with previous literature (e.g. https://www.nature.com/articles/nature12586 ).

2. “Dead end 1 (Dnd1), an RNA binding protein (RBP), underlies the classic mouse mutation “Ter”, named for the high incidence of teratomas in male Dnd1Ter/Ter mutants” – mention that this holds true for a specific genetic background only.

3. “loss or mutation of Dnd1 leads to a severe loss of embryonic germ cells soon after their specification (17, 18), and in zebrafish, transformation of germ cells to somatic lineages” – not clear what “severe loss” means. Would “loss“ be sufficient. In addition, the way it is phrased, it sounds as if the apparent loss of cells in the mouse is not (also) a result of cell transformation. Is cell death the only way of losing the germ cells in Ter mice?

4. “Although some of these chromatin modifiers mapped as mRNA targets of DND1 in HEK293 cells, many were not expressed in this cell line” – This sentence is clear for those who read reference 21. I would add 2 sentences, explaining the results presented in references 20 and 21 such that this part of the introduction is a “stand alone” paragraph.

Results

5. “At E14.5, the DND1-GFP-hi population accounts for <1% of MGCs. Their numbers peak at E16.5, when DND1-GFP-hi cells represent 12% of total MGCs, and decline again at E18.5, when the DND1-GFP-hi population accounts for only 5% of MGCs. “. In table S2, the numbers and percentages do not add up properly (are not always 100%, the total MGCs is less/more than hi + lo). Is there an explanation for that?

6. What stage is presented in Fig 1A?

7. In S1 E16, what should one consider as hi and lo ? there are regions that appear to be devoid of signal, some lo and some hi. The definition, of hi and lo should be made clear here. As presented, it seems like there are domains of hi on either side of the cords and that they are larger than what one would have expect from a 12% population size.

8. Define GCNA in Fig 2. What does the White “DNA” in 2A-C stand for? Why not show the level of these specific markers derived from the RNA seq done at E16?

9. The authors should try to exclude the option that the hi and lo are not a result of small differences in the rate of development among cells at E16.5-E18.5, with all the cells becoming similar to each other at P3 (perhaps hi). It is not obvious how can one compare the lo in P3 with that in E18.5. If it is based on absolute GFP level it could signify more uniform distribution of cell states at the E14.5 and P3 -> It would help if the authors present arguments that make this option (that the small hi population is not simply composed of “faster”, or “slower” cells) less likely.

10. Conducting analysis similar to that presented in Fig 2 for 1-2 differentially-expressed markers derived from Fig 3 (antibody as in 2, or employing a quantitative in situ hybridization method) would increase the confidence in the results presented in both figures – i.e., show the correlations shown in 3 following the experimental scheme of Figure 2.

11. What justifies presenting the hi and lo of different stages within the same graphs in Figure 6? Were the counts normalized to the expression level of a house-keeping gene for example?

12. The ideal control for the RIP would have been a GFP-tagged mutated Dnd1, but such a knockin is obviously not available. A control of RIP using another GFP fusion could have helped here. Perhaps I missed it, but what would increase the confidence in the RIP results and show specificity would be a demonstration of lack of very good (not just a few examples provided) correlation between the normalized expression level of transcripts (not rRNAs which are not polyadenylated) and the RIP results. The authors should just present the results more clearly to exclude this option.

13.

“ Nearly all of these chromatin regulators are genes that fail to be up-regulated in DND1Ter/Ter mutant germ cells by E14.5 “. Here the authors focus on RNAs encoding for proteins that are more “interesting” in the context of this work. How do other genes behave in this comparison?

Discussion

14. Is there anything special in germ cells at the stages studied that could account for the high representation of vesicle transport encoding RNAs?

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Decision Letter 1

John M Greally, Erez Raz

6 Feb 2023

Dear Dr Ruthig,

We are pleased to inform you that your manuscript entitled "The RNA Binding Protein DND1 Is Elevated in a Subpopulation of Pro-Spermatogonia and Targets Chromatin Modifiers and Translational Machinery During Late Gestation" has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Erez Raz

Guest Editor

PLOS Genetics

John Greally

Section Editor

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

Dear Victor, Dear Blanche,

As you can see in the comments, everyone agreed that this version of the manuscript should be published in the PLOS Genetics.

One of the referees had a few very minor additional points, which I am sure should not be a problem to address.

Congratulations and apologies for the long time it took.

Erez

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In this revised manuscript, Ruthig et al identify populations of embryonic germ cells expressing either high or low levels of a Dnd1-GFP reporter knock-in allele. Using this allele, the authors identify fundamental transcriptional differences between these populations and direct RNA targets of the DND1 RNA binding protein. Several regulators of the epigenome and translational machinery are expressed at higher levels in Dnd1-hi cells and are shown to be direct targets of Dnd1.

I commend the authors for their very careful consideration of reviewer comments and the nice revisions they have made to this manuscript. In particular the authors have addressed 2 critical concerns raised out of the reviews. First, the authors used DAPI staining to "normalize" the GFP expression levels in their microscopy data, which has addressed concerns about identifying GFP-hi and GFP-low cells in their images. Second, the authors have added an essential experiment that shows Annexin V staining, a marker of apoptosis, is convincingly more associated with GFL-low cells. This along with the established literature that there is loss of a large proportion of male germ cells around birth, provides interesting biological significance to the 2 populations and the genes that they express. I was also pleased to see additional work performed to verify that endogenous Dnd1 expression correlates with GFP expression levels.

I do not have any major concerns left to be addressed.

A few minor comments:

1. I would add a reference to this statement on line 504 "A large proportion of MGCs are lost by the time of birth." Probably well accepted within the mammalian germ cell community, but outside of that, it is not necessarily common knowledge.

2. Line 103 – still a use of "transgenic" to describe the allele. Knock-in would be better.

3. Typo on Line 248 - "express much higher levels of genes associated with associated with the germline...."

Reviewer #2: This substantially revised ms has addressed all of my queries raised. The replies are satisfactory and the new results, particularly Figure 3 demonstrating striking differences in Annexin V expression suggest phenotypic consequences to distinct DND1 levels in the identified germ cell populations and improved the ms. The new data in Suppl. Fig. 4 indicates correlation between the single cell and bulk transcriptomes making the observations more robust.

Reviewer #3: The authors addressed all the issues I raised, and provided good reasons for not doing so in some cases.

I support publishing this paper without any additional changes.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-22-00785R1

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Acceptance letter

John M Greally, Erez Raz

23 Feb 2023

PGENETICS-D-22-00785R1

The RNA Binding Protein DND1 is Elevated in a Subpopulation of Pro-Spermatogonia and Targets Chromatin Modifiers and Translational Machinery During Late Gestation

Dear Dr Ruthig,

We are pleased to inform you that your manuscript entitled "The RNA Binding Protein DND1 is Elevated in a Subpopulation of Pro-Spermatogonia and Targets Chromatin Modifiers and Translational Machinery During Late Gestation" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Anita Estes

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Quantitative data from DND1-GFP Fluorescence Associated Cell Sorting (FACS).

    (XLSX)

    S2 Table. Complete lists for all significantly differentially expressed genes, and GO biological process gene list filtering for hiloRNA-seq and RIP-seq.

    (XLSX)

    S3 Table. Complete list of apoptosis and cell death terms, with gene Ensembl IDs from Biological Process Gene Ontology analysis in g:Profiler of E16.5 DND1-GFP-hi and DND1-GFP-lo cells.

    (XLSX)

    S4 Table. Combined DND1-seq (hiloRNA-seq, RIP-seq) dataset with graphic output.

    (XLSX)

    S5 Table. Complete RIP-seq g:Profiler results.

    (XLSX)

    S1 Fig. Fetal male germ cells visibly express low and high levels of DND1-GFP.

    Whole mount confocal imaging of unstained freshly dissected A: E16.5 and B: E18.5 DND1-GFP testes showing heterogenous endogenous DND1-GFP fluorescence. Insets show cord region (white dotted box, a’ and b’) and rete testis region (yellow dotted box, a” and b”). All scale bars are 200μm. Scale for A and B in B, scale for a’, a”, b’, and b” in b”.

    (TIF)

    S2 Fig. Quantification of DND1-GFP in E16.5 MGCs.

    A: View of fixed MGCs in a testis cord with DND1-GFP-hi cell marked by an arrow and a DND1-GFP-lo cell marked by a wedge. Scale bar = 50μm. B, C: Representative images showing a quantified DND1-GFP-hi cell (B) and a quantified DND1-GFP-lo cell (C) from an E16.5 testis. b’, c’: DND1-GFP (endogenous signal). b”, c”: DNA (Hoechst stain). Scale bar = 10μm. D: Quantification of DND1-GFP fluorescence in 50 randomly selected DND1-GFP MGCs from multiple testes cords in 2 separate mice. Dashed line demarcates the relative fluorescence boundary between DND1-GFP-lo cells and DND1-GFP-hi cells (below and above the line, respectively). E: Mean intensity of Dnd1 expression relative to Gapdh expression in DND1-GFP-lo (blue) and DND1-GFP-hi (yellow) cells from Dnd1GFP/+ heterozygotes. F: Mean intensity of eGfp expression relative to Dnd1 expression in DND1-GFP-lo and DND1-GFP-hi cells from Dnd1GFP/+ heterozygotes. E-F: Technical replicates = 4, biological replicates ≥ 4 fetal testes. P value: ** < 0.01, n.s. = not significant.

    (TIF)

    S3 Fig. Pearson correlation matrix of transcriptomes from DND1-GFP-lo and DND1-GFP-hi samples at E16.5 and E18.5.

    The matrix shows that DND1-GFP-lo and DND1-GFP-hi cells are most similar to biological replicate samples, but all samples of male germ cells share high similarity.

    (TIF)

    S4 Fig. Genes expressed at higher or lower levels in DND1-GFP-hi cells often show a similar correlation in E16.5 sc-RNA-seq data from Law et al.

    Re-analysis of E16.5 male germ cell scRNA-seq from [31]. Cluster 0 and Cluster 1 have higher levels of Dnd1 and Cluster 2 has lower levels of Dnd1 (also Fig 1E). Violin plots showing levels of other germ cell genes differentially expressed between DND1-GFP-lo and DND1-GFP-lo cells in scRNA-seq of E16.5 male germ cell from [31].

    (TIF)

    S5 Fig. Markers previously reported to be associated with the apoptotic or male germ cell determination pathway do not consistently correlate with low or high DND1-GFP.

    Imaging from whole mount confocal microscopy on E14.5 DND1-GFP testes (A-C) and E18.5 DND1-GFP testes (D-F) for: ORF1p (A & D), TOMM20 (B & E), and CPARP (C & F) (red). All sections stained for DND1-GFP (GFP, green), germ cell nuclear antigen (GCNA, blue), and DNA (Hoechst, white). Blue triangles mark DND1-GFP-lo and red triangles mark DND1-GFP-hi germ cells. Scales in C and F are 50μm and apply to all images.

    (TIF)

    S6 Fig. Examples of genes from PANTHER protein classes “Transcriptional Regulation”, “Defense/Immunity”, “Transporters”, and “Transmembrane Signal Receptors” that were significantly differentially expressed in DND1-GFP-lo compared to DND1-GFP-hi cells at E16.5 and/or E18.5.

    hiloRNA-seq expression P-value (DESeq2) between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5: not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****).

    (TIF)

    S7 Fig. Chromatin modifiers are disproportionately over-expressed in DND1-GFP-hi cells with genes either shared between DND1-GFP-hi cells at E16.5 and E18.5 or exclusive to DND1-GFP-hi cells E16.5.

    Euler plot of significant differentially expressed genes that are part of the GO biological process category “regulation of chromatin modification”. There are no genes differentially expressed in both E16.5 and E18.5 DND1-GFP-lo cells. Chart shows top 10 genes for each region of the Euler plot (for full list see S2 Table).

    (TIF)

    S8 Fig. Input biological replicates are most similar to each other across all time points, while immunoprecipitation (IP) biological replicates are most similar to each other within a time point.

    Pearson correlation matrix of input and IP biological replicates at E14.5, E16.5, E18.5 used in RIP-seq with breakouts showing isolated correlations for input and IP biological replicates at each time point.

    (TIF)

    S9 Fig. All paired input and immunoprecipitation (IP) biological replicates at all time points show a subset of transcripts that graph as targets.

    A: Scatter plots of log2(TPM) for paired input and immunoprecipitation (IP) biological replicates at E14.5, E16.5, E18.5. Guideline with slope of 1 is plotted to help delineate genes in quadrant I that are above the line, indicative of being a transcript target of DND1. B: Gene expression values (TPM) for input and IP biological replicates at E14.5, E16.5, E18.5 for highly expressed housekeeping genes that did not enrich as targets (Gapdh, Acta2), and a highly expressed gene that did enrich as a target (Lamb1).

    (TIF)

    S10 Fig. Many epigenetic regulators that are differentially expressed dependent on the level of DND1 are both targets of DND1 and fail to up-regulate in Dnd1Ter/Ter mutant germ cells.

    A: Heat map of epigenetic regulators showing: L2FC in DND1-GFP-hi vs DND1-GFP-lo cells from hiloRNA-seq (E16.5, E18.5); enrichment of transcript as a DND1 target as a function of L2FC in RIP-seq (E14.5, E16.5, E18.5); and L2FC in Dnd1Ter/Ter mutant vs wild type germ cells from Dnd1Ter-seq (E14.5) from [23]. B: Expanded gene-level hiloRNA-seq and RIP-seq data for representatives from each epigenetic regulator family in Figs 4B and S10A. With the exception of Hdac4, transcript levels for all genes shown are higher in DND1-GFP-hi cells, and all genes except Brdt map as significant targets of DND1 at one or more stages. hiloRNA-seq expression P-value (DESeq2) between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5 and RIP-seq enrichment P-value (DESeq2): not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****).

    (TIF)

    S11 Fig. Collagens and genes associated with the novel intermediate pro-spermatogonia identity [16, 55] are DND1 targets, but only some are significantly differentially regulated in DND1-GFP-lo and DND1-GFP-hi cells.

    A: Heat map of collagens showing L2FC in DND1-GFP-lo vs DND1-GFP-hi cells from hiloRNA-seq (E16.5, E18.5). B: Enrichment of transcript as DND1 targets as a function of L2FC in RIP-seq (E14.5, E16.5, E18.5). C, D: Expanded gene-level hiloRNA-seq and RIP-seq data for representative genes associated with (C) intermediate pro-spermatogonia identity as reported in [55], and (D) pro-spermatogonia to spermatogonial stem cell transition as reported in [16]. hiloRNA-seq expression P-value (DESeq2) between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5 and RIP-seq enrichment P-value (DESeq2): not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****).

    (TIF)

    S12 Fig. Y-linked and X-linked genes are differentially regulated in DND1-GFP-lo and DND1-GFP-hi cells and some enrich as DND1 targets.

    Gene-level hiloRNA-seq and RIP-seq data for Y-linked (Eif2s3y, Uty, Smcy) and X-linked (Eif2s3x, Utx, Smcx) genes. hiloRNA-seq expression P-value (DESeq2) between DND1-GFP-lo and DND1-GFP-hi cells at E16.5 and E18.5 and RIP-seq enrichment P-value (DESeq2): not significant (ns), <0.05 (*), <0.01 (**), <0.001 (***), <0.0001 (****).

    (TIF)

    S13 Fig. Workflow of DND1-GFP and Annexin V FACS analysis.

    A: Live cells were gated using FSC-A and SSC-A parameters, and DND1-GFP-lo cells and DND1-GFP-hi cells were identified with GFP fluorescence and cell count parameters. DND1-GFP-lo and DND1-GFP-hi populations were plotted by 647/APC-A (Annexin) fluorescence. B: Annexin V-positive threshold was determined by examining live cells for GFP and 647/APC-A fluorescence. Using this threshold, the DND1-GFP positive cells were divided into DND1-GFP-hi and DND1-GFP-lo, creating four quadrants.

    (TIF)

    Attachment

    Submitted filename: Ruthig_RTR.docx

    Data Availability Statement

    The raw hiloRNA-seq and RIP-seq data is available on GEO (GSE197610) as a super-series. The complete script for all data analysis has also been deposited in GEO as a singular simple text file in Rmarkdown syntax along with all the supporting, intermediate, and Rstudio data files used in the analysis. These scripts are complete but contain redactions to protect the cybersecurity of the Duke Computing Cluster and the Scientific Computing Unit at Weill Cornell Medicine, and therefore will need adjustments to function on other computing clusters and with Rstudio sessions. The unprocessed matrix counts and processed Loupe File from the aggregated E16.5 scRNA-seq re-analysis have also been deposited on GEO (GSE197610) as part of the same super-series and are linked to the raw scRNA-seq data published in Law et al. 2020 which is also on GEO (GSE124904) https://doi.org/10.1038/s41467-019-10596-0. The Dnd1Ter-seq dataset published in Ruthig et al. 2019 was deposited with that publication on GEO (GSE132719) https://doi.org/10.1242/dev.175950.


    Articles from PLOS Genetics are provided here courtesy of PLOS

    RESOURCES