Abstract
Germline specification in mammals occurs through an inductive process whereby competent cells in the post-implantation epiblast differentiate into primordial germ cells (PGC). The intrinsic factors that endow epiblast cells with the competence to respond to germline inductive signals remain unknown. Single-cell RNA sequencing across multiple stages of an in vitro PGC-like cells (PGCLC) differentiation system shows that PGCLC genes initially expressed in the naïve pluripotent stage become homogeneously dismantled in germline competent epiblast like-cells (EpiLC). In contrast, the decommissioning of enhancers associated with these germline genes is incomplete. Namely, a subset of these enhancers partly retain H3K4me1, accumulate less heterochromatic marks and remain accessible and responsive to transcriptional activators. Subsequently, as in vitro germline competence is lost, these enhancers get further decommissioned and lose their responsiveness to transcriptional activators. Importantly, using H3K4me1-deficient cells, we show that the loss of this histone modification reduces the germline competence of EpiLC and decreases PGCLC differentiation efficiency. Our work suggests that, although H3K4me1 might not be essential for enhancer function, it can facilitate the (re)activation of enhancers and the establishment of gene expression programs during specific developmental transitions.
Subject terms: Epigenetic memory, Germline development, Epigenomics, Gene expression profiling, Gene regulation
While inductive signals controlling germline specification are well characterized, the intrinsic factors that allow epiblast cells to respond to such signals remain largely unknown. Here the authors use in vitro differentiated primordial germ cells to show that partial retention of histone H3K4 monomethylation within relevant enhancers is important for germline competence and specification.
Introduction
Competence can be defined as the ability of a cell to differentiate towards a specific cell fate in response to intrinsic and extrinsic signals1. While the extracellular signals involved in the induction of multiple cell fates have been described2, the intrinsic factors that determine the cellular competence to respond to those signals remain elusive. One major example illustrating the dynamic and transient nature of competence occurs early during mammalian embryogenesis, as the primordial germ cells (PGC), the precursors of the gametes, become specified. In mice, following implantation and the exit of naïve pluripotency (E4.5–E5.5), PGC are induced from the epiblast around E6.0–E6.25 at the proximo-posterior end of the mouse embryo3. The induction of PGC occurs in response to signals emanating from the extraembryonic tissues surrounding the epiblast: BMP4 from the extraembryonic ectoderm and WNT3 from the visceral endoderm4. Furthermore, regardless of their position within the embryo, formative epiblast cells (~E5.5–6.25)5–7 are germline competent when exposed to appropriate signals, but this ability gets lost as the epiblast progresses towards a primed pluripotency stage (>E6.5)4. However, the intrinsic factors that confer germline competence on the formative but not the primed epiblast cells remain obscure8,9, partly due to the limited cell numbers that can be obtained in vivo from mouse peri-implantation embryos (E4.5–E6.5). These limitations were mitigated by a robust in vitro differentiation system whereby mouse embryonic stem cells (ESC) grown under 2i conditions (naïve pluripotency) can be sequentially differentiated into EpiLC and PGCLC that resemble the formative epiblast and E9.5 PGC, respectively10. This system facilitated the mechanistic and genomic characterization of transcription factors (TFs)11–13 and epigenetic reprogramming events14–16 previously shown to be involved in PGC specification in vivo17,18.
Using the PGCLC in vitro differentiation system, two transcription factors, FOXD3 and OTX2, were found to promote the transition from naïve to formative pluripotency by coordinating the silencing of naïve genes and the activation of early post-implantation epiblast markers9,19,20. Subsequently, FOXD3 and OTX2 restrict the differentiation of EpiLC into PGCLC and, thus, the silencing of these TFs is required for in vitro germline specification9,20. The regulatory function of FOXD3 during these developmental transitions involves binding to and silencing of enhancers shared between naïve pluripotent cells and PGCLC9. Interestingly, during the transition from 2i ESC to EpiLC, FOXD3-bound enhancers lose TF and co-activator binding as well as H3K27ac but partly retain H3K4me1. This suggests that these enhancers do not become fully decommissioned, but transiently display a chromatin state similar, but not identical, to that of primed enhancers21,22. Enhancer priming typically involves binding of pioneer TFs and pre-marking by H3K4me1 that can precede and facilitate subsequent enhancer activation (i.e. marking by H3K27ac, recruitment of RNA Pol II, production of eRNAs)23–26. Interestingly, in differentiated macrophages, enhancers activated upon stimulation rapidly lose H3K27ac and TF binding, while retaining H3K4me1 for considerably longer. It was proposed that H3K4me1 persistence could facilitate a faster and stronger enhancer induction upon restimulation27. It is currently unknown whether, during development, H3K4me1 persistence once enhancers become decommissioned can similarly facilitate their eventual re-activation21.
H3K4me1 is catalyzed by the SET domains of the histone methyltransferases MLL3 (KMT2C) and MLL4 (KMT2D), which are part of the COMPASS Complex28,29. The knockout (KO) of Mll3/4 or Mll4 alone impairs enhancer activation and results in differentiation defects in various lineages30–37. However, the importance of MLL3/4 for enhancer function might be independent of H3K4me1 deposition, since Mll3/4 catalytic mutant ESC (Mll3/4 dCD) in which H3K4me1 was globally lost, only displayed minor defects in enhancer activation (i.e. H3K27ac, RNA Pol II and eRNA levels) in comparison to Mll3/4 KO ESC38. Similarly, work in Drosophila melanogaster showed that, while the KO of Trr, the homolog of Mll3/4 in flies, was embryonic lethal, an amino acid substitution in the SET domain of Trr that globally reduced H3K4me1 did not impair development or viability39. On the other hand, subsequent work with Mll3/4 dCD ESC showed that the recruitment of chromatin remodelers40 and the establishment of long-range chromatin interactions32 required H3K4me1. Hence, the functional relevance of H3K4me1 for enhancer function is still under debate41.
Here we perform an extensive transcriptional and epigenetic characterization of the main stages of PGCLC differentiation to gain insights into the molecular basis of germline competence. Comparisons between germline competent EpiLC and non-competent EpiSC reveal that a notable fraction of PGCLC enhancers, which tend to be already active in the naïve stage (i.e. 2i ESC), partly retain H3K4me1 and remain accessible and responsive to TFs in EpiLC in comparison to EpiSC. Most importantly, the persistence of H3K4me1 within PGCLC enhancers seems to contribute to in vitro germline competence, as the absence of this histone mark reduces PGCLC differentiation efficiency.
Results
Characterization of the PGCLC in vitro differentiation system by single-cell RNA-seq
To overcome the scarcity and transient nature of PGCs in vivo, we used an in vitro PGCLC differentiation system10 whereby mouse ESC are differentiated into EpiLC from which PGCLC can be obtained within heterogeneous embryoid bodies (EB). In contrast, Epiblast stem cells (EpiSC), resembling the post-implantation gastrulating epiblast, cannot be efficiently differentiated into PGCLC and, thus, display limited germline competence (Fig. 1a)7,10. To better characterize this in vitro system, we performed single-cell RNA sequencing (scRNA-seq) across multiple stages of PGCLC differentiation (the scRNA-seq data can be easily explored with the cloupe file available through GEO: GSE155088). t-SNE analysis of the resulting single-cell transcriptomes (Supplementary Data 1) showed that cells tend to cluster within their corresponding differentiation stage (Fig. 1b). However, Day 2 and Day 4 EB showed cellular heterogeneity and formed distinct subclusters (Fig. 1b, c). One of these subclusters was identified as PGCLC based on the high expression of major PGC markers (e.g. Prdm14, Prdm1, Tfap2c, Dppa3), while the additional subpopulations within d4 EB were annotated based on the expression of cell identity markers identified by single-cell transcriptional profiling of E8.25 mouse embryos42 (Fig. 1c, Supplementary Fig. 1a, b). Remarkably, these subclusters were similar to the extraembryonic tissues (i.e. extraembryonic ectoderm, extraembryonic mesoderm, and endothelium) that surround PGCs in the proximo-posterior end of the mouse embryo following germline specification in vivo. Furthermore, differential expression analysis between the PGCLC cluster and the remaining cells of the d2 and d4 EBs (see Methods) led to the identification of a set of 389 PGCLC genes (Supplementary Data 2), which included the PGC markers mentioned above as well as major naïve pluripotency regulators (e.g. Nanog, Esrrb) (Fig. 1d, e). In agreement with previous reports10, we found that several PGCLC genes were more highly expressed in ESC, progressively silenced in EpiLC, and finally reactivated in PGCLC (Fig. 1d, e, Supplementary Fig. 1c). Despite the similar expression profile of ESC and PGCLC, major ESC (e.g. Klf4, Zfp57, Tbx3) and PGC (e.g Prdm1, Dppa3, Dnd1) markers were specifically expressed in the ESC and PGCLC clusters, respectively (Fig. 1e, Supplementary Fig. 1d, Supplementary Data 2). Furthermore, differential expression analysis between PGCLC and in vitro pluripotent cell types (i.e. ESC, d2 EpiLC, and EpiSC) showed that ~25% of the PGCLC genes (100/389) displayed a PGCLC specific expression pattern (Supplementary Fig. 1e). Therefore, despite their similarities, 2i ESC and PGCLC are transcriptionally distinct. Lastly, we also defined gene sets specific for the three investigated in vitro pluripotent cell types (i.e. ESC, d2 EpiLC, and EpiSC) by performing differential gene expression analysis between each cell type and the other two (Supplementary Data 2).
Germline competent EpiLC are transcriptionally homogeneous
Previous work indicates that the acquisition of germline competence in day 2 (d2) EpiLC entails the complete dismantling of the naïve gene expression program5,6, which is then partly re-activated during PGCLC induction. In agreement with this, all d2 EpiLC clustered together and neither a distinct subpopulation indicative of a retained naïve pluripotency expression program nor signs of precocious germline induction could be identified (Fig. 1b, f; Supplementary Fig. 1f). Congruently, the cell-to-cell variability in gene expression levels, defined as transcriptional noise, was significantly lower in d2 EpiLC than in the preceding or subsequent cellular stages (Fig. 1g). This is in agreement with the transcriptional homogeneity of the E5.5 epiblast in vivo43, which bears the lowest transcriptional noise during mouse peri-implantation development44. Lastly, we analyzed bulk RNA-seq data generated in Otx2−/−20 and Prdm14−/−14 EpiLC, which despite displaying increased and decreased germline competence, respectively, showed normal expression of PGCLC genes (Supplementary Fig. 1g). Therefore, in agreement with the previous work6, our scRNA-seq analysis indicates that in vitro germline competence entails a transcriptionally homogeneous stage in which the gene expression program shared between naïve pluripotency and PGCLC is silenced.
Identification of PGCLC active enhancers
Many PGCLC genes, especially those active in ESC, are silenced in EpiLC and EpiSC (Fig. 1d, e, Supplementary Fig. 1c), yet only EpiLC display high germline competence. Taking previous observations into account9,45,46, we hypothesized that enhancers involved in the induction of PGCLC genes might display epigenetic differences between EpiLC and EpiSC that could explain their distinct germline competence. To test this hypothesis, we first identified distal H3K27ac peaks in d6-sorted PGCLC using publically available data15. In agreement with our previous observations9, a large fraction of the d6 PGCLC H3K27ac peaks were initially active in ESC, lost H3K27ac in EpiLC, and became progressively reactivated in d2 and d6 PGCLC (Supplementary Fig. 2a). Since most of the d6 PGCLC H3K27ac peaks were also active in ESC, we then used Capture Hi-C data generated in ESC47 to systematically associate these distal peaks to their putative target genes. Finally, we defined PGCLC enhancers as those distal d6 PGCLC H3K27ac peaks that could be physically linked to our PGCLC gene set (Supplementary Data 2). This resulted in 415 PGCLC enhancers linked to 216 of the 389 PGCLC genes (Fig. 2a). Furthermore, to compare epigenetic changes between different enhancer groups, EpiLC and EpiSC enhancers were defined using similar criteria (Methods; Supplementary Data 3).
To validate our PGCLC enhancer calling strategy, we first selected three representative enhancers linked to Esrrb, Klf5, and Lrrc31/Lrrc34, respectively (Fig. 2b, Supplementary Fig. 2b). These three enhancers were initially active in ESC, got silenced in EpiLC, and finally became re-activated in PGCLC. Each enhancer was individually deleted in ESC using CRISPR/Cas9 technology (Supplementary Fig. 2c). The deletion of the enhancers associated with Lrrc31/Lrrc34 and Klf5 significantly reduced the expression of the corresponding target genes in ESC and d4 EB (Supplementary Fig. 2d, e). The Esrrb enhancer deletion had a moderate effect in ESC and severely diminished Esrrb expression in d4 EB (Fig. 2c). The moderate effect in ESC could be explained by the presence of additional ESC-specific enhancers (Fig. 2b) that might control Esrrb expression in ESC and compensate for the enhancer deletion48. Similarly, since some PGCLC genes are associated with multiple and potentially redundant regulatory elements in PGCLC (i.e. 415 enhancers linked to 216 PGCLC genes; 1.9 enhancers/gene), we individually deleted three different enhancers (i.e. E1, E2, and E3) (Fig. 2d; Supplementary Fig. 2c) previously described as components of a Prdm14 super-enhancer49. In agreement with previous work, the E2 deletion strongly reduced Prdm14 expression in ESC, while the deletion of E3 or E1 had considerably smaller effects (Fig. 2e)49. Upon PGCLC differentiation, the regulatory importance of each enhancer changed and E1 clearly contributed to Prdm14 expression, especially during early PGCLC induction (Fig. 2e). These results suggest that, rather than being components of an ESC super-enhancer, the E1–E3 elements differentially contribute to Prdm14 expression in either ESC (i.e. E2) or early (d2 EBs) PGCLC (i.e. E1). Furthermore, in agreement with the role of Prdm14 as a PGC master regulator50, we found that the individual E1–E3 deletions significantly impaired PGCLC differentiation (Fig. 2f, Supplementary Fig. 2f). However, for the E2 and E3 enhancer deletions, we can not exclude that the impaired PGCC differentiation might be caused, at least partially, by the initial decrease in Prdm14 expression in ESC, which might compromise naïve pluripotency by increasing the levels of de novo DNA methyltransferases and, thus, of CpG methylation51–53. Altogether, the previous deletions support the relevance of the identified PGCLC enhancers and suggest that some of them (e.g. Esrrb and Prdm14 E1 enhancers) are particularly relevant during PGCLC induction, while others might be important in both ESC and PGCLC.
Partial decommissioning of PGCLC enhancers in EpiLC
To explore whether PGCLC enhancers display any chromatin differences between EpiLC and EpiSC, we generated an extensive set of ChIP-seq and ATAC-seq data sets in ESC, EpiLC, and EpiSC (Fig. 3a; Supplementary Fig. 3a). Overall, the most pronounced changes within PGCLC enhancers were observed for H3K4me1, which was higher in EpiLC, and H3K9me3 and mCpG, which were increased in EpiSC (Fig. 3a, b). These differences were not observed for other enhancer groups (e.g. EpiSC enhancers), suggesting that they were not due to technical reasons or global epigenomic differences between EpiLC and EpiSC (Fig. 3a). Moreover, when analyzing the transcription start sites (TSS) of the PGCLC genes we found that, although H3K4me1 was higher in EpiLC than in EpiSC, its overall levels were low compared to PGCLC enhancers (Fig. 3a). Similarly, constitutive heterochromatin marks (e.g. H3K9me3, mCpG) around TSS increased in EpiSC, but their levels were lower than within PGCLC enhancers. Other chromatin features typically found at promoter regions (e.g. H3K4me2/3, high chromatin accessibility) were similar around PGCLC TSS in EpiLC and EpiSC (Fig. 3a). Therefore, subsequent analyses were focused on PGCLC enhancers rather than promoters.
To identify PGCLC enhancers showing consistent differences between EpiLC and EpiSC, we focused on H3K4me1, which is widely considered as an enhancer mark54. We generated two additional H3K4me1 ChIP-seq replicates in EpiLC and EpiSC (including data from a different ESC strain i.e. R1) and classified PGCLC enhancers in two groups: Group I: PGCLC enhancers showing an EpiLC/EpiSC H3K4me1 ratio higher than 1.2-fold in at least two replicates Group II: all other PGCLC enhancers showing EpiLC/EpiSC H3K4me1 ratios lower than 1.2-fold. Based on this classification, 71% of the PGCLC enhancers were assigned to Group I and, thus, showed higher H3K4me1 levels in EpiLC than in EpiSC in at least two replicates (71% Group I; 29% Group II) (Fig. 3c, d; Supplementary Fig. 3b, c). The Group I PGCLC enhancers also retained more H3K27ac in EpiLC, while the differences in H3K4me2 and chromatin accessibility, as measured by ATAC-seq, were rather moderate (Supplementary Fig. 3d). In contrast, the Group I enhancers displayed lower mCpG and H3K9me3 levels in EpiLC than in EpiSC, while these differences were considerably less obvious for the Group II enhancers (Fig. 3d, e, Supplementary Fig. 3e). Notably, previous work showed that germline genes and enhancers are repressed by the H3K9 methyltransferase EHMT2 during mouse post-implantation development45, suggesting that Group I enhancers get progressively silenced as ESC differentiate into EpiLC and EpiSC. Furthermore, H3K4me1 and mCpG were inversely correlated across Group I PGCLC enhancers, particularly in EpiLC (Fig. 3f). This negative correlation was also investigated by performing H3K4me1 ChIP in EpiLC followed by bisulfite sequencing of two representative Group I PGCLC enhancers (i.e. Esrrb and Lrrc31/Lrrc34 enhancers) (Fig. 3g). In comparison to the input genomic DNA, the H3K4me1-enriched DNA showed lower mCpG levels at the two analyzed enhancers (Fig. 3g), further supporting the antagonism of CpG methylation and H3K4 methylation55,56.
In summary, a subset of the PGCLC enhancers shows partial retention of H3K4me1 and lower levels of heterochromatic marks (i.e. mCpG, H3K9me3) in EpiLC compared to EpiSC (Fig. 3d, e). These enhancers tend to be initially active in ESC but lose H3K27ac and TF/coactivator binding (indirectly measured by ATAC-seq) already in EpiLC (Supplementary Fig. 3d). Therefore, our data suggest that a significant fraction of PGCLC enhancers are not fully decommissioned in EpiLC, which we hypothesize could facilitate their re-activation in PGCLC and, thus, contribute to germline competence.
Impaired decommissioning of PGCLC enhancers in Otx2 deficient cells
The deletion of Otx2 increases and prolongs germline competence in EpiLC20. Hence, to further explore the relationship between the chromatin features of PGCLC enhancers and germline competence, we analyzed Otx2−/− cells57. Firstly, we confirmed the extended germline competence of Otx2−/− cells, which can be robustly differentiated into PGCLC after keeping them for up to seven days in EpiSC culture conditions (Fig. 4a, Supplementary Fig. 4a). ChIP-seq experiments revealed that the increased germline competence of Otx2−/− cells was correlated with the retention of H3K4me1, H3K4me2, and H3K27ac in PGCLC enhancers, particularly within those displaying higher levels of H3K4me1 in WT EpiLC (i.e. Group I enhancers) (Fig. 4b, c, Supplementary Fig. 4b). Moreover, H3K4me1/2 levels were higher in Otx2−/− ESC than in WT ESC in some but not all of the investigated Group I enhancers (Supplementary Fig. 4c). Nevertheless, the correlation between germline competence and H3K4me1 levels within PGCLC enhancers was not perfect, since Otx2−/− d4 EpiSC displayed higher germline competence than WT EpiLC, yet slightly lower H3K4me1 levels within Group I enhancers (Fig. 4c). Therefore, in addition to H3K4me1, other chromatin features within PGCLC enhancers might also contribute to the extended germline competence of Otx2−/− cells. In agreement with this possibility, Group I enhancers showed slightly higher H3K4me2 in Otx2−/− d4 EpiSC than in WT EpiLC (Fig. 4c). Furthermore, genome-wide as well as detailed analysis of representative enhancers (i.e. Esrrb and Prdm1 enhancers) showed that the increased competence of Otx2−/− EpiLC and d4 EpiSC was also reflected in reduced CpG methylation levels within Group I PGCLC enhancers (Fig. 4d, Supplementary Fig. 4d). Overall, as the PGCLC genes get properly silenced in Otx2−/− EpiLC (Supplementary Fig. 1g), these results suggest that the extended germline competence of Otx2−/− cells could be linked to the impaired and delayed decommissioning of a subset of PGCLC enhancers.
Partially decommissioned PGCLC enhancers are accessible and responsive to transcriptional activators in EpiLC
To investigate whether the partial decommissioning of PGCLC enhancers in EpiLC compared to EpiSC could have any functional consequences, we evaluated the accessibility of these enhancers to major PGCLC transcriptional activators. To this end, we generated clonal ESC lines in which HA-tagged PRDM14 or NANOG could be overexpressed upon the addition of Doxycycline (Dox) (Supplementary Fig. 4e, f). In agreement with previous reports, the overexpression of either PRDM14 or NANOG upon differentiation of EpiLC into PGCLC yielded a high percentage of PGCLC in the absence of growth factors (Fig. 4e, Supplementary Fig. 4g)12,58. In contrast, the overexpression of PRDM14 or NANOG upon differentiation from EpiSC resulted in considerably less PGCLC (Fig. 4e, Supplementary Fig. 4g). To assess whether PGCLC enhancers were differentially accessible to these TFs in EpiLC and EpiSC, we performed ChIP-seq experiments after a short induction of HA-tagged PRDM14 or NANOG in both cell types. Importantly, PGCLC enhancers, especially those belonging to Group I, were considerably more bound by PRDM14-HA and NANOG-HA in EpiLC than in EpiSC (Fig. 4f). Furthermore, NANOG-HA binding to the PGCLC enhancers resulted in increased H3K27ac and H3K4me2 levels in EpiLC but not in EpiSC, particularly within Group I enhancers (Fig. 4g, h, Supplementary Fig. 4h). Altogether, these results indicate that PGCLC enhancers, especially those belonging to Group I, are both accessible and responsive to transcriptional activators in EpiLC but not in EpiSC.
Increased decommissioning of PGCLC enhancers in the absence of H3K4me1
Our data suggest that a significant fraction of PGCLC enhancers display permissive chromatin features in EpiLC that might contribute to their increased germline competence compared to EpiSC. These permissive features could be attributed, at least partly, to the persistence of H3K4me1 in EpiLC, as this histone mark could (i) protect PGCLC enhancers from mCpG and H3K9me2/356,59–63 and/or (ii) facilitate the recruitment of chromatin remodeling complexes40. To directly assess the importance of H3K4me1 for germline competence and PGCLC specification, we used mESC catalytically deficient for MLL3 and MLL4 (dCD and dCT ESC lines; Fig. 5a)38. Previous work showed that in dCD ESC, active enhancers show a moderate reduction in H3K27ac, while gene expression, eRNA levels, and the binding of MLL3/4 and their associated complexes are not affected, indicating that H3K4me1 is largely dispensable for the maintenance of enhancer activity (Supplementary Fig. 5a)38,39. Additional characterization of dCD and dCT cell lines revealed that, upon differentiation, the major reductions in H3K4me1 and H3K4me2 that PGCLC enhancers already displayed in ESC became further exacerbated in EpiLC and EpiSC (Fig. 5b, c, Supplementary Fig. 5b–d). Moreover, H3K27ac levels within PGCLC enhancers were also reduced in dCD/dCT cells compared to their WT counterparts, although such differences were not as pronounced as for H3K4me1 (Fig. 5b, Supplementary Fig. 5b, d). Similarly, other enhancer groups (i.e. EpiLC and EpiSC enhancers) also displayed strong H3K4me1/2 losses and milder H3K27ac reductions in dCD cells (Fig. 5c, Supplementary Fig. 5d, g). In contrast, the levels of the previous active histone modifications were rather similar around the TSS of PGCLC genes, with H3K4me1 even showing a slight increase in dCD cells (Supplementary Fig. 5d). In agreement with the protective role of H3K4me1/2 against CpG methylation56,60, mCpG levels within PGCLC enhancers were generally elevated in dCD ESC64, while in dCD EpiLC the increased methylation was evident among some Group I enhancers (Fig. 5d, Supplementary Fig. 5e, f). Next, to evaluate the potential functional consequences of the previous epigenetic changes, we generated RNA-seq data in WT and dCD ESC, EpiLC, and EpiSC. In agreement with previous work in ESC indicating that H3K4me1 is dispensable for enhancer function38,39, the genes associated with PGCLC, EpiLC, and EpiSC enhancers showed rather minor expression differences between WT and dCD cells in the three investigated cell types (Fig. 5e, Supplementary Fig. 5h, i, Supplementary Data 4).
Overall, our analyses indicate that the decommissioning of a subset of PGCLC enhancers gets exacerbated in dCD EpiLC compared to their WT counterparts, resulting in a chromatin state similar to the one observed in WT EpiSC (i.e. lower H3K4me1 and higher mCpG; Fig. 3). However, these epigenetic changes do not result in major gene expression changes in any of the investigated in vitro pluripotent cell types.
H3K4me1 is necessary for in vitro germline competence
To address whether the increased decommissioning of PGCLC enhancers observed in dCD/dCT EpiLC could compromise their germline competence, these MLL3/4 catalytic mutant cell lines were differentiated into PGCLC. Chiefly, both dCD and dCT cells showed a significant reduction in their PGCLC differentiation capacity (Fig. 6a, Supplementary Fig. 6a). In agreement with MLL3 and MLL4 being functionally redundant65, such PGCLC differentiation defects were not observed when using cells that were catalytic mutant for MLL4 but not MLL3 (i.e. 4CT cells) (Fig. 5a, Fig. 6a). Furthermore, the compromised PGCLC differentiation of the dCD cells was still observed at a later time point (day 6, Supplementary Fig. 6b), indicating that the observed defects are not simply explained by a delay in PGCLC specification.
To gain molecular insights into the compromised PGCLC differentiation capacity of dCD cells, we performed scRNA-seq analyses of WT (1416 cells) and dCD (1699 cells) d4 EBs (Supplementary Data 5). UMAP (Uniform Manifold Approximation and Projection for Dimension Reduction) analysis of the resulting single-cell transcriptomes confirmed the presence within d4 EBs of subclusters resembling the main cell populations (i.e. PGCLC, ExEctorderm-like, ExMesoderm-like) that characterize the proximo-posterior end of the mouse embryo following germline specification (Fig. 6b). In addition, we also identified additional subclusters (i.e. ExEndoderm/Gut-like, 2-cell-like; Fig. 6b) that were not observed in the previous scRNA-seq analyses (Fig. 1, Supplementary Fig. 1), probably due to the lower number of d4 EB cells sequenced in those initial experiments. Most importantly, the PGCLC cluster consisted mostly of WT cells (162 WT cells (11.4% of all cells in WT d4 EBs) vs 8 dCD cells (0.5% of all cells in dCD d4 EB); Fig. 6c, d), suggesting that the defective PGCLC differentiation of dCD cells might be even more pronounced than our experiments using FACS and cell surface markers indicate (Fig. 6a). Moreover, while the transcriptomes of some extraembryonic-like cell types were similar between WT and dCD cells (e.g. Ex-Endoderm/Gut-like, Endothelial-like), many dCD cells were located in clusters with poorly defined identity that did not show differential expression of specific markers (Fig. 6b, Supplementary Fig. 6c). These undefined clusters were transcriptionally more heterogeneous than the remaining clusters found within the d4 EBs (Supplementary Fig. 6d). Interestingly, we noticed that within some of the non-PGCLC clusters there were cells showing high expression of PGCLC genes (Supplementary Fig. 6e) and, thus, a cellular identity somehow similar to PGCLC. Therefore, to further explore the PGCLC differentiation defects of the dCD cells we used an alternative approach to identify all cells within the WT and dCD d4 EBs that express major PGC markers (i.e. Prdm1 or Dppa3) but not naïve pluripotency ones (i.e. Klf4). Although the overall abundance of these Prdm1orDppa3 + /Klf4- cells was similar in WT and dCD EBs, their distribution among the different cell clusters was quite different. Namely, the vast majority of WT Prdm1orDppa3 + /Klf4- cells were found within the PGCLC cluster (91%), while many dCD cells were part of the undefined subclusters (61%) (Fig. 6e), suggesting important transcriptional differences between WT and dCD Prdm1orDppa3 + /Klf4- cells (Fig. 6f). Congruently, the expression of the PGCLC genes associated with PGCLC enhancers (e.g. Tfap2c, Prdm14, Utf1, Esrrb) was significantly reduced in dCD Prdm1orDppa3 + /Klf4- cells in comparison to their WT counterparts, while smaller differences were observed for the PGCLC genes without associated enhancers (Fig. 6g, Supplementary Fig. 6f). Overall, our scRNA-seq analyses suggest that the induction of the PGCLC genes, particularly of those linked to PGCLC enhancers, is reduced but not fully abrogated in the dCD cells. Consequently, many dCD cells display a poorly defined identity in which the PGCLC expression program is only partially established.
Finally, to further assess the importance of H3K4me1 for PGCLC induction, we investigated whether the extended germline competence of Otx2−/− EpiLC could be also attributed, at least partly, to the retention of H3K4me1 and the impaired decommissioning of PGCLC enhancers (Fig. 4). To this end, we deleted Otx2 in dCD ESC (i.e. dCD Otx2−/−) as well as in their parental WT ESC (i.e. R1 Otx2−/−) (Supplementary Fig. 6g) and differentiated them into PGCLC. As expected, the deletion of Otx2 in the R1 ESC resulted in increased germline competence (Fig. 6h), although not as pronounced as in E14 ESC (Fig. 4a)20, which could be attributed to the variable germline competence observed among different ESC lines10. Most importantly, both dCD and dCD Otx2−/− EpiLC showed a strong and similar reduction in their PGCLC differentiation capacity (Fig. 6h). Furthermore, genome-wide and locus-specific analyses of mCpG levels revealed that Group I PGCLC enhancers were considerably more methylated in dCD Otx2−/− EpiLC than in Otx2−/− EpiLC (Fig. 6i, Supplementary Fig. 6h).
Altogether, our data strongly suggest that H3K4me1 is important for in vitro germline competence and proper PGCLC induction. Although we cannot rule out that gene expression and epigenetic changes in ESC and/or extraembryonic-like cell types66 might also contribute to the PGCLC differentiation defects observed in dCD/dCT cells, our data suggests that the persistence of H3K4me1 within the PGCLC enhancers might facilitate their reactivation during PGCLC induction.
PGCLC enhancers get heterogeneously decommissioned in formative epiblast cells in vivo
Overall, our work using the PGCLC in vitro differentiation system suggests that germline competence entails the persistence of permissive chromatin features within a significant fraction of PGCLC enhancers. To evaluate whether this is also true in vivo, we took advantage of several epigenomic datasets recently generated from mouse embryos67. Firstly, analysis of DNAse I data obtained in E9.5 and E10.5 PGC showed that our in vitro defined PGCLC enhancers are highly accessible in vivo (Fig. 7a). Next, to evaluate the dynamics of PGCLC enhancer decommissioning, we analyzed genome-wide CpG methylation data from mouse epiblasts68. In agreement with our in vitro observations, PGCLC enhancers showing incomplete decommissioning in EpiLC (i.e. Group I enhancers) displayed lower CpG methylation levels in germline competent E5.5 epiblast cells than in the E6.5 epiblast (Fig. 7b), in which germline competence is already reduced4.
Previous single-cell CpG methylation analyses revealed that, during mouse peri-implantation stages, the formative epiblast is particularly heterogeneous, especially within enhancers69. Therefore, we hypothesize that the lower CpG methylation levels observed for some PGCLC enhancers in the E5.5 epiblast in comparison to the E6.5 epiblast could be the result of the increased cell-to-cell variation. To evaluate this idea, we analyzed single-cell data in which DNA methylation and gene expression were measured for the same cells across different epiblast stages (i.e. E4.5, E5.5, and E6.5)43. Firstly, we measured mCpG heterogeneity by comparing the methylation status of individual CpGs within PGCLC enhancers70 (Fig. 7c). This analysis revealed that the formative epiblast (E5.5) displayed the highest variation in mCpG, while in the primed epiblast (E6.5) the PGCLC enhancers were more homogeneously methylated (Fig. 7d). Interestingly, when comparing different enhancer sets across epiblast stages, the highest epigenetic heterogeneity (~30 %) was observed for the Group I PGCLC enhancers in the E5.5 epiblast (Fig. 7e). As the CpG coverage for the Group I PGCLC enhancers was similar across epiblast stages, the previous differences in mCpG heterogeneity are unlikely to be caused by technical reasons (Supplementary Fig. 7a). Lastly, we compared mCpG levels within Group I PGCLC enhancers and the expression of their associated genes across the profiled epiblast cells (Fig. 7f). Similar to what we observed for EpiLC and EpiSC, the PGCLC genes associated with the Group I enhancers display low and similar expression in E5.5 and E6.5 epiblast cells despite the differences in CpG methylation.
Overall, the previous data support that PGCLC enhancers are also partially decommissioned in the formative epiblast and indicate that such partial decommissioning could indeed reflect epigenetic heterogeneity. Nevertheless, the relevance of the partial decommissioning and epigenetic heterogeneity of PGCLC enhancers for germline competence in vivo remains to be demonstrated.
Discussion
Direct evidences supporting the functional relevance of H3K4me1 are scarce, partly due to the difficulties in separating the enzymatic and non-enzymatic functions of histone methyltransferases71. This limitation was recently overcome by establishing Mll3/4 catalytic mutant ESC lines in which H3K4me1 is lost from enhancers38. Under self-renewing conditions, the loss of H3K4me1 partly reduced H3K27ac but did not affect transcription from either enhancers or gene promoters, suggesting that H3K4me1 is dispensable for the maintenance of enhancer activity38. Similarly, H3K4me1 was not required for de novo enhancer activation (i.e. without the prior presence of H3K4me1) upon somatic differentiation of ESC (Fig. 5)66. However, it is still unclear whether H3K4me1 is important for the activation of primed enhancers already pre-marked with this histone modification25,31. In the case of in vitro germline competence, here we report that a subset of PGCLC enhancers gets partly decommissioned in EpiLC and retains permissive chromatin features, including H3K4me1, already present in a preceding active state (i.e. in 2i ESC) (Fig. 7g). This resembles the so-called latent enhancers previously described in differentiated macrophages, in which, following an initial round of activation and silencing, the persistence of H3K4me1 was proposed to facilitate subsequent enhancer induction upon restimulation27. The mechanisms involved in the persistence of H3K4me1 and other permissive chromatin features are still unknown, although we can envision at least two non-mutually exclusive possibilities: (i) a passive mechanism whereby MLL3/4 binding to PGCLC enhancers is already lost in EpiLC, but H3K4me1 can still be transiently retained due to the slow dynamics of H3K4 demethylation72,73; (ii) an active maintenance mechanism similar to the one reported for enhancer priming24,25, whereby the binding of certain TFs might enable the persistent recruitment of MLL3/4 and the retention of H3K4me1 within PGCLC enhancers. Since PGCLC enhancers display low and similar ATAC-seq signals in EpiLC and EpiSC (Fig. 3), this would argue in favor of passive mechanisms rather than active retention of TFs and co-activators (e.g. MLL3/4) in EpiLC.
Drosophila melanogaster embryos expressing catalytic deficient Trr, the homolog of mammalian Mll3/4, develop normally, thus arguing against a major role for H3K4me1 in enhancer function39. In contrast, the loss of MLL3/4 catalytic activities in mouse embryos resulted in early embryonic lethality (~E8.5)66, suggesting that H3K4me1 is functionally relevant during mammalian embryogenesis. It is important to consider that, in comparison to mammalian cells, the D. melanogaster genome is largely devoid (<0.03%) of CpG methylation throughout its entire life cycle74. Previous studies have extensively documented the protective role of H3K4me1/2 against mCpG and heterochromatinization56,59–63. Congruently, we showed that H3K4me1 and mCpG levels within PGCLC enhancers are negatively correlated and that H3K4me1-deficient cells displayed increased mCpG levels within Group I PGCLC enhancers. Therefore, although the importance of mCpG for enhancer activity is still debated75–77, it is tempting to speculate that the lack of DNA methylation in flies renders them less sensitive to H3K4me1 loss. Nevertheless, Trr catalytic mutant fly embryos displayed aberrant phenotypes under stress conditions, suggesting that H3K4me1 might help to fine-tune enhancer activity under suboptimal conditions. Similarly, we found that the induction of PGCLC genes linked to enhancers was reduced in dCD cells, although the magnitude of the observed gene expression changes was moderate (Fig. 6g). Therefore, we propose that H3K4me1 might facilitate, rather than being essential for, enhancer (re)activation and the robust induction of developmental gene expression programs (Fig. 7g). This H3K4me1 facilitator role might involve not only protection from DNA methylation and heterochromatinization but also increased accessibility and responsiveness to transcriptional activators (Fig. 4)40 as well as physical proximity between genes and enhancers32. Future work in additional species and cellular transitions will establish the relevance and prevalence of H3K4me1 function within enhancers.
Finally, our work highlights the diverse mechanisms whereby the epigenetic status of enhancers can contribute to developmental competence. For instance, a subset of early brain enhancers displays a poised state in ESC (i.e. H3K4me1 + H3K27me3) that might facilitate their activation upon neural differentiation78, while in endodermal progenitors the binding of pioneer TFs and the consequent priming of relevant enhancers by H3K4me1 might facilitate their activation during subsequent differentiation24,25. Moreover, recent work based on chromatin accessibility or multi-omic profiling indicates that the commitment towards certain cell lineages involves the activation of already accessible enhancers, while in other cases this can occur through de novo enhancer activation43,79. Future work will elucidate the prevalence and regulatory mechanisms by which the chromatin state of enhancers can contribute to cellular competence and the robust deployment of developmental gene expression programs. In this regard, it would be important to evaluate whether, as reported here for PGCLC induction, the partial decommissioning of enhancers can be involved in their subsequent reactivation and, thus, in the induction of gene expression programs in other developmental contexts. Similar mechanisms might be also important in other physiological27 and pathological80,81 contexts in which a previously used but already dismantled gene expression program gets re-activated.
Methods
WT and transgenic ESC lines
Two different male mouse ESC lines (i.e. E14Tg2a and R1) were used as indicated for each particular experiment. The Mll3/4 dCD, Mll3/4 dCT, and Mll4 CT ESC lines were derived from WT R1 ESC as described in38. The E14 Otx2−/− ESC line57,82 and its parental E14Tg2a ESC were kindly provided by Christa Bücker. Other transgenic ESC lines generated in this study are more extensively described in the following sections and are derived from E14Tg2a ESC.
Cell culture and differentiation protocols
All ESC lines were cultured under serum+LIF conditions both for regular maintenance and CRISPR/Cas9 genome editing experiments. More specifically, ESC were cultured on gelatin-coated plates using Knock-out DMEM (Life Technologies) supplemented with 15% FBS (Life Technologies) and LIF. Before each PGCLC differentiation, ESC were adapted to 2i media (serum-free N2B27 medium supplemented with MEK inhibitor PD0325901 [0.4 μM, Miltenyi Biotec], GSK3β inhibitor CHIR99021 [3 μM, Amsbio], and LIF) for at least 4 days in gelatin-coated tissue culture plates83. All measurements described in the manuscript for ESC were performed under 2i conditions.
For EpiLC and PGCLC differentiation, the protocols from83 were followed. For the EpiLC differentiation, 6-well plates were coated with 15 µg ml−1 Fibronectin and 2 × 105 2i ESC were differentiated in N2B27 media supplemented with 20 ng ml−1 Activin A, 12 ng ml−1 bFGF, and 10 µl ml−1 KSR for two days (unless indicated otherwise for particular experiments). For the PGCLC differentiation, EpiLC or EpiSC were plated at a density of 2 × 104 cells ml−1 and cultured as embryoid bodies (EB) on Ultra-Low attachment multiwell plates (Corning® Costar®) in GK15 medium supplemented with growth factors (0.5 µg ml−1 BMP4, 0.1 µg ml−1 SCF, 1000 U ml−1 LIF and 50 ng ml−1 EGF, no BMP8a). Unless otherwise indicated, PGCLC were typically analyzed after four days.
EpiSC differentiation was performed according to84. 2i ESC were plated at a density of 2.5 × 105 cells ml−1 on plates coated with 15 µg ml−1 Fibronectin and differentiated into EpiSC by passaging them in N2B27 media supplemented with 20 ng ml−1 Activin A and 12 ng ml−1 bFGF for at least eight days or as indicated in the results section for particular experiments.
Enhancer and gene deletions with CRISPR/Cas9
Pairs of gRNAs flanking each of the selected enhancers/genes were designed with benchling (https://www.benchling.com). ESC were transfected with pairs of gRNA-Cas9 expressing vectors85 specific for each enhancer/gene using Lipofectamine 3000 (invitrogen). After transfection and puromycin selection, single cells were seeded into a 96 well plate to derive clonal cell lines. Then, clonal lines were genotyped by PCR and the presence of the intended deletions in each clonal line was verified by Sanger-sequencing. gRNA sequences and genomic coordinates of the different deletions are listed in Supplementary Data 6.
Generation of the Mll3/4 dCD, Mll3/4 dCT and Mll4 CT ESC lines
The Mll3/4 dCD ESC line was previously generated and described38. The Mll3/4 dCT and Mll4 CT ESC lines were also generated as described in38. Briefly, R1 ESC were co-transfected with a 200 bp single-stranded oligonucleotide donor template harboring the desired point mutations and a vector coexpressing Cas9 and gRNAs targeting Mll3 and Mll4. After 48 h, single GFP + cells were sorted into 96 well plates coated with fibronectin (5 mg ml−1). The resulting clonal ESC lines were genotyped by PCR amplification of a region spanning the cleavage site, followed by digestion with a restriction enzyme predicted to only cut the wild-type PCR products. Once candidate ESC lines were selected, the presence of the intended Mll3/4 mutations was confirmed by Sanger sequencing.
Ectopic and inducible expression of candidate transcription factors
Mouse Prdm14 and Nanog cDNAs were amplified and cloned into a Doxycycline (Dox)-inducible piggyBac vector86 enabling the ectopic expression of selected genes fused with an HA-tag at their C-terminus. The resulting piggyBac vectors were transfected together with a Super PiggyBac Transposase expressing vector (Systems Bioscience) and a Tet transactivator. After transfection, single cells were seeded into a 96 well plate to derive clonal ESC lines. The clonal ESC lines with the lowest expression of the transgenic genes in the absence of Dox were selected. To evaluate the effects of PRDM14 and NANOG ectopic expression on PGCLC specification, 1 µg ml−1 Dox was added once PGCLC differentiation was started. To investigate the effects of the ectopic expression of PRDM14 or NANOG in EpiLC and EpiSC, 1 µg ml−1 Dox was added 18 h before the cells were collected for downstream ChIP-seq experiments (i.e. EpiLC differentiation was initiated, after 30 h Dox was added and cells were collected after 48 h; EpiSC were maintained in differentiation media for at least eight passages and then treated with Dox for 18 h).
Quantification of PGCLC by flow cytometry
In general, PGCLC were quantified using antibody staining and flow cytometry83. Briefly, after four days of PGCLC differentiation, the resulting EB were dissociated and stained for 45 min with antibodies against CD61 (Biolegends, cat#104307; dilution 1:500) and CD15 (Thermo Fisher, cat#50-8813-41; dilution 1:200) conjugated with PE and Alexa Fluor 647, respectively. All PGCLC quantifications were performed using the FACS CantoII Cytometer (BD Bioscience) equipped with the BD FACSDiva Software. PGCLC sorting was performed on a FACS AriaIII cell sorter (BD Bioscience) and the data were analyzed with FlowJo (Version 10.6.1).
ATAC-seq
The ATAC-seq protocol was adapted from87 using 3.8 × 104 and 5.0 × 104 cells for the two replicates generated for each of the investigated cell types, respectively. Briefly, the cells were lysed in lysis buffer (10 mM Tris-HCl, pH 7.4, 10 mM NaCl, 3 mM MgCl2, 0.1% IGEPAL CA-630) supplemented with freshly added protease inhibitor for 15 min. Following centrifugation, the pellet was resuspended in a Tn5 transposase reaction mix (Illumina) for 30 min at 37 °C. Following DNA purification with the MinElute PCR purification kit (Qiagen), libraries were prepared with the Nextera DNA library prep kit (Illumina).
ChIP-seq and ChIPmentation
Cells were cross-linked with 1% formaldehyde for 10 min, followed by quenching with 0.125 M glycine, harvesting, and washing in PBS containing protease inhibitors. The cells were sequentially lysed in three buffers (lysis buffer 1: 50 mM HEPES, 140 mM NaCl, 1 mM EDTA, 10% glycerol, 0.5% NP-40, 0.25% TX-100; lysis buffer 2: 10 mM Tris, 200 mM NaCl, 1 mM EDTA, 0.5 mM EGTA; lysis Buffer 3: 10 mM Tris, 100 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, 0.1% Na-Deoxycholate, 0.5% N-lauroylsarcosine) with rotation for 10 min in between. Then, the chromatin was sonicated with the Epishear™ Probe Sonicator (Active Motif) with 20 s ON and 30 s OFF for 8 cycles. After centrifugation, the supernatant was divided into input and ChIP samples. The ChIP samples were incubated with specific antibodies (Supplementary Data 6) overnight, followed by immunoprecipitation with Dynabeads Protein G beads (invitrogen). Next, the beads were washed with RIPA buffer (50 mM Hepes, 500 mM LiCl, 1 mM EDTA, 1% NP-40, 0.7% Na-Deoxycholate) on a magnet, eluted (50 mM Tris, 10 mM EDTA, 1% SDS), and reverse cross-linked at 65 °C overnight in parallel with the input. Finally, DNAs were purified with the ChIP DNA Clean & Concentrator (Zymo Research) and analyzed by qPCR (Primers in Supplementary Data 6), or ChIP libraries were generated with the TruSeq kit (Illumina).
When ChIP-seq profiles were generated from low cell numbers (e.g. d4 EB obtained with the PGCLC differentiation protocol), the ChIPmentation protocol described by Schmidl et al. 201588 was used. Following immunoprecipitation with Dynabeads Protein G beads (invitrogen) in PCR tubes, samples were subject to Tagmentation (5 µl Tagmentation Buffer, 1 µl Tagmentation DNA Enzyme (Illumina), 19 µl Nuclease free water) in order to incorporate sequencing adapters. Lastly, DNAs were eluted from the beads and used for library preparation with the Nextera DNA library preparation kit (Illumina).
Locus-specific bisulfite sequencing
Genomic DNA was purified with phenol-chloroform and subjected to bisulfite conversion according to the EZ DNA Methylation-Direct™ Kit (Zymo Research). Then, primer pairs specific for each investigated enhancer were used for PCR amplification with the EpiTaq HS polymerase (TaKaRa). Finally, the resulting amplicons were gel-purified, subjected to blunt-end cloning, and analyzed by Sanger sequencing. The target sequence and the bisulfite primer for the selected PGCLC enhancers are listed in Supplementary Data 6.
ChIP-bisulfite sequencing
ChIP-bisulfite experiments were performed as described in89 with slight modifications. Firstly, the ChIP protocol described above was followed. After the final DNA purification, all the resulting H3K4me1 and H3K4me2 ChIP DNAs and 200 ng of the corresponding input DNA were subjected to bisulfite conversion according to the EZ DNA Methylation-Direct™ Kit (Zymo Research). Then, the subsequent amplification, purification, and sequencing steps were performed as described for the locus-specific bisulfite sequencing experiments.
Genome-wide bisulfite sequencing
Bisulfite sequencing libraries were prepared from column-purified DNA of d2 EpiLC using the PBAT method90. Briefly, for the bisulfite conversion the instructions of the EZ Methylation Direct MagPrep Kit (Zymo) were followed. After purification, bisulfite-converted DNAs were eluted from MagBeads directly into 39 µl of first-strand synthesis reaction master mix (1× Blue Buffer (Enzymatics), 0.4 mM dNTP mix (Roche), 0.4 µM 6NF preamp oligo (IDT)), heated to 65 °C for 3 min and cooled on ice. 50 U of klenow exo- (Enzymatics) was added and the mixture incubated on a thermocycler at 37 °C for 30 min after slowly ramping from 4 °C. Reactions were diluted to 100 µl and 20 U of exonuclease I (NEB) added and incubated at 37 °C before purification using a 0.8:1 ratio of AMPure XP beads. Purified products were resuspended in 50 µl of second strand mastermix (1× Blue Buffer (Enzymatics), 0.4 mM dNTP mix (Roche), 0.4 µM 6NR adapter 2 oligo (IDT) then heated to 98 °C for 2 min and cooled on ice. 50 U of Klenow exo- (Enzymatics) was added and the mixture was incubated on a thermocycler at 37 °C for 90 min after slowly ramping from 4 °C. Second strand products were purified using a 0.8:1 ratio of AMPure XP beads and resuspended in 50 µl of PCR master mix (1x KAPA HiFi Readymix, 0.2 µM PE1.0 primer, 0.2 µM iTAG index primer) and amplified with 12 cycles. The final libraries were purified using a 0.8:1 volumetric ratio of AMPure XP beads before pooling and sequencing. All libraries were prepared in parallel with the pre-PCR purification steps carried out using a Bravo Workstation pipetting robot (Agilent Technologies).
RNA isolation, cDNA synthesis and RT-qPCR
Total RNA from ESC, EpiLC, and EpiSC was extracted following the protocol of the innuPREP DNA/RNA mini kit (Analytik Jena), while for the RNA extraction from d2 EB, d4 EB and sorted PGCLC the ReliaPrep™ RNA Miniprep Systems (Promega) was used. cDNAs were generated using the ProtoScript II First Strand cDNA Synthesis Kit and Oligo(dT) primers (New England Biolabs). RT-qPCR were performed on the Light Cycler 480II (Roche) with the primers listed in Supplementary Data 6 using Eef1a1 and Hprt as housekeeping controls.
RNA-seq and scRNA-seq
Total RNA from WT and dCD EpiLC (2 replicates) and EpiSC (2 replicates) were purified as described above. Bulk RNA-seq were generated following the protocol of the TruSeq stranded kit (Illumina).
For scRNA-seq, EpiSC were dissociated with Accutase and all other cell types with TripleExpress. Cells were then centrifuged and resuspended in PBS containing 0.04% BSA. Next, cells were passed through a strainer, the cell concentration was determined and the scRNA-seq libraries were prepared using the Chromium Single-Cell Gene Expression (10× Genomics) according to the Single-Cell 3′ Reagents Kit (v2) protocol for the single-cell experiment of the PGCLC differentiation (Fig. 1) and the Single-Cell 3′ Reagents Kit (v3) protocol for the d4 EB from R1 WT and dCD cells (Fig. 6).
Western Blot
Nuclei were isolated by incubating cells with lysis buffer (20 mM Tris pH 7.6, 100 mM NaCl, 300 mM sucrose, 3 mM MgCl2) containing freshly added protease inhibitors for 10 min at 4 °C and then centrifuged for 10 min at 4 °C and 1680 g. The resulting pellets, containing the cell nuclei, were treated with a high salt buffer (20 mM Tris pH 8.0, 400 mM NaCl, 2 mM EDTA pH 8.0) and disrupted with a glass homogenizer on ice. After incubation on ice for 30 min and centrifugation (24000 × g for 20 min at 4 °C), supernatants were collected and protein concentration was estimated by a BCA-Assay. 20 µg of the resulting protein extracts were heated in Laemmli buffer at 95 °C for 5 min, loaded on 4–15% Mini-PROTEAN® TGX™ Precast Protein Gels (Bio-Rad) and transferred (190 mM glycine, 25 mM Tris, 20% Methanol, 0.1% SDS) to a PVDF membrane. After blocking with 5% milk, the primary antibody (Supplementary Data 6) was incubated overnight at 4 °C, and the secondary antibody (coupled to horseradish peroxidase (HRP)) for 1 h at RT with washes in between. Finally, proteins were visualized using the lumi-light plus western blotting substrate (Roche). Full scan images are provided in the Source Data file.
scRNA-seq data processing
The 10x Genomics scRNA-seq data generated in this study across different stages of PGCLC differentiation can be explored by opening the.cloupe file available through GEO [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE155088] with the Loupe Cell Browser (https://www.10xgenomics.com). UMIs were counted using NCBI:GCA_000001635.6 and cellranger-2.1.091. The resulting UMI values were aggregated into a single matrix with default normalization (“–normalize=mapped”) (Supplementary Data 1).
For the scRNA-seq data from E4.5–E6.5 mouse embryos43 the count matrix (GSE121650) was normalized to FPKM (edgeR, Ensembl gene annotation, v87). Then, the previously generated lineage assignments43 were used to solely select epiblast cells for further analysis.
scRNA-seq data analysis
The code used to define PGCLC genes is available through Github (https://github.com/ToreBle/Germline_competence). Briefly, monocle292 was used to evaluate the in vitro scRNA-seq data generated across the different PGCLC differentiation stages. Therefore, k-means clustering was performed on the t-SNE plots (with k = 3 for d2 EB and k = 4 for d4 EB). From the resulting clusters, those containing PGCLC were identified by the enrichment of previously defined core PGC genes from d4/d6 PGCLC and E9.5 PGCs93. To determine the cellular identity of the remaining clusters found within the EB, the expression of lineage-specific markers identified in E8.25 mouse embryos42 was used. To this end, all markers with a log2FoldChange >2.5 were considered. Each EB cluster was annotated as equivalent to the mouse embryonic tissue for which we observed the most significant enrichment in the expression of the corresponding marker genes.
The ESC, EpiLC, EpiSC, and PGCLC gene sets were defined by differential expression using Seurat94 and the negbinom option for differential expression testing. ESC, EpiLC, and EpiSC genes were determined by differential expression between the stage indicated vs the remaining two (e.g. ESC vs. d2 EpiLC and EpiSC) From this analysis only the genes upregulated in the particular stage (adjusted p-value < 0.005), a high expression in the stage of interest (more than 20%, pct.1 >0.2) and a low expression distribution in the others (less than 40%, pct.2 <0.4) were considered. Similar PGCLC genes were determined by differential expression between the d2 + d4 PGCLC clusters and the remaining clusters from d2 and d4 EB. Again, from this analysis only the genes upregulated in PGCLC (adjusted p-value < 0.005) and with high expression in PGCLC (expressed in 20% of the PGCLC) and a low expression distribution in the other analyzed cells (expressed in less than 40% of non-PGCLC EB cells) were considered. In the case of PGCLC genes this resulted in 389 PGCLC genes (Supplementary Data 2). To quantify the expression of PGCLC genes in different analyses, the UMI count matrix (of the in vitro stages) or FPKM-normalized data (of the in vivo stages) were used to calculate the mean expression of all PGCLC genes within a cell or the mean expression of each PGCLC gene across all cells of a stage.
For the RNA velocity analysis (Supplementary Fig. 1f), spliced and unspliced read counts were obtained with kallisto95 and bustools96, parsed into R-3.6.1 to create a Seurat object and t-SNE plot which was then overlaid by the RNA velocity calculations from velocyto.R97.
The estimation of transcriptional noise for the epiblast stages in vivo and in vitro was performed like in44. First, the 500 most variable genes for each stage were selected and pairwise compared by Spearman correlations. Then the Spearman correlation values were transformed into distance measurements () that were considered to represent transcriptional noise.
For the single-cell RNA-seq data set of the d4 EB from R1 WT and dCD cells, single cells with a high percentage of mitochondrial gene expression were discarded, by considering only single cells with at least 2000 expressed genes. The clusters (Fig. 6b) were identified in the combined dataset of R1 WT and dCD cells by shared nearest neighbor (SNN) modularity optimization-based clustering algorithm from Seurat with a resolution of 0.3. This resulted in 9 clusters that were classified based on the marker gene expression of E8.25 mouse embryos42. Those clusters that did not show any specific marker gene expression from this cluster (e.g. 2-cell-like) were identified by differentially expressed genes within the cluster. The Prdm1orDppa3 + /Klf4- cells were defined using the following gene expression values; Dppa3 >0.1 OR Prdm1 >1 AND Klf4 <1. The single-cell expression correlation values were determined by the Spearman correlation of the top 1000 most variable genes in each cluster and the correlation of their expression among the cells belonging to the same cluster.
ChIP-seq data processing
For ChIP-seq data processing, single-end reads were mapped to the mouse genome (mm10) using BWA98. After duplication removal with the MarkDuplicates function from the Picard tools (http://broadinstitute.github.io/picard/), reads within blacklisted regions (https://www.encodeproject.org/annotations/ENCSR636HFF/) were discarded and the aligned reads were normalized with deeptools-3.3.199 to 1x sequencing depth (as RPGC: Reads per genomic content). An overview of all the ChIP-seq experiments performed in this study is listed in Supplementary Data 6 and, briefly, it includes the following datasets:
H3K4me1 ChIP-seq experiments in ESC (n = 2), EpiLC (n = 4) and EpiSC (n = 2) performed in R1 and E14Tg2a cell lines.
H3K4me2 ChIP-seq experiments in ESC (n = 2), EpiLC (n = 3) and EpiSC (n = 3) performed in R1 and E14Tg2a cell lines.
H3K4me3 ChIP-seq experiments in ESC (n = 2), EpiLC (n = 2) and EpiSC (n = 2) performed in R1 and E14Tg2a cell lines.
H3K27ac ChIP-seq experiments in ESC (n = 2), EpiLC (n = 4) and EpiSC (n = 2) performed in R1 and E14Tg2a cell lines.
NANOG-HA ChIP-seq experiments in EpiSC were performed as two biological replicates in E14Tg2a.
Additional ChIP-seq experiments were performed as single replicates (Supplementary Data 6).
All the processed ChIP-seq data can be explored using the following UCSC browser sessions:
- Comparison of the pluripotent stages (related to Figs. 2, 3): http://genome-euro.ucsc.edu/s/Tore/Comparison%20of%20the%20pluripotent%20stages
- PRDM14/NANOG overexpression (related to Fig. 4): http://genome-euro.ucsc.edu/cgi-bin/hgTracks?hgS_doOtherUser=submit&hgS_otherUserName=Tore&hgS_otherUserSessionName=PRDM14%2FNANOG%20overexpression
- H3K4me1/2 deficient cells and Otx2-/- cells (related to Figs. 4, 5): http://genome-euro.ucsc.edu/cgi-bin/hgTracks?hgS_doOtherUser=submit&hgS_otherUserName=Tore&hgS_otherUserSessionName=H3K4me1%2F2%20deficiency%20and%20Otx2%2D%2F%2D.
ATAC-seq data processing
ATAC-seq experiments were performed as two biological replicates for each of the investigated cell types (i.e. ESC, EpiLC, EpiSC). For ATAC-seq data processing, paired-end reads were mapped to the mouse genome (mm10) using BWA98. Read duplicates and reads within blacklisted regions were discarded. Given the concordance of the ATAC-seq replicates (Spearman correlation of a 2 kb window for ESC: 0.86; EpiLC: 0.88; EpiSC: 0.86), BAM files for each stage were merged and converted into bigWig files by normalization to 1× sequencing depth with deeptools-3.3.199. The processed ATAC-seq data can be also found in the UCSC browser sessions mentioned above.
Enhancer definition
The code used to define enhancers is available through Github (https://github.com/ToreBle/Germline_competence).
PGCLC enhancers: The H3K27ac ChIP-seq data (SOLiD sequencing) from d2 and d6 sorted PGCLC15 were aligned with the default settings of novoalignCS (V1.06.09, Novocraft Technologies). For visualization, replicate BAM files for each stage were merged and normalized to 1 x sequencing depth. Briefly, H3K27ac peaks were called from both replicates of d6 PGCLC with MACS2100 using broad settings (--broad -m 5 50 --fix-bimodal --extsize 200) and q-values <1 × 10−3. Next, all peaks within a distance of 1 kb from both d6 PGCLC replicates were merged with bedtools (https://github.com/arq5x/bedtools2). The resulting regions were subtracted from blacklisted and promoter regions (± 2 kb of the Ensembl gene annotation, v86) with bedtools (Supplementary Fig. 2a). PGCLC enhancers were defined as the subset of the distal H3K27ac peaks identified in d6 PGCLC that could be physically linked to PGCLC genes according to capture HiC data generated in ESC by47 (Supplementary Data 1). Thereby, distal d6 PGCLC H3K27ac peaks and PGCLC genes were linked if the two anchors of a Capture HiC interaction occurred within 1 Kb of a d6 PGCLC H3K27ac peak and 3 Kb of the PGCLC gene TSS.
For the statistical comparisons of epigenetic signals within PGCLC enhancers in different cell types, we first determined the average signals of the ChIP-seq, ATAC-seq, or genome-wide CpG bisulfite sequencing within ± 1 kb of the PGCLC enhancers in d2 EpiLC and EpiSC using deeptools-3.3.199. Then, the effect size of paired Wilcoxon tests were calculated by dividing the z-statistics by the square roots of the sample sizes using rstatix and boots.ci for the approximation of the confidence intervals (confidence level: 0.95).
EpiLC/EpiSC enhancers: Briefly, H3K27ac peaks were called in both cell types separately with MACS2100 using broad settings (--broad -m 5 50 --fix-bimodal --extsize 200) and q-values < 1 × 10−3. The resulting regions were subtracted from blacklisted and promoter regions (± 2 kb of the Ensembl gene annotation, v86) using bedtools. EpiLC and EpiSC enhancers were obtained from these regions by selecting only H3K27ac peaks located within 500 kb of the gene sets defined by single-cell RNA-seq for each cell type. The enhancer assignment to the proximal genes was done with GREAT-4.0.4101 and only enhancers with a minimum distance of 3.5 kb to the nearest transcription start site (TSS) were considered.
CpG methylation analysis
The analysis of the local bisulfite sequencing experiments performed for selected PGCLC enhancers was evaluated with BISMA102.
Genome-wide DNA methylation data was analyzed with Bismark103. However, as the considered data sets were prepared with slightly different protocols, the preprocessing steps were adjusted accordingly: For the whole-genome bisulfite sequencing data from 2i ESC (GSE41923), d2 EpiLC, and EpiSC (GSE70355), the adapter trimming was performed with Trim Galore (http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/) using the default settings; for data sets generated by post bisulfite adapter tagging (pbat), either 9 (data from d2 EpiLC and PGCLC (DRA003471)) or 6 bp (genome-wide methylation data generated in this study) were removed. The DNA methylation data from E4.5–E6.5 epiblasts were generated by STEM-seq (GSE76505), and, in this case, adapter sequences were removed with cutadapt104 and Trim Galore, respectively. For all the previous samples, reads were mapped with Bismark-v0.16.1103 and bowtie2-2.2.9105, using the -pbat setting for the STEM-seq and pbat samples. For paired-end pbat samples the unmapped reads were remapped as single-end reads. Then, for each cell type all available datasets were combined to estimate the CpG methylation levels with the Bismark methylation extractor. Finally, only CpGs with a coverage of 3−100 reads were considered.
CpG methylation heterogeneity and scNMT-seq analysis
With the scNMT-seq method the transcriptome, methylome (CpG methylation), and chromatin accessibility (GpC methylation) are recorded from the same single-cell106. Among all the single cells analyzed in E4.5, E5.5, and E6.5 mouse embryos by43, we only considered those assigned to the epiblast by scRNA-seq (https://github.com/rargelaguet/scnmt_gastrulation). Then, epiblast cells were additionally filtered according to their methylome (GSE121690) and only cells with a CpG coverage >102 at PGCLC enhancers were considered. This resulted in 258 epiblast cells from E4.5, E5.5, and E6.5 with high-quality single-cell CpG methylation data.
The single-cell CpG methylation for each cell was stored in a bedGraph file. From each bedGraph file the genome-wide CpG methylation levels of individual cells were determined as the mean methylation of all covered CpGs in each single-cell. For the analysis of mCpG levels and CpG coverage within PGCLC enhancer Group I, the genome-wide bedGraph files were subtracted to obtain bedGraph files with the PGCLC enhancer regions (Supplementary Data 3) using bedtools (https://github.com/arq5x/bedtools2) from which the mean methylation of all covered CpGs was determined in each single-cell.
CpG methylation heterogeneity was estimated with the PDclust package70. The number of CpGs covered in each pair of cells resulted in approximately 150 CpGs for each pairwise comparison when PGCLC enhancers were considered. Then, the average of the absolute difference in the methylation values for all the CpGs covered for each pairwise comparison were computed as a dissimilarity matrix.
For the scRNA-seq data, the mean expression of all PGCLC genes (FPKM-normalized) linked to Group I PGCLC enhancers was determined per cell.
Bulk RNA-sequencing
Public available RNA-seq data generated across different time-points of EpiLC differentiation107, Otx2-/- EpiLC57, and Prdm14-/- EpiLC14 were mapped with STAR108 to the mouse reference genome (Ensembl gene annotation, v99), and reads within genes were counted with featureCounts109. The rlog normalization was generated with DESeq2110.
For the bulk RNA-seq data generated in R1 WT and dCD cells, we analyzed three replicates for R1 WT and the dCD ESC previously generated by Dorighi et al., 2017, two replicates for R1 WT and dCD d2 EpiLC, and two replicates for R1 WT and dCD EpiSC. The transcript abundance was determined with the bootstrap mode of kallisto (-b 100)95 and the differential expression analyses were performed with sleuth on the gene-level (Ensembl gene annotation, v96)95,111.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank the Rada-Iglesias lab members for insightful comments and critical reading of the manuscript, Antonio Simeone, and Christa Bücker for generously providing the Otx2-/- cell line. Library preparations and next-generation sequencing experiments were performed in the NGS Core Facility of the Cologne Center for Genomics (CCG). Computational analyses were performed on the Cologne High Efficient Operating Platform for Science (CHEOPS). Cell sorting and Flow cytometry experiments were performed in the FACS Facility of the Center for Molecular Medicine (CMMC) and the FACS & Imaging Core Facility at the Max Planck Institute for Biology of Ageing. Tore Bleckwehl was supported by a doctoral fellowship from the Studienstiftung des deutschen Volkes (Germany). Giuliano Crispatzu is supported by funding within the CRU329 (DFG 386793560). Kaitlin Schaaf was supported by a Research Internships in Science and Engineering (RISE) Scholarship of the Deutscher Akademischer Austauschdienst (DAAD). Work in the Rada-Iglesias laboratory was supported by CMMC intramural funding (Germany), the German Research Foundation (DFG) (Research Grant RA 2547/2-1), “Programa STAR-Santander Universidades, Campus Cantabria Internacional de la convocatoria CEI 2015 de Campus de Excelencia Internacional” (Spain), the Spanish Ministry of Science, Innovation, and Universities (Research Grant PGC2018-095301-B-I00) and the European Research Council (ERC CoG “PoisedLogic”; 862022). AB and MM were supported by the Spanish Ministry of Science, Innovation, and Universities (BFU2017-84914-P), and the CNIC is supported by the Instituto de Salud Carlos III (ISCIII), the Spanish Ministry of Science, Innovation, and Universities, and the Pro CNIC Foundation, and is a Severo Ochoa Center of Excellence (SEV-2015-0505).
Source data
Author contributions
Conceptualization, T.B., A.R.-I.; Experimental investigation, T.B., K.S., G.C., P.R., M.B., S.J.C, L.B., A.B., M.L., K.D; Data analysis: T.B., G.C.; Writing, Review & Editing, T.B., A.R.-I.; Resources, W.F.J. vI, M.M., J.W., W.R., A.R.-I.; Supervision and Funding Acquisition, A.R.-I.;
Funding
Open Access funding enabled and organized by Projekt DEAL.
Data availability
The data that support this study are available from the corresponding authors upon reasonable request. The NGS data generated in this study have been deposited in the GEO database under accession code GSE155089. The enhancer sets of this study are provided in Supplementary Data 3. Public data sets analyzed in this study include single-cell RNA-seq in E8.5 mouse embryo: “E-MTAB-6153”; bulk RNA sequencing: “EpiLC-GSE117896”, “Otx2-/--GSE56138”, “Prdm14-/--DRA003471”, and “Mll3/4dCD-GSE98063”; WGBS: “GSE70355”, WGBS in WT and dCD ESC: “GSE118314”; H3K27ac in PGCLC: “GSE60204”; Mll3/4 ChIP-seq in ESC: “GSE98063”; DNAse I in PGC: “GSE109767”; STEM-seq in E4.5–E6.5: “GSE76505”; single-cell RNA-seq and single-cell DNA methylation: “GSE121708”. Source data are provided with this paper.
Code availability
Tore Bleckwehl (2021). ToreBle/Germline_competence: Bleckwehl et al., 2021 (Version 1.0.0) [Computer software]. Zenodo. 10.5281/zenodo.5295059.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review informationNature Communications thanks Katsuhiko Hayashi and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Tore Bleckwehl, Email: tbleckwe@uni-koeln.de.
Álvaro Rada-Iglesias, Email: alvaro.rada@unican.es.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-021-26065-6.
References
- 1.Waddington, C. H. Organisers and Genes by C. H. Waddington. (1940).
- 2.Solini GE, Dong C, Saha M. Embryonic transplantation experiments: past, present, and future. Trends Dev. Biol. 2017;10:13–30. [PMC free article] [PubMed] [Google Scholar]
- 3.Ohinata Y, et al. Blimp1 is a critical determinant of the germ cell lineage in mice. Nature. 2005;436:207–213. doi: 10.1038/nature03813. [DOI] [PubMed] [Google Scholar]
- 4.Ohinata Y, et al. A signaling principle for the specification of the germ cell lineage in mice. Cell. 2009;137:571–584. doi: 10.1016/j.cell.2009.03.014. [DOI] [PubMed] [Google Scholar]
- 5.Kalkan T, et al. Tracking the embryonic stem cell transition from ground state pluripotency. Development. 2017;144:1221–1234. doi: 10.1242/dev.142711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mulas C, Kalkan T, Smith A. NODAL secures pluripotency upon embryonic stem cell progression from the ground state. Stem Cell Rep. 2017;9:77–91. doi: 10.1016/j.stemcr.2017.05.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Smith A. Formative pluripotency: the executive phase in a developmental continuum. Development. 2017;144:365–373. doi: 10.1242/dev.142679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bleckwehl T, Rada-Iglesias A. Transcriptional and epigenetic control of germline competence and specification. Curr. Opin. Cell Biol. 2019;61:1–8. doi: 10.1016/j.ceb.2019.05.006. [DOI] [PubMed] [Google Scholar]
- 9.Respuela P, et al. Foxd3 promotes exit from naive pluripotency through enhancer decommissioning and inhibits germline specification. Cell Stem Cell. 2016;18:118–133. doi: 10.1016/j.stem.2015.09.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hayashi K, Ohta H, Kurimoto K, Aramaki S, Saitou M. Reconstitution of the mouse germ cell specification pathway in culture by pluripotent stem cells. Cell. 2011;146:519–532. doi: 10.1016/j.cell.2011.06.052. [DOI] [PubMed] [Google Scholar]
- 11.Hackett JA, et al. Tracing the transitions from pluripotency to germ cell fate with CRISPR screening. Nat. Commun. 2018;9:4292. doi: 10.1038/s41467-018-06230-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Murakami K, et al. NANOG alone induces germ cells in primed epiblast in vitro by activation of enhancers. Nature. 2016;529:403–407. doi: 10.1038/nature16480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Mitani T, et al. Principles for the regulation of multiple developmental pathways by a versatile transcriptional factor, BLIMP1. Nucleic Acids Res. 2017;45:12152–12169. doi: 10.1093/nar/gkx798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Shirane K, et al. Global landscape and regulatory principles of DNA methylation reprogramming for germ cell specification by mouse pluripotent stem cells. Dev. Cell. 2016;39:87–103. doi: 10.1016/j.devcel.2016.08.008. [DOI] [PubMed] [Google Scholar]
- 15.Kurimoto K, et al. Quantitative Dynamics of chromatin remodeling during germ cell specification from mouse embryonic stem cells. Cell Stem Cell. 2015;16:517–532. doi: 10.1016/j.stem.2015.03.002. [DOI] [PubMed] [Google Scholar]
- 16.von Meyenn F, et al. Comparative principles of DNA methylation reprogramming during human and mouse in vitro primordial germ cell specification. Dev. Cell. 2016;39:104–115. doi: 10.1016/j.devcel.2016.09.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Saitou M, Kagiwada S, Kurimoto K. Epigenetic reprogramming in mouse pre-implantation development and primordial germ cells. Development. 2012;139:15–31. doi: 10.1242/dev.050849. [DOI] [PubMed] [Google Scholar]
- 18.Sybirna A, Wong FCK, Surani MA. Genetic basis for primordial germ cells specification in mouse and human: Conserved and divergent roles of PRDM and SOX transcription factors. Curr. Top. Dev. Biol. 2019;135:35–89. doi: 10.1016/bs.ctdb.2019.04.004. [DOI] [PubMed] [Google Scholar]
- 19.Kalkan T, et al. Complementary activity of ETV5, RBPJ, and TCF3 drives formative transition from naive pluripotency. Cell Stem Cell. 2019;24:785–801.e7. doi: 10.1016/j.stem.2019.03.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zhang J, et al. OTX2 restricts entry to the mouse germline. Nature. 2018;562:595–599. doi: 10.1038/s41586-018-0581-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Calo E, Wysocka J. Modification of enhancer chromatin: what, how, and why? Mol. Cell. 2013;49:825–837. doi: 10.1016/j.molcel.2013.01.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Creyghton MP, et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Natl Acad. Sci. USA. 2010;107:21931–21936. doi: 10.1073/pnas.1016071107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Lara-Astiaso D, et al. Immunogenetics. Chromatin state dynamics during blood formation. Science. 2014;345:943–949. doi: 10.1126/science.1256271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lee K, et al. FOXA2 is required for enhancer priming during pancreatic differentiation. Cell Rep. 2019;28:382–393.e7. doi: 10.1016/j.celrep.2019.06.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wang A, et al. Epigenetic priming of enhancers predicts developmental competence of hESC-derived endodermal lineage intermediates. Cell Stem Cell. 2015;16:386–399. doi: 10.1016/j.stem.2015.02.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Lai B, et al. MLL3/MLL4 are required for CBP/p300 binding on enhancers and super-enhancer formation in brown adipogenesis. Nucleic Acids Res. 2017;45:6388–6403. doi: 10.1093/nar/gkx234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ostuni R, et al. Latent enhancers activated by stimulation in differentiated cells. Cell. 2013;152:157–171. doi: 10.1016/j.cell.2012.12.018. [DOI] [PubMed] [Google Scholar]
- 28.Sze, C. C. & Shilatifard, A. MLL3/MLL4/COMPASS family on epigenetic regulation of enhancer function and cancer. Cold Spring Harb. Perspect. Med. 6, a026427 (2016). [DOI] [PMC free article] [PubMed]
- 29.Bochyńska, A., Lüscher-Firzlaff, J. & Lüscher, B. Modes of interaction of KMT2 histone H3 Lysine 4 Methyltransferase/COMPASS complexes with chromatin. Cells7, 17 (2018). [DOI] [PMC free article] [PubMed]
- 30.Lin-Shiao E, et al. KMT2D regulates p63 target enhancers to coordinate epithelial homeostasis. Genes Dev. 2018;32:181–193. doi: 10.1101/gad.306241.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wang C, et al. Enhancer priming by H3K4 methyltransferase MLL4 controls cell fate transition. Proc. Natl Acad. Sci. USA. 2016;113:11871–11876. doi: 10.1073/pnas.1606857113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Yan J, et al. Histone H3 lysine 4 monomethylation modulates long-range chromatin interactions at enhancers. Cell Res. 2018;28:387. doi: 10.1038/cr.2018.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Jang Y, et al. H3.3K4M destabilizes enhancer H3K4 methyltransferases MLL3/MLL4 and impairs adipose tissue development. Nucleic Acids Res. 2019;47:607–620. doi: 10.1093/nar/gky982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Placek K, et al. MLL4 prepares the enhancer landscape for Foxp3 induction via chromatin looping. Nat. Immunol. 2017;18:1035–1045. doi: 10.1038/ni.3812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Ang S-Y, et al. KMT2D regulates specific programs in heart development via histone H3 lysine 4 di-methylation. Development. 2016;143:810–821. doi: 10.1242/dev.132688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Ortega-Molina A, et al. The histone lysine methyltransferase KMT2D sustains a gene expression program that represses B cell lymphoma development. Nat. Med. 2015;21:1199–1208. doi: 10.1038/nm.3943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Zhang J, et al. Disruption of KMT2D perturbs germinal center B cell development and promotes lymphomagenesis. Nat. Med. 2015;21:1190–1198. doi: 10.1038/nm.3940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Dorighi KM, et al. Mll3 and Mll4 facilitate enhancer RNA synthesis and transcription from promoters independently of H3K4 monomethylation. Mol. Cell. 2017;66:568–576.e4. doi: 10.1016/j.molcel.2017.04.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Rickels R, et al. Histone H3K4 monomethylation catalyzed by Trr and mammalian COMPASS-like proteins at enhancers is dispensable for development and viability. Nat. Genet. 2017;49:1647–1653. doi: 10.1038/ng.3965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Local A, et al. Identification of H3K4me1-associated proteins at mammalian enhancers. Nat. Genet. 2018;50:73–82. doi: 10.1038/s41588-017-0015-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Rada-Iglesias A. Is H3K4me1 at enhancers correlative or causative? Nat. Genet. 2018;50:4–5. doi: 10.1038/s41588-017-0018-3. [DOI] [PubMed] [Google Scholar]
- 42.Ibarra-Soria X, et al. Defining murine organogenesis at single-cell resolution reveals a role for the leukotriene pathway in regulating blood progenitor formation. Nat. Cell Biol. 2018;20:127–134. doi: 10.1038/s41556-017-0013-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Argelaguet, R. et al. Multi-omics profiling of mouse gastrulation at single-cell resolution. Nature (2019) 10.1038/s41586-019-1825-8. [DOI] [PMC free article] [PubMed]
- 44.Mohammed H, et al. Single-cell landscape of transcriptional heterogeneity and cell fate decisions during mouse early gastrulation. Cell Rep. 2017;20:1215–1228. doi: 10.1016/j.celrep.2017.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Zylicz, J. J. et al. Chromatin dynamics and the role of G9a in gene regulation and enhancer silencing during early mouse development. Elife4, e09571 (2015). [DOI] [PMC free article] [PubMed]
- 46.Tischler, J. et al. Metabolic regulation of pluripotency and germ cell fate through α‐ketoglutarate. The EMBO Journal vol. 38, e99518 (2019). [DOI] [PMC free article] [PubMed]
- 47.Atlasi Y, et al. Epigenetic modulation of a hardwired 3D chromatin landscape in two naive states of pluripotency. Nat. Cell Biol. 2019;21:568–578. doi: 10.1038/s41556-019-0310-9. [DOI] [PubMed] [Google Scholar]
- 48.Osterwalder M, et al. Enhancer redundancy provides phenotypic robustness in mammalian development. Nature. 2018;554:239–243. doi: 10.1038/nature25461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Hnisz D, et al. Convergence of developmental and oncogenic signaling pathways at transcriptional super-enhancers. Mol. Cell. 2015;58:362–370. doi: 10.1016/j.molcel.2015.02.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Yamaji M, et al. Critical function of Prdm14 for the establishment of the germ cell lineage in mice. Nat. Genet. 2008;40:1016–1022. doi: 10.1038/ng.186. [DOI] [PubMed] [Google Scholar]
- 51.Yamaji M, et al. PRDM14 ensures naive pluripotency through dual regulation of signaling and epigenetic pathways in mouse embryonic stem cells. Cell Stem Cell. 2013;12:368–382. doi: 10.1016/j.stem.2012.12.012. [DOI] [PubMed] [Google Scholar]
- 52.Sim Y-J, et al. 2i maintains a naive ground state in ESCs through two distinct epigenetic mechanisms. Stem Cell Rep. 2017;8:1312–1328. doi: 10.1016/j.stemcr.2017.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Gretarsson KH, Hackett JA. Dppa2 and Dppa4 counteract de novo methylation to establish a permissive epigenome for development. Nat. Struct. Mol. Biol. 2020;27:706–716. doi: 10.1038/s41594-020-0445-1. [DOI] [PubMed] [Google Scholar]
- 54.Heintzman ND, et al. Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome. Nat. Genet. 2007;39:311–318. doi: 10.1038/ng1966. [DOI] [PubMed] [Google Scholar]
- 55.Rose NR, Klose RJ. Understanding the relationship between DNA methylation and histone lysine methylation. Biochim. Biophys. Acta. 2014;1839:1362–1372. doi: 10.1016/j.bbagrm.2014.02.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Ooi SKT, et al. DNMT3L connects unmethylated lysine 4 of histone H3 to de novo methylation of DNA. Nature. 2007;448:714–717. doi: 10.1038/nature05987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Buecker C, et al. Reorganization of enhancer patterns in transition from naive to primed pluripotency. Cell Stem Cell. 2014;14:838–853. doi: 10.1016/j.stem.2014.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Magnúsdóttir E, et al. A tripartite transcription factor network regulates primordial germ cell specification in mice. Nat. Cell Biol. 2013;15:905–915. doi: 10.1038/ncb2798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Hughes AL, Kelley JR, Klose RJ. Understanding the interplay between CpG island-associated gene promoters and H3K4 methylation. Biochim. Biophys. Acta Gene Regul. Mech. 2020;1863:194567. doi: 10.1016/j.bbagrm.2020.194567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Guo X, et al. Structural insight into autoinhibition and histone H3-induced activation of DNMT3A. Nature. 2015;517:640–644. doi: 10.1038/nature13899. [DOI] [PubMed] [Google Scholar]
- 61.Zhang Y, et al. Chromatin methylation activity of Dnmt3a and Dnmt3a/3L is guided by interaction of the ADD domain with the histone H3 tail. Nucleic Acids Res. 2010;38:4246–4253. doi: 10.1093/nar/gkq147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Li B-Z, et al. Histone tails regulate DNA methylation by allosterically activating de novo methyltransferase. Cell Res. 2011;21:1172–1181. doi: 10.1038/cr.2011.92. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Nady N, et al. Recognition of multivalent histone states associated with heterochromatin by UHRF1 protein. J. Biol. Chem. 2011;286:24300–24311. doi: 10.1074/jbc.M111.234104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Skvortsova K, et al. Retention of paternal DNA methylome in the developing zebrafish germline. Nat. Commun. 2019;10:3054. doi: 10.1038/s41467-019-10895-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Hu D, et al. The MLL3/MLL4 branches of the COMPASS family function as major histone H3K4 monomethylases at enhancers. Mol. Cell. Biol. 2013;33:4745–4754. doi: 10.1128/MCB.01181-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Xie, G. et al. MLL3/MLL4 methyltransferase activities regulate embryonic stem cell differentiation independent of enhancer H3K4me1. 10.1101/2020.09.14.296558.
- 67.Li J, et al. Accurate annotation of accessible chromatin in mouse and human primordial germ cells. Cell Res. 2018;28:1077–1089. doi: 10.1038/s41422-018-0096-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Zhang Y, et al. Dynamic epigenomic landscapes during early lineage specification in mouse embryos. Nat. Genet. 2018;50:96–105. doi: 10.1038/s41588-017-0003-x. [DOI] [PubMed] [Google Scholar]
- 69.Rulands S, et al. Genome-scale oscillations in DNA methylation during exit from pluripotency. Cell Syst. 2018;7:63–76.e12. doi: 10.1016/j.cels.2018.06.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Hui T, et al. High-resolution single-Cell DNA methylation measurements reveal epigenetically distinct hematopoietic stem cell subpopulations. Stem Cell Rep. 2018;11:578–592. doi: 10.1016/j.stemcr.2018.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Aubert Y, Egolf S, Capell BC. The unexpected noncatalytic roles of histone modifiers in development and disease. Trends Genet. 2019;35:645–657. doi: 10.1016/j.tig.2019.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.AlAbdi L, et al. Oct4-mediated inhibition of Lsd1 activity promotes the active and primed state of pluripotency enhancers. Cell Rep. 2020;30:1478–1490.e6. doi: 10.1016/j.celrep.2019.11.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Maltby VE, et al. Histone H3K4 demethylation is negatively regulated by histone H3 acetylation in Saccharomyces cerevisiae. Proc. Natl Acad. Sci. USA. 2012;109:18505–18510. doi: 10.1073/pnas.1202070109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Deshmukh S, Ponnaluri VC, Dai N, Pradhan S, Deobagkar D. Levels of DNA cytosine methylation in the genome. PeerJ. 2018;6:e5119. doi: 10.7717/peerj.5119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Barnett KR, et al. ATAC-Me captures prolonged dna methylation of dynamic chromatin accessibility loci during cell fate transitions. Mol. Cell. 2020;77:1350–1364.e6. doi: 10.1016/j.molcel.2020.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Sapozhnikov, D. M. & Szyf, M. Unraveling the functional role of DNA methylation using targeted DNA demethylation by steric blockage of DNA methyltransferase with CRISPR/dCas9. 10.1101/2020.03.28.012518. [DOI] [PMC free article] [PubMed]
- 77.Ford, E. et al. Frequent lack of repressive capacity of promoter DNA methylation identified through genome-wide epigenomic manipulation. 10.1101/170506.
- 78.Cruz-Molina S, et al. PRC2 facilitates the regulatory topology required for poised enhancer function during pluripotent stem cell differentiation. Cell Stem Cell. 2017;20:689–705.e9. doi: 10.1016/j.stem.2017.02.004. [DOI] [PubMed] [Google Scholar]
- 79.Rauch A, et al. Author correction: osteogenesis depends on commissioning of a network of stem cell transcription factors that act as repressors of adipogenesis. Nat. Genet. 2019;51:766. doi: 10.1038/s41588-019-0400-4. [DOI] [PubMed] [Google Scholar]
- 80.Kaufman CK, et al. A zebrafish melanoma model reveals emergence of neural crest identity during melanoma initiation. Science. 2016;351:aad2197. doi: 10.1126/science.aad2197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Pomerantz MM, et al. Prostate cancer reactivates developmental epigenomic programs during metastatic progression. Nat. Genet. 2020;52:790–799. doi: 10.1038/s41588-020-0664-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Acampora D, Di Giovannantonio LG, Simeone A. Otx2 is an intrinsic determinant of the embryonic stem cell state and is required for transition to a stable epiblast stem cell condition. Development. 2013;140:43–55. doi: 10.1242/dev.085290. [DOI] [PubMed] [Google Scholar]
- 83.Hayashi K, Saitou M. Generation of eggs from mouse embryonic stem cells and induced pluripotent stem cells. Nat. Protoc. 2013;8:1513–1524. doi: 10.1038/nprot.2013.090. [DOI] [PubMed] [Google Scholar]
- 84.Guo G, et al. Klf4 reverts developmentally programmed restriction of ground state pluripotency. Development. 2009;136:1063–1069. doi: 10.1242/dev.030957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Cong L, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–823. doi: 10.1126/science.1231143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Calo E, et al. Tissue-selective effects of nucleolar stress and rDNA damage in developmental disorders. Nature. 2018;554:112–117. doi: 10.1038/nature25449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods. 2013;10:1213–1218. doi: 10.1038/nmeth.2688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Schmidl C, Rendeiro AF, Sheffield NC, Bock C. ChIPmentation: fast, robust, low-input ChIP-seq for histones and transcription factors. Nat. Methods. 2015;12:963–965. doi: 10.1038/nmeth.3542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Thomson JP, et al. CpG islands influence chromatin structure via the CpG-binding protein Cfp1. Nature. 2010;464:1082–1086. doi: 10.1038/nature08924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Clark SJ, et al. Genome-wide base-resolution mapping of DNA methylation in single cells using single-cell bisulfite sequencing (scBS-seq) Nat. Protoc. 2017;12:534–547. doi: 10.1038/nprot.2016.187. [DOI] [PubMed] [Google Scholar]
- 91.Zheng GXY, et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 2017;8:14049. doi: 10.1038/ncomms14049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Trapnell C, et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotechnol. 2014;32:381–386. doi: 10.1038/nbt.2859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Nakaki F, et al. Induction of mouse germ-cell fate by transcription factors in vitro. Nature. 2013;501:222–226. doi: 10.1038/nature12417. [DOI] [PubMed] [Google Scholar]
- 94.Stuart T, et al. Comprehensive Integration of Single- Cell Data. Cell. 2019;177:1888–1902.e21. doi: 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Bray NL, Pimentel H, Melsted P, Pachter L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 2016;34:525–527. doi: 10.1038/nbt.3519. [DOI] [PubMed] [Google Scholar]
- 96.Melsted, P. et al. Modular and efficient pre-processing of single-cell RNA-seq. 10.1101/673285.
- 97.La Manno G, et al. RNA velocity of single cells. Nature. 2018;560:494–498. doi: 10.1038/s41586-018-0414-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Ramírez F, et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.McLean CY, et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 2010;28:495–501. doi: 10.1038/nbt.1630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Rohde C, Zhang Y, Reinhardt R, Jeltsch A. BISMA - Fast and accurate bisulfite sequencing data analysis of individual clones from unique and repetitive sequences. BMC Bioinforma. 2010;11:230. doi: 10.1186/1471-2105-11-230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Krueger F, Andrews SR. Bismark: a flexible aligner and methylation caller for Bisulfite-seq applications. Bioinformatics. 2011;27:1571–1572. doi: 10.1093/bioinformatics/btr167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 2011;17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
- 105.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Clark SJ, et al. scNMT-seq enables joint profiling of chromatin accessibility DNA methylation and transcription in single cells. Nat. Commun. 2018;9:781. doi: 10.1038/s41467-018-03149-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Yang P, et al. Multi-omic profiling reveals dynamics of the phased progression of pluripotency. Cell Syst. 2019;8:427–445.e10. doi: 10.1016/j.cels.2019.03.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Dobin A, Gingeras TR. Mapping RNA-seq reads with STAR. Curr. Protoc. Bioinforma. 2015;51:11.14.1–11.14.19. doi: 10.1002/0471250953.bi1114s51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
- 110.Anders, S. & Huber, W. Differential expression analysis for sequence count data. Nat. Precedingshttps://www.nature.com/articles/npre.2010.4282.2 (2010). [DOI] [PMC free article] [PubMed]
- 111.Yi L, Pimentel H, Bray NL, Pachter L. Gene-level differential analysis at transcript-level resolution. Genome Biol. 2018;19:53. doi: 10.1186/s13059-018-1419-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data that support this study are available from the corresponding authors upon reasonable request. The NGS data generated in this study have been deposited in the GEO database under accession code GSE155089. The enhancer sets of this study are provided in Supplementary Data 3. Public data sets analyzed in this study include single-cell RNA-seq in E8.5 mouse embryo: “E-MTAB-6153”; bulk RNA sequencing: “EpiLC-GSE117896”, “Otx2-/--GSE56138”, “Prdm14-/--DRA003471”, and “Mll3/4dCD-GSE98063”; WGBS: “GSE70355”, WGBS in WT and dCD ESC: “GSE118314”; H3K27ac in PGCLC: “GSE60204”; Mll3/4 ChIP-seq in ESC: “GSE98063”; DNAse I in PGC: “GSE109767”; STEM-seq in E4.5–E6.5: “GSE76505”; single-cell RNA-seq and single-cell DNA methylation: “GSE121708”. Source data are provided with this paper.
Tore Bleckwehl (2021). ToreBle/Germline_competence: Bleckwehl et al., 2021 (Version 1.0.0) [Computer software]. Zenodo. 10.5281/zenodo.5295059.