Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2016 Sep 27;17(1):179–192. doi: 10.1016/j.celrep.2016.08.087

MERVL/Zscan4 Network Activation Results in Transient Genome-wide DNA Demethylation of mESCs

Mélanie A Eckersley-Maslin 1,, Valentine Svensson 2,3, Christel Krueger 1, Thomas M Stubbs 1, Pascal Giehr 4, Felix Krueger 5, Ricardo J Miragaia 2,3,6, Charalampos Kyriakopoulos 7, Rebecca V Berrens 1, Inês Milagre 1, Jörn Walter 4, Sarah A Teichmann 2,3, Wolf Reik 1,3,8,∗∗
PMCID: PMC5055476  PMID: 27681430

Summary

Mouse embryonic stem cells are dynamic and heterogeneous. For example, rare cells cycle through a state characterized by decondensed chromatin and expression of transcripts, including the Zscan4 cluster and MERVL endogenous retrovirus, which are usually restricted to preimplantation embryos. Here, we further characterize the dynamics and consequences of this transient cell state. Single-cell transcriptomics identified the earliest upregulated transcripts as cells enter the MERVL/Zscan4 state. The MERVL/Zscan4 transcriptional network was also upregulated during induced pluripotent stem cell reprogramming. Genome-wide DNA methylation and chromatin analyses revealed global DNA hypomethylation accompanying increased chromatin accessibility. This transient DNA demethylation was driven by a loss of DNA methyltransferase proteins in the cells and occurred genome-wide. While methylation levels were restored once cells exit this state, genomic imprints remained hypomethylated, demonstrating a potential global and enduring influence of endogenous retroviral activation on the epigenome.

Keywords: Zscan4, MERVL, endogenous retrovirus, embryonic stem cell, DNA methylation, imprint, preimplantation, reprogramming, chromatin

Graphical Abstract

graphic file with name fx1.jpg

Highlights

  • Single-cell transcriptomics reveals dynamics of MERVL/Zscan4 network activation

  • MERVL-LTR transcriptional network is expressed in iPSC reprogramming events

  • Translation block depletes Dnmt proteins, inducing transient global demethylation

  • Passage through the MERVL/Zscan4 state may cause irreversible imprint erasure


Mouse embryonic stem cells sporadically express preimplantation transcripts, including the MERVL endogenous retrovirus and Zscan4 cluster. Eckersley-Maslin et al. investigate the transcriptional dynamics in these cells and reveal transient genome-wide DNA demethylation accompanying chromatin decompaction. Following state exit, methylation levels are restored, except for genomic imprints, which remain lost.

Introduction

It is well known that mouse embryonic stem cells (ESCs) contain extensive epigenetic and transcriptional heterogeneities, yet the mechanistic details of how cells enter and exit these different states and their importance for stem cell potency and maintenance are only beginning to be understood (reviewed in De Los Angeles et al., 2015, Lee et al., 2014, Torres-Padilla and Chambers, 2014). Intriguingly, a rare group of ESCs express early-embryonic transcripts, including the Zscan4 cluster (Zalzman et al., 2010) and MERVL endogenous retrovirus (Macfarlan et al., 2012). However, the precise molecular dynamics of this process and any potential epigenetic consequences of the endogenous retroviral activation remain unknown.

Transposable elements comprise more than 50% of mammalian genomes and fall into two classes based on the presence or absence of a long-terminal repeat (LTR) (Friedli and Trono, 2015). One class of LTR-retrotransposon (also known as endogenous retroviruses [ERVs]) is the murine endogenous retrovirus with leucine tRNA primer, or MERVL. Expression of MERVL is normally restricted to the two-cell mouse preimplantation embryo but also occurs transiently in rare ESCs (Macfarlan et al., 2012). Similarly, the Zscan4 cluster of zinc-finger proteins, also normally restricted to the two-cell embryo (Falco et al., 2007), is expressed in a subset of ESCs (Zscan4+ cells), where it has been proposed to have a role in telomere elongation and genomic stability (Zalzman et al., 2010). While MERVL+ cells express Zscan4 transcripts and vice versa (Akiyama et al., 2015, Ishiuchi et al., 2015, Macfarlan et al., 2012), it is unclear if these markers label a single population or overlapping populations of cells with potentially different functions. Furthermore, the dynamics of transcriptional network activation and any epigenetic consequences in these cells remain unclear.

Globally, MERVL+ cells have increased levels of acetylated histones (Ishiuchi et al., 2015, Macfarlan et al., 2012) and an increase in chromatin mobility (Bošković et al., 2014, Ishiuchi et al., 2015). Similarly, Zscan4+ cells show an increase in H3K27ac, including at repetitive elements such as retrotransposons (Akiyama et al., 2015). Moreover, experimental perturbations that lead to global chromatin changes, including treatment with histone deacetylase (HDAC) inhibitors (Dan et al., 2015, Walter et al., 2016), knockdown of the chromatin assembly factor Caf-1 (Ishiuchi et al., 2015), the chromatin remodeler Chd5 (Hayashi et al., 2016), Hnrnpk (Thompson et al., 2015), or members of repressive chromatin complexes such as Kap-1, Kdm1a, G9a, Hp1, or Rybp (Hisada et al., 2012, Macfarlan et al., 2011, Macfarlan et al., 2012, Maksakova et al., 2013, Rowe et al., 2010), all result in an expansion of MERVL and/or Zscan4 expressing populations (reviewed in Ishiuchi and Torres-Padilla, 2013, Schlesinger and Goff, 2015). Thus, while a link between chromatin decompaction and MERVL/Zscan4 expression is apparent, a comprehensive genome-wide base-resolution analysis of the epigenetic landscape and its dynamics in this transient cell state is lacking.

In this study, we provide a detailed molecular understanding of the transcriptional dynamics and epigenetic characteristics of MERVL+/Zscan4+ cells. Through bulk and single-cell transcriptomics, we observe a coordinated upregulation of a MERVL-LTR-driven transcriptional network that is similarly activated during preimplantation development and induced pluripotent stem cell reprogramming. Interestingly, in MERVL+Zscan4+ cells, inhibition of translation leads to depletion of proteins, including DNA methyltransferases, which in turn results in genome-wide DNA hypomethylation through dilution of methylated cytosines upon DNA replication and cell division. Importantly, while methylation levels are restored following exit from the MERVL+Zscan4+ state, once genomic imprints are lost, they remain demethylated. In this way, ESCs cycling through the MERVL+Zscan4+ state may lead to irretrievable loss of imprints in ESC cultures.

Results

Single-Cell Transcriptomic Analysis of MERVL-LTR-Driven Network Dynamics

The MERVL endogenous retrovirus and other normally developmentally restricted transcripts, including the Zscan4 cluster, are transiently upregulated in a small subset of mouse ESCs (Akiyama et al., 2015, Ishiuchi et al., 2015, Macfarlan et al., 2012, Zalzman et al., 2010), yet the dynamics of their regulation, epigenetic consequences and functional significance in a broader biological context remain poorly understood. In order to explore the molecular regulation of these rare cells, a double reporter ESC line was constructed containing stable transgene integrations of MERVL and Zscan4c promoter-driven tdTomato and EGFP fluorescent reporters, respectively (Figure 1A). These reporters have been previously described to reflect expression patterns of their endogenous counterparts (Macfarlan et al., 2012, Zalzman et al., 2010). While ∼1%–2% of cells are labeled by both reporters (MERVL+Zscan4c+), we observed an additional Zscan4c::EGFP+-only population (Figure 1B). However, total RNA sequencing (RNA-seq) revealed the same set of transcripts was upregulated in both single- and double-positive populations (Figure 1C), indicating the reporters are largely interchangeable and mark the same set of ESCs, with the MERVL::tdTomato reporter showing a more restricted expression pattern.

Figure 1.

Figure 1

Single-Cell Transcriptomic Analysis of MERVL-LTR Driven Network Dynamics

(A) Representative single z-section through a ESC colony containing a cell co-expressing the Zscan4c::EGFP (top left) and MERVL::tdTomato (top right) reporters. DNA is visualized with DAPI (bottom left). Scale bar, 10 μm.

(B) Flow cytometry plot of the MERVL::tdTomato (x axis) Zscan4c::eGFP (y axis) dual-reporter ESC line. Representative gates used to sort the three populations (negative, Zscan4c+, and Zscan4c+MERVL+) are shown.

(C) Heat map showing relative expression levels of 172 differentially expressed genes in negative (left column), Zscan4+ (middle column) and Zscan4+MERVL+ (right column) sorted populations as determined by total RNA-sequencing (see Supplemental Experimental Procedures). Scale bar depicts log2 relative expression.

(D) Overview showing localization and frequency of differentially expressed genes (red bars) across the genome.

(E) Schematic overview over the Gm13691 containing cluster on chromosome 2 depicting the organization of coding sequence (CDS), MERVL, and LINE L1 elements. Two data tracks show wiggle plots of total RNA-seq reads for MERVL+Zscan4+ (dark blue) and negative-sorted (gray) cells.

(F) Quantification of total RNA-seq reads mapping to different repeat classes in MERVL+Zscan4c+ and negative-sorted cells, plotted as fold change over percentage of total reads. The yellow line indicates no change.

(G) Relative frequency plot showing distance between gene and nearest MERVL promoter (MT2_Mm) of all genes (black) and differentially expressed genes (red). p value < 0.0001 Mann-Whitney U test.

(H) Log10 TPM (transcripts per million) values of MERVL (x axis) and the Zscan4 cluster (y axis) of single cells sorted from negative (gray), Zscan4c+ (light blue) and Zscan4c+MERVL+ (dark blue) gates. The solid blue line represents the projected trajectory of the cells in this two-dimensional space, or “pseudotime’ (see Supplemental Experimental Procedures).

(I) Smoothed heatmap showing expression of 172 differentially expressed genes (rows) across sorted single cells (columns) ordered by MERVL expression (bottom scale bar). Median Spearman rank correlation was 0.6 between MERVL and differentially expressed genes and −0.08 between MERVL and all genes.

(J) Expression profiles of dynamic clusters of genes across pseudotime, denoting selected genes of interest.

See also Figure S1 and Tables S1, S2, and S3.

We next defined a set of 172 differentially expressed genes based on the total RNA-seq data (Tables S1 and S2), which was used in all subsequent analyses (see Supplemental Experimental Procedures). Interestingly, many of these genes had no known function and were organized in clusters of tandem repeats (Figures 1D and 1E), suggesting a coordinated and rapid regulation of homologous transcripts. Consistent with previous reports (Akiyama et al., 2015, Ishiuchi et al., 2015, Macfarlan et al., 2012), we observed specific upregulation of MERVL endogenous retroviral elements (Figures 1F and S1A) and found differentially expressed genes to be closer to the MERVL promoter (MT2_Mm) when compared to all genes (Figure 1G). Furthermore, we confirmed and extended by assay for transposase-accessible chromatin sequencing (ATAC-seq) analysis (Figures S1B and S1C) the altered nuclear organization recently described at a global level in MERVL+Zscan4+ cells (Akiyama et al., 2015, Ishiuchi et al., 2015, Macfarlan et al., 2012). The overall increase in chromatin accessibility across the genome was particularly pronounced at promoters of upregulated genes and MERVL elements (MT2_Mm and MERVL-int) (Figure S1C), consistent with the decondensed chromatin structure enabling transcriptional activation of the MERVL MT2_Mm promoter and linked protein-coding genes.

To further understand the dynamics of MERVL-LTR-driven gene activation, single-cell RNA-sequencing was performed on 319 cells sorted from the negative (75 cells), Zscan4c+ only (52 cells), and MERVL+Zscan4c+ (192 cells) populations (Figure 1B). The endogenous counterparts of the reporters were co-regulated across the single cells (Figure 1H), in that MERVL+ cells expressed the Zscan4 cluster and vice versa. There was a synchronous graded upregulation of the differentially expressed genes across the single cells (Figure 1I). Notably, cells from the Zscan4c+ only and MERVL+Zscan4c+ sorted fractions were intermingled, suggesting that the two separate populations seen by flow cytometry represented a difference in the kinetics and/or strength of the reporters and not true distinct populations. Importantly, our findings were independent of the reporters and sorting strategies used, as re-analysis of published unsorted ESC single-cell datasets (Buettner et al., 2015, Kolodziejczyk et al., 2015, Kumar et al., 2014) also revealed a similar coordinated and gradual upregulation of the MERVL-LTR-promoted transcriptional network (Figures S1D and S1E).

Next, the transcriptional dynamics of these transient cells was analyzed. By creating a “pseudotime” trajectory, cells were ordered as they activated the MERVL-LTR-driven transcriptional network (Figure 1H; see Supplemental Experimental Procedures for details). This permitted subsequent identification of genes that were dynamic over pseudotime. These genes largely overlapped with the differentially expressed genes identified above (Figure S1F) and fell into five clusters (Figure 1J; Table S3). Interestingly, the first cluster to be activated (cluster 1) includes MERVL, whereas clusters 2 and 3, which contain the Zscan4 genes, were activated soon after. Intriguingly, clusters 4 and 5 became expressed as clusters 1 and 2 began to decrease, suggesting they may include potential negative regulators of the MERVL-LTR-promoted transcriptional network. In summary, the unbiased inference of the ordering of these cells based on modeling of the single cell RNA-sequencing (scRNA-seq) data allowed us to make predictions about the transcriptional network dynamics of MERVL+Zscan4+ cell regulation.

MERVL-LTR-Driven Transcriptional Network Activated upon iPSC Reprogramming

The MERVL-driven transcriptional network that is upregulated in MERVL+Zscan4+ cells is similarly activated upon zygotic genome activation in mouse preimplantation embryos (Ishiuchi et al., 2015, Kigami et al., 2003, Macfarlan et al., 2012) (Figures 2A, S2A, and S2B; Table S4). This coincides with a developmental time window in which dynamic chromatin remodeling events accompany changes in cellular identity and potency. We therefore investigated whether the MERVL-LTR-driven transcriptional network is similarly upregulated during another cellular reprogramming event, specifically during induced pluripotent stem cell (iPSC) reprogramming. Strikingly, while there was no upregulation of the MERVL-LTR-driven transcriptional network at early stages of iPSC reprogramming, consistent with previous reports (Friedli et al., 2014), we observed a dramatic and transient upregulation of the MERVL-LTR-driven transcriptional network in the intermediate-late stages of iPSC reprogramming (Figure 2B; Table S4). In contrast, no upregulation of the MERVL-LTR driven transcriptional network was found in somatic tissues (Figure S2C; Table S4). Therefore, MERVL-LTR transcriptional network activation occurs not only in two-cell embryos in vivo but also in vitro in a subset of mouse ESCs and during iPSC reprogramming.

Figure 2.

Figure 2

MERVL-LTR-Driven Transcriptional Network Activated upon iPSC Reprogramming

(A and B) Heatmap showing relative probe-normalized expression levels of MERVL+ transcriptional network in early embryos (A) and induced pluripotent stem cell reprogramming (B). MEF, mouse embryonic fibroblasts. Refractory (SSEA1-/Thy1+) and poised (SSEA1+/Thy1) stages correspond to fluorescence-activated cell sorting (FACS)-sorted cells at day 6, where passage 8 (p8; corresponding to day 21), p12 (corresponding to day 29) iPSCs represent intermediate-late stages of reprogramming and p29 (corresponding to day 60) iPSCs are fully reprogrammed. Scale bar depicts log2 relative expression. Data are from Park et al. (2015) (A) and (I.M., unpublished data) (B).

See also Figure S2 and Table S4.

MERVL+Zscan4+ Cells Undergo Global DNA Demethylation

One commonality in the programs that transiently upregulate the MERVL-LTR-driven transcriptional network is that they coincide with global changes in DNA methylation landscapes. We therefore investigated the methylomes of MERVL+Zscan4+ cells. Mass spectrometry revealed a decrease in 5-methylcytosine (Figure 3A) and increase in 5-hydroxymethylcytosine (Figure 3B) in MERVL+Zscan4c+ sorted cells. This was confirmed by whole-genome bisulfite sequencing, which revealed a substantial decline in overall CpG methylation from 80% to 56% (Figures 3C and S3A).

Figure 3.

Figure 3

MERVL+Zscan4+ Cells Undergo Global DNA Demethylation

(A and B) Total levels of (A) 5-methylcytosine and (B) 5-hydroxymethylcytosine as a percentage of total cytosine in negative-sorted (gray) and MERVL+Zscan4c+ (M+Z+, blue) cells as determined by mass spectrometry. Bars represent mean + SD of three biological replicates. 5mC p value = 0.015, 5hmC p value = 0.034, two-tailed paired t test.

(C) Percentage of total CpG methylation determined by whole-genome bisulfite sequencing in negative-sorted (gray) and MERVL+Zscan4c+ (M+Z+, blue) cells. Bars represent mean + SD of three biological replicates. Difference is statistically significant (∗∗∗p = 0.000188, homoscedastic two-tailed t test).

(D) Scatterplot comparing percentage of methylated cytosines between negative-sorted (x axis) and MERVL+Zscan4c+ (y axis) cells. Methylated cytosines were counted for each rolling 50-CpG window genome-wide and are expressed as percent of total cytosines per window. R = 0.469.

(E) CpG methylation across part of chromosome 7 for negative-sorted (top track) and MERVL+Zscan4c+ (M+Z+) cells (bottom track). CpG methylation was quantified for each 50-CpG window and is shown as percentage of total CpGs per window. Color scale ranges from 0% methylated (blue) to 100% methylated (red). Top annotation track shows location of genes (gray) with the Zscan4 cluster highlighted in red; the middle annotation track shows location of CpG islands and bottom annotation track position of annotated MERVL elements.

(F) Bean plots showing distribution of methylation levels for different genome features between negative-sorted (gray) and MERVL+Zscan4c+ (blue) cells. Lines represent mean values.

(G) Methylation levels of different repeat classes between negative-sorted (gray) and MERVL+Zscan4c+ (blue) cells. Bars represent means + SD of three biological replicates.

See also Figure S3.

The decrease in CpG methylation occurs genome-wide and is not restricted to any individual genomic feature. There was an overall shift in the methylation levels of probes between MERVL+Zscan4c+ and negative-sorted cells (Figure 3D) that was evenly distributed across chromosomes (Figure 3E) and not specific to differentially expressed regions. Methylation levels were reduced across all genomic locations and features analyzed, including gene bodies, promoters, enhancer regions, and all repeat classes including MERVL elements (Figures 3F, 3G, and S3B). Intriguingly, genomic imprints also showed reduced methylation levels (Figures 3F and S3C). In summary, MERVL+Zscan4+ cells have a global loss of DNA methylation across all genomic contexts unlinked to the transcriptional changes occurring in the cell.

Acute DNA Demethylation Is Not Sufficient for MERVL Network Activation

Next, we investigated whether the global DNA demethylation observed was the cause of the MERVL network activation or a consequence of being in this state. First, global DNA demethylation was induced in ESCs using an inducible Dnmt1 knockout line (Sharif et al., 2016). Within 3 days of Dnmt1 loss, CpG methylation levels genome-wide (Figure 4A) and at MERVL elements (Figure 4B) were reduced to ∼30%, which is less than what is seen in MERVL+Zscan4+ cells. Despite this, there was no activation of the MERVL endogenous retrovirus (Figure 4C) or MERVL-LTR transcriptional network (Figure 4D), suggesting that acute DNA demethylation during this time window is not sufficient to enter the MERVL+Zscan4+ state. Similarly, there was no upregulation of the MERVL-LTR transcriptional network during or following the global DNA demethylation that is observed when switching ESCs from serum containing media to naive 2i conditions (Figure 4E) (Ficz et al., 2013, Habibi et al., 2013, Leitch et al., 2013, von Meyenn et al., 2016). Instead, the proportion of MERVL+Zscan4+ cells is reduced upon long-term naive 2i culture (Figures S4A and S4B) as previously reported (Macfarlan et al., 2012). Furthermore, the MERVL-LTR transcriptional network is not significantly upregulated in DNMT triple-knockout (TKO) cells (Domcke et al., 2015) that are devoid of DNA methylation (Figure S4C), again supporting the notion that MERVL/Zscan4 network activation is not a consequence of global DNA demethylation.

Figure 4.

Figure 4

Acute DNA Demethylation Is Not Sufficient for MERVL Network Activation

(A and B) Genome-wide (A) and MERVL (B) CpG methylation levels measured by bisulfite sequencing of wild-type (gray) and conditional Dnmt1 knockout ESCs induced for 1 day (dark red) or 3 days (pink).

(C) qRT-PCR analysis of MERVL expression in wild-type (gray) and conditional Dnmt1 knockout ESCs induced for 1 day (dark red) or 3 days (pink). Bars represent mean + SD of three biological replicates.

(D) Bean plots showing log2 expression levels of the MERVL-LTR-driven transcriptional network in wild-type (gray) and conditional Dnmt1 knockout ESCs induced for 1 day (dark red) or 3 days (pink).

(E) Bean plots showing log2 expression levels of the MERVL-LTR-driven transcriptional network in serum-treated (0 hr, dark green), and cells cultured in naive 2i conditions for 6 hr, 24 hr, 72 hr or long-term (LT; light green). Data were reanalyzed from von Meyenn et al. (2016). Average methylation levels are depicted above.

(F) Schematic showing release experiment in which MERVL+Zscan4c+ (M+Z+) cells are depleted from the steady-state population by flow cytometry sorting of the negative gate, followed by re-culturing. The “new” MERVL+Zscan4c+ (M+Z+) cells were subsequently collected by flow cytometry at the indicated time points for methylation analysis.

(G) Genome-wide methylation levels determined by PBAT analysis. Steady-state negative-sorted (light gray, first column) and MERVL+Zscan4c+ (M+Z+) sorted (dark blue, last column) cells are shown for reference. The other columns (light to dark blue) reflect cells that were initially sorted as negative, returned to culture for 6, 24, 48, or 72 hr, and then re-sorted as MERVL+Zscan4c+ (M+Z+) positive. Bars represent average of three biological replicates of 100 cells each ± SD, except the 6-hr time point (n = 1).

See also Figure S4.

Next, we followed the kinetics of DNA demethylation and MERVL-LTR-driven network activation to further understand the causal relationship between the two. If DNA demethylation followed MERVL+Zscan4+ state activation, the longer a cell remains in this state, the more demethylated it would become. Negative cells were isolated by flow cytometry and returned to culture, allowing the newly arising MERVL+Zscan4c+ cells to be identified (Figure 4F). By 72 hr, the initial negative sorted cells had re-established the steady-state proportion of MERVL+Zscan4c+ cells (Figure S4D). Importantly, during this time, there was a progressive loss of DNA methylation (Figure 4G). Therefore, global DNA demethylation occurs subsequently to MERVL-LTR transcriptional network activation and is not sufficient to induce the MERVL+Zscan4+ state.

Translation Block Leads to Depletion of Dnmt Enzymes

Next, the mechanism resulting in global DNA demethylation was investigated. mRNA levels of the DNA methylation machinery enzymes were mostly similar in the MERVL+Zscan4c+ cells compared to negative-sorted control cells (Figure 5A). Despite this, immunofluorescence staining of Zscan4c+ cells revealed a dramatic reduction in the protein levels of maintenance methyltransferase Dnmt1 and de novo methyltransferases Dnmt3a and Dnmt3b (Figures 5B and 5C), with some Zscan4c+ cells showing complete absence of protein.

Figure 5.

Figure 5

Translation Block Leads to Depletion of Dnmt Enzymes

(A) Expression levels of DNA methylation machinery transcripts between negative-sorted (gray) and MERVL+Zscan4c+ (M+Z+, dark blue) cells as determined by RNA-sequencing. Bars represent mean + SD of at least three biological replicates. p < 0.05, ∗∗∗p < 0.0001, homoscedastic two-tailed t test.

(B) Representative single z-slices of cells labeled by Zscan4c::EGFP reporter (green) immunostained for Dnmt1 (top row, red), Dnmt3a (middle row, red), or Dnmt3b (bottom row, red). Blue depicts DAPI staining. Zscan4c+ cells are outlined in white dotted lines. Scale bar represents 25 μm.

(C) Semiquantitative analysis of immunofluorescence staining of protein levels. Cells were scored as having high (dark), low (light), or no expression of Dnmt1, Dnmt3a, or Dnmt3b. Cells were subsequently scored using either the Zscan4c::EGFP or MERVL::tdTomato reporters as negative or MERVL+Zscan4c+ (M+Z+) cells. Differences are statistically significant (chi-square test).

(D) Expression levels of Sox2, Nanog and Oct4 pluripotency transcripts between negative sorted (gray) and MERVL+Zscan4c+ (dark blue) cells as determined by RNA-sequencing. Bars represent mean + SD of at least three biological replicates.

(E) Oct4 immunofluorescence staining of cells labeled by the MERVL::tdTomato reporter (middle panel, red). MERVL+ cells are outlined in white dotted lines. Scale bar represents 10 μm.

(F) SUnSET assay showing sites of active translation by a puromycin pulse and detected using a puromycin antibody (second panel, red) in cells labeled with the Zscan4c::EGFP reporter (first panel, green). DNA is visualized with DAPI (gray). Zscan4c+ cells are outlined in white dotted lines. Scale bar represents 10 μm.

(G) Semiquantitative analysis of SUnSET assay. Percentage of cells scored as having high (dark gray) or low (light gray) levels of puromycin immunofluorescence. Subsequently, cells were scored based on either the Zscan4c::EGFP or MERVL::tdTomato as negative cells or MERVL+Zscan4c+ cells.

See also Figure S5.

This uncoupling of the transcriptome and proteome is not limited to the Dnmt enzymes. Despite similar transcript levels (Figure 5D), immunofluorescence analysis revealed a complete absence of Oct4 and Sox2 and reduced Nanog protein levels in MERVL+ cells (Figures 5E and S5A), consistent with previous reports (Ishiuchi et al., 2015, Macfarlan et al., 2012). This suggests that a general rather than targeted method of protein depletion occurs in MERVL+Zscan4+ cells. Indeed, global repression of nascent protein synthesis in Zscan4+ cells has been previously reported and proposed to be driven by a cluster of genes on chromosome 2 (mm10 Chr12:87473449-88356013) with high sequence homology to eukaryotic initiation factor 1a (Eif1a) (Hung et al., 2013). We confirmed both an inhibition of active protein synthesis in MERVL+Zscan4c+ cells (Figures 5F and 5G) as well as upregulation of the Eif1A-like cluster (Figures S5B and S5C). Together, this supports the notion that loss of Dnmt and pluripotency proteins in MERVL+Zscan4+ cells is due to translation inhibition.

Loss of Dnmt Activity Is Sufficient for Global DNA Demethylation

Our results suggest that the global reduction in DNA methylation was due to impaired DNA methyltransferase activity resulting directly from the loss of the proteins in these cells. To analyze the relative contributions of different Dnmts, we performed hairpin bisulfite analysis which retains linkage of top and bottom strands of individual DNA duplexes. CpGs for LINE L1, MERVL, and major satellites were classified as fully methylated, fully unmethylated, or hemimethylated (in which the top strand was methylated and bottom unmethylated or vice versa). As expected, there was a decrease in fully methylated duplexes (Figure 6A), consistent with the global loss of DNA methylation. This corresponded to an increase in both unmethylated and hemimethylated molecules.

Figure 6.

Figure 6

Loss of Dnmt Activity Sufficient for Global DNA Demethylation

(A) Hairpin bisulfite analysis of LINE L1 (top), MERVL (middle), or major satellites (bottom) elements in two biological replicates of negative-sorted (first two columns) or MERVL+Zscan4c+ (last two columns) cells. Each row represents an individual read that was classified as fully methylated (orange), hemimethylated on top (light green) or bottom (dark green) strands, or fully unmethylated (blue).

(B) Predicted activity of maintenance (red) and de novo (blue) Dnmt enzymes for the three repeat classes. Color shades represent biological replicates; error bars represent the error of the model.

(C) Cell-cycle distribution of cells into G2/M (red), S (green) or G1 (purple) phases, determined by single-cell RNA-sequencing analysis as previously described (Scialdone et al., 2015).

(D) Single z-slice confocal image of a mitotic cell expressing the MERVL::tdTomato reporter (red), demonstrating the ability of MERVL+Zscan4c+ cells to undergo mitosis either during or following MERVL transcriptional activation. DAPI is shown in white. Scale bar represents 10 μm.

(E) Schematic showing methylation dynamics following replication with (left) and without (right) activity of Dnmt enzymes. Filled circles represent methylated CpGs, and empty circles represent unmethylated CpGs. Newly synthesized DNA strands are depicted in green.

See also Figure S6.

To predict the relative efficiencies of the maintenance and de novo methyltransferases, a hidden Markov model (Arand et al., 2012) was run on the hairpin bisulfite data (Figure S6A). This enabled the specific methylation efficiencies for the different Dnmts to be calculated based on the experimental data. For all regions, the model predicted a complete absence of any de novo activity (Dnmt3a/3b) and a reduced maintenance (Dnmt1) efficiency of 55%–76% compared to negative-sorted cells (Figure 6B). Importantly, MERVL+Zscan4c+ cells exist in all stages of the cell cycle (Figure 6C), albeit with a prolonged G2/M phase (Fujii et al., 2015, Storm et al., 2014), and undergo mitosis (Figure 6D), permitting a replication-dependent mechanism of demethylation. Therefore, the global DNA demethylation is likely a direct result of cell division with reduced maintenance and de novo DNA methylation (Figure 6E).

Epigenetic Consequences of Passing through the MERVL+Zscan4+ State

Lastly, the epigenetic and functional consequences of the MERVL+Zscan4+ state were investigated. Cells were isolated by flow cytometry using the Zscan4c and MERVL reporters and then placed back in culture (Figure 7A). By 72 hr, the cells had mostly exited the MERVL+Zscan4c+ state and returned to the negative population (Figures 7B and 7C), confirming that cells are able to cycle out of the state. Immunofluorescence staining revealed the newly emerging negative cells now expressed pluripotency and Dnmt proteins (Figure 7D), indicating that the translation block had been lifted and protein synthesis resumed. We next assessed whether the methylome of these cells was restored. MERVL+Zscan4c+ cells were sorted and re-plated and the newly emerging negative cell population isolated for bisulfite analysis. Within 3 days, methylation levels of the cells that had now left the MERVL+Zscan4c+ state had returned to steady-state levels (Figure 7E), confirming that the demethylation observed in these cells is a transient event and does not persist once the cells exit the state.

Figure 7.

Figure 7

Epigenetic Consequences of Passing through the MERVL+Zscan4+ State

(A) Schematic showing release experiment in which MERVL+Zscan4c+ cells were purified from the steady-state population by flow cytometry sorting of the Zscan4c+ MERVL+ gate, followed by re-culturing. At the indicated time points, cells were re-analyzed by flow cytometry and the Zscan4c/MERVL-negative population collected for immunofluorescence and/or methylation analysis.

(B) Flow cytometry profiles of sorted MERVL+Zscan4c+ (M+Z+) cells following re-culturing for 24 hr (green), 72 hr (blue), or 7 days (black).

(C) Quantification of the proportion of cells remaining in the MERVL+Zscan4+ state (MERVL+ and/or Zscan4c+) at various time points following re-culturing of the sorted cells.

(D) Immunofluorescence staining of sorted MERVL+Zscan4c+ cells that were re-cultured for 14 days. Top row shows Oct4 (first column), Dnmt1 (second column), Dnmt3a (third column), and Dnmt3b (last column). Bottom row depicts merge with DAPI in blue. Scale bar represents 25 μm.

(E) Genome-wide methylation levels determined by PBAT analysis. Steady-state negative-sorted (light gray) and MERVL+Zscan4c+ sorted (M+Z+, dark blue) cells are shown in the first two columns. The last two columns (dark gray) reflect cells that have exited the MERVL+Zscan4c+ state. To identify these, MERVL+Zscan4c+ cells were isolated by flow cytometry and returned to culture. After 3 or 7 days, cells were sorted again. Those lacking MERVL/Zscan4c reporter expression are defined as newly emerging negative cells. Bars represent the mean ± SD of three biological replicates of 100 cells each.

(F) Box plots showing distribution of methylation levels for whole-genome, gene bodies, and imprint DMRs. Steady-state negative-sorted (light gray) and MERVL+Zscan4c+ sorted (M+Z+, dark blue) cells are shown in the first two columns for each feature analyzed. The last two columns (dark gray) reflect cells that have exited the MERVL+Zscan4c+ state identified as in (E) above.

(G) Methylation analysis of Peg13, H19, and Gtl2 imprinted DMRs. Cells were sorted from either negative gates (first of bar pairs) or as MERVL+Zscan4c+ positive (second of bar pairs) and returned to culture for 14 days, and analysis was performed on the resulting bulk population. Imprint DMRs were amplified from bisulfite-treated DNA and sequenced giving >1,000-fold coverage. Reads were binned based on their total amount of methylation (0%–20% white, 21%–40% light blue, 41%–60% medium blue, 61%–80% dark blue, and 81%–100% black).

(H) Average methylation levels for each CpG within Gtl2 DMR for negative (gray) or MERVL+Zscan4c+ (M+Z+, blue) cells that had been initially sorted and then replated for 14 days.

(I) MERVL endogenous retrovirus activation induces genome-wide DNA demethylation. Embryonic stem cells cycle in and out of the MERVL+Zscan4+ state. Entry into the state is initiated by chromatin decompaction, which enables accessibility of transcription factors, resulting in activation of the MERVL-LTR-driven transcriptional network. Subsequently, inhibition of protein synthesis depletes DNMT protein levels in the cell, which results in genome-wide DNA demethylation. Upon exit from the MERVL+Zscan4+ state, the translation block is lifted through auto-inhibition, protein synthesis resumes, and DMNT protein levels return to normal. While genome-wide methylation levels are restored, genomic imprints remain unmethylated, potentially explaining how imprints are lost in long-term ESC cultures.

Finally, the functional consequences of passing through the MERVL+Zscan4+ state were assessed. In particular we investigated the methylation status of imprinted regions: clusters of genes expressed exclusively from either the maternal or paternal alleles and controlled through differential methylation of imprint control regions or differentially methylated regions (DMRs). Loss of imprinting is associated with severe developmental disorders and cancer progression, and thus, imprints must be carefully maintained (reviewed in Barlow and Bartolomei, 2014). Significantly, in MERVL+Zscan4c+ cells, the global DNA demethylation extended to the imprint DMRs (Figures 3F and 7F). Thus, we investigated whether this imprint loss endured in cells that had passed through the MERVL+Zscan4+ state by further analysis of the newly emerging negative cells (see above). Consistent with a restoration of DNA methylation, CpG methylation levels genome-wide and at gene bodies returned to steady-state negative levels by 7 days (Figure 7F). Strikingly, imprint DMRs remained hypomethylated in the cells that had passed through the MERVL+Zscan4c+ state (Figure 7F). To further confirm this, negative cells and MERVL+Zscan4c+ cells were sorted by flow cytometry and then returned to culture for 14 days, after which bulk populations were collected. Imprint DMRs were amplified from bisulfite-treated DNA and sequenced, yielding >1,000-fold coverage per region analyzed. As expected, we observed intermediate methylation levels in negative-sorted cells resulting from a mix of methylated and unmethylated alleles (Figure 7G). However, the DMRs examined of cells that had passed through the MERVL+Zscan4c+ state were hypomethylated compared to controls (Figures 7G and 7H), implying that once the imprints are lost, they are not restored.

In summary, through a combination of single-cell transcriptome and genome-wide epigenetic analyses, we provide significant advances towards our understanding of MERVL+Zscan4+ cells. We reveal a graded upregulation of a MERVL-promoter-driven transcriptional program as cells enter the MERVL+Zscan4+ state, which is similarly upregulated in the two-cell embryo and during iPSC reprogramming. Intriguingly, cells that upregulate the MERVL-LTR-promoted transcriptional network exhibit genome-wide DNA demethylation caused by a translation-block-induced depletion of Dnmt proteins in the cells. While DNA methylation is regained genome-wide once the cell exits the state, genomic imprints are not restored, providing a potential mechanism of how imprint erasure may occur in ESC cultures.

Discussion

We have provided insights into the dynamics and functional consequences of a MERVL-LTR-driven transcriptional network in mouse ESCs. Activation of this network coincides with dramatic chromatin remodeling and ultimately results in genome-wide DNA demethylation. The following model for MERVL network activation in ESCs is proposed (Figure 7I): Chromatin decompaction enables transcription factor accessibility to otherwise heterochromatic and inaccessible MERVL MT2_Mm promoters (Akiyama et al., 2015, Ishiuchi et al., 2015). This drives transcription of full-length MERVL elements and/or downstream sequences, including clusters of tightly packed repeated transcripts of mostly unknown function. Among these are the Eif1a-like family members that may act as dominant-negative inhibitors of translation, causing a depletion of proteins in the cell. This includes DNA methyltransferases, whose loss results in a reduced ability to establish and maintain DNA methylation following DNA replication and consequently global loss of DNA methylation genome-wide. Crucially, while genome-wide methylation levels are restored following exit from the MERVL+Zscan4+ state, genomic imprints are not. Not all systems of extensive DNA demethylation result in imprint erasure; while ESCs undergo demethylation in 2i conditions, imprints are usually protected (Ficz et al., 2013, Habibi et al., 2013). However, addition of vitamin C to 2i conditions has recently been shown to further demethylate the genome, including imprints (Walter et al., 2016).

Variable imprint loss frequently occurs in mouse ESC lines and ESC-derived fetuses (Dean et al., 1998, Deng et al., 2007, Humpherys et al., 2001, Sun et al., 2012). In our study, we demonstrate that the global DNA demethylation observed in MERVL+Zscan4+ cells extends to genomic imprints. While genome methylation levels are restored when cells exit the MERVL+Zscan4+ state, the failure to restore genomic imprints provides a potential mechanism of imprint loss in mouse ESCs. Consistently, mouse ESCs (which have the MERVL+Zscan4+ state) are more vulnerable to imprint erasure than epiblast stem cells (EpiSCs) (Sun et al., 2012). For genome-wide and imprint DNA demethylation to occur, a cell must undergo DNA replication while in the MERVL+Zscan4+ state. Given that ∼1% of ESCs are in the MERVL+Zscan4+ state at a given time, of which ∼20% are in S phase, the actual manifestation of imprint loss through this mechanism would be infrequent and sporadic, and not all ESC cultures will lose imprints in this way. Increasing the frequency of entering the MERVL+Zscan4+ state and/or clonal expansion would increase the risk of imprint erasure, although we cannot rule out the contribution of a stress response or that other mechanisms contribute towards imprint loss. This has important consequences if ESCs are to be studied functionally and/or used to generate genetically modified mice.

Our single-cell transcriptome analyses reveal the dynamics of MERVL network activation. By ordering the cells by pseudotime, we were able to identify the earliest transcripts upregulated as cells enter the MERVL+Zscan4+ state, which importantly included MERVL, supporting its role as a driver. Following MERVL activation, the remainder of the transcriptional network is activated in subsequent waves, including a final group of potential repressors that may act to enable the cell to exit the MERVL+Zscan4+ state.

In addition to ESCs, activation of the MERVL-LTR-driven transcriptional network occurs in preimplantation embryos at the time of zygotic genome activation (two-cell stage). This has led to the cells being called “2C-like” and claims of increased potency made (Ishiuchi et al., 2015, Macfarlan et al., 2012). Furthermore, the increased chromatin accessibility in MERVL+Zscan4+ cells is pronounced at MERVL elements, consistent with the open chromatin at active MERVLs in two-cell embryos (Wu et al., 2016). Importantly, we show that the network is activated in vitro during the early stages of iPSC reprogramming, suggesting that it may be a general feature of cell identity change, although it remains unknown if this network is upregulated in vivo at other developmental stages or species. Interestingly, forced overexpression of Zscan4 increases the efficiency of iPSC reprogramming and increases the expression of members of the MERVL-LTR-driven transcriptional network (Hirata et al., 2012).

Given the dynamic nature of MERVL+Zscan4+ cells, the extent and generality of DNA hypomethylation was striking. The demethylation occurred globally across all genomic features and importantly occurs as a consequence of retroviral network activation and not vice versa. Consistently, treatment of ESCs with 5-azacytidine, which induces DNA demethylation, does not lead to activation of MERVL-LTR-driven transcripts (Macfarlan et al., 2011). Our hairpin bisulfite analysis revealed that a reduced activity of the maintenance methyltransferase Dnmt1 and the absence of the de novo methyltransferases Dnmt3a and Dnmt3b could explain the observed demethylation. Indeed, there was a dramatic depletion in the Dnmt proteins in MERVL+Zscan4c+ cells, despite similar transcript levels. We confirmed a global reduction of translation in MERVL+Zscan4+ cells (Hung et al., 2013), which is likely the mechanism through which depletion of Dnmt and pluripotency proteins, including Oct4 (Ishiuchi et al., 2015, Macfarlan et al., 2012), occurs, although it is possible that additional pathways, including the altered cell cycle of MERVL+Zscan4+ cells, could also be involved. Interestingly, as the cluster of Eif1a-like transcripts that have been proposed to act in a dominant-negative manner to suppress protein synthesis (Hung et al., 2013) are among the earliest group of transcripts upregulated in MERVL+Zscan4c+ cells (cluster 2 in Figure 1E), transcripts expressed from later clusters may not be translated in MERVL+Zscan4+ cells.

In conclusion, we provide detailed molecular insights into the dynamics of an endogenous retroviral transcriptional network activated in rare mouse ESCs, preimplantation embryos and during iPSC reprogramming. Our studies also demonstrate the downstream effects of activating this network, including global DNA demethylation. While methylation levels are restored once cells exit the MERVL+Zscan4+ state, genomic imprints are not. In summary, our study provides a detailed molecular understanding of the transcriptional dynamics and epigenetic characteristics of MERVL+Zscan4+ cells and potential functional consequences of this dynamic transient ESC state.

Experimental Procedures

Cell Culture

E14 ESCs were cultured in DMEM containing 15% fetal calf serum and 103 U/mL leukemia inhibitory factor (LIF); for 2i/LIF experiments, cells were cultured in serum-free N2B27 supplemented with LIF, 1 μM PD0325901, and 3 μM CHIR99021 inhibitors. The Zscan4c::emerald plasmid (90636) was kindly provided by Minoru Ko (Zalzman et al., 2010), and the MERVL::tdTomato reporter (Macfarlan et al., 2012) was a gift from Samuel Pfaff (Addgene #40281). Reporter stable clonal lines were generated by transfection (Fugene6, Promega), drug selection, and subcloning. Flow cytometry analysis was performed using the BD LSR Fortessa and sorts performed on the BD Aria III or BD Influx High-Speed Cell Sorter.

RNA Isolation, qPCR, and Total RNA-Sequencing

RNA was isolated using QIAGEN RNA-DNA allprep columns or TriReagent (Sigma) and treated with DNaseI (Ambion). cDNA was generated using 0.5–1 μg RNA (Thermo RevertAid) and qRT-PCR performed using the Brilliant III SYBR mix (Agilent Technologies). Relative quantification was performed using the comparative CT method with normalization to CycloB1 levels. Primer sequences available upon request. Opposite strand-specific total RNA libraries (ribozero) were made using 200 ng to 1 μg DNase-treated RNA using the Sanger Institute Illumina bespoke pipeline. ∼6–8 × 106 paired-end 75-bp reads were generated per sample (at least three biological replicates each) using the Illumina HiSeq 2500 Rapid Run platform.

Whole-Genome Bisulfite Sequencing

Whole-genome bisulfite sequencing libraries were generated using a post-bisulfite adaptor tagging (PBAT) method as previously described (Peat et al., 2014) with ten cycles of amplification. Three biological replicates were generated per condition and libraries sequenced using Illumina HiSeq 2000.

For additional methods, including immunofluorescence, ATAC-seq, and full bioinformatics analysis and statistical methods, see the Supplemental Experimental Procedures.

Author Contributions

M.A.E.-M. and W.R. conceived and designed the study. M.A.E.-M. performed experiments, analyzed data, and wrote the paper. V.S. helped design and performed all analysis on single-cell RNA-seq experiments. C. Krueger performed repeat transcriptome analysis and edited the paper. T.M.S. helped design and perform low-cell-number PBAT experiments. P.G. performed hairpin bisulfite experiments. F.K. performed bioinformatics processing. C. Kyriakopoulos performed hairpin bisulfite modelling. R.J.M. prepared single-cell RNA-seq libraries. R.V.B. performed Dnmt1 knockout experiments. I.M. performed iPSC experiments. J.W., S.A.T., and W.R. supervised the study.

Acknowledgments

The authors thank all members of the W.R. laboratory for helpful discussions. We also thank Ferdinand von Meyenn and Mario Iurlaro for serum-2i transcriptome data, Simon Andrews for bioinformatics support, Simon Walker for imaging support, Nathalie Smerdon for high-throughput sequencing assistance, David Oxley for mass spectrometry, Rachael Walker for flow cytometry assistance, Ruslan Strogantsev for genomic imprinting discussions, Minoru Ko for the Zscan4c::EGFP plasmid, Samuel Pfaff for the MERVL::tdTomato plasmid, and Haruhiko Koseki for Dnmt1fl/fl ESCs. M.A.E.-M. is supported by an EMBO Fellowship (ALTF938-2014) and Marie Sklodowska-Curie Individual Fellowship. P.G. was supported by DFG grant WA1029 within the SFB1027. Research in W.R.’s lab is supported by the BBSRC (grant BB/K010867/1), Wellcome Trust (grant 095645/Z/11/Z), EU EpiGeneSys, and BLUEPRINT.

Published: September 27, 2016

Footnotes

Supplemental Information includes Supplemental Experimental Procedures, six figures, and four tables and can be found with this article online at http://dx.doi.org/10.1016/j.celrep.2016.08.087.

Contributor Information

Mélanie A. Eckersley-Maslin, Email: eckersleym@babraham.ac.uk.

Wolf Reik, Email: wolf.reik@babraham.ac.uk.

Accession Numbers

The accession numbers for the datasets reported in this paper are GEO: GSE75751, GEO: GSE85776, and ArrayExpress: E-MTAB-5058 (single-cell RNA-seq data).

Supplemental Information

Document S1. Supplemental Experimental Procedures and Figures S1–S6
mmc1.pdf (1.2MB, pdf)
Table S1. Expression Levels of Differentially Expressed Genes, Related to Figure 1

List of differentially expressed genes between Negative (MERVL−Zscan4−), Zscan4+ only (MERVL−Zscan4+) and MERVL+Zscan4+ sorted cells, along with their chromosome coordinates, ENSEMBL gene ID, Description and average log2 expression value for the replicates across the three conditions.

mmc2.xlsx (64.9KB, xlsx)
Table S2. Expression Levels of All Assessed Genes, Related to Figure 1

List of all genes, chromosome coordinates, ENSEMBL gene ID, Description and average log2 expression value across the Negative (MERVLZscan4−), Zscan4+ only (MERVL−Zscan4+) and MERVL+Zscan4+ sorted cells.

mmc3.xlsx (2.7MB, xlsx)
Table S3. List of Genes Dynamic over Pseudotime, Related to Figure 1

List of transcripts contained within the 5 clusters that are dynamic over pseudotime as determined by single-cell RNA sequencing analysis (see Supplemental Experimental Procedures).

mmc4.xlsx (156.4KB, xlsx)
Table S4. Expression Levels of Differentially Expressed Genes in Other Datasets, Related to Figure 2

Expression levels for the 172 MERVL-LTR driven genes in the different datasets analyzed. (A) Park et al. (B) Deng et al. (C) Xue et al. (D) Milagre et al. (E) Encode. Note that due to differences in library preparation and data analysis it is not appropriate to compare values across datasets.

mmc5.xlsx (85.8KB, xlsx)
Document S2. Article plus Supplemental Information
mmc6.pdf (5.6MB, pdf)

References

  1. Akiyama T., Xin L., Oda M., Sharov A.A., Amano M., Piao Y., Cadet J.S., Dudekula D.B., Qian Y., Wang W. Transient bursts of Zscan4 expression are accompanied by the rapid derepression of heterochromatin in mouse embryonic stem cells. DNA Res. 2015;22:307–318. doi: 10.1093/dnares/dsv013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Arand J., Spieler D., Karius T., Branco M.R., Meilinger D., Meissner A., Jenuwein T., Xu G., Leonhardt H., Wolf V., Walter J. In vivo control of CpG and non-CpG DNA methylation by DNA methyltransferases. PLoS Genet. 2012;8:e1002750. doi: 10.1371/journal.pgen.1002750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barlow D.P., Bartolomei M.S. Genomic imprinting in mammals. Cold Spring Harb. Perspect. Biol. 2014;6:a018382. doi: 10.1101/cshperspect.a018382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bošković A., Eid A., Pontabry J., Ishiuchi T., Spiegelhalter C., Raghu Ram E.V.S., Meshorer E., Torres-Padilla M.-E. Higher chromatin mobility supports totipotency and precedes pluripotency in vivo. Genes Dev. 2014;28:1042–1047. doi: 10.1101/gad.238881.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Buettner F., Natarajan K.N., Casale F.P., Proserpio V., Scialdone A., Theis F.J., Teichmann S.A., Marioni J.C., Stegle O. Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells. Nat. Biotechnol. 2015;33:155–160. doi: 10.1038/nbt.3102. [DOI] [PubMed] [Google Scholar]
  6. Dan J., Yang J., Liu Y., Xiao A., Liu L. Roles for histone acetylation in regulation of telomere elongation and two-cell state in mouse ES cells. J. Cell. Physiol. 2015;230:2337–2344. doi: 10.1002/jcp.24980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. De Los Angeles A., Ferrari F., Xi R., Fujiwara Y., Benvenisty N., Deng H., Hochedlinger K., Jaenisch R., Lee S., Leitch H.G. Hallmarks of pluripotency. Nature. 2015;525:469–478. doi: 10.1038/nature15515. [DOI] [PubMed] [Google Scholar]
  8. Dean W., Bowden L., Aitchison A., Klose J., Moore T., Meneses J.J., Reik W., Feil R. Altered imprinted gene methylation and expression in completely ES cell-derived mouse fetuses: association with aberrant phenotypes. Development. 1998;125:2273–2282. doi: 10.1242/dev.125.12.2273. [DOI] [PubMed] [Google Scholar]
  9. Deng T., Kuang Y., Zhang D., Wang L., Sun R., Xu G., Wang Z., Fei J. Disruption of imprinting and aberrant embryo development in completely inbred embryonic stem cell-derived mice. Dev. Growth Differ. 2007;49:603–610. doi: 10.1111/j.1440-169X.2007.00955.x. [DOI] [PubMed] [Google Scholar]
  10. Domcke S., Bardet A.F., Adrian Ginno P., Hartl D., Burger L., Schübeler D. Competition between DNA methylation and transcription factors determines binding of NRF1. Nature. 2015;528:575–579. doi: 10.1038/nature16462. [DOI] [PubMed] [Google Scholar]
  11. Falco G., Lee S.-L., Stanghellini I., Bassey U.C., Hamatani T., Ko M.S.H. Zscan4: a novel gene expressed exclusively in late 2-cell embryos and embryonic stem cells. Dev. Biol. 2007;307:539–550. doi: 10.1016/j.ydbio.2007.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Ficz G., Hore T.A., Santos F., Lee H.J., Dean W., Arand J., Krueger F., Oxley D., Paul Y.-L., Walter J. FGF signaling inhibition in ESCs drives rapid genome-wide demethylation to the epigenetic ground state of pluripotency. Cell Stem Cell. 2013;13:351–359. doi: 10.1016/j.stem.2013.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Friedli M., Trono D. The developmental control of transposable elements and the evolution of higher species. Annu. Rev. Cell Dev. Biol. 2015;31:429–451. doi: 10.1146/annurev-cellbio-100814-125514. [DOI] [PubMed] [Google Scholar]
  14. Friedli M., Turelli P., Kapopoulou A., Rauwel B., Castro-Díaz N., Rowe H.M., Ecco G., Unzu C., Planet E., Lombardo A. Loss of transcriptional control over endogenous retroelements during reprogramming to pluripotency. Genome Res. 2014;24:1251–1259. doi: 10.1101/gr.172809.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fujii S., Nishikawa-Torikai S., Futatsugi Y., Toyooka Y., Yamane M., Ohtsuka S., Niwa H. Nr0b1 is a negative regulator of Zscan4c in mouse embryonic stem cells. Sci. Rep. 2015;5:9146. doi: 10.1038/srep09146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Habibi E., Brinkman A.B., Arand J., Kroeze L.I., Kerstens H.H.D., Matarese F., Lepikhov K., Gut M., Brun-Heath I., Hubner N.C. Whole-genome bisulfite sequencing of two distinct interconvertible DNA methylomes of mouse embryonic stem cells. Cell Stem Cell. 2013;13:360–369. doi: 10.1016/j.stem.2013.06.002. [DOI] [PubMed] [Google Scholar]
  17. Hayashi M., Maehara K., Harada A., Semba Y., Kudo K., Takahashi H., Oki S., Meno C., Ichiyanagi K., Akashi K., Ohkawa Y. Chd5 regulates MuERV-L/MERVL expression in mouse embryonic stem cells via H3K27me3 modification and histone H3.1/H3.2. J. Cell. Biochem. 2016;117:780–792. doi: 10.1002/jcb.25368. [DOI] [PubMed] [Google Scholar]
  18. Hirata T., Amano T., Nakatake Y., Amano M., Piao Y., Hoang H.G., Ko M.S.H. Zscan4 transiently reactivates early embryonic genes during the generation of induced pluripotent stem cells. Sci. Rep. 2012;2:208. doi: 10.1038/srep00208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Hisada K., Sánchez C., Endo T.A., Endoh M., Román-Trufero M., Sharif J., Koseki H., Vidal M. RYBP represses endogenous retroviruses and preimplantation- and germ line-specific genes in mouse embryonic stem cells. Mol. Cell. Biol. 2012;32:1139–1149. doi: 10.1128/MCB.06441-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Humpherys D., Eggan K., Akutsu H., Hochedlinger K., Rideout W.M., 3rd, Biniszkiewicz D., Yanagimachi R., Jaenisch R. Epigenetic instability in ES cells and cloned mice. Science. 2001;293:95–97. doi: 10.1126/science.1061402. [DOI] [PubMed] [Google Scholar]
  21. Hung S.S.C., Wong R.C.B., Sharov A.A., Nakatake Y., Yu H., Ko M.S.H. Repression of global protein synthesis by Eif1a-like genes that are expressed specifically in the two-cell embryos and the transient Zscan4-positive state of embryonic stem cells. DNA Res. 2013;20:391–402. doi: 10.1093/dnares/dst018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Ishiuchi T., Torres-Padilla M.-E. Towards an understanding of the regulatory mechanisms of totipotency. Curr. Opin. Genet. Dev. 2013;23:512–518. doi: 10.1016/j.gde.2013.06.006. [DOI] [PubMed] [Google Scholar]
  23. Ishiuchi T., Enriquez-Gasca R., Mizutani E., Bošković A., Ziegler-Birling C., Rodriguez-Terrones D., Wakayama T., Vaquerizas J.M., Torres-Padilla M.-E. Early embryonic-like cells are induced by downregulating replication-dependent chromatin assembly. Nat. Struct. Mol. Biol. 2015;22:662–671. doi: 10.1038/nsmb.3066. [DOI] [PubMed] [Google Scholar]
  24. Kigami D., Minami N., Takayama H., Imai H. MuERV-L is one of the earliest transcribed genes in mouse one-cell embryos. Biol. Reprod. 2003;68:651–654. doi: 10.1095/biolreprod.102.007906. [DOI] [PubMed] [Google Scholar]
  25. Kolodziejczyk A.A., Kim J.K., Tsang J.C.H., Ilicic T., Henriksson J., Natarajan K.N., Tuck A.C., Gao X., Bühler M., Liu P. Single cell RNA-sequencing of pluripotent states unlocks modular transcriptional variation. Cell Stem Cell. 2015;17:471–485. doi: 10.1016/j.stem.2015.09.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kumar R.M., Cahan P., Shalek A.K., Satija R., DaleyKeyser A.J., Li H., Zhang J., Pardee K., Gennert D., Trombetta J.J. Deconstructing transcriptional heterogeneity in pluripotent stem cells. Nature. 2014;516:56–61. doi: 10.1038/nature13920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Lee H.J., Hore T.A., Reik W. Reprogramming the methylome: erasing memory and creating diversity. Cell Stem Cell. 2014;14:710–719. doi: 10.1016/j.stem.2014.05.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Leitch H.G., McEwen K.R., Turp A., Encheva V., Carroll T., Grabole N., Mansfield W., Nashun B., Knezovich J.G., Smith A. Naive pluripotency is associated with global DNA hypomethylation. Nat. Struct. Mol. Biol. 2013;20:311–316. doi: 10.1038/nsmb.2510. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Macfarlan T.S., Gifford W.D., Agarwal S., Driscoll S., Lettieri K., Wang J., Andrews S.E., Franco L., Rosenfeld M.G., Ren B., Pfaff S.L. Endogenous retroviruses and neighboring genes are coordinately repressed by LSD1/KDM1A. Genes Dev. 2011;25:594–607. doi: 10.1101/gad.2008511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Macfarlan T.S., Gifford W.D., Driscoll S., Lettieri K., Rowe H.M., Bonanomi D., Firth A., Singer O., Trono D., Pfaff S.L. Embryonic stem cell potency fluctuates with endogenous retrovirus activity. Nature. 2012;487:57–63. doi: 10.1038/nature11244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Maksakova I.A., Thompson P.J., Goyal P., Jones S.J., Singh P.B., Karimi M.M., Lorincz M.C. Distinct roles of KAP1, HP1 and G9a/GLP in silencing of the two-cell-specific retrotransposon MERVL in mouse ES cells. Epigenetics Chromatin. 2013;6:15. doi: 10.1186/1756-8935-6-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Park S.-J., Shirahige K., Ohsugi M., Nakai K. DBTMEE: a database of transcriptome in mouse early embryos. Nucleic Acids Res. 2015;43:D771–D776. doi: 10.1093/nar/gku1001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Peat J.R., Dean W., Clark S.J., Krueger F., Smallwood S.A., Ficz G., Kim J.K., Marioni J.C., Hore T.A., Reik W. Genome-wide bisulfite sequencing in zygotes identifies demethylation targets and maps the contribution of TET3 oxidation. Cell Rep. 2014;9:1990–2000. doi: 10.1016/j.celrep.2014.11.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Rowe H.M., Jakobsson J., Mesnard D., Rougemont J., Reynard S., Aktas T., Maillard P.V., Layard-Liesching H., Verp S., Marquis J. KAP1 controls endogenous retroviruses in embryonic stem cells. Nature. 2010;463:237–240. doi: 10.1038/nature08674. [DOI] [PubMed] [Google Scholar]
  35. Schlesinger S., Goff S.P. Retroviral transcriptional regulation and embryonic stem cells: war and peace. Mol. Cell. Biol. 2015;35:770–777. doi: 10.1128/MCB.01293-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Scialdone A., Natarajan K.N., Saraiva L.R., Proserpio V., Teichmann S.A., Stegle O., Marioni J.C., Buettner F. Computational assignment of cell-cycle stage from single-cell transcriptome data. Methods. 2015;85:54–61. doi: 10.1016/j.ymeth.2015.06.021. [DOI] [PubMed] [Google Scholar]
  37. Sharif J., Endo T.A., Nakayama M., Karimi M.M., Shimada M., Katsuyama K., Goyal P., Brind'Amour J., Sun M.A., Sun Z. Activation of endogenous retroviruses in Dnmt1(-/-) ESCs involves disruption of SETDB1-mediated repression by NP95 binding to hemimethylated DNA. Cell Stem Cell. 2016;19:81–94. doi: 10.1016/j.stem.2016.03.013. [DOI] [PubMed] [Google Scholar]
  38. Storm M.P., Kumpfmueller B., Bone H.K., Buchholz M., Sanchez Ripoll Y., Chaudhuri J.B., Niwa H., Tosh D., Welham M.J. Zscan4 is regulated by PI3-kinase and DNA-damaging agents and directly interacts with the transcriptional repressors LSD1 and CtBP2 in mouse embryonic stem cells. PLoS ONE. 2014;9:e89821. doi: 10.1371/journal.pone.0089821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Sun B., Ito M., Mendjan S., Ito Y., Brons I.G.M., Murrell A., Vallier L., Ferguson-Smith A.C., Pedersen R.A. Status of genomic imprinting in epigenetically distinct pluripotent stem cells. Stem Cells. 2012;30:161–168. doi: 10.1002/stem.793. [DOI] [PubMed] [Google Scholar]
  40. Thompson P.J., Dulberg V., Moon K.-M., Foster L.J., Chen C., Karimi M.M., Lorincz M.C. hnRNP K coordinates transcriptional silencing by SETDB1 in embryonic stem cells. PLoS Genet. 2015;11:e1004933. doi: 10.1371/journal.pgen.1004933. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Torres-Padilla M.E., Chambers I. Transcription factor heterogeneity in pluripotent stem cells: a stochastic advantage. Development. 2014;141:2173–2181. doi: 10.1242/dev.102624. [DOI] [PubMed] [Google Scholar]
  42. von Meyenn F., Iurlaro M., Habibi E., Liu N.Q., Salehzadeh-Yazdi A., Santos F., Petrini E., Milagre I., Yu M., Xie Z. Impairment of DNA methylation maintenance is the main cause of global demethylation in naive embryonic stem cells. Mol. Cell. 2016;62:848–861. doi: 10.1016/j.molcel.2016.04.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Walter M., Teissandier A., Pérez-Palacios R., Bourc’his D. An epigenetic switch ensures transposon repression upon dynamic loss of DNA methylation in embryonic stem cells. eLife. 2016;5:e11418. doi: 10.7554/eLife.11418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Wu J., Huang B., Chen H., Yin Q., Liu Y., Xiang Y., Zhang B., Liu B., Wang Q., Xia W. The landscape of accessible chromatin in mammalian preimplantation embryos. Nature. 2016;534:652–657. doi: 10.1038/nature18606. [DOI] [PubMed] [Google Scholar]
  45. Zalzman M., Falco G., Sharova L.V., Nishiyama A., Thomas M., Lee S.-L., Stagg C.A., Hoang H.G., Yang H.-T., Indig F.E. Zscan4 regulates telomere elongation and genomic stability in ES cells. Nature. 2010;464:858–863. doi: 10.1038/nature08882. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Supplemental Experimental Procedures and Figures S1–S6
mmc1.pdf (1.2MB, pdf)
Table S1. Expression Levels of Differentially Expressed Genes, Related to Figure 1

List of differentially expressed genes between Negative (MERVL−Zscan4−), Zscan4+ only (MERVL−Zscan4+) and MERVL+Zscan4+ sorted cells, along with their chromosome coordinates, ENSEMBL gene ID, Description and average log2 expression value for the replicates across the three conditions.

mmc2.xlsx (64.9KB, xlsx)
Table S2. Expression Levels of All Assessed Genes, Related to Figure 1

List of all genes, chromosome coordinates, ENSEMBL gene ID, Description and average log2 expression value across the Negative (MERVLZscan4−), Zscan4+ only (MERVL−Zscan4+) and MERVL+Zscan4+ sorted cells.

mmc3.xlsx (2.7MB, xlsx)
Table S3. List of Genes Dynamic over Pseudotime, Related to Figure 1

List of transcripts contained within the 5 clusters that are dynamic over pseudotime as determined by single-cell RNA sequencing analysis (see Supplemental Experimental Procedures).

mmc4.xlsx (156.4KB, xlsx)
Table S4. Expression Levels of Differentially Expressed Genes in Other Datasets, Related to Figure 2

Expression levels for the 172 MERVL-LTR driven genes in the different datasets analyzed. (A) Park et al. (B) Deng et al. (C) Xue et al. (D) Milagre et al. (E) Encode. Note that due to differences in library preparation and data analysis it is not appropriate to compare values across datasets.

mmc5.xlsx (85.8KB, xlsx)
Document S2. Article plus Supplemental Information
mmc6.pdf (5.6MB, pdf)

RESOURCES