Skip to main content
eLife logoLink to eLife
. 2017 Mar 21;6:e22345. doi: 10.7554/eLife.22345

Stella modulates transcriptional and endogenous retrovirus programs during maternal-to-zygotic transition

Yun Huang 1,2,, Jong Kyoung Kim 3,4,, Dang Vinh Do 1,2, Caroline Lee 1,2, Christopher A Penfold 1,2, Jan J Zylicz 1,2, John C Marioni 3,5,6, Jamie A Hackett 1,2,7,*, M Azim Surani 1,2,*
Editor: Robb Krumlauf8
PMCID: PMC5404928  PMID: 28323615

Abstract

The maternal-to-zygotic transition (MZT) marks the period when the embryonic genome is activated and acquires control of development. Maternally inherited factors play a key role in this critical developmental process, which occurs at the 2-cell stage in mice. We investigated the function of the maternally inherited factor Stella (encoded by Dppa3) using single-cell/embryo approaches. We show that loss of maternal Stella results in widespread transcriptional mis-regulation and a partial failure of MZT. Strikingly, activation of endogenous retroviruses (ERVs) is significantly impaired in Stella maternal/zygotic knockout embryos, which in turn leads to a failure to upregulate chimeric transcripts. Amongst ERVs, MuERV-L activation is particularly affected by the absence of Stella, and direct in vivo knockdown of MuERV-L impacts the developmental potential of the embryo. We propose that Stella is involved in ensuring activation of ERVs, which themselves play a potentially key role during early development, either directly or through influencing embryonic gene expression.

DOI: http://dx.doi.org/10.7554/eLife.22345.001

Research Organism: Mouse

eLife digest

When a sperm cell fertilizes an egg cell, this creates a single-celled embryo called a zygote that will go on to divide repeatedly throughout development. The zygote inherits the contents of the egg including many important proteins that initially control how the embryo develops. In mice, the embryo takes over control of development once the zygote has divided to form a two-celled embryo. This transition of control destroys the maternally inherited proteins and selectively activates zygotic genes and some DNA sequences called transposable elements that evolved from virus DNA.

One of the proteins inherited from the egg cell is called Stella. Embryos that lack Stella die within the first few cell divisions, which suggests that the protein is needed during the earliest stages of development. However, it is not clear what Stella's role in the early embryo is.

Huang, Kim et al. decided to investigate how Stella might affect the genes that are switched on during development by using a technique called RNA-seq to study egg cells and early embryos from mice. Two-celled embryos that lacked Stella could not activate a number of genes that produce proteins that are critical for development. They also failed to activate a group of transposable elements called endogenous retroviruses. In particular, a lack of Stella significantly reduced the activity of an endogenous retrovirus called MuERV-L.

Further experiments showed that MuERV-L is needed for normal embryonic development, and so suggests that transposable elements play important roles in this process. Future studies will aim to explore these roles in more detail. It will also be important to identify the genes that Stella targets in embryos, and to investigate the roles that similar maternally inherited proteins play in early embryonic development.

DOI: http://dx.doi.org/10.7554/eLife.22345.002

Introduction

Maternally inherited factors in the zygote play a critical role during early development (Ancelin et al., 2016; Li et al., 2010; Wasson et al., 2016). The maternal-to-zygotic transition (MZT) marks the time of transfer of developmental control to the embryo following activation of the zygotic genome (Lee et al., 2014; Li et al., 2013). In mice, the major wave of zygotic genome activation (ZGA) occurs at the late 2-cell stage (Golbus et al., 1973). The earliest zygotically transcribed genes are preferentially enriched in essential processes such as transcription, RNA metabolism and ribosome biogenesis (Hamatani et al., 2004; Xue et al., 2013). Other events that characterise early pre-implantation development include extensive erasure of global DNA methylation and dynamic changes in histone modifications (Cantone and Fisher, 2013).

Previous studies have shown that Stella, encoded by the Dppa3 gene locus, is a maternally inherited factor that is required for normal pre-implantation development (Bortvin et al., 2004; Payer et al., 2003). Stella is a small basic protein with a putative SAP-like domain and splicing-like factor; it also contains nuclear localisation and export signal and has the potential to bind to DNA and RNA in-vitro (Nakamura et al., 2007; Payer et al., 2003) (Figure 1—figure supplement 1A). Expression of Stella is high in oocytes, continues through pre-implantation development, and subsequently occurs specifically in primordial germ cells. Stella is also expressed in naïve embryonic stem cells, but is downregulated upon exit from pluripotency (Hayashi et al., 2008). Lack of maternal inheritance of Stella results in developmental defects; these manifest from the 2-cell stage onwards, and result in only a small proportion of embryos developing to the blastocyst stage. Notably, a maternally inherited pool of Stella in zygotic Stella knockout embryos, derived from Stella heterozygous females, allows development to progress normally. This indicates Stella has a key role during the earliest developmental events (Payer et al., 2003).

Stella was suggested to protect maternal pronuclei (PN) from TET3 mediated active DNA demethylation (Nakamura et al., 2007, 2012) (Figure 1—figure supplement 1B). In the absence of Stella, zygotes display enrichment of 5hmC in both parental PN (Wossidlo et al., 2011), and an aberrant accumulation of γH2AX in maternal chromatin (Nakatani et al., 2015). In addition, Stella protects DNA methylation levels at selected imprinted genes and transposable elements (Nakamura et al., 2007). However, embryos depleted of maternal effect proteins known to regulate imprinted genes only exhibit developmental defects post-implantation (Denomme and Mann, 2013), implying that the 2-cell phenotype in Stella maternal/zygotic knockout (Stella M/Z KO) embryos is not primarily linked with imprint defects. Moreover, TET3 only partially contributes to DNA demethylation and its absence is compatible with embryonic development (Amouroux et al., 2016; Peat et al., 2014; Shen et al., 2014; Tsukada et al., 2015). Thus, what impairs pre-implantation embryonic development in the absence of maternal Stella remains unclear.

A significant number of transposable elements (TEs) are preferentially activated during early development and in a sub-population of mouse embryonic stem cells (Gifford et al., 2013; Macfarlan et al., 2012; Rowe and Trono, 2011). Notably, at the 2-cell (2C) stage in mouse development, there is selective upregulation of endogenous retrovirus (ERVs), which are a subset of TEs characterised by the presence of LTRs that mediate expression and retrotransposition (Kigami et al., 2003; Ribet et al., 2008). Growing evidence suggests that activation of some TEs has important biological functions during early development (Beraldi et al., 2006; Kigami et al., 2003). TEs can regulate tissue-specific gene expression or splicing through their exaptation as gene regulatory elements, and may also play a key role during speciation (Böhne et al., 2008; Gifford et al., 2013; Rebollo et al., 2012). TEs additionally drive expression of genes directly by acting as alternative promoters that generate chimeric transcripts, which include both TE and protein-coding sequences (Macfarlan et al., 2012; Peaston et al., 2004). Thus the expression of TEs may be functionally important during early embryo development and understanding the regulation of TEs themselves is therefore of great interest.

We adopted an unbiased approach to investigate the role of Stella during mouse maternal-to-zygotic transition, using single-cell/embryo RNA-seq analysis of mutant embryos (Deng et al., 2014; Xue et al., 2013; Yan et al., 2013). We find Stella M/Z KO 2-cell embryos fail to upregulate key zygotic genes involved in regulation of ribosome and RNA processing. Furthermore, the absence of Stella results in widespread misregulation of TEs and of chimeric transcripts that are derived from these TEs. In particular, Stella M/Z KO embryos exhibit a general failure to upregulate MuERV-L transcripts and protein at the 2-cell stage. Our perturbation data is consistent with MuERV-L playing a functionally important role during pre-implantation embryonic development, implying that MuERV-L is amongst the critical factors affected by Stella in early embryos.

Results

Stella M/Z KO embryos are transcriptionally distinct from wild type embryos

We used Stella knockout mice (Payer et al., 2003), to collect mutant oocytes and embryos for single-cell/embryo RNA-seq with modifications (Tang et al., 2010). Stella knockout (KO) oocytes and Stella maternal/zygotic knockout (Stella M/Z KO, KO) 1-cell and 2-cell embryos, lacking both maternal and zygotic Stella, were harvested. Strain matched wild type (WT) oocytes and embryos served as controls (Figure 1A; Figure 1—figure supplement 2A). Following normalisation, potentially confounding technical factors were controlled for using Removal of Unwanted Variation (RUV) (Risso et al., 2014) (Figure 1—figure supplement 2B, Figure 1—source data 1, Supplementary file 1).

Figure 1. Stella M/Z KO embryos are impaired in maternal-to-zygotic transition.

(A) A schematic illustration of the single-cell / embryo RNA-seq experimental setup. The total number of oocytes and embryos collected are indicated. The colour scheme represents the transition from maternal (green) to zygotic (red) transcripts. Maternal Stella is represented in green circle. (B) A score plot of the first three principal components for 66 cells using gene counts. The lower panels represent the score plots of the first two principal components using cells belonging to a specific developmental time point. The developmental time points are indicated by colour and the genotypes for Stella are indicated by shape. (C) A Venn diagram illustrating the overlap of differentially expressed genes (DEG) between WT and KO oocytes, 1-cell and 2-cell embryos (adjusted p-value<0.05) (Figure 1—source data 2). (D) The heatmap represents the log2 ratio of the number of upregulated to downregulated genes in KO compared to WT at oocytes, 1-cell and 2-cell stage, belonging to a given cluster of DBTMEE (Park et al., 2015). Fisher’s exact test performed and statistically significant p-values are stated. (E) A time-series clustering of gene expression dynamics across oocyte to 2-cell embryo (see Materials and methods). (Left) Heatmap shows a cluster of WT maternal transcripts (ED), which are differentially expressed in KO samples (adjusted p-value<0.05, Figure 1—source data 3). (Right) Heatmap shows a cluster of WT zygotically activated genes (ZAG) (EU), which are differentially expressed in KO samples (adjusted p-value<0.05, Figure 1—source data 4). Also see Figure 1—figure supplements 12 and Figure 1—source data 14.

DOI: http://dx.doi.org/10.7554/eLife.22345.003

Figure 1—source data 1. Gene counts for WT and KO oocyte, 1-cell and 2-cell embryos.
Gene counts for 66 samples from single cell / embryo RNA-seq experiments. Removal of Unwanted Variation analysis has been performed to control for potentially confounding technical factors.
DOI: 10.7554/eLife.22345.004
Figure 1—source data 2. List of differentially expressed genes between WT and KO samples at oocyte, 1-cell and 2-cell stage, from single cell/embryo RNA-seq analysis.
ENSEMBL ID and names of genes identified as significantly differentially expressed with adjusted P-value (padj) <0.05. Log2FoldChange=Log2(WT/KO). Stage indicates the developmental stage that is analysed.
DOI: 10.7554/eLife.22345.005
Figure 1—source data 3. List of maternal transcripts, WT class = ED, which are differentially expressed in KO samples.
‘E’ denotes equally expressed, ‘D’ denotes downregulated or ‘U’ denotes upregulated at a later time point. Adjusted p-value (padj) shows significant differential expression between WT and KO class.
DOI: 10.7554/eLife.22345.006
Figure 1—source data 4. List of zygotically activated genes (ZAG), WT class = EU, which are differentially expressed in KO samples.
‘E’ denotes equally expressed, ‘D’ denotes downregulated or ‘U’ denotes upregulated at a later time point. Adjusted p-value (padj) shows significant differential expression between WT and KO class.
DOI: 10.7554/eLife.22345.007

Figure 1.

Figure 1—figure supplement 1. Stella protein domains.

Figure 1—figure supplement 1.

(A) A schematic diagram illustrating the putative Stella protein domains: SAP-like domain, splicing-like factor, nuclear export and nuclear localization signals (Aravind and Koonin, 2000; Payer et al., 2003). (B) A schematic diagram illustrating the N-terminus of Stella (amino acids 1–75) is required for H3K9me2 binding in maternal pronuclei (PN), while the C-terminus of Stella (amino acids 76–150) is required to exclude TET3 from the maternal PN (Nakamura et al., 2012).
Figure 1—figure supplement 2. Quality control of single-cell / embryo RNA-seq.

Figure 1—figure supplement 2.

(A) A scatter plot of the number of reads mapped to exons against the proportion of reads mapped to mitochondrial genes. The dashed red line indicates threshold of a good quality sample if: (1) cells have a total number of reads mapped to exons greater than 500,000; (2) cells have less than 10% of the proportion of reads mapped to 37 genes on the mitochondria chromosome. All 66 samples are of good quality. (B) A score plot of the first two principle components for cells of each developmental stage. RUV = removal of unwanted variation (see Materials and methods, Figure 1—source data 1). Top panel shows samples clustering pre-RUV, and bottom panel shows samples clustering post-RUV. Each batch represents independent experiments in which samples are collected, and this is indicated by a different colour (Supplementary file 1).

Principal component (PC) analysis revealed that the first PC represents progression from oocyte to the 2-cell stage (Figure 1B). The largest separation is observed between oocyte/1-cell and 2-cell embryos, consistent with the degradation of maternal transcripts and activation of the zygotic genome at MZT. The second and third PCs capture the separation between WT and KO samples, suggesting distinct genome-wide gene expression changes between WT and KO embryos. Furthermore, although KO oocytes and 1-cell embryos exhibit some separation from their WT counterparts, the difference is more clearly pronounced at the 2-cell stage. Notably, Stella M/Z KO 1-cell and 2-cell embryos clustered separately from each other, suggesting that Stella M/Z KO defects at the 2-cell stage are not simply due to delayed developmental progression (Figure 1B). Differential gene expression analysis between WT and KO samples identified 5881 misregulated genes (adjusted p-value<0.05) across the developmental stages with 360 genes overlapping across all stages (Figure 1C, Figure 1—source data 2).

Stella M/Z KO embryos are impaired in maternal-to-zygotic transition

Next, we compared the differentially expressed genes (DEG) against a public database of early mouse embryonic transcriptomes (DBTMEE) (Park et al., 2015). This dataset categorises genes by specific expression pattern at any developmental stage. Strikingly, we found significant enrichment of maternal transcripts (maternal RNA, minor zygotic genome activation and 1-cell transient genes) and depletion of zygotic transcripts (major zygotic genome activation, 2-cell transient and mid pre-implantation activation) in the Stella M/Z KO 2-cell embryos compared to WT (Figure 1D). While Stella KO oocytes are transcriptionally distinct from WT (Figure 1B), Stella KO oocytes do not exhibit overt abnormalities (Bortvin et al., 2004; Payer et al., 2003). Furthermore, the apparent enrichment in maternal transcripts only manifests at the 2-cell stage, suggesting the observed transcriptional differences are not inherited from aberrant transcripts in the Stella KO oocyte but are likely linked to failed downregulation of these maternal transcripts.

To independently characterise differences in gene expression dynamics between WT and KO embryos during MZT, genes were clustered based on their pattern of expression in WT oocytes, 1-cell and 2-cell embryos. We identified a cluster of maternal transcripts (ED), defined as highly expressed in WT oocytes and 1-cell and with reduced expression at the 2-cell stage, that are differentially expressed in KO samples (n = 849, adjusted p-value<0.05) (Figure 1E, Figure 1—source data 3). Of these genes, 37.7% did not show the expected reduction in expression in Stella M/Z KO 2-cell embryos. Furthermore, we detected a group of zygotically activated genes (ZAG) (EU) with low expression in WT oocytes and 1-cell embryos and increased expression in 2-cell embryos, which are differentially expressed in KO samples (n = 698, adjusted p-value<0.05). Moreover, 39.5% of this group of genes exhibited significantly dampened 2-cell activation in the absence of maternal and zygotic Stella (Figure 1E, Figure 1—source data 4). The combined analysis suggests that Stella M/Z KO embryos exhibit partial impairment in maternal-to-zygotic transition.

Stella M/Z KO 2-cell embryos fail to upregulate essential ribosomal and RNA processing genes

Gene ontology analysis of genes more highly expressed in KO than WT 2-cell embryos reveals an enrichment of chromatin modifiers, and genes involved in microtubule based processes and response to DNA damage (Figure 2A, Figure 2—source data 1), consistent with a previous observation of abnormal γH2AX enrichment in maternal PN (Nakatani et al., 2015). In addition, downregulated genes in Stella M/Z KO 2-cell embryos are enriched for RNA processing and ribosome biogenesis, with an overall depletion of genes associated with ribosomes in the KEGG pathway and related gene ontology (Figure 2B, Figure 2—figure supplement 1). The findings from single-cell/embryo RNA-sequencing were validated by independent qRT-PCR analyses. Consistently, genes associated with chromatin modifiers (Rbbp7 and Kdm1b) and DNA damage (Fam175a and Brip1) were more highly expressed in KO embryos (Figure 2C), while those associated with ribosome biogenesis (Nop16, Emg1), RNA binding (Larp1b) and cell cycle regulator (Cdc25c) showed lower expression in KO 2-cell embryos (Figure 2D). To explore potential regulatory targets of Stella, a genome-wide co-expression network analysis was performed across the WT dataset. Overall, 910 genes showed correlated expression patterns with Dppa3 (Pearson correlation >|0.9|) (Figure 2—source data 2). Of these, 710 genes are positively correlated while 200 genes are negatively correlated with Dppa3. Genes with a positive expression correlation with Dppa3 are significantly enriched for RNA processing and RNA splicing (Figure 2E), including Snrpd1 and Snrpb2, both core components of the spliceosome (Figure 2F).

Figure 2. Stella is associated with the activation of essential zygotic genes.

(A) Revigo plots (Supek et al., 2011) of a selection of gene ontology (GO) terms enriched in 2-cell DEG (Figure 2—source data 1). The colour of the circle represents the log10 p-value. Semantic space clusters GO terms of similar functions together. (B) A boxplot of normalised counts of genes belong to the Ribosome KEGG pathway between WT and KO across the developmental stages. Two-sided Wilcoxon rank sum test performed between WT and KO and statistically significant p-value is stated. (C and D) shows boxplots of single-embryo qRT-PCR validation of RNA-seq in WT (white) and KO (light blue) 2-cell embryos. All genes are normalised to housekeeping genes, relative to one WT embryo and log2 transformed. (C) shows genes significantly upregulated and (D) downregulated in KO relative to WT 2-cell embryos (p<0.05 = *; p<0.01 = **, p<0.001 = *** and p<0.0001 = ****). (E) A table of the top 10 GO terms enriched in 710 genes whose expression levels are positively correlated with Dppa3 (Pearson’s correlation coefficient >0.9), identified from the genome-wide co-expression network analysis (Figure 2—source data 2). (F) Scatter plots of the expression of Snrpd1 and Snrpb2 against Dppa3. Gene expression was log2 transformed (counts per million + 1) and Pearson’s correlation analysis was performed. Also see Figure 2—figure supplement 1 and Figure 2—source data 12.

DOI: http://dx.doi.org/10.7554/eLife.22345.010

Figure 2—source data 1. List of gene ontology terms enriched in differentially expressed genes between WT and KO 2-cell embryos.
Gene ontology terms enriched for biological processes were calculated using DAVID software (version 6.7).
DOI: 10.7554/eLife.22345.011
Figure 2—source data 2. Genome-wide Dppa3 co-expression network analysis.
Table shows ensemble ID and official names of genes exhibiting significant expression correlation with Dppa3 (Pearson Correlation |r| > 0.9).
DOI: 10.7554/eLife.22345.012

Figure 2.

Figure 2—figure supplement 1. Ribosome associated genes are depleted in Stella M/Z KO 2-cell embryos.

Figure 2—figure supplement 1.

Comparison of normalised counts of genes belonging to the Ribosome associated gene ontology pathway between WT (white) and KO (blue) across the developmental stages. Two-sided Wilcoxon rank sum test was performed between WT and KO samples and statistically significant p-value stated.

Overall, our analysis suggests that the Stella M/Z KO 2-cell embryos do not effectively transition from maternal to zygotic control, as judged by the aberrant enrichment of maternal transcripts and depletion of zygotic transcripts. Genes regulating essential processes such as RNA processing are depleted in Stella M/Z KO 2-cell embryos and Dppa3 exhibits positive expression correlation with genes enriched in these processes. Notably, most genes however do undergo appropriate MZT changes in mutant embryos, arguing against a general developmental delay. Taken together, this supports a hypothesis whereby Stella plays an important role in upregulating the expression of a subset of genes that are essential during zygotic genome activation.

Expression of TEs are dysregulated in Stella KO oocytes and Stella M/Z KO embryos

Stella has been shown to affect the DNA methylation of particular TEs in zygotes (Nakamura et al., 2007) and primordial germ cells (Nakashima et al., 2013). Since many TEs are specifically activated during early development, we investigated their expression in WT and KO oocytes and embryos. We remapped the single-cell/embryo RNA sequencing data to obtain read counts for class I TEs (retrotransposons) (Figure 3—source data 12). To get an overview of the extent of TE activation, we calculated the fraction of transcripts mapped to TEs as a ratio of total mapped reads. We found a dramatic increase in the ratio of reads mapped to TEs, primarily of the LTR class, at the 2-cell stage compared to oocyte and 1-cell, suggesting widespread TE activation coincident with MZT (Figure 3A, Figure 3B). Further inspection revealed that activation of the LTR class is dominated by upregulation of ERVL and ERVL-MaLR families, while LINE and SINE classes are primarily accounted for by L1 and Alu families, respectively (Figure 3—figure supplement 1A). Moreover, principal component analysis showed developmental progression can be clearly captured by TE expression alone along PC1 (Figure 3C), similar to recent studies identifying stage-specific transcription initiation of ERV expression in early human embryos (Göke et al., 2015; Grow et al., 2015). In addition, WT and KO oocytes and embryos can be partially separated in individual PCAs along each of the developmental stages, signifying differences in expression of TEs between WT and KO samples.

Figure 3. TEs are mis-expressed in the absence of Stella.

(A) Top panel shows boxplots of the ratio of reads mapped to TEs to total reads mapped in WT and KO oocytes, 1-cell and 2-cell embryos. The number of reads mapped to TEs are based on uniquely and multi-mapped counts (Figure 3—source data 2). (B) A heatmap of the relative expression of LTR, LINE and SINE in oocyte (green), 1-cell (yellow) and 2-cell embryos (red). Blue indicates higher expression and white indicates lower expression. Samples are clustered by row. (C) A score plot of the first three principal components for 66 cells using uniquely mapped TE counts (Figure 3—source data 1). The lower panels represent the score plots of the first two principal components using cells belonging to a specific developmental time point. The developmental time points are indicated by colour and the genotypes of Stella are indicated by shape. (D) A bar chart of the odds ratio of TEs up and downregulated in KO 2-cell embryos compared to WT, intersected with ‘maternal TEs’ and ‘zygotic TEs’. For maternal TEs enriched in TEs upregulated in KO 2-cell, ***p=5.35 × 10−6; for zygotic TEs enriched in TEs downregulated in KO 2-cell, ***p=8.13 × 10−7. (E) Bar plots showing the relative expression of TE families (LTR-ERVL, LINE1-L1 and SINE-Alu) in WT (white) and KO (light blue) oocytes, 1-cell and 2-cell embryos, data analysed from single-cell / embryo RNA sequencing. Two-sided Wilcoxon rank sum test performed between WT and KO samples and statistically significant p-values stated. (F) Top is an illustration of the structure of the full-length MuERV-L element flanked by 5’ and 3’ LTRs. Bottom is boxplots of the relative expression (counts per million) of MuERV-L Int and MT2_Mm transcript in WT (white) and KO (light-blue) oocytes, 1-cell and 2-cell embryos (p<0.05 corresponds to * and p<0.01 corresponds to **). (G) Immunofluorescence staining against MuERV-L Gag antibody in 2-cell embryos. (Left) The top panel shows bright-field, middle panel is MuERV-L Gag (green) and bottom panel are merged images of MuERV-L and DAPI, which counterstains DNA. Representative projections of z-stacks are shown for WT and KO embryos. (Right) A boxplot of the z-stack quantifications of the relative fluorescence intensity of MuERV-L Gag protein between WT and KO 2-cell embryos. Also see Figure 3—figure supplements 13 and Figure 3—source data 12.

DOI: http://dx.doi.org/10.7554/eLife.22345.014

Figure 3—source data 1. Uniquely mapped transposable element (TE) counts.
TE counts for 66 samples from single cell / embryo RNA-seq experiments. Uniquely mapped reads only.
elife-22345-fig3-data1.xlsx (492.4KB, xlsx)
DOI: 10.7554/eLife.22345.015
Figure 3—source data 2. Uniquely and multi-mapped transposable element (TE) counts.
TE counts for 66 samples from single cell/embryo RNA-seq experiments. Uniquely and multi-mapped reads.
elife-22345-fig3-data2.xlsx (503.1KB, xlsx)
DOI: 10.7554/eLife.22345.016

Figure 3.

Figure 3—figure supplement 1. TEs are mis-expressed in the absence of Stella.

Figure 3—figure supplement 1.

(A) Heatmaps of the relative expression of individual families of LTR, LINE and SINE elements in oocyte (green), 1-cell (yellow) and 2-cell (red) embryos. Samples are clustered by row. Blue indicated relatively high expression while white indicates relatively low expression of TEs. (B) Bar charts of relative expression of LTR elements, LINE and SINE in WT (white) and KO (light blue) oocytes, 1-cell and 2-cell embryos. Two-sided Wilcoxon rank sum test performed between WT and KO samples and statistically significant p-values are stated. (C) Validation of MuERV-L Gag antibody. (Left) IF staining of an ESC reporter line for MuERV-L – 2C::tdTomato (Macfarlan et al., 2012), illustrating the MuERV-L Gag (GFP) staining overlaps well with tdTomato+ ESC. (Right) IF staining showing robust detection of MuERV-L Gag (GFP) in 2-cell embryos, while it is virtually undetectable in metaphase II (MII) oocytes. DNA is counterstained with DAPI.
Figure 3—figure supplement 2. Dppa3 does not affect MuERV-L expression in mESCs.

Figure 3—figure supplement 2.

(A) Left shows histogram plots of tdTomato expression in 2C::tdTomato mESCs 48 hr and 96 hr after addition of DOX to induce Dppa3 expression. Red represents tdTomato frequency with Dppa3 over-expression (OE), while blue represents empty vector transfection control. The horizontal bars depict the expression threshold set for dtTomato and the proportion of tdTomato+ cells in each condition is stated. Right shows a bar plot of Dppa3 expression 48 hr and 96 hr after addition of DOX. Dppa3 expression is normalized to a housekeeping gene (GAPDH) and relative to empty vector sample. (B) Left shows histogram plots of tdTomato expression in 2C::tdTomato mESCs 48 hr and 96 hr after siRNA transfection. Red represents tdTomato frequency with Dppa3 siRNA, while blue represents scramble siRNA control. The bar depicts the expression threshold set for dtTomato and the proportion of tdTomato+ cells in each condition is stated. Right shows a bar plot of Dppa3 expression 48 hr and 96 hr after siRNA transfection. Dppa3 expression is normalized to a housekeeping gene (GAPDH) and relative to scramble siRNA sample.
Figure 3—figure supplement 3. Quality control of TE reads mapping and normalization.

Figure 3—figure supplement 3.

(A) A bar plot of the proportion of multi-mapped TE reads (Figure 3—source data 2) which can be unambiguously assigned at the level of class (repClass), family (repFamily) and element (repName) based on the RepeatMasker annotation provided by UCSC table browser. (B) A heatmap of the expression of LTR, LINE and SINE in oocyte (green), 1-cell (yellow) and 2-cell (red) embryos based on between-sample normalization method (See Materials and methods). Samples are clustered by row.

Stella is associated with the expression of specific TE

We identified ‘maternal TEs’, defined as TEs expressed higher in WT oocytes than 2-cell embryos and ‘zygotic TEs’, as TEs expressed higher in WT 2-cell embryos than oocytes. To characterise the disparities between TE expression in WT and KO, we compared differentially expressed TEs at the 2-cell stage, with maternal and zygotic TEs (Figure 3D). Strikingly, TEs upregulated in Stella M/Z KO 2-cells are significantly enriched for maternal TEs while TEs downregulated in Stella M/Z KO 2-cells are associated with zygotic TEs. Furthermore, Stella M/Z KO 2-cell embryos display lower activation of LTR, specifically the endogenous retrovirus (LTR-ERVL) family, with small relative enrichment for LINE and SINE classes (Figure 3E, Figure 3—figure supplement 1B).

A particular element of interest is MuERV-L (Kigami et al., 2003; Ribet et al., 2008), which encodes a canonical retroviral Gag and Pol, flanked by 5’ and 3’ LTR (also known as MT2_Mm). We observed a selective upregulation of MuERV-L-Int and MT2_Mm transcripts at the 2-cell stage, but this was significantly reduced in Stella M/Z KO embryos (Figure 3F). Many MuERV-L elements have an open reading frame and we therefore tested protein levels using a MuERV-L Gag antibody. This antibody was validated by showing staining overlaps well with a MuERV-L reporter ESC line – 2C::tdTomato (Macfarlan et al., 2012), and is specific to 2-cell embryos whilst not detected in metaphase II oocytes (Figure 3—figure supplement 1C). Strikingly, immunofluorescence (IF) staining showed a significantly lower signal and loss of perinuclear localisation of MuERV-L Gag (Ribet et al., 2008) in Stella M/Z KO 2-cell embryos compared to WT, implying Stella influences both transcript and protein levels of MuERV-L (Figure 3G).

To assess whether Stella directly binds to specific transcripts / TEs we turned to ESC since obtaining sufficient 2-cell embryos for this analysis is intractable. Chromatin immunoprecipitation-seq (ChIP-seq) of HA-tagged Stella resulted in minimal peak calls (n = 56) (Supplementary file 2), implying Stella does not efficiently bind DNA (including MuERV-L) in ESC. Consistently, neither overexpression nor knockdown of Dppa3 in mESC had a marked effect on the expression of MuERV-L (Figure 3—figure supplement 2). These results suggest that Stella-mediated control of MuERV-L is likely specific to the unique chromatin/cellular context of early embryos. Notably Stella has the potential to bind both RNA and DNA and could thus affect TEs through multiple modes. Overall, our data suggests Stella plays a key role in regulating a subset of TEs specifically in 2-cell embryos, which includes promoting strong activation of MuERV-L elements.

Expression of a subset of TEs is positively correlated with their nearest gene

Next we explored the relationship between TE and protein-coding gene expression in WT and KO samples. First, we considered the intersection between differentially expressed genes and genes that are neighbours of misregulated TEs at the 2-cell stage, and discovered a highly significant overlap of 12% between the two populations (p=3.19 × 10−87) (Figure 4A). Notably, a greater number of TEs downregulated in KO 2-cell embryos are located within 10 kb of the transcriptional start site of zygotically activated genes (ZAG, n = 698) than expected by chance (p<10−4) (Figure 4B). Altogether, this suggests perturbations in TE expression may be linked to neighbouring mRNA expression levels in Stella M/Z KO embryos. To determine if this relationship is applicable genome-wide, we performed global expression correlation analysis between a TE and its nearest gene. To eliminate confounding factors, we excluded TEs that overlap directly with an exon. Notably, we detected 387 TEs whose expression levels were positively correlated with their nearest gene (Spearman’s correlation >0.7) while 224 TEs were negatively correlated (Spearman’s correlation <−0.7) (Figure 4—figure supplement 1A). This relationship was validated using single-embryo qRT-PCR at four genomic loci, which demonstrated a significant correlation between the expression of a TE and its downstream gene (Figure 4C). These loci were selected to represent a range of LTR class elements; RMER19A, ORR1A2-Int and MT2B are further categorised into ERVK, ERVL-MaLR and ERVL families respectively. Interestingly, 2 of the genes, Chka and Zfp54, have been implicated in regulation of early embryonic development (Albertsen et al., 2010; Wu et al., 2008). Thus, the expression levels of a subset of TEs are highly correlated with the expression of their nearest gene.

Figure 4. A positive correlation between the expression of a subset of TEs and their nearest gene.

(A) A Venn diagram illustrating a significant overlap between differentially expressed genes and genes within ±20 kb of differentially expressed TEs in WT v KO 2-cell embryos. (B) A histogram of the mean number of depleted TEs in Stella M/Z KO 2-cell embryos within ±10 kb of the TSS of a group of genes. The red line represents zygotically activated genes (ZAG, n = 698) that belong to WT class EU and differentially expressed in KO samples (adjusted p-value<0.05) (Figure 1—source data 4). The grey bars represent each of 10,000 random sets of 698 genes. (C) Top is schematic illustrations of the relationship between the TE and downstream gene, where grey box represents the TE and its orientation; white box represents the gene and its orientation. Bottom is scatter plots of qRT-PCR expression in 2-cell embryos. WT (white circle) n = 19 and KO (light blue triangle) n = 17. Spearman’s correlation analysis was performed and p<0.0001 corresponds to ****. (D) Characterisation of chimeric transcripts. (Middle) A schematic illustration of the alternative splicing event that occurred between the TE and the downstream gene. SD = splice donor site and SA = splice acceptor site. The endogenous starting codon (ATG) is indicated. (Left) The PCR product of the chimeric transcript where the forward primer originates from the TE and reverse primer originates from the gene. The PCR product is significantly shorter than the annotated distances between them, confirming a splicing event. (Right) Sanger sequencing chromatographs from the PCR product for each chimeric transcript, confirming the site of chimeric junction. * Indicates a different nucleotide to annotation. (E) Box plots of single-embryo qRT-PCR of the relative expression of chimeric transcripts in WT (white) and KO (light blue) 2-cell embryos. p<0.01 corresponds to ** and p<0.001 corresponds to ***. All expression data are normalised to three housekeeping genes and relative to 1 WT embryo. Also see Figure 4—figure supplement 1.

DOI: http://dx.doi.org/10.7554/eLife.22345.020

Figure 4.

Figure 4—figure supplement 1. A subset of TEs is positively correlated with its nearest genes.

Figure 4—figure supplement 1.

(A) A histogram of Spearman’s correlation of the expression between a gene and its nearest TE within a distance of 1Mbp. The red line indicates correlation >|0.7|. 387 TE / gene pair correlation >0.7 while 224 TE/ gene pair correlation <−0.7. (B) (Left) A bar chart of MuERV-L int transcript expression in individual WT (white) and KO (grey) 2-cell embryos. Red star indicates 3 KO embryos expressing the lowest MuERV-L transcript (KO2,8,1) while green circle highlights KO6, the KO embryo which expresses the highest MuERV-L levels. The right panel shows a heatmap of expression of genes identified to make a chimeric junction with MuERV-L (Macfarlan et al., 2012). Heatmap is normalised by row and clustered by column.

Whilst association does not equate to causality, there are a number of ways TEs have been implicated in the regulation of gene expression (Elbarbary et al., 2016; Thompson et al., 2016). One possibility is the establishment of alternative splicing events, which create a junction between an expressed TE and a downstream gene exon, thus leading to the formation of a chimeric transcript (Peaston et al., 2004). Indeed, MuERV-L has been shown to form such fusion transcripts with 307 genes at the 2-cell stage (Macfarlan et al., 2012). Here, we find Stella M/Z KO embryos with the lowest MuERV-L expression also express lower levels of genes that make a chimeric junction with MuERV-L elements (Figure 4—figure supplement 1B). In contrast, the KO embryo with the highest MuERV-L expression expresses higher levels of these genes. Importantly, we next identified and validated a number of chimeric transcripts that are expressed in 2-cell embryos by PCR and sequenced across the chimeric junction (Figure 4D). Single-embryo qRT-PCR shows most of these chimeric transcripts are expressed at lower levels in KO compared to WT 2-cell embryos (Figure 4E). These chimeric transcripts include Snrpc, which encodes for a component of the ribonucleoprotein required for the formation of the spliceosome, and the RNA binding protein, Rmb8a. Our data implies a link between activation of some TEs and the emergent gene expression network, which is perturbed in Stella M/Z KO embryos due to altered TE expression. This is, in part, mediated by TEs directly regulating gene expression through chimeric transcripts.

MuERV-L plays a functional role during pre-implantation development

We propose that the dysregulation in TE expression might directly contribute to the abnormal pre-implantation phenotype observed in Stella M/Z KO embryos. We focused on assessing the function of MuERV-L elements, as they represent one of the most highly activated TEs at the 2-cell stage (Kigami et al., 2003), as well as being amongst the most significantly downregulated in Stella M/Z KO embryos (Figure 3F,G). We considered that attenuated MuERV-L activation in Stella M/Z KO embryos could in turn disrupt activation of associated 2-cell transcripts driven by their LTRs (Figure 4—figure supplement 1B). Alternatively, MuERV-L mRNA/protein itself may also have a functional role during early embryonic development.

We detected 583 copies of the full-length MuERV-L element, containing the complete gag and pol genes, in the genome. This makes it experimentally challenging to completely eliminate MuERV-L activation in early embryos. Nevertheless, we utilised siRNA to knockdown the expression of MuERV-L elements in 1-cell embryos and monitored the effects on embryonic development (Figure 5A). We designed siRNA to target the consensus sequence of MuERV-L from the Dfam database (Hubley et al., 2016) (Figure 5B). Computational analysis reveals MuERV-L siRNA targets 80.5% of full-length MuERV-L elements with perfect match and 99.5% with 2 bp mismatches, while scramble siRNA has no targets with 2 bp mismatches. At 20 μM siRNA, we observed a slight decrease in MuERV-L transcripts (Figure 5C) but a significant reduction in MuERV-L Gag protein staining in 2-cell embryos (Figure 5D). These embryos exhibited a mild developmental delay at 4 to 8 cell stage progression on day 2, with fewer embryos reaching the blastocyst stage on day 4, and those that did were smaller and with fewer cells (Figure 5E). This suggests that a modest reduction in MuERV-L at the 2-cell stage has a notable effect on pre-implantation development.

Figure 5. MuERV-L plays a functional role during pre-implantation development.

(A) A schematic diagram illustrating the experimental setup. (B) Left is an illustration of the structure of the MuERV-L element (GenBank accession number: Y12713), depicting the exact site of siRNA target. Right is a table showing the number of full-length MuERV-L elements targeted by MuERV-L siRNA and the number of off-targets. (C) Box plots of single-embryo qRT-PCR of MuERV-L pol transcript and D) MuERV-L Gag protein (fluorescence) intensity in uninjected controls, 20 µM scramble siRNA or 20 µM MuERV-L siRNA injection 2-cell embryos. (Right) Representative z-stack projections of immunofluorescence (IF) staining against MuERV-L Gag in 2-cell embryos. Two-tail Wilcoxon Rank Sum Test was performed for both (C) and (D) and p<0.01 corresponds to *, not significant (n.s.) or are otherwise stated. (E) The effect of MuERV-L siRNA on developmental progression of pre-implantation embryos. (Left) Bar graphs illustrating the number of embryos observed at different stages on day 2–4 of in-vitro culture. Blue background indicates area of interest. Experiments were repeated twice and the circle dots represent the data points for each replicate. A total of n = 17, n = 58 and n = 53 embryos were analysed for uninjected, scramble and MuERV-L siRNA injections respectively. One-tailed Student’s T-test performed and there were no statistical difference between uninjected, scramble or MuERV-L siRNA injected groups. (Middle) Day five bright field images of late blastocysts injected with scramble or MuERV-L siRNA. (Right) A box plot of IF quantification of number of DAPI+ cells in the late blastocyst stage on day five injected with 20 µM of scramble (white) or MuERV-L (grey) siRNA. (F and G) are box plots of single-embryo qRT-PCR of MuERV-L pol transcript and MuERV-L Gag protein (fluorescence) intensity in uninjected control, 80 µM scramble siRNA and 80 µM MuERV-L siRNA injected 2-cell embryos. (Right) Representative z-stack projections of IF staining against MuERV-L Gag antibody in 2-cell embryos. Two-tail Wilcoxon Rank Sum Test was performed for both (F) and (G) and p<0.01 corresponds to *, not significant (n.s.) or are otherwise stated. (H) A bar graph plotting the number of embryos observed at each stage in day two in-vitro culture in uninjected embryos, 80 µM scramble (white) or MuERV-L (grey) siRNA injection. Experiments were repeated twice and the circle dots represent the data points for each replicate. A total of n = 17, n = 54 and n = 50 embryos were analysed for uninjected, scramble and MuERV-L siRNA respectively. One-tailed Student’s T-test performed. p<0.05 corresponds to * and p<0.01 corresponds to **.

DOI: http://dx.doi.org/10.7554/eLife.22345.022

Figure 5.

Figure 5—figure supplement 1. Chimeric transcript expression in MuERV-L knockdown embryos.

Figure 5—figure supplement 1.

A scatter dot plot of qRT-PCR analysis of chimeric transcripts in uninjected (circle, n = 4), 80 µM scramble siRNA (square, n = 12) and 80 µM MuERV-L siRNA (triangle, n = 8) treated 2-cell embryos. The median and interquartile range is indicated. All transcripts are normalised to housekeeping genes, relative to one uninjected embryo. Two-sided Wilcoxon rank sum test was performed and statistically significant results are stated.

There is nonetheless variability in MuERV-L expression at the 2-cell stage, as shown in other studies (Ancelin et al., 2016), and we therefore speculated that a higher concentration of siRNA might increase the repression efficiency of MuERV-L transcripts. Indeed, 80 μM MuERV-L siRNA was more effective (Figure 5F and G), and in turn resulted in >90% embryos arrested at the 2-cell stage (Figure 5H), albeit some indirect effects at higher concentrations of siRNA may contribute to the embryonic arrest. Notably, MuERV-L knockdown resulted in repression of some chimeric transcripts previously characterised in 2-cell embryos (Figure 4D, Figure 5—figure supplement 1). These data collectively suggest MuERV-L may have a functional role in embryonic development.

Discussion

The maternal-to-zygotic transition is a complex process that entails extensive global transcriptional and epigenetic changes. Maternally inherited factors such as Stella play a role in this transition in mouse oocytes but the difficulties of obtaining sufficient materials has hindered investigations on how they contribute to MZT. Here, we use single-cell/embryo approaches and find that deletion of maternal and zygotic Stella affects MZT. Furthermore, we observed a failure to activate endogenous retrovirus (LTR-ERVL), and specifically MuERV-L elements. In parallel there is an accumulation of maternal transcripts and a failure to upregulate many zygotic transcripts. These events appear to be partially coupled, with altered LTR promoter activity leading to changes in gene expression, for example through the modulation of chimeric transcripts. In addition, we reveal the biological relevance of TE expression as knockdown of MuERV-L at the 2-cell stage hinders developmental progression, which likely contributes to the phenotype observed following loss of Stella.

Maternal mRNA degradation and zygotic genome activation are critical aspects during MZT (Li et al., 2013). Maternal mRNA clearance plays both a permissive and instructive role (Tadros and Lipshitz, 2009), while ZGA also promotes the final degradation of maternal transcripts. Stella has the potential to function as an RNA binding protein, which might facilitate the degradation of maternal transcripts (Nakamura et al., 2007; Payer et al., 2003). In addition, Stella KO oocytes exhibit transcriptional differences compared to WT oocytes. This is consistent with a previous study showing Stella KO oocytes are defective in transcriptional repression (Liu et al., 2012), which may be needed for dosage regulation of critical maternally inherited transcripts. Stella has also however been demonstrated to be required beyond oogenesis (Nakamura et al., 2007). Maternally inherited factors likely influence the timing and specificity of gene expression during ZGA, which is essentially determined by interplay between the transcriptional machinery and histone modifications that influences chromatin accessibility (Lee et al., 2014).

The failure of zygotic transcript activation may be linked to the failure of zygotic TE activation. We detected extensive changes in global TE expression during MZT. This is accompanied by strong upregulation of the LTR-ERVL family in 2-cell embryos, which is consistent with a recent study showing that the chromatin surrounding ERVL is in a highly accessible state at the 2-cell stage (Wu et al., 2016). LTR-ERVL upregulation is significantly reduced in the absence of maternal Stella, suggesting that Stella may facilitate the activation of many ERVs. The expression changes are validated at the protein level for a specific LTR-ERVL element, MuERV-L. Importantly, the decrease in MuERV-L is unlikely to be a secondary effect of 2-cell embryonic arrest, as maternal Kdm1a (LSD1) mutant embryos, which result in pan 2-cell arrest maintain normal MuERV-L transcript expression (Ancelin et al., 2016). Notably, whilst many epigenetic mechanisms have been identified for the suppression of ERVs (Gifford et al., 2013), we have identified a factor associated with the activation of a subset of ERVs during mammalian MZT.

The question remains, how does Stella regulate TE expression? In primordial germ cells, Stella directly binds to LINE1 and IAP elements to facilitate DNA demethylation of these targets (Nakashima et al., 2013). Whilst experimentally challenging, it would be pertinent to determine potential DNA binding targets of Stella in 2-cell embryos. At the same time, function of Stella at the RNA level cannot be excluded, or it could act by altering the epigenetic status of cells at this time. For example, the H3K9 methyltransferase Setdb1 is aberrantly enriched in Stella M/Z KO 2-cell embryos, and this has been shown to suppress ERVs and chimeric transcripts expression (Eymery et al., 2016; Karimi et al., 2011; Kim et al., 2016). Since a human orthologue of Dppa3 is expressed in ESCs, germ cells and pre-implantation embryos (Bowles et al., 2003; Deng et al., 2014; Payer et al., 2003), it would be of interest to know whether DPPA3 plays a similar role during human pre-implantation development; for example, on the stage specific activation endogenous retroviruses (Göke et al., 2015; Grow et al., 2015).

TEs have evidently been co-opted for the regulation of mammalian development as exemplified by the domestication of the retroviral env gene, which is essential for placental development (Dupressoir et al., 2012). Here, we have shown that the expression of a subset of TEs are intimately linked to its nearest gene during early embryonic development. As we have shown in several cases, TEs could regulate gene expression through chimeric transcripts, which are misregulated in the absence of Stella. The impact of transcriptional inhibition of ERV programme and associated chimeric transcripts during MZT, and their subsequent effects on development merits further investigation. Intriguingly, we discovered that reducing MuERV-L mRNA/ protein levels at the 2-cell stage affects development. This could suggest that the presence of MuERV-L Gag protein has a functional role during early mouse development, similar to a recent report illustrating a role of endogenous LTR viral-like particles in human blastocyst development (Grow et al., 2015). It is also possible that MuERV-L siRNA KD may downregulate a critical subset of chimeric transcripts that include MuERV-L sequence or, could feedback to target transcriptional repression to MuERV-L loci through small RNA pathways (Castel and Martienssen, 2013). Disentangling these possibilities warrants further investigation.

In conclusion, we have revealed that Stella is involved in orchestrating the transcriptional changes and activation of endogenous retroviral (ERV) elements during maternal-to-zygotic transition. The appropriate expression of LTR-ERV driven zygotic genes and specific ERV elements (MuERV-L) in turn contribute towards normal early embryonic development in mice.

Materials and methods

Experimental methods

Collection of mouse oocytes and embryos

All husbandry and experiments involving mice were carried out according to the Home Office guidelines and the local ethics committee. Here, we refer to Stella as the protein encoded by the Dppa3 gene. Stella knockout (KO) mice were generated as previously described (Payer et al., 2003) (RRID:MGI:2683730) in a pure 129/SvEv strain. For RNA-seq, ovulated oocytes were collected from Stella KO or wild type (WT) females crossed with vasectomised male at 9am, embryonic day (E) 0.5. Stella KO females were crossed with Stella KO males to produce Stella maternal and zygotic knockout (Stella M/Z KO, KO) embryos, while Stella WT females were crossed with Stella WT males to obtain WT embryos. 1-cell and 2-cell embryos were collected at 4pm on E0.5 and 9am-3pm on E1.5 respectively. All oocytes and embryos were morphologically assessed to ensure only viable samples were collected.

Single embryo RNA sequencing and RT-qPCR

Oocytes were incubated with 0.3 mg/ml hyaluronidase (Sigma, St. Louis, MO) to remove the cumulus cells. Zona pellucida was removed from oocytes and embryos prior to lysis. RNA were extracted from single oocytes, 1-cell embryos or whole 2-cell embryos; and cDNA were amplified as described previously with modifications (Tang et al., 2010). Between 1:10,000–1:60,000 dilution of ERCC RNA Spike-In mix (ThermoFisher Scientific, Hemel Hempstead, UK) was added to lysis buffer to estimate efficiency of amplification (Jiang et al., 2011). 1:10 dilution of the cDNA was used for RT-qPCR. For relative expression, all genes / TE transcripts were normalised to three housekeeping genes (ARBP, PPIA and GAPDH). For primer sequences see Supplementary file 3. For RNA-seq, cDNA were fragmented to ~200 bp with Focused-ultrasonicator (Covaris, Woburn, MA) and adaptor ligated libraries were generated using NEBNext Ultra DNA library Prep Kit for illumina (New England Labs, Ipswich, MA). Single-end 50 bp sequencing was performed with HiSeq1500 (Illumina, San Diego, CA) to an average depth of 13.5 million reads per sample.

2-cell embryo immunofluorescence staining

2-cell embryos were fixed in 4% paraformaldehyde (PFA) in PBS for 15min at room temperature (RT). After permeabilisation with IF buffer (0.1% Trition, 1% BSA in PBS) for 30min at RT, embryos were incubated with primary antibody overnight at 4°C. After washing 3x with IF buffer, embryos were incubated with corresponding secondary antibody and 1 μg/ml of DAPI (4’,6’-diamidino-2-phenylindole) for DNA visualisation for 1 hr at RT. After washing, embryos were mounted in VECTASHIELD Mounting medium (Vector Laboratories, Peterborough, UK). For each experiment, WT and KO embryos were stained at the same time and processed identically using the same setting for confocal acquisition to allow comparison of relative fluorescence intensity. The following antibodies were used: rabbit anti-MuERV-L Gag (ER50102, Huangzhou HuaAn Biotechnology Co., LTD, China) (RRID:AB_2636876), 1:200; Alexa488 donkey anti-rabbit IgG (ab150073, Abcam, Cambridge,UK) (RRID:AB_2636877), 1:500.

siRNA injections

B6CBAF1/J (F1) female mice were superovulated by injection of 5 International Units (IU) of pregnant mare’s serum gonadotrophin (PMSG) (Sigma, St. Louis, MO), followed by 5 IU of human chorionic gonadotrophin (hCG) (Sigma, St. Louis, MO) after 48 hr and then mated with F1 male mice. Zygotes were harvested from oviducts 17–22 hr post hCG injection. Cumulus cells were removed by incubation with 0.3 mg/ml hyaluronidase in M2 medium (Sigma, St. Louis, MO). 20 µM or 80 µM of Stealth RNAi siRNA (ThermoFisher Scientific, Waltham, MA) were micro-injected into the cytoplasm of zygotes using a Femtoject 4i device (Ependorf, UK). The following are the siRNA sequences: scramble (sense: UUCCUCUCCACGCGCAGUACAUUUA) and MuERV-L (sense: GAAGAUAUGCCUUUCACCAGCUCUA). Injected embryos were cultured in M16 medium (Sigma, St. Louis, MO) in BD Falcon Organ Culture Dish at 5% CO2 and 37°C for a total of 5 days. The number of surviving 2-cell embryos 24 hr after micro-injection represents the total number of embryo analysed per experimental group. 20 µM or 80 µM siRNA injections were repeated twice independently.

Stella-HA chromatin immunoprecipitation

We tagged three copies of hemagglutinin (HA, 27 bp) to the C-terminus of Stella and used the high affinity HA antibody to pull down Stella bound chromatin. The Stella+HA construct was expressed in the absence of endogenous Stella by using Stella M/Z KO mESCs, with NANOG-HA and eGFP-HA as positive and negative controls respectively. The ChIP protocol was modified from a previously described Stella ChIP protocol (Nakamura et al., 2012). Briefly, cells were cross-linked with 1% paraformaldehyde (PFA) for 8 min at RT, quenched with 200 mM glycine for 5 min and washed twice with PBS. Samples were suspended in 2 ml/IP of radio immunoprecipitation assay (RIPA) buffer (50 mM Tris-HCl, pH8, 150 mM NaCl, 1 mM EDTA, 1% NP40, 0.5% deoxycholate and 0.1% SDS) with 1:20 dilution of proteinase inhibitor cocktail (Roche, Welwyn Garden City, UK) and sonicated to an average length of 200–1000 bp. Dynabeads Protein G (ThermoFisher Scientific, Hemel Hempstead, UK) were pre-incubated with anti-mouse HA antibody (MMS-101P-500, Covance Research Products Inc, Denver, CA) (RRID:AB_291261) at 7 µg/IP for 1 hr at 4°C. The antibody-beads complex was then incubated with chromatin solution overnight at 4°C. Beads were washed twice with RIPA buffer, twice with high-salt wash buffer (20 mM Tris-HCl, pH8; 500 mM NaCl, 1 mM EDTA, 1% NP-40, 0.5% deoxycholate and 0.1% SDS), twice with LiCl wash buffer (250 mM LiCl, 20 mM Tris-HCl, pH8, 1 mM EDTA, 1% NP-40 and 0.5% deoxycholate) for 10 min at 4°C and a final wash with Tris-EDTA. Chromatin were eluted by adding 100 µl of SDS-Elution buffer (50 mM Tris-HCl (pH 7.5), 10 mM EDTA, 1% SDS) and incubated at 68°C for 10 min. Reverse-crosslink was performed with 100 µl of proteinase K (1 mg/ml) for 2 hr at 42°C and 5 hr at 68°C. Finally, DNA were extracted with phenol purification and precipitated with NaCl into pellets. DNA pellets were re-suspended in 11 µl of H20. We detected enrichments of NANOG in known binding regions using ChIP-qPCR, but not for eGFP, supporting the efficacy of the ChIP (data not shown). Stella-HA ChIP sequencing libraries were prepared with NEBNext Ultra Library Prep Kit for Illumina (New England Labs, Ipswich, MA) with ~8 ng DNA input. Single-end 50 bp sequencing was performed with HiSeq2500 (Illumina, San Diego, CA). Stella -HA ChIP seq enriched peaks are shown in Supplementary file 2.

2C::tdTomato ESC culture and Dppa3 transfection

2C::tdTomato reporter (Addgene plasmid #40281) ESCs (Macfarlan et al., 2012) were cultured in GMEM (Life Technologies, Carlsbad, CA), 10% KnockOut Serum Replacement (ThermoFisher Scientific, Waltham, MA), 2 mM L-glutamine (Life Technologies), 0.1 mM MEM non-essential amino acids (Life Technologies), 100 U/ml penicillin and 100 µg/ml streptomycin (Life Technologies), 1 mM sodium pyruvate (Sigma), 0.05 mM 2-mercaptoethanol (Life technologies), LIF (1000 U/ml; ESGRO; Merck Milipore) with 2i (PD0325901 (1 µM; Stemgent, Cambridge, MA) and CHIR99021 (3 µM; Stemgent, Cambridge, MA). For Dppa3 over-expression, a Tet-ON3G inducible vector containing Dppa3 CDS was transfected into ESCs with Lipfectamine 2000 (ThermoFisher Scientific, Waltham, MA). 500 ng/ml of DOX was added on day 0. For Dppa3 knockdown experiment, ON-TARGETplus SMARTpool (Dharmacon, Lafayette, CO) which contain four siRNA targeting Dppa3 was transfected into ESCs using Lipofectamine RNAiMAX transfection reagent (ThermoFisher Scientific, Waltham, MA). The Dppa3 siRNA SMARTpool includes the following siRNA sequences: GGAUGAUACAGACGUCCUA; UAGAUAGGAUGCACAACGA; AGAAAGUCGACCCAAUGAA; GAGUAUGUACGUUCUAAUU. ON-TARGETplus Non-targeting (scramble) siRNA (Dharmacon) was transfected as negative control. Td-tomato expression was analysed with BD LSRFORTESSA (BD Biosciences, San Jose, CA).

Data processing and analysis

Mapping reads for gene-level counts and data processing

Single-end reads were mapped to the Mus musculus genome (GRCm38) using GSNAP (version 2014-07-21) with default options (Wu and Nacu, 2010). To detect splice junctions in reads, we extracted known splice sites from the GTF file of GRCm38 provided by Ensembl (release 79). Uniquely mapped reads were counted for each gene using htseq-count (Anders et al., 2015). We assessed quality of cells following the criteria previously described (Kolodziejczyk et al., 2015) (Figure 1—figure supplement 2A). To remove unwanted variation between batches, we applied a generalised linear model to the gene-level counts using the RUVs function of the RUVSeq package of R with k = 4 (Risso et al., 2014) (Figure 1—source data 1). To perform principal component analysis, the batch-adjusted gene counts were normalised by the size factor estimated by the DESeq2 package of R and lowly expressed genes whose mean normalised counts are below 100 were filtered out. The normalised counts were log-transformed and a pseudo count of 1 was added. The prcomp function of R was applied to the log-transformed gene counts by enabling the scaling option.

Differential expression analysis between WT and KO

From the batch-adjusted counts, we identified differentially expressed genes between WT and KO for each developmental stage using the DESeq function of the DESeq2 package of R with default options (Love et al., 2014). Genes that have an adjusted P-value less than a given FDR cutoff of 0.05 were considered as differentially expressed (Figure 1—source data 2).

Clustering of time-series data

To cluster the dynamic expression profile of genes in WT, we defined nine classes according to any differences between adjacent time points: EE, ED, EU, DE, DD, DU, UE, UD, UU, where ‘E’ denotes equally expressed, ‘D’ denotes downregulated or ‘U’ denotes upregulated at a later time point. A gene is considered to show significant differences between adjacent time points (‘D’ or ‘U’) if the adjusted P-value estimated from DESeq function of the DESeq2 package with a default option is less than 0.1. Otherwise, the difference is called as ‘E’. For example, ‘ED’ means (1) equally expressed between oocyte and 1-cell and (2) downregulated at 2-cell compared to 1-cell. Differential expression analysis was performed between WT and KO samples, adjusted p-value<0.05 is considered significantly different. WT class ED genes that are differentially expressed in KO samples are represented in the left heatmap (Figure 1E, Figure 1—source data 3). WT class EU genes that are differentially expressed in KO samples are represented in the right heatmap (Figure 1E, Figure 1—source data 4).

Gene ontology analysis

Gene ontology (GO) analyses were performed using DAVID 6.7 (https://david.ncifcrf.gov/home.jsp) for differentially expressed upregulated or downregulated genes in KO compared to WT 2-cell embryos. Adjusted p-values from BP_Fat were plotted with ReviGO (Supek et al., 2011) allowing for medium similarity, with a selection of GO terms displayed (Figure 2A, Figure 2—source data 1).

Genome-wide Dppa3 gene co-expression analysis

A Dppa3 co-expression network (Bassel et al., 2011) was constructed for genes expressed in WT oocyte, 1-cell, and 2-cell stages. The co-expression network can be represented as an adjacency matrix, C, with an edge being drawn between two genes i and j if the magnitude of the Pearson correlation coefficient between the log2 transformed CPM (log2 (CPM + 1)) was greater than a particular threshold value:

Ci,j={ρi,j,  |ρi,j|x,0,  |ρi,j|<x.

Mapping reads and normalisation for TE counts

To investigate the expression of class I TE (retrotransposons), we remapped the sequencing data to the same genome by randomly keeping only one genomic alignment of multi-mapped reads using GSNAP. From the GTF file of RepeatMasker provided by UCSC table browser (downloaded at 2015-08-25), uniquely mapped reads were counted for each genomic region of the TE (annotated as repName in RepeatMasker) using htseq-count (Figure 3—source data 1). We also generated multi-mapped counts by considering both uniquely and multi-mapped reads. To this end, the optional NH tags of all alignments in the SAM files were set to one and mapped reads were counted for each TE using htseq-count (Figure 3—source data 2). We show multi-mapped reads can be unambiguously assigned to a class (repClass in RepeatMasker) and family (repFamily in RepeatMasker) 98% of the time, however, only 54% of the time at the level of an element (repName in RepeatMasker) (Figure 3—figure supplement 3A). For this reason, all analysis at the level of retrotransposon class and family are based on multi-mapped counts, while analysis at the level of individual transposable elements are based only on uniquely mapped counts. To adjust for different sequencing depths, we normalised the TE counts by the total number of reads mapped to TEs and used the within-sample normalised values as our estimates of the expression levels of TEs (Figure 3B). The dynamic expression profiles of TEs during MZT were unchanged even when we normalised the TE counts by the size factor estimated by the DESeq2 package of R (between-sample normalisation method) (Figure 3—figure supplement 3B).

Stella ChIP-Seq analysis

We mapped single-end reads of ChIP-seq experiments to the Mus musculus genome (mm10) using Bowtie2 (version 2.2.7) with default options (--local) (Langmead and Salzberg, 2012). After including only uniquely mapped reads, peak calling was performed by HOMER (Heinz et al., 2010) (version 4.8.1) with default options (findPeaks –style factor), where the input sequencing run was used as a control. The peaks were associated with nearby genes (mm10) by using HOMER annotatePeaks.pl.

Correlation analysis between genes and TEs

Uniquely mapped BAM files were uploaded into SeqMonk (Version 0.32.0) (https://www.bioinformatics.babraham.ac.uk/projects/seqmonk/). The edgeR package (McCarthy et al., 2012) was used to identify differently expressed genes and TEs between WT and KO 2-cell embryos. We performed an intersection between differentially expressed genes and genes ±20 kb of differentially expressed TEs. A hypergeometric test was used to determine the significance of the overlap. For genome-wide analysis, we calculated Spearman’s rank correlation coefficient between a TE and its nearest neighbouring gene using the expression profile of 66 cells. We excluded all TEs overlapped with any exons including UTRs, allowing the max gap of 51 bp equal to the read length. The maximum distance between genes and TEs was set to 1Mbp.

Testing TE enrichments of zygotically activated genes (ZAGs)

To test whether TEs depleted in KO 2-cell embryos (adjusted P-value less than 0.05) are enriched in the neighbourhood of ZAGs, we counted the mean number of depleted TEs within ±10 kb of the transcription start site (TSS) of ZAGs (n = 698). ZAGs are defined as the WT class EU, which are differentially expressed in KO samples in the time-series cluster (Figure 1—source data 4). This process was then repeated 10,000 times for a randomly selected group of genes (n = 698). We excluded all depleted TEs overlapped with any exons. The observed mean number was then tested against a null model assuming no enrichment of depleted repeats in the neighbourhood of ZAGs by computing the empirical P-value of the observed mean number based on a null distribution of simulated mean numbers of depleted TEs (Figure 4B).

Chimeric transcript identification

We extracted split (split but mapped to the same chromosome) or translocated (mapped to different chromosomes) reads from the remapped SAM files of GSNAP with a novel splicing option (--novelsplicing=1 --distant-splice-penalty=0). Only one genomic alignment was kept for each multi-mapped read. From the CIGAR string of the extracted reads, we calculated the genomic positions of both ends of split or translocated reads. If one end of a read is overlapped with one of genes and the other end is overlapped with one of repeat elements unambiguously, we considered it as the read supporting chimeric transcripts between genes and TE. Chimeric transcripts were validated through PCR amplification across the predicted chimeric junction and the DNA sequence at the junction was validated with Sanger sequencing.

siRNA target analysis

To search full-length MuERV-L copies with the complete gag and pol genes, we aligned the sequences for MuERV-L gag and pol genes (GenBank accession number: Y12713) to the Mus musculus genome (GRCm38) using Bowtie2 (version 2.3.0) with options ‘-a -D 20 -R 3 -N 1 -L 20 –i S,1,0.50’. From the GTF file of RepeatMasker we used for TE counts, we extracted all MERVL-int elements overlapped with the alignments to the gag and pol genes (n = 583). To calculate the specificity of MuERV-L siRNA, we mapped the sequence of MuERV-L siRNA to the Mus musculus genome (GRCm38) using Bowtie2 (version 2.3.0) with options ‘-a -N 1 -L 10’. We stratified the SAM alignments into perfect match, one mismatch, and two mismatches by examining the ‘XM:i:<N>' field and counted the alignments overlapped with the full-length MERVL-int elements.

Confocal acquisition and image analysis

Full projections of images were taken every 0.5 μM (Figure 5E) or 1.5 μM (Figure 3G, Figure 3—figure supplement 1C, Figure 5D and G) along the z-axis with an inverted Leica TCS SP5 confocal microscope. For MuERV-L staining in 2-cell embryos, image analysis was carried out using Fiji (Schindelin et al., 2012). Measurements of fluorescent intensity were automated using a custom ImageJ macro to create a Huang thresholded 3D cell mask from the fluorescent signal and measure the mean intensity inside each embryo. For measurements of the number of DAPI+ cells in day five blastocysts (Figure 5E), we generated a Jython script. To allow reliable counting of clustered cells with widely varying amounts of labelling, a hierarchical k-means segmentation algorithm (Dufour et al., 2008) was implemented to generate a 3D mask, and the ImageJ 3D Object Counter plugin (Bolte and Cordelières, 2006) was used to determine how many separated objects were present by binary connexity analysis.

Acknowledgements

We thank Richard Butler for his support on the confocal imaging analysis, Charles Bradshaw for bioinformatic support, Todd S Macfarlan and Samuel L Pfaff for the 2C::tdTomato ESCs. We also thank members of the Surani lab for their critical input and helpful discussions on this project. The work was funded by a studentship to YH from the James Baird Fund, University of Cambridge, by the DGIST Start-up Fund of the Ministry of Science, ICT and Future Planning to JKK, by a core grant from EMBL and CRUK to JCM, by a Wellcome Trust Senior Investigator Award to MAS, and by a core grant from the Wellcome Trust and Cancer Research UK to the Gurdon Institute.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Funding Information

This paper was supported by the following grants:

  • Wellcome 096738 to Yun Huang, Dang Vinh Do, Caroline Lee, Christopher A Penfold, Jan J Zylicz, Jamie A Hackett, M Azim Surani.

  • Wellcome 092096 to Yun Huang, Dang Vinh Do, Caroline Lee, Christopher A Penfold, Jan J Zylicz, Jamie A Hackett, M Azim Surani.

  • Cancer Research UK C6946/A14492 to Yun Huang, Dang Vinh Do, Caroline Lee, Christopher A Penfold, Jan J Zylicz, Jamie A Hackett, M Azim Surani.

  • James Baird Fund, University of Cambridge to Yun Huang.

  • European Molecular Biology Laboratory to Jong Kyoung Kim, John C Marioni, Jamie A Hackett.

  • Cancer Research UK to Jong Kyoung Kim, John C Marioni.

  • DGIST Start-up Fund of the Ministry of Science, ICT and Future Planning 2017010073 to Jong Kyoung Kim.

Additional information

Competing interests

The authors declare that no competing interests exist.

Author contributions

YH, Conceptualization, Resources, Data curation, Formal analysis, Validation, Investigation, Methodology, Writing—original draft, Writing—review and editing.

JKK, Resources, Data curation, Formal analysis, Investigation, Methodology, Writing—original draft, Writing—review and editing.

DVD, Data curation, Investigation, Methodology.

CL, Data curation, Investigation, Methodology.

CAP, Formal analysis, Investigation, Methodology.

JJZ, Writing—review and editing, Critical evaluation of the manuscript for important intellectual content.

JCM, Funding acquisition, Project administration, Writing—review and editing.

JAH, Conceptualization, Supervision, Writing—original draft, Writing—review and editing.

MAS, Conceptualization, Supervision, Funding acquisition, Writing—original draft, Project administration, Writing—review and editing.

Ethics

Animal experimentation: All husbandry and experiments involving mice were authorized by a UK Home Office Project License 80/2637 and carried out in a Home Office-designated facility.

Additional files

Supplementary file 1. Sample details for single-cell / embryo RNA sequencing.

DOI: http://dx.doi.org/10.7554/eLife.22345.024

elife-22345-supp1.xlsx (11.9KB, xlsx)
DOI: 10.7554/eLife.22345.024
Supplementary file 2. Stella ChIP-seq enriched peaks.

DOI: http://dx.doi.org/10.7554/eLife.22345.025

elife-22345-supp2.xlsx (14.3KB, xlsx)
DOI: 10.7554/eLife.22345.025
Supplementary file 3. A list of primer sequences used in this study.

DOI: http://dx.doi.org/10.7554/eLife.22345.026

elife-22345-supp3.xlsx (42.6KB, xlsx)
DOI: 10.7554/eLife.22345.026

Major datasets

The following dataset was generated:

Huang Y,Kim JK,Do DV,Lee C,Penfold CA,Zylicz JJ,Marioni JC,Hackett JA,Surani MA,2016,Transcription profiling of wild type and Stella knockout oocytes, wild type and Stella maternal/zygotic knockout embryos to study to the role of Stella in early mouse development,http://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-5210/,Publicly available at the ArrayExpress (accession no: E-MTAB-5210)

The following previously published datasets were used:

Macfarlan TS,Gifford WD,Driscoll S,Lettieri K,Rowe HM,Bonanomi D,Firth A,Singer O,Trono D,Pfaff SL.,2012,2C::tomato ES cells, 2-cell embryos and wild type oocytes,https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE33923,Publicly available at the NCBI Gene Expression Omnibus (accession no: GSE33923)

References

  1. Albertsen M, Teperek M, Elholm G, Füchtbauer EM, Lykke-Hartmann K. Localization and differential expression of the Krüppel-associated box zinc finger proteins 1 and 54 in early mouse development. DNA and Cell Biology. 2010;29:589–601. doi: 10.1089/dna.2010.1040. [DOI] [PubMed] [Google Scholar]
  2. Amouroux R, Nashun B, Shirane K, Nakagawa S, Hill PW, D'Souza Z, Nakayama M, Matsuda M, Turp A, Ndjetehe E, Encheva V, Kudo NR, Koseki H, Sasaki H, Hajkova P. De novo DNA methylation drives 5hmc accumulation in mouse zygotes. Nature Cell Biology. 2016;18:225–233. doi: 10.1038/ncb3296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ancelin K, Syx L, Borensztein M, Ranisavljevic N, Vassilev I, Briseño-Roa L, Liu T, Metzger E, Servant N, Barillot E, Chen CJ, Schüle R, Heard E. Maternal LSD1/KDM1A is an essential regulator of chromatin and transcription landscapes during zygotic genome activation. eLife. 2016;5:e08851. doi: 10.7554/eLife.08851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Anders S, Pyl PT, Huber W. HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Aravind L, Koonin EV. SAP - a putative DNA-binding motif involved in chromosomal organization. Trends in Biochemical Sciences. 2000;25:112–114. doi: 10.1016/S0968-0004(99)01537-6. [DOI] [PubMed] [Google Scholar]
  6. Bassel GW, Lan H, Glaab E, Gibbs DJ, Gerjets T, Krasnogor N, Bonner AJ, Holdsworth MJ, Provart NJ. Genome-wide network model capturing seed germination reveals coordinated regulation of plant cellular phase transitions. PNAS. 2011;108:9709–9714. doi: 10.1073/pnas.1100958108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Beraldi R, Pittoggi C, Sciamanna I, Mattei E, Spadafora C. Expression of LINE-1 retroposons is essential for murine preimplantation development. Molecular Reproduction and Development. 2006;73:279–287. doi: 10.1002/mrd.20423. [DOI] [PubMed] [Google Scholar]
  8. Bolte S, Cordelières FP. A guided tour into subcellular colocalization analysis in light microscopy. Journal of Microscopy. 2006;224:213–232. doi: 10.1111/j.1365-2818.2006.01706.x. [DOI] [PubMed] [Google Scholar]
  9. Bortvin A, Goodheart M, Liao M, Page DC. Dppa3 / Pgc7 / stella is a maternal factor and is not required for germ cell specification in mice. BMC Developmental Biology. 2004;4:2. doi: 10.1186/1471-213X-4-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Bowles J, Teasdale RP, James K, Koopman P. Dppa3 is a marker of pluripotency and has a human homologue that is expressed in germ cell tumours. Cytogenetic and Genome Research. 2003;101:261–265. doi: 10.1159/000074346. [DOI] [PubMed] [Google Scholar]
  11. Böhne A, Brunet F, Galiana-Arnoux D, Schultheis C, Volff JN. Transposable elements as drivers of genomic and biological diversity in vertebrates. Chromosome Research. 2008;16:203–215. doi: 10.1007/s10577-007-1202-6. [DOI] [PubMed] [Google Scholar]
  12. Cantone I, Fisher AG. Epigenetic programming and reprogramming during development. Nature Structural & Molecular Biology. 2013;20:282–289. doi: 10.1038/nsmb.2489. [DOI] [PubMed] [Google Scholar]
  13. Castel SE, Martienssen RA. RNA interference in the nucleus: roles for small RNAs in transcription, epigenetics and beyond. Nature Reviews Genetics. 2013;14:100–112. doi: 10.1038/nrg3355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Deng Q, Ramsköld D, Reinius B, Sandberg R. Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells. Science. 2014;343:193–196. doi: 10.1126/science.1245316. [DOI] [PubMed] [Google Scholar]
  15. Denomme MM, Mann MR. Maternal control of genomic imprint maintenance. Reproductive BioMedicine Online. 2013;27:629–636. doi: 10.1016/j.rbmo.2013.06.004. [DOI] [PubMed] [Google Scholar]
  16. Dufour A, Meas-Yedid V, Grassart A, Olivo-Marin J. Automated quantification of cell endocytosis using active contours and wavelets. 19th International Conference on Pattern Recognition; 2008. [DOI] [Google Scholar]
  17. Dupressoir A, Lavialle C, Heidmann T. From ancestral infectious retroviruses to bona fide cellular genes: role of the captured syncytins in placentation. Placenta. 2012;33:663–671. doi: 10.1016/j.placenta.2012.05.005. [DOI] [PubMed] [Google Scholar]
  18. Elbarbary RA, Lucas BA, Maquat LE. Retrotransposons as regulators of gene expression. Science. 2016;351:aac7247. doi: 10.1126/science.aac7247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Eymery A, Liu Z, Ozonov EA, Stadler MB, Peters AH. The methyltransferase Setdb1 is essential for meiosis and mitosis in mouse oocytes and early embryos. Development. 2016;143:2767–2779. doi: 10.1242/dev.132746. [DOI] [PubMed] [Google Scholar]
  20. Gifford WD, Pfaff SL, Macfarlan TS. Transposable elements as genetic regulatory substrates in early development. Trends in Cell Biology. 2013;23:218–226. doi: 10.1016/j.tcb.2013.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Golbus MS, Calarco PG, Epstein CJ. The effects of inhibitors of RNA synthesis (alpha-amanitin and actinomycin D) on preimplantation mouse embryogenesis. The Journal of Experimental Zoology. 1973;186:207–216. doi: 10.1002/jez.1401860211. [DOI] [PubMed] [Google Scholar]
  22. Grow EJ, Flynn RA, Chavez SL, Bayless NL, Wossidlo M, Wesche DJ, Martin L, Ware CB, Blish CA, Chang HY, Pera RA, Wysocka J. Intrinsic retroviral reactivation in human preimplantation embryos and pluripotent cells. Nature. 2015;522:221–225. doi: 10.1038/nature14308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Göke J, Lu X, Chan YS, Ng HH, Ly LH, Sachs F, Szczerbinska I. Dynamic transcription of distinct classes of endogenous retroviral elements marks specific populations of early human embryonic cells. Cell Stem Cell. 2015;16:135–141. doi: 10.1016/j.stem.2015.01.005. [DOI] [PubMed] [Google Scholar]
  24. Hamatani T, Carter MG, Sharov AA, Ko MS. Dynamics of global gene expression changes during mouse preimplantation development. Developmental Cell. 2004;6:117–131. doi: 10.1016/S1534-5807(03)00373-3. [DOI] [PubMed] [Google Scholar]
  25. Hayashi K, Lopes SM, Tang F, Surani MA, Susana M, Azim Surani M. Dynamic equilibrium and heterogeneity of mouse pluripotent stem cells with distinct functional and epigenetic states. Cell Stem Cell. 2008;3:391–401. doi: 10.1016/j.stem.2008.07.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Molecular Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hubley R, Finn RD, Clements J, Eddy SR, Jones TA, Bao W, Smit AF, Wheeler TJ. The dfam database of repetitive DNA families. Nucleic Acids Research. 2016;44:D81–D89. doi: 10.1093/nar/gkv1272. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Jiang L, Schlesinger F, Davis CA, Zhang Y, Li R, Salit M, Gingeras TR, Oliver B. Synthetic spike-in standards for RNA-seq experiments. Genome Research. 2011;21:1543–1551. doi: 10.1101/gr.121095.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Karimi MM, Goyal P, Maksakova IA, Bilenky M, Leung D, Tang JX, Shinkai Y, Mager DL, Jones S, Hirst M, Lorincz MC. DNA methylation and SETDB1/H3K9me3 regulate predominantly distinct sets of genes, retroelements, and chimeric transcripts in mESCs. Cell Stem Cell. 2011;8:676–687. doi: 10.1016/j.stem.2011.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Kigami D, Minami N, Takayama H, Imai H. MuERV-L is one of the earliest transcribed genes in mouse one-cell embryos. Biology of Reproduction. 2003;68:651–654. doi: 10.1095/biolreprod.102.007906. [DOI] [PubMed] [Google Scholar]
  31. Kim J, Zhao H, Dan J, Kim S, Hardikar S, Hollowell D, Lin K, Lu Y, Takata Y, Shen J, Chen T. Maternal Setdb1 is required for meiotic progression and preimplantation development in mouse. PLoS Genetics. 2016;12:e1005970. doi: 10.1371/journal.pgen.1005970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Kolodziejczyk AA, Kim JK, Tsang JC, Ilicic T, Henriksson J, Natarajan KN, Tuck AC, Gao X, Bühler M, Liu P, Marioni JC, Teichmann SA. Single cell RNA-Sequencing of pluripotent states unlocks modular transcriptional variation. Cell Stem Cell. 2015;17:471–485. doi: 10.1016/j.stem.2015.09.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Langmead B, Salzberg SL. Fast gapped-read alignment with bowtie 2. Nature Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Lee MT, Bonneau AR, Giraldez AJ. Zygotic genome activation during the maternal-to-zygotic transition. Annual Review of Cell and Developmental Biology. 2014;30:581–613. doi: 10.1146/annurev-cellbio-100913-013027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Li L, Zheng P, Dean J. Maternal control of early mouse development. Development. 2010;137:859–870. doi: 10.1242/dev.039487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Li L, Lu X, Dean J. The maternal to zygotic transition in mammals. Molecular Aspects of Medicine. 2013;34:919–938. doi: 10.1016/j.mam.2013.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Liu YJ, Nakamura T, Nakano T. Essential role of DPPA3 for chromatin condensation in mouse oocytogenesis. Biology of Reproduction. 2012;86:40. doi: 10.1095/biolreprod.111.095018. [DOI] [PubMed] [Google Scholar]
  38. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Macfarlan TS, Gifford WD, Driscoll S, Lettieri K, Rowe HM, Bonanomi D, Firth A, Singer O, Trono D, Pfaff SL. Embryonic stem cell potency fluctuates with endogenous retrovirus activity. Nature. 2012;487:57–63. doi: 10.1038/nature11244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. McCarthy DJ, Chen Y, Smyth GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Research. 2012;40:4288–4297. doi: 10.1093/nar/gks042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Nakamura T, Arai Y, Umehara H, Masuhara M, Kimura T, Taniguchi H, Sekimoto T, Ikawa M, Yoneda Y, Okabe M, Tanaka S, Shiota K, Nakano T. PGC7/Stella protects against DNA demethylation in early embryogenesis. Nature Cell Biology. 2007;9:64–71. doi: 10.1038/ncb1519. [DOI] [PubMed] [Google Scholar]
  42. Nakamura T, Liu YJ, Nakashima H, Umehara H, Inoue K, Matoba S, Tachibana M, Ogura A, Shinkai Y, Nakano T. PGC7 binds histone H3K9me2 to protect against conversion of 5mc to 5hmc in early embryos. Nature. 2012;486:415–419. doi: 10.1038/nature11093. [DOI] [PubMed] [Google Scholar]
  43. Nakashima H, Kimura T, Kaga Y, Nakatani T, Seki Y, Nakamura T, Nakano T. Effects of dppa3 on DNA methylation dynamics during primordial germ cell development in mice. Biology of Reproduction. 2013;88:125. doi: 10.1095/biolreprod.112.105932. [DOI] [PubMed] [Google Scholar]
  44. Nakatani T, Yamagata K, Kimura T, Oda M, Nakashima H, Hori M, Sekita Y, Arakawa T, Nakamura T, Nakano T. Stella preserves maternal chromosome integrity by inhibiting 5hmC-induced γh2ax accumulation. EMBO Reports. 2015;16:582–589. doi: 10.15252/embr.201439427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Park SJ, Shirahige K, Ohsugi M, Nakai K. DBTMEE: a database of transcriptome in mouse early embryos. Nucleic Acids Research. 2015;43:D771–D776. doi: 10.1093/nar/gku1001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Payer B, Saitou M, Barton SC, Thresher R, Dixon JP, Zahn D, Colledge WH, Carlton MB, Nakano T, Surani MA. Stella is a maternal effect gene required for normal early development in mice. Current Biology. 2003;13:2110–2117. doi: 10.1016/j.cub.2003.11.026. [DOI] [PubMed] [Google Scholar]
  47. Peaston AE, Evsikov AV, Graber JH, de Vries WN, Holbrook AE, Solter D, Knowles BB. Retrotransposons regulate host genes in mouse oocytes and preimplantation embryos. Developmental Cell. 2004;7:597–606. doi: 10.1016/j.devcel.2004.09.004. [DOI] [PubMed] [Google Scholar]
  48. Peat JR, Dean W, Clark SJ, Krueger F, Smallwood SA, Ficz G, Kim JK, Marioni JC, Hore TA, Reik W. Genome-wide bisulfite sequencing in zygotes identifies demethylation targets and maps the contribution of TET3 oxidation. Cell Reports. 2014;9:1990–2000. doi: 10.1016/j.celrep.2014.11.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Rebollo R, Romanish MT, Mager DL. Transposable elements: an abundant and natural source of regulatory sequences for host genes. Annual Review of Genetics. 2012;46:21–42. doi: 10.1146/annurev-genet-110711-155621. [DOI] [PubMed] [Google Scholar]
  50. Ribet D, Louvet-Vallée S, Harper F, de Parseval N, Dewannieux M, Heidmann O, Pierron G, Maro B, Heidmann T. Murine endogenous retrovirus MuERV-L is the progenitor of the "orphan" epsilon viruslike particles of the early mouse embryo. Journal of Virology. 2008;82:1622–1625. doi: 10.1128/JVI.02097-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Risso D, Ngai J, Speed TP, Dudoit S. Normalization of RNA-seq data using factor analysis of control genes or samples. Nature Biotechnology. 2014;32:896–902. doi: 10.1038/nbt.2931. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Rowe HM, Trono D. Dynamic control of endogenous retroviruses during development. Virology. 2011;411:273–287. doi: 10.1016/j.virol.2010.12.007. [DOI] [PubMed] [Google Scholar]
  53. Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Shen L, Inoue A, He J, Liu Y, Lu F, Zhang Y. Tet3 and DNA replication mediate demethylation of both the maternal and paternal genomes in mouse zygotes. Cell Stem Cell. 2014;15:459–470. doi: 10.1016/j.stem.2014.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Supek F, Bošnjak M, Škunca N, Šmuc T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS One. 2011;6:e21800. doi: 10.1371/journal.pone.0021800. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Tadros W, Lipshitz HD. The maternal-to-zygotic transition: a play in two acts. Development. 2009;136:3033–3042. doi: 10.1242/dev.033183. [DOI] [PubMed] [Google Scholar]
  57. Tang F, Barbacioru C, Nordman E, Li B, Xu N, Bashkirov VI, Lao K, Surani MA, Azim Surani M. RNA-Seq analysis to capture the transcriptome landscape of a single cell. Nature Protocols. 2010;5:516–535. doi: 10.1038/nprot.2009.236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Thompson PJ, Macfarlan TS, Lorincz MC. Long terminal repeats: from parasitic elements to building blocks of the transcriptional regulatory repertoire. Molecular Cell. 2016;62:766–776. doi: 10.1016/j.molcel.2016.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Tsukada Y, Akiyama T, Nakayama KI. Maternal TET3 is dispensable for embryonic development but is required for neonatal growth. Scientific Reports. 2015;5:15876. doi: 10.1038/srep15876. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Wasson JA, Simon AK, Myrick DA, Wolf G, Driscoll S, Pfaff SL, Macfarlan TS, Katz DJ. Maternally provided LSD1/KDM1A enables the maternal-to-zygotic transition and prevents defects that manifest postnatally. eLife. 2016;5:e08848. doi: 10.7554/eLife.08848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Wossidlo M, Nakamura T, Lepikhov K, Marques CJ, Zakhartchenko V, Boiani M, Arand J, Nakano T, Reik W, Walter J. 5-Hydroxymethylcytosine in the mammalian zygote is linked with epigenetic reprogramming. Nature Communications. 2011;2:241. doi: 10.1038/ncomms1240. [DOI] [PubMed] [Google Scholar]
  62. Wu G, Aoyama C, Young SG, Vance DE. Early embryonic lethality caused by disruption of the gene for choline kinase alpha, the first enzyme in phosphatidylcholine biosynthesis. The Journal of Biological Chemistry. 2008;283:1456–1462. doi: 10.1074/jbc.M708766200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Wu TD, Nacu S. Fast and SNP-tolerant detection of complex variants and splicing in short reads. Bioinformatics. 2010;26:873–881. doi: 10.1093/bioinformatics/btq057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Wu J, Huang B, Chen H, Yin Q, Liu Y, Xiang Y, Zhang B, Liu B, Wang Q, Xia W, Li W, Li Y, Ma J, Peng X, Zheng H, Ming J, Zhang W, Zhang J, Tian G, Xu F, Chang Z, Na J, Yang X, Xie W. The landscape of accessible chromatin in mammalian preimplantation embryos. Nature. 2016;534:652–657. doi: 10.1038/nature18606. [DOI] [PubMed] [Google Scholar]
  65. Xue Z, Huang K, Cai C, Cai L, Jiang CY, Feng Y, Liu Z, Zeng Q, Cheng L, Sun YE, Liu JY, Horvath S, Fan G. Genetic programs in human and mouse early embryos revealed by single-cell RNA sequencing. Nature. 2013;500:593–597. doi: 10.1038/nature12364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Yan L, Yang M, Guo H, Yang L, Wu J, Li R, Liu P, Lian Y, Zheng X, Yan J, Huang J, Li M, Wu X, Wen L, Lao K, Li R, Qiao J, Tang F. Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells. Nature Structural & Molecular Biology. 2013;20:1131–1139. doi: 10.1038/nsmb.2660. [DOI] [PubMed] [Google Scholar]
eLife. 2017 Mar 21;6:e22345. doi: 10.7554/eLife.22345.034

Decision letter

Editor: Robb Krumlauf1

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "STELLA and MuERV-L activation are essential for early mouse development" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by Robb Krumlauf as the Reviewing Editor and Fiona Watt as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Didier Trono (Reviewer #1) and Todd S Macfarlan (Reviewer #2).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

This study presents an interesting set of findings. The strength of this paper is in the genomic analyses, primarily single cell RNA-sequencing approaches and data analysis. They convincingly show failure in MZT of maternal Stella mutants, which is a significant and important finding, although not entirely surprising, since embryogenesis has been previously shown to be impaired in these mutants. There are however two major weaknesses to the work. The first shortcoming is a failure to provide mechanistic insight on how Stella might affect MERVL transcription and regulates MZT genes. The second relates to the lack of clarity of the proposed connection between Stella, MERVL, 2-cell stage-specific genes and MZT. The loss-of-function experiments (via RNAi) of MERVL elements is not entirely convincing. The knockdown as performed should not affect the expression of MuERVL LTR-driven ZGA genes, which are also controlled by STELLA. This offers no mechanistic insight into how the expression of MERVL elements is important for developmental progression. Thus more mechanistic insights would greatly improve the manuscript. However, we realize that extra work showing the requirement of MuERV-L for development might take an extended period that would prevent a timely submission. The authors could change to this conclusion and instead focus on what is regulated by Stella. Hence, as a minimum, there is a need for an enhanced discussion with plausible mechanisms of both Stella and MERVLs role in early development. Furthermore there is a need for controls to address concerns about the interpretation of the MERVL siRNA experiment tests.

Major specific points:

1) Mechanism of Stella action. The authors don't attempt any experiment to address how Stella regulates MZT, other than looking at effects on transcription. No data is shown to demonstrate that these effects are direct. It is unlikely that Stella binds both to maternal mRNAs aiding in their destruction while simultaneously activation MERVL promoters like a transcription factor. As a starting point, it would be interesting if the authors could "rescue" the maternal mutants by injecting Stella cDNAs containing mutations in some of its putative domains to tease out domains that might be important for the observed MZT defects. Likewise, the Introduction would benefit from a description of the Stella protein and its known domains.

2) MERVL loss-of-function experiments. The authors use a siRNA against the MEVL GAG region, which will target some but not all full-length MERVL elements. The authors should provide a list of the genomic MERVL elements (or the percentage) that should be perfect targets, one mismatch, two mismatch, etc. to the siRNA. In other words, provide the reader with the fraction of full-length MERVL elements that should be affected by the siRNA. Additionally, the IF images, even if very carefully performed, can be easily misinterpreted since the embryos are imaged separately. Western blots with the same Gag antibody should also be included. This is achievable since Heidmann's lab reported westerns from 2C embryos over a decade ago.

3) In Figure 5C, it looks like the scramble siRNA reduces MERVL expression relative to uninjected embryos. Is this significant? In Figure 5E and 5H, the authors should include uninjected embryos for comparison and include statistics. There is a concern that the different phenotypes between scramble siRNA and MERVL siRNA are much less than the difference between uninjected and scramble siRNA. Thus, the reader should be provided this control. Figure 5E and H . Given the title, replicating this a few more times to get higher certainty in the effect of MuERV-L knockdown would be appropriate. If this work would be substantial, a change in title and claims about development potential needs to be considered.

4) There is no mechanistic insight into how knocking down MERVL transcripts (and Gag positive virus particles) affects development and the discussion on this topic is not adequate. In the Discussion, the authors state they found "that reducing MERVL activation at the 2C stage affects development", but this is misleading, since knocking down MERVL transcripts with an siRNA is not the same as preventing the expression of MERVL elements and the broad open chromatin domains this likely causes. This is testable and one might expect that MERVL siRNAs have little effect on the embryonic transcriptome or chromatin state surrounding MERVLs. A better experiment with this aim would be CRISPRi. It is nonetheless interesting that MERVL element mRNA/proteins seem to be important for development, but there is no experiment that addresses mechanism, or even discussion on a plausible mechanism. For example, might the viral particles themselves be important, similar to the roles proposed for HERVK particles in human embryos (Grow et al., 2015). Since this is a major claim of the paper, it either needs to be softened or it should be supported with more compelling data such as rescue experiments.

5) It would be interesting to test if the MERVL LOF phenotype is more severe if a portion of the M2t_mm LTR is targeted by an siRNA, since this could potentially target chimeric transcripts in addition to the full-length elements. Was this experiment attempted?

6) Figure 1 nicely demonstrates that a subset of MZT genes fails to become activated in Stella KO embryos. The GO analysis illustrated in Figure 2 further reveals that a lot of different pathways are affected, although none stands out as particularly responsible for the observed phenotype. Panels in 2C and 2D provide useful validations of the RNA seq analyses, yet do not add any specific information hence could be presented as Supplementary Data.

7) Figure 3 depicts the dysregulation of TEs expression in Stella KO embryos. How many cells are included for each stage in Figure 3E should be indicated. The authors should verify levels of MERVL expression by qPCR, using consensus primers spanning the sequence of this element. They further suggest that a zinc finger protein, ZFP143, might be involved in regulating MERVL, as its levels increase slightly upon passage from the 1-cell to the 2-cell stage, and are higher in the absence of Stella. The upper graph needs more information: how many cells assessed? Statistical significance? Normalization? Furthermore, there is no reference regarding the putative ZFP143-binding sequence, the location of which within MERVL (and other TEs observed to be upregulated in the absence of Stella) should be indicated. As well, to support their incrimination of ZFP153 in the control of MERVL, the authors could overexpress it or knock it down in mESC, a small fraction of which upregulates MERVL at any given time in tissue culture (McFarlan et al., 2012). As well, the effect of Stella and ZFP143 overexpression on MERVL LTR-driven reporters could be tested in this or other heterologous systems.

8) In line with the decreased upregulation of MERVL elements and their MT2_mm LTRs in the absence of Stella, Figure 4 reveals that the expression of genes driven from these transcriptional units is also reduced in this setting, suggesting that Stella regulates the transcriptional activity of the MERVL promoter. In Figure 5 the authors demonstrate that depletion by siRNA of MERVL transcripts from early embryos leads to a phenotype closely similar to that induced by the depletion of Stella. However, the siRNA used in these experiments targets the gag sequence of MERVL, hence should trigger the degradation of only a subset of MERVL RNAs, and not those where the LTR drives chimeric transcripts, and drives MZT genes, as those illustrated in Figure 4D. It might be interpreted as suggesting that the lethal phenotype induces by Stella depletion results from the lack of MERVL transcripts, not from the modulation of MZT genes. The authors should definitely address this paradox, for instance by i) quantifying the expression of MZT genes (as per Figure 4D) upon MERVL depletion, and ii) injecting MERVL transcripts in the zygote of Stella KO embryos to see whether they rescue lethality.

9) Figure 4C illustrates 4 examples of genes, the expression trend of which matches that of neighboring TEs. In only one case (ZFP54) is this TE a MERVL-related element, whereas in the other 3 they are other types of TEs. The nature of these TEs should be better described (LTRs? Internal sequences? Orientation?). Were chimeric transcripts detected in any of these cases? Were there other TEs in the intervening sequences (between TEs mentioned here and genes and their TSS -genes orientation should also be indicated).

[Editors' note: further revisions were requested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "STELLA modulates transcriptional and endogenous retrovirus programs during maternal-to-zygotic transition" for further consideration at eLife. Your revised article has been favorably evaluated by Fiona Watt (Senior editor), a Reviewing editor, and three reviewers.

The manuscript has been improved but there are some remaining issues that need to be addressed before acceptance, as outlined below by reviewer 1. The major issues that need to be addressed relate to conclusions of MERVL depletion in early embryos reducing level of 2C specific genes and the relevance of Figure 5—figure supplement 5A.

Reviewer #1:

Most of the concerns we raised in our previous review have been satisfactorily addressed in the new version of the manuscript. However, the experiments aimed at exposing the importance and role of MERVL transcripts in MZT still raise questions, and need further clarification or dampening of the related claims.

In the manuscript and rebuttal letter, the authors state that MERVL depletion in early embryos leads to decreased expression of 2C-specific genes, but the data do not fully support this claim. Figure 5—figure supplement 1B shows qRT-PCR analysis for 4 2C-specific genes driven by the MT2_mm MERVL LTR in embryos injected with MERVL-targeting or scramble siRNA. According to the authors' model, they should be downregulated upon knocking down MERVL transcripts. Yet two (Tcstv1, Gm4340) are upregulated, one (Gpbp1l1) unchanged, and only one (Sp110) downregulated, similar to the targeted MERVL transcripts themselves. It would have been useful to examine the siRNA-sensitivity of some other of the 200 or so MZT genes, including ones not driven by a MERVL-associated LTR but either by the LTR of another transposable element (e.g. Eif1a-like genes) or their own promoter (e.g. Zscan4).

As well, the experiment described in Figure 5—figure supplement 5A (could not find where it is specifically described in the text) does not add much, since the downregulation from the injection of an siRNA targeting the MT2_mm part of chimeric transcripts is significant for only 2 out of 6 of tested here.

eLife. 2017 Mar 21;6:e22345. doi: 10.7554/eLife.22345.035

Author response


Major specific points:

1) Mechanism of Stella action. The authors don't attempt any experiment to address how Stella regulates MZT, other than looking at effects on transcription. No data is shown to demonstrate that these effects are direct. It is unlikely that Stella binds both to maternal mRNAs aiding in their destruction while simultaneously activation MERVL promoters like a transcription factor.

Our aim at the very outset was to understand the mechanism of Stella function, while accepting that many of the experiments (e.g. ChIP, Co-IP, CLIP etc) are very challenging in 2-cell embryos. We have therefore attempted to identify direct targets of Stella in mESCs, using a published protocol with modifications (Nakamura et al., 2012) (Author response image 1). We tagged hemagglutinin (HA) to the C-terminus of Stella and used the high affinity HA antibody to pull down Stella bound chromatin. The Stella+HA construct was over-expressed in the absence of endogenous Stella by using StellaM/Z KO ESCs, with NANOG-HA and eGFP-HA as positive and negative controls, respectively; expression was confirmed by western blot (Author response image 1A and 1B).

Author response image 1. Stella-HA ChIP.

Author response image 1.

(A) Western blot of Stella and HA. Stella with 3 copies of hemagglutinin (HA) tagged to its C-terminal is over-expressed in a Dox inducible system in StellaM/Z KO ESCs. Stella’s predicted size is ~ 21kDA, and appears as ~ 27kDA due to addition of 3xHA construct. Bottom panel shows H3 loading control. (B) Western blot for NANOG+HA and eGFP+HA. Both NANOG+HA and eGFP+HA constructs were expressed in a Dox-inducible system in StellaM/Z KO ESCs. The blot show NANOG+HA construct is detected at ~55kDA while the endogenous NANOG is detected at ~45kDA. eGFP+HA construct is detected at ~37kDA, as confirmed with eGFP antibody alone. Bottom panel shows H3 loading control. (C) Top panel shows ChIP-qPCR of the relative enrichment of NANOG-HA in NANOG known binding regions (Xist, proximal enhancer of Prdm14) and gene desert regions (GAPDH and Chr18). Bottom panel shows ChIP-qPCR of the relative enrichment of Stella-HA in previously published Stella binding regions (Mage-a2, Wfdc15a) (Nakamura et al., 2012) and gene desert regions. One-tailed Student’s T-test was performed, where * corresponds to p<0.05 or otherwise stated.

DOI: http://dx.doi.org/10.7554/eLife.22345.027

We detected enrichments of NANOG in known binding sites (Xist, proximal enhancers of Prdm14) (Gillich, Bao, and Surani 2013, Murakami et al., 2016), but not for eGFP, supporting the efficacy of the ChIP. However, there was little or no enrichment of Stella at its previously reported binding sites (Mage-a2, Wfdc15a1) (Nakamura et al., 2012), nor could we detect Stella binding to other regions in the genome (n=56) (subsection “Stella is associated with the expression of specific TE “, Supplementary file 2). In addition, neither over-expression nor knockdown of Dppa3 affected MuERV-L expression in mESCs (Figure 3—figure supplement 2). This indicates that Stella does not bind to zygotic genes / MuERV-L elements in mESCs, with the caveat that mESC might not accurately reflect the cellular/chromatin context of 2-cell embryos, where these experiments are intractable. Nonetheless, we felt the attempt was worthwhile partly because published reports indicated that ESCs cycle through a 2C-like state (Macfarlan et al., 2012).

As a starting point, it would be interesting if the authors could "rescue" the maternal mutants by injecting Stella cDNAs containing mutations in some of its putative domains to tease out domains that might be important for the observed MZT defects.

We considered this point previously by comparing maternal- and zygotic- only mutants. Briefly, maternal Stella persists until ~E1.5, followed by zygotic Stella expression at ~E2.5 embryos (Payer et al. 2003). Evidence suggests that maternal Stella is critical for development, but zygotic Stella is largely dispensable, implying that reintroduction of Stella in the early embryo cannot rescue the effects of maternal loss (Payer et al., 2003, Bortvin et al., 2004, Nakamura et al., 2007). This emphasizes the critical role of Stella at the earliest developmental stages. Thus, it is doubtful that attempts to conduct ‘rescue’ experiments would be effective since Stella protein is unlikely to accumulate to sufficient levels to have an effect. Note also that these experiments are very challenging since this would require a large number of mutant oocytes.

Likewise, the Introduction would benefit from a description of the Stella protein and its known domains.

We provide the information together with a schematic diagram in the Introduction (Figure 1—figure supplement 1).

2) MERVL loss-of-function experiments. The authors use a siRNA against the MEVL GAG region, which will target some but not all full-length MERVL elements. The authors should provide a list of the genomic MERVL elements (or the percentage) that should be perfect targets, one mismatch, two mismatch, etc. to the siRNA. In other words, provide the reader with the fraction of full-length MERVL elements that should be affected by the siRNA.

This is an important point that we had overlooked. We have now performed the additional bioinformatics analysis (Figure 5B) as suggested by the reviewers. We identified 583 full-length MuERV-L elements that fully encode the gag and pol proteins in the genome. MuERV-L siRNA targets 80.5% of full-length MuERV-L elements with perfect match and 99.5% of full-length MuERV-L elements with 2 mismatches or less. This is an informative result given the minimum number of predicted off-targets with 2 mismatches or less. Furthermore, scramble siRNA did not have any predicted targets with 2 mismatches or less.

Additionally, the IF images, even if very carefully performed, can be easily misinterpreted since the embryos are imaged separately. Western blots with the same Gag antibody should also be included. This is achievable since Heidmann's lab reported westerns from 2C embryos over a decade ago.

While we acknowledge this point, the western blots would require approximately 40 – 100 2-cell embryos per sample (Kang et al., 2014, Komatsu et al., 2014, Mu et al., 2011). Note also that the experiments require microinjections, with a ~50% survival rate. Finally our MuERV-L antibody is different from that used by Heidmann’s lab (Ribet et al., 2008), and we have no access to it. For these reasons, we chose IF to validate MuERV-L protein knockdown. The analysis was carried with care using our outstanding core imaging facility at the Gurdon Institute, with high technical expertise. Our facility has designed a custom ImageJ macro plugin to measure IF signals; we used this to asses MuERV-L (GFP) intensity in 2-cell embryos using the Fiji image software.

Accordingly, we took extra precautions ensuring that all the control and experimental samples were treated simultaneously and identically, and on the same day with identical microscope settings; representative images for each condition are shown in Figure 5D. The quantification of the GFP intensity is however meant to provide changes in relative levels in MuERV-L protein, and not precise levels of the protein. This procedure for IF images was used recently by our imaging core on another study on pre-implantation mouse embryos (Goolam et al.Cell, 2016).

3) In Figure 5C, it looks like the scramble siRNA reduces MERVL expression relative to uninjected embryos. Is this significant?

MuERV-L pol expression is not significantly reduced in scramble siRNA compared to uninjected embryos as judged by the two-sided Wilcoxson rank sum test and this is now labeled as n.s. (non significant) in Figure 5C and Figure 5D for clarity.

In Figure 5E and 5H, the authors should include uninjected embryos for comparison and include statistics. There is a concern that the different phenotypes between scramble siRNA and MERVL siRNA are much less than the difference between uninjected and scramble siRNA. Thus, the reader should be provided this control. Figure 5E and H .

The uninjected control embryos are now included in the Figure 5E and Figure 5H as suggested by the reviewers. At 20μM, the percentage of embryos reaching blastocyst by day 4 was similar between uninjected and scramble siRNA embryos (83% versus 89% respectively). However, only 60% of MuERV-L siRNA injected embryos reached the blastocyst stage, which narrowly below significance (p=0.065). There were no other statistically significant differences between uninjected, scramble or MuERV-L siRNA treated embryos in Figure 5E. Statistical significance is now stated for Figure 5H.

We agree that at 80μM concentration, siRNA itself may have a negative effect on embryo development compared to uninjected embryos. This might in part be due to some non-specific effects, which we acknowledged in the manuscript (subsection “MuERV-L plays a functional role during pre-implantation development “). On the other hand, there are 583 full-length copies of MuERV-L elements in the genome, which are highly upregulated at the 2-cell stage, comprising ~3% of the transcriprome. This likely requires a higher concentration of siRNA to achieve a significant knockdown compared to what is normally the case for a single copy gene. To accommodate the conflicting experimental drawbacks, we decided to present both 20μM and 80μM; they both show a similar effect, with the latter having a greater effect accepting the appropriate caveats.

Given the title, replicating this a few more times to get higher certainty in the effect of MuERV-L knockdown would be appropriate. If this work would be substantial, a change in title and claims about development potential needs to be considered.

We agree with the reviewers’ comments and appreciate the suggestion offered by the reviewers. Obtaining enough mutant embryos is very challenging besides the experiments themselves. We have therefore changed the title and moderated the claims in the manuscript. The revised title is below, which we hope will be acceptable.

“Stella modulates transcriptional and endogenous retrovirus programs during maternal-to-zygotic transition”

4) There is no mechanistic insight into how knocking down MERVL transcripts (and Gag positive virus particles) affects development and the discussion on this topic is not adequate. In the Discussion, the authors state they found "that reducing MERVL activation at the 2C stage affects development", but this is misleading, since knocking down MERVL transcripts with an siRNA is not the same as preventing the expression of MERVL elements and the broad open chromatin domains this likely causes. This is testable and one might expect that MERVL siRNAs have little effect on the embryonic transcriptome or chromatin state surrounding MERVLs. A better experiment with this aim would be CRISPRi. It is nonetheless interesting that MERVL element mRNA/proteins seem to be important for development, but there is no experiment that addresses mechanism, or even discussion on a plausible mechanism. For example, might the viral particles themselves be important, similar to the roles proposed for HERVK particles in human embryos (Grow et al., Nature). Since this is a major claim of the paper, it either needs to be softened or it should be supported with more compelling data such as rescue experiments.

As we pointed out earlier, we have moderated our claims based on the reviewers’ comments, with respect to the siRNA effects on MuERV-L transcript and protein. Also in response to points 4 and point 8, we have performed additional qRT-PCR analysis on 2-cell embryos, which revealed that MuERV-L knockdown does affect the expression of chimeric transcripts and 2-cell specific genes, which may contribute to the reduced embryonic potential of MuERV-L knockdown embryos. Importantly, this suggests siRNA targeting is one relevant approach to investigate the biological significance of gene expression derived from MuERV-L.

The use of CRISPRi is a potentially interesting and complementary approach to address whether LTR-driven zygotic gene expression is important during development. However, it will be difficult to generate a gRNA library capable of targeting most of the LTR promoters without significant indirect effects on protein-coding genes, as many solo or non-functional LTRs have integrated into 5’ and 3’UTRs, which would be targeted despite little or no influence on the expression of their host gene. Thus developing this CRISPRi system would require extensive optimization, including efficient delivery into the zygote.

A potential role of viral particles in mouse development seems less likely, because we previously showed that knockout of all interferon response genes (IFITM) family members has no detectable effect on mouse development (Lange et al., 2008). Furthermore, while HERVK and virus-like particles were indicated in human hypomethylated epiblast cells in blastocysts (Grow et al., 2015), we did not detect them in highly hypomethylated human primordial germ cells that were predicted in the paper by Grow (W.Tang et al., unpublished). We have included discussion of this and additional possible mechanisms by which MuERV-L mRNA / protein may function during pre-implantation development (Discussion section paragraph five).

5) It would be interesting to test if the MERVL LOF phenotype is more severe if a portion of the M2t_mm LTR is targeted by an siRNA, since this could potentially target chimeric transcripts in addition to the full-length elements. Was this experiment attempted?

We have preliminary data regarding this specific comment (Author response image 2). We designed siRNA against MT2_Mm and MT2 elements (MT2A, MT2B, MT2B1, MT2B2, MT2C, MT2C_Mm); these LTRs were also significantly depleted in Stella M/Z KO 2-cell embryos. The phenotype we observed for MT2_Mm knockdown was in fact similar to MuERV-L knockdown (Author response image 2A). Bioinformatics analysis predicted MT2_Mm siRNA targets 74.3% of MT2_Mm elements with 2 or less mismatches and a minimum number of off-targets (Author response image 2B). MT2 elements have very diverse sequences, with 37002 elements considered in the analysis, and only 7.85% of MT2 elements targeted by MT2 siRNA with 2 or less mismatches (Author response image 2D). Nevertheless, we observed an effect on embryonic development (Author response image 2C), which may have resulted from knockdown of a combination of MT2 elements and associated chimeric transcripts.

Author response image 2. Effects of MT2 element knockdown on pre-implantation embryonic development.

Author response image 2.

(A) A bar graph illustrating the number of uninjected embryos, embryos injected with 20μM scramble or 20μM MT2_Mm siRNAs, at different stages of development in day 2 – 4 of in-vitro culture, n=1. The percentage of embryos reaching the blastocyst stage is stated. (B) Bioinformatics analysis of the number of MT2_Mm elements and off-targets that are perfect matches or 1 or 2 mismatches for MT2_Mm siRNA. A total of 2667 MT2_Mm elements are included in the analysis. (C) A bar graph illustrating the number of uninjected embryos, embryos injected with 20 μM scramble or 20 μM MT2 siRNAs, at different stages of development in day 2 – 4 of in-vitro culture, n=1. The percentage of embryos reaching the blastocyst stage is stated. (D) Bioinformatics analysis of the number of MT2 elements (MT2A, MT2B, MT2B1, MT2B2, MT2C_Mm) and off-targets that are perfect matches or 1 or 2 mismatches for MT2 siRNA. A total of 37002 MT2 elements are included in the analysis.

DOI: http://dx.doi.org/10.7554/eLife.22345.028

6) Figure 1 nicely demonstrates that a subset of MZT genes fails to become activated in Stella KO embryos. The GO analysis illustrated in Figure 2 further reveals that a lot of different pathways are affected, although none stands out as particularly responsible for the observed phenotype. Panels in 2C and 2D provide useful validations of the RNA seq analyses, yet do not add any specific information hence could be presented as Supplementary Data.

There were significant dysregulations observed in Stella M/Z KO 2-cell embryos and the GO analysis served to highlight potentially significant functional pathways that contributed towards embryonic lethality in KO embryos. We believe the qRT-PCR data give examples from each of the pathways, provides validation of the RNA-seq from independent experiments (in an alternative visual format) and further emphasizes the potentially important biological processes misregulated in Stella M/Z KO 2-cell embryos. Thus we would prefer to retain it in the main figure if possible.

7) Figure 3 depicts the dysregulation of TEs expression in Stella KO embryos. How many cells are included for each stage in Figure 3E should be indicated.

The bar chart for Figure 3E were calculated from single cell / embryo RNA sequencing reads, which were mapped to TEs. We have now made this clear in the figure legend. Thus the number of embryos included for each stage is the same as the original RNA-sequencing study (WT oocyte n=12, KO oocyte n=12, WT 1-cell n=12, KO 1-cell n=12, WT 2-cell n=9, KO 2-cell n=9).

The authors should verify levels of MERVL expression by qPCR, using consensus primers spanning the sequence of this element.

To explore this we quantified MuERV-L expression in WT and KO 2-cell embryos processed for RNA-sequencing (Author response image 3). RNA-seq mapped reads showed significant reduction in MuERV-L-int expression in KO 2-cell embryos. This was not however observed in 2 sets of MuERV-L by qRT-PCR analysis (Author response image 3A). Initial evidence suggests this is because MuERV-L primers can amplify significant off target regions, which lead to over-estimation of MuERV-L expression in some embryos (Author response image 3B). For this reason we opted for IF validation, which showed robust reduction of MuERV-L Gag protein level in KO 2-cell embryos, and corroborated the unbiased RNA-seq, which showed a comparable reduction.

Author response image 3. MERVL RNA-sequencing versus qRT-PCR.

Author response image 3.

(A) The first dot plot shows MERVL-Int expression in WT and KO 2-cell embryos. MERVL-Int reads were mapped from single embryo RNA-sequencing and expressed as counts per million. The latter two dot plots shows MERVL expression from 2 qRT-PCR primers (MERVL-pol and MERVL1) in the same WT and KO 2-cell embryos. Samples are expressed as a ratio relative to WT embryo 1 (WT1). The median and interquartile range is indicated. WT n=9, KO n=9. Two-sided Wilcoxon rank sum test was performed and *** corresponds to p<0.001, n.s = not significant. (B) A plot of expression correlation between MERVL-Int from RNA-sequencing and MERVL-pol from qRT-PCR, Pearson correlation analysis performed.

DOI: http://dx.doi.org/10.7554/eLife.22345.029

They further suggest that a zinc finger protein, ZFP143, might be involved in regulating MERVL, as its levels increase slightly upon passage from the 1-cell to the 2-cell stage, and are higher in the absence of Stella. The upper graph needs more information: how many cells assessed? Statistical significance? Normalization? Furthermore, there is no reference regarding the putative ZFP143-binding sequence, the location of which within MERVL (and other TEs observed to be upregulated in the absence of Stella) should be indicated. As well, to support their incrimination of ZFP153 in the control of MERVL, the authors could overexpress it or knock it down in mESC, a small fraction of which upregulates MERVL at any given time in tissue culture (McFarlan et al., 2012). As well, the effect of Stella and ZFP143 overexpression on MERVL LTR-driven reporters could be tested in this or other heterologous systems.

Thank you for your comments and interest on ZFP143 as a potential candidate regulating MuERV-L elements. We illustrated ZFP143 as an example of a motif enriched in differentially expressed TEs between WT and KO samples, but this is not the main focus here. As we do not have sufficient experimental evidence or bioinformatics analysis to suggest ZFP143 directly regulates TEs / MuERV-L elements, we have removed this from the manuscript.

8) In line with the decreased upregulation of MERVL elements and their MT2_mm LTRs in the absence of Stella, Figure 4 reveals that the expression of genes driven from these transcriptional units is also reduced in this setting, suggesting that Stella regulates the transcriptional activity of the MERVL promoter. In Figure 5 the authors demonstrate that depletion by siRNA of MERVL transcripts from early embryos leads to a phenotype closely similar to that induced by the depletion of Stella. However, the siRNA used in these experiments targets the gag sequence of MERVL, hence should trigger the degradation of only a subset of MERVL RNAs, and not those where the LTR drives chimeric transcripts, and drives MZT genes, as those illustrated in Figure 4D. It might be interpreted as suggesting that the lethal phenotype induces by Stella depletion results from the lack of MERVL transcripts, not from the modulation of MZT genes. The authors should definitely address this paradox, for instance by i) quantifying the expression of MZT genes (as per Figure 4D) upon MERVL depletion, and ii) injecting MERVL transcripts in the zygote of Stella KO embryos to see whether they rescue lethality.

Thank you for pointing out the contradictory messages in the manuscript. We have attempted to address this as suggested by the reviewers (i), by quantifying transcript expression in uninjected, 80 μM scramble and 80 μM MuERV-L siRNA 2-cell embryos (Figure 5—figure supplement 1). Interestingly, we observed MuERV-L knockdown results in significantly reduced expression of 2/6 of chimeric transcripts (Rbm8a, GM12617), and in a 2-cell specific gene – Sp110. Furthermore, GM12617 is also significantly depleted in StellaM/Z KO 2-cell embryos (Figure 1—source data 2). This suggests that defects in transcript expression may partially contribute to the reduced developmental potential of MuERV-L knockdown embryos.

In response to the second experiment proposed (ii), while MuERV-L transcripts are significantly depleted in StellaM/Z KO 2-cell embryos, we have also identified 2560 misregulated genes. We believe the aberrant ERV programme (including MuERV-L) may have contributed towards gene misregulation, however, MuERV-L transcript alone will likely not be sufficient to rescue the embryonic lethality observed in StellaM/Z KO embryos.

9) Figure 4C illustrates 4 examples of genes, the expression trend of which matches that of neighboring TEs. In only one case (ZFP54) is this TE a MERVL-related element, whereas in the other 3 they are other types of TEs. The nature of these TEs should be better described (LTRs? Internal sequences? Orientation?). Were chimeric transcripts detected in any of these cases? Were there other TEs in the intervening sequences (between TEs mentioned here and genes and their TSS -genes orientation should also be indicated).

The nature of the 4 TE types are mentioned in the manuscript (subsection “Expression of a subset of TEs is positively correlated with their nearest gene “) and the orientations of the TEs and genes are illustrated in Figure 4C now. ZFP54-MT2B is a chimeric transcript (Macfarlan et al., 2012), but the other 3 pairs of TE/gene are not. These candidates were chosen based on a genome-wide expression correlation analysis between a TE and its nearest gene. While there may be other TEs in the intervening sequences between some pairs of TE/gene, these intervening TEs are not robustly expressed in 2-cell embryos.

[Editors' note: further revisions were requested prior to acceptance, as described below.]

Reviewer #1:

Most of the concerns we raised in our previous review have been satisfactorily addressed in the new version of the manuscript. However, the experiments aimed at exposing the importance and role of MERVL transcripts in MZT still raise questions, and need further clarification or dampening of the related claims.

In the manuscript and rebuttal letter, the authors state that MERVL depletion in early embryos leads to decreased expression of 2C-specific genes, but the data do not fully support this claim. Figure 5—figure supplement 1B shows qRT-PCR analysis for 4 2C-specific genes driven by the MT2_mm MERVL LTR in embryos injected with MERVL-targeting or scramble siRNA. According to the authors' model, they should be downregulated upon knocking down MERVL transcripts. Yet two (Tcstv1, Gm4340) are upregulated, one (Gpbp1l1) unchanged, and only one (Sp110) downregulated, similar to the targeted MERVL transcripts themselves. It would have been useful to examine the siRNA-sensitivity of some other of the 200 or so MZT genes, including ones not driven by a MERVL-associated LTR but either by the LTR of another transposable element (e.g. Eif1a-like genes) or their own promoter (e.g. Zscan4).

As well, the experiment described in Figure 5—figure supplement 5A (could not find where it is specifically described in the text) does not add much, since the downregulation from the injection of an siRNA targeting the MT2_mm part of chimeric transcripts is significant for only 2 out of 6 of tested here.

Concerning MuERV-L KD gene expression analysis, we had attempted to characterize the effect on a small sub-set of 2C specific genes; this showed some variable effects. In light of this we have accepted the reviewer’s suggestion to dampen our claims. We have therefore decided to remove this section from the manuscript and Figure 5—figure supplement 1B.

Our analysis of chimeric transcripts however revealed that all analysed amplicons tended towards reduced expression in MuERV-L knockdown embryos, with 2 transcripts being significantly depleted (p<0.05) (Figure 5—figure supplement 1). We believe that it will be valuable for the readers to see these observations since it indicates that MuERV-L depletion could affect embryonic development via multiple mechanisms.

We have however removed all the conclusions based on MuERV-L KD chimeric transcript expression analysis and simply present the observation in the Results section. We suggest in the discussion that one possible consequence of MuERV-L knockdown is that it may affect expression of chimeric transcripts, but we state that this warrants further investigation.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Figure 1—source data 1. Gene counts for WT and KO oocyte, 1-cell and 2-cell embryos.

    Gene counts for 66 samples from single cell / embryo RNA-seq experiments. Removal of Unwanted Variation analysis has been performed to control for potentially confounding technical factors.

    DOI: http://dx.doi.org/10.7554/eLife.22345.004

    DOI: 10.7554/eLife.22345.004
    Figure 1—source data 2. List of differentially expressed genes between WT and KO samples at oocyte, 1-cell and 2-cell stage, from single cell/embryo RNA-seq analysis.

    ENSEMBL ID and names of genes identified as significantly differentially expressed with adjusted P-value (padj) <0.05. Log2FoldChange=Log2(WT/KO). Stage indicates the developmental stage that is analysed.

    DOI: http://dx.doi.org/10.7554/eLife.22345.005

    DOI: 10.7554/eLife.22345.005
    Figure 1—source data 3. List of maternal transcripts, WT class = ED, which are differentially expressed in KO samples.

    ‘E’ denotes equally expressed, ‘D’ denotes downregulated or ‘U’ denotes upregulated at a later time point. Adjusted p-value (padj) shows significant differential expression between WT and KO class.

    DOI: http://dx.doi.org/10.7554/eLife.22345.006

    DOI: 10.7554/eLife.22345.006
    Figure 1—source data 4. List of zygotically activated genes (ZAG), WT class = EU, which are differentially expressed in KO samples.

    ‘E’ denotes equally expressed, ‘D’ denotes downregulated or ‘U’ denotes upregulated at a later time point. Adjusted p-value (padj) shows significant differential expression between WT and KO class.

    DOI: http://dx.doi.org/10.7554/eLife.22345.007

    DOI: 10.7554/eLife.22345.007
    Figure 2—source data 1. List of gene ontology terms enriched in differentially expressed genes between WT and KO 2-cell embryos.

    Gene ontology terms enriched for biological processes were calculated using DAVID software (version 6.7).

    DOI: http://dx.doi.org/10.7554/eLife.22345.011

    DOI: 10.7554/eLife.22345.011
    Figure 2—source data 2. Genome-wide Dppa3 co-expression network analysis.

    Table shows ensemble ID and official names of genes exhibiting significant expression correlation with Dppa3 (Pearson Correlation |r| > 0.9).

    DOI: http://dx.doi.org/10.7554/eLife.22345.012

    DOI: 10.7554/eLife.22345.012
    Figure 3—source data 1. Uniquely mapped transposable element (TE) counts.

    TE counts for 66 samples from single cell / embryo RNA-seq experiments. Uniquely mapped reads only.

    DOI: http://dx.doi.org/10.7554/eLife.22345.015

    elife-22345-fig3-data1.xlsx (492.4KB, xlsx)
    DOI: 10.7554/eLife.22345.015
    Figure 3—source data 2. Uniquely and multi-mapped transposable element (TE) counts.

    TE counts for 66 samples from single cell/embryo RNA-seq experiments. Uniquely and multi-mapped reads.

    DOI: http://dx.doi.org/10.7554/eLife.22345.016

    elife-22345-fig3-data2.xlsx (503.1KB, xlsx)
    DOI: 10.7554/eLife.22345.016
    Supplementary file 1. Sample details for single-cell / embryo RNA sequencing.

    DOI: http://dx.doi.org/10.7554/eLife.22345.024

    elife-22345-supp1.xlsx (11.9KB, xlsx)
    DOI: 10.7554/eLife.22345.024
    Supplementary file 2. Stella ChIP-seq enriched peaks.

    DOI: http://dx.doi.org/10.7554/eLife.22345.025

    elife-22345-supp2.xlsx (14.3KB, xlsx)
    DOI: 10.7554/eLife.22345.025
    Supplementary file 3. A list of primer sequences used in this study.

    DOI: http://dx.doi.org/10.7554/eLife.22345.026

    elife-22345-supp3.xlsx (42.6KB, xlsx)
    DOI: 10.7554/eLife.22345.026

    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES