Summary
Vertebrate eggs can induce the nuclear reprogramming of somatic cells to enable production of cloned animals. Nuclear reprogramming is relatively inefficient, and the development of the resultant embryos is frequently compromised, in part due to the inappropriate expression of genes previously active in the donor nucleus. Here, we identify H3K4 methylation as a major epigenetic roadblock that limits transcriptional reprogramming and efficient nuclear transfer (NT). Widespread expression of donor-cell-specific genes was observed in inappropriate cell types in NT embryos, limiting their developmental capacity. The expression of these genes in reprogrammed embryos arises from epigenetic memories of a previously active transcriptional state in donor cells that is characterized by high H3K4 methylation. Reducing H3K4 methylation had little effect on gene expression in donor cells, but it substantially improved transcriptional reprogramming and development of NT embryos. These results show that H3K4 methylation imposes a barrier to efficient nuclear reprogramming and suggest approaches for improving reprogramming strategies.
Keywords: nuclear transfer, reprogramming, epigenetic memory, H3K4me3, Kdm5b, endoderm, cell-fate stability
Graphical Abstract
Highlights
-
•
Nuclear transfer embryos retain the memory of a past state of active transcription (ON-memory)
-
•
ON-memory genes are enriched for H3K4 methylation in somatic donor nuclei
-
•
H3K4 demethylation improves transcriptional reprogramming
-
•
Removing H3K4 methylation enhances the development of nuclear transfer embryos
Hörmanseder et al. find that persistent memories of transcriptional activity in donor cell identity genes present a barrier to cell-fate reprogramming following nuclear transfer. They show that reducing H3K4 methylation in donor cells reduces transcriptional memory and improves the development of embryos derived by nuclear transfer.
Introduction
During development, cells lose their pluripotent status and acquire a stable cell identity, which only rarely, if ever, changes to another kind. Yet, somatic cells can be reprogrammed to another cell fate by nuclear transfer (NT) to eggs (Gurdon, 1960), by the expression of a combination of transcription factors (Takahashi and Yamanaka, 2006) or by cell-cell fusion (Blau et al., 1983). In these reprogramming procedures, the gene-expression pattern and epigenetic state characteristic of one differentiated cell identity is erased and the gene expression pattern specific to another cell type is established.
However, the efficiency of complete reprogramming via NT is low, as less than 10% of NT embryos generated from differentiated cells reach adulthood (Gurdon, 1960, Meissner and Jaenisch, 2006). This led to the hypothesis that differentiated cells acquire a resistance to reprogramming procedures, which during normal development, helps to stabilize their cell fate. Due to this resistance, eggs cannot fully reprogram the incoming somatic nuclei, so that embryos with aberrant gene expression patterns arise and normal embryonic development is not supported (Gao et al., 2003, Hirasawa et al., 2013, Ng and Gurdon, 2005). So far, it has been shown that a failure in reactivating genes, e.g., the pluripotency gene Oct4, during nuclear reprogramming is indicative of a poor developmental outcome of NT embryos (Boiani et al., 2002). Furthermore, epigenetic modifications inhibiting the re-activation of genes during the reprogramming procedure have been investigated and their removal has been utilized to improve reprogramming efficiency and to increase the viability of NT embryos (Blelloch et al., 2006, Chung et al., 2015, Enright et al., 2003, Kishigami et al., 2006, Liu et al., 2016, Matoba et al., 2014). However, the expression of donor cell-type-specific genes in the wrong cell type of NT embryos could also lead to a severe disruption of normal gene expression patterns resulting in developmental defects and embryonic lethality. Indeed, the existence of such an active transcription state memory has been suggested in NT and induced pluripotent stem cell (iPSC) experiments (Polo et al., 2010, Kim et al., 2011, Ng and Gurdon, 2005). Currently, however, the extent and functional importance of persistent donor-cell-type-specific gene expression in resistance to reprogramming is not known. Furthermore, the epigenetic mechanisms that confer memory of an active state of gene expression and that maintain the differentiated state of cells during nuclear reprogramming and embryonic development remain elusive.
Here we show that in Xenopus and human NT embryos, memory of an active transcriptional state (ON-memory) is a phenomenon as widespread as the memory of an inactive transcriptional state. ON-memory genes are associated with increased levels of the active histone mark H3K4me3 when compared to properly reprogrammed genes in Xenopus and human somatic donor cells. Importantly, while a reduction in H3K4 methylation levels has little effect on gene expression in the donor cells, it significantly improves transcriptional reprogramming and enhances the developmental potential of the resultant NT embryos in Xenopus. Our study thus identifies H3K4 methylation as a critical epigenetic barrier in NT-mediated reprogramming and implicates its role as stabilization mechanism of cell differentiation.
Results
Identification of Reprogramming Resistant ON-Memory Genes Expressed in the Wrong Cell Type of NT Embryos
The low success rate of current cloning strategies was suggested to be partly due to the persistence of a donor-cell-type-specific gene expression pattern in NT embryos, which could hinder the generation of new cell types (Firas et al., 2014, Liu et al., 2016, Matoba et al., 2014). As a first step to test this hypothesis, we evaluated the extent of memory gene expression in Xenopus NT embryos on a transcriptome-wide level.
For this purpose, the nucleus of a neurula-stage endoderm cell was transplanted to an enucleated egg to obtain NT embryos and as a control for normal gene expression, in vitro fertilized (IVF) embryos were generated (Figure 1A). Properly cleaved embryos were collected at the gastrula stage, a time point where ectoderm and endoderm identity is established and before any developmental defects can be observed in these NT embryos. Endoderm donor cells as well as ectoderm cells of single NT and IVF embryos were then subjected to RNA sequencing (RNA-seq) analysis in biological triplicate (Figures 1A and S1A–S1F; Tables S1 and S2). To test the extent of memory and reprogramming in the newly generated cell type, we addressed which transcripts differ between endoderm donor cells and ectoderm cells of IVF embryos. When the expression of these genes also differs between NT- and IVF- ectoderm cells, we consider them to be examples of donor cell memory (Figure S1A). If they are expressed at similar levels in NT and IVF, we consider them as reprogrammed (Figure S1B). Of all 24,215 identified transcripts (Figure 1B, in gray), a large number (17,587; Table S2) was differentially expressed between endoderm donor cells and ectoderm cells of IVF embryos. 13,083 of these genes were reprogrammed as they were expressed at similar levels in the ectoderm of NT and control IVF embryos (Table S2). In contrast, 4,504 genes were resistant to reprogramming as they were differentially expressed between ectoderm cells of NT and control IVF embryos (Figures 1B and 1C). This gene set included 1,534 ON-memory genes- these are genes that were expressed in donor endoderm cells and continued to be significantly (false discovery rate [FDR] ≤ 0.05) upregulated in NT ectoderm cells when compared to IVF ectoderm cells (Figures 1B and 1C, group 1). Another 1,346 of the same gene set are described as OFF-memory genes, because their transcripts were expressed at significantly (FDR ≤ 0.05) lower levels in ectoderm cells of NT embryos when compared to IVF controls (Figure 1B and 1C, group 4). The remaining 1,624 genes were either too much down- or upregulated in the ectoderm of NT embryos when compared to the IVF controls (Figure 1C, group 2 and group 3, respectively). We therefore see that a total of 2,880 ON-memory and OFF-memory genes are not reprogrammed by NT to eggs in Xenopus, and instead remember their donor cell expression pattern. Furthermore, this result suggests that NT embryos show endoderm ON-memory gene expression to the same extent as OFF-memory gene expression in the newly generated ectoderm cell type.
To obtain insight into biological processes associated with the inappropriately reprogrammed genes, we performed gene ontology analysis. This revealed that as a whole, reprogramming resistant genes are enriched for genes involved in development, transcriptional activity, and metabolic processes (Figure 1C). Importantly, we observed that the ON-memory gene set was enriched for genes implicated in endoderm development (Figure 1C, group 1). Furthermore, we found that master regulators of endoderm specification in Xenopus, such as sox17 and gata6, were among the ON-memory genes showing the highest upregulation in NT embryos when compared to IVF (Figure S1C). Hence, these results point toward retention of endoderm donor cell identity in the ectoderm cells of the NT embryos.
Next, we investigated the mechanism by which the ON-memory gene transcripts are accumulated in the ectoderm cells of the NT embryos. In Xenopus, there is no transcription for the first 12 cell cycles of embryonic development (Hörmanseder et al., 2013). Consistently, we did not observe gene expression of the ON-memory genes sox17β, gata6, and a2m (endodermin) at stage 7, prior to zygotic genome activation (ZGA; Figures S1G and S1H). This indicates that there was no carry-over of transcripts for these genes during NT, and that transcripts detected here were newly synthetized after ZGA. We therefore conclude that the memory of an active state of gene transcription of the donor nucleus was transmitted to its mitotic progeny during early embryonic cell divisions in the absence of the conditions that induced that state, and independently of ongoing gene transcription. It implies that the memory of the donor cell gene expression pattern observed in NT embryos is stabilized by epigenetic mechanisms.
ON-Memory Genes Are Enriched for H3K4me3 in Endoderm Donor Nuclei in Xenopus
We then investigated which epigenetic feature of the donor nuclei could account for the fact that ON-memory genes resist the reprogramming process and commence expression in the wrong cell type of NT embryos. Actively transcribed genes are characterized by the presence of methylated lysine 4 on histone H3 (H3K4me3) (Santos-Rosa et al., 2002). We hypothesized that accumulation of H3K4me3 following transcription of endoderm genes in donor cells could confer ON-memory gene-expression after NT.
Our transcriptome analysis identified reprogrammed-down genes that were active in endoderm donor cells (Figure 2A) but are reprogrammed and downregulated to IVF levels in the ectoderm of NT embryos (Figure 2B). ON-memory genes were initially expressed in endoderm donor cells at similar levels to reprogrammed-down genes (Figure 2A). However, they remained significantly upregulated in ectoderm cells of the NT embryo when compared to IVF (ON-memory; Figure 2B). Within the ON-memory group, a subset were especially resistant to reprogramming as they were more than 3-fold overexpressed (ON-memory(3FC); Figure 2B). Using these three sets of genes we tested if differences in H3K4me3 features could explain resistance to transcriptional reprogramming. We performed H3K4me3 chromatin immunoprecipitation sequencing (ChIP-seq) analysis on neurula-stage endoderm donor cells in biological duplicate (Figures 2C–2F and S2A–S2D). The intensity of the H3K4me3 ChIP-seq signal around the transcriptional start site (TSS) was significantly higher in ON-memory genes when compared to reprogrammed-down genes (Figures 2C and 2D). Previous studies suggested that broad H3K4me3 domains are linked to cell identity and transcriptional consistency (Benayoun et al., 2014), and since endoderm ON-memory genes are also enriched for endoderm lineage genes and maintain their transcriptionally active state even after the nuclear reprogramming procedure, we tested if ON-memory genes showed high H3K4me3 breadth. Indeed, when comparing the empirical cumulative distribution of H3K4me3 domain size spanning the TSS, ON-memory(3FC) genes showed significantly broader H3K4me3 domains than reprogrammed-down genes (Figure 2E). For example, the ON-memory genes gata6 and sox17β are marked by broader domains than the reprogrammed-down gene abhd4 (Figure 2F).
These results suggest that ON-memory genes are enriched for H3K4me3, as they show higher ChIP-seq intensity and broader domains of this mark when compared to reprogrammed-down genes in the endoderm donor cells. Increased H3K4me3 levels and breadth could act together as barrier to cell-fate changes and hence explain why the set of memory-ON genes are resisting the reprogramming process.
The Phenomenon of ON-Memory Is Conserved in Human NT Embryos, and ON-Memory Genes Are Enriched for H3K4me3 in Human Donor Cells
Our observations in Xenopus prompted us to investigate if ON-memory gene expression is conserved in human NT embryos and whether ON-memory genes, when compared to reprogrammed-down genes, are also enriched for H3K4me3 levels in the human donor cells. In previous studies (Chung et al., 2015, Matoba et al., 2014), epigenetic marks correlating with ON-memory genes were not addressed. Therefore, we obtained the published RNAseq datasets generated from pools of human eight-cell IVF embryos, as well as from eight-cell NT embryos using human dermal fibroblast (DFB-8) cell nuclei as donors (Chung et al., 2015).
Our transcriptome comparison between the donor DFB cells and IVF and NT embryos identified a set of 76 ON-memory genes that remained more than 5-fold upregulated in NT embryos when compared to IVF control (ON-memory(> 5FC); Figures 2G–2I). A set of 364 genes showed partial inactivation (ON-memory(2-5 FC)) (Figures 2G–2I) and a third set of 508 genes was efficiently downregulated in the NT-embryo when compared to IVF (reprogrammed-down; Figures 2I and 2J).
Therefore, our analysis of human NT embryo RNA-seq data suggests that the phenomenon of ON-memory gene expression is conserved in human NT embryos.
We then investigated the H3K4me3 ChIP-seq intensities of ON-memory and reprogrammed-down genes using publicly available H3K4me3 ChIP-seq datasets for NHDF cells, which are related to the DFB-8 cells used as donors to generate the NT embryos. In agreement with the results obtained for Xenopus, ON-memory genes showed increased H3K4me3 intensity around their TSS when compared to reprogrammed-down genes (Figure 2J). However, when comparing H3K4me3 domains breadth of ON-memory genes and reprogrammed-down genes, we could not observe a significant difference (Figure 2K).
These results suggest that also in human, high H3K4me3 levels at the TSS could confer ON-memory gene expression in NT embryos and hence act as barrier to nuclear reprogramming.
H3K4 Demethylation of Donor Nuclei Improves Transcriptional Reprogramming in Xenopus NT Embryos
Having established a correlation between ON-memory gene expression and H3K4me3 enrichment, we next asked whether this modification is responsible for resistance to reprogramming.
Hence, one-cell embryos were injected with mRNA encoding the H3K4-specific demethylase Kdm5bwt, or with Kdm5bci, the catalytic inactive version of the enzyme, and grown to neurula-stage (Figure 3A). Western blot analysis confirmed that Kdm5bwt-expressing embryos showed reduced H3K4me3 levels when compared to uninjected embryos (Figure 3B). Furthermore, H3K4me3 ChIP-RTqPCR analysis verified the reduction in H3K4me3 levels following Kdm5bwt treatment around the TSS and in the gene body of candidate ON-memory genes (Figure S2E). We then used Kdm5bwt- and Kdm5bci-expressing neurula-stage endoderm cells as donors to generate NT(Kdm5bwt) embryos and NT(Kdm5bci) embryos, respectively (Figure 3A). As controls, we in vitro fertilized embryos. We collected gastrula stage embryos and subjected them, as well as the endoderm donor cells, to RNA-seq analysis (Figure 3A; Tables S1 and S3).
First, we addressed the effect of H3K4 demethylation on gene expression in donor cells. Interestingly, we identified that only 102 out of 24,758 identified transcripts were differentially expressed between Kdm5bWT and Kdm5bci treated donor cells (Table S3). Therefore, changes in H3K4 methylation do not result in strong changes of gene expression levels, as reported previously (Clouaire et al., 2012). Second, we addressed if H3K4 methylation is important for stabilizing an active state of gene expression by evaluating if genes lose resistance to reprogramming as well as their ON-memory state following Kdm5bwt treatment of the donor cell. Transcriptome comparison of Kdm5bci expressing endoderm donor cells, the ectoderm of IVF embryos and NT(Kdm5bci) embryos identified 1,434 reprogramming resistant genes as they were differentially expressed between ectoderm cells of NT(Kdm5bci) and IVF embryos (Figure 3C). By comparison, in NT(Kdm5bwt) embryos, the number of reprogramming resistant genes was substantially reduced, as our analysis identified only 573 differentially expressed genes between the ectoderms of NT(Kdm5bwt) embryos and IVF embryos (Figure 3D). Importantly, Kdm5bwt treatment of the donor cells significantly reduced ON-memory gene expression from 231 ON-memory(3FC) genes in NT(Kdm5bci) embryos (Figure 3C) to 140 ON-memory(3FC) genes in NT(Kdm5bwt) embryos (Figure 3D). While the expression levels of ON-memory genes in the donor tissues was unaffected by the treatment with Kdm5bwt when compared to Kdm5bci, we observed a significant reduction of average ON-memory gene expression levels in NT(Kdm5bwt) embryos when compared to NT(Kdm5bci) embryos (Figures 3E and 3F). Hierarchical clustering (Figure 3G), as well as principal component analysis (PCA) (Figures S2F and S2G) of the transcriptome revealed that three out of seven NT(Kdm5bwt) embryos have a gene expression pattern in their ectoderm cells that is more similar to the one of IVF embryos than to the one of control NT(Kdm5bci) embryos.
This implicates that expression of Kdm5bwt in the donor cell reduces ON-memory gene expression and is able to improve the whole transcriptome of the resultant NT embryos.
Next, we analyzed the expression of selected candidate ON-memory genes (sox17β, gata6, foxA4, a2m, and darmin) during gastrulation of NT and IVF embryos via qRT-PCR. In donor endoderm cells, ON-memory gene expression was not affected by H3K4 demethylation (Figures S2H–S2L). We observed that ON-memory genes were upregulated in NT(Kdm5bci) ectoderm cells at all stages of gastrula embryos (Figures S2M–S2Q). While sox17β, gata6 and foxA4 showed a decrease in gene expression in NT(Kdm5bwt) ectoderm cells at all stages (Figures S2M–S2O), a2m and darmin were insensitive to Kdm5bwt treatment of the donor cell as they did not show a significant reduction in gene expression in the resultant NT(Kdm5bwt) ectoderm cells when compared to NT(Kdm5bci) ectoderm cells (Figures S2P–S2Q). We propose that this is due to an additional, unknown epigenetic barrier other than H3K4me3, as we could observe that H3K4me3 levels were reduced at the TSS and at the gene body of darmin to a similar extent as of the other, Kdm5bwt sensitive ON-memory genes (Figure S2E). These results corroborate that Kdm5bwt treatment can reduce ON-memory gene expression in the resultant NT embryos throughout gastrulation.
Finally, we confirmed that the observed reduction in ON-memory gene expression following Kdm5bwt expression is indeed due to the demethylation of H3K4. We reduced H3K4me3 levels in the donor cells by expressing a dominant-negative version of histone H3.3 (H3.3K4M) that binds and inhibits the SET domain of H3K4-specific methyltransferases (Lewis et al., 2013). Transcriptome comparison between donor cells, the ectoderm cells of IVF embryos and of the NT(H3.3wt) embryos or NT(H3.3 K4M) embryos corroborated the finding that a reduction of H3K4 methylation decreases ON-memory gene expression in NT embryos (Figure S3; Table S3).
Together, our data show that a H3K4 demethylation of donor nuclei not only improves the reprogramming of ON-memory genes but can also restore the global transcriptome of NT embryos.
H3K4 Demethylation in Donor Nuclei Improves the Development of NT Embryos
Finally, we investigated whether H3K4 demethylation in donor cells and the associated reduction in ON-memory gene expression in the resultant NT embryos are able to improve their survival.
We generated NT embryos from Kdm5bwt- and Kdm5bci-treated donor cells, and while they developed to properly cleaved early gastrula embryos at a similar rate (Figures 4A and 4B; Table S1), NT(Kdm5bwt) embryos showed fewer morphological abnormalities when compared to NT(Kdm5bci) embryos (Figures 4A and 4B; Table S1) as development proceeded. Notably, NT(Kdm5bwt) embryos reached the feeding tadpole stage and beyond at a significantly higher rate than NT(Kdm5bci) embryos (Figure 4C). The developmental potential of NT(Kdm5bci) embryos was consistent with previous results using uninjected endoderm cells as donors for NT (Gurdon, 1960), as 30% of cleaved NT(Kdm5bci) embryos reached the feeding tadpole stage (Figures 4B and 4C). Instead, the developmental outcome of NT(Kdm5bwt) embryos was comparable to the one of NT embryos generated from undifferentiated blastula cells (Gurdon, 1960), as 60% of all cleaved embryos reached the feeding tadpole stage (Figures 4B and 4C).
These results show that H3K4 methylation acts as a barrier to nuclear reprogramming, and that its removal significantly improves the developmental potential of NT embryos.
Discussion
Our study identifies H3K4 methylation as an epigenetic barrier to nuclear reprogramming and suggests it as a safeguarding mechanism for cellular identity. Challenging the stability of cell identity through NT reveals epigenetic mechanisms that inhibit the activation of genes supporting alternative cell fates (Chung et al., 2015, Liu et al., 2016, Matoba et al., 2014). Here, we describe an epigenetic layer that prevents the inactivation of genes during nuclear reprogramming and ensures the stable expression of genes characteristic of an established cell identity. We observe that H3K4 methylation imposes memory of an active transcriptional state and that its suppression results in improved transcriptional reprogramming and an enhancement of the developmental outcome of Xenopus NT embryos. Therefore, our study shows that interfering with an epigenetic barrier and the associated ON-memory can improve the generation of new cell types by reprogramming via NT.
By transplanting differentiated nuclei to eggs, we uncover a function of H3K4 methylation in epigenetic memory of cell fate that extends beyond ensuring transcriptional consistency and maintaining ongoing gene transcription (Benayoun et al., 2014). Indeed, reduction of H3K4 methylation by either demethylase or histone mutant expression has very little effect on endodermal cell transcription in fertilized embryos. However, the role of H3K4me3 in stabilizing a transcriptional program becomes most evident when the endoderm cell chromatin undergoes reprogramming by the egg: The early phase of frog embryogenesis encompasses a period of intense cell division in the absence of transcription. After ZGA, when cells are again permissive for transcription, H3K4me3 can induce endoderm ON-memory gene expression in ectoderm cells of cloned embryos. Hence we can differentiate in our experimental system the function of H3K4me3 in simply maintaining ongoing transcription, as observed in pluripotent cells (Muramoto et al., 2010), from its function as an epigenetic memory factor of somatic cell identity.
The stabilization of an active transcriptional state correlates with increased intensity and breadth of H3K4 methylation around these genes, as for example around the endoderm lineage genes sox17β and gata6. During DNA replication, the modified nucleosomes are locally redistributed between the two daughter strands (Probst et al., 2009) and broad H3K4me3 domains of ON-memory genes could ensure that the mark on these key lineage genes is faithfully propagated to the mitotic progeny, even when the chromatin is challenged by the egg’s reprogramming factors and in the absence of gene transcription. Our results underline the importance of H3K4me3 as a safeguarding mechanism of cell identity.
ON-memory gene expression is conserved in Xenopus and human NT embryos and correlates with increased H3K4me3 levels in the somatic donor cell nuclei. It is likely that also in mammals, H3K4 methylation imposes a barrier to nuclear reprogramming and its removal enhances the efficiency of cloning. Interestingly, recent studies in mouse NT embryos suggest that a failure in reactivating Kdm5b expression during reprogramming correlates with a poor developmental outcome (Liu et al., 2016), which would support that NT embryos with reduced Kdm5b levels might not be able to efficiently erase H3K4 methylation mediated ON-memory. However, studies also show that Kdm5b knock down in IVF embryos results in aberrant major ZGA (Dahl et al., 2016) and embryonic lethality. It is currently unknown if the developmental failure of mouse NT embryos is due to the normal requirement of Kdm5b for ZGA or if it is due to persistent ON-memory gene expression. In our work, we erase H3K4 methylation marks in the donor cells, and leave the H3K4me3 demethylation activities of the NT embryo unperturbed, and hence are able to show that ON-memory indeed acts as barrier to nuclear reprogramming. We propose that both, ON-memory gene expression due to persistent active marks and as well as a lack in Kdm5b activities, which are important for ZGA and development (Dahl et al., 2016), contribute to embryonic lethality in mouse NT embryos.
Once the essential epigenetic barriers conferring ON- and OFF-memory are identified, they can be targeted to improve reprogramming efficiencies and allow the generation of high quality stem cells suitable for cell replacement therapies.
STAR★Methods
Key Resources Table
REAGENT or RESOURCE | SOURCE | IDENTIFIER |
---|---|---|
Antibodies | ||
H3K4me3 | Abcam | ab8580; RRID: AB_306649 |
HA | Sigma | H9658; RRID: AB_260092 |
H4 | Abcam | ab31830; RRID: AB_1209246 |
Chemicals, Peptides, and Recombinant Proteins | ||
Magnetic beads conjugated with secondary antibody | Invitrogen | 11204D |
Critical Commercial Assays | ||
TruSeq RNA library prep kit | Illumina | RS-122-2001 |
TruSeq DNA kit | Illumina | FC-121-2001 |
Deposited Data | ||
Raw and analyzed data | This paper | GEO: GSM733650 |
Experimental Models: Organisms/Strains | ||
Xenopus laevis wild type, mature females | Nasco | LM00535MX |
Xenopus laevis wild type, mature males | Nasco | LM00715MX |
Recombinant DNA | ||
pCS2+ Kdm5b aa1-770- NLS-6HA | This paper | accession number NM_152895 |
pCS2+ Kdm5b H499A aa1-770- NLS-6HA | This paper | accession number NM_152895 |
pCS2+ H3.3-6HA | This paper | accession number NM_001098432 |
pCS2+ H3.3K4M-6HA | This paper | accession number NM_001098432 |
Sequence-Based Reagents | ||
RTqPCR primers (Table S4) | Sigma | NA |
Software and Algorithms | ||
Sickle | https://github.com/najoshi/sickle | https://github.com/najoshi/sickle |
cutadapt 1.0 | Martin, 2011 | https://pypi.python.org/pypi/cutadapt/1.0 |
TopHat 2.0.6 | Trapnell et al., 2009 | https://ccb.jhu.edu/software/tophat/index.shtml |
BWA (version 0.6.2) | Li and Durbin, 2009 | https://sourceforge.net/projects/bio-bwa/files/ |
samtools 0.1.8 | Li et al., 2009 | |
edgeR | Robinson et al., 2010 | https://bioconductor.org/packages/release/bioc/html/edgeR.html |
bedtools (version 2.25.0) | Quinlan and Hall, 2010 | https://github.com/arq5x/bedtools2/releases |
R version 3.2.4 | https://www.R-project.org/ | https://www.R-project.org/ |
gplots package version 3.0.1 | https://CRAN.R-project.org/package=gplots | https://CRAN.R-project.org/package=gplots |
Contact for Reagent and Resource Sharing
Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Eva Hörmanseder e.hoermanseder@gurdon.cam.ac.uk
Experimental Model and Subject Details
Mature Xenopus laevis males and females were obtained from Nasco (901 Janesville Avenue, PO Box 901, Fort Atkinson, WI 53538-0901; https://www.enasco.com/xenopus). Our work with Xenopus laevis is covered under the Home Office Project License PPL 70/8591 and frog husbandry and all experiments were performed according to the relevant regulatory standards. Animals were maintained in a recirculating fresh water system (Marine Biotech) at a density of one adult/3l, with 10% water change per day and temperatures ranging from 16°C to 20°C. Water was sequentially filtered with mechanical pad sump filter, nitrifying bacteria filter, mechanical canister filter, carbon filter, and UV sterilized. Water quality parameters were as follow: conductivity 1500us; temperature 17-22°C; PH 6-8. Photoperiod was set to 12h ON/12h OFF. Frogs are fed twice per week with Royal Horizon 4.5mm pellets (skretting, https://www.skrettingfishfeeds.co.uk/). Unconsumed food was removed 10 min after the start of feeding. All material used for this work involves killing of testis-donating frogs by an overdose of anesthetic. Females are injected with hormones (50 units pregnant mare serum gonadotropin, 3 days in advance of egg laying, and 500 units human chorionic gonadotropin, 1 day in advance of egg laying) in the dorsal lymph sack to induce natural ovulation and egg laying in 1xMMR (100mM NaCl, 2mM KCl, 1mM MgSO4, 2mM CaCl2, 0.1mM EDTA, 5mM HEPES (pH 7.8). After egg laying, frogs underwent a health check by a veterinarian and were given a resting period of at least 3 months before re-use. These procedures were of minimal invasiveness and did not cause stress or suffering to the animal. The researchers and the staff of the Gurdon Institute animal husbandry facility are trained in these experiments, and veterinarians monitor the health status of the animals.
Method Details
mRNA production
Mouse Kdm5b (accession number NM_152895, aa1-770) and its catalytic inactive (ci) mutant (H499A; aa1-770), both with an C-terminal NLS-tag, as well as Xenopus H3.3 (accession number NM_001098432) and its dominant-negative mutant (K4M) constructs were sub-cloned into pCS2+ plasmid with 6 C-terminal HA-tags using the gateway cloning system (Thermo Fisher Scientific). mRNA was synthetized in vitro using MEGAscript SP6 Kit (Ambion, AM1330M) following the manufacturer’s instructions.
mRNA injection into one-cell embryos
Eggs were in vitro fertilized, dejellied using 2% Cystein solution in 0.1xMMR, pH 7.8, washed 3 times with 0.1x MMR and transferred into 0.5x MMR for injections. For Kdm5b wild-type and catalytic inactive mutant, 13.6 ng of mRNA was used per injection. For H3.3WT and for H3.3K4M 0.2ng and 1.25ng mRNA, respectively, was used per injection to obtain equal expression levels of the proteins in embryos. Embryos were cultured at 23°C and collected at neurula stage 18 (Kdm5b experiments) or stage 21 (H3.3 experiments) (Nieuwkoop and Faber, 1994) to prepare endoderm donor cells for nuclear transfer.
Donor cell preparation
Endoderm cells were isolated from endoderm tissues of the respective neurula stage embryos (stage 18 for Kdm5b experiments; stage 21 for experiments shown in Figures 1 and 2 or the H3.3 experiments, please also see above) and frozen on dry ice for further analysis or dissociated in calcium- and magnesium-free modified Barth saline (1xMBS; 88mM NaCl, 1mM KCl, 10mM HEPES, 2.5mM NaHCO3, pH to 7.4.) with 1 mM EDTA and 0.1% BSA in a petri dish covered with 1% agarose in H2O and used immediately for nuclear transplantation.
Nuclear transfer and embryo culture
The procedure was carried out as described previously (Gurdon et al., 1958). In brief, dissociated endoderm cells were mildly disrupted by pipetting them up and down gently in a glass micropipette. Nuclear transplantation was performed by injection of a whole permeabilized cell into an egg enucleated for 30 s with a UV mineralite lamp and dejellied by a 5 s Hanovia lamp treatment. Nuclear transfer was performed within the next minute. The nuclear transplant embryos were placed into 1x MBS 0.1% BSA. At the 4-cell stage, the medium was exchanged to 0.1xMBS. As control, embryos were in vitro fertilized and the embryos were cultured at 16°C in 0.1x MBS until they reached stage 7 or stage 11. For all our analyses, completely cleaved NT embryos were selected at stage 7, stage 10 or 11 that were morphologically indistinguishable from IVF embryos at the same stage. For experiments analyzing gene expression at stage 11, NT and IVF embryos with the same blastopore size and cell size were selected to ensure that they are all at the same developmental stage, and the animal cap cells (ectoderm) were isolated and frozen on dry ice for further analysis. To score the developmental outcome, embryos were cultured in 0.1xMBS at 16°C until neurula stage and then at room temperature until they reached the desired developmental stages, which were determined according to the developmental table of Nieuwkoop and Faber (Nieuwkoop and Faber, 1994) and counted.
RNA extraction
Embryonic tissues were selected and isolated as described above, dissected as indicated and frozen at −80°C. RNA extractions were performed using QIAGEN RNeasy Mini kit (QIAGEN, 74106) according to the manufacturer’s protocol including the DNase step. RNA was eluted in 40ul of DEPC H2O.
cDNA sequencing library
RNA quality and quantity was analyzed on a RNA screen tape (Agilent) using RNA sample buffer (Agilent) on a Agilent 2200 tape station. Per sample, 500 ng RNA was used to generate a cDNA sequencing libraries using a Illumina TrueSeq kit (RS-122-2001), according to the manufacturer’s protocol using 12 PCR amplification cycles.
cDNA synthesis and RTqPCR analysis
cDNA synthesis was performed from the isolated RNAs using oligo dT(15) primers. RTqPCRs were performed with 5 μL cDNA and gene specific primers at 50 nM (Primer sequences are listed in Table S4) using a SybrGreen detection system (Sigma, S9194) and ABI 7300 machine (Applied Biosystems) using standard ABI cycling conditions (two-step PCR cycle: 94°C for 15 s and 60°C for 60 s). Reactions were performed in a total volume of 25 μl.
Western Blotting
Expression of mRNAs in embryos as well as the reduction of H3K4me3 levels was confirmed by western blot analyses. Briefly, embryonic tissues were homogenized in 50 μl buffer E1 (50mM HEPES-KOH pH 7.5, 140mM NaCl, 1mM EDTA pH 8.0, 10% Glycerol, 0.5% Igepal CA-630, 0.25% Triton X-100, 1mM DTT, complete protease inhibitors (Roche)) and then the chromatin was collected by centrifugation at 1600 g for 5 min. The supernatant was kept, the chromatin pellet was washed 3 times with 0.5ml Buffer E1 and then solubilized in 50 μL Emilie’s Buffer (500mM Tris pH 6.8, 500mM NaCl, 1% NP40, 0.1% SDS, 1% β-Mercaptoethanol). Laemmli sample buffer was added to the lysate and chromatin fractions, samples were separated on a 15% polyacrylamide gel and transferred to Hybond-P membranes (Amersham Bioscience). Antibodies against H3K4me3 (Abcam ab8580), HA (Sigma, H9658) or histone H4 (Abcam ab31830) were used for western blotting according to standard protocols and suppliers recommendations.
Chromatin Immunoprecipitation (ChiP)
Chromatin Immunoprecipitation (ChIP) were performed as described previously (Gentsch and Smith, 2014) with the following modifications. Neurula (stage 18) embryos were generated by in vitro fertilization. For each ChIP experiment, 50 embryos were dissected in 1xMBS to obtain the endoderm tissue. Samples were fixed in 2 mL of 1% Formaldehyde in 0.1x MMR for 25 min at room temperature, followed by 4 washes with 1 mL 0.1xMMR and equilibration in 500 μL HEG solution (50mM HEPES-KOH pH 7.5, 1 mM EDTA, 20% Glycerol) at 4°C, then excess buffer was removed and samples were frozen at −80°C. To extract chromatin, the samples were homogenized in 2 mL buffer E1 (50mM HEPES-KOH pH 7.5, 140mM NaCl, 1mM EDTA pH 8.0, 10% Glycerol, 0.5% Igepal CA-630, 0.25% Triton X-100, 1mM DTT, complete protease inhibitors (Roche)). Chromatin was collected by centrifugation for 2 min at 3500 rmp, 4°C and then washed two times with 2 mL E1, three times with 2ml buffer E2 (10mM Tris pH 8.0, 200mM NaCl, 1mM EDTA, 0.5mM EGTA, complete protease inhibitors (Roche)) and three times with 500 μL Buffer E3 (10mM Tris pH 8.0, 200mM NaCl, 1mM EDTA, 0.5mM EGTA, 0.1% Na-deoxycholate, 0.5% N-lauroylsarcosine, complete protease inhibitors (Roche)). Chromatin was fragmented by sonication for 20 cycles (30 s on and 30 s off) using a Bioruptor (Diagenode) at 4°C. The samples were centrifuged at 15min, 4°C at full speed, the supernatant was collected and Triton X-100 was added to 1%. 25 μL of the solution were put aside to serve as Input for later analysis. Before ChIP, primary anti H3K4me3 (Abcam ab8580, 0.5 μg per 50 embryos) antibodies were bound to PBS washed magnetic beads conjugated with secondary antibody (Invitrogen 11204D, 25 μL per 50 embryos) in 500 μL 1xPBS 0.1% BSA overnight at 4°C on a rotating wheel. Beads were washed 3 times with 1xPBS 0.1% BSA, added to the fragmented chromatin solution and incubated overnight at 4°C on a rotating wheel. Beads were then washed 6 times with RIPA buffer (50mM HEPES-KOH pH 7.5, 500mM LiCl, 1mM EDTA, 1% Igepal CA-630, 0.7% Na-deoxycholate, complete protease inhibitors (Roche)) and twice with TEN buffer (10mM Tris pH 8.0, 1mM EDTA, 150mM NaCl, complete protease inhibitors (Roche)) for each 10 min. For crosslink reversal, the beads were resuspended in 150 μL Stop buffer (40mM Tris pH 8.0, 10mM EDTA, 1% SDS) and 125 μL Stop buffer was added to the input fraction. The samples were supplemented with Proteinase K (0.3 μg/μl), NaCl (250 mM) and incubated at 65°C overnight. RNase A (DNase free) was added to a final concentration of 200 μg/μl and DNA was Phenol/Chloroform extracted. 150 μg/μl Glycogen was added and DNA was recovered by Ethanol precipitation. The pellet was resuspended in 30 μL H2O.
ChIP-seq library preparation
Half of a ChIP reaction (15 μl, see above) were subjected for ChIP-seq library preparation with the TruSeq DNA kit (Illumina, FC-121-2001). Two independent biological replicates were generated for each H3K4me3 ChIP experiments.
ChIP-RTqPCR
The ChIP reaction (see above) was diluted 1:40 and 5 μL were used for subsequent RTqPCR analysis using primer pairs described in Table S4 at 50nM together with a SybrGreen detection system (Sigma, S9194) and ABI 7300 machine (Applied Biosystems) using standard ABI cycling conditions (two-step PCR cycle: 94°C for 15 s and 60°C for 60 s). Reactions are performed in a total volume of 25 μl.
Experimental design
In all experiments analyzing gene expression, one sample was taken per embryo (i.e., one sample corresponds to one individual embryo). For the analysis of ON-memory gene-expression, 3 independent experiments (here defined as 3 biological replicates) were performed. In experiment#1, 4 IVF-, 3 NT-ectoderm samples and 1 endoderm donor sample, in experiment#2, 4 IVF-, 4 NT-ectoderm samples and 1 endoderm donor sample, in experiment#3, 3 IVF-, 5 NT-ectoderm samples and 1 endoderm donor sample were generated (see Table S1). To address the effect of Kdm5b, 2 independent experiments (here defined as two biological replicates) were performed. In experiment#4, 4 NT(Kdm5bwt)- and 4 NT(Kdm5bci) - ectoderm samples and 2 NT(Kdm5bwt)- and 2 NT(Kdm5bci)- endoderm donor cell samples and in experiment#5, 3 NT(Kdm5bwt)- and 4 NT(Kdm5bci) - ectoderm samples and 2 NT(Kdm5bwt)- and 2 NT(Kdm5bci)- endoderm donor cell samples were taken (see Table S1). For the H3.3K4M analysis, one experiment#6 (one biological replicate) was performed with 4 IVF ectoderm samples, 4 NT(H3.3wt)- and 4 NT(H3.3K4M)- ectoderm samples and 2 NT(H3.3wt)- and 2 NT(H3.3K4M) - endoderm donor cell samples.
Randomization and Blinding: When the donor cells were treated, the order of nuclear transfer of control- and treated donor nuclei was alternated; The experimenter was unaware of the treatment of the donor cell while performing the nuclear transfer. When embryos were selected for analysis, healthy looking embryos (i.e., morphologically indistinguishable from IVF-embryos) with the same blastopore size were selected. A sample size was chosen that allowed the significant identification of differentially expressed genes (see quantification and statistical data analysis section below), and that also considered the loss of a third of the samples due to inefficient RNA extraction or a failure in library generation. Samples were excluded that showed poor RNA quality (RIN below 7), quantity (below 500ng) or that did not result in a product after performing the library preparation protocol. Furthermore, after sequencing, the raw reads were clustered using WardD, and out of 6 experiments, 3 experiments contained outliers: exp#1: two NT-samples; exp#3: one IVF-sample and exp#5: one NT(Kdm5bwt)- sample. These 4 samples were excluded from further DE gene expression analysis.
For ChIP experiments, 2 independent experiments (here referred to as 2 biological replicates) were performed. Per ChIP experiment, the endoderm tissues of 50 Kdm5bwt-, 50 Kdm5bci- expressing embryos, as well as 50 uninjected embryos were pooled.
For the quantification of the developmental outcome, 6 independent experiments (n) were performed, and the total number of gastrulae (N) was determined for each condition: 69 NT(Kdm5bci)- and 56 NT(Kdm5bci)- and 152 IVF- gastrula embryos. Gastrula embryos were selected, that were morphologically indistinguishable from IVF embryos at the same stage. Here, the biological replicates refer to the number of gastrula embryos quantified. Their development was followed, and surviving embryos were counted for each condition at the indicated stages. Randomization and Blinding: Per experiment, each condition was performed and the order was alternated; The experimenter was unaware of the treatment of the donor cell while performing the nuclear transfer and the quantification of the survival.
Quantification and Statistical Analysis
Xenopus transcriptome and sequencing data
We used the Xenopus laevis annotation that was generated for (Teperek et al., 2016). RNA-seq and ChIP-seq libraries were sequenced on an Illumina HiSeq 2000 instrument in single read mode at 36 base length. Fastq files were filtered for low quality reads (< Q20) using sickle and low quality bases were trimmed from the ends of the reads (< Q20). Adapters were removed using cutadapt 1.0 (Martin, 2011). RNA-seq data were mapped against the Xenopus laevis genome using TopHat 2.0.6 (Trapnell et al., 2009) - Xenopus laevis genome (JGI version 6.1) was used for all analyses in this paper, and can be downloaded here: ftp://ftp.xenbase.org/pub/Genomics/JGI/Xenla6.1/.). ChIP-seq data mapped against X. laevis (version 6.1) with BWA (version 0.6.2) (Li and Durbin, 2009). Duplicate reads in ChIP-seq were then filtered out with samtools 0.1.8 (Li et al., 2009). After this step, our input dataset contained more than 18 M uniquely mapped reads, and our IP samples more than 10 M uniquely mapped reads.
Xenopus differential gene expression
For the expression profiling, read counts were generated for each of the transcripts. RPKMs (reads per kilobase per million) were calculated by normalizing read counts for each transcript by the transcript length and the total number of reads in the corresponding sample. Counts per million (CPM) and differentially expressed (DE) transcripts were called using edgeR (Robinson et al., 2010). For the analysis of the extent of transcriptional memory in Figures 1 and 2, transcripts remained in the analysis if they had CPM > 1 in either all of the Donor or 70% of IVF or 70% of NT embryo samples. The log2 fold change (logFC) and the false discovery rate (FDR) was calculated comparing 11 IVF ectoderm samples, 12 NT ectoderm samples and 3 endoderm donor cell samples from 3 independent experiments (see Table S1). To address the effect of Kdm5b, the DE analysis was performed on 8 IVF ectoderm samples, 7 NT(Kdm5bwt)- and 8 NT(Kdm5bci) - ectoderm samples and 4 NT(Kdm5bwt)- and 4 NT(Kdm5bci)- endoderm donor cell samples from 2 independent experiments (see Table S1). The H3.3K4M DE analysis was performed on 4 IVF ectoderm samples, 4 NT(H3.3wt)- and 4 NT(H3.3K4M)- ectoderm samples and 2 NT(H3.3wt)- and 2 NT(H3.3K4M) - endoderm donor cell samples from one experiment.
Human differential gene expression
Publicly available datasets of the gene expression analysis were obtained from the Gene Expession Omnibus (GEO) DataSets, accession number GSE73362 (human) (Chung et al., 2015). The logFCs in gene expression levels were calculated in R.
DE data filter-strategy
X.Laevis, log2 fold changes (logFC) and false discovery rate (FDR) were calculated by using the R package EdgeR. These lists of transcripts were then additionally filtered the following way (note that in the Xenopus analysis “3FC” corresponds log2FC < 1.5 or log2FC > 1.5, which is more precisely a 2.8285 fold change, and approximately a 3 fold change):
DE transcripts Donor/IVF: FDRDonor/IVF < 0.05; DE between Donor/IVF and NT/IVF: FDRDonor/IVF < 0.05&FDRNT/IVF < 0.05. ON-memory: FDRDonor/IVF < 0.05, logFCDonor/IVF > 0, FDRNT/IVF < 0.05, logFCNT/IVF > 0, RPKMDonor > 1; ON-memory(3FC): FDRDonor/IVF < 0.05, logFCDonor/IVF > 0, FDRNT/IVF < 0.05, logFCNT/IVF > 1.5, RPKMDonor > 1; OFF-memory: FDRDonor/IVF < 0.05, logFCDonor/IVF < 0, FDRNT/IVF < 0.05, logFCNT/IVF < 0; OFF-memory(3FC): FDRDonor/IVF < 0.05, logFCDonor/IVF < 0, FDRDonor/NT < 0.05, logFCDonor/NT < 0, FDRNT/IVF < 0.05, logFCNT/IVF < −1.5; Reprogrammed-down: FDRDonor/IVF < 0.05, logFCDonor/IVF > 0, FDRDonor/NT < 0.05, logFCDonor/NT > 0, RPKMDonor > 1; transcripts with FDRNT/IVF < 0.05 were excluded. Note that transcripts that were transcribed in the Donor (RPKM > 1 in all Donor samples) but not in IVF and NT (RPKM < 1 in some or all samples) were kept in the analysis and considered as ON-reprogrammed as they were successfully downregulated during reprogramming. Reprogrammed-up: FDRDonor/IVF < 0.05, logFCDonor/IVF < 0, FDRDonor/NT < 0.05, logFCDonor/NT < 0, FDRNT/IVF > 0.05. Reprogrammed: FDRDonor/IVF < 0.05, FDRDonor/NT < 0.05, transcripts with FDRNT/IVF < 0.05 were excluded.
Gene ontology terms over-represented among the differentially expressed genes were found using topGO (Alexa et al., 2006).
In the Kdm5b experiments, each of the two experiments was filtered separately and then the generated lists were intersected (Figures 3, S3, and S4; Tables S2 and S3). DE transcripts between Kdm5bwt versus Kdm5bci expressing donor and H3.3wt versus H3.3K4M expressing donor cells were identified by filtering for FDR < 0.05 and then excluded from further analysis. The different gene sets upon Kdm5bci treatment were filtered the following way:
DE transcripts Donorci/IVF: FDRDonorci/IVF < 0.05; DE between Donorci/IVF and NTci/IVF: FDRDonorci/IVF < 0.05&FDRNTci/IVF < 0.05. ON-memory: FDRDonorci/IVF < 0.05, logFCDonorci/IVF > 0, FDRNTci/IVF < 0.05, logFCNTci/IVF > 0, RPKMDonorci > 1; ON-memory(3FC): FDRDonorci/IVF < 0.05, logFCDonorci/IVF > 0, FDRNTci/IVF < 0.05, logFCNTci/IVF > 1.5, RPKMDonocir > 1; OFF-memory: FDRDonorci/IVF < 0.05, logFCDonorci/IVF < 0, FDRNTci/IVF < 0.05, logFCNTci/IVF < 0; OFF-memory(3FC): FDRDonorci/IVF < 0.05, logFCDonorci/IVF < 0, FDRDonorci/NT < 0.05, logFCDonorci/NT < 0, FDRNTci/IVF < 0.05, logFCNTci/IVF < −1.5; Reprogrammed-down: FDRDonorci/IVF < 0.05, logFCDonorci/IVF > 0, RPKMDonorci > 1; transcripts with FDRNT/IVF < 0.05 were excluded. Note that transcripts that were transcribed in the Donor (RPKM > 1 in all Donor samples) but not in IVF and NT (RPKM < 1 is some or all samples) were kept in the analysis and considered as ON-reprogrammed as they were successfully downregulated during reprogramming. Reprogrammed-up: FDRDonorci/IVF < 0.05, logFCDonorci/IVF < 0, FDRNTci/IVF > 0.05. Reprogrammed: FDRDonorci/IVF < 0.05, transcripts with FDRNTci/IVF < 0.05 were excluded
The different gene sets upon Kdm5bwt, H3.3wt and H3.3K4M treatment were filtered following the same strategy as above.
In human and mouse, the values for log2 FC (logFC) were filtered using R. These lists of transcripts were then additionally filtered the following way:
DE transcripts between Donor/IVF: FCDonor/IVF > 5; DE between Donor/IVF and NT/IVF: FCDonor/IVF > 5 and FCNT/IVF > 5. ON-memory(2-5FC): logFCDonor/IVF > 2.3, 1 < logFCNT/IVF < 2.3, RPKMDonor > 1; ON-memory(5FC):, logFCDonor/IVF > 2.3, logFCNT/IVF > 2.3, FPKMDonor > 1; OFF-memory(2-5FC): logFCDonor/IVF < −2.3, −2.3 < logFCNT/IVF < −1; OFF-memory(5FC): logFCDonor/IVF < −2.3, logFCNT/IVF < −2.3; Reprogrammed-down: logFCDonor/IVF > 2.3, RPKMDonor > 1; transcripts with logFCNT/IVF > 1 were excluded. Note that genes that were transcribed in the Donor (FPKM > 1 in all Donor samples) but not in IVF and NT (FPKM < 1 in some or all samples) were kept in the analysis and considered as ON-reprogrammed as they were successfully downregulated during reprogramming. Reprogrammed-up: logFCDonor/IVF < −2.3, logFCNT/IVF > −1. Reprogrammed: union of ON- and OFF- reprogrammed transcripts.
Heatmaps and plots for gene expression
Heatmaps. The log2 fold change was calculated over the mean IVF expression level or over the mean pooled donor, IVF and NT expression level, as indicated in figure legends. These values were plotted on a heatmap and clustered by rows only or by both rows and columns, as shown in figure legends, using heatmap.2 (from R package gplots) using default settings (which is complete as agglomeration method and Euclidean distance as similarity measure). MA plot. The log2 FC in expression of transcripts between NT- to IVF-embryos was plotted against the average donor cell gene expression (log2(RPKM+1)). Box-plots show distribution of mean gene expression levels of the different sets of transcripts. The middle line in the box indicates the median, the box edges indicate the 25th/75th percentiles, the whiskers indicate the min and max.
Differences in gene expression levels between pairwise sets of genes were tested using Mann-Whitney test (equivalent to Wilcoxon rank sum test. R, wilcox.test(alternative = c(“two.sided”), paired = F)).
qPCR analysis
The indicated genes were quantified using a standard curve of embryonic cDNA (gene expression) or Xenopus genomic DNA (ChIP). For normalization, the values of the genes of interest were divided by the values for H4 (gene expression) or the values of the IP were represented as percent of the Input values (ChIP). The data were then visualized using R as a scatterplot, with the mean and the standard error of the mean (gene expression); each dot on the scatterplot corresponds to one embryo sample generated in two independent experiments) or the mean as a column bar graph with the standard error of the mean (ChIP; each bar corresponds to two values generated in two independent experiments). When indicated, significance was calculated using Mann-Whitney test (equivalent to Wilcoxon rank sum test. R, wilcox.test(alternative = c(“two.sided”), paired = F)). ∗p value < 0.05, ∗∗p value < 0.01, ∗∗∗p value < 0.001.
Principal component analysis and hierarchical transcriptome clustering
After computing CPM (count per million), genes were retained in the analysis if they had CPM > 1 in either all of the Donor or in ≥ 70% of IVF or ≥ 70% of NT embryo samples. The data were subsequently scaled two times using z-score transformation: one scaling has been performed for each batch of experiments (i.e., experiments produced at the same time). Then all batched experiments have been scaled together again to reduce the variability due to the technical batch factor. Data obtained after this step have been used for the unsupervised hierarchical clustering analysis, which was performed by using the Euclidean distance and ward.D linkage as implemented in R. The PCA analysis was performed using the R function prcomp() using the parameter cor = T.x.
ChIP-seq data analysis
Aligned data from H3K4me3 ChIP-seq were used to compute the coverage around TSS (transcriptional start site) for each of the two biological replicates separately. The histone methylation levels was computed as:
where IP is the immunoprecipitation sample, input is the input-control; NIP is the total number of aligned reads in the IP experiment and Ninput is the total number of aligned reads in the input sample.
Specifically, a region of 4kb centered on the TSSs was binned in 50bp-wide windows and the histone methylation level was computed for each bin. The average of the normalized histone methylation levels was computed for each set of genes (ON-memory, ON-reprogrammed, genome-wide (GW)) and then visualized. Differences in histone methylation levels at TSSs between the set of genes were tested with the Kolmogorov-Smirnov test (R, ks.test).
Additionally, the global (integral) histone methylation level in the 4kb window was computed and the distribution across each set of genes was visualized in box-plots and compared using ks.test. For signal tracks, count reads were computed with bedtools (version 2.25.0) genomecov (Quinlan and Hall, 2010).
Processed and normalized bigwig files relative to H3K4me3 in adult normal human dermal fibroblast (NHDF) cells were downloaded from GEO (GEO accession number GSM733650). The same strategy was used to compare the average histone methylation levels of different groups of genes in a region of 2kb around TSS as indicated.
Methylated histone regions
Histone methylated regions (peaks) were called using MACS2 (version 2.0.9) with the following options (–broad–gsize = 2.6e9 -q 0.01) for each immunoprecipitation experiment individually. Resulting peaks overlapping TSSs were used for the subsequent analysis. Peaks sizes distributions were visualized by plotting histograms. The inserts with examples of H3K4me3 regions spanning the TSS of example genes were generated using Integrative Genomics Viewer, IGV (Robinson et al., 2011, Thorvaldsdóttir et al., 2013). KS test was used to evaluate differences between peaks sizes across the different set of previously defined genes. ECDFs (empirical cumulative distribution functions) were computed for all experiments and visualized.
We downloaded peaks list of the human study in NHDF cells (GEO accession number GSM733650). Peak sizes distributions of different set of genes were compared similarly.
Developmental outcome
Statistical significance was calculated using a one tailed t test using R. The data are represented as the mean with the SEM (standard error of the mean).
Data Availability
The RNA-seq and ChIP-seq data generated in this study are: 73 samples, single-ended RNA-seq libraries from neurula stage 18 or 21 endoderm and gastrula stage 11 ectoderm samples; 2 single-ended ChIP-seq libraries from endoderm cells of neurula (stage 21) embryos with antibody for H3K4me3, and 2 replicates for each histone modification pull-down. The accession number for the RNA-seq and ChIP-seq data reported in this paper is GEO: GSE92366.
Author Contributions
Conceptualization: E.H.; Methodology: E.H. and J.J.; Software: A.S., G.E.A., and C.R.B.; Formal Analysis: E.H., A.S., and G.E.A.; Investigation: E.H.; Validation: E.H., M.F., and J.J.; Data Curation: C.R.B., A.S., and G.E.A.; Writing – Original Draft and Visualization: E.H.; Writing – Review and Editing: E.H., J.J., and J.G.; Supervision and Funding acquisition: E.H., J.J., and J.G. J.J. and J.G. contributed equally.
Acknowledgments
This work was funded by the Molecular Research Council (MR/P000479/1), the Wellcome Trust (101050/Z/13/Z and 092096/Z/10/Z), and Cancer Research UK (C6946/A14492). E.H. was a recipient of a long-term fellowship from the European Molecular Biology Organization (EMBO) and Isaac Newton Trust funding. The authors are grateful to Dave Simpson, members of the Gurdon lab and Institute, Julie Ahringer and her lab, and Thomas U. Mayer for support and critically reading the manuscript.
Published: March 30, 2017
Footnotes
Supplemental Information includes four figures and four tables and can be found with this article online at http://dx.doi.org/10.1016/j.stem.2017.03.003.
Contributor Information
Eva Hörmanseder, Email: e.hoermanseder@gurdon.cam.ac.uk.
Jerome Jullien, Email: j.jullien@gurdon.cam.ac.uk.
Supplemental Information
References
- Alexa A., Rahnenführer J., Lengauer T. Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics. 2006;22:1600–1607. doi: 10.1093/bioinformatics/btl140. [DOI] [PubMed] [Google Scholar]
- Benayoun B.A., Pollina E.A., Ucar D., Mahmoudi S., Karra K., Wong E.D., Devarajan K., Daugherty A.C., Kundaje A.B., Mancini E. H3K4me3 breadth is linked to cell identity and transcriptional consistency. Cell. 2014;158:673–688. doi: 10.1016/j.cell.2014.06.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blau H.M., Chiu C.P., Webster C. Cytoplasmic activation of human nuclear genes in stable heterocaryons. Cell. 1983;32:1171–1180. doi: 10.1016/0092-8674(83)90300-8. [DOI] [PubMed] [Google Scholar]
- Blelloch R., Wang Z., Meissner A., Pollard S., Smith A., Jaenisch R. Reprogramming efficiency following somatic cell nuclear transfer is influenced by the differentiation and methylation state of the donor nucleus. Stem Cells. 2006;24:2007–2013. doi: 10.1634/stemcells.2006-0050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boiani M., Eckardt S., Schöler H.R., McLaughlin K.J. Oct4 distribution and level in mouse clones: consequences for pluripotency. Genes Dev. 2002;16:1209–1219. doi: 10.1101/gad.966002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chung Y.G., Matoba S., Liu Y., Eum J.H., Lu F., Jiang W., Lee J.E., Sepilian V., Cha K.Y., Lee D.R., Zhang Y. Histone demethylase expression enhances human somatic cell nuclear transfer efficiency and promotes derivation of pluripotent stem cells. Cell Stem Cell. 2015;17:758–766. doi: 10.1016/j.stem.2015.10.001. [DOI] [PubMed] [Google Scholar]
- Clouaire T., Webb S., Skene P., Illingworth R., Kerr A., Andrews R., Lee J.H., Skalnik D., Bird A. Cfp1 integrates both CpG content and gene activity for accurate H3K4me3 deposition in embryonic stem cells. Genes Dev. 2012;26:1714–1728. doi: 10.1101/gad.194209.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Consortium E.P., ENCODE Project Consortium An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dahl J.A., Jung I., Aanes H., Greggains G.D., Manaf A., Lerdrup M., Li G., Kuan S., Li B., Lee A.Y. Broad histone H3K4me3 domains in mouse oocytes modulate maternal-to-zygotic transition. Nature. 2016;537:548–552. doi: 10.1038/nature19360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Enright B.P., Kubota C., Yang X., Tian X.C. Epigenetic characteristics and development of embryos cloned from donor cells treated by trichostatin A or 5-aza-2′-deoxycytidine. Biol. Reprod. 2003;69:896–901. doi: 10.1095/biolreprod.103.017954. [DOI] [PubMed] [Google Scholar]
- Firas J., Liu X., Polo J.M. Epigenetic memory in somatic cell nuclear transfer and induced pluripotency: evidence and implications. Differentiation. 2014;88:29–32. doi: 10.1016/j.diff.2014.09.001. [DOI] [PubMed] [Google Scholar]
- Gao S., Chung Y.G., Williams J.W., Riley J., Moley K., Latham K.E. Somatic cell-like features of cloned mouse embryos prepared with cultured myoblast nuclei. Biol. Reprod. 2003;69:48–56. doi: 10.1095/biolreprod.102.014522. [DOI] [PubMed] [Google Scholar]
- Gentsch G.E., Smith J.C. Investigating physical chromatin associations across the Xenopus genome by chromatin immunoprecipitation. Cold Spring Harb. Protoc. 2014;2014:2014. doi: 10.1101/pdb.prot080614. [DOI] [PubMed] [Google Scholar]
- Gurdon J.B. The developmental capacity of nuclei taken from differentiating endoderm cells of Xenopus laevis. J. Embryol. Exp. Morphol. 1960;8:505–526. [PubMed] [Google Scholar]
- Gurdon J.B., Elsdale T.R., Fischberg M. Sexually mature individuals of Xenopus laevis from the transplantation of single somatic nuclei. Nature. 1958;182:64–65. doi: 10.1038/182064a0. [DOI] [PubMed] [Google Scholar]
- Hirasawa R., Matoba S., Inoue K., Ogura A. Somatic donor cell type correlates with embryonic, but not extra-embryonic, gene expression in postimplantation cloned embryos. PLoS ONE. 2013;8:e76422. doi: 10.1371/journal.pone.0076422. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hörmanseder E., Tischer T., Mayer T.U. Modulation of cell cycle control during oocyte-to-embryo transitions. EMBO J. 2013;32:2191–2203. doi: 10.1038/emboj.2013.164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim K., Zhao R., Doi A., Ng K., Unternaehrer J., Cahan P., Huo H., Loh Y.H., Aryee M.J., Lensch M.W. Donor cell type can influence the epigenome and differentiation potential of human induced pluripotent stem cells. Nat. Biotechnol. 2011;29:1117–1119. doi: 10.1038/nbt.2052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kishigami S., Mizutani E., Ohta H., Hikichi T., Thuan N.V., Wakayama S., Bui H.T., Wakayama T. Significant improvement of mouse cloning technique by treatment with trichostatin A after somatic nuclear transfer. Biochem. Biophys. Res. Commun. 2006;340:183–189. doi: 10.1016/j.bbrc.2005.11.164. [DOI] [PubMed] [Google Scholar]
- Lewis P.W., Müller M.M., Koletsky M.S., Cordero F., Lin S., Banaszynski L.A., Garcia B.A., Muir T.W., Becher O.J., Allis C.D. Inhibition of PRC2 activity by a gain-of-function H3 mutation found in pediatric glioblastoma. Science. 2013;340:857–861. doi: 10.1126/science.1232245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H., Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R., 1000 Genome Project Data Processing Subgroup The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu W., Liu X., Wang C., Gao Y., Gao R., Kou X., Zhao Y., Li J., Wu Y., Xiu W. Identification of key factors conquering developmental arrest of somatic cell cloned embryos by combining embryo biopsy and single-cell sequencing. Cell Discov. 2016;2:16010. doi: 10.1038/celldisc.2016.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. 2011;17:10–12. [Google Scholar]
- Matoba S., Liu Y., Lu F., Iwabuchi K.A., Shen L., Inoue A., Zhang Y. Embryonic development following somatic cell nuclear transfer impeded by persisting histone methylation. Cell. 2014;159:884–895. doi: 10.1016/j.cell.2014.09.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meissner A., Jaenisch R. Mammalian nuclear transfer. Dev. Dyn. 2006;235:2460–2469. doi: 10.1002/dvdy.20915. [DOI] [PubMed] [Google Scholar]
- Muramoto T., Müller I., Thomas G., Melvin A., Chubb J.R. Methylation of H3K4 is required for inheritance of active transcriptional states. Curr. Biol. 2010;20:397–406. doi: 10.1016/j.cub.2010.01.017. [DOI] [PubMed] [Google Scholar]
- Ng R.K., Gurdon J.B. Epigenetic memory of active gene transcription is inherited through somatic cell nuclear transfer. Proc. Natl. Acad. Sci. USA. 2005;102:1957–1962. doi: 10.1073/pnas.0409813102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nieuwkoop P.D., Faber J. Garland Pub.; 1994. Normal Table of Xenopus laevis (Daudin): A Systematical and Chronological Survey of the Development from the Fertilized Egg till the End of Metamorphosis. [Google Scholar]
- Polo J.M., Liu S., Figueroa M.E., Kulalert W., Eminli S., Tan K.Y., Apostolou E., Stadtfeld M., Li Y., Shioda T. Cell type of origin influences the molecular and functional properties of mouse induced pluripotent stem cells. Nat. Biotechnol. 2010;28:848–855. doi: 10.1038/nbt.1667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Probst A.V., Dunleavy E., Almouzni G. Epigenetic inheritance during the cell cycle. Nat. Rev. Mol. Cell Biol. 2009;10:192–206. doi: 10.1038/nrm2640. [DOI] [PubMed] [Google Scholar]
- Quinlan A.R., Hall I.M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson M.D., McCarthy D.J., Smyth G.K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson J.T., Thorvaldsdóttir H., Winckler W., Guttman M., Lander E.S., Getz G., Mesirov J.P. Integrative genomics viewer. Nat. Biotechnol. 2011;29:24–26. doi: 10.1038/nbt.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Santos-Rosa H., Schneider R., Bannister A.J., Sherriff J., Bernstein B.E., Emre N.C., Schreiber S.L., Mellor J., Kouzarides T. Active genes are tri-methylated at K4 of histone H3. Nature. 2002;419:407–411. doi: 10.1038/nature01080. [DOI] [PubMed] [Google Scholar]
- Takahashi K., Yamanaka S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell. 2006;126:663–676. doi: 10.1016/j.cell.2006.07.024. [DOI] [PubMed] [Google Scholar]
- Teperek M., Simeone A., Gaggioli V., Miyamoto K., Allen G.E., Erkek S., Kwon T., Marcotte E.M., Zegerman P., Bradshaw C.R. Sperm is epigenetically programmed to regulate gene transcription in embryos. Genome Res. 2016;26:1034–1046. doi: 10.1101/gr.201541.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thorvaldsdóttir H., Robinson J.T., Mesirov J.P. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief. Bioinform. 2013;14:178–192. doi: 10.1093/bib/bbs017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trapnell C., Pachter L., Salzberg S.L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25:1105–1111. doi: 10.1093/bioinformatics/btp120. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The RNA-seq and ChIP-seq data generated in this study are: 73 samples, single-ended RNA-seq libraries from neurula stage 18 or 21 endoderm and gastrula stage 11 ectoderm samples; 2 single-ended ChIP-seq libraries from endoderm cells of neurula (stage 21) embryos with antibody for H3K4me3, and 2 replicates for each histone modification pull-down. The accession number for the RNA-seq and ChIP-seq data reported in this paper is GEO: GSE92366.