Abstract
Human embryonic stem cells (hESCs) undergo epigenetic changes in vitro which may compromise function, so an epigenetic pluripotency “signature” would be invaluable for line validation. We assessed Cytosine-phosphate-Guanine Island (CGI) methylation in hESCs by genomic DNA hybridisation to a CGI array, and saw substantial variation in CGI methylation between lines. Comparison of hESC CGI methylation profiles to corresponding somatic tissue data and hESC mRNA expression profiles identified a conserved hESC-specific methylation pattern associated with expressed genes. Transcriptional repressors and activators were over-represented amongst genes whose associated CGIs were methylated or unmethylated specifically in hESCs, respectively. Knockdown of candidate transcriptional regulators (HMGA1, GLIS2, PFDN5) induced differentiation in hESCs, whereas ectopic expression in fibroblasts modulated iPSC colony formation. Chromatin immunoprecipitation confirmed interaction between the candidates and the core pluripotency transcription factor network. We thus identify novel pluripotency genes on the basis of a conserved and distinct epigenetic configuration in human stem cells.
Introduction
The application of human embryonic stem cells (hESCs) to regenerative medicine relies on maintaining appropriate gene expression controlling self renewal or lineage specification in vitro. Epigenetic modifications of DNA and chromatin control the expression patterns that define cellular identity and function during development and in differentiated tissues [1]. Methylation of cytosine in cytosine-phosphate-guanine (CpG) dinucleotides is an epigenetic mark conferring stability on gene expression states, notably by the establishment of a silent chromatin state [2]. In normal sequence, where CpGs are relatively infrequent (~ 1 per 100 bp), most CpGs are methylated, but, in Cytosine-phosphate-Guanine Islands (CGIs), where CpGs typically comprise about 1 in 10 bp for sequences ~1 to 2 kb in length, they are usually unmethylated. About 70% of CGIs are associated with gene promoters [3]. While CGIs are normally unmethylated, exceptions include those associated with imprinted genes [4], genes subject to X-chromosome inactivation [5], and transposable elements [6, 7].
Genetic changes are common in hESCs as a cell line adapts to culture [8, 9], and hESCs are also neither epigenetically homogenous nor stable in vitro. Gene promoter methylation and X-inactivation states vary between cell lines and even between different cultures of the same line [10] [11–17]. The same is true for cells induced into pluripotency, which can retain a residual epigenetic memory of their origin [18–22]. As not all epigenetic variations affect cell behaviour, it is important to identify those relevant to a pluripotent phenotype.
DNA methylation in hESCs has been studied by various methods [12, 23] [24] [25]. We have used a human CGI array (> 17,000 CGIs) to identify CGIs that are either methylated or unmethylated in hESCs, and identify CGI methylation patterns conserved between hESC lines and distinct from differentiated tissues assessed previously [26]. These conserved patterns define putative biomarkers of the pluripotent state at an epigenetic level; that is, CGIs whose methylation status is apparently unique to hESCs. Functional roles for selected candidates in regulation of stem cell phenotype were confirmed by small RNA-mediated interference and modulation of somatic cell reprogramming frequency. We propose a model whereby transcriptional communication between “secondary” pluripotency-associated factors such as the epigenetically-defined biomarkers described here and the core pluripotency network ensures that expression of both groups of factors is achieved in pluripotent cells and modulated precisely in differentiating cells.
Materials and Methods
Human embryonic stem cell lines
HESC lines were derived under license from the UK Human Fertilisation and Embryology Authority (R0136). Their identity, provenance and culture conditions are summarised in table A in S1 File and detailed previously [27, 28].
CGI analysis
For each hESC line, gDNA was prepared from 2 biological replicates (RCM1) or 4 biological replicates (RH1, 3 and 4) of 105 cells per replicate. gDNA was purified, digested and ligated to "catch linkers" prior to MBD2 column binding, elution and array hybridisation, all as described previously [26].
mRNA expression analysis
Expression analysis was performed on samples concurrent with those used to evaluate the CGI methylation of gDNA. RNA was prepared and array data analysed as described [27].
Annotation from Gene Ontology was processed through the Bioconductor package GO.db. Statistical significance for overrepresentation of expressed sets of genes within GO categories were derived through the hypergeometric test (equivalent to Fisher´s exact test). P-values were adjusted for multiple testing with the Benjamini-Hochberg procedure.
Screening of candidate gene function by RNA interference
Genes were knocked down in hESCs by transfection of siRNA molecules with RNAiMAX Lipofectamine (Invitrogen) (S8 Fig). SiRNA sequences are shown in Table M in S1 File.
RT-qPCR Analysis
Gene expression analyses were performed as described [29]. Table O in S1 File gives primer sequences.
Immunocytochemistry
For OCT4 and NANOG immunohistochemistry, hESCs were fixed and immunostained as described [30]. Immunostaining for methylated and hydroxymethylated DNA used a protocol adapted from that previously reported [31] to permit costaining with DAPI and mC or 5-hmC visualisation.
5-hmC ELISA
The 5-hmC content of genomic DNA was measured by a hydroxymethylated DNA quantification kit (Quest 5-hmC DNA ELISA Kit, ZYMO Research) according to manufacturer’s protocol.
Chromatin Immunoprecipitation
HESC chromatin was prepared using the ChIP-IT High Sensitivity kit (Active Motif), with approximately 50,000 cell equivalents and 4 μg antibody to OCT4 (Santa Cruz SC-8628) was used per reaction. Controls and data from the ChIP-IT qPCR analysis kit (Active Motif) were used to create a standard curve and calculate the number of binding events per 1000 cells in each sample. Primer sequences for the promoter regions of HMGA1, GLIS2 and PFDN5 incorporating OCT4 binding sites are shown in Table P in S1 File.
Reprogramming of Human Dermal Fibroblasts
10 independent transfections were carried out for each condition to ensure a statistically powerful experiment. Briefly, 2x105 HDFs (≤ passage 8; Cascade Biologics C0045C) were harvested by trypsinisation and transfected in a 20 μl reaction containing 0.04 pmol of each plasmid (pCXLE-OCT4sh53, pCXLE-UL, pCXLE-SK [32] and pCXLE-GW-HMGA1, pCXLE-GW-GLIS2 and/or pCXLE-GW-PFDN5 (S13 Fig) using a Nucleofector X Unit running program EN150 and nucleofection solution P2 (Lonza). Cells were plated out and cultured in fibroblast medium (KO-DMEM, 10% FCS, L-gln, penicillin & streptomycin) for 5–8 days, then replated on Matrigel and cultured in mTeSR1 medium until colonies appeared. Plates were fixed and stained for alkaline phosphatase activity using the Stemgent AP2 kit according to the manufacturer’s instructions.
Statistical Analyses
Statistical analyses were performed in Graphpad Prism, using unpaired t-tests or ANOVA, as appropriate.
Results
CGI Array Hybridisation of hESC MAP-gDNA
As a pilot study of the significance of CGI methylation to pluripotency, 4 hESC lines, differing in provenance, sex, passage number and culture conditions were assessed to identify a conserved pattern of CGI methylation status (Table A in S1 File; summary of hESC lines employed in this study). Three of these lines (RH1, RH3 and RCM1) were female, and one (RH4) was male. To investigate hESC line CGI methylation status, we probed an array of biologically-defined CGIs used previously to study tissue-specific methylation[26] (S3 Table). A CGI was designated as gene-associated (GA-CGI) if it mapped to within 1.5 kb of an annotated gene or overlapped with the gene itself. Thus of 17,387 CGIs on the array, 13,657 were gene-associated (78.5%; S3 Table).
Control (unpurified) input gDNA comparisons between lines gave similar levels of CGI fragment: probe hybridisation for the autosomes (within 0.5 log2), as expected for euploid human cell lines (S2 Fig). When RH4 (male) was compared with female lines, the X chromosome showed significantly weaker hybridisation in RH4 whereas the Y chromosome showed stronger hybridisation (S2 Fig). This difference in DNA hybridisation signal is expected for the sex chromosomes from lines of different sexes, and confirmed the CGI array system resolution to within 2-fold. To identify methylated CGIs, arrays were probed with MBD2 methyl-binding domain column-purified genomic DNA (MAP-gDNA), with hybridisation signal compared against total gDNA (See S1 Fig for schematic overview). A CGI was designated as methylated (Me-CGI) when the M value (log2[MAP-gDNA/Total gDNA]) was ≥ 1.5 and the adjusted p value ≤ 0.1; otherwise it was designated as unmethylated. The complete CGI methylation dataset is available (S5 Table). Examples of CGI methylation or lack thereof in different cell lines identified by the CGI array were confirmed by sequencing of fragments amplified from bisulphite-treated gDNA (S3 Fig).
CGI Methylation of Human Embryonic Stem Cells
HESC MAP-gDNA hybridisation data showed that 12–16% of CGIs were methylated in hESCs, depending on cell line (2119 Me-CGIs in RH4 to 2717 in RH3). For consistency, somatic tissue data reported previously[26] were reanalysed in parallel. Similar proportions of CGIs were methylated in somatic tissues as in hESCs, varying from 10–14% (1785 Me-CGIs in male blood to 2546 Me-CGIs in muscle [Fig 1A; corresponding numbers and proportions of unmethylated CGIs are listed in table Q in S1 File]). Overall CGI methylation levels are thus similar in hESCs and somatic tissues. There was no significant difference in CGI methylation rates between hESC lines and somatic tissues for CGIs generally, or for gene-associated CGIs specifically (P = 0.142 in both cases; Kruskall-Wallis).
CGI methylation varied between hESC lines. Comparison of Me-CGIs between the female lines RH1, RH3 and RCM1 revealed heterogeneity between pairs of lines (S4 Fig). There were frequent differences greater than 0.5-fold throughout the genome between RH1 and RCM1, and between RH3 and RCM1 (S4A and S4B Fig). Individual CGIs were either hypermethylated (red, M>1.5; difference between lines >0.75) or hypomethylated (blue, M<1.5; difference between lines >0.75) in RH1 and RH3 cf. RCM1, rather than one cell line in a comparison being consistently hypo- or hypermethylated with respect to the other. However, the sibling lines RH1 and RH3 showed similar levels of autosomal CGI methylation (S4C Fig). In the case of the X chromosome, RH1 CGI methylation was generally higher than that of RH3 (S4C and S4D Fig), corresponding to methylation of 137 X-linked GA-CGIs in RH1, compared to 39 for RH3, similar to that observed for the male line RH4 (33 CGIs). The female line RCM1 had 138 X-linked Me-GA-CGIs, similar to RH1 (Table B in S1 File; methylation of X-linked gene-associated CGIs in hESC lines). RH3 and RH4 (male) have similar levels of X chromosomal and autosomal CGI methylation (Table B in S1 File, no significant difference in X-linked Me-CGIs from expected for RH3 or RH4). X-linked CGI methylation for RH1 and RCM1 was significantly higher than expected (P<<0.001 for both lines; Χ2 = 203.3 [RH1] or 265.8 [RCM1]).
Despite the heterogeneity of hESC CGI methylation, correlation of Me-CGI lists for different lines identified 1111 CGIs (40.8%–52.4% Me-CGIs, depending on cell line) methylated in all hESC lines tested (Fig 1B and 1C and S1 Table and S2 Table) 311 of these CGIs are also methylated in somatic tissues. The set of 1111 CGIs was reduced to 1079 by removal of 32 CGIs for which there was more than one reporter which behaved differently in different lines (Table C in S1 File). 828 CGIs methylated in all hESC lines are gene-associated, significantly fewer than the expected proportion (78.5%) of methylated gene-associated CGIs (Me-GA-CGIs, 873 expected, Χ2 = 10.4, P = 0.0012). These 828 Me-GA-CGIs are associated with 891 genes (S2 Table). No localisation of CGI methylation to particular chromosomes was apparent (Fig 1C and 1D, P = 0.371, Χ2 = 24.6). Comparison of CGI sets that were uniformly methylated or unmethylated in hESCs with somatic tissues identified two subsets whose methylation patterns were specific to hESCs. 201 GA-CGIs (associated with 220 genes, reduced to 216 as 4 genes were associated with multiple, differentially methylated, CGIs) were methylated in hESCs and unmethylated in adult tissues; conversely, 98 CGIs (associated with 113 genes) were unmethylated in hESCs and methylated in somatic tissues (Fig 1D and 1E and Table D in S1 File). As with the Me-CGIs, genes with a CGI methylated in hESCs and unmethylated in somatic tissues showed no significant chromosomal localisation (hESC-Me-GA-CGIs, Χ2 = 22.6, P = 0.244), nor did genes with an hESC-unmethylated/somatic methylated CGI (hESC-UnMe-GA-CGIs, Χ2 = 25.199, P = 0.288).
Transcriptome Analysis of hESCs
RNA transcriptome data were obtained for the three female hESC lines RH1, RH3 and RCM1. RH4 data were not included as RH4 is male and thus hemizygous for X, unaffected by X chromosome inactivation but expressing Y-linked genes.
Triplicate RNA samples from independent biological replicates concurrent with those used to prepare gDNA for CGI methylation analysis were hybridised to Affymetrix U133Plus2 GeneChips. Similar expression distributions were seen for all arrays. Arrays clustered to individual cell lines (S5 Fig), indicating high reproducibility of microarray data. Whilst overall expression patterns were similar for the three lines (Pearson correlation coefficient r>0.99), X-linked expression for RH3 was consistently higher than RH1 and RCM1, which were similar to each other (S6 Fig). The complete transcriptome dataset is available as supplementary S6 Table.
Correlation of CGI Status with Expression
Correlation of expression data with CGI methylation data defined two smaller gene sets. Of 216 genes associated with 201 hESC-Me-GA-CGIs (1.47% of GA-CGIs), 128 (59.3%) were expressed in all three lines (Fig 1F, and Table E and Table F in S1 File). Similarly, of 109 genes associated with 98 hESC-UnMe-GA-CGIs (0.72% GA-CGIs), 56 (57.1%) were expressed in all three lines (Fig 1G, and Table E and Table G in S1 File). As with CGI methylation generally, no localisation of expressed genes with a methylated CGI was apparent (Fig 1F; expressed hESC-Me-GA-CGIs, Χ2 = 25.694, P = 0.265). However, distribution of expressed genes with an associated hESC-unmethylated CGI was apparently non-random (Fig 1G; expressed hESC-UnMe-GA-CGIs, Χ2 = 33.886, P = 0.05), due to enrichment of chromosome 16 genes (9 observed; 2.5 expected, Χ2 = 18.298, P = 1.9x10-5).
There was a significant correlation in only three cases of differential methylation and differential expression between a pair of hESC lines (RH1, RH3, RCM1; Table H in S1 File, differential CGI methylation and gene expression in female hESC lines). These were genes whose associated CGI was methylated in RH1 but unmethylated in RH3 and that were significantly more highly-expressed in RH3, genes whose associated CGI was methylated in RCM1, unmethylated in RH1 and that were more highly expressed in RH1, and genes whose associated CGI was methylated in RCM1, unmethylated in RH3 and that were more highly expressed in RH3. Thus the methylation and expression states were consistent with the established association of DNA methylation with gene silencing. Of 80 genes whose associated CGI was methylated in RH1 but unmethylated in RH3 that were expressed in RH3 but not RH1, 50 were X-linked, a very significant result (Table I in S1 File, association of X chromosome with differential methylation and expression in female hESC lines). Similarly, X-linked GA-CGIs differentially methylated between female lines and more highly expressed in the unmethylated partner were significantly overrepresented in the RCM1: RH1 and RCM1: RH3 comparisons. (Table I in S1 File and S6 Fig).
Gene Ontology Identifies Candidate Epigenetic Biomarkers of hESCs
Correlation of Me-GA-CGIs and UnMe-GA-CGIs (Table D in S1 File) with transcriptome data identified expressed genes with an hESC-specific CGI methylation state (Tables E-G in S1 File). Of 109 gene loci with an hESC-UnMe-GA-CGI, 78 genes with non-redundant Entrez IDs were expressed. Similarly, of 216 gene loci with an hESC-Me-GA-CGI, 169 unique genes were expressed. The subset of CGI-associated genes may be enriched for functional categories, so we first analysed the distribution of genes associated with CGIs on the array, and identified 11357 non-redundant Entrez IDs, about 2/3 (64.3%) of the genes annotated in Gene Ontology (17673 genes). CGI-associated genes were enriched (e.g. transcription factor, developmental process) or depleted (egg. receptors, signal transducers) for various categories (S7 Fig and Table J in S1 File); thus the functional analysis of expressed hESC-methylated or unmethylated genes was compared with the set of CGI-associated genes on the array. These hESC-specific gene sets were tested for enrichment in Gene Ontology categories relative to the proportion expected for CGI-associated genes. Transcriptional activators (GO:0016563) are significantly overrepresented among genes with an hESC-unmethylated CGI (GO:0016563, P < 0.01, FDR = 0.01; Table K in S1 File); similarly related GO categories including transcriptional regulator activity (GO:0030528), transcription factor binding (GO:0008134), DNA binding (GO:0003677) and sequence-specific DNA binding (GO:0043565) are also overrepresented (Fig 2A). Only two genes associated with transcriptional repressor activity (MSX1 and TBX3) have an hESC-UnMe-CGI. For genes with an associated hESC-Me-CGI, only two categories were enriched (FDR < 0.25): phosphoinositide binding (GO: 0035091) and transcription repressor activity (GO: 0016564); P < 0.001, FDR = 0.137 in both cases; Fig 2A).
We found no association between hESC-specific CGI methylation and genetic imprinting. With respect to the set of hESC-expressed genes whose associated CGI is methylated in hESCs and unmethylated in somatic tissues, there are no imprinted genes included. In the set of hESC-expressed genes whose associated CGI is unmethylated in hESCs and methylated in somatic tissues, there are two imprinted genes, GNAS and SLC22A3, but SLC22A3 is known to be imprinted in a limited fashion, being only monoallelically expressed in the placenta during the first trimester [33]. Comparing our epigenetic biomarkers dataset with the 231 known imprinted genes as a proportion of the most recent estimate of ~19,000 human genes [34], there was thus no significant overlap (χ2 = 0.692, P = 0.405, 1 DF).
Functional Significance of Epigenetically-Defined hESC Biomarkers
We tested the functional role of three candidate epigenetically-defined hESC biomarkers: the transcriptional activators GLIS2 and HMGA1, and the repressor PFDN5, as all are expressed significantly over background (Table L in S1 File, microarray expression data for GLIS2, HMGA1 and PFDN5). Small Interfering RNA (siRNA) transfection conditions in hESC lines were optimised (S8 Fig). RH1 was selected to represent the cell lines in this study, with results independently confirmed in H9 one of the most commonly-studied hESC lines. HESCs were transfected with Lipofectamine RNAiMAX and siRNA oligonucleotides (Table M in S1 File, sequences of siRNA oligonucleotides used in this study) twice, 24 hours apart, and samples were taken at 48 and 96 hours after the first transfection. An siRNA targeting no human transcript (IDS-NULL, directed against IDS but containing 4 point mutations) was used as a negative control for sample normalisation from the same cell line and time point. An siRNA directed against OCT4 was used as a positive control for effects on pluripotency, and an siRNA directed against YAP1 to control for responses to knockdown of an hESC-expressed gene unnecessary for pluripotency[35]. Response to knockdown of all three genes was similar (RH1, Figs 2–5 and H9, S9–S12 Figs). Target transcript knockdown was significant and strong (typically > 90%) by 24 hours post-siRNA treatment 2 (Fig 2B and S9A Fig). Despite rapid (24 hr.) downregulation of YAP1 to ~10% normal levels, we saw no effect on OCT4, NANOG or SOX2 in RH1 (Fig 2Bi) and minimal effects in H9 (S9Ai Fig). However OCT4 knockdown was accompanied by significant reductions in NANOG and SOX2 (Fig 2Bii and S9Aii Fig). These controls show that our siRNA knockdown system was (1) not “swamping” the cellular RNA degradation system, inducing non-specific effects that perturbed normal phenotype, (2) efficient, producing a measurable target knockdown, and (3) yielded expected effects when targeting genes of known function.
HMGA1, GLIS2 and PFDN5 were significantly downregulated by siRNA treatment (Fig 2Biii-v and S9Aiii-v Fig). In RH1 cells HMGA1 and PFDN5 expression remained downregulated (~10% normal transcript levels) at 72 hours, but GLIS2 transcripts recovered to approximately normal levels by this time (Fig 2Biii-v and S9Aiii-v Fig). In H9 cells, GLIS2 and PFDN5 knockdowns were very efficient, HMGA1 less so, appearing to recover and "overshoot" normal transcript levels by 72 hrs. (S9Aiii-v Fig). Whenever GLIS2, HMGA1 or PFDN5 expression was knocked down, OCT4, NANOG and SOX2 were also downregulated with respect to controls (Fig 2Aiii-v and S9Aiii-v Fig).
Immunostaining of anti-OCT4 siRNA-treated hESCs showed no OCT4 signal, as expected, or for NANOG, consistent with RT-qPCR data showing NANOG falling on OCT4 knockdown (Fig 2C and S9B Fig). Both, however, were readily detectable in cells treated with either anti-YAP1 siRNA or IDS-NULL (Fig 2C and S9B Fig). Anti-HMGA1,-GLIS2 and-PFDN5 siRNA-treated hESCs showed no detectable OCT4 signal or, in the cases of GLIS2 and HMGA1, NANOG signal. Anti-PFDN5 siRNA-treated hESCs did show a detectable NANOG signal, but noticeably weaker than negative controls (Fig 2C and S9B Fig).
OCT4, NANOG and SOX2 downregulation induced by siRNA to OCT4 or HMGA1, GLIS2, or PFDN5 was accompanied by morphology changes consistent with differentiation. In contrast, siRNAs anti-YAP1 and IDS-NULL induced no such changes (Fig 3 and S10 Fig). As stem cell differentiation is accompanied by loss of genomic DNA hydroxymethylation (Ruzov et al., 2011) we evaluated effects on this epigenetic mark and on expression of TET dioxygenases (TET1, 2, 3). Compared with anti-YAP1 and IDS-NULL, knockdown of OCT4, HMGA1, GLIS2 and PFDN5 reduced genomic levels of 5-hmC in both lines, as shown by quantitative ELISA (Fig 4B and S11B Fig; P<0.001). Consistent with these results, the immunofluorescence-detectable signal for 5-hmC in both cell lines was attenuated (Fig 4A and S11A Fig). This reduction in 5-hmC levels could not be attributed to a reduction in TET expression as transcript levels were either unchanged or only modestly so (< 2-fold; Fig 4C and S11C Fig). In some cases, most notably with the RH1 cell line (Fig 4), there was also some apparent reduction in global mC levels detectable by immunofluorescence, but it should be noted that the same effect was observed with OCT4 as well as with HMGA1, GLIS2 and PFDN5.
To determine whether GLIS2, HMGA1 and PFDN5 knockdown resulted in undirected differentiation or biased lineage selection, we quantified lineage-associated markers by RT-qPCR. In RH1 cells, OCT4 downregulation caused small reductions of most lineage markers examined, except upregulation of the mesodermal marker Brachyury and the ectodermal marker NF200 (Fig 5). H9 cells responded differently, upregulating AFP and other trophoblast and endodermal markers including CDX2, CGα and GATA4, consistent with previous data (S12 Fig, [36]). GLIS2 downregulation induced changes in several genes, including upregulation of the endodermal markers α-fetoprotein (AFP), albumin and either HNF4α (RH1) or GATA6 (H9) (Fig 5 and S12 Fig). Trophoblast markers were also upregulated in both lines on GLIS2 knockdown (Fig 5 and S12 Fig). While AFP was also upregulated in cells of both lines knocked down for HMGA1, most endodermal and trophoblast markers other than CGα (upregulated) were downregulated. PFDN5 knockdown also induced AFP upregulation but, as with HMGA1, other endodermal markers tended to be downregulated or unchanged. In RH1 cells, PFDN5 also resulted in upregulation of HAND1 (expressed by multiple early lineages) and Brachyury (mesendoderm), but H9 differed; here both were downregulated. Overall therefore, there were similarities in response, but even the OCT4 knockdown response was not uniform between lines. GLIS2 knockdown gave the most consistent response between cell lines.
As perturbation of either OCT4 or HMGA1, GLIS2 or PFDN5 (all transcriptional regulators) results in loss of stem cell phenotype, and biomarker knockdown also results in downregulation of OCT4, NANOG and SOX2, a transcriptional connection between the core pluripotency factors and biomarkers is possible. GLIS2, HMGA1 and PFDN5 all apparently contain OCT4 binding sites in their promoters (Fig 6A–6C; [37, 38] R. Young, Whitehead Institute, Cambridge, MA [unpublished dataset], and [25], and also binding sites for other transcription factors which interact with OCT4, NANOG and SOX2, including EOMES, TRIM28, E2F1, CDX2 and ETS1.
OCT4 knockdown in hESCs perturbed GLIS2, HMGA1 and PFDN5 expression (Fig 6D), but although statistically significant, the mRNA changes were smaller than those seen for OCT4, NANOG and SOX2 in response to GLIS2, HMGA1 or PFDN5 knockdown, where the pluripotency factors were rapidly and completely repressed. OCT4 knockdown downregulated GLIS2 by ~2-fold, and upregulated HMGA1 and PFDN5 by ~2.5-fold in both lines (Fig 6D). To confirm the proposed link between the epigenetic biomarkers and core pluripotency network, we performed ChIP-qPCR for transcription factor promoter binding in hESCs (Fig 6E). We used Matinspector (Genomatix software; [39]) to predict OCT4, SOX2 and NANOG binding sites in the promoters of HMGA1, PFDN5 and GLIS2, and similarly HMGA1 and GLIS2 binding sites in the promoters of OCT4, SOX2 and NANOG. Amplification of HMGA1, GLIS2 and PFDN5 promoter regions from cross-linked, sheared chromatin immunoprecipitated with an antibody to OCT4 indicated that OCT4 is DNA-bound at the predicted loci in hESCs (Fig 6E). We were unable to identify antibodies for HMGA1 and GLIS2 suitable for chromatin-immunoprecipitation to confirm their binding to the promoters of OCT4, SOX2 and NANOG.
Ectopic Expression of Epigenetic Biomarkers in Differentiated Cells
Unlike the pluripotency factors OCT4 and NANOG, expression of HMGA1, GLIS2 and PFDN5 is not limited to stem cells. To identify a role in conferring pluripotency as well as its maintenance, we reprogrammed human dermal fibroblasts by transfection with episomal plasmids expressing OCT4, KLF4, SOX2, L-MYC, LIN28 and a short hairpin RNA directed against p53 [32], and added similar plasmids expressing HMGA1, GLIS2 or PFDN5 (S13 Fig). In 4 experiments supplementing the basic reprogramming set with HMGA1, GLIS2, or HMGA1, GLIS2 and PFDN5 together, the epigenetically-identified factors either had no significant effect on the number of colonies obtained that were positive for the early pluripotency reprogramming and stem cell marker alkaline phosphatase, or reduced it. This effect was more noticeable with GLIS2 than HMGA1. PFDN5 however, apparently increased AP+ colonies, either significantly (Fig 6F) or not where greater variability in colony numbers between treatment replicates was seen, but the general trend was seen in all experiments (Fig 6F and S14 Fig).
Discussion
We have defined a CGI methylation map specific to hESCs. Whilst overall CGI methylation in hESCs is similar to adult tissues, gene-associated CGI methylation is reduced, consistent with the view that pluripotent cells possess an open chromatin structure permissive of gene expression (Fussner et al., 2010; Gaspar-Maia et al., 2011; Meshorer et al., 2006). Gene association of CGIs (within 1.5kb of or overlapping an annotated gene) included “orphan” CGIs of uncertain significance. Such CGIs may indicate novel promoters for alternative transcripts or non-coding RNAs regulating gene expression [40, 41]. Expression of a gene associated with an Me-CGI could reflect suppression of an alternative transcript or non-coding RNA, detailed sequence based maps for which now exist for hESCs [42, 43].
As in other hESC DNA methylation studies [14–17, 44], we saw substantial variation in CGI methylation between lines, probably related to differences in line provenance, cultivation method and passage number. This variation was particularly evident on the X-chromosome, probably reflecting the 3 classes of X inactivation status seen in hESC lines (See supplementary discussion A in S1 Document).
Cross-referencing of the CGI methylation map to the transcriptome yielded a panel of 184 hESC-expressed genes as putative epigenetically-defined biomarkers of a pluripotent phenotype (S15 Fig). Expressed genes whose associated CGI was methylated in hESCs were unbiasedly distributed throughout the genome, but those whose CGI was unmethylated were over-represented on chromosome 16. These included GLIS2 (16p13.3), and Cadherin Type 1 (16q22.1), a recognised pluripotency factor [45], and seven other genes implicated in signal transduction and chromatin remodelling which have not been assigned roles in pluripotency.
HESC epigenetically-defined biomarkers were significantly enriched for transcriptional control functions, generally transcriptional activators for those associated with UnMe-CGIs, and repressors for those associated with Me-CGIs. We chose the transcriptional activators GLIS2 and HMGA1, and the repressor PFDN5 as candidates to test for a role in pluripotency. GLIS2 encodes a zinc finger transcription factor which interacts with p120 catenin (a member of the pluripotency-associated WNT pathway); it is expressed modestly in hESCs (Table L in S1 File). The GLIS2 homologue GLIS1 promotes reprogramming of fibroblasts to iPS cells [46]. HMGA1 has been implicated in tumour development [47] and recently, but unknown at the time our study was initiated, as a reprogramming factor [48]. It is known to interact with molecules influencing hESC phenotype including Lin28, WNT, MYC, STAT3 and GSK3 and its transcription is modulated by KLF4 and OCT family members. PFDN5 is a molecular chaperone highly expressed in hESCs (Table L in S1 File) and interacts with both MYC and WNT, genes with important roles in growth and self renewal of ECSs [49]. Collectively, these interactions are consistent with our findings that interference with these genes initiates differentiation. Downregulation of candidate biomarkers resulted in variable effects on the expression of specific lineage-associated markers. GLIS2 is associated with both neural (ectodermal) and kidney (mesodermal) development [50]. Downregulation of GLIS2 upregulated multiple markers of endodermal and extraembryonic lineages, consistent with these roles. HMGA1 and PFDN5 knockdown also upregulated endoderm-associated AFP, but effects on other endodermal and extraembryonic markers were less pronounced or inconsistent. However, we only assessed lineage marker abundance at one timepoint, and differences may be due to variation in interference kinetics or cell response, perhaps in turn related to epigenomic differences between lines.
Interaction of Epigenetic Biomarkers with the Core Pluripotency Network
We confirm a role for GLIS2, HMGA1 and PFDN5, and by inference other genes with an hESC-specific CGI-methylation state, in the maintenance of pluripotency transcription factor expression and pluripotency-associated epigenetic marking (DNA hydroxymethylation). For GLIS2 and PFDN5, and some functions of HMGA1, these roles are novel and suggestive of an epigenetically-defined network of stem cell regulation by genes also expressed in some differentiated cells. We surmised that this network would be controlled in turn by pluripotency-determining factors, and indeed ChIP confirmed the existence of predicted OCT4 binding sites in the promoters of all three genes [25, 37, 38]. Probable binding sites for NANOG and/or SOX2 in the biomarker gene promoters were also identified (data not shown). Our observation of a modulating effect on fibroblast reprogramming (positive for PFDN5, negative for HMGA1 and GLIS2) transfected with established reprogramming factors is consistent with the hypothesis that the biomarkers interact with the core pluripotency factors at some level. The inhibitory effects of HMGA1 and GLIS2 may reflect time- or phase-dependent roles for these factors in reprogramming which our experimental design did not address, as recently described for MBD3 [51], or reflect competing interaction with other factors. HMGA1-induced augmentation of reprogramming was achieved by co-transfection with OCT4, KLF4, SOX2 and L-MYC [48], whereas our study also included LIN28 and a short hairpin RNA directed against p53. Further studies are required to confirm interactions of GLIS2 and PFDN5 with pluripotency transcription factors. Binding sites for GLIS2 are predicted in the promoters of OCT4 and NANOG (data not shown). PFDN5 is known to repress c-MYC activity which regulates genes involved in many processes including cell-cycle control, metabolism, signal transduction, and cell-fate decisions as well as self-renewal (Chappell and Dalton, 2013). HMGA1 binding sites are predicted in the promoters of OCT4 and NANOG (data not shown) and have been shown by chromatin immunoprecipitation in SOX2, LIN28 and c-MYC [48]. The reduction in DNA hydroxymethylation, not accompanied by similar falls in TET gene transcription, following interference with HMGA1, GLIS2, or PFDN5 suggests that these effects are secondary events following disruption of core pluripotency functions. Collectively, our data suggest a model as shown in Fig 7. Downregulation of biomarker expression, either experimentally by RNA interference, or by methylation changes at associated CGIs modulating gene expression, results in downregulation of the core pluripotency transcription network, directly or indirectly. Differentiation is initiated, with the particular lineage decision decided by the presence or absence of secondary transcription regulators, such as epigenetically-regulated factors identified here. Stability of the differentiation process is conferred both by downregulation of pluripotency genes (NANOG, SOX2 and OCT4), and also by epigenetic changes, e.g. OCT4 promoter [52, 53], or methylation changes in CGIs associated with particular lineages. The methylation changes confer stability and heritability on the gene expression changes, and hence on the resulting cellular phenotype.
In conclusion, we identify epigenetically defined biomarkers of a pluripotent phenotype. The methylation state of these biomarkers is independent of variables such as culture condition or derivation method, and their expression is required for pluripotency. Appraisal of the methylation state of the CGIs described here could be a useful criterion for hESC line validation, or for assessing how well adult cell-derived IPSCs have established a true embryonic stem cell epigenetic state.
Supporting Information
Acknowledgments
The authors thank Profs. Adrian Bird (University of Edinburgh) and Agapios Sachinidis (University of Cologne) for helpful comments on the manuscript, Dr Rob Illingworth for help with the initial CGI array hybridisations, Samuel Collombet for help developing 5-hmC immunohistochemistry, Andrew Malinowski and Ms Rachael Riddoch for assistance with bisulphite sequencing.
Data Availability
All relevant data are included in the paper itself, or the supplementary information.
Funding Statement
This work was supported by EUFP7 funding to PDS and ST as part of the ESNATS (Embryonic Stem Cell Novel Alternative Testing Strategies) consortium (FP7-201619) and by the Portuguese FCT to MF (PTDC/BIA-GEN/116519/2010). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Jaenisch R, Bird A. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nature genetics. 2003;33 Suppl:245–54. . [DOI] [PubMed] [Google Scholar]
- 2. Wolffe AP, Matzke MA. Epigenetics: regulation through repression. Science (New York, NY. 1999;286(5439):481–6. . [DOI] [PubMed] [Google Scholar]
- 3. Saxonov S, Berg P, Brutlag DL. A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters. Proceedings of the National Academy of Sciences of the United States of America. 2006;103(5):1412–7. . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Edwards CA, Ferguson-Smith AC. Mechanisms regulating imprinted genes in clusters. Current opinion in cell biology. 2007;19(3):281–9. . [DOI] [PubMed] [Google Scholar]
- 5. Norris DP, Brockdorff N, Rastan S. Methylation status of CpG-rich islands on active and inactive mouse X chromosomes. Mamm Genome. 1991;1(2):78–83. . [DOI] [PubMed] [Google Scholar]
- 6. Bird A. DNA methylation patterns and epigenetic memory. Genes & development. 2002;16(1):6–21. . [DOI] [PubMed] [Google Scholar]
- 7. Lee SH, Cho SY, Shannon MF, Fan J, Rangasamy D. The impact of CpG island on defining transcriptional activation of the mouse L1 retrotransposable elements. PloS one. 2010;5(6):e11353 10.1371/journal.pone.0011353 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Baker DE, Harrison NJ, Maltby E, Smith K, Moore HD, Shaw PJ, et al. Adaptation to culture of human embryonic stem cells and oncogenesis in vivo. Nature biotechnology. 2007;25(2):207–15. . [DOI] [PubMed] [Google Scholar]
- 9. Amps K, Andrews PW, Anyfantis G, Armstrong L, Avery S, Baharvand H, et al. Screening ethnically diverse human embryonic stem cells identifies a chromosome 20 minimal amplicon conferring growth advantage. Nature biotechnology. 2011;29(12):1132–44. 10.1038/nbt.2051 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Hanson C, Caisander G. Human embryonic stem cells and chromosome stability. Apmis. 2005;113(11–12):751–5. . [DOI] [PubMed] [Google Scholar]
- 11. Enver T, Soneji S, Joshi C, Brown J, Iborra F, Orntoft T, et al. Cellular differentiation hierarchies in normal and culture-adapted human embryonic stem cells. Human molecular genetics. 2005;14(21):3129–40. . [DOI] [PubMed] [Google Scholar]
- 12. Maitra A, Arking DE, Shivapurkar N, Ikeda M, Stastny V, Kassauei K, et al. Genomic alterations in cultured human embryonic stem cells. Nature genetics. 2005;37(10):1099–103. . [DOI] [PubMed] [Google Scholar]
- 13. Imreh MP, Gertow K, Cedervall J, Unger C, Holmberg K, Szoke K, et al. In vitro culture conditions favoring selection of chromosomal abnormalities in human ES cells. Journal of cellular biochemistry. 2006;99(2):508–16. . [DOI] [PubMed] [Google Scholar]
- 14. Rugg-Gunn PJ, Ferguson-Smith AC, Pedersen RA. Status of genomic imprinting in human embryonic stem cells as revealed by a large cohort of independently derived and maintained lines. Human molecular genetics. 2007;16 Spec No. 2:R243–51. . [DOI] [PubMed] [Google Scholar]
- 15. Allegrucci C, Wu YZ, Thurston A, Denning CN, Priddle H, Mummery CL, et al. Restriction landmark genome scanning identifies culture-induced DNA methylation instability in the human embryonic stem cell epigenome. Human molecular genetics. 2007;16(10):1253–68. . [DOI] [PubMed] [Google Scholar]
- 16. Kim KP, Thurston A, Mummery C, Ward-van Oostwaard D, Priddle H, Allegrucci C, et al. Gene-specific vulnerability to imprinting variability in human embryonic stem cell lines. Genome research. 2007;17(12):1731–42. . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Minoguchi S, Iba H. Instability of retroviral DNA methylation in embryonic stem cells. Stem cells (Dayton, Ohio). 2008;26(5):1166–73. . [DOI] [PubMed] [Google Scholar]
- 18. Chin MH, Mason MJ, Xie W, Volinia S, Singer M, Peterson C, et al. Induced pluripotent stem cells and embryonic stem cells are distinguished by gene expression signatures. Cell stem cell. 2009;5(1):111–23. 10.1016/j.stem.2009.06.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Hu BY, Weick JP, Yu J, Ma LX, Zhang XQ, Thomson JA, et al. Neural differentiation of human induced pluripotent stem cells follows developmental principles but with variable potency. Proceedings of the National Academy of Sciences of the United States of America. 2010;107(9):4335–40. 10.1073/pnas.0910012107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Kim K, Doi A, Wen B, Ng K, Zhao R, Cahan P, et al. Epigenetic memory in induced pluripotent stem cells. Nature. 2010;467(7313):285–90. 10.1038/nature09342 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Lister R, Pelizzola M, Kida YS, Hawkins RD, Nery JR, Hon G, et al. Hotspots of aberrant epigenomic reprogramming in human induced pluripotent stem cells. Nature. 2011;471(7336):68–73. 10.1038/nature09798 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Ruiz S, Diep D, Gore A, Panopoulos AD, Montserrat N, Plongthongkum N, et al. Identification of a specific reprogramming-associated epigenetic signature in human induced pluripotent stem cells. Proceedings of the National Academy of Sciences of the United States of America. 2012;109(40):16196–201. 10.1073/pnas.1202352109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Shen Y, Chow J, Wang Z, Fan G. Abnormal CpG island methylation occurs during in vitro differentiation of human embryonic stem cells. Human molecular genetics. 2006;15(17):2623–35. . [DOI] [PubMed] [Google Scholar]
- 24. Bibikova M, Chudin E, Wu B, Zhou L, Garcia EW, Liu Y, et al. Human embryonic stem cells have a unique epigenetic signature. Genome research. 2006;16(9):1075–83. . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Lister R, Pelizzola M, Dowen RH, Hawkins RD, Hon G, Tonti-Filippini J, et al. Human DNA methylomes at base resolution show widespread epigenomic differences. Nature. 2009;462(7271):315–22. 10.1038/nature08514 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Illingworth R, Kerr A, Desousa D, Jorgensen H, Ellis P, Stalker J, et al. A novel CpG island set identifies tissue-specific methylation at developmental gene loci. PLoS biology. 2008;6(1):e22 10.1371/journal.pbio.0060022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. De Sousa PA, Gardner J, Sneddon S, Pells S, Tye BJ, Dand P, et al. Clinically failed eggs as a source of normal human embryo stem cells. Stem cell research. 2009;2(3):188–97. 10.1016/j.scr.2009.01.002 [DOI] [PubMed] [Google Scholar]
- 28. Fletcher JM, Ferrier PM, Gardner JO, Harkness L, Dhanjal S, Serhal P, et al. Variations in humanized and defined culture conditions supporting derivation of new human embryonic stem cell lines. Cloning and stem cells. 2006;8(4):319–34. . [DOI] [PubMed] [Google Scholar]
- 29. Zhang R, Mjoseng HK, Hoeve MA, Bauer NG, Pells S, Besseling R, et al. A thermoresponsive and chemically defined hydrogel for long-term culture of human embryonic stem cells. Nature communications. 2013;4:1335 10.1038/ncomms2341 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Ermakov A, Pells S, Freile P, Ganeva VV, Wildenhain J, Bradley M, et al. A role for intracellular calcium downstream of G-protein signaling in undifferentiated human embryonic stem cell culture. Stem cell research. 2012;9(3):171–84. 10.1016/j.scr.2012.06.007 [DOI] [PubMed] [Google Scholar]
- 31. Ruzov A, Tsenkina Y, Serio A, Dudnakova T, Fletcher J, Bai Y, et al. Lineage-specific distribution of high levels of genomic 5-hydroxymethylcytosine in mammalian development. Cell research. 2011;21(9):1332–42. 10.1038/cr.2011.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Okita K, Matsumura Y, Sato Y, Okada A, Morizane A, Okamoto S, et al. A more efficient method to generate integration-free human iPS cells. Nature methods. 2011;8(5):409–12. 10.1038/nmeth.1591 . [DOI] [PubMed] [Google Scholar]
- 33. Monk D, Arnaud P, Apostolidou S, Hills FA, Kelsey G, Stanier P, et al. Limited evolutionary conservation of imprinting in the human placenta. Proceedings of the National Academy of Sciences of the United States of America. 2006;103(17):6623–8. 10.1073/pnas.0511031103 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Ezkurdia I, Juan D, Rodriguez JM, Frankish A, Diekhans M, Harrow J, et al. Multiple evidence strands suggest that there may be as few as 19,000 human protein-coding genes. Human molecular genetics. 2014;23(22):5866–78. 10.1093/hmg/ddu309 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Chia NY, Chan YS, Feng B, Lu X, Orlov YL, Moreau D, et al. A genome-wide RNAi screen reveals determinants of human embryonic stem cell identity. Nature. 2010;468(7321):316–20. 10.1038/nature09531 [DOI] [PubMed] [Google Scholar]
- 36. Hay DC, Sutherland L, Clark J, Burdon T. Oct-4 knockdown induces similar patterns of endoderm and trophoblast differentiation markers in human and mouse embryonic stem cells. Stem cells (Dayton, Ohio). 2004;22(2):225–35. . [DOI] [PubMed] [Google Scholar]
- 37. Kunarso G, Chia NY, Jeyakani J, Hwang C, Lu X, Chan YS, et al. Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nature genetics. 2010;42(7):631–4. 10.1038/ng.600 [DOI] [PubMed] [Google Scholar]
- 38. Mullen AC, Orlando DA, Newman JJ, Loven J, Kumar RM, Bilodeau S, et al. Master transcription factors determine cell-type-specific responses to TGF-beta signaling. Cell. 2011;147(3):565–76. 10.1016/j.cell.2011.08.050 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Cartharius K, Frech K, Grote K, Klocke B, Haltmeier M, Klingenhoff A, et al. MatInspector and beyond: promoter analysis based on transcription factor binding sites. Bioinformatics. 2005;21(13):2933–42. 10.1093/bioinformatics/bti473 . [DOI] [PubMed] [Google Scholar]
- 40. Illingworth RS, Gruenewald-Schneider U, Webb S, Kerr AR, James KD, Turner DJ, et al. Orphan CpG islands identify numerous conserved promoters in the mammalian genome. PLoS genetics. 2010;6(9):e1001134 10.1371/journal.pgen.1001134 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Maunakea AK, Nagarajan RP, Bilenky M, Ballinger TJ, D'Souza C, Fouse SD, et al. Conserved role of intragenic DNA methylation in regulating alternative promoters. Nature. 2010;466(7303):253–7. 10.1038/nature09165 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Au KF, Sebastiano V, Afshar PT, Durruthy JD, Lee L, Williams BA, et al. Characterization of the human ESC transcriptome by hybrid sequencing. Proceedings of the National Academy of Sciences of the United States of America. 2013;110(50):E4821–30. 10.1073/pnas.1320101110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Tang X, Hou M, Ding Y, Li Z, Ren L, Gao G. Systematically profiling and annotating long intergenic non-coding RNAs in human embryonic stem cell. BMC genomics. 2013;14 Suppl 5:S3 10.1186/1471-2164-14-S5-S3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Bock C, Kiskinis E, Verstappen G, Gu H, Boulting G, Smith ZD, et al. Reference Maps of human ES and iPS cell variation enable high-throughput characterization of pluripotent cell lines. Cell. 2011;144(3):439–52. 10.1016/j.cell.2010.12.032.21295703 PubMed Central PMCID: PMC3063454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Redmer T, Diecke S, Grigoryan T, Quiroga-Negreira A, Birchmeier W, Besser D. E-cadherin is crucial for embryonic stem cell pluripotency and can replace OCT4 during somatic cell reprogramming. EMBO reports. 2011;12(7):720–6. 10.1038/embor.2011.88 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Maekawa M, Yamaguchi K, Nakamura T, Shibukawa R, Kodanaka I, Ichisaka T, et al. Direct reprogramming of somatic cells is promoted by maternal transcription factor Glis1. Nature. 2011;474(7350):225–9. 10.1038/nature10106 [DOI] [PubMed] [Google Scholar]
- 47. Cleynen I, Van de Ven WJ. The HMGA proteins: a myriad of functions (Review). International journal of oncology. 2008;32(2):289–305. . [PubMed] [Google Scholar]
- 48. Shah SN, Kerr C, Cope L, Zambidis E, Liu C, Hillion J, et al. HMGA1 reprograms somatic cells into pluripotent stem cells by inducing stem cell transcriptional networks. PloS one. 2012;7(11):e48533 10.1371/journal.pone.0048533 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Yoshida T, Kitaura H, Hagio Y, Sato T, Iguchi-Ariga SM, Ariga H. Negative regulation of the Wnt signal by MM-1 through inhibiting expression of the wnt4 gene. Experimental cell research. 2008;314(6):1217–28. 10.1016/j.yexcr.2008.01.002 [DOI] [PubMed] [Google Scholar]
- 50. Zhang F, Nakanishi G, Kurebayashi S, Yoshino K, Perantoni A, Kim YS, et al. Characterization of Glis2, a novel gene encoding a Gli-related, Kruppel-like transcription factor with transactivation and repressor functions. Roles in kidney development and neurogenesis. The Journal of biological chemistry. 2002;277(12):10139–49. . [DOI] [PubMed] [Google Scholar]
- 51. Rais Y, Zviran A, Geula S, Gafni O, Chomsky E, Viukov S, et al. Deterministic direct reprogramming of somatic cells to pluripotency. Nature. 2013;502(7469):65–70. 10.1038/nature12587 . [DOI] [PubMed] [Google Scholar]
- 52. Fouse SD, Shen Y, Pellegrini M, Cole S, Meissner A, Van Neste L, et al. Promoter CpG methylation contributes to ES cell gene regulation in parallel with Oct4/Nanog, PcG complex, and histone H3 K4/K27 trimethylation. Cell stem cell. 2008;2(2):160–9. 10.1016/j.stem.2007.12.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Yeo S, Jeong S, Kim J, Han JS, Han YM, Kang YK. Characterization of DNA methylation change in stem cell marker genes during differentiation of human embryonic stem cells. Biochemical and biophysical research communications. 2007;359(3):536–42. . [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are included in the paper itself, or the supplementary information.