Abstract
CpG methylation generally occurs on both DNA strands and is essential for mammalian development and differentiation. Until recently, hemimethylation, in which only one strand is methylated, was considered to be simply a transitory state generated during DNA synthesis. The discovery that a subset of CCCTC-binding factor (CTCF) binding sites is heritably hemimethylated suggests that hemimethylation might have an unknown biological function. Here we show that the binding of CTCF is profoundly altered by which DNA strand is methylated and by the specific CTCF binding motif. CpG methylation on the motif strand can inhibit CTCF binding by up to 7-fold, whereas methylation on the opposite strand can stimulate binding by up to 4-fold. Thus, hemimethylation can alter binding by up to 28-fold in a strand-specific manner. The mechanism for sensing methylation on the opposite strand requires two critical residues, V454 and S364, within CTCF zinc fingers 7 and 4. Similar to methylation, CpG hydroxymethylation on the motif strand can inhibit CTCF binding by up to 4-fold. However, hydroxymethylation on the opposite strand removes the stimulatory effect. Strand-specific methylation states may therefore provide a mechanism to explain the transient and dynamic nature of CTCF-mediated chromatin interactions.
Graphical Abstract
INTRODUCTION
DNA methylation patterns and their faithful inheritance during cell division are essential for appropriate gene expression (1). For vertebrates, cytosines in the CpG palindrome in duplex DNA are generally symmetrically methylated or unmethylated on both strands (2). An exception to this symmetrical methylation occurs when hemimethylated DNA is generated by DNA synthesis during S-phase. This transitory state of inherent asymmetry is generally rapidly converted to symmetrical methylation by DNA methyltransferase 1 which prefers hemimethylated DNA as a substrate (reviewed in (3)). Thus, only low levels of hemimethylation exist in most cells such as somatic mouse tissues (4). On the other hand, maintained hemimethylation of some repetitive sequences has been reported in mouse embryonic stem (ES) cells and in early mouse embryos (4,5). The concept that hemimethylation is simply a transitory state was challenged by the recent discovery of persistent and heritable hemimethylation flanking a subset of CCCTC-binding factor (CTCF) and cohesin binding sites in ES cells, but not in other somatic cell types (6). The fact that the hemimethylation state is heritable after cell division provides a strong suggestion that it might have biological significance.
CTCF has a major role in genomic organization and function (7). Also, binding to its recognition motif sequence is well known to be sensitive to symmetrical DNA methylation, for example in controlling genomic imprinting of the Igf2/H19 locus (8,9). We therefore used short oligonucleotides (oligos) and recombinant CTCF full-length (CTCF FL) and truncated proteins containing the eleven zinc finger (ZF) domains (CTCF ZF1–11) to investigate potential differential effects of asymmetric DNA methylation states on binding in vitro. Methylation on the motif strand inhibited binding; but unexpectedly, methylation on the opposite strand stimulated binding. Potentially, the asymmetry inherent in hemimethylation could be interpreted by CTCF and may serve as a signal for asymmetric cell division in stem cells.
MATERIALS AND METHODS
DNA cloning and protein expression
Human CTCF proteins were cloned from the following plasmids: CTCF ZF1–11 (pXC1441) was a gift from Drs Xiaodong Cheng and John Horton (10) and pDONR223_CTCF_WT was a gift from Drs Jesse Boehm, William Hahn, and David Root (Addgene plasmid # 81789; http://n2t.net/addgene:81789;RRID: Addgene_81789) (11). The CTCF ZF1-11 fragment or full-length proteins were cloned into pSUMO vectors for expressing 6xHis-SUMO tagged CTCF proteins. Tagged CTCF proteins were expressed in the Escherichia coli strain BL21-CodonPlus (DE3)-RIL (230245, Agilent). For large-scale (2 L) purification, a colony was inoculated into 100 mL of LB medium containing ampicillin (50 mg/mL) and 25 μM ZnCl2 and cultured overnight at 37°C at 170 rpm. Subsequently, the culture was amplified into 2 L of LB medium and was grown for 4–5 h at 28°C to OD600 of ∼0.8 and shifted to 16°C. Protein expression was induced with 0.1 mM isopropyl β-D-1-thiogalactopyranoside (IPTG) overnight at 16°C. Cells were harvested by centrifugation at 4000 × g for 20 min at 4°C. Cell pellets were processed immediately or stored at –80°C for future purification.
Cell pellets were resuspended in 150 mL of buffer A (20 mM Tris–HCl (pH 8.0), 25 mM imidazole, 1 M NaCl, 5% (v/v) glycerol, 0.5 mM Tris(2-carboxyethyl)phosphine hydrochloride (TCEP), 25 μM ZnCl2) with 1 mM phenylmethanesulfonylfluoride (PMSF). Resuspended cells were lysed in a high-pressure homogenizer (APV) and clarified by centrifugation at 38 700 × g at 4°C for 40 min. The supernatant was collected and loaded onto a 5 ml HisTrap FF column (17525501, Cytiva) pre-equilibrated with buffer A and washed with 10 column volumes (CVs) washing buffer (20 mM Tris–HCl (pH 8.0), 50 mM imidazole, 1 M NaCl, 5% (v/v) glycerol, 0.5 mM TCEP, 25 μM ZnCl2). Bound proteins were eluted with elution buffer (20 mM Tris–HCl (pH 8.0), 250 mM imidazole, 1 M NaCl, 5% (v/v) glycerol, 0.5 mM TCEP, 25 μM ZnCl2). For 6xHis-SUMO tagged CTCF, pooled protein was subsequently concentrated and loaded onto a Superdex 200 16/600 column (GE28-9893-35, Cytiva) pre-equilibrated with size-exclusion buffer (20 mM Tris–HCl (pH 8.0), 1 M NaCl, 5% (v/v) glycerol, 0.5 mM TCEP, 25 μM ZnCl2). The 6xHis-SUMO tagged CTCF protein was eluted at a volume of 80 mL for ZF1-11 and 65 ml for full-length protein. The peak fractions were pooled, concentrated, and fresh-frozen in aliquots after dialysis against a size-exclusion buffer containing 50% glycerol.
For the fluorescence polarization DNA binding assay, both 6xHis-SUMO tagged protein (CTCF ZF1–11) and untagged protein (CTCF ZF1–11 and CTCF full-length) were used, with no change in results based on the presence or absence of the tag. For proteins with the 6xHis-SUMO tag removed, ULP1 Sumo protease was added, and the samples were dialyzed overnight to remove imidazole, then the protein was applied to a 5 ml HisTrap FF column (17525501, Cytiva) pre-equilibrated with buffer A without imidazole. Protein was eluted by a gradient increase in the concentration of imidazole to 250 mM with 10 CVs and subsequently concentrated and loaded onto a Superdex 200 16/600 column pre-equilibrated with size-exclusion buffer. The CTCF protein was eluted at a volume of 88 ml for ZF1–11 and 66 ml for full-length protein. The peak fractions were pooled, concentrated, and fresh-frozen in aliquots after dialysis against a size-exclusion buffer containing 50% glycerol. Protein purity and cleavage completion were analyzed by SDS-PAGE.
Site-directed mutagenesis
All site-directed mutagenesis was carried out using the QuikChange kit (200521, Agilent) and the constructs were confirmed by DNA sequencing.
Fluorescence polarization (FP) DNA binding assay
Single-stranded DNA oligos were purchased from IDT and the sequences are shown in Supplementary Table S1. The strand containing the CTCF motif was labeled with 6-carboxy-fluorescein (FAM). Single-stranded oligos were annealed, and the efficiency of the annealing was assessed by running the oligos on a 20% TBE (tris–borate–EDTA) gel. Annealed double-stranded oligos were diluted in DNA binding buffer (20 mM Tris–HCl (pH 7.5), 300 mM NaCl, 5% (v/v) glycerol, and 0.5 mM TCEP) and each oligo (5 nM) was incubated for 15 min at 25°C with a serial dilution of either full-length CTCF or ZF1–11 CTCF protein. The protein concentration ranged from 4000 to 0.244 nM for full-length CTCF and 8000 to 0.488 nM for ZF1-11 CTCF. Duplicate protein serial dilutions were set up for each oligo in a 384-well black assay plate (#3575, Corning) and the assay was repeated giving a total of 4–8 data sets for each oligo-protein pair. The plates were read on a Synergy Neo microplate reader (BioTek). Negative control wells containing only oligo and DNA binding buffer were used as background and subtracted from the polarization values. Polarization (P) was then converted to anisotropy (A) using the equation (A) = (2 × P)/(3 − P) (12). Graphpad Prism was used to graph the data and calculate the dissociation constants (KD) using the nonlinear regression equation for specific binding with Hill slope. A negative control oligo was included in each assay with a resulting KD of ≥900 nM.
CTCF ZF3–7 and DNA complex modeling
The CTCF ZF3–7 in complex with DNA structure (PDB ID: 5KKQ (10)) was used as the initial structure template. The DNA fragment was manually mutated to Cen-CTCF or H19 in COOT (13), and the protein coordinates and DNA coordinates were saved as receptor and ligand, separately. The docking method MDockPP (14) was run through the provided web server without any constrains. All the other docking parameters were as default. The model was further refined by NPDock DNA–protein complex refinement (15).
Molecular dynamics simulations
Simulation systems were solvated with TIP3P (transferable intermolecular potential with 3 points) water with 150 mM NaCl, minimized to convergence by steepest descent. 0.1 ns of simulation in the canonical ensemble were then used to relax the solvent and salt with the protein and DNA restrained, and subsequently equilibrated to atmospheric conditions in the isothermal–isobaric ensemble for 5 ns. All simulations used the GROMACS simulation engine (16). Production simulations were performed in the canonical ensemble at 300 K.
Discovery of hemimethylated CpG dyads within CTCF motifs
Human hemimethylation data from Xu and Corces (2018) (6) and mouse hemimethylation data from Zhao et al. (17) were downloaded from GEO database (GSE97394 for Xu and Corces, GSE48229 for Zhao et al.). CTCF genomic coordinates for both human GRCh38/hg38 and mouse GRCm38/mm10 were obtained from the R AnnotationHub resource package ‘CTCF 0.99.11’ (18) and intersected with hemimethylated CpG coordinates using Bedtools V2.30 (19). Overlapping CpGs were then filtered for hemimethylation defined as a 50% difference in methylation across a CpG dyad. For the associated genomic regions with nearby genes, gene ontology (GO) analysis was performed using GREAT V4.0.4 (http://great.stanford.edu/public/html/) (20).
RESULTS AND DISCUSSION
CTCF binding is regulated by strand-specific CpG methylation
As illustrated in Figure 1A, there are two key cytosines within the CTCF consensus recognition motif at positions 2 and 12. Wang et al. reported an enrichment of CpG dinucleotides at these two positions (called position 1 and 11 in their paper) and found that 29% of CTCF motifs contain a CpG at one or both of these positions (21). The C2 and C12 cytosines were shown to interact with the ZF7 and ZF4 domains in the crystal structures reported by Hashimoto et al. (10). To test the effect of hemimethylation of these cytosines on CTCF binding, four CTCF motifs in the form of 18–19 base pair double-stranded oligos (Figure 1) were selected for fluorescence polarization (FP) DNA binding studies (12). We chose to investigate a CTCF site located at the centromeric boundary of a differentially methylated region (DMR) which includes the only known example of a variably imprinted human gene, nc886/VTRNA2-1 (22,23). We have previously shown that methylation of the nc886 DMR and the resultant repression of nc886 is associated with lower body mass index in children and reduced survival in individuals with acute myeloid leukemia (23,24). Given that the nc886 DMR is variably methylated, we hypothesized that the centromeric CTCF site (Cen-CTCF) would be regulated by methylation. The Cen-CTCF binding motif has CpGs at both the C2 and C12 positions and was discovered using motifbreakR (https://github.com/Simon-Coetzee/MotifBreakR (25)) to predict whether the rs2346018 A/C polymorphism would affect CTCF binding at this site. The presence of an A provides a consensus nucleotide at position 6 producing a recognition motif for CTCF (Figure 1A). The binding of the Cen-CTCF motif was compared to two positive control motifs. The first is the CTCF motif located telomeric to the nc886 DMR (Telo-CTCF) which does not contain a CpG site and is frequently occupied by CTCF in multiple cell types based on ENCODE ChIP-seq data. The second positive control is a well-studied binding motif from the Igf2/H19 imprinting control region (H19) which has previously been shown to interact with CTCF (8–10,26). The H19 motif contains a CpG at position C2 and an adjacent CpG in the triplet interacting with ZF6 (Figure 1A). As a negative control, we used the DNA sequence 5′ to the Cen-CTCF motif which differs considerably from the consensus sequence and is not predicted to be recognized by CTCF.
For the in vitro binding experiments, we used both full-length CTCF (CTCF FL) protein and the DNA-binding domain of the protein (CTCF ZF1–11). As shown in Figure 1B, there was strong binding of both CTCF ZF1–11 and CTCF FL proteins (KD ∼5 nM) to the positive control, Telo-CTCF oligo. The negative control showed 200-fold less affinity (KD ∼1000 nM) establishing the specificity of the assay. The Cen-CTCF and H19 oligos showed intermediate levels of interaction with both CTCF ZF1-11 and CTCF FL proteins with dissociation constants of about 200 nM. Since the results were similar for both the full-length protein and the DNA-binding portion of the protein and only the ZF domains are required for binding to the motif (10,26), further studies to investigate the effects of DNA methylation on CTCF binding were therefore conducted with CTCF ZF1–11 protein using the Cen-CTCF and H19 oligos as ligands.
We next measured the effects of various CpG methylation patterns in the Cen-CTCF and H19 oligos on the binding of CTCF ZF1-11 protein using FP assays (Figure 2). As expected, full methylation of both strands strongly inhibited binding as shown by an increase in the KD from 163 to 1075 nM (7-fold difference) (Figure 2A). Methylation of only the motif strand C2 and C12 positions had an almost equivalent effect as full methylation of both strands (KD = 1034 nM). On the other hand, methylation of the opposite strand C2 and C12 positions increased binding to a KD of 37 nM showing a 28-fold difference in binding between the two hemimethylation states (Figure 2A). Further investigation of the roles of individual C2 and C12 methylation and combinations of methylation showed the dominant role of motif strand C2 methylation in inhibiting binding and the additive nature of opposite strand methylation in stimulating binding (Supplementary Table S2). Given the surprising nature of these observations, we verified the stimulatory effects of hemimethylation using the H19 oligo, since full methylation of this site has been well studied (8,9). As indicated in Figure 1A, this motif has a CpG at the C2 but not the C12 position. Full methylation of the C2 position on both strands or methylation of only the motif strand C2 position changed the KD from 149 nM to 294 and 346 nM respectively (Figure 2B). Once again, methylation of the opposite strand stimulated binding (KD 89 nM), and the difference between the two hemimethylation states was 4-fold (Figure 2B).
Two previous studies assessed hemimethylation of CTCF binding sites within the imprinting control region upstream of the mouse and human H19 gene but did not report a change in CTCF binding when the opposite DNA strand was methylated (9,26). One of these studies included the MS1 site from the mouse HS1 insulator (9) which contains the same motif sequence used in the H19 oligo in this paper. The gel mobility shift assay that was used in these previous studies may not have been sensitive enough to observe or quantitate an increase in binding. Here, we find that the effects of strand-specific methylation depend on the actual CTCF target sequence; and the inhibition of binding from motif strand methylation is dominant over the stimulation of binding from opposite strand methylation.
Hemimethylated CpGs occur within CTCF motifs in embryonic stem cells
Our results that CTCF binding can be regulated by strand-specific hemimethylation states led us to investigate whether any of the hemimethylated CpGs identified in H9 human embryonic stem cells by Xu and Corces (2018) (6) and in undifferentiated E14TG2a mouse embryonic stem cells by Zhao et al. (17) are located within a CTCF motif. In the Xu and Corces paper (2018), the authors found that hemimethylated CpGs often flank CTCF motifs and show an asymmetry as to which strand is methylated (6). We used their nascent bisulfite sequencing pulse-chase data to determine whether CpGs within the recognition motif itself are hemimethylated. Using the CTCF consensus sequence data sources as described in Dozmorov et al. (18), we identified 102838 CpGs that fall within a CTCF motif of which 803 of these CpGs were found to be hemimethylated across both pulse and chase using a 50% difference in methylation cut-off (Table 1 and Supplementary Table S3). Using the hairpin bisulfite sequencing data from the Zhao et al. (2014) paper, we found 189291 CpGs that fall within a CTCF motif of which 2604 are hemimethylated using a 50% difference in methylation cut-off (Table 1 and Supplementary Table S4). It is worth noting that the Cen-CTCF motif is not present in the mouse genome. In the human ES cell line, the Cen-CTCF motif does appear to contain hemimethylation; however, it was not included in our final data set because we limited our results to CpGs with hemimethylation across both pulse and chase and there was missing data at this site. The data from both studies provide evidence that about 1% of CpGs within CTCF motifs are hemimethylated in vivo, and there is approximately an even distribution of methylation on either the motif strand or the opposite strand in both mouse and human ES cells (Table 1). This suggests that strand-specific hemimethylation could potentially have opposing effects on CTCF binding in vivo as predicted by our in vitro data, but further detailed studies in living cells are needed to substantiate this.
Table 1.
Data source | Xu and Corces, 2018 (human) | Zhao et al., 2014 (mouse) |
---|---|---|
CpGs located within a CTCF motif | 102 838 | 189 291 |
Hemimethylated CpGs within a CTCF motif | 803* (0.8%) | 2604 (1.4%) |
CpGs with hemimethylation on Motif strand | 222 | 1120 |
CpGs with hemimethylation on Opposite strand | 242 | 1054 |
CpGs with hemimethylation on both Motif and Opposite strands | 339 | 430 |
5 most significant GO terms for nearest genes | Neuron differentiation, Nervous system development, Animal organ development, Cell development, Animal organ morphogenesis | Regulation of granulocyte differentiation, Positive regulation of myeloid cell differentiation, Maintenance of protein location, Positive regulation of myeloid leukocyte differentiation, Cellular response to epidermal growth factor stimulus |
Human nascent bisulfite sequencing pulse-chase hemimethylation data from Xu and Corces (2018) and mouse hairpin bisulfite sequencing hemimethylation data from Zhao et al. (2014) were downloaded from GEO database (GSE97394 and GSE48229). * Only CpG dyads with hemimethylation across both pulse and chase were included.
To assess the possible biological significance of these hemimethylated CpGs, we used gene ontology (GO) enrichment analysis of the genes which contain or are near the hemimethylated CpGs. We found that hemimethylation at CTCF motifs is enriched within/near genes involved in tissue development and cellular differentiation. The presence of hemimethylated CpGs on the opposite strands may strengthen CTCF binding and provide a mechanism to maintain CTCF at certain critical sites during cell division. Therefore, it is possible that this mechanism could be involved in regulating the differentiation program in stem cells.
CTCF V454 and S364 sense the presence of methyl groups on the opposite strand
To explore the structural basis for the surprising increase in binding of CTCF to methylation on the opposite strand, we utilized the detailed crystallographic data of Hashimoto et al. (PDB ID: 5T00 (10)), which was obtained using a different CTCF motif containing a CA instead of a CpG at the C2 position with the resultant thymine rather than a cytosine on the opposite strand. This opposite strand thymine presents a methyl group in exactly the same geometry as 5-methylcytosine (5mC) and was shown to have hydrophobic interaction with V454 of ZF7 in the existing crystallographic data (PDB ID: 5T00 (10)). In addition, the C beta from the S364 of ZF4 has a hydrophobic interaction with 5mC at the C12 position on the opposite strand. In agreement with this observation, our Molecular Dynamics (MD) simulations using the 5T00 structure and substituting in the Cen-CTCF oligo revealed that V454 and S364 remain in close proximity to the mCpG at positions C2 and C12 on the opposite strand for hundreds of nanoseconds (Supplementary Figure S1). This observation led us to hypothesize that V454 of ZF7 and S364 of ZF4 sense opposite strand methylation through hydrophobic contact with 5mC. Docking of the opposite strands of the C2 and C12 positions of Cen-CTCF was performed in silico to investigate the potential interactions between these key cytosines and ZF7 and ZF4 (Figure 3A and B). The Valine residue at position 454 and the Serine residue at position 364 in the protein were found to be in close proximities to the C2 and C12 opposite nucleotides respectively, and therefore may interact with the methyl group of the 5mC through hydrophobic interactions. Molecular modeling predicted that the addition of a methyl group to the C2 opposite position would decrease the interaction distance from 4.9 angstroms (Å) to 3.5 Å and from 5.1 to 3.6 Å for the C12 opposite position (Figure 3A and B). Similar results were found for the C2 position on the H19 motif where 5mC decreased the interaction distance from 4.2 to 3.0 Å suggesting that this mechanism might be general (Supplementary Figure S2).
The likelihood that these predicted interactions with 5mC would increase the binding of CTCF was investigated by exploring the effects of substituting the key residues (V454 and S364) in the protein with glycine (hydrophobic interaction reducing) and investigating the influence of 5mC on binding to Cen-CTCF. As expected, methylation of the motif strand inhibited binding by all mutant proteins (Figure 3C–F). On the other hand, the stimulatory effect of 5mC on the opposite strand was reduced by substitution to glycine at either the V454 or S364 positions (Figure 3C and D), but not at the S450 or V363 positions (Figure 3E and F). These data strongly support the structural predictions by showing that V454 and S364 residues in the CTCF ZFs sense the presence of methyl groups on the opposite strand C2 and C12 positions.
CTCF binding is regulated by CpG hydroxymethylation
Several studies have reported changes in 5-hydroxymethylcytosine (5hmC) levels at CTCF binding sites (27–30); however, little is known about the effects of 5hmC on CTCF binding. The 5hmC base is predicted to not only introduce a hydrophilic group, but also to sterically obstruct the interactions between the CTCF V454 and S364 with the opposite strand C2 and C12 positions of Cen-CTCF and C2 of H19 (Figure 4A and C). Not shown, is the prediction that 5hmC on the motif strand would still inhibit binding given that it is larger than 5mC. We tested these predictions using the Cen-CTCF and H19 oligos containing 5hmC at the target positions (Figure 4B and D). Full hydroxymethylation of the CpGs within the Cen-CTCF oligo or hydroxymethylation on the motif strand resulted in a 4-fold reduction in the binding affinity (Figure 4B), confirming that 5mC or 5hmC at the C2 position sterically obstructs D451 in CTCF (10). The presence of 5hmC on the opposite strand had no effect or subtle effects on the binding of CTCF (Figure 4B). For the H19 oligo, hydroxymethylation on both strands of the oligo or only on the motif strand resulted in a 2-fold reduction in the binding affinity (Figure 4D). As with the Cen-CTCF oligo, hydroxymethylation of the opposite strand of the H19 oligo had only a subtle effect with slightly diminished binding affinity relative to the unmethylated oligo (Figure 4D). Therefore, the substitution of 5hmC for 5mC on the opposite strand mitigated the stimulation, whereas it maintained the inhibitory effects on the motif strand, albeit to a lesser extent than 5mC. Our finding that hydroxymethylation can inhibit the binding of CTCF to DNA is in line with a study by Marina et al. (2016) in which they show by electrophoretic mobility shift assay that DNA probes from the CD45 and KCNA2B genes with the modifications 5mC, 5hmC, and 5fC have minimal interaction with CTCF (31). Variable hydroxymethylation might therefore add an additional level of control to CTCF binding. Previous reports have shown that hydroxymethylation is increased at weak and variable CTCF sites (29) and have suggested a role for hydroxymethylation in maintaining a poised chromatin state (28,29). Thus, hydroxymethylation may represent a dynamic intermediate that maintains the motif in a poised state ready for changes in cytosine modifications that could stimulate the binding of CTCF.
CONCLUSIONS AND IMPORTANCE
The recent discovery of heritable hemimethylation states in ES cells (6) has challenged the long-held assumption that hemimethylation is simply an intermediate to full symmetrical methylation and therefore of little biological significance. Our results that different hemimethylation states at CTCF binding sites can change binding affinity by as much as 28-fold suggests that there may indeed be a function for hemimethylation in the organization of the genome, particularly in ES cells. We were helped considerably in defining the mechanism for these effects by the detailed structures published by Hashimoto et al. (10) and were able to confirm the mechanism by mutating the relevant zinc fingers in CTCF. Our results are particularly intriguing because hemimethylation at a single CTCF motif could potentially cause large-scale alterations of the epigenome in ES cells. A recent study by Gabriele et al. demonstrated that even CTCF topologically associated domain boundaries, which are thought to be stable interactions, can be highly dynamic and transient in nature (32). The regulation of CTCF sites by specific CpG modified states in a strand-specific manner, like hemimethylation and hydroxymethylation, may provide a mechanism to explain the transient and dynamic nature of CTCF-mediated chromatin interactions.
The phenomenon of asymmetric cell division was identified by Edwin Conklin >100 years ago (33) and many mechanisms likely participate in the generation of cellular diversity (34). While there is no evidence yet that DNA methylation plays a role in this process, hemimethylation may provide a mechanism for asymmetric cell division given that it generates two sequence-identical DNA duplexes with strongly differing potentials to bind CTCF. In this regard, it is interesting that hemimethylation is much less common in the differentiated progeny of ES cells (6) suggesting that resolution of hemimethylation may be associated with cell differentiation.
DATA AVAILABILITY
All data are available from the corresponding author upon request.
Supplementary Material
ACKNOWLEDGEMENTS
We thank Drs Scott Rothbart and Robert Vaughan (Van Andel Institute) for assistance with establishing a protocol for the FP assay, Drs Xiaodong Cheng and John Horton (MD Anderson) for providing the CTCF expression plasmids, Dr Evan Worden (Van Andel Institute) for helpful suggestions and editing, and Joshua Wierenga (Van Andel Institute student intern) for assisting with expression of the mutant CTCF proteins.
Contributor Information
Stacey L Thomas, Department of Epigenetics, Van Andel Institute, Grand Rapids, MI 49503, USA.
Ting-Hai Xu, Department of Epigenetics, Van Andel Institute, Grand Rapids, MI 49503, USA; Department of Structural Biology, Van Andel Institute, Grand Rapids, MI 49503, USA.
Brittany L Carpenter, Department of Epigenetics, Van Andel Institute, Grand Rapids, MI 49503, USA.
Steven E Pierce, Department of Neurodegenerative Science, Van Andel Institute, Grand Rapids, MI 49503, USA.
Bradley M Dickson, Department of Epigenetics, Van Andel Institute, Grand Rapids, MI 49503, USA.
Minmin Liu, Department of Epigenetics, Van Andel Institute, Grand Rapids, MI 49503, USA.
Gangning Liang, Department of Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90089, USA.
Peter A Jones, Department of Epigenetics, Van Andel Institute, Grand Rapids, MI 49503, USA.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
National Cancer Institute [R35CA209859 to P.A.J., G.L., R50CA243878 to M.L.]. Funding for open access charge: Van Andel Institute internal funding (to P.A.J.).
Conflict of interest statement. P.A.J. is a paid consultant for Zymo Research; however, no Zymo Research products were used in this study. G.L. is a paid consultant for Pangea Laboratory.
REFERENCES
- 1. Cedar H., Bergman Y.. Linking DNA methylation and histone modification: patterns and paradigms. Nat. Rev. Genet. 2009; 10:295–304. [DOI] [PubMed] [Google Scholar]
- 2. Bird A.P. Use of restriction enzymes to study eukaryotic DNA methylation: II. The symmetry of methylated sites supports semi-conservative copying of the methylation pattern. J. Mol. Biol. 1978; 118:49–60. [DOI] [PubMed] [Google Scholar]
- 3. Jones P.A., Liang G.. Rethinking how DNA methylation patterns are maintained. Nat. Rev. Genet. 2009; 10:805–811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Arand J., Wossidlo M., Lepikhov K., Peat J.R., Reik W., Walter J.. Selective impairment of methylation maintenance is the major cause of DNA methylation reprogramming in the early embryo. Epigenetics Chromatin. 2015; 8:1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Liang G., Chan M.F., Tomigahara Y., Tsai Y.C., Gonzales F.A., Li E., Laird P.W., Jones P.A.. Cooperativity between DNA methyltransferases in the maintenance methylation of repetitive elements. Mol. Cell. Biol. 2002; 22:480–491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Xu C., Corces V.G.. Nascent DNA methylome mapping reveals inheritance of hemimethylation at CTCF/cohesin sites. Science. 2018; 359:1166–1170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Phillips J.E., Corces V.G.. CTCF: master weaver of the genome. Cell. 2009; 137:1194–1211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Bell A.C., Felsenfeld G.. Methylation of a CTCF-dependent boundary controls imprinted expression of the Igf2 gene. Nature. 2000; 405:482–485. [DOI] [PubMed] [Google Scholar]
- 9. Hark A.T., Schoenherr C.J., Katz D.J., Ingram R.S., Levorse J.M., Tilghman S.M.. CTCF mediates methylation-sensitive enhancer-blocking activity at the H19/Igf2 locus. Nature. 2000; 405:486–489. [DOI] [PubMed] [Google Scholar]
- 10. Hashimoto H., Wang D., Horton J.R., Zhang X., Corces V.G., Cheng X.. Structural basis for the versatile and methylation-dependent binding of CTCF to DNA. Mol. Cell. 2017; 66:711–720. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Kim E., Ilic N., Shrestha Y., Zou L., Kamburov A., Zhu C., Yang X., Lubonja R., Tran N., Nguyen C.et al.. Systematic functional interrogation of rare cancer variants identifies oncogenic alleles. Cancer Discov. 2016; 6:714–726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Moerke N.J. Fluorescence polarization (FP) assays for monitoring peptide-protein or nucleic acid-protein binding. Curr. Protoc. Chem. Biol. 2009; 1:1–15. [DOI] [PubMed] [Google Scholar]
- 13. Emsley P., Cowtan K.. Coot: model-building tools for molecular graphics. Acta. Crystallogr. D Biol. Crystallogr. 2004; 60:2126–2132. [DOI] [PubMed] [Google Scholar]
- 14. Huang S.Y., Zou X.. MDockPP: a hierarchical approach for protein-protein docking and its application to CAPRI rounds 15-19. Proteins. 2010; 78:3096–3103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Tuszynska I., Magnus M., Jonak K., Dawson W., Bujnicki J.M.. NPDock: a web server for protein-nucleic acid docking. Nucleic Acids Res. 2015; 43:W425–W430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Hess B., Kutzner C., van der Spoel D., Lindahl E.. GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. J. Chem. Theory Comput. 2008; 4:435–447. [DOI] [PubMed] [Google Scholar]
- 17. Zhao L., Sun M.A., Li Z., Bai X., Yu M., Wang M., Liang L., Shao X., Arnovitz S., Wang Q.et al.. The dynamics of DNA methylation fidelity during mouse embryonic stem cell self-renewal and differentiation. Genome Res. 2014; 24:1296–1307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Dozmorov M.G., Mu W., Davis E.S., Lee S., Triche T.J. Jr, Phanstiel D.H., Love M.I. CTCF: an R/bioconductor data package of human and mouse CTCF binding sites. Bioinform. Adv. 2022; 2:vbac097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Quinlan A.R., Hall I.M.. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010; 26:841–842. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. McLean C.Y., Bristor D., Hiller M., Clarke S.L., Schaar B.T., Lowe C.B., Wenger A.M., Bejerano G.. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 2010; 28:495–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Wang H., Maurano M.T., Qu H., Varley K.E., Gertz J., Pauli F., Lee K., Canfield T., Weaver M., Sandstrom R.et al.. Widespread plasticity in CTCF occupancy linked to DNA methylation. Genome Res. 2012; 22:1680–1688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Carpenter B.L., Remba T.K., Thomas S.L., Madaj Z., Brink L., Tiedemann R.L., Odendaal H.J., Jones P.A.. Oocyte age and preconceptual alcohol use are highly correlated with epigenetic imprinting of a noncoding RNA (nc886). Proc. Natl. Acad. Sci. U.S.A. 2021; 118:e2026580118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Carpenter B.L., Zhou W., Madaj Z., DeWitt A.K., Ross J.P., Gronbaek K., Liang G., Clark S.J., Molloy P.L., Jones P.A.. Mother-child transmission of epigenetic information by tunable polymorphic imprinting. Proc. Natl. Acad. Sci. U.S.A. 2018; 115:E11970–E11977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Treppendahl M.B., Qiu X., Sogaard A., Yang X., Nandrup-Bus C., Hother C., Andersen M.K., Kjeldsen L., Mollgard L., Hellstrom-Lindberg E.et al.. Allelic methylation levels of the noncoding VTRNA2-1 located on chromosome 5q31.1 predict outcome in AML. Blood. 2012; 119:206–216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Coetzee S.G., Coetzee G.A., Hazelett D.J.. motifbreakR: an R/bioconductor package for predicting variant effects at transcription factor binding sites. Bioinformatics. 2015; 31:3847–3849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Renda M., Baglivo I., Burgess-Beusse B., Esposito S., Fattorusso R., Felsenfeld G., Pedone P.V.. Critical DNA binding interactions of the insulator protein CTCF: a small number of zinc fingers mediate strong binding, and a single finger-DNA interaction controls binding at imprinted loci. J. Biol. Chem. 2007; 282:33336–33345. [DOI] [PubMed] [Google Scholar]
- 27. Azizgolshani N., Petersen C.L., Chen Y., Levy J.J., Salas L.A., Perreard L., Nguyen L.N., Christensen B.C.. DNA 5-hydroxymethylcytosine in pediatric central nervous system tumors may impact tumor classification and is a positive prognostic marker. Clin. Epigenetics. 2021; 13:176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Sun Z., Terragni J., Borgaro J.G., Liu Y., Yu L., Guan S., Wang H., Sun D., Cheng X., Zhu Z.et al.. High-resolution enzymatic mapping of genomic 5-hydroxymethylcytosine in mouse embryonic stem cells. Cell Rep. 2013; 3:567–576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Teif V.B., Beshnova D.A., Vainshtein Y., Marth C., Mallm J.P., Hofer T., Rippe K.. Nucleosome repositioning links DNA (de)methylation and differential CTCF binding during stem cell development. Genome Res. 2014; 24:1285–1295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Wiehle L., Thorn G.J., Raddatz G., Clarkson C.T., Rippe K., Lyko F., Breiling A., Teif V.B.. DNA (de)methylation in embryonic stem cells controls CTCF-dependent chromatin boundaries. Genome Res. 2019; 29:750–761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Marina R.J., Sturgill D., Bailly M.A., Thenoz M., Varma G., Prigge M.F., Nanan K.K., Shukla S., Haque N., Oberdoerffer S.. TET-catalyzed oxidation of intragenic 5-methylcytosine regulates CTCF-dependent alternative splicing. EMBO J. 2016; 35:335–355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Gabriele M., Brandao H.B., Grosse-Holz S., Jha A., Dailey G.M., Cattoglio C., Hsieh T.S., Mirny L., Zechner C., Hansen A.S.. Dynamics of CTCF- and cohesin-mediated chromatin looping revealed by live-cell imaging. Science. 2022; 376:496–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Conklin E.G. The organization and cell-lineage of the ascidian egg. J. Acad. Nat. Sci. Phila. 1905; 16:1–119. [Google Scholar]
- 34. Sunchu B., Cabernard C.. Principles and mechanisms of asymmetric cell division. Development. 2020; 147:dev167650. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data are available from the corresponding author upon request.