Abstract
CXXC5 is a member of the zinc-finger CXXC family proteins that interact with unmodified CpG dinucleotides through a conserved ZF-CXXC domain. CXXC5 is involved in the modulation of gene expressions that lead to alterations in diverse cellular events. However, the underlying mechanism of CXXC5-modulated gene expressions remains unclear. Proteins perform their functions in a network of proteins whose identities and amounts change spatiotemporally in response to various stimuli in a lineage-specific manner. Since CXXC5 lacks an intrinsic transcription regulatory function or enzymatic activity but is a DNA binder, CXXC5 by interacting with proteins could act as a scaffold to establish a chromatin state restrictive or permissive for transcription. To initially address this, we utilized the proximity-dependent biotinylation approach. Proximity interaction partners of CXXC5 include DNA and chromatin modifiers, transcription factors/co-regulators, and RNA processors. Of these, CXXC5 through its CXXC domain interacted with EMD, MAZ, and MeCP2. Furthermore, an interplay between CXXC5 and MeCP2 was critical for a subset of CXXC5 target gene expressions. It appears that CXXC5 may act as a nucleation factor in modulating gene expressions. Providing a prelude for CXXC5 actions, our results could also contribute to a better understanding of CXXC5-mediated cellular processes in physiology and pathophysiology.
Subject terms: Biochemistry, Cancer, Cell biology, Molecular biology
Introduction
The methylation of mammalian genomic DNA, which predominantly arises post-replicatively at the 5’ position of the cytosine base in the context of CpG dinucleotides, varies across cell types, developmental stages, physiological and pathophysiological conditions1. Acting as stable and heritable epigenetic marks, methylated CpGs present in 80% of CpGs in the genome and involve both genic and intergenic regions2. Methylated CpGs are specifically recognized and bound by methyl-CpG-binding proteins, which in turn generate a chromatin environment refractory to transcription by recruiting a variety of chromatin modifiers3,4.
Although the majority of CpGs are methylated, about 70% of annotated human gene promoters are associated with unmethylated DNA stretches called CpG islands (CGIs), which are rich in C and G nucleotides with a high density of CpG dinucleotides5,6. The structurally and functionally distinct zinc-finger (ZF)-CXXC family proteins interact with unmodified CpG dinucleotides through a highly conserved ZF-CXXC domain characterized by two consecutive cysteine-rich motifs (CXXCXXC) that associate with two Zn++ ions forming zinc-finger structures. Upon binding to unmethylated CpG motifs with varying affinities and specifies within the context of surrounding base composition7, the ZF-CXXC family proteins establish a chromatin architecture directly through chromatin-modifying enzymatic activities and/or indirectly through the recruitment of chromatin-modifiers to modulate gene expressions4,8,9.
CXXC5, also known as RINF (Retinoid-Inducible Nuclear Factor) and WID (WT1-Induced Inhibitor of Dishevelled), is a member of the ZF-CXXC family. CXXC5 is located on chromosome 5q31.2 and encodes a 322 amino-acid protein with a predicted molecular mass of 33 kDa8,10. Retinoic acid11, transforming growth factor-β12, bone morphogenetic protein 413,14, Wnt3a15–17, and estrogen18–21 modulate the expression of CXXC5 in experimental systems. The encoded CXXC5 protein alters gene expressions13,17,21–27 that result in the modulation of diverse cellular events, including signal transduction, DNA damage response, metabolism, proliferation, differentiation, and death11–13,15,21,22,24,25,27–30.
However, the underlying mechanism by which CXXC5 regulates gene expressions remains unclear. We recently showed that CXXC5 is an unmethylated CpG binder but lacks an intrinsic transcription regulatory function21. This raises the possibility that CXXC5 acts as a nucleation factor to establish a transcription state restrictive or permissive for transcription by interacting with transcription factors, transcription co-regulatory proteins, histone, and/or DNA modifiers. Consistent with this prediction, CXXC5 appears to be involved in epigenetic alterations, including CpG methylations and histone modifications by recruiting TET2 (the ten-eleven translocation methylcytosine dioxygenase 2) at a subset of CGI promoters that results in the modulation of gene expression in plasmacytoid dendritic cells23. Similarly, CXXC5 was recently shown to influence the establishment and maintenance of DNA methylation22 as well as hydroxymethylation in embryonic stem cells22 and immature erythroid cells27. It appears that CXXC5 regulates the transcription of TET1 and TET2 and interacts with TET1 and TET2 to modulate the expression of pluripotency genes, including Nanog and Oct4 (Octamer-Binding Protein 4)22. Nanog and Oct4, in turn, cooperate with CXXC5 as the protein partner at a subset of gene promoters and enhancers to modulate the expression of genes involved in the facilitation of differentiation22. Likewise, it was recently reported that the enhanced synthesis and interaction of CXXC5 with TET2 could pose challenges for the treatment of castration-resistant prostate cancer by increasing the accessibility of non-canonical DNA binding sites for androgen receptor31. Moreover, the interaction of CXXC5 with TET2 could affect the stability of TET232. It was also reported that CXXC5 inhibits the expression of CD40L (CD40 ligand gene) by interacting with histone-lysine N-methyltransferase SUV39H1 (KMT1A) at the gene promoter26. Furthermore, CXXC5 was reported to interact with FOXL2 (Forkhead Box L2)33, RBPJ (Recombination Signal Binding Protein For Immunoglobulin Kappa J Region)24, Sall4 (Spalt Like Transcription Factor 4)34,35, members of SMAD (Mothers against decapentaplegic homolog) family29,36, and VDR (Vitamin D receptor)30 to modulate gene expressions. Additionally, CXXC5 was shown to serve as a negative feedback regulator of the Wnt/β-catenin signaling pathway by interacting with the Dvl (Dishevelled) protein in the cytoplasm of dermal fibroblast14–16,37.
Since proteins perform their functions in a network of proteins whose identities and amounts change temporally and spatially in response to intrinsic and extrinsic stimuli in a lineage-specific manner, the identification of protein partners of CXXC5 would critically contribute to a better understanding of the mechanisms of CXXC5-mediated cellular processes in physiology and pathophysiology. Following its introduction, the proximity-dependent biotinylation approach (BioID) has been effectively used for the identification of interacting partners of many proteins38,39. BioID is based on the genetic fusion of a mutant E. coli biotin ligase enzyme, BirA*(R118G), which is defective in both self-association and DNA binding, to a protein-of-interest to biotinylate proximity proteins40,41. Biotinylated proteins are then selectively isolated with biotin-affinity capture and identified with mass spectrometry (MS). Using the BioID-MS approach, we found that CXXC5 interacts with a large number of proteins mainly grouped in the regulation of gene expression which further clustered into proteins involved in DNA, chromatin, and RNA modifications. Of the proteins, we selectively verified that CXXC5 through its CXXC domain interacts with EMD (Emerin), MAZ (MYC Associated Zinc Finger Protein), and MeCP2 (Methyl-CpG Binding Protein 2). We also found that an interplay between CXXC5 and MeCP2 contributes to the expression of some of the CXXC5 target genes. Since CXXC5 lacks an intrinsic transcription regulatory function and enzymatic activity but is an unmethylated CpG binder, our results, together with others, imply that CXXC5 may act as a nucleation factor/molecular scaffold for gene expressions.
Results
Expression of the CXXC5-BirA* fusion protein in MCF7 cells
Although the underlying mechanism(s) is unclear, CXXC5 as a CpG dinucleotide binder is involved in gene expressions. Since the dynamically changing network of protein environment is a critical determinant for proteins to perform their functions in a cell- and signaling pathway-dependent manner, the identification of interacting partners of CXXC5 could provide critical information about the mechanisms of target gene expressions. To begin to address this issue, we utilized the BioID-MS approach. To generate the protein components of BioID, we genetically fused the 3xFlag-CXXC5 (3F-CXXC5) cDNA to the 5’ end of sequences encoding the BirA*-HA cDNA present in the pcDNA expression vector, pcDNA3.1-BirA*(R118G)-HA (BirA*-HA). To ensure that the genetic fusion of 3F-CXXC5 to BirA*-HA does not affect the synthesis, the intracellular localization, and the biotinylation ability of the CXXC5-BirA* fusion protein, we initially carried out immunocytochemistry (ICC) and western blot (WB) analyses in transiently transfected MCF7 cells derived from a breast adenocarcinoma. The expression vector bearing the BirA*-HA, 3F-CXXC5, or 3F-CXXC5-BirA*-HA cDNA was transiently transfected into MCF7 cells for 24 h. Cells were then treated without or with 50 μM biotin and 1 mM ATP for 16 h followed by ICC (Fig. 1a) and WB (Fig. 1b & Supplementary Information Fig. S1) using an antibody specific for the Flag, HA, or biotin. Results revealed that BirA*-HA displaying an expected molecular mass (MM) of 33 kDa was primarily present in the cytoplasm; whereas, 3F-CXXC5 with about 37 kDa MM in the absence or presence of exogenously added biotin was localized in the nucleus, as we showed previously19. Similarly, 3F-CXXC5-BirA*-HA with a predicted MM of 70 kDa was localized in the nucleus independently of the exogenously added biotin. Importantly, the detection of many biotinylated proteins only in the presence of biotin in transfected cells synthesizing 3F-CXXC5-BirA*-HA assessed with WB indicates that the fusion protein is functional as well.
CXXC5 associated proteins in MCF7 cells
Based on the synthesis and intracellular location of the functional 3F-CXXC5-BirA*-HA fusion protein, we then carried out BioID assays in MCF7 cells. Cells were transiently transfected with the expression vector bearing none (EV), the BirA*-HA, or the Flag-CXXC5-BirA*-HA cDNA for 24 h. Cells were then treated in the absence or presence of 50 μM biotin and 1 mM ATP for 16 h. Biotinylated proteins in cell lysates were captured with streptavidin-conjugated magnetic beads. Protein fragments following on-bead tryptic proteolysis of the captured proteins were subjected to mass spectrometry (MS). Subtractive analyses of identified proteins from EV, BirA*-HA, and 3F-CXXC5-BirA*-HA synthesizing cells as two biological replicates with two technical repeats revealed 108 proximal interactors of CXXC5 (Supplementary Information, Table S1). It should be noted that none of the proteins we identified is coincident with reported protein partners of CXXC5 except for TNRC1842 which was present in one of the biological replicates of our BioID experiments. Differences in protein profiles are likely due to dynamically changing protein abundances and protein complex compositions in distinct cell types. It is also possible that the genetic fusion of CXXC5 to BirA* altered the native conformational features of the protein, thereby modifying the in cellula protein interaction profile of the protein.
Gene ontology analyses for biological functions using the DAVID bioinformatics tool43 suggest that CXXC5 interacts with proteins largely grouped in the regulation of gene expression, which can further be sub-grouped into proteins involved in DNA, chromatin, and RNA processes (Supplementary Information, Fig. S2). Proteins identified as transcription factors includes ADNP, AP2A (TFAP2A), BCLAF1, CTCF, CUX1, DIDO1, ELF1, EMSY (C11orf30), GRHL2, LIN54, MAZ, NFIB, NFIX, NR2C2, RFX1, RREB1, SCML2, TRPS1, ZNF148, and ZNF638. Transcription co-regulatory proteins comprise CCAR2, HCFC1, LYRIC, MKL2, SNW1, SP110, TCF20, and TIF1B (TRIM28). DNA and Chromatin modifiers include BAZ1A, CHD4, CHD8, COR1B, HMGX4, KDM2A (CXXC8), KMT2A (CXXC7), KMT2B (CXXC10), GATAD2A, GATAD2B, MeCP2, NSD2, RUVB1, SMCA5, SMHD1, TOP2A, TOP2B, TOX4, and XRCC6. The group of proteins involved in RNA processing, binding, and transport encompasses CPSF6, DDX21, DDX5, DHX9, DKC1, HNRPR, NAT10, NONO, NUCL (NCL), PAIRB, ROA2, SF3B2, SRRM1, TADBP, and THOC4. Also, the proximity interaction partners of CXXC5 include architectural proteins Emerin (EMD) as well as LAP2α and LAP2β, both of which are splice variants encoded by TMPO.
Interaction of CXXC5 with EMD and MAZ
The initial validation of interactions between the putative interactors with the endogenous CXXC5 by the use of immunoprecipitation in MCF7 cells proved to be difficult. This was due to low levels of endogenous CXXC5 synthesis and the efficiency of the available antibodies from different resources for the immunoprecipitation of CXXC5 (Supplementary Information, Fig. S3). To circumvent these problems, we used the Human Embryonic Kidney 293 (HEK293) cells that exhibit high transfection efficiency44. Initial screening of some of the CXXC5 proximity interacting partners identified with BioID by the use of transient transfections followed by co-immunoprecipitation (Co-IP) in HEK293 cells revealed that CXXC5 could interact, for example, with EMD, MAZ, and MeCP2 proteins but not with LAP2α (Thymopoietin; Lamina-Associated Polypeptide 2, Isoform alpha), RUVBL1 (RuvB Like AAA ATPase 1) or SNW1 (SNW Domain Containing 1) protein (Supplementary Information, Fig. S4). Based on these observations, we selected EMD, MAZ, and MeCP2 to verify that they are indeed interacting protein partners of CXXC5.
EMD, a member of the nuclear lamina-associated protein family45, is a serine-rich inner nuclear membrane protein with a predicted molecular mass (MM) of 28.9 kDa and is involved in the organization of chromatin structure, nuclear assembly, and gene expressions46. To validate the interaction between CXXC5 and EMD, we transiently transfected HEK293 cells with the expression vector bearing 3F-CXXC5 and/or HA-EMD cDNA for 48 h (Fig. 2). We also transiently transfected cells with expression vectors bearing cDNAs with converse tag sequences: 3F-EMD and/or HA-CXXC5 cDNA to ensure that the nature of tags does not alter the intracellular localization and/or interactions of the putative protein partners (data not shown). 3F-CXXC5 is primarily localized to the nucleus, whereas HA-EMD shows a nuclear membrane/periphery staining that partially overlaps with the staining of 3F-CXXC5 as well (Fig. 2a & Inset). HA-EMD or 3F-EMD, in transfected cells, shows three distinct protein species with discrete electrophoretic migration ranging from 35 to 55 kDa, in contrast to 3F-CXXC5 or HA-CXXC5, both of which display a single electrophoretic species with an MM of about 37 kDa (Fig. 2b,c). Although protein species of EMD were not studied further, they likely represent isoforms with differentially processed post-translational modifications47–49. Immunoprecipitation of nuclear extracts of transiently transfected cells that co-synthesize 3F-CXXC5 and HA-EMD using the HA antibody (Fig. 2d,e; Supplementary Information, Fig. S5) together with protein A and G magnetic beads followed by immunoblotting with the Flag (Fig. 2d) or the HA (Fig. 2e) antibody indicates the presence of both EMD and CXXC5 in the immunoprecipitants. This demonstrates that CXXC5 and EMD interact. Interestingly, CXXC5 displayed an interaction only with the 35 kDa EMD species (Fig. 2d,e).
MAZ is a transcription factor with C2H2-type zinc finger motifs that can bind GC-rich promoters of target genes to control transcriptional processes50. We had carried out the initial screening of the CXXC5 interaction with MAZ using a MAZ cDNA that encodes an amino-terminally truncated variant with an estimated MM of 28 kDa (MAZΔN), whereas the full-length MAZ is about 51 kDa (Fig. 3a). We observed in transiently transfected HEK293 cells that HA-MAZ and HA-MAZΔN display electrophoretic mobility of 55 and 33 kDa, respectively (Fig. 3b). In transiently co-transfected HEK293 cells, the nuclearly localized (Fig. 3c) and co-synthesized 3F-CXXC5 and HA-MAZ (Fig. 3d) or HA-MAZΔN (Fig. 3e) showed interactions, as 3F-CXXC5 was immunoprecipitated with the HA antibody (Fig. 3f) in the co-presence of HA-MAZ or HA-MAZΔN (Fig. 3g). These results indicate that MAZ, through the carboxyl-terminus, interacts with CXXC5.
Interaction of CXXC5 with MeCP2
As a member of the methyl-CpG binding protein family (MBP) with a conserved methyl-cytosine binding domain (MBD), MeCP2 with a predicted MM of 53 kDa binds to methylated CpG dinucleotides and unmethylated DNA3 in contrast to CXXC5 which preferentially interacts with unmethylated CpG dinucleotide containing DNA7,21. The binding of MBPs to DNA as DNA methylation readers alters chromatin structure by recruiting chromatin remodelers and histone modifiers to modulate gene expressions3.
To examine the interaction of CXXC5 with MeCP2, HEK293 cells were transiently transfected with an expression vector bearing the 3F-CXXC5, or HA-CXXC5, and/or HA-MeCP2, or 3F-MeCP2, cDNA for 48 h. In HEK293 cells, HA-MeCP2, as 3F-CXXC5, localizes to the nucleus (Fig. 4a). Immunoblotting of nuclear extracts of transiently transfected HEK293 cells revealed that HA-MeCP2 or 3F-MeCP2 primarily displays electrophoretic mobility of about 80 kDa, as shown previously51, when synthesized alone or together with 3F-CXXC5 or HA-CXXC5 (Fig. 4b,c). The apparent higher MM than the estimated MM of MeCP2 is likely due to different post-translational modifications of the protein52. Immunoprecipitation of 3F-CXXC5 or 3F-MeCP2 with the HA antibody (Fig. 4d) in the presence of HA-MeCP2 or HA-CXXC5 in immunoprecipitants, respectively, (Fig. 4e) suggests that CXXC5 and MeCP2 are interacting partners.
The interaction between the unmethylated CpG dinucleotide binder CXXC5 and the DNA methylation reader and histone modifier MeCP2 is perplexing and enticed us to further explore the feature of this interaction. To extend the verification that CXXC5 and MeCP2 are interacting partners, we carried out the proximity ligation assay (PLA). PLA used here utilizes species-specific secondary antibodies conjugated with distinct DNA primers. A hybridization step followed by circular DNA amplification with fluorescent probes to the conjugated DNA primers allows the visualization of proximity spots by fluorescence microscopy53. In transiently transfected HEK293 cells synthesizing HA-MeCP2 and/or 3F-CXXC5, the HA or the Flag antibody alone showed virtually no fluorescence signal in cells whereas prominent nuclear fluorescence signals were detectable when cells were probed with both antibodies (Fig. 5a, Supplementary Information Fig. S6). This observation reinforces the conclusion that CXXC5 and MeCP2 are interacting partners.
Moreover, ChIP of cell extracts from HEK293 cells transiently transfected with 3F-CXXC5 and HA-MeCP2 with the Flag antibody followed by immunoblotting using the HA antibody and re-probing with the Flag antibody suggests that CXXC5 and MeCP2 are co-present on chromatin as well (Fig. 5b).
Assessing sub-regions critical for CXXC5-MeCP2 interactions
Based on these results, we wanted to explore a sub-region(s) of CXXC5 critical for the interaction with MeCP2. We generated cDNAs encoding amino-terminally, carboxyl-terminally, or internally truncated CXXC5 proteins. Since CXXC5 localizes to the nucleus through a nuclear localization signal present at the immediate amino-terminus of the CXXC domain28, to ensure that truncated CXXC5 variants lacking the CXXC domain also localize to the nucleus by inserting an exogenous nuclear localization signal (eNLS) derived from the SV40 T antigen54 between the Flag epitope and a truncated CXXC5 variant, namely 3F-eCXXC5Δ250-322 and 3F-eCXXC5Δ1-100&250-322 (Fig. 6a). To examine the synthesis and intracellular location of CXXC5 variants, we performed WB from and ICC in transiently transfected HEK293 cells with the use of the Flag antibody. Results revealed that CXXC5 variants were synthesized at expected MMs (Fig. 6b) and were localized primarily to the nucleus (Fig. 6c). To assess a sub-region(s) of CXXC5 critical for MeCP2 interaction, HEK293 cells transiently co-transfected with the expression vector bearing a 3F-CXXC5 variant and the HA-MeCP2 cDNA for 48 h were subjected to Co-IP. The nuclear extracts were immunoprecipitated with the HA antibody and immunoblotted with the Flag antibody followed by re-probing with the HA antibody (Fig. 6d). Results showed that 3F-eCXXC5Δ250-322 with the deleted CXXC domain was not detectable in the precipitants. Similarly, 3F-eCXXC5Δ1-100 & 250–322 lacking the first 100 amino acids at the amino-terminus together with the deleted CXXC domain did not show an interaction in either immunoprecipitants or in cells as assessed with PLA (Supplementary Information Fig. S6). On the other hand, the remaining of the CXXC5 variants, including the CXXC domain alone, 3F-CXXC, were detectable with the Flag antibody in the HA, but not in the IgG, precipitated lysate. The sequential residues at the domain (threonine, glycine, histidine, and glutamine amino acids, TGHQ) are critical for the binding of the CXXC domain to DNA23,32,55. To assess whether DNA binding defective CXXC5 interacts with MeCP2, we generated full-length 3F-CXXC5DBM and 3F-CXXCDBM mutants by converting the TGHQ sequence to AAAA. 3F-CXXCDBM as the full-length 3F-CXXC5DBM retained the interaction with MeCP2. These results collectively suggest that the carboxyl-terminal CXXC domain of CXXC5 is the required region to interact with MeCP2 independently of its ability of binding to DNA.
It should be noted that although the CXXC domain is the required region of CXXC5 for protein interactions, the central region of CXXC5, aa 101–150, could contribute to the stability/affinity of protein interactions. We observed that the 3F-CXXC5Δ100-149 variant, similar to the CXXC domain alone (3F-CXXC), consistently showed a lesser degree of interaction with MeCP2 compared to, for example, 3F-CXXC5, 3F-CXXC5Δ150-199, 3F-CXXC5Δ200-249, or 3F-CXXC5Δ1-100.
To examine whether we could also locate a region of MeCP2 involved in the interaction with CXXC5, we generated MeCP2 variants. An initial screening of MeCP2 variants suggested that the carboxyl-terminus of MeCP2 is involved in the interaction with CXXC5. A previous detailed study indicated that the carboxyl-terminus of MeCP2 is required for the interaction with FBP11, the protein product of PRPF40A (Pre-mRNA Processing Factor 40 Homolog A) and HYPC, the protein product of PRPF40B (Pre-mRNA Processing Factor 40 Homolog B)56; as the truncation of the carboxyl-terminal 86 amino acids containing the WW protein interaction domain of MeCP2, which also results from a frameshift mutation present in a group of Rett syndrome patients, abrogates interactions with FBP11 and HYPC. In transiently transfected HEK293 cells, the nuclearly localized (data not shown) the HA, or the Flag (data not shown), tagged MeCP2 variant lacking 86 amino acids from the carboxyl terminus (MeCP21-400) (Fig. 7a) displays a MM of about 55 KDa (Δ) compared to the full-length (FL) HA-MeCP2 which exhibits an apparent MM of 80 kDa (Fig. 7b). Immunoprecipitation of HA-MeCP21-400, or Flag-MeCP21-400, with the HA antibody from nuclear extracts of HEK293 cells co-synthesizing 3F-CXXC5, or HA-CXXC5, revealed that the interaction of CXXC5 with MeCP21-400 decreases dramatically compared to HA-MeCP2 or Flag-MeCP2 (Fig. 7c,d; Supplementary Information Fig. S7c,d). These results suggest that the carboxyl-terminus of MeCP2 is a critical region for interaction with CXXC5 as well. Moreover, the CXXC domain of CXXC5, 3F-CXXC, appears to be sufficient for the interaction of CXXC5 with HA-MeCP2 but not HA-MeCP21-400, as the HA antibody specifically immunoprecipitated 3F-CXXC from nuclear extracts of HEK293 cells co-synthesizing HA-MeCP2 (Fig. 7e,f; Supplementary Information Fig. S7e,f).
Similarly, the immunoprecipitation of 3F-CXXC with HA antibody from cellular extracts of HEK293 cells co-synthesizing HA-EMD or HA-MAZ (Supplementary Information Fig. S8a,b) further suggests that the CXXC domain of CXXC5 is also a critical region for the interaction with EMD or MAZ.
These observations collectively suggest that the CXXC domain as the DNA binding module of CXXC5 also participates in protein interactions, supporting previous findings that CXXC5, through the CXXC domain, interacts with Dvl157, HDAC1 (Histone deacetylase 1)12, and functionally associates with ATM (ATM Serine/Threonine Kinase)28.
Assessing the possible interplay between CXXC5 and MeCP2 in gene expressions
Our findings that CXXC5 and MeCP2 are co-present on chromatin imply an interplay between these proteins that could modulate gene expressions. We previously reported that CXXC5 is involved in the expression of HDAC11 (Histone deacetylase 11), NFKBIZ (NF-kappa-B inhibitor zeta), or IL12A (Interleukin-12 subunit alpha) in MCF7 cells21 as similarly reported for the IL12A gene in plasmacytoid dendritic cells23. To explore whether the alterations in the extent of gene expression is due to the engagement of CXXC5 with the promoter region of HDAC11, NFKBIZ, or IL12A, we initially carried out bioinformatics analyses using the Eukaryotic Promoter Database58,59, which contains resources of eukaryotic RNA polymerase II promoters with experimentally defined transcription start sites, to locate the promoter region of HDAC11, NFKBIZ or IL12A. Results indicate that the promoter of HDAC11 or NFKBIZ, as reported previously60,61, or IL12A is located within a CpG island (Supplementary Information Fig. S9). We also performed bioinformatics analyses using the Cistrome Data Browser which utilizes publicly available ChIP-chip and ChIP-seq datasets for genome-wide locations of transcription factor binding from various biological resources62,63, to assess whether CXXC5 is enriched at the promoter region of HDAC11, NFKBIZ, or IL12A. We found no CXXC5 dataset generated with the use of human resources at the Cistrome Data Browser. However, our analysis of a recent ChIP-Seq dataset carried out with the use of mouse embryonic stem cells22 suggests, at least in one of two replicates of ChIP-seq results, that CXXC5 may interact with the promoter region of IL12A. Although there is no MeCP2 ChIP-Seq dataset generated with the use of MCF7 cells, analyses with datasets from IMR-90, a human lung fibroblast cell line, and HCT-166 cells derived from human colon carcinoma revealed that MeCP2 could associate with the promoter region of HDAC11, NFKBIZ, or IL12A (Supplementary Information Fig. S9).
Based on these analyses, we assessed the possible association of CXXC5 or MeCP2 with the promoter region of HDAC11, NFKBIZ, or IL12A in MCF7 cells. We also used as a negative control Exon10 of CXXC5, which is highly methylated64 and to which MeCP2 does not show binding as assessed with the Cistrome Data Browser (data not shown). Cells transiently transfected with the expression vector bearing the 3F-CXXC5 or HA-MeCP2 cDNA for 48 h were subjected to ChIP using the Flag or HA antibody followed by qPCR using primers specific to the promoter region of HDAC11, NFKBIZ, IL12A, or Exon10 of CXXC5. ChIP-qPCR results indicated that CXXC5 or MeCP2 associates with the promoter region of HDAC11, NFKBIZ, or IL12A but not with exon10 of CXXC5 (Supplementary Information Fig. S10a).
Our analyses for the simultaneous presence of CXXC5 and MeCP2 with the promoters of genes we tested by sequential-ChIP from cells co-synthesizing 3F-CXXC5 and HA-MeCP2 were inconclusive. However, ChIP-qPCRs with the use of equal aliquots of extracts from cells synthesizing 3F-CXXC5 and HA-MeCP2 with the Flag or the HA antibody suggested the presence of CXXC5 and MeCP2 on the promoter region of HDAC11, NFKBIZ, or IL12A (data not shown), as we similarly observed in cells transfected with 3F-CXXC5 or HA-MeCP2 alone (Supplementary Information Fig. S10a).
These results imply that an interplay between CXXC5 and MeCP2 could contribute to the expression of HDAC11, NFKBIZ, or IL12A. To test this possibility, we examined the alterations in the expression of HDAC11, NFKBIZ, or IL12A in MCF7 cells transiently transfected with a control siRNA (CtS), a siRNA specific to CXXC5 (#10), and/or a siRNA pool specific to MeCP2 (Me-siR). Results revealed that CtS did not affect the transcript or protein levels of CXXC5 or MeCP2. #10 specifically repressed the transcript and protein levels of CXXC5 without affecting those of MeCP2 (Fig. 8a,b; Supplementary Information, Fig. S11). Re-probing the membrane with an antibody specific to MeCP2, which also immunoprecipitates endogenous MeCP2 in MCF7 cells (Supplementary Information, Fig. S12), indicated that Me-siR specifically suppressed the transcript and protein levels of MeCP2. Co-transfection of #10 and Me-siR led to suppression of the transcript and protein levels of both CXXC5 and MeCP2 (Fig. 8a,b; Supplementary Information, Fig. S11).
Repression of CXXC5 and MeCP2 protein levels resulted in the alteration of HDAC11, NFKBIZ, or IL12A expression. Transfection of cells with #10 alone attenuated the expression of HDAC11 and IL12 but augmented the NFKBIZ expression. Me-siR alone, on the other hand, did not affect the expression of HDAC11, IL12A, or NFKBIZ. But when co-transfected, Me-siR counteracted the repressive effect of #10 on HDAC11 expression, prevented and further augmented the expression of IL12A without affecting the transcript levels of NFKBIZ enhanced by #10 (Fig. 8c). To assess whether modulations in gene expressions were due to changes in the extent of MeCP2 interactions with the promoter region of HDAC11, NFKBIZ, or IL12A, we carried out ChIP-qPCR from MCF7 cells transfected with CtS or siRNA#10 for 48 h. Cells were then subjected to ChIP using the MeCP2 antibody or an isotype-matched IgG. Results, depicted as fold changes compared to CtS following normalization to IgG, revealed that the association of MeCP2 with the promoter regions of HDAC11 and IL12A increases but does not change at the NFKBIZ promoter (Fig. 8d) or Exon10 of CXXC5 as the control (Supplementary Information, Fig. S10b).
These results collectively suggest that an interplay between CXXC5 and MeCP2 at promoter regions contributes to the modulation of a subset of CXXC5 target gene expressions. We also observed that the methylation state of the promoter region of HDAC11, NFKBIZ, or IL12A is not affected by the reduction of CXXC5 levels (Supplementary Information Fig. S13b). This implies that modulation of a subset of gene expressions by the integrated effects of CXXC5 and MeCP2 may not require changes in DNA methylation.
Correlation analysis between mRNA expressions of CXXC5 and MeCP2 in breast cancer patients
To assess whether there is a correlation between the CXXC5 and MeCP2 expressions in clinical settings particularly in breast cancer patients that could highlight the importance of our observations, we used the GEPIA (Gene Expression Profiling Interactive Analysis) webserver for the expression analyses of CXXC5 and MeCP2 based on paired normal tissue and tumor samples from the TCGA and healthy tissue samples from the GTEx databases65. We found that the mean gene expression profile of CXXC5 is higher in breast tumor samples compared to mammary tissue together with paired normal breast tissue samples (Supplementary Information Fig. S14a), while the mean expression of the MeCP2 gene does not show significant variations between breast tumor and paired normal tissue samples (Supplementary Information Fig. S14b). We also found that although CXXC5 expression is significantly deregulated, there is no strong correlation between the CXXC5 and MeCP2 expressions in healthy mammary tissue in paired normal breast samples and breast tumors (Supplementary Information Fig. S14c–e), neither in many other tissue tumors including acute myeloid leukemia, brain lower grade glioma, glioblastoma multiforme, sarcoma, prostate adenocarcinoma, and testicular germ cell carcinoma (Supplementary Information, Fig. S15).
Discussion
Our findings indicate that the proximity interaction partners of CXXC5 identified with BioID in MCF7 cells encompass proteins involved in DNA structural changes, DNA modifications, chromatin remodeling, chromatin/histone modifications, and RNA processing as well as transcription factors and transcription co-regulatory proteins. Since CXXC5 lacks an intrinsic enzymatic activity or a transcription regulatory function but is a preferential unmethylated CpG dinucleotide binder through its CXXC domain4,21,66,67, our results together with the previous studies12,22–24,26,27,29–31,33–36 suggest that CXXC5 could act as a molecular scaffold for the regulation of gene expressions leading to cellular proliferation, differentiation, and death in a cell context-dependent manner11–13,15,21,22,24,28–30. This prediction presupposes the binding of CXXC5 to unmethylated DNA. However, it is also likely that CXXC5 without interacting with DNA associates directly or indirectly as a part of a protein complex with various transcription factors and/or DNA/chromatin binders to modulate gene expressions. Previous studies12,37, as we find here, also indicate that the DNA binding feature of the CXXC domain is independent of the ability to interact with protein partners, as the DNA binding CXXC mutants retain interactions with Dvl137 or HDAC112. These findings indicate bi-functionality for the CXXC domain of CXXC5: DNA and protein interactions. Moreover, the interaction of CXXC5 with KMT1A (SUV39H1) was reported to occur through a central region (aa 101–200) of CXXC5. This, together with our observation that the efficiency of interactions of the CXXC domain with protein partners could be modulated by a centrally located region (aa 100–150) of CXXC5, suggests that in addition to the CXXC domain, CXXC5 may contain distinct protein interaction regions and/or surfaces emerging from inter/intra-molecular allosteric interactions that could control associations or interaction affinities with protein partners. This, in turn, implies that dynamic conformational fluctuations of CXXC5 with a highly disordered amino-terminus region21 as results of DNA binding, protein–protein interactions, and/or post-translational modifications in a cellular environment are critical for diverse functional features of the protein in a signal- and cell type-specific manner.
Nuclear lamina (NL) is an interwoven structure composed of lamins and lamin-associated proteins of the inner nuclear membrane (INM). NL and proteins resident in the INM form a dynamic network that regulates chromatin organization, cell cycle regulation, DNA replication, DNA repair, cell differentiation, and death68,69. The NL is composed of Lamin A and C, which are the two major splice variants of a single gene (LMNA), as well as Lamin B1 and B2 encoded by LMNB1 and LMNB2, respectively68,69. Lamins interact with chromatin either directly or indirectly through chromatin-binding proteins. Lamin B1/B2 interacts with LBR (Lamin B Receptor) and HP1 (heterochromatin protein 1) associated with heterochromatin. On the other hand, Lamin A/C interacts with members of the LEM-Domain family (LEMD) proteins, including LAP2A/B and EMD through a nucleoplasmic adaptor protein BANF1 (barrier-to-autointegration factor, BAF), which binds to histones and DNA associated with both heterochromatin and euchromatin. We observed here that the proximity interaction partners of CXXC5 include EMD and LAP2A. We found that CXXC5 interacts with EMD but not with LAP2A, which resides in the nucleoplasm rather than INM due to the lack of a membrane-spanning domain present in INM proteins. Moreover, we observed that CXXC5 retains interactions with EMD lacking the LEM domain required for BANF1 binding consequently chromatin interactions (Supplementary Information, Fig. S16). It is therefore plausible that CXXC5 as an unmethylated CpG binder could anchor DNA to the INM through interactions with EMD, thereby providing a local chromatin environment critical for the modulation of target gene expressions in a nuclear activity-dependent manner.
Similarly, the interaction of CXXC5 with MAZ, a transcription factor with six Cys2His2-type zinc finger motifs at the carboxyl-terminus that binds to the permutation of the canonical GGGAGGG DNA sequence, could be critical for reciprocal recruitment, or assisted loading, to regulatory sites of target genes. This could result in coordinated recruitment of coregulatory proteins, including epigenetic factors, exemplified with nucleosome-remodeling factor (NURF) subunit BPTF as the proximity interaction partner of CXXC5 (Supplementary Information, Table S1) and the interactor of MAZ70 for the regulation of target gene expressions.
DNA methylation is one of the mechanisms of gene silencing and it mostly occurs in CpG dinucleotides of the genome. The effect of DNA methylation on gene expression could be refractory and/or permissive, depending on the genomic region. DNA methylation at promoters represses transcription of genes, while DNA methylation of intra/intergenic regions with different degrees of CpG density appears to correlate with gene expression71. DNA methylation is intimately associated with histone modifications, leading to integrated epigenetic processes for the alteration of chromatin architecture and the subsequent modulation of gene expressions. Methylated DNA is specifically recognized by MBPs, which belong to three distinct structural families: the Methyl-binding domain (MBD), the Methyl-CpG binding zinc fingers, and the SRA (SET- and RING-associated) domain proteins72. As a member of the MBD family, MeCP2 functions as a genome-wide transcriptional modulator. Mutations in the X-linked MeCP2 gene lead to a severe neurodevelopmental disorder, Rett syndrome73. MeCP2 readily binds both methylated and unmethylated DNA. MeCP2 through the MBD domain interacts with methylated/hemimethylated CG dinucleotides as well as methylated/hemimethylated CAC tri-nucleotides. MeCP2 also interacts with methylated cytosines in the non-CG context (mCH, where H = A, C, or T) as well as nucleosomes74–82. The MBD domain of MeCP2 also binds to unmethylated 5'-CAC/GTG-3' motif-containing DNA with an affinity comparable to methylated DNA74. In addition, the ID, TRD, and CTD domains of MeCP2 exhibit methylation-independent DNA binding capabilities83, and AT-hook-like domains within the ID, TRD, and CTD alpha domains of MeCP2 bind to the minor groove of AT-rich DNA84.
These methylation-dependent and independent DNA binding capabilities, together with the ability of MeCP2 to bind to DNA cooperatively and to induce DNA bridging and looping83,85,86, are suggested to allow MeCP2 to interact with different sites on DNA simultaneously, thereby contributing to genome-wide chromatin organization52,87. Upon binding to DNA/histones at gene regulatory regions, MeCP2 could directly hinder the binding of transcription factors to cognate response elements or indirectly through sequential and ordered recruitment of distinct members of the chromatin remodeling complexes to generate a chromatin state refractory for transcription. Moreover, MeCP2 suppresses transcription by binding to methylated cytosine within transcribed regions of gene bodies thereby impeding transcriptional elongation88. Besides transcription attenuation/repression, MeCP2 also functions as an activator/enhancer of gene expressions by engaging transcriptionally active promoters81 and recruiting coactivator proteins, including CREB1 (CAMP responsive element binding protein 1)82 and MYCN (MYCN Proto-Oncogene, BHLH Transcription Factor)89.
We found here that CXXC5 interacts with MeCP2, and the interaction involves the carboxyl-terminal CXXC domain of CXXC5 independently of its ability to bind to DNA and the carboxyl-terminus of the WW domain (CTD β domain) of MeCP2. We observed that CXXC5 and MeCP2 are associated with the promoter region, located within a CGI, of HDAC11, IL12A, or NFKBIZ gene. Furthermore, we observed that an interplay between CXXC5 and MeCP2 at HDAC11, IL12A, and NFKBIZ promoter regions is critical for the magnitude of gene expressions independently of DNA methylation. These observations imply that the integrated effects of CXXC5 and MeCP2 are important for the transcriptional output of some of the CXXC5 target genes. However, underlying mechanistic features of these integrated effects remain, at this juncture, speculative. The interaction of the DNA-bound CXXC5 with MeCP2 may lead to the recruitment of chromatin remodelers and/or co-regulatory proteins to fine-tune, augment or repress, the expression of a subset of target genes. One anticipated result would then be a decrease in the association of CXXC5 with DNA when CXXC5 is knockdown should reduce the extent of DNA association of MeCP2. In contrast, we observed an increased presence of MeCP2 at the promoter regions of HDAC11 and IL12A but not at the NFKBIZ gene promoter. This suggests that the binding of CXXC5 to DNA restricts the DNA association of MeCP2 at HDAC11 and IL12A promoters independent of CXXC5 interactions, thereby increasing the repressive potential of MeCP2 for transcription when the CXXC5 levels are reduced. It is, therefore, possible that the DNA-bound CXXC5 independently of MeCP2 is involved in the magnitude of the gene expression through interactions with various chromatin modelers and/or transcription co-regulators or preventing the binding of various transcription factors to cognate response elements at the target gene promoters to establish a transcription state. It is also possible that the recruitment of MeCP2 by the DNA-associated CXXC5 or the interaction of CXXC5 with the DNA/histone-bound MeCP2 constrains the modulatory effects of MeCP2 for gene expressions by sterically blocking the ability of either interaction partner to recruit co-regulatory complexes. This could, when the levels of both proteins are reduced, result in no change, as observed with HDAC11, or in augmentation by causing accessibility to other TFs, as in the case of IL12A, in the level of gene expression. We also observed that the knockdown of CXXC5 leads to an enhanced NFKBIZ expression without affecting the extent of the association of MeCP2 with the NFKBIZ promoter. The repression of MeCP2 levels, on the other hand, did not affect the NFKBIZ expression. This suggests that the DNA-associated CXXC5 recruits, or assists the loading of, MeCP2 to repress the NFKBIZ expression. But we also observed that the expression of NFKBIZ remains elevated when both CXXC5 and MeCP2 protein levels are suppressed. This raises the possibility that the binding of various TFs to exposed cognate binding sites as a result of reduced levels of CXXC5, and thereby the associated MeCP2, increases the NFKBIZ expression. Deciphering possibilities involved in target gene expressions could further shed light on the function of CXXC5 in cellular events.
In summary, the proximity interaction analysis we employed here indicates that although lacks an intrinsic transcription regulatory activity or an enzymatic function, CXXC5 as an unmethylated CpG binder interacts with various DNA/chromatin modelers and transcription factors/co-regulators and is involved in the modulation of target gene expressions. While constituting a prelude for the identification of interaction partners of CXXC5, our findings here provide a basis for a better understanding of the regulatory mechanisms of CXXC5-mediated cellular processes in response to intrinsic and extrinsic stimuli in a lineage-specific manner in both physiology and pathophysiology that could offer clinical benefits.
Materials and methods
Cell culture and transfection
MCF7 and HEK293 cells were cultured in phenol red-free, high glucose (4.5 g/L) containing Dulbecco’s modified Eagle’s medium (DMEM, Lonza, Belgium, BE12-917F) supplemented with 10% fetal bovine serum (FBS, Lonza), 1% L-Glutamine (Lonza, BE17-605E) and 1% Penicillin/Streptomycin (Lonza, Belgium) as described previously19,21,90. MCF7 or HEK293 cells were transiently transfected with Turbofect transfection reagent (R0533; ThermoFisher, Waltham, MA, USA) for 48 h if not otherwise specified. Protein concentrations in extracts were assessed with a Bradford protein assay kit (Bio-Rad Life Sciences; 5000001). Restriction and DNA modifying enzymes were obtained from New England Bio-Labs (Beverly, MA, USA) or ThermoFisher. Chemicals were obtained from Sigma-Aldrich (Germany) or ThermoFisher. Pageruler Prestained Protein Ladder (ThermoFisher; 26616) or Pageruler Plus Prestained Protein Ladder (ThermoFisher; 26620) was used as the molecular mass (MM) marker. In all PCR-based approaches, at least two distinct primer sets, and their combinations, designed with the PrimerQuest Tool of Integrated DNA Technologies (IDT; https://www.idtdna.com/pages/tools/primerquest) were initially used for testing their efficiencies in amplifying template DNA (genomic, cDNA, or plasmid) under various conditions including varying temperatures without or with various DMSO concentrations. Based on PCR results, a primer set giving the best amplification efficiency was used in experiments and reported in Supplementary Information, Table S2.
Generation and functional analyses of protein components of BioID
To generate protein components of BioID, the 3xFlag-CXXC5 (3F-CXXC5) cDNA obtained with PCR using the wild-type CXXC5 cDNA19 as the template was genetically fused to the 5’ end of a sequence encoding the BirA*-HA cDNA present in the pcDNA expression vector, pcDNA3.1-BirA*(R118G)-HA obtained from Addgene (36047). To generate BirA*(R118G)-HA cDNA with the translation initiation codon encoding methionine within the context of Kozak sequence (CGCCATG), we used PCR with primers and BirA*(R118G)-HA cDNA as the template. The cDNA was then cloned into the pcDNA3.1 vector and sequenced to ensure the fidelity of the encoding sequences. To assess the synthesis and intracellular location of the protein components of BioID, we carried out western blot (WB) and immunocytochemistry (ICC) analyses in transiently transfected MCF7 cells derived from a breast adenocarcinoma. The expression vector pcDNA3.1 bearing none, the BirA*-HA, the 3F-CXXC5, or the 3F-CXXC5-BirA*-HA cDNA were transiently transfected into MCF7 cells for 24 h. Cells were then treated without or with 50 μM biotin (Sigma-Aldrich; B4639) and 1 mM ATP (Adenosine 5’-triphosphate disodium salt hydrate, Sigma-Aldrich; A2283) for 16 h followed by ICC and WB.
For WB analyses, cellular extracts (100 µg) using M-PER Mammalian Protein Extraction Reagent (ThermoFisher; 78,501) were subjected to SDS-%10PAGE followed by transfer onto a PVDF membrane (Advansta, WesternBright PVDF-CL, L-08008–001) using a wet transfer system at 100 V for 1 h. The membrane was blocked with buffer containing 5% skim milk (Bio-Rad, 170-6404) in 0.1% Tris Buffered Saline-Tween (TBS-T) for overnight at 4 °C followed by the incubation with the Flag antibody (SigmaAldrich; F1834; 1:1000 dilution), the HA (Abcam, ab9110; 1:1000 dilution) or the Biotin (Abcam ab53494; 1:200 dilution) antibody in the blocking buffer for 1 h at room temperature. The membrane was then washed extensively with 0.1% TBS-T followed by incubation with the blocking buffer containing an HRP conjugated goat anti-mouse IgG secondary antibody (Advansta, R-05071-500; 1:4000 dilution) for the Flag antibody or an HRP conjugated goat anti-rabbit IgG secondary antibody (Advansta, R-05072-500; 1:4000 dilution) for the HA or the Biotin antibody for one hour at room temperature. After an extensive wash with 0.1% TBS-T, the membrane was incubated for two minutes with WesternBright ECL reagent (Advansta, K-12045-D50) at 1:1 luminol-enhancer reagent:peroxide ratio. Visualizations were carried out using the ChemiDoc MP system (Bio-Rad, 1708280), and images were analyzed with Image Lab software (Bio-Rad).
For ICC, MCF7 cells (5 × 104) grown on coverslips in 12-well tissue culture plates were transiently transfected for 48 h. Cells were then washed with PBS and fixed with 3.7% formaldehyde in PBS for 30 min. The cells were permeabilized with 0.4% Triton-X-100 in PBS for 10 min followed with the incubation containing 10% normal goat serum in PBS for the HA or the Biotin antibody or 10% BSA in PBS for the Flag antibody to block the non-specific antibody binding for 1 h. Cells were then incubated with the HA (Abcam, ab9119; 1:500 dilution), the Flag (Sigma Aldrich, F-1804; 1:250 dilution), or Biotin (Abcam ab53494; 1:100 dilution) antibody in the corresponding blocking buffer for 2 h. Cells were subsequently washed with PBS and incubated with a goat anti-mouse IgG H&L (Alexa Fluor 488; green-fluorescent dye; ab150113; 1:1000 dilution) for the Flag antibody, a goat anti-rabbit IgG H&L (Alexa Fluor 594; red-fluorescent dye; ab150077; 1: 1000 dilution) for the HA antibody or the Biotin antibody at room temperature for 1 h. The cells were rinsed in PBS and mounted onto glass slides with a mounting medium containing DAPI (4,6-diamidino-2-phenylindole; blue-fluorescent stain) for nuclear staining (Abcam, ab104139). Images were viewed and captured with the Nikon Eclipse 50i Fluorescence Microscope.
Proximity-dependent biotinylation (BioID) assay
To identify the proximity interaction partners of CXXC5, we carried out the BioID assay. MCF7 cells (2.5 × 106/10 cm2 culture dish of total 10 culture dishes) grown for 48 h were transiently transfected with the expression vector pCDNA3.1 bearing none, the BirA*-HA, or the 3F-CXXC5-BirA*-HA cDNA for 24 h. Cells were then treated with 50 μM Biotin 1 mM ATP for 16 h. Cells were collected and washed with cold PBS and then lysed at room temperature in lysis buffer [50 mM Tris, pH 7.4; 500 mM NaCl; 0.4% SDS; 5 mM EDTA; 2% TritonX; 1 mM DTT with freshly added protease (Roche; 5892970001) and phosphatase (Roche; 4906845001)] inhibitors. Cell lysates were sonicated for a total of 7.5 min (with 5-s pulse and 10-s rest in between pulses) and centrifuged at 7500 rpm for 10 min at 4 °C. The supernatant was incubated with 50 µl Streptavidin magnetic beads (NEB, S1420S) overnight. Beads were collected and washed twice with Wash Buffer I (2% SDS in dH2O) for 10 min. Beads were washed once with Wash Buffer II (2% deoxycholate; 1% TritonX; 50 mM NaCl; 50 mM HEPES pH 7.5; 1 mM EDTA) for 10 min, once with Wash Buffer III (0.5% NP-40; 0.5% deoxycholate; 1% TritonX; 500 mM NaCl; 1 mM EDTA; 10 mM Tris pH 8.0) for 10 min, and once with Wash Buffer IV (50 mM Tris, pH 7.4; 50 mM NaCl) for 30 min, respectively, and all washing steps were carried out at room temperature. 10% of bound proteins were eluted from the streptavidin beads with 50 µl of Laemmli-DTT sample buffer containing 500 nM D-Biotin for WB analyses using the biotin antibody (Ab533494) and the remaining samples were subjected to Mass Spectrometry (MS) analyses.
Protein identification by mass spectrometry
MS analyses were carried out at the Koç University Proteomic Facility (Istanbul, Turkey). The protein-bound streptavidin beads were washed with 50 mM NH4HCO3, followed by reduction with 100 mM DTT in 50 mM NH4HCO3 at 56 °C for 45 min, and alkylation with 100 mM iodoacetamide at RT in the dark for 30 min. MS Grade Trypsin Protease (Pierce) was added onto the beads for overnight digestion at 37 °C (enzyme: protein ratio of 1:100). The resulting peptides were purified using C18 StageTips (ThermoFisher). Peptides were analyzed by online C18 nanoflow reversed-phase HPLC (2D nanoLC; Eksigent) linked to a Q-Exactive Orbitrap mass spectrometer (ThermoFisher). The data sets were searched against the human SWISS-PROT database version 2014_08. Proteome Discoverer (version 1.4; ThermoFisher) was used to identify proteins. The final protein lists were analyzed using the STRING v10.591 and DAVID92 databases.
Validation of interaction partners of CXXC5
Cloning
Based on BioID results, we cloned the cDNA for MeCP2 (Methyl CpG Binding Protein 2), MAZ (MYC Associated Zinc Finger Protein), or EMD (Emerin) into the pcDNA 3.1(-) vector bearing in-frame sequences at the 5’-end of the multiple cloning site that encode for an amino-terminally located 3xFlag or HA tag. The cDNA for MeCP2 was kindly provided by Dr. K. Miyake, University of Yamanashi, Japan. The EMD (HsCD00324605) or the MAZ (HsCD00377948) cDNA was obtained from the Harvard Plasmid (https://plasmid.med.harvard.edu). The MAZ cDNA encodes an amino-terminally truncated variant that uses a methionine residue at position 230 of the full-length MAZ (477 amino acid) as the first methionine, which results in the synthesis of a 248 amino acid long truncated MAZ protein with an estimated MM of 28 kDa (MAZΔN). Using cDNAs as templates we generated PCR amplicons by PCR and inserted them into the pcDNA 3.1(-) expression vector. We also cloned a MAZ cDNA encoding the full-length protein with an estimated MM of 51.1 kDa using a cDNA library from MCF7 cells and PCR into the 3xFlag- or HA-tagged pcDNA 3.1(-) vector. Similarly, EMD lacking the LEM domain, EMDΔLEM, residues of 1 through 4745 was cloned the 3xFlag- or HA-tagged pcDNA 3.1(-) vector.
To identify a sub-region(s) of CXXC5, we generated cDNAs encoding amino and/or carboxyl-terminally truncated CXXC5 proteins. To ensure that some CXXC5 variant proteins lacking the nuclear localization signal located at the carboxyl-terminus CXXC domain localize to the nucleus as the full-length CXXC5 we inserted sequences generated by PCR using the CXXC5 cDNA as the template to encode an NLS derived from the SV40 T antigen54 between sequences encoding the Flag epitope and a CXXC5 variant.
All constructs were sequenced for the fidelity of encoding sequences. Tag and Primer sequences are given in Supplementary Information, Table S2.
ICC
HEK293 cells (2.5 × 104) plated on coverslips in a well of 12-well tissue culture plates were grown for 48 h. Cells were then transiently transfected with expression vectors bearing the 3F-CXXC5 (or HA-CXXC5) cDNA alone or together with the HA or 3F tagged EMD, MAZ, or MeCP2 cDNA using Turbofect transfection reagent (Thermo Scientific, R0532). 48 h after transfections, cells were processed for ICC as described above using the Flag and/or HA antibodies followed by Alexa Fluor conjugated secondary antibodies for visualization with a Nikon Eclipse 50i Fluorescence Microscope. ImageJ software was used for image analysis.
WB
HEK293 cells (15 × 104) plated on six-well tissue culture plates for 48 h were transiently transfected with the expression vector bearing the 3F-CXXC5 (or HA-CXXC5) cDNA alone or together with the HA- or 3F-tagged EMD, MAZ, or MeCP2 cDNA using Turbofect transfection reagent (Thermo Scientific, R0532). Cells were then processed for WB as described above.
Co-Immunoprecipitation (Co-IP)
Transiently transfected HEK293 cells in six-well plates were collected with trypsin and lysed with NE-PER (ThermoFisher; 78,833) that contained freshly added protease and phosphatase inhibitors. The protein concentration of lysates was measured by the Bradford Protein Assay. To block non-specific protein binding to magnetic beads, 500 μg lysates were incubated with non-specific IgG (5 μg) together with 25 μl Protein A and G conjugated magnetic beads at 4 °C for 1 h. Lysates in 1.5 ml centrifuge tubes were then applied to a magnetic field for 30 s to pull the beads to the side of the tube. The supernatant was transferred to a clean 1.5 ml microcentrifuge tube and beads were discarded. The pre-cleared lysates were subsequently incubated with a 5 µg HA or Flag antibody at 4 °C overnight and followed by the addition of 25 μl Protein A and G conjugated magnetic beads at 4 °C for 1.5 h Beads were then washed two times with 500 μl IP buffer composed of 150 mM NaCl, 10 mM HEPES pH 7.5, 10 mM MgCl2, 0.5% Igepal, protease inhibitor, and phosphatase inhibitors. Bead pellets were resuspended in 30 μl of 2xLaemmli-SDS buffer [187.5 mM Tris–HCl (pH 6.8), 6% (w/v) SDS, 30% glycerol, 150 mM DTT, 0.03% (w/v) bromophenol blue, 2% β-mercaptoethanol] and incubated at 95 °C for 5 min. Samples were then applied to a magnetic field for 30 s and supernatants were subjected to SDS-10%PAGE for WB analysis using the Flag or the HA antibody followed by the HRP conjugated VeriBlot for IP Detection Reagent (Abcam, ab131366).
Proximity ligation assay (PLA)
Duolink In Situ Red Starter kit (Sigma-Aldrich) was used according to the manufacturer’s instructions. In brief, HEK293 cells (2.5 × 104) were grown on glass coverslips in a well of a 12-well tissue culture plate for 48 h. Cells were then transfected with the expression vectors bearing the 3F-CXXC5 and HA-MeCP2 cDNA. After transfection, cells were fixed with 3.2% PFA in PBS for 10 min, permeabilized with 0.1% Triton-X for 5 min, and then blocked with Duolink blocking solution at 37 °C for 30 min. Cells were subsequently probed with the Flag (1:500) and/or the HA (1:250) antibody overnight at 4 °C. Cells were then treated with fluorescent probes for 1 h at 37 °C. Cells were washed in wash buffer A for 10 min at RT and incubated with secondary antibodies conjugated to plus and minus PLA probes for 1 h at 37 °C. After repeating the washing step with wash buffer A for 10 min at RT, cells were incubated with the ligase for 30 min at 37 °C. After another washing cycle with wash buffer A, cells were incubated with the polymerase in the amplification buffer for 100 min at 37 °C. Finally, the cells were washed in 1X wash buffer B for 20 min and then with 0.01X wash buffer B for 1 min at RT. Duolink In Situ Mounting media with DAPI was used for nuclear staining. Images were captured with a Nikon Eclipse 50i Fluorescence Microscope. ImageJ software was used for image analysis.
Chromatin Immunoprecipitation (ChIP), ChIP-WB and ChIP-qPCR
ChIP was carried out as described19,93,94. In all ChIP experiments (ChIP-WB or ChIP-qPCR), transfected cells (10 × 106) were fixed with 1% paraformaldehyde at room temperature (RT) for 10 min with gently shaking. Glycine was then added to a final concentration of 125 mM to quench the paraformaldehyde at RT for 5 min with gently shaking. Cells were washed with PBS twice and collected in 5 ml PBS with a cell scraper. The pelleted cells were lysed with ChIP lysis buffer (1% SDS, 10 mM EDTA, 50 mM Tris–HCl, pH 8.1) and sonicated to shear DNA to lengths between 200 and 1000 bp. Cell debris was then pelleted and the supernatant was collected. The supernatant was diluted with ChIP dilution buffer (0.01% SDS, 1.1% Triton X-100,1.2 mM EDTA, 16.7 mM Tris–HCl, pH 8.1, 167 mM NaCl), supplemented with protease inhibitor (Roche). To preclear, diluted samples were incubated with 30 µl Protein A (NEB #S1425S) and 30 µl Protein G (NEB #S1430S) magnetic beads with rotation for 1 h at 4 °C. The beads were then pelleted with a magnetic separator and the supernatant was collected. 10% of supernatant was set aside as the input control. The Flag, the HA antibody, or a species-specific IgG (10 µg) were then added into the supernatant and incubated for 1 h at 4 °C with rotation. Protein A/G magnetic beads (60 µl) were then put into the mixture overnight at 4 °C with rotation. The mixture was pelleted with a magnetic separator. Immunoprecipitates were sequentially washed once each with wash buffers containing low salt (0.1% SDS, 1%Triton X-100, 2 mM EDTA, 20 mM Tris–HCl, pH 8.1, 150 mM NaCl) high salt (0.1% SDS, 1%Triton X-100, 2 mM EDTA, 20 mM Tris–HCl, pH 8.1, 500 mM NaCl), and lithium chloride (0.25 M LiCl,1% NP40, 1% deoxycholate, 1 mM EDTA, 10 mM Tris–HCl, pH 8.1) followed by two washes with Tris–EDTA buffer (10 mM Tris–HCl, 1 mM EDTA,pH 8.0).
For ChIP-WB, immunoprecipitates following washes were directly dissolved in 40 µl 6XLaemmli buffer (375 mM Tris–HCl pH 6.8, 6% SDS, 4.8% Glycerol, 9% 2-Mercaptoethanol, 0.03% Bromophenol blue) and were boiled for 10 min. Beads were removed with a magnetic stand and the supernatants were subjected to SDS-8%PAGE followed by WB.
For ChIP-qPCR, immunoprecipitates after washes were resuspended with a ChIP elution buffer (10 mM Tris–HCl pH 8.0, 1 mM EDTA) followed by the addition of NaCl (to the final concentration of 300 mM) to de-crosslink protein-DNA interactions. Samples were then incubated at 37 °C for 1 h for RNase treatment and followed by incubation at 65 °C for 4 h with Proteinase K treatment (10 mg/ml). DNA was then recovered with phenol:chloroform:isoamyl alcohol (25:24:1) followed by ethanol precipitation of DNA for qPCR.We also carried out ChIP for endogenous MeCP2 using a MeCP2-specific antibody (Proteintech Group, Inc., Rosemont, IL, USA; 10861-1-AP) in MCF7 cells transiently transfected with 10 nM of a scrambled siRNA (AllStar, CtS) for 48 h. For ChIP-WB, samples from ChIP carried out as described above were resuspended in 40 µl of 2xLaemmli-SDS buffer and incubated at 95 °C for 10 min. Supernatants were subjected to SDS-8%PAGE for WB analysis using the HA antibody. For ChIP-qPCR, following de-crosslinking and protein digestion, DNA was recovered with phenol:chloroform:isoamyl alcohol (25:24:1) followed by ethanol precipitation and was subjected to qPCR using primers specific for the promoter region of HDAC11, NFKBIZ, IL12A, or the Exon10 of CXXC5 as a negative control (Supplementary Information, Table S2).
qPCR results were normalized using percent (%) of input approach90 and depicted as fold changes compared to CtS following normalization to IgG.
siRNA transfections and RT-qPCRs
For siRNA transfection, MCF7 cells in 6-well tissue culture plates were transiently transfected with the HiPerfect transfection reagent (Qiagen) using 10 nM a scrambled siRNA (AllStar, CtS), a siRNA specific for CXXC5 (siRNA#10; FlexiTube GeneSolution, Qiagen), as we described previously19,21, and/or a siRNA pool specific to MeCP2 (sc-35892, SCBT). To equalize the total amount of siRNA (20 nM) used in co-transfection experiments, 10 nM gene-specific siRNA was used together with 10 nM CtS. Isolated total RNA was used for the cDNA synthesis (The RevertAid First Strand cDNA Synthesis Kit, ThermoFisher). The SYBR Green Mastermix (BioRad, Hercules, CA, USA) and gene-specific primers (Supplementary Information, Table S2) were used for qPCR reactions on BioRad Connect Real-Time PCR.
For the normalization of results, we used the expression of RPLP0 (60S acidic ribosomal protein P0), as we described previously95. The relative quantification of gene expressions was assessed with the comparative 2-ΔΔCT method96. For qPCR experiments, MIQE Guidelines were followed97.
Bisulfite PCR
To assess the DNA methylation state, we used bisulfite DNA sequencing. CtS siRNA or siRNA#10 transfected MCF7 cells for 48 h were subjected to genomic DNA isolation by using QIAamp DNA Mini Kit (Qiagen, 51304) according to the manufacturer’s protocol. Bisulfite conversion of 500 ng of isolated gDNA was performed with EZ-DNA Methylation Lightning Kit (Zymo Research, D5030). The bisulfite converted DNA was used as the template for PCR reaction (LongAmp Taq Polymerase, NEB, M0323) using bisulfite converted DNA specific primers designed with MethylViewer98. Amplicons were cloned into the pGEM-T vector (Promega, A3600) for sequencing. Sequences were analyzed with the QUMA99 tool (http://quma.cdb.riken.jp/).
ChIP-seq data analysis
To investigate DNA regions that CXXC5 could interact with, data from a previously carried out CXXC5 ChIP-seq experiment were analyzed22. The analyses were carried out with publicly available bioinformatics tools available on the Cancer Genomics Cloud (CGC) (Seven Bridges Genomics, Boston, USA). Briefly, the raw sequencing reads of ChIP-seq experiments (GEO accession: GSE132025) were retrieved from the NCBI Gene Expression Omnibus (GEO) database as sequence read archive (SRA) files. The SRA files were first converted to FASTQ format using the SRA Toolkit fastq-dump tool. The sequenced reads in FASTQ format were aligned on the mouse reference genome version 10 (mm10) using the Burrows-Wheeler Aligner (BWA) bwa-backtrack algorithm specialized for short reads96 and the peaks were called using the MACS2 tool version 2.1.1100. Both tools are available on the CGC platform as a workflow. We used the default parameters for both tools and used the broad peak calling functionality of MACS2 to identify binding regions.
Correlation analysis between mRNA expressions of CXXC5 and MeCP2 in breast cancer patients
To assess the possible correlation between the CXXC5 and MeCP2 expressions, we used the GEPIA (Gene Expression Profiling Interactive Analysis) webserver65 for the expression analysis of the CXXC5 and MeCP2 genes based on paired normal tissue and tumor tissue samples from the TCGA (https://www.cancer.gov/tcga) and healthy breast tissue samples from the GTEx101 databases. The gene expression profiles of CXXC5 and MeCP2 across all tumor samples and paired normal tissues as well as the correlation between mRNA expressions of CXXC5 and MeCP2 in normal and breast tumor samples were analyzed.
Statistical analysis
Experiments were repeated at least two independent times. Results, where and when appropriate, were presented as the mean ± standard error (S.E.) of three biological replicates. Statistical analyses were performed using a two-tailed unpaired t-test with a confidence interval, minimum, of 95%.
Supplementary Information
Acknowledgements
This work was supported by grants from TUBITAK-KBAG 114Z243 and METU-BAP-01-08-2016-001. Gamze Ayaz was supported by graduate study fellowships from TUBITAK 2214/A. The Seven Bridges Cancer Genomics Cloud has been funded in whole or in part with Federal funds from the National Cancer Institute, National Institutes of Health, Contract No. HHSN261201400008C and ID/IQ Agreement No. 17X146 under Contract No: HHSN261201500003I. We thank Dr. K. Miyake, University of Yamanashi, Japan for generously providing the cDNA for MeCP2. We gratefully acknowledge the critical guidance and the offer for the laboratory resources of Dr. Nurhan Özlü, Koç University, Istanbul, Turkey, throughout the study. We express our gratitude to Büşra Akarlar and Dr. Zeynep Cansu Üretmen of the Koç University Proteomics Facility, Istanbul, Turkey, for the guidance and help in the execution of MS analyses and PLA. We thank Rengül Çetin Atalay and Deniz Kahraman for allowing us to use the Fluorescence Microscope facility.
Author contributions
G.A., P.Y., and M.M. designed and oversaw the study. G.A. and Ç.E.O. conducted BioID and analyzed MS data; G.A. and G.T. performed ICC. G.A., G.T., B.K., and Ö.D.D. carried Co-IP and WB experiments. G.T. performed PLA. P.Y. and G.K. carried out ChIP and DNA methylation assays. G.A., K.Y., Ç.E.O., P.Y., T.C. carried out bioinformatics analyses. All analyzed the results. G.A., P.Y., and M.M. wrote the manuscript. All contributors reviewed and approved the manuscript.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Gamze Ayaz, Gizem Turan, Çağla Ece Olgun and Pelin Yaşar.
Contributor Information
Gamze Ayaz, Email: gamze.ayazsen@nih.gov.
Mesut Muyan, Email: mmuyan@metu.edu.tr.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-021-97060-6.
References
- 1.Li E, Zhang Y. DNA methylation in mammals. Cold Spring Harb. Perspect. Biol. 2014;6:a019133. doi: 10.1101/cshperspect.a019133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Smith ZD, Meissner A. DNA methylation: Roles in mammalian development. Nat. Rev. Genet. 2013;14:204–220. doi: 10.1038/nrg3354. [DOI] [PubMed] [Google Scholar]
- 3.Du Q, Luu P-L, Stirzaker C, Clark SJ. Methyl-CpG-binding domain proteins: readers of the epigenome. Epigenomics. 2015;7:1051–1073. doi: 10.2217/epi.15.39. [DOI] [PubMed] [Google Scholar]
- 4.Long HK, Blackledge NP, Klose RJ. ZF-CxxC domain-containing proteins, CpG islands and the chromatin connection. Biochem. Soc. Trans. 2013;41:727–740. doi: 10.1042/BST20130028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Saxonov S, Berg P, Brutlag DL. A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters. Proc. Natl. Acad. Sci. U.S.A. 2006;103:1412–1417. doi: 10.1073/pnas.0510310103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Deaton AM, Bird A. CpG islands and the regulation of transcription. Genes Dev. 2011;25:1010–1022. doi: 10.1101/gad.2037511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Xu C, et al. DNA sequence recognition of human CXXC domains and their structural determinants. Structure. 2018;26:85–95.e3. doi: 10.1016/j.str.2017.11.022. [DOI] [PubMed] [Google Scholar]
- 8.Xiong X, Tu S, Wang J, Luo S, Yan X. CXXC5: A novel regulator and coordinator of TGF-β, BMP and Wnt signaling. J. Cell. Mol. Med. 2019;23:740–749. doi: 10.1111/jcmm.14046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Blackledge NP, Klose RJ. CpG island chromatin: A platform for gene regulation. Epigenetics. 2011;6:147–152. doi: 10.4161/epi.6.2.13640. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Yaşar P, Muyan M. CXXC5 (CXXC finger protein 5) Atlas Genet. Cytogenet. Oncol. Haematol. 2014 doi: 10.4267/2042/55369. [DOI] [Google Scholar]
- 11.Pendino F, et al. Functional involvement of RINF, retinoid-inducible nuclear factor (CXXC5), in normal and tumoral human myelopoiesis. Blood. 2009;113:3172–3181. doi: 10.1182/blood-2008-07-170035. [DOI] [PubMed] [Google Scholar]
- 12.Yan X, et al. CXXC5 suppresses hepatocellular carcinoma by promoting TGF-β-induced cell cycle arrest and apoptosis. J. Mol. Cell Biol. 2018;10:48–59. doi: 10.1093/jmcb/mjx042. [DOI] [PubMed] [Google Scholar]
- 13.Kim HY, et al. CXXC5 is a transcriptional activator of Flk-1 and mediates bone morphogenic protein-induced endothelial cell differentiation and vessel formation. FASEB J. 2014;28:615–626. doi: 10.1096/fj.13-236216. [DOI] [PubMed] [Google Scholar]
- 14.Andersson T, et al. CXXC5 is a novel BMP4-regulated modulator of Wnt signaling in neural stem cells. J. Biol. Chem. 2009;284:3672–3681. doi: 10.1074/jbc.M808119200. [DOI] [PubMed] [Google Scholar]
- 15.Kim H-Y, et al. CXXC5 is a negative-feedback regulator of the Wnt/β-catenin pathway involved in osteoblast differentiation. Cell Death Differ. 2015;22:912–920. doi: 10.1038/cdd.2014.238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lee SH, et al. The Dishevelled-binding protein CXXC5 negatively regulates cutaneous wound healing. J. Exp. Med. 2015;212:1061–1080. doi: 10.1084/jem.20141601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Kim MY, et al. CXXC5 plays a role as a transcription activator for myelin genes on oligodendrocyte differentiation. Glia. 2016;64:350–362. doi: 10.1002/glia.22932. [DOI] [PubMed] [Google Scholar]
- 18.Nott SL, et al. Genomic responses from the estrogen-responsive element-dependent signaling pathway mediated by estrogen receptor α are required to elicit cellular alterations. J. Biol. Chem. 2009;284:15277–156288. doi: 10.1074/jbc.M900365200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yaşar P, Ayaz G, Muyan M. Estradiol-estrogen receptor α mediates the expression of the CXXC5 gene through the estrogen response element-dependent signaling pathway. Sci. Rep. 2016;6:37808. doi: 10.1038/srep37808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Choi S, et al. CXXC5 mediates growth plate senescence and is a target for enhancement of longitudinal bone growth. Life Sci. Alliance. 2019;2:e201800254. doi: 10.26508/lsa.201800254. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ayaz G, et al. CXXC5 as an unmethylated CpG dinucleotide binding protein contributes to estrogen-mediated cellular proliferation. Sci. Rep. 2020;10:5971. doi: 10.1038/s41598-020-62912-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Ravichandran M, et al. Rinf regulates pluripotency network genes and tet enzymes in embryonic stem cells. Cell Rep. 2019;28:1993–2003.e5. doi: 10.1016/j.celrep.2019.07.080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ma S, et al. Epigenetic regulator CXXC5 recruits DNA demethylase Tet2 to regulate TLR7/9-elicited IFN response in pDCs. J. Exp. Med. 2017;214:1471–1491. doi: 10.1084/jem.20161149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Aras S, et al. Oxygen-dependent expression of cytochrome c oxidase subunit 4–2 gene expression is mediated by transcription factors RBPJ, CXXC5 and CHCHD2. Nucleic Acids Res. 2013;41:2255–2266. doi: 10.1093/nar/gks1454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Li G, et al. CXXC5 regulates differentiation of C2C12 myoblasts into myocytes. J. Muscle Res. Cell Motil. 2014;35:259–265. doi: 10.1007/s10974-014-9400-2. [DOI] [PubMed] [Google Scholar]
- 26.Tsuchiya Y, et al. ThPOK represses CXXC5, which induces methylation of histone H3 lysine 9 in Cd40lg promoter by association with SUV39H1: İmplications in repression of CD40L expression in CD8 + cytotoxic T cells. J. Leukoc. Biol. 2016;100:327–338. doi: 10.1189/jlb.1A0915-396RR. [DOI] [PubMed] [Google Scholar]
- 27.Astori A, et al. The epigenetic regulator RINF (CXXC5) maintains SMAD7 expression in human immature erythroid cells and sustains red blood cells expansion. Haematologica. 2020 doi: 10.3324/haematol.2020.263558. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zhang M, et al. The CXXC finger 5 protein is required for DNA damage-induced p53 activation. Sci. China Ser. C Life Sci. 2009;52:528–538. doi: 10.1007/s11427-009-0083-7. [DOI] [PubMed] [Google Scholar]
- 29.Wang X, et al. CXXC5 associates with smads to mediate TNFα induced apoptosis. Curr. Mol. Med. 2013;13:1385–1396. doi: 10.2174/15665240113139990069. [DOI] [PubMed] [Google Scholar]
- 30.Marshall PA, et al. Discovery of novel vitamin D receptor interacting proteins that modulate 1,25-dihydroxyvitamin D3 signaling. J. Steroid Biochem. Mol. Biol. 2012;132:147–159. doi: 10.1016/j.jsbmb.2012.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.He Y, et al. A noncanonical AR addiction drives enzalutamide resistance in prostate cancer. Nat. Commun. 2021;12:1521. doi: 10.1038/s41467-021-21860-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ko M, et al. Modulation of TET2 expression and 5-methylcytosine oxidation by the CXXC domain protein IDAX. Nature. 2013;497:122–126. doi: 10.1038/nature12052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.L’Hôte D, et al. Discovery of novel protein partners of the transcription factor FOXL2 provides insights into its physiopathological roles. Hum. Mol. Genet. 2012;21:3264–3274. doi: 10.1093/hmg/dds170. [DOI] [PubMed] [Google Scholar]
- 34.van den Berg DLC, et al. An Oct4-centered protein interaction network in embryonic stem cells. Cell Stem Cell. 2010;6:369–381. doi: 10.1016/j.stem.2010.02.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Gagliardi A, et al. A direct physical interaction between Nanog and Sox2 regulates embryonic stem cell self-renewal. EMBO J. 2013;32:2231–2247. doi: 10.1038/emboj.2013.161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Peng X, et al. CXXC5 is required for cardiac looping relating to TGFβ signaling pathway in zebrafish. Int. J. Cardiol. 2016;214:246–253. doi: 10.1016/j.ijcard.2016.03.201. [DOI] [PubMed] [Google Scholar]
- 37.Kim MS, et al. A novel wilms tumor 1 (WT1) target gene negatively regulates the WNT signaling pathway. J. Biol. Chem. 2010;285:14585–14593. doi: 10.1074/jbc.M109.094334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Roux KJ, Kim DI, Burke B. BioID: A screen for protein-protein interactions. Curr. Protoc. Protein Sci. 2018;91:19231–192315. doi: 10.1002/cpps.51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Sears RM, May DG, Roux KJ. BioID as a tool for protein-proximity labeling in living cells. Methods Mol. Biol. 2019;2012:299–313. doi: 10.1007/978-1-4939-9546-2_15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Roux KJ, Kim DI, Raida M, Burke B. A promiscuous biotin ligase fusion protein identifies proximal and interacting proteins in mammalian cells. J. Cell Biol. 2012;196:801–810. doi: 10.1083/jcb.201112098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Firat-Karalar EN, Stearns T. Probing mammalian centrosome structure using BioID proximity-dependent biotinylation. Methods Cell Biol. 2015;129:153–170. doi: 10.1016/bs.mcb.2015.03.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Hein MY, et al. A human interactome in three quantitative dimensions organized by stoichiometries and abundances. Cell. 2015;163:712–723. doi: 10.1016/j.cell.2015.09.053. [DOI] [PubMed] [Google Scholar]
- 43.Szklarczyk D, et al. STRING v11: Protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019 doi: 10.1093/nar/gky1131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Wurm FM. Production of recombinant protein therapeutics in cultivated mammalian cells. Nat. Biotechnol. 2004;22:1393–1398. doi: 10.1038/nbt1026. [DOI] [PubMed] [Google Scholar]
- 45.Berk JM, Tifft KE, Wilson KL. The nuclear envelope LEM-domain protein emerin. Nucleus. 2013;4:298–314. doi: 10.4161/nucl.25751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Vlcek S, Foisner R. Lamins and lamin-associated proteins in aging and disease. Curr. Opin. Cell Biol. 2007;19:298–304. doi: 10.1016/j.ceb.2007.04.001. [DOI] [PubMed] [Google Scholar]
- 47.Leach N, et al. Emerin is hyperphosphorylated and redistributed in herpes simplex virus type 1-infected cells in a manner dependent on both UL34 and US3. J. Virol. 2007;81:10792–10803. doi: 10.1128/jvi.00196-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Berk JM, et al. O-Linked β-N-acetylglucosamine (O-GlcNAc) regulates emerin binding to barrier to autointegration factor (BAF) in a chromatin- and lamin B-enriched niche. J. Biol. Chem. 2013;288:30192–30209. doi: 10.1074/jbc.M113.503060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Yadav S, et al. EBV early lytic protein BFRF1 alters emerin distribution and post-translational modification. Virus Res. 2017;232:113–122. doi: 10.1016/j.virusres.2017.02.010. [DOI] [PubMed] [Google Scholar]
- 50.Yu ZH, et al. Dual function of MAZ mediated by FOXF2 in basal-like breast cancer: Promotion of proliferation and suppression of progression. Cancer Lett. 2017;402:142–152. doi: 10.1016/j.canlet.2017.05.020. [DOI] [PubMed] [Google Scholar]
- 51.Klose RJ, Bird AP. MeCP2 behaves as an elongated monomer that does not stably associate with the Sin3a chromatin remodeling complex. J. Biol. Chem. 2004;279:46490–46496. doi: 10.1074/jbc.M408284200. [DOI] [PubMed] [Google Scholar]
- 52.Schmidt A, Zhang H, Cardoso MC. MeCP2 and Chromatin compartmentalization. Cells. 2020;9:878. doi: 10.3390/cells9040878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Söderberg O, et al. Characterizing proteins and their interactions in cells and tissues using the in situ proximity ligation assay. Methods. 2008;45:227–232. doi: 10.1016/j.ymeth.2008.06.014. [DOI] [PubMed] [Google Scholar]
- 54.Kalderon D, Roberts BL, Richardson WD, Smith AE. A short amino acid sequence able to specify nuclear location. Cell. 1984;39:499–509. doi: 10.1016/0092-8674(84)90457-4. [DOI] [PubMed] [Google Scholar]
- 55.Xu C, Bian C, Lam R, Dong A, Min J. The structural basis for selective binding of non-methylated CpG islands by the CFP1 CXXC domain. Nat. Commun. 2011;2:227. doi: 10.1038/ncomms1237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Buschdorf JP, Strätling WH. A WW domain binding region in methyl-CpG-binding protein MeCP2: İmpact on Rett syndrome. J. Mol. Med. 2004;82:135–143. doi: 10.1007/s00109-003-0497-9. [DOI] [PubMed] [Google Scholar]
- 57.Lee I, et al. Crystal structure of the PDZ domain of mouse Dishevelled 1 and its interaction with CXXC5. Biochem. Biophys. Res. Commun. 2017;485:584–590. doi: 10.1016/j.bbrc.2016.12.023. [DOI] [PubMed] [Google Scholar]
- 58.Dreos R, Ambrosini G, Groux R, Cavin Périer R, Bucher P. The eukaryotic promoter database in its 30th year: Focus on non-vertebrate organisms. Nucleic Acids Res. 2017;45:D51–D55. doi: 10.1093/nar/gkw1069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Dreos R, Ambrosini G, Périer RC, Bucher P. The Eukaryotic Promoter Database: Expansion of EPDnew and new promoter analysis tools. Nucleic Acids Res. 2015;43:D92–D96. doi: 10.1093/nar/gku1111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Voelter-Mahlknecht S, Ho AD, Mahlknecht U. Chromosomal organization and localization of the novel class IV human histone deacetylase 11 gene. Int. J. Mol. Med. 2005;16:589–598. doi: 10.3892/ijmm.16.4.589. [DOI] [PubMed] [Google Scholar]
- 61.Van Kester MS, et al. A meta-analysis of gene expression data identifies a molecular signature characteristic for tumor-stage mycosis fungoides. J. Invest. Dermatol. 2012;132:2050–2059. doi: 10.1038/jid.2012.117. [DOI] [PubMed] [Google Scholar]
- 62.Liu T, et al. Cistrome: An integrative platform for transcriptional regulation studies. Genome Biol. 2011;12:R83. doi: 10.1186/gb-2011-12-8-r83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Zheng R, et al. Cistrome data browser: Expanded datasets and new tools for gene regulatory analysis. Nucleic Acids Res. 2019;47:D729–D735. doi: 10.1093/nar/gky1094. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Yaşar P, et al. A CpG island promoter drives the CXXC5 gene expression. Sci. Rep. 2021;11:15655. doi: 10.1038/s41598-021-95165-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Tang Z, Kang B, Li C, Chen T, Zhang Z. GEPIA2: An enhanced web server for large-scale expression profiling and interactive analysis. Nucleic Acids Res. 2019;47(W1):W556–W560. doi: 10.1093/nar/gkz430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Liu N, et al. Intrinsic and extrinsic connections of Tet3 dioxygenase with CXXC zinc finger modules. PLoS ONE. 2013;8:e62755. doi: 10.1371/journal.pone.0062755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Joshi HR, et al. Frontline Science: Cxxc5 expression alters cell cycle and myeloid differentiation of mouse hematopoietic stem and progenitor cells. J. Leukoc. Biol. 2020;108:469–484. doi: 10.1002/JLB.1HI0120-169R. [DOI] [PubMed] [Google Scholar]
- 68.Ho CY, Lammerding J. Lamins at a glance. J. Cell Sci. 2012;125(Pt 9):2087–2093. doi: 10.1242/jcs.087288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Almendáriz-Palacios C, et al. The nuclear lamina: Protein accumulation and disease. Biomedicines. 2020;8:188. doi: 10.3390/biomedicines8070188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Strachan GD, et al. Fetal Alz-50 clone 1 interacts with the human orthologue of the Kelch-like ech-associated protein. Biochemistry. 2004;43:12113–12122. doi: 10.1021/bi0494166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Yang X, et al. Gene body methylation can alter gene expression and ıs a therapeutic target in cancer. Cancer Cell. 2014;26:577–590. doi: 10.1016/j.ccr.2014.07.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Mahmood N, Rabbani SA. DNA methylation readers and cancer: Mechanistic and therapeutic applications. Front. Oncol. 2019 doi: 10.3389/fonc.2019.00489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Amir RE, et al. Rett syndrome is caused by mutations in X-linked MECP2, encoding methyl- CpG-binding protein 2. Nat. Genet. 1999;23:185–188. doi: 10.1038/13810. [DOI] [PubMed] [Google Scholar]
- 74.Lei M, Tempel W, Chen S, Liu K, Min J. Plasticity at the DNA recognition site of the MeCP2 mCG-binding domain. Biochim. Biophys. Acta Gene Regul. Mech. 2019;1862:194409. doi: 10.1016/j.bbagrm.2019.194409. [DOI] [PubMed] [Google Scholar]
- 75.Lagger S, et al. MeCP2 recognizes cytosine methylated tri-nucleotide and di-nucleotide sequences to tune transcription in the mammalian brain. PLoS Genet. 2017;13:e1006793. doi: 10.1371/journal.pgen.1006793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Guo JU, et al. Distribution, recognition and regulation of non-CpG methylation in the adult mammalian brain. Nat. Neurosci. 2014;17:215–222. doi: 10.1038/nn.3607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Gabel HW, et al. Disruption of DNA-methylation-dependent long gene repression in Rett syndrome. Nature. 2015;522:89–93. doi: 10.1038/nature14319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Kinde B, Gabel HW, Gilbert CS, Griffith EC, Greenberg ME. Reading the unique DNA methylation landscape of the brain: Non-CpG methylation, hydroxymethylation, and MeCP2. Proc. Natl. Acad. Sci. U.S.A. 2015;112:6800–6806. doi: 10.1073/pnas.1411269112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Chandler SP, Guschin D, Landsberger N, Wolffe AP. The methyl-CpG binding transcriptional repressor MeCP2 stably associates with nucleosomal DNA. Biochemistry. 1999;38:7008–7018. doi: 10.1021/bi990224y. [DOI] [PubMed] [Google Scholar]
- 80.Thambirajah AA, et al. MeCP2 binds to nucleosome free (linker DNA) regions and to H3K9/H3K27 methylated nucleosomes in the brain. Nucleic Acids Res. 2012;40:2884–2897. doi: 10.1093/nar/gkr1066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Yasui DH, et al. Integrated epigenomic analyses of neuronal MeCP2 reveal a role for long-range interaction with active genes. Proc. Natl. Acad. Sci. U.S.A. 2007;104:19416–19421. doi: 10.1073/pnas.0707442104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Chahrour M, et al. MeCP2, a key contributor to neurological disease, activates and represses transcription. Science. 2008;320:1224–1229. doi: 10.1126/science.1153252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Ghosh RP, et al. Unique physical properties and interactions of the domains of methylated DNA binding protein 2. Biochemistry. 2010;49:4395–4410. doi: 10.1021/bi9019753. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Lyst MJ, Connelly J, Merusi C, Bird A. Sequence-specific DNA binding by AT-hook motifs in MeCP2. FEBS Lett. 2016;590:2927–2933. doi: 10.1002/1873-3468.12328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Liu M, et al. DNA looping by two 5-methylcytosine-binding proteins quantified using nanofluidic devices. Epigenetics Chromatin. 2020;13:18. doi: 10.1186/s13072-020-00339-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Horike SI, Cai S, Miyano M, Cheng JF, Kohwi-Shigematsu T. Loss of silent-chromatin looping and impaired imprinting of DLX5 in Rett syndrome. Nat. Genet. 2005;37:31–40. doi: 10.1038/ng1491. [DOI] [PubMed] [Google Scholar]
- 87.Ragione FD, Vacca M, Fioriniello S, Pepe G, D’Esposito M. MECP2, a multi-talented modulator of chromatin architecture. Brief. Funct. Genomics. 2016;15:420–431. doi: 10.1093/bfgp/elw023. [DOI] [PubMed] [Google Scholar]
- 88.Kinde B, Wu DY, Greenberg ME, Gabel HW. DNA methylation in the gene body influences MeCP2-mediated gene repression. Proc. Natl. Acad. Sci. U.S.A. 2016;113:15114–15119. doi: 10.1073/pnas.1618737114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Murphy DM, et al. Co-localization of the oncogenic transcription factor MYCN and the DNA methyl binding protein MeCP2 at genomic sites in neuroblastoma. PLoS ONE. 2011;6:e21436. doi: 10.1371/journal.pone.0021436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Muyan M, et al. Modulation of estrogen response element-driven gene expressions and cellular proliferation with polar directions by designer transcription regulators. PLoS ONE. 2015;10:e0136423. doi: 10.1371/journal.pone.0136423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Szklarczyk D, et al. The STRING database in 2017: Quality-controlled protein-protein association networks, made broadly accessible. Nucleic Acids Res. 2017;45(D1):D362–D368. doi: 10.1093/nar/gkw937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Ashburner M, et al. Gene Ontology: Tool for the unification of biology. Nat. Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Huang J, et al. Binding of estrogen receptor β to estrogen response element in Situ is independent of estradiol and impaired by its amino terminus. Mol. Endocrinol. 2005;19:2696–2712. doi: 10.1210/me.2005-0120. [DOI] [PubMed] [Google Scholar]
- 94.Muyan M, Callahan LM, Huang Y, Lee AJ. The ligand-mediated nuclear mobility and interaction with estrogen-responsive elements of estrogen receptors are subtype specific. J. Mol. Endocrinol. 2012;49:249–266. doi: 10.1530/JME-12-0097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Lyng MB, Lænkholm AV, Pallisgaard N, Ditzel HJ. Identification of genes for normalization of real-time RT-PCR data in breast carcinomas. BMC Cancer. 2008;8:20. doi: 10.1186/1471-2407-8-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) method. Methods. 2001;25:402–408. doi: 10.1006/meth.2001.1262. [DOI] [PubMed] [Google Scholar]
- 97.Bustin SA, et al. The MIQE guidelines: Minimum information for publication of quantitative real-time PCR experiments. Clin. Chem. 2009;55:611–622. doi: 10.1373/clinchem.2008.112797. [DOI] [PubMed] [Google Scholar]
- 98.Pardo CE, et al. MethylViewer: Computational analysis and editing for bisulfite sequencing and methyltransferase accessibility protocol for individual templates (MAPit) projects. Nucleic Acids Res. 2011;39:e5. doi: 10.1093/nar/gkq716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Lonsdale J, et al. The genotype-tissue expression (GTEx) project. Nat. Genet. 2013;45:580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.