Summary
Transcriptional fidelity depends on accurate promoter selection and initiation from the correct sites. In yeast, H3K36me3-mediated recruitment of the Rpd3S HDAC complex to gene bodies suppresses spurious transcription initiation. Here we describe an equivalent pathway in metazoans. PWWP2A/B is an H3K36me3 reader that forms a stable complex with HDAC1/2. We used CAGE-seq to profile all transcription initiation sites in wild-type mESCs and cells lacking PWWP2A/B. Loss of PWWP2A/B enhances spurious initiation from intragenic sites present in wild-type mESCs, and this effect is associated with increased levels of initiating Pol-II and histone acetylation. Spurious initiation events in Pwwp2a/b DKO mESCs do not overlap in genomic location or chromatin features with spurious sites that arise in Dnmt3b KO mESCs, previously reported to function in the suppression of intragenic transcriptional initiation, suggesting these pathways function cooperatively in maintaining the fidelity of transcription initiation in metazoans.
Subject Areas: Molecular Biology, Molecular Mechanism of Gene Regulation, Stem Cells Research
Graphical Abstract
Highlights
-
•
Loss of PWWP2A/B leads to increased levels of spurious transcription initiation
-
•
Spurious TSS sites are predominantly in the gene bodies of highly expressed genes
-
•
Spurious sites are marked with increased histone acetylation and initiating Pol II
-
•
PWWP2-spurious TSSs are distinct from those caused by DNMT3B loss
Molecular Biology; Molecular Mechanism of Gene Regulation; Stem Cells Research
Introduction
Trimethylation of lysine 36 on histone H3 (H3K36me3) is a highly conserved posttranslational histone modification in eukaryotic organisms and is enriched over the bodies of actively transcribed genes (Bannister et al., 2005). H3K36me3 is recognized by the PWWP domain, a motif that is present in many chromatin-modifying complexes (Dhayalan et al., 2010; Qin and Min, 2014; Tian et al., 2019; van Nuland et al., 2013; Wen et al., 2014; Xu et al., 2008). These readers are involved in a variety of biological processes, including intragenic DNA methylation, transcription elongation, DNA repair, alternative splicing, and repression of spurious transcription initiation (as reviewed in Huang and Zhu, 2018; Wagner and Carpenter, 2012). Furthermore, deregulation of H3K36 methylation and mutations of associated factors are linked to developmental disorders and cancers (as reviewed in Li et al., 2019; Zaghi et al., 2019).
Studies in yeast and mammalian cells have found that H3K36me3 is important for maintaining transcriptional fidelity through repression of spurious transcription initiation from within coding regions. Mechanistically, H3K36me3-mediated recruitment of effectors from two different epigenetic pathways, DNA methylation and histone deacetylation, has been linked to the suppression of spurious intragenic transcription initiation in actively transcribed genes. Human cancer cells treated with inhibitors against DNA methyltransferases and histone deacetylases show elevated levels of spurious transcription initiation (Brocks et al., 2017). Loss of the H3K36me3-binding DNA methyltransferase DNMT3B in mouse embryonic stem cells (mESCs) leads to increased spurious transcription initiation (Neri et al., 2017). In Saccharomyces cerevisiae, H3K36me3 recruits the Rpd3S HDAC complex to deacetylate histones across coding regions to repress spurious transcription initiation (Carrozza et al., 2005; Joshi and Struhl, 2005; Keogh et al., 2005), but mammalian equivalents of this pathway remained uncharacterized. In this study, we uncover a newly described PWWP2A/B-HDAC complex in mammalian cells that suppresses transcription initiation from the gene bodies of actively transcribed genes.
Recently, we and others have identified a variant nucleosome remodeling and deacetylase (NuRD) complex, in which the histone deacetylase subunits of NuRD (MTA1/2/3,HDAC1/2, RBBP4/7) form a highly stable and stoichiometric complex with the H3K36me3-reader PWWP2A and its paralog PWWP2B (Figure 1A) (Link et al., 2018; Zhang et al., 2018). In this complex, PWWP2A/B, MTA1/2/3, and HDAC1/2 are present at a stoichiometry of 2:2:2 with PWWP2A/B being mutually exclusive with the canonical NuRD subunits MBD2/3 and CHD3/4 (Figure 1A). Although the NuRD complex has been found to play a role in cellular differentiation and cell type commitment through regulation of pluripotency and cellular reprogramming (Luo et al., 2013; Mor et al., 2018; Reynolds et al., 2012), the role of the PWWP2-HDAC complex is still not well understood. Unlike the NuRD complex subunits MBD3 and CHD4, which are enriched at active enhancers and promoters (Bornelov et al., 2018; Shimbo et al., 2013), the PWWP2A variant NuRD complex is targeted to gene bodies. We previously showed that the PWWP2A localizes to gene bodies through binding to H3K36me3 via its C-terminal PWWP domain, where it regulates histone acetylation levels at highly transcribed genes (Zhang et al., 2018). Given that this novel HDAC complex was targeted to H3K36me3, we wondered whether the PWWP2-HDAC complex could fulfill the same function as the yeast Rpd3S HDAC complex in protecting against spurious transcription initiation. Cap analysis of gene expression (CAGE-seq) (Takahashi et al., 2012), a method widely applied to capture transcription initiation events (Andersson et al., 2014; Brocks et al., 2017; Fort et al., 2014), allowed us to precisely map and quantify all sites of transcription initiation at canonical promoters and spurious sites genome-wide in wild-type and Pwwp2a/b double knockout (DKO) mouse embryonic stem cells (mESCs). We used mESCs deleted for both PWWP2A and PWWP2B as our previous biochemical characterized showed these two paralogs can coexist in the same complex. Absence of PWWP2A/B in mESCs leads to increased levels of spurious transcription initiation particularly over the gene bodies of highly expressed genes. The profile of spurious initiation events upon PWWP2A/B depletion in mESCs contrasts with that in Dnmt3b knockout (KO) cells, suggesting that, although both gene body DNA methylation and histone deacetylation function downstream of H3K36me3 signaling, they have distinct roles in the suppression of spurious transcription initiation.
Figure 1.
Loss of PWWP2A/B Leads to Increased Levels of intragenic Spurious Transcription Initiation in mESCs
(A) Comparison of the NuRD and PWWP2A/B HDAC complex, which contains the H3K36me3-binding PWWP2A/B protein bound to the MTA1/2/3:HDAC1/2:RBBP4/7 HDAC subcomplex (top). Schematic of nuclear RNA CAGE-seq from wild-type E14 and Pwwp2a/b DKO mESCs (bottom).
(B) The percentage of consensus TSS detected by CAGE-seq in wild-type E14 and Pwwp2a/b DKO clones A1 and C7 classified into four groups: annotated promoter, exonic, intronic, or intergenic TSS. See also Table S1.
(C) The percentage of CAGE-seq TSSs in each group (annotated promoter, exon, intron, intergenic) that overlap with ATAC-seq peaks for wild-type E14 and Pwwp2a/b DKO lines A1 and C7. See also Figure S1.
(D) Annotated promoter TSSs (blue bars) detected by CAGE in mESCs are enriched in core promoter elements (GC box, CCAAT box) and in transcription factor motifs commonly found at promoters (ETS, YY1, NRF) compared with randomly generated background sequences (gray bars). The top five most abundant and significant motifs found using Homer are shown with the significance value.
(E) Spurious TSSs (blue bars) from wild-type E14 mESCs are slightly enriched in core promoter elements (DCE, GC box) and transcription factor (ETS) sequences compared with randomly generated background sequences (gray bars). The top four most significant motifs found using Homer are shown with the significance value.
(F) UCSC genome browser view of the CAGE signal across several sites. Left: CAGE captures TSSs of different isoforms of the Wtap gene. Middle: transcription initiation peaks detected in the gene body of the HnrnpK gene where arrows indicate increased spurious transcription initiation events in Pwwp2a/b DKO lines. Right: intergenic transcription initiation event that overlaps with a region of high chromatin accessibility, possibly an enhancer. See also Figure S2.
(G) CAGE-seq expression level heatmap for annotated promoter, exonic, intronic, and intergenic TSSs divided into two groups based on behavior: Up (increased CAGE expression in Pwwp2a/b DKO compared with wild-type) and Down (decreased CAGE expression in Pwwp2a/b DKO compared with wild-type). More spurious exonic and intronic TSSs go Up than Down in Pwwp2a/b DKO cells. The bottom row shows the common events shared between the two clones.
(H) CAGE expression for annotated promoter, exonic, intronic, and intergenic TSSs in wild-type E14 and Pwwp2a/b DKO lines calculated using the CAGEr package in tags per 10 million (TP10M). The p values between wild-type E14 and DKO lines were calculated using the two-sided Mann-Whitney test. The exonic TSSs shows significantly more transcription initiation in the Pwwp2a/b DKO lines.
Results
Loss of PWWP2A/B in mESCs Results in Increase Spurious Transcription Initiation within Intragenic Sites
To investigate the role of the PWWP2A/B HDAC complex in spurious transcription initiation, we performed CAGE-seq in wild-type mESCs and two previously characterized Pwwp2a/b DKO mESC lines (clones A1 and C7) (Zhang et al., 2018). CAGE-seq using the nuclear RNA fraction as input was performed in triplicate for wild-type and Pwwp2a/b DKO C7 and in duplicate for DKO A1 (Figure 1A), and the biological replicates were merged owing to high reproducibility (Figure S1A). On average, each CAGE-seq library generated more than 20 million mapped reads (Table S1). The high sequencing depth allowed us to capture thousands of transcription initiation events (19134 CAGE peaks/clusters) in mESCs corresponding to annotated transcription initiation sites, spurious initiation sites, and transcription from enhancers. CAGE peaks were categorized as annotated promoter, intragenic (exonic and intronic), or intergenic TSSs (see Transparent Methods). Although the majority (∼73%) of the reads and the consensus CAGE peaks (∼63%) corresponded to GENCODE annotated promoter-TSSs (Table S1 and Figure 1B), we detected thousands of initiation events (37% of CAGE peaks) that mapped to exonic, intronic, and intergenic regions (Figure 1B) in the wild-type and DKO mESCs.
To distinguish between spurious transcription initiation events and enhancers (eRNAs), all CAGE peaks were compared with ATAC-seq and H3K27ac ChIP-seq peaks, which mark active promoters and enhancers. As expected, the majority of annotated promoter CAGE peaks overlapped with regions of high chromatin accessibility as assayed by ATAC-seq and H3K27ac (Figures 1C and S1B). Approximately half of intergenic CAGE peaks overlapped with ATAC-seq peaks suggesting that these transcripts may correspond to eRNA production at enhancers (Figure 1C) (Andersson et al., 2014). The majority of exonic and intronic CAGE peaks did not overlap with enhancers (ATAC-seq and H3K27ac peaks) (Figures 1C and S1B) suggesting that the majority of intragenic TSSs are spurious transcription initiation events and not eRNAs.
We next examined the underlying sequence features of annotated promoter-TSSs and non-annotated spurious TSSs detected in wild-type E14 cells. As expected, canonical annotated promoter-TSSs are enriched in common core promoter elements (GC box, CCAAT box) and TF motifs commonly found at promoters (ETS, YY1, NRF) (Figure 1D). Spurious TSSs are less enriched for fewer common core promoter elements but still contain promoter-related motifs (DCE, GC box, ETS) compared with background (Figure 1E). CAGE peaks corresponding to spurious TSSs also tend to be much narrower than peaks at annotated promoter-TSSs (Figure S1C). These results suggest that by CAGE the majority of detectable peaks come from canonical eukaryotic promoters, but thousands of spurious transcription initiation events still arise from weak cryptic promoters even in wild-type mESCs.
To determine the effect of PWWP2A/B deletion in mESCs, we intersected all CAGE peaks from wild-type E14 with those from the DKO clones. The vast majority of CAGE peaks in wild-type and Pwwp2a/b DKO mESCs show extensive overlap (Figures 1F, S1D, and S1E), suggesting that PWWP2A/B loss does not have a significant impact on the number of transcription initiation sites in the genome. We next performed differential expression analysis to determine whether PWWP2A/B deletion would upregulate transcription initiation from existing spurious sites. TSS peaks that increase in expression in Pwwp2a/b DKO cells compared with wild-type are identified as “Up,” and peaks that decrease are “Down,” and we observed many intragenic TSSs being upregulated or induced in mESCs lacking PWWP2A/B. At intragenic TSSs, the number of Up sites is 14.8 and 2.2 times more than the number of Down sites for exonic and intronic TSSs, respectively (Figures 1G and Table S2). In particular, exonic TSSs show increased levels of spurious transcription initiation upon PWWP2A/B loss compared with wild-type mESCs (Figures 1F–1H and S2). Annotated TSS and intergenic TSS peaks did not show a skewed sensitivity to PWWP2A/B loss, with similar numbers of peaks increasing or decreasing in the level of transcription initiation (Figure 1G).
Spurious Sites Sensitive to PWWP2A/B Depletion Are Enriched at Intragenic Regions of Highly Expressed Genes
To further study the role of PWWP2A/B in regulating intragenic spurious initiation, we defined the subset of CAGE-seq peaks most sensitive to PWWP2A/B depletion. We removed intragenic TSS events that intersected with H3K27ac and ATAC-seq peaks, which may represent eRNAs, and classified the remaining 2,146 intragenic Up TSSs in both DKO clones as PWWP2-sensitive spurious TSSs (Figure 2A and Table S2). Owing to good correlation between the two clones A1 and C7 (Figures 2B and S3A), all downstream analysis was averaged for the two Pwwp2a/b DKO clones.
Figure 2.
Intragenic Spurious Transcription Initiation Sites Arise Predominantly from Highly Expressed Genes
(A) Transcription initiation expression value (TP10M) of all intragenic PWWP2-sensitive spurious initiation sites (Up events n = 2,146) in E14 and Pwwp2a/b DKO lines. See also Table S2.
(B) Scatterplot shows the correlation of PWWP2-sensitive spurious TSSs detected in Pwwp2a/b DKO lines A1 and C7. See also Figure S3.
(C) Percentage of annotated TSSs and PWWP2-sensitive spurious TSSs (n = 2,146) that intersect (dark green) or do not intersect (cyan) with CpG islands. The p value was calculated by a chi-square test using the frequency.
(D) Boxplot of the %GC content for annotated TSSs (n = 11,043) and PWWP2-sensitive spurious TSSs (longer than 10 nt, n = 1,224). The p value was calculated using the two-sided Mann-Whitney test.
(E) The top most enriched motifs in PWWP2-sensitive spurious TSS (blue bars) compared with annotated TSSs (gray bars), as detected by the Homer motif search analysis.
(F) Percentage of spurious PWWP2-sensitive and PWWP2-insensitive TSSs occurring in genes with high (approximately top 10%) or low PWWP2A binding as defined by ChIP-seq (Zhang et al., 2018).
(G) Percentage of spurious PWWP2-sensitive and PWWP2-insensitive TSSs arising from genes with High, Intermediate, Low, or No expression (obtained by sorting all genes by expression level in wild-type E14 mESCs and dividing into four equal groups). A majority of spurious events (both PWWP2-sensitive and insensitive) occur from highly expressed gene group, which a greater fraction of PWWP2-sensitive spurious TSS residing in the most highly expressed group. The p value was calculated by chi-square test using the frequency. See also Figure S4.
As discussed previously, motif analysis of the CAGE peaks for annotated and spurious TSSs in mESCs revealed that spurious TSSs are far less enriched for core promoter sequence motifs compared with annotated promoter-TSSs (Figures 1D and 1E). Furthermore, the majority (approximately 75%) of annotated promoter-TSSs overlapped with CpG islands and have high GC content (Figures 2C and 2D), whereas PWWP2-sensitive intragenic spurious transcription initiation peaks rarely overlap with CpG islands and are not GC-rich (Figures 2C and 2D). Lack of strong canonical promoter-like characteristics at spurious TSSs may explain the weaker transcription initiation capacity from these spurious intragenic cryptic promoter sites. Motif analysis comparing PWWP2-sensitive TSSs against annotated TSSs revealed that transcriptional repressors Tgif1/2 are significantly enriched at the flanking regions of PWWP2-sensitive spurious TSS in mESCs (Figure 2E) (Lee et al., 2015) and may suggest these cryptic promoters could be actively bound and repressed.
We next sought to characterize what type of genes were susceptible to PWWP2-sensitive spurious initiation. Gene ontology enrichment analysis revealed that genes harboring PWWP2-sensitive spurious TSSs are from a variety of biological pathways (Figure S4). Several RNA processing and surveillance pathways are also enriched, including RNA splicing, translation, and RNA nonsense-mediated decay (Figure S4). Previously we identified all genes bound by PWWP2A by ChIP-seq and classified them as being highly (9.4%) or lowly (90.6%) bound (Zhang et al., 2018). High-occupancy genes are more enriched for H3K36me3 than low-occupancy genes (Figure S3B). Here we see that all genes harboring intragenic spurious initiation sites are PWWP2A bound to some degree in mESCs, with a slightly greater proportion of PWWP2-sensitive spurious initiation events arising from high PWWP2A-occupied genes (Figure 2F). Strikingly, the greatest determinant correlated with the incidence of spurious transcription initiation is the level of gene expression, with the majority of spurious sites occurring within highly expressed genes (Figure 2G). Specifically, 88% of PWWP2A-sensitive spurious TSSs occur in the top 25% of genes ranked by expression (n = 1,108, see Transparent Methods) (Figure 2G). Together, these results indicate that PWWP2-sensitive spurious TSSs lack the underlying sequence features of canonical annotated promoters such as enrichment of many core promoter motifs, high GC and CpG island content, but contain weak promoter elements that are susceptible to spurious initiation events especially within highly expressed genes.
Loss of PWWP2A/B and DNMT3B Have Distinct Effects on Spurious Transcription Initiation
The DNA methyltransferase DNMT3B, like PWWP2A/B, is also selectively recruited to gene bodies through binding to H3K36me3 via its PWWP domain (Rondelet et al., 2016). A prior study found that loss of DNMT3B in mESCs leads to intragenic DNA hypomethylation and increased spurious transcription initiation events (Neri et al., 2017). Given that both of these pathways are downstream of H3K36me3 and contribute to regulating the epigenetic landscape at gene bodies, we sought to determine their relationship with one another. We applied our pipeline to analyze previously published data from Dnmt3b KO mESCs, which was performed using DECAP-seq, an alternative to CAGE-seq for profiling sites of transcription initiation. Consistent with the published study, we found that most DECAP TSSs correspond to Gencode annotated promoters (Figure 3A). We classified all DNMT3B TSSs following the terminology used in the previous DNMT3B study—TSSs absent in wild-type but present in Dnmt3b KO are categorized as “On,” TSSs present in wild-type cells but lost in Dnmt3b KO are “Off,” upregulated TSSs in Dnmt3b KO are “Up,” and downregulated TSSs in Dnmt3b KO are “Down” (Figures 3B and Table S3). Unlike in Pwwp2a/b DKO mESCs, where sites of spurious initiation almost perfectly overlap with wild-type but show different levels of transcription initiation, either increased (Up) or decreased (Down) (Figures 1G, S1D, and S1E), Dnmt3b KO mESCs showed many On and Off sites indicating that loss of DNMT3B-mediated DNA methylation led to a redistribution of spurious sites in mESCs (Figure 3B). Another distinction between the two pathways was that, Pwwp2a/b deletion led to preferential upregulation of intragenic spurious TSSs, whereas Dnmt3b deletion resulted in an equivalent number of Up and Down spurious sites.
Figure 3.
PWWP2A/B and DNMT3B Loss Have Different Effects on Spurious Transcription Initiation
(A) The percentage of consensus TSSs detected by DECAP-seq in wild-type and Dnmt3b KO mESCs from GSE72854 categorized as annotated promoter, exonic, intronic, intergenic TSSs.
(B) TSS peaks for wild-type and Dnmt3b KO cells categorized as Gencode annotated promoters, exonic, intronic, and intergenic TSSs and divided into four groups based on behavior: On (only present in Dnmt3b KO), Off (only present in wild-type), Up (upregulated in KO), and Down (downregulated in KO). See also Tables S1 and S3 and Figure S5.
(C) Overlap between PWWP2-sensitive spurious TSSs (n = 2,146) and DNMT3B-sensitive spurious TSSs (On n = 666 and Up = 383).
(D) Percentage of DNMT3B-sensitive spurious TSSs arising from genes with High, Intermediate, Low, or No expression (obtained by sorting all genes by expression level in wild-type E14 mESCs and dividing into four equal groups). The p values were calculated by chi-square test using the frequency.
(E) Distribution of the distance in base pairs of the closest PWWP2-sensitive spurious TSS relative to every DNMT3B-sensitive TSSs (Up and On). Dashed line indicates presence of a PWWP2-sensitive TSS within 100 bp.
We next identified DNMT3B-sensitive spurious TSSs as all intragenic On and Up TSSs after removal of sites that overlapped with ATAC-seq and H3K27ac peaks. The overlap between sites (genomic positions) of PWWP2-sensitive and DNMT3B-sensitive spurious initiation is very low (Figure 3C). However, like PWWP2-sensitive sites, DNMT3B-sensitive spurious initiation also occurs predominantly at highly expressed genes (Figure 3D) and displays similar promoter width as well as CpG Island and GC content as PWWP2-sensitive spurious TSSs (Figures 2C, 2D, and S5A–S5C). Despite both DNMT3B-sensitive and PWWP2-sensitive spurious residing within highly expressed genes, there is little overlap between sites (Figure 3C) with the vast majority of DNMT3B-sensitive spurious TSS more than 100 bp away from the closest neighboring PWWP2-sensitive spurious TSS (Figure 3E).
Transcription and Chromatin Signatures Underlying Spurious Transcription Initiation
We next explored the dynamics of Pol II binding, nascent transcription, and histone acetylation at PWWP2-sensitive and DNMT3B-sensitive spurious transcription initiation sites. Canonical transcription initiation sites are enriched for initiated RNA Polymerase II phosphorylated at serine 5 (RNA Pol II Ser5P) surrounding the TSS. Pol II Ser5P binding thus serves as an orthogonal approach to examine sites of spurious transcription initiation (Phatnani and Greenleaf, 2006). To reduce noise from neighboring peaks, only single peaks in a 2-kb window were considered for downstream analysis for both the annotated TSSs and spurious TSSs. By comparing Pol II Ser5P ChIP-seq in E14 and Pwwp2a/b DKO cells, we found that Pol II Ser5P signal slightly peaks just downstream of the spurious TSSs resembling annotated promoter-TSSs (Figure 4A), although at a much lower levels (Figures 4A and 4B). Pol II Ser5P levels are also increased in Pwwp2a/b DKO when compared with E14 cells (Figure 4B). These intragenic spurious initiation sites also showed a local increase of nascent RNA transcription compared with background as detected by 4sU nascent RNA-seq in wild-type E14 and Pwwp2a/b DKO lines (Figure 4B). In contrast to annotated TSSs, which display a strong asymmetric pattern around the peak indicative of a bias toward the sense direction and weak antisense transcription, the nascent RNA transcription around spurious transcription initiation sites is more balanced around the peak suggesting that the transcription driven by spurious TSSs is weakly expressed bidirectionally or may not produce long RNA products (Figure 4B).
Figure 4.
Elevated RNAPII and Histone Acetylation at Spurious Transcription Initiation Sites in Pwwp2a/b DKO mESCs
(A) Metagene profiles of RNA Pol II Ser5P, 4sU-nascent transcription, H3K9ac, and H3K27ac at annotated TSSs ± 1 kb (n = 8,164) of wild-type and Pwwp2a/b DKO mESCs.
(B) Metagene profiles RNA Pol II Ser5P, 4sU-nascent transcription, H3K9ac, and H3K27ac at the PWWP2-sensitive spurious TSSs ± 1 kb. Only TSSs that are 2 kb away from each other were considered (n = 1,869).
(C) PWWP2-sensitive spurious TSSs ranked by the level of upregulation in Pwwp2a/b DKO compared with wild-type. Only sites 2 kb away from each other were considered (n = 1,869). Bottom group: spurious sites that show small increase in CAGE expression upon PWWP2A/B loss (cyan). Top group: spurious sites that show large increase in CAGE expression upon PWWP2A/B loss (red). (Top and Bottom group: x = 326).
(D) Boxplots of the distribution of relative differences of the Top and Bottom PWWP2-sensitive spurious sites with respect to the level of Pol II Ser5P, nascent transcription, H3K9ac and H3K27ac. p Values were calculated and using the two-sided Mann-Whitney test between Pwwp2a/B DKO and wild-type E14 mESCs.
(E) DNMT3B-sensitive spurious TSSs ranked by the level of upregulation in Dnmt3b KO compared with wild-type. Only TSSs 2 kb away from annotated TSS and 100 bp away from PWWP2-sensitive TSSs were considered (n = 806). Red and cyan indicate top (most sensitive) and bottom group (least sensitive) sites, respectively (x = 145, the same proportion as in C).
(F) Boxplots of the distribution of relative differences between Pwwp2a/b DKO and E14 for Top and Bottom groups DNMT3B-spurious sites from (E) with respect to the level of Pol II Ser5P, nascent transcription, H3K9ac and H3K27ac. p Values were calculated and using the two-sided Mann-Whitney test Pwwp2a/B DKO and wild-type E14 mESCs. See also Figure S6.
Acetylation of H3K27 and H3K9 is enriched at active genes, peaking around the TSS. Previously we showed that loss of PWWP2A/B HDAC complex leads to increased histone acetylation at active gene promoters and gene bodies (Zhang et al., 2018). We assessed the pattern of H3K27ac and H3K9ac at PWWP2-sensitive spurious TSSs defined in this study and found the signature resembles that of annotated TSSs, with a trough at the spurious TSS and peaks at the +1 and −1 nucleosome positions (Figures 4A and 4B). Spurious TSSs show increased acetylation levels in Pwwp2a/b DKO cells compared with wild-type mESCs (Figures 4B and S6B). We calculated the difference in RNA Pol II Ser5P, 4SU-seq, H3K9ac, and H3K27ac levels between Pwwp2a/b DKO and wild-type E14 and found all of them to be enriched around the spurious TSSs in the DKO (Figures S6A and S6B).
We divided the PWWP2-sensitive spurious TSSs sites into two groups based on the degree of increase in spurious transcription initiation compared with wild-type, “Top” (large increase in DKO compared with E14) and “Bottom” (small increase in DKO compared with E14), using TP10M of each CAGE peak (Figure 4C). There are significantly higher levels of initiating Pol II Ser5P, nascent RNA transcription, and H3K9ac at Top sites compared with Bottom sites (Figure 4D). This suggests that increased histone acetylation upon PWWP2A/B loss correlates with increased initiation and transcription from spurious sites, and the degree of change correlates with sensitivity to PWWP2A/B loss. In parallel we performed the same analysis for DNMT3B-sensitive spurious TSSs and found that spurious sites more (Top) or less (Bottom) sensitive to DNMT3B loss did not show differences in the level of initiating Pol II, nascent RNA production, or histone acetylation (Figures 4E, 4F, S6C, and S6D). These features, which explain sensitivity to PWWP2A/B loss, do not explain differences between spurious sites arising from DNMT3B depletion.
In summary, our analysis of the chromatin environment at spurious TSSs show that these spurious initiation sites show increased recruitment of initiating Pol II Ser5P, elevated levels of nascent transcription, and histone acetylation relative to the surrounding background. Additionally, upon PWWP2A/B loss, increased levels of spurious initiation are correlated with elevated histone acetylation and increased engaged RNA Pol II Ser5P and nascent transcription compared with wild-type. Although PWWP2A/B appears to regulate spurious transcription initiation at the level of histone acetylation and recruitment of initiating Pol II Ser5P, the DNMT3B pathway may regulate spurious initiation through altering binding of DNA-methylation sensitive factors (as discussed in Neri et al., 2017).
Discussion
Transcription initiation is a highly coordinated process that establishes accurate patterns of gene expression and ensures the production of functional transcripts. Here we describe how cross talk between two epigenetic pathways H3K36me3 and histone acetylation suppresses spurious transcription initiation from the gene bodies of active genes. H3K36me3 is deposited co-transcriptionally with the elongating RNA Pol II and exclusively marks the gene bodies of actively transcribed genes (Kizer et al., 2005). Acetylation of H3K9 and H3K27 is enriched at active promoter regions and regulates RNA Pol II initiation and pause release (Gates et al., 2017; Stasevich et al., 2014). Here we describe a pathway in metazoans, in which H3K36me3 recruitment of the PWWP2A/B HDAC complex represses spurious transcription initiation from the gene bodies of actively transcribed genes. This prevents RNA Pol II from initiating at cryptic promoters within coding regions and the production of aberrant transcripts, which could be harmful to cellular homeostasis.
PWWP2A/B binds H3K36me3 through a C-terminal PWWP domain, a module present in many other H3K36me3-binding proteins (Qin and Min, 2014; Vermeulen et al., 2010; Vezzoli et al., 2010; Wen et al., 2014). Previously we and others showed that PWWP2A/B forms a stable complex HDAC1/2 and MTA1/2/3 (Link et al., 2018; Zhang et al., 2018). In this study, we find that loss of PWWP2A/B in mESCs leads to increases in the level of spurious transcription initiation, especially over intragenic gene body regions.
Similar to studies in S. cerevisiae, we find that, although spurious transcripts are lowly expressed, they are highly prevalent in number within gene bodies in mESCs (Lu and Lin, 2019; Neil et al., 2009). Spurious initiation sites contain some core promoter elements like the GC box, DCE, and ETS motifs but are far less enriched for these motifs than canonical eukaryotic promoters (Haberle and Stark, 2018; Hollenhorst et al., 2004; Lee et al., 2005; Roy and Singer, 2015). Additionally, spurious initiation sites also lack the high GC and CpG island content associated with eukaryotic promoters (Deaton and Bird, 2011; Haberle and Stark, 2018; Roy and Singer, 2015). The vast majority of spurious transcription initiate from highly expressed genes, and these sites are marked by initiating Pol II Ser5P, which increases further upon PWWP2A/B loss. The increased level of spurious transcripts at highly expressed genes may reflect increased levels of hyperacetylation, increased chromatin accessibility and nucleosome turnover, and a high local concentration of coactivators and transcriptional machinery, which together lead to binding and initiation of RNA Pol II within intragenic sites. Our data support the model where promiscuous initiation by Pol II occurs more frequently at highly expressed genes and that spurious transcription initiation is an inevitable by-product of transcription (Wade and Grainger, 2018). Multiple mechanisms besides the H3K36me3-HDAC pathway have been found to play a role in countering spurious initiation and the production of spurious transcripts (Gouot et al., 2018; Neri et al., 2017; Scandaglia et al., 2017), with intragenic DNA methylation by DNMT3B as another pathway downstream of H3K36me3 that suppresses spurious transcription initiation.
We find that loss of the PWWP2A/B HDAC complex has a different profile than loss of DNMT3B, and PWWP2-sensitive and DNMT3B-sensitive spurious TSSs are largely non-overlapping. The only features common to both pathways was that intragenic spurious transcription initiate predominantly from the highly expressed genes. Our data suggest that PWWP2A/B suppresses transcription initiation through regulating histone acetylation and Pol II initiation, whereas DNA methylation may regulate the binding of methylation-sensitive TFs to suppress initiation at intragenic sites. Although PWWP2A/B loss leads to increased levels of initiation from pre-existing intragenic sites found in mESCs, loss of DNMT3B leads to redistribution of sites from which spurious transcripts initiate. It appears that in mammalian organisms these two epigenetic mechanisms cooperate downstream of H3K36me3 to control transcription fidelity by ensuring that initiation occurs predominantly at strong promoter regions and is actively suppressed from intragenic regions.
Limitations of the Study
There are two points that are outside the scope of this study and will require more experimental work to fully elucidate, these being whether PWWP2A and PWWP2B have redundant functions and the role of the PWWP2A/B-HDAC complex at intergenic regions.
-
1.
Overlapping and paralog-specific differences of PWWP2A and PWWP2B
In this work, we used mESCs where both the Pwwp2a and Pwwp2b genes have been knocked out. Our justification for using the double knockout comes from biochemical characterization of the complex in our previous work (Zhang et al., 2018), which shows that two copies of PWWP2A bind to the MTA1-dimer (stoichiometry of 2:2), PWWP2A and PWWP2B can coexist in the same complex, and the PWWP domains have the same affinity for H3K36me3. However, despite high sequence conservation, there are differences in the domain composition of the two paralogs (see Figure S7) with PWWP2B lacking the N-terminal proline-rich region present in PWWP2A. Both paralogs are expressed in mESCs but could have differences in tissue-specific expression and functions. Further work is needed to address whether there are paralog-specific differences in this family of proteins.
-
2.
Intergenic regions that show sensitivity to PWWP2A/B loss
Our characterization of spurious transcription initiation focused predominantly on intragenic spurious TSSs, which were sensitive to PWWP2A/B deletion. Although intragenic spurious TSSs were much more sensitive to loss of PWWP2A/B, some intergenic spurious TSSs were also induced. These intergenic sites are not enriched for H3K36me3, so we cannot subscribe these events to H3K36me3-mediated recruitment of PWWP2A/B. Aside from the H3K36me3-binding PWWP domain, there is a H2A.Z-binding domain, which could recruit PWWP2A/B to chromatin, but whether this could explain intergenic spurious TSSs events remains to be examined.
Resource Availability
Lead Contact
Further information and requests for resources and reagents should be directed to and will be fulfilled by the corresponding author, Tianyi Zhang (tzhang17@ic.ac.uk).
Materials Availability
Materials and protocols used in this study are available from the authors upon request. This study did not generate new unique reagents.
Data and Code Availability
CAGE-seq and processed bigwig files generated in this study have been deposited in Gene Expression Omnibus (GEO) under GSE148382. Code used in this study has been deposited in github, https://github.com/guifengwei/PWWP2-CAGE-seq.
Methods
All methods can be found in the accompanying Transparent Methods supplemental file.
Acknowledgments
Work in the Brockdorff lab is supported by the Wellcome Trust (grant no. 215513). T.Z. was supported by NSERC and the Clarendon Scholarship in Oxford and is now supported by the Sir Henry Wellcome Postdoctoral Fellowship (grant no. 215933). We thank Amanda Williams from Oxford Zoology for Next-seq sequencing and Joseph Bowness and Mafalda Almeida for discussion and critical reading of the manuscript.
Author Contributions
T.Z. conceived and performed the experiments. G.W. performed all the bioinformatics analysis and generated all the figures. G.W. and T.Z. wrote and revised the manuscript with input from N.B. who supervised the whole study.
Declaration of Interests
The authors declare no competing interests.
Published: November 20, 2020
Footnotes
Supplemental Information can be found online at https://doi.org/10.1016/j.isci.2020.101741.
Contributor Information
Neil Brockdorff, Email: neil.brockdorff@bioch.ox.ac.uk.
Tianyi Zhang, Email: tzhang17@ic.ac.uk.
Supplemental Information
References
- Andersson R., Gebhard C., Miguel-Escalada I., Hoof I., Bornholdt J., Boyd M., Chen Y., Zhao X., Schmidl C., Suzuki T. An atlas of active enhancers across human cell types and tissues. Nature. 2014;507:455–461. doi: 10.1038/nature12787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bannister A.J., Schneider R., Myers F.A., Thorne A.W., Crane-Robinson C., Kouzarides T. Spatial distribution of di- and tri-methyl lysine 36 of histone H3 at active genes. J. Biol. Chem. 2005;280:17732–17736. doi: 10.1074/jbc.M500796200. [DOI] [PubMed] [Google Scholar]
- Bornelov S., Reynolds N., Xenophontos M., Gharbi S., Johnstone E., Floyd R., Ralser M., Signolet J., Loos R., Dietmann S. The nucleosome remodeling and deacetylation complex modulates chromatin structure at sites of active transcription to fine-tune gene expression. Mol. Cell. 2018;71:56–72 e54. doi: 10.1016/j.molcel.2018.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brocks D., Schmidt C.R., Daskalakis M., Jang H.S., Shah N.M., Li D., Li J., Zhang B., Hou Y., Laudato S. DNMT and HDAC inhibitors induce cryptic transcription start sites encoded in long terminal repeats. Nat. Genet. 2017;49:1052–1060. doi: 10.1038/ng.3889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carrozza M.J., Li B., Florens L., Suganuma T., Swanson S.K., Lee K.K., Shia W.J., Anderson S., Yates J., Washburn M.P. Histone H3 methylation by Set2 directs deacetylation of coding regions by Rpd3S to suppress spurious intragenic transcription. Cell. 2005;123:581–592. doi: 10.1016/j.cell.2005.10.023. [DOI] [PubMed] [Google Scholar]
- Deaton A.M., Bird A. CpG islands and the regulation of transcription. Genes Dev. 2011;25:1010–1022. doi: 10.1101/gad.2037511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dhayalan A., Rajavelu A., Rathert P., Tamas R., Jurkowska R.Z., Ragozin S., Jeltsch A. The Dnmt3a PWWP domain reads histone 3 lysine 36 trimethylation and guides DNA methylation. J. Biol. Chem. 2010;285:26114–26120. doi: 10.1074/jbc.M109.089433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fort A., Hashimoto K., Yamada D., Salimullah M., Keya C.A., Saxena A., Bonetti A., Voineagu I., Bertin N., Kratz A. Deep transcriptome profiling of mammalian stem cells supports a regulatory role for retrotransposons in pluripotency maintenance. Nat. Genet. 2014;46:558–566. doi: 10.1038/ng.2965. [DOI] [PubMed] [Google Scholar]
- Gates L.A., Shi J., Rohira A.D., Feng Q., Zhu B., Bedford M.T., Sagum C.A., Jung S.Y., Qin J., Tsai M.J. Acetylation on histone H3 lysine 9 mediates a switch from transcription initiation to elongation. J. Biol. Chem. 2017;292:14456–14472. doi: 10.1074/jbc.M117.802074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gouot E., Bhat W., Rufiange A., Fournier E., Paquet E., Nourani A. Casein kinase 2 mediated phosphorylation of Spt6 modulates histone dynamics and regulates spurious transcription. Nucleic Acids Res. 2018;46:7612–7630. doi: 10.1093/nar/gky515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haberle V., Stark A. Eukaryotic core promoters and the functional basis of transcription initiation. Nat. Rev. Mol. Cell Biol. 2018;19:621–637. doi: 10.1038/s41580-018-0028-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hollenhorst P.C., Jones D.A., Graves B.J. Expression profiles frame the promoter specificity dilemma of the ETS family of transcription factors. Nucleic Acids Res. 2004;32:5693–5702. doi: 10.1093/nar/gkh906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang C., Zhu B. Roles of H3K36-specific histone methyltransferases in transcription: antagonizing silencing and safeguarding transcription fidelity. Biophys. Rep. 2018;4:170–177. doi: 10.1007/s41048-018-0063-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joshi A.A., Struhl K. Eaf3 chromodomain interaction with methylated H3-K36 links histone deacetylation to Pol II elongation. Mol. Cell. 2005;20:971–978. doi: 10.1016/j.molcel.2005.11.021. [DOI] [PubMed] [Google Scholar]
- Keogh M.C., Kurdistani S.K., Morris S.A., Ahn S.H., Podolny V., Collins S.R., Schuldiner M., Chin K., Punna T., Thompson N.J. Cotranscriptional set2 methylation of histone H3 lysine 36 recruits a repressive Rpd3 complex. Cell. 2005;123:593–605. doi: 10.1016/j.cell.2005.10.025. [DOI] [PubMed] [Google Scholar]
- Kizer K.O., Phatnani H.P., Shibata Y., Hall H., Greenleaf A.L., Strahl B.D. A novel domain in Set2 mediates RNA polymerase II interaction and couples histone H3 K36 methylation with transcript elongation. Mol. Cell. Biol. 2005;25:3305–3316. doi: 10.1128/MCB.25.8.3305-3316.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee B.K., Shen W., Lee J., Rhee C., Chung H., Kim K.Y., Park I.H., Kim J. Tgif1 counterbalances the activity of core pluripotency factors in mouse embryonic stem cells. Cell Rep. 2015;13:52–60. doi: 10.1016/j.celrep.2015.08.067. [DOI] [PubMed] [Google Scholar]
- Lee D.H., Gershenzon N., Gupta M., Ioshikhes I.P., Reinberg D., Lewis B.A. Functional characterization of core promoter elements: the downstream core element is recognized by TAF1. Mol. Cell. Biol. 2005;25:9674–9686. doi: 10.1128/MCB.25.21.9674-9686.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li J., Ahn J.H., Wang G.G. Understanding histone H3 lysine 36 methylation and its deregulation in disease. Cell. Mol. Life Sci. 2019;76:2899–2916. doi: 10.1007/s00018-019-03144-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Link S., Spitzer R.M.M., Sana M., Torrado M., Volker-Albert M.C., Keilhauer E.C., Burgold T., Punzeler S., Low J.K.K., Lindstrom I. PWWP2A binds distinct chromatin moieties and interacts with an MTA1-specific core NuRD complex. Nat. Commun. 2018;9:4300. doi: 10.1038/s41467-018-06665-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lu Z., Lin Z. Pervasive and dynamic transcription initiation in Saccharomyces cerevisiae. Genome Res. 2019;29:1198–1210. doi: 10.1101/gr.245456.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo M., Ling T., Xie W., Sun H., Zhou Y., Zhu Q., Shen M., Zong L., Lyu G., Zhao Y. NuRD blocks reprogramming of mouse somatic cells into pluripotent stem cells. Stem Cells. 2013;31:1278–1286. doi: 10.1002/stem.1374. [DOI] [PubMed] [Google Scholar]
- Mor N., Rais Y., Sheban D., Peles S., Aguilera-Castrejon A., Zviran A., Elinger D., Viukov S., Geula S., Krupalnik V. Neutralizing gatad2a-chd4-mbd3/NuRD complex facilitates deterministic induction of naive pluripotency. Cell Stem Cell. 2018;23:412–425.e10. doi: 10.1016/j.stem.2018.07.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Neil H., Malabat C., d'Aubenton-Carafa Y., Xu Z., Steinmetz L.M., Jacquier A. Widespread bidirectional promoters are the major source of cryptic transcripts in yeast. Nature. 2009;457:1038–1042. doi: 10.1038/nature07747. [DOI] [PubMed] [Google Scholar]
- Neri F., Rapelli S., Krepelova A., Incarnato D., Parlato C., Basile G., Maldotti M., Anselmi F., Oliviero S. Intragenic DNA methylation prevents spurious transcription initiation. Nature. 2017;543:72–77. doi: 10.1038/nature21373. [DOI] [PubMed] [Google Scholar]
- Phatnani H.P., Greenleaf A.L. Phosphorylation and functions of the RNA polymerase II CTD. Genes Dev. 2006;20:2922–2936. doi: 10.1101/gad.1477006. [DOI] [PubMed] [Google Scholar]
- Qin S., Min J. Structure and function of the nucleosome-binding PWWP domain. Trends Biochem. Sci. 2014;39:536–547. doi: 10.1016/j.tibs.2014.09.001. [DOI] [PubMed] [Google Scholar]
- Reynolds N., Latos P., Hynes-Allen A., Loos R., Leaford D., O'Shaughnessy A., Mosaku O., Signolet J., Brennecke P., Kalkan T. NuRD suppresses pluripotency gene expression to promote transcriptional heterogeneity and lineage commitment. Cell Stem Cell. 2012;10:583–594. doi: 10.1016/j.stem.2012.02.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rondelet G., Dal Maso T., Willems L., Wouters J. Structural basis for recognition of histone H3K36me3 nucleosome by human de novo DNA methyltransferases 3A and 3B. J. Struct. Biol. 2016;194:357–367. doi: 10.1016/j.jsb.2016.03.013. [DOI] [PubMed] [Google Scholar]
- Roy A.L., Singer D.S. Core promoters in transcription: old problem, new insights. Trends Biochem. Sci. 2015;40:165–171. doi: 10.1016/j.tibs.2015.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scandaglia M., Lopez-Atalaya J.P., Medrano-Fernandez A., Lopez-Cascales M.T., Del Blanco B., Lipinski M., Benito E., Olivares R., Iwase S., Shi Y. Loss of Kdm5c causes spurious transcription and prevents the fine-tuning of activity-regulated enhancers in neurons. Cell Rep. 2017;21:47–59. doi: 10.1016/j.celrep.2017.09.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shimbo T., Du Y., Grimm S.A., Dhasarathy A., Mav D., Shah R.R., Shi H., Wade P.A. MBD3 localizes at promoters, gene bodies and enhancers of active genes. PLoS Genet. 2013;9:e1004028. doi: 10.1371/journal.pgen.1004028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stasevich T.J., Hayashi-Takanaka Y., Sato Y., Maehara K., Ohkawa Y., Sakata-Sogawa K., Tokunaga M., Nagase T., Nozaki N., McNally J.G. Regulation of RNA polymerase II activation by histone acetylation in single living cells. Nature. 2014;516:272–275. doi: 10.1038/nature13714. [DOI] [PubMed] [Google Scholar]
- Takahashi H., Lassmann T., Murata M., Carninci P. 5' end-centered expression profiling using cap-analysis gene expression and next-generation sequencing. Nat. Protoc. 2012;7:542–561. doi: 10.1038/nprot.2012.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tian W., Yan P., Xu N., Chakravorty A., Liefke R., Xi Q., Wang Z. The HRP3 PWWP domain recognizes the minor groove of double-stranded DNA and recruits HRP3 to chromatin. Nucleic Acids Res. 2019;47:5436–5448. doi: 10.1093/nar/gkz294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Nuland R., van Schaik F.M., Simonis M., van Heesch S., Cuppen E., Boelens R., Timmers H.M., van Ingen H. Nucleosomal DNA binding drives the recognition of H3K36-methylated nucleosomes by the PSIP1-PWWP domain. Epigenetics Chromatin. 2013;6:12. doi: 10.1186/1756-8935-6-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vermeulen M., Eberl H.C., Matarese F., Marks H., Denissov S., Butter F., Lee K.K., Olsen J.V., Hyman A.A., Stunnenberg H.G. Quantitative interaction proteomics and genome-wide profiling of epigenetic histone marks and their readers. Cell. 2010;142:967–980. doi: 10.1016/j.cell.2010.08.020. [DOI] [PubMed] [Google Scholar]
- Vezzoli A., Bonadies N., Allen M.D., Freund S.M., Santiveri C.M., Kvinlaug B.T., Huntly B.J., Gottgens B., Bycroft M. Molecular basis of histone H3K36me3 recognition by the PWWP domain of Brpf1. Nat. Struct. Mol. Biol. 2010;17:617–619. doi: 10.1038/nsmb.1797. [DOI] [PubMed] [Google Scholar]
- Wade J.T., Grainger D.C. Spurious transcription and its impact on cell function. Transcription. 2018;9:182–189. doi: 10.1080/21541264.2017.1381794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wagner E.J., Carpenter P.B. Understanding the language of Lys36 methylation at histone H3. Nat. Rev. Mol. Cell Biol. 2012;13:115–126. doi: 10.1038/nrm3274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wen H., Li Y., Xi Y., Jiang S., Stratton S., Peng D., Tanaka K., Ren Y., Xia Z., Wu J. ZMYND11 links histone H3.3K36me3 to transcription elongation and tumour suppression. Nature. 2014;508:263–268. doi: 10.1038/nature13045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu C., Cui G., Botuyan M.V., Mer G. Structural basis for the recognition of methylated histone H3K36 by the Eaf3 subunit of histone deacetylase complex Rpd3S. Structure. 2008;16:1740–1750. doi: 10.1016/j.str.2008.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zaghi M., Broccoli V., Sessa A. H3K36 methylation in neural development and associated diseases. Front. Genet. 2019;10:1291. doi: 10.3389/fgene.2019.01291. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang T., Wei G., Millard C.J., Fischer R., Konietzny R., Kessler B.M., Schwabe J.W.R., Brockdorff N. A variant NuRD complex containing PWWP2A/B excludes MBD2/3 to regulate transcription at active genes. Nat. Commun. 2018;9:3798. doi: 10.1038/s41467-018-06235-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
CAGE-seq and processed bigwig files generated in this study have been deposited in Gene Expression Omnibus (GEO) under GSE148382. Code used in this study has been deposited in github, https://github.com/guifengwei/PWWP2-CAGE-seq.