Abstract
Although it is well established that the Polycomb Group (PcG) complexes maintain gene repression through the incorporation of H2AK121ub and H3K27me3, little is known about the effect of these modifications on chromatin accessibility, which is fundamental to understand PcG function. Here, by integrating chromatin accessibility, histone marks and expression analyses in different Arabidopsis PcG mutants, we show that PcG function regulates chromatin accessibility. We find that H2AK121ub is associated with a less accessible but still permissive chromatin at transcriptional regulation hotspots. Accessibility is further reduced by EMF1 acting in collaboration with PRC2 activity. Consequently, H2AK121ub/H3K27me3 marks are linked to inaccessible although responsive chromatin. In contrast, only-H3K27me3-marked chromatin is less responsive, indicating that H2AK121ub-marked hotspots are required for transcriptional responses. Nevertheless, despite the loss of PcG activities leads to increased chromatin accessibility, this is not necessarily accompanied by transcriptional activation, indicating that accessible chromatin is not always predictive of gene expression.
Subject terms: DNA methylation, Cell fate, Plant molecular biology
Polycomb Group complexes maintain gene repression through the incorporation of H2AK121ub and H3K27me3. Here, the authors show that H2AK121ub marks less accessible but transcriptionally permissive chromatin, while H3K27me3 enforces a repressed transcriptionally less-permissive state.
Introduction
PcG complexes maintain gene repression by incorporating histone modifications within chromatin1–3. While PcG repressive complex 1 (PRC1) has histone 2A E3 ubiquitin ligase activity4–8, PRC2 has histone 3 lysine 27 (H3K27) trimethyltransferase activity9–12. Nevertheless, despite the important role of these chromatin marks in regulating transcription in eukaryotes, their effect on chromatin is not yet clear. Several pieces of evidence suggested that animal PcG proteins mediate chromatin compaction13–15. However, a recent report proposed that although PcG occupied promoters exhibit reduced accessibility, this does not rely on PcG proteins. Instead, PRC1 plays a role in increasing nucleosome occupancy and decreasing nucleosomal spacing16.
In Arabidopsis thaliana (Arabidopsis), the composition of PcG complexes is still a matter of debate. While PRC2 core components are well conserved to their animal counterparts and the catalytic activity is mainly carried out by the EZ homologs CURLY LEAF (CLF) and SWINGER (SWN) during sporophyte development3,17, the identity of PRC1 is less clear. Despite the H2A monoubiquitination module is formed by the Polycomb Group ring finger (PCGF) and RING1 homologs BMI1 (A, B, or C) and RING1 (A or B)2,18, association of the plant-specific proteins EMBRYONIC FLOWER 1 (EMF1) and LIKE-HETEROCHROMATIN PROTEIN 1 (LHP1) to this complex has turned to be controversial. EMF1 was proposed to be a PRC1 component due to its ability to in vitro mediate chromatin compaction as Drosophila melanogaster (Drosophila) PRC1 component Posterior Sex Combs (Psc) does19,20. However, recent data showed that EMF1 co-purifies with PRC2 (refs. 21–23) and is required for H3K27me3 marking19,24,25. Likewise, LHP1 was proposed to perform similar function to Drosophila PRC1 component Polycomb (Pc)26,27; however, LHP1 also co-purifies with PRC2 (refs. 21–23) and is involved in maintenance/spreading of H3K27me3 (ref. 21) but dispensable for H2AK121ub marking28. Thus, these two proteins actually seem to be PRC2-associated component.
Interestingly, although H2AK121ub and H3K27me3 marks often co-localize at genes (H2AK121ub/H3K27me3-marked genes), H3K27me3-marked peaks are generally much longer than that of H2AK121ub28. Accordingly, H3K27me3 marks usually cover the complete gene region, whereas H2AK121ub marks are enriched at the region surrounding the transcriptional start site (TSS)28. In addition, there are also genes marked with only-H2AK121ub or H3K27me3 (ref. 28) (only-H2AK121ub and only-H3K27me3, respectively), indicating that these marks may play independent roles.
H3K27me3 marks in Arabidopsis have been linked to regions displaying locally reduced DNA accessibility29. According to this, the promoters of expressed genes display high chromatin accessibility and are depleted of H3K27me3 marks30,31. However, it is not clear whether H3K27me3 has a direct role in regulating chromatin accessibility. On the other hand, the role of PRC1-mediated H2AK121ub is far from clear. Although several data support a repressive role of this modification6,8,28, it has been proposed to be associated with gene responsiveness and its repressive function seems to require PRC2 recruitment32. In any case, nothing is known about a possible role of H2AK121ub in regulating chromatin accessibility in plants.
In this work, by integrating chromatin accessibility, histone marks, and expression analyses in different Arabidopsis PcG mutants, we show that H2AK121ub marks associate with a less accessible but permissive chromatin at transcriptional regulation hotspots, which are sites enriched for the binding of transcription factors (TFs). Chromatin accessibility can be further reduced by EMF1 and PRC2 activity. However, while H2AK121ub/H3K27me3-mediated inaccessible chromatin is still transcriptionally responsive, only-H3K27me3 marked chromatin is less responsive.
Results
H2AK121ub hallmarks hotspots for transcriptional regulation
To investigate a possible role of H3K27me3 and H2AK121ub in regulating chromatin accessibility, we first performed Assay for Transposase-Accessible Chromatin sequencing (ATAC-seq) in wild-type Col-0 (WT) and different PcG mutants at 10 days after germination (DAG). We selected bmi1abc, clf28swn7, and emf1-2 severe mutants8,19,33 that cannot undergo the transition from embryonic to vegetative development after germination in the aerial part of the seedling. However, unlike bmi1abc and clf28swn7, which in addition have a stunted root displaying embryonic traits, emf1-2 is able to develop a WT-like root (Supplementary Fig. 1). We in addition included ring1ab weak and lhp1 mutants that are able to develop vegetative and floral organs in spite of several developmental alterations34,35 (Supplementary Fig. 1).
Principal component analysis (PCA) of ATAC-seq data revealed that WT, ring1ab, and lhp1 clustered together, whereas clf28swn7 and emf1-2 on one side, and bmi1abc on the other, constituted two distant and distinct clusters, indicating differences between them (Fig. 1a). We next identified Tn5 hypersensitive sites (THSs) for each genotype as those genomic regions exhibiting a significant accumulation of signal from the corresponding ATAC-seq data in the two different replicates. We found that the number of THSs in WT was lower than that in the different mutants (11,351 in WT, 12,803 in lhp1, 13,829 in ring1ab, 13,472 in emf1-2, 17,162 in clf28swn7, and 17,420 in bmi1abc; Supplementary Fig. 2). In addition, most of WT THSs were also accessible regions in mutants, indicating that loss-of-function of these PcG proteins leads to emergence of new THSs (Supplementary Fig. 2). To compare the contribution of the different mutants in increasing chromatin accessibility, THSs from the different genotypes were merged together into a list of consensus THSs. The list consisted in highly confident consensus THSs exhibiting an accessibility signal greater than 3 CPM with a q-value <0.05 in at least one of the genotypes (17,372; Supplementary Data 1). Average accessibility signal at consensus THSs was much higher in severe mutants than in ring1ab and lhp1, in which the signal was similar to WT (Fig. 1b). Consensus THSs were mostly localized within the 1 kb region upstream of the TSS (Fig. 1c). Accordingly, we found an accessibility signal peak at the region prior to the TSS in all genotypes (Fig. 1d). We then analyzed the average accessibility signal at WT THSs in the different genotypes (Fig. 1e). Interestingly, the embryonic mutants bmi1abc, clf28swn7, and emf1-2 showed higher accessibility signals at this subset of THSs than the genotypes that develop vegetative tissues, as is the case of WT, ring1ab, and lhp1 mutants. This is consistent with the fact that undifferentiated embryonic cells tend to have a more accessible chromatin state than differentiated cells36–38.
We found that consensus THSs co-localized with regions highly enriched in transcription factor (TF)-binding sites (enrichment = 7.5, p value <10−4; Fig. 1f) as determined by genome-wide identification of the binding sites of 100 TFs (Supplementary Data 2), supporting that THSs indicate the presence of transcriptional regulation hotspots30,36,39. According to this, a high number of transcriptional regulation hotspots that were inaccessible in WT became accessible in one or more mutants (Fig. 1f).
Nevertheless, the emergence of new THSs in mutants could be the result of a direct or indirect consequence of loss of PcG regulation. Therefore, we compared the genome-wide distribution of H2AK121ub and H3K27me3 marks in WT to that of consensus THSs, genotype-specific THSs, and TFs binding sites (Fig. 2a). We found that consensus THSs tend to associate with PcG-marked regions (Fig. 2a and Supplementary Fig. 3). Intriguingly, around 80% of consensus THSs showed an H2AK121ub peak in its vicinity (≤2 kb distance from THS) (Fig. 2a, b); furthermore, 50% of consensus THSs overlapped or had an H2AK121ub peak at less than 100 bp away (Fig. 2b). In contrast, this was not so evident for H3K27me3, as only around 40% of consensus THSs showed an H3K27me3 peak within the next 2 kb (Fig. 2b) and 96% of these regions co-localized with H2AK121ub (Fig. 2c). All together, these results strongly suggest that the loss of PcG function plays a role in increasing accessibility at transcriptional regulation hotspots; moreover, that H2AK121 monoubiquitination hallmarks these hotspots.
H2AK121ub and H3K27me3 are differentially affected in PcG mutants
To investigate the role of H3K27me3 and H2AK121ub marks in regulating chromatin accessibility, we next analyzed the signal levels of these modifications at H2AK121ub/H3K27me3, only-H3K27me3, and only-H2AK121ub marked genes in the different mutants (Fig. 3a–d). For this, we used previously published ChIP-seq data for some of the mutants and WT28, and generated new datasets for the remaining mutants together with a new WT dataset (see “Methods” section). We redefined H2AK121ub/H3K27me3, only-H3K27me3, and only-H2AK121ub marked genes as those genes displaying the corresponding modification/s in one of the two WT datasets, resulting in similar gene lists than the ones previously published28 (see “Methods” section).
Average H3K27me3 levels at H2AK121ub/H3K27me3-marked genes were almost undetectable in clf28swn7 and significantly decreased in bmi1abc compared to WT as previously described28 (Fig. 3a), which is consistent with a requirement of PRC1 activity for H3K27me3 marking. Accordingly, ChIP-reChIP experiments using anti-H3K27me3 and anti-H2AK121ub antibodies verified that these marks co-localize at some nucleosomes of H2AK121ub/H3K27me3-marked genes (Supplementary Fig. 4). We also found reduced levels of H3K27me3 in lhp1 and emf1-2 at these genes, although the differences with respect to WT were less significant than in bmi1abc. Conversely, ring1ab weak mutant showed WT-like levels (Fig. 3a). Nevertheless, analyzing the levels of H3K27me3 at individual genes in ring1ab, we found specific genes displaying less than 60% of WT levels (Fig. 3a and Supplementary Fig. 5), which is consistent with previous results40. The low percentage of genes showing strongly reduced levels of H3K27me3 in ring1ab (around 7%) is most probably due to the presence of RING1 activity in this knockdown mutant. All together, these data indicate that all these proteins are required for appropriate H3K27me3 marking at H2AK121ub/H3K27me3 genes. On the contrary, average global H3K27me3 levels at only-H3K27me3 genes were significantly affected in clf28swn7 and lhp1 but not in strong bmi1abc, weak ring1ab, and unexpectedly emf1-2 (Fig. 3b).
Interestingly, despite we found decreased levels of H3K27me3 at many genes in emf1-2, which supports that EMF1 plays an important role in H3K27me3 deposition19,24,25, we also found a high percentage of both H2AK121ub/H3K27me3 and only-H3K27me3 genes with increased levels of H3K27me3 in emf1-2 compared to WT (Fig. 3a, b and Supplementary Fig. 6). Phenotypic analysis of the different mutants indicated that unlike BMI1 and CLF/SWN, EMF1 might not be decisive for the incorporation of H3K27me3 marks in root tissue as emf1-2 can develop a WT-like root (Supplementary Fig. 1). Since we used whole seedlings for ChIP-seq analyses, the fact that emf1-2 displays a stunted shoot development but a WT-like root implies a higher proportion of root cells in emf1-2 samples compared to WT, which develops shoot and root tissues. Therefore, the apparently increased levels of H3K27me3 at some genes in emf1-2 might be a consequence of the different ratio of root cells between emf1-2 and WT samples, which lead to increased average H3K27me3 levels in emf1-2 at root repressed genes. Supporting this, Gene Ontology (GO) analysis of genes with increased H3K27me3 levels in emf1-2 showed an enrichment in processes involved in photosynthesis and response to light, corresponding to genes marked with H3K27me3 in both WT and emf1-2 roots (Supplementary Fig. 6).
Regarding H2AK121ub marks, average H2AK121ub signal levels at H2AK121ub/H3K27me3 and only-H2AK121ub marked genes were significantly reduced in bmi1abc, increased in clf28swn7, and unaltered in lhp1, as previously reported28 (Fig. 3c, d). The fact that clf28swn7 displayed increased H2AK121ub levels at both H2AK121ub/H3K27me3 and only-H2AK121ub marked genes (Fig. 3c, d) suggests an indirect consequence, since this effect was observed at only-H2AK121ub genes that are not CLF or SWN targets41. On the other hand, except for a few genes, we did not find significantly altered levels of H2AK121ub in ring1ab and emf1-2 (Fig. 3c, d), which is consistent with a reduced but not eliminated RING1 activity and a low impact of EMF1 in H2AK121ub marking, respectively. All together, these data show that the levels of H2AK121ub are differentially affected in all these PcG mutants.
H2AK121ub and EMF1 reduce chromatin accessibility at PcG targets
Since we observed a generalized increase in chromatin accessibility in severe mutants associated with the embryonic phenotype, we investigated whether the loss of PcG marks actually plays a role in regulating chromatin accessibility. For this, we analyzed the accessibility profile around the TSS of only-H2AK121ub, H2AK121ub/H3K27me3 and only-H3K27me3 marked genes dividing the genes in groups according to their levels of H2AK121ub or H3K27me3 in the different mutants compared to WT (Fig. 4 and Supplementary Data 3). We first focused on only-H2AK121ub-marked genes (Fig. 4a). We found that decreasing levels of H2AK121ub led to a significant progressive increase in accessibility (Fig. 4a) whereas increasing levels led to reduced accessibility, supporting the association of H2AK121ub marks with a less accessible chromatin. However, we also found that expression levels of only-H2AK121ub-marked genes in WT, although not as high as in active genes lacking PcG marks (non-PcG targets), were higher than in H2AK121ub/H3K27me3, and only-H3K27me3-marked genes (Supplementary Fig. 7). Thus, as gene expression has been correlated with accessible chromatin30,42, these results suggest that H2AK121ub marks at only-H2AK121ub-marked genes associate with a less accessible although still permissive chromatin. Then, we analyzed H2AK121ub/H3K27me3-marked genes, which are generally repressed28. Since the incorporation of H3K27me3 at these genes is dependent on H2AK121ub28, their lower expression levels (Supplementary Fig. 7) suggests a role of H3K27me3 marks in further decreasing accessibility. Accordingly, we found that reduced levels of H2AK121ub in bmi1abc induced a significant loss of H3K27me3 and an increase in accessibility (Fig. 4b). Surprisingly, in clf28swn7, in which H3K27me3 is fully eliminated and H2AK121ub increases at many genes, accessibility was not increased as much as in bmi1abc, suggesting that either H2AK121ub and/or some other factor is preventing a gain of chromatin accessibility in this mutant. Importantly, although the levels of H2AK121ub were unaltered at H2AK121ub/H3K27me3-marked genes in emf1-2, genes with reduced levels of H3K27me3 in this mutant showed higher accessibility than in clf28swn7, indicating that EMF1, acting downstream H2AK121ub, contributes in reducing chromatin accessibility. Increased chromatin accessibility in emf1-2 was also observed at only-H3K27me3-marked genes, which are not regulated by bmi1abc (Fig. 4c), supporting that EMF1 plays a role of in establishing an inaccessible chromatin.
We noted that the accessibility peak at only-H3K27me3-marked genes in emf1-2 was smaller than that at H2AK121ub/H3K27me3 genes (Fig. 4b, c), suggesting that chromatin accessibility at only-H3K27me3 genes is less susceptible to a decrease in the levels of H3K27me3. On the other hand, although H3K27me3 levels were affected in lhp1 and ring1ab, we did not find significant accessibility changes at any of the gene subsets.
Accessibility increases in mutants independently of transcription
Gene expression has been correlated with increased chromatin accessibility31,37,42. In line with this, we found that in WT the genes displaying the highest expression levels were the ones with highest chromatin accessibility (Supplementary Fig. 8). Hence, as loss of PcG function causes upregulation of a considerable number of PcG target and non-target genes26,28,33,40,43 (Supplementary Fig. 9a), PCA analysis of the transcriptome in the different mutants (Supplementary Fig. 9b) showed a similar clustering to the one observed with ATAC-seq data (Fig. 1a). However, it is also known that many PcG target genes do not become upregulated in PcG mutants despite losing PcG marks26,28,33,40,43. Therefore, we wondered whether the increased chromatin accessibility found at PcG targets in bmi1abc, clf28swn7, and emf1-2 was a consequence of transcriptional upregulation of target genes or if it was caused by the loss of PcG function independently of the transcriptional state. To investigate this, we integrated accessibility, histone mark levels, and gene expression data in the different mutants. Since accessibility changes at only-H2AK121ub, H2AK121ub/H3K27me3, or only-H3K27me3 marked genes in mutants were evident at genes displaying less than 60% of WT H2AK121ub and/or H3K27me3 levels (Fig. 4), we analyzed whether or not the expression of the genes with less than 60% of WT marks (cutoff log2FC ≤ −0.74) was altered in bmi1abc, clf28swn7, and emf1-2 (Fig. 5). Scatter plots in mutants representing accessibility versus histone mark levels confirmed that reduced levels of H2AK121ub and/or H3K27me3 led to increased chromatin accessibility (Fig. 5a–c and Supplementary Fig. 10). Interestingly, despite a high percentage of these genes were upregulated in mutants, we found a similar or even higher percentage of genes displaying unaltered expression levels. Remarkably, the percentage of genes with unaltered expression levels was particularly high in the subset of only-H3K27me3-marked genes (Fig. 5a–c). All together, these results indicate that the increased chromatin accessibility found at PcG targets is not caused by gene expression.
Discussion
PcG complexes play central roles in eukaryotic gene regulation. However, despite these complexes have been proposed to define a closed chromatin conformation44, little is known about their role in regulating chromatin accessibility, especially in plants. Instead, PcG function has been extensively correlated to gene repression44.
While the association of PRC2 marking and gene repression is widely accepted in plants, PRC1-mediated H2AK121ub has been recently proposed to be associated with gene responsiveness32. Despite PRC1 activity is required to recruit PRC2 for H3K27me3 marking at many genes, there are also genes only marked with H2AK121ub or H3K27me3 (refs. 28,32). Interestingly, average expression levels of only-H2AK121ub genes are higher than H2AK121ub/H3K27me3 or only-H3K27me3 (ref. 28), which may argue against a repressive role of this modification. In line with this, a recent report proposed that H2AK121ub at H2AK121ub/H3K27me3-marked genes does not play a repressive role by itself, as despite it allows recruitment of PRC2, and thus gene repression, the removal of H2AK121ub at these genes seems to prevent gene reactivation by interfering with active H3K27me3 demethylation32. However, we found that the expression levels of only-H2AK121ub-marked genes were significantly lower than non-PcG active genes, supporting that this modification may play a role in modulating gene expression.
Nevertheless, although it is true that inaccessible chromatin is refractory to gene expression, chromatin accessibility by itself does not impose a transcriptional state45. Therefore, to understand PcG function it is fundamental to determine whether PRC1, PRC2, or their respective hallmarks play a role in regulating chromatin accessibility. Our data revealed that indeed PcG activities regulate chromatin accessibility. We found that reduced levels of H2AK121ub led to increased chromatin accessibility, whereas increased levels of this modification caused the opposite effect, supporting a role of this modification in regulating chromatin accessibility. It might be argued that the BMI1 proteins rather than H2AK121ub marks are the ones responsible for this effect. However, this role is supported by the fact that accessibility changes associated with the levels of H2AK121ub are observed in clf28swn7. In this mutant, the expression of BMI1s is not altered and the recruitment of the BMI1s should not be affected, as PRC1 recruitment is independent of PRC2. In any case, our data cannot rule out the possibility of a direct implication of BMI1 proteins in regulating chromatin accessibility instead of H2AK121ub.
We also found that H2AK121ub associates with transcriptional regulation hotspots, which are sites enriched for the binding of a wide variety of TFs, and thus, required for transcriptional responses. Hence, we propose that H2AK121ub marks help to create a less accessible although still permissive chromatin at these regulatory hotspots (Fig. 6). This is consistent with the proposed hypothesis that H2AK121ub could provide a standby mode between activation and repression32.
In addition, our results indicate that chromatin accessibility can be further reduced by EMF1. This plant-specific protein has been shown to associate with PRC2 and to be required for H3K27me3 marking19,21–25. Accordingly, loss of EMF1 function affected H3K27me3 levels but not H2AK121ub. However, emf1-2 ChIP-seq results showed that not all H3K27me3-marked genes display reduced levels. Furthermore, emf1-2 phenotype suggests that EMF1 is dispensable for H3K27me3 marking at root tissue. Interestingly, a recent phylogenetic analysis of EMF1 showed that ancestral EMF1 gene duplicated twice resulting in three orthologous groups that, next to EMF1, contain three novel outparalogs46. This finding rises the possibility that these paralogs display EMF1-like functions in different tissues or in the regulation of different subsets of genes.
In any case, we found that the H3K27me3-marked genes that lose H3K27me3 in emf1-2 displayed higher chromatin accessibility than clf28swn7, indicating that EMF1 acting upstream H3K27me3 marking plays an important role in regulating chromatin accessibility. Interestingly, EMF1 has the ability to in vitro mediate chromatin compaction20. Local and/or higher order chromatin compaction affects chromatin accessibility45. Furthermore, local chromatin compaction has been shown to precede the establishment of H3K27me3 in mouse embryonic stem cells47, which supports the connection between EMF1 and PRC2 activity. EMF1 ability to in vitro compact chromatin is shared with Drosophila PRC1 component Psc20, which combines BMI1 and EMF1 functions in a single polypeptide48, and with vertebrate Cbx2 (ref. 13), which is one of the Pc homologs49, indicating that, unlike in animals, EMF1 function in plants is associated to PRC2.
Based on our results, we also propose that chromatin at H2AK121ub/H3K27me3-marked genes associates with inaccessible but responsive chromatin, most likely due to the presence of H2AK121ub-marked transcriptional regulation hotspots, which may be important to allow gene reprogramming (Fig. 6). Conversely, only-H3K27me3-marked genes, which are generally not linked with these hotspots, are less responsive (Fig. 6), suggesting that the lack of H2AK121ub-marked transcriptional hotspots may prevent the ectopic expression of these genes.
In vertebrate, a recent report showed that despite PcG target promoters exhibit reduced chromatin accessibility, removal of PRC1, but not PRC2, causes reduced nucleosome occupancy and increase nucleosome spacing without affecting chromatin accessibility16. We found this result surprising, as one would have predicted that these effects would yield at least locally increased chromatin accessibility. In any case, under our experimental conditions and integrating chromatin accessibility, histone marks levels, and expression data in different PcG mutants, we were able to show that the loss of PcG activities in Arabidopsis leads to increased chromatin accessibility at PcG targets. Furthermore, our findings indicate that this increased chromatin accessibility is not caused by gene expression, which explains the lack of transcriptional activation of many PcG target genes that lose PcG marks in mutant plants26,28,33,40,43.
Methods
Plant materials and growth conditions
Arabidopsis thaliana WT Col-0, bmi1abc8, clf28swn7 (ref. 33), emf1-2 (ref. 50), lhp1 (ref. 51), and ring1ab34 mutants were grown under long-day condition (16 h of light/8 h of dark) at 22 °C on MS agar plates containing 1% sucrose and 0.7% agar.
ATAC-seq and data analysis
Nuclei isolation of ATAC-seq experiments were performed as previously described following the sucrose sedimentation protocol52. Briefly, nuclei from 10-day-old whole seedlings were separated from other cellular debris by passing the nuclei suspension through a sucrose cushion during centrifugation. After centrifugation, the contaminating organelles and debris are visible at the top of the tube and nuclei at the bottom. Next, 30,000 nuclei were counted and resuspended in 50 μL transposition reaction mix using the TruePrepTM DNA Library Prep Kit V2 for Illumina® (Vazyme, TD501). Reactions were incubated for 30 min at 37 °C before isolation DNA using the VAHTSTM DNA Clean Beads (Vazyme, N411). Two independent biological replicates were processed for next-generation sequencing library preparation. Libraries were prepared by PCR amplification using two index (i5 and i7) Illumina barcodes (Vazyme, TD202) with 13 cycles. Libraries were purified with a VAHTSTM DNA Clean Beads. ATAC libraries were sequenced on Illumina Hiseq-Xten PE150 by generating 2 × 150-bp paired-end reads.
We obtained around 45 million paired-end reads for each independent biological replicates of the different genotypes. Quality trimming and adapters removal were performed using Trimmomatic53. The organelle genomes were excluded from the TAIR10 Arabidopsis reference genome before reads were mapped using bowtie. PCR duplicates were discarded from the mapped reads using Picard (Picard Toolkit, 2019, Broad Institute, GitHub Repository. http://broadinstitute.github.io/picard/; Broad Institute). Regions with an artifactual massive amount of unique mapped reads were identified. Specifically, 91 regions were collected into a list of blacklisted genomic regions (Supplementary Data 4). In order to prevent interference with subsequent analysis steps, reads mapped to the blacklisted genomic regions were eliminated using samtools. Next, THSs were identified by performing a peak calling analysis using MACS2 with parameters–nomodel–shift -100–extsize 200 -q 0.05. For each genotype, genotype-specific THSs were determined by intersecting the THSs found on each individual replicate using bedtools. A final set of consensus THSs was generated by merging the THSs identified in each genotype. Using different deepTools functions we generated bigwig files storing the genome-wide accessibility signal measured as CPM for the different genotype replicates. Average accessibility profiles for the different genotypes were generated using the bioconductor R package ChIPpeakAnno. An accessibility signal matrix was generated where rows represented THSs and columns genotype replicates. For each THS, its average accessibility signal was computed for each sample. Rows corresponding to THSs exhibiting a maximum signal level below the 5% lower percentile of the signal distribution, approximately 3 CPM were filtered out in order to reduce noise in our analysis. After log2 transformation of the filtered matrix, the bioconductor R package limma was used to identify significantly reduced or increased accessibility levels in the different mutants when compared to WT-like levels (80–120% levels). The statistical significance of differences in accessibility around TSS were assessed using the non-parametric Mann–Whitney–Wilcoxon test implemented in the Wilcox.test function in R. Regression analysis was performed using the lm (linear model) function in R. The overall significance of the regression analysis was carried out using the F-test.
The bioconductor R package ChipseekeR54 was used to study the location of THS around the TSS. The percentages of consensus THSs located nearby histone marks were determined using different functions from deepTools.
Transcription factor-binding sites (TFBS) have been determined in this study by reanalyzing in a uniform manner 100 independent ChIP-seq datasets previously published covering most TF families in Arabidopsis thaliana (see Supplementary Data 2). The TFBS enrichment at the THS was estimated using Monte Carlo simulations. First, we computed the median number of TFBS overlapping with our consensus THSs using deepTools. Then, we generated 104 sets of random genomic regions using a custom R script. These random sets were composed of as many genomic regions as consensus THSs. Moreover, the random regions have the same length as our THSs. For each one of the randomly generated sets we computed the median number of TFBS overlapping with the corresponding random genomic regions. By comparing these random values to the actual number of TFBS in our consensus THSs we estimated the corresponding p value and enrichment.
ChIP-seq and data analysis
H2AK121ub and H3K27me3 ChIP-seq data of bmi1abc, clf28swn7, and WT at 7 DAG, and H2AK121ub data of lhp1 at 7 DAG were previously generated28. H2AK121ub and H3K27me3 ChIP-seq data of WT, ring1ab, emf1-2, and H3K27me3 data of lhp1 at 10 DAG were generated for this work.
ChIP-seq experiments were performed as previously described55. In brief, whole seedlings were fixed in 1% formaldehyde. Chromatin was extracted from fixed tissue and fragmented using a Bioruptor® Pico (Diagenode) in fragments of 200–500 bp. The sheared chromatin was immunoprecipitated overnight using the following antibodies: Anti-H2Aub (Cell Signaling Technology, 8240S, dilution 1:100) and anti-H3K27me3 (Millipore, 07-449, dilution 1:300). Immunocomplexes were captures using Protein A Sepharose beads CL-4B (GE Healthcare). After washing the Protein-A beads, chromatin was eluted and the crosslinking was reversed overnight at 65 °C. The DNA from the immunoprecipitated chromatin was treated with RNase and proteinase K and purified by phenol–chloroform extraction followed by ethanol precipitation. For ChIP-seq, two immunoprecipitations from independent biological replicates were processed for next-generation sequencing library preparation. All libraries were prepared with end repair, A-tailing and ligation of Illumina-compatible adapters using the Ovation® Ultralow Library Systems (NuGEN). The ligated product was amplified with 14 cycles of PCR. DNA of a size range between 200 and 600 bp was purified from an agarose gel. Amplification was confirmed by testing an aliquot of the library before and after amplification by qPCR. Libraries were sequenced on Illumina Hiseq-Xten PE150 by generating 2 × 150-bp paired-end reads.
Each sample for the different genotypes generated approximately 12 million paired-end reads. Bowtie56 was used to map reads to the TAIR10 Arabidopsis reference genome. The tool boxes for the analysis of high-throughput sequencing data deepTools57 and samtools58 were used to generate bigwig files storing H3K27me3 and H2AK121ub genome-wide signal levels measured as CPM (counts per million). Specifically, the function bamCoverage was used to compute normalized signal levels using the parameter –normalizeUsing CPM. Mark peaks were detected using MACS2 (ref. 59) with default parameters for each individual replicate of the WT genotype. Specifically, the function bamCoverage was used to compute normalized signal levels using the parameter –normalizeUsing CPM. Mark peaks were detected using MACS2 (ref. 59) with default parameters for each individual replicate of the WT genotype. A specific input for each of the two datasets was considered, namely an input for the previously published data corresponding to 7 DAG and another specific input for the data generated in this study from 10 DAG seedlings. The final set of mark peaks were defined as the intersection of the ones detected for each replicate and the combination obtained from the 7 DAG and 10 DAG WT data using bedtools60. Target genes were associated to mark peaks using the bioconductor R package ChIPpeakAnno61 when an overlap was detected with either the gene body or promoter defined as the region 750 bp upstream from the TSS. A matrix was constructed were rows represented marked genes and columns the different genotype samples. For each marked gene a signal level in each one of the genotype samples under study was computed as the average signal across the corresponding histone mark peak. In order to remove noisy peaks, we filtered out genes with signals below 3 CPM, approximately corresponding to the 5% lower percentile of the signal distribution. After log2 transformation of the filtered matrix the bioconductor R package limma62 was used to identify significantly reduced or increased histone mark signal levels in the different mutants when compared to the WT.
Metagene plots were generated with a custom R script available from our github repository (10.5281/zenodo.4304639)69. The statistical significance of differences in metagene plots were assessed using the non-parametric Mann–Whitney–Wilcoxon test implemented in the Wilcox.test funcion in R. Similarly, the identification of binding sites for the TFs under analysis was performed using bowtie for read mapping and macs2 for peak calling.
ChIP-reChIP-qPCR analysis
ChIP-reChIP experiments were performed as previously described63. In brief, 10-day-old WT seedlings were fixed in 1% formaldehyde. Chromatin was extracted from fixed tissue and fragmented using a Bioruptor® Pico (Diagenode) in fragments of 500–1000 bp. ChIP was first performed with anti-H3K27me3 (Millipore, 07-449, dilution 1:300). Immunocomplexes were captures using Protein A Sepharose beads CL-4B (GE Healthcare). After washing the Protein-A beads, one fraction was processed as in a conventional ChIP assay in order to revert the crosslinking and purify the DNA, and another fraction was used to elute the immunoprecipitated protein–DNA complexes in Re-ChIP elution buffer (2 mM EDTA, 500 mM NaCl, 0.1% SDS, 1% NP40) by incubation for 30 min at 37 °C. The eluate was diluted 20 times with ChIP dilution buffer supplemented with 50 μg of BSA and protease inhibitor. Then, the second ChIP was performed with anti-H2Aub (Cell Signaling Technology, 8240S, dilution 1:100). Immunocomplexes were captures using Protein A Sepharose beads CL-4B (GE Healthcare). After washing the Protein-A beads, chromatin was eluted, de-crosslinked, and the DNA was extracted. The DNA obtained in the two sequential ChIPs were used as PCR template to amplify PLT5, SOC1, and AG using specific primers (Supplementary Table 1).
RNA-seq and data analysis
WT at 7 DAG RNA-seq data of bmi1abc have been previously published28. RNA-seq data of WT and the remaining mutants at 10 DAG were generated for this work.
Total RNA was extracted using E.Z.N.A.® Plant RNA Kit (Omega, R6827-01) from 10-day-old whole seedlings following the manufacturer´s instructions. Three independent biological replicates were processed for next-generation sequencing library preparation. Libraries were prepared using 1 μg of total RNA with the VAHTSTM Total RNA-seq (H/M/R) Library Prep Kit for Illumina® (Vazyme, NR603) according to the manufacturer’s instructions. Libraries were sequenced on Illumina Hiseq-Xten PE150 by generating 2 × 150-bp paired-end reads.
Each independent biological replicate of the different genotypes generated approximately 20 million paired-end reads. Quality control was performed using FASTQC. Read mapping to the TAIR10 Arabidopsis reference genome was carried out using HISAT2 (ref. 64). Transcript assembly and gene expression estimation measured as FPKM (fragments per kilobase of exon and million of mapped reads) were computed using Stringtie65 and the bioconductor R package ballgown66. Differential gene expression analysis was performed using the bioconductor R package limma62. Specifically, a log2 fold-change of ±1 and a p value of 0.05 was used to determine activated, repressed, and unaltered genes. GO enrichment analysis was performed over the different gene sets of interest using the bioconductor R package clusterProfiler67. PCA was carried out using the R package FactoMineR v2.3 (ref. 68).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This work is supported by startup funds from State Key Laboratory for Protein and Plant Gene Research, School of Advanced Agricultural Sciences, Peking-Tsinghua Center for Life Sciences, Peking University, grant 31970532 from the National Natural Science Foundation of China and grants BIO2016-76457-P, PID2019-106664GB-I00, and BIO2017-84066-R from Spanish Ministry of Science and innovation. P.d.l.R. is supported by BIO2017-83629-R, granted to Dr. F. Valverde.
Source data
Author contributions
X. Yin, P.Y., and X.M. performed ATAC-seq experiment. X. Yin, G.T., and S.Z. performed ChIP-seq experiment. J.Y., Y.Z., X.M., S.Z., X. Yin, J.Y. and X. Yang performed RNA-seq experiments. F.J.R.-C. and P.d.l.R. analyzed high-throughput sequencing data (ATAC-seq, ChIP-seq and RNA-seq, and TF-binding sites, respectively); M.C., Y.Z., and F.J.R.-C. interpreted the data and M.C. wrote the manuscript. All authors read and approved the final manuscript.
Data availability
Data supporting the findings of this work are available within the paper and its Supplementary Information files. A reporting summary for this article is available as a Supplementary Information file. The datasets and plant materials generated and analyzed during the current study are available from the corresponding author upon request. ATAC-seq, ChIP-seq, and RNA-seq datasets generated in this study have been deposited in the Gene Expression Omnibus (GEO) under accession GSE155378. Previously generated ChIP-seq and RNA-seq data are under accession GSE89358. Source data are provided with this paper.
Code availability
Several custom R scripts that were used to generate signal profiles have been deposited in the GitHub repository (10.5281/zenodo.4304639)69.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Xiaochang Yin, Francisco J. Romero-Campero.
Contributor Information
Francisco J. Romero-Campero, Email: fran@us.es
Myriam Calonje, Email: myriam.calonje@ibvf.csic.es.
Yue Zhou, Email: yue_zhou@pku.edu.cn.
Supplementary information
Supplementary information is available for this paper at 10.1038/s41467-020-20614-1.
References
- 1.Ringrose L, Paro R. Epigenetic regulation of cellular memory by the Polycomb and Trithorax group proteins. Annu. Rev. Genet. 2004;38:413–443. doi: 10.1146/annurev.genet.38.072902.091907. [DOI] [PubMed] [Google Scholar]
- 2.Merini, W. & Calonje, M. PRC1 is taking the lead in PcG repression. PlantJ. Cell Mol. Biol. 83, 110–120 (2015). [DOI] [PubMed]
- 3.Förderer A, Zhou Y, Turck F. The age of multiplexity: recruitment and interactions of Polycomb complexes in plants. Curr. Opin. Plant Biol. 2016;29:169–178. doi: 10.1016/j.pbi.2015.11.010. [DOI] [PubMed] [Google Scholar]
- 4.Wang H, et al. Role of histone H2A ubiquitination in Polycomb silencing. Nature. 2004;431:873–878. doi: 10.1038/nature02985. [DOI] [PubMed] [Google Scholar]
- 5.Cao R, Tsukada Y-I, Zhang Y. Role of Bmi-1 and Ring1A in H2A ubiquitylation and Hox gene silencing. Mol. Cell. 2005;20:845–854. doi: 10.1016/j.molcel.2005.12.002. [DOI] [PubMed] [Google Scholar]
- 6.Bratzel F, López-Torrejón G, Koch M, Del Pozo JC, Calonje M. Keeping cell identity in Arabidopsis requires PRC1 RING-finger homologs that catalyze H2A monoubiquitination. Curr. Biol. 2010;20:1853–1859. doi: 10.1016/j.cub.2010.09.046. [DOI] [PubMed] [Google Scholar]
- 7.Bratzel F, et al. Regulation of the new Arabidopsis imprinted gene AtBMI1C requires the interplay of different epigenetic mechanisms. Mol. Plant. 2012;5:260–269. doi: 10.1093/mp/ssr078. [DOI] [PubMed] [Google Scholar]
- 8.Yang C, et al. VAL- and AtBMI1-mediated H2Aub initiate the switch from embryonic to postgerminative growth in Arabidopsis. Curr. Biol. 2013;23:1324–1329. doi: 10.1016/j.cub.2013.05.050. [DOI] [PubMed] [Google Scholar]
- 9.Müller J, et al. Histone methyltransferase activity of a Drosophila Polycomb group repressor complex. Cell. 2002;111:197–208. doi: 10.1016/S0092-8674(02)00976-5. [DOI] [PubMed] [Google Scholar]
- 10.Cao R, et al. Role of histone H3 lysine 27 methylation in Polycomb-group silencing. Science. 2002;298:1039–1043. doi: 10.1126/science.1076997. [DOI] [PubMed] [Google Scholar]
- 11.Makarevich G, et al. Different Polycomb group complexes regulate common target genes in. Arabidopsis. EMBO Rep. 2006;7:947–952. doi: 10.1038/sj.embor.7400760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Mozgova I, Hennig L. The Polycomb group protein regulatory network. Annu. Rev. Plant Biol. 2015;66:269–296. doi: 10.1146/annurev-arplant-043014-115627. [DOI] [PubMed] [Google Scholar]
- 13.Grau DJ, et al. Compaction of chromatin by diverse Polycomb group proteins requires localized regions of high charge. Genes Dev. 2011;25:2210–2221. doi: 10.1101/gad.17288211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Trojer P, et al. L3MBTL2 protein acts in concert with PcG protein-mediated monoubiquitination of H2A to establish a repressive chromatin structure. Mol. Cell. 2011;42:438–450. doi: 10.1016/j.molcel.2011.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Deaton, A. M. et al. Enhancer regions show high histone H3.3 turnover that changes during differentiation. eLife5, e15316 (2016). [DOI] [PMC free article] [PubMed]
- 16.King HW, Fursova NA, Blackledge NP, Klose RJ. Polycomb repressive complex 1 shapes the nucleosome landscape but not accessibility at target genes. Genome Res. 2018;28:1494–1507. doi: 10.1101/gr.237180.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mozgova I, Köhler C, Hennig L. Keeping the gate closed: functions of the polycomb repressive complex PRC2 in development. Plant J. Cell. Mol. Biol. 2015;83:121–132. doi: 10.1111/tpj.12828. [DOI] [PubMed] [Google Scholar]
- 18.Calonje M. PRC1 marks the difference in plant PcG repression. Mol. Plant. 2014;7:459–471. doi: 10.1093/mp/sst150. [DOI] [PubMed] [Google Scholar]
- 19.Calonje M, Sanchez R, Chen L, Sung ZR. EMBRYONIC FLOWER1 participates in polycomb group-mediated AG gene silencing in Arabidopsis. Plant Cell. 2008;20:277–291. doi: 10.1105/tpc.106.049957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Beh LY, Colwell LJ, Francis NJ. A core subunit of Polycomb repressive complex 1 is broadly conserved in function but not primary sequence. Proc. Natl Acad. Sci. USA. 2012;109:E1063–E1071. doi: 10.1073/pnas.1118678109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Derkacheva M, et al. Arabidopsis MSI1 connects LHP1 to PRC2 complexes. EMBO J. 2013;32:2073–2085. doi: 10.1038/emboj.2013.145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Liang SC, et al. Kicking against the PRCs—a domesticated transposase antagonises silencing mediated by polycomb group proteins and is an accessory component of polycomb repressive complex 2. PLoS Genet. 2015;11:e1005660. doi: 10.1371/journal.pgen.1005660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bloomer RH, et al. The Arabidopsis epigenetic regulator ICU11 as an accessory protein of Polycomb Repressive Complex 2. Proc. Natl Acad. Sci. USA. 2020;117:16660–16666. doi: 10.1073/pnas.1920621117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kim SY, Lee J, Eshed-Williams L, Zilberman D, Sung ZR. EMF1 and PRC2 cooperate to repress key regulators of Arabidopsis development. PLoS Genet. 2012;8:e1002512. doi: 10.1371/journal.pgen.1002512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Li Z, Fu X, Wang Y, Liu R, He Y. Polycomb-mediated gene silencing by the BAH-EMF1 complex in plants. Nat. Genet. 2018;50:1254–1261. doi: 10.1038/s41588-018-0190-0. [DOI] [PubMed] [Google Scholar]
- 26.Turck F, et al. Arabidopsis TFL2/LHP1 specifically associates with genes marked by trimethylation of histone H3 lysine 27. PLoS Genet. 2007;3:e86. doi: 10.1371/journal.pgen.0030086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zhang X, et al. The Arabidopsis LHP1 protein colocalizes with histone H3 Lys27 trimethylation. Nat. Struct. Mol. Biol. 2007;14:869–871. doi: 10.1038/nsmb1283. [DOI] [PubMed] [Google Scholar]
- 28.Zhou Y, Romero-Campero FJ, Gómez-Zambrano Á, Turck F, Calonje M. H2A monoubiquitination in Arabidopsis thaliana is generally independent of LHP1 and PRC2 activity. Genome Biol. 2017;18:69. doi: 10.1186/s13059-017-1197-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Shu H, Wildhaber T, Siretskiy A, Gruissem W, Hennig L. Distinct modes of DNA accessibility in plant chromatin. Nat. Commun. 2012;3:1281. doi: 10.1038/ncomms2259. [DOI] [PubMed] [Google Scholar]
- 30.Tannenbaum M, et al. Regulatory chromatin landscape in Arabidopsis thaliana roots uncovered by coupling INTACT and ATAC-seq. Plant Methods. 2018;14:113. doi: 10.1186/s13007-018-0381-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Frerichs A, Engelhorn J, Altmüller J, Gutierrez-Marcos J, Werr W. Specific chromatin changes mark lateral organ founder cells in the Arabidopsis inflorescence meristem. J. Exp. Bot. 2019;70:3867–3879. doi: 10.1093/jxb/erz181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kralemann LEM, et al. Removal of H2Aub1 by ubiquitin-specific proteases 12 and 13 is required for stable Polycomb-mediated gene repression in Arabidopsis. Genome Biol. 2020;21:144. doi: 10.1186/s13059-020-02062-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lafos M, et al. Dynamic regulation of H3K27 trimethylation during Arabidopsis differentiation. PLoS Genet. 2011;7:e1002040. doi: 10.1371/journal.pgen.1002040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Xu L, Shen W-H. Polycomb silencing of KNOX genes confines shoot stem cell niches in Arabidopsis. Curr. Biol. 2008;18:1966–1971. doi: 10.1016/j.cub.2008.11.019. [DOI] [PubMed] [Google Scholar]
- 35.Gaudin V, et al. Mutations in LIKE HETEROCHROMATIN PROTEIN 1 affect flowering time and plant architecture in Arabidopsis. Dev. Camb. Engl. 2001;128:4847–4858. doi: 10.1242/dev.128.23.4847. [DOI] [PubMed] [Google Scholar]
- 36.Sijacic P, Bajic M, McKinney EC, Meagher RB, Deal RB. Changes in chromatin accessibility between Arabidopsis stem cells and mesophyll cells illuminate cell type-specific transcription factor networks. Plant J. Cell Mol. Biol. 2018;94:215–231. doi: 10.1111/tpj.13882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Gaspar-Maia A, Alajem A, Meshorer E, Ramalho-Santos M. Open chromatin in pluripotency and reprogramming. Nat. Rev. Mol. Cell Biol. 2011;12:36–47. doi: 10.1038/nrm3036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Schlesinger S, Meshorer E. Open chromatin, epigenetic plasticity, and nuclear organization in pluripotency. Dev. Cell. 2019;48:135–150. doi: 10.1016/j.devcel.2019.01.003. [DOI] [PubMed] [Google Scholar]
- 39.Maher KA, et al. Profiling of accessible chromatin regions across multiple plant species and cell types reveals common gene regulatory principles and new control modules. Plant Cell. 2018;30:15–36. doi: 10.1105/tpc.17.00581. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Wang H, et al. Arabidopsis flower and embryo developmental genes are repressed in seedlings by different combinations of Polycomb group proteins in association with distinct sets of cis-regulatory elements. PLoS Genet. 2016;12:e1005771. doi: 10.1371/journal.pgen.1005771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Shu J, et al. Genome-wide occupancy of histone H3K27 methyltransferases CURLY LEAF and SWINGER in Arabidopsis seedlings. Plant Direct. 2019;3:e00100. doi: 10.1002/pld3.100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Alvarez JM, et al. Local changes in chromatin accessibility and transcriptional networks underlying the nitrate response in Arabidopsis roots. Mol. Plant. 2019;12:1545–1560. doi: 10.1016/j.molp.2019.09.002. [DOI] [PubMed] [Google Scholar]
- 43.Bouyer D, et al. Polycomb repressive complex 2 controls the embryo-to-seedling phase transition. PLoS Genet. 2011;7:e1002014. doi: 10.1371/journal.pgen.1002014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Schuettengruber B, Bourbon H-M, Di Croce L, Cavalli G. Genome regulation by Polycomb and Trithorax: 70 years and counting. Cell. 2017;171:34–57. doi: 10.1016/j.cell.2017.08.002. [DOI] [PubMed] [Google Scholar]
- 45.Klemm SL, Shipony Z, Greenleaf WJ. Chromatin accessibility and the regulatory epigenome. Nat. Rev. Genet. 2019;20:207–220. doi: 10.1038/s41576-018-0089-8. [DOI] [PubMed] [Google Scholar]
- 46.Berke L, Snel B. The plant Polycomb repressive complex 1 (PRC1) existed in the ancestor of seed plants and has a complex duplication history. BMC Evol. Biol. 2015;15:44. doi: 10.1186/s12862-015-0319-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Yuan W, et al. Dense chromatin activates Polycomb repressive complex 2 to regulate H3 lysine 27 methylation. Science. 2012;337:971–975. doi: 10.1126/science.1225237. [DOI] [PubMed] [Google Scholar]
- 48.Brunk BP, Martin EC, Adler PN. Drosophila genes posterior sex combs and suppressor two of zeste encode proteins with homology to the murine bmi-1 oncogene. Nature. 1991;353:351–353. doi: 10.1038/353351a0. [DOI] [PubMed] [Google Scholar]
- 49.Völkel P, Le Faou P, Vandamme J, Pira D, Angrand P-O. A human Polycomb isoform lacking the Pc box does not participate to PRC1 complexes but forms protein assemblies and represses transcription. Epigenetics. 2012;7:482–491. doi: 10.4161/epi.19741. [DOI] [PubMed] [Google Scholar]
- 50.Moon Y-H, et al. EMF genes maintain vegetative development by repressing the flower program in Arabidopsis. Plant Cell. 2003;15:681–693. doi: 10.1105/tpc.007831. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kotake T, Takada S, Nakahigashi K, Ohto M, Goto K. Arabidopsis TERMINAL FLOWER 2 gene encodes a heterochromatin protein 1 homolog and represses both FLOWERING LOCUS T to regulate flowering time and several floral homeotic genes. Plant Cell Physiol. 2003;44:555–564. doi: 10.1093/pcp/pcg091. [DOI] [PubMed] [Google Scholar]
- 52.Bajic M, Maher KA, Deal RB. Identification of open chromatin regions in plant genomes using ATAC-Seq. Methods Mol. Biol. 2018;1675:183–201. doi: 10.1007/978-1-4939-7318-7_12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Yu G, Wang L-G, He Q-Y. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics. 2015;31:2382–2383. doi: 10.1093/bioinformatics/btv145. [DOI] [PubMed] [Google Scholar]
- 55.Reimer JJ, Turck F. Genome-wide mapping of protein-DNA interaction by chromatin immunoprecipitation and DNA microarray hybridization (ChIP-chip). Part A: ChIP-chip molecular methods. Methods Mol. Biol. 2010;631:139–160. doi: 10.1007/978-1-60761-646-7_12. [DOI] [PubMed] [Google Scholar]
- 56.Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Ramírez F, et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Li H, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Zhu LJ, et al. ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip data. BMC Bioinformatics. 2010;11:237. doi: 10.1186/1471-2105-11-237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Ritchie ME, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Truax AD, Greer SF. ChIP and Re-ChIP assays: investigating interactions between regulatory proteins, histone modifications, and the DNA sequences to which they bind. Methods Mol. Biol. 2012;809:175–188. doi: 10.1007/978-1-61779-376-9_12. [DOI] [PubMed] [Google Scholar]
- 64.Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019;37:907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Kovaka S, et al. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019;20:278. doi: 10.1186/s13059-019-1910-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Frazee AC, et al. Ballgown bridges the gap between transcriptome assembly and expression analysis. Nat. Biotechnol. 2015;33:243–246. doi: 10.1038/nbt.3172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Yu G, Wang L-G, Han Y, He Q-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics J. Integr. Biol. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Lê S, Josse J, Husson F. FactoMineR: an R package for multivariate analysis. J. Stat. Softw. 2008;25:1–18. doi: 10.18637/jss.v025.i01. [DOI] [Google Scholar]
- 69.Yin, X. et al. H2AK121ub in Arabidopsis associates with a less accessible chromatin state at transcriptional regulation hotspots. Available from GitHub repository. 10.5281/zenodo.4304639 (2020). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data supporting the findings of this work are available within the paper and its Supplementary Information files. A reporting summary for this article is available as a Supplementary Information file. The datasets and plant materials generated and analyzed during the current study are available from the corresponding author upon request. ATAC-seq, ChIP-seq, and RNA-seq datasets generated in this study have been deposited in the Gene Expression Omnibus (GEO) under accession GSE155378. Previously generated ChIP-seq and RNA-seq data are under accession GSE89358. Source data are provided with this paper.
Several custom R scripts that were used to generate signal profiles have been deposited in the GitHub repository (10.5281/zenodo.4304639)69.