Abstract
RNA-mediated transcriptional silencing, in plants known as RNA-directed DNA methylation (RdDM), is a conserved process where small interfering RNA (siRNA) and long non-coding RNA (lncRNA) help establish repressive chromatin modifications. This process represses transposons and affects the expression of protein-coding genes. We found that in Arabidopsis thaliana AGO4 binding sites are often located distant from genes differentially expressed in ago4. Using Hi-C to compare interactions between genotypes, we show that RdDM-targeted loci have the potential to engage in chromosomal interactions, but these interactions are inhibited in wild-type conditions. In mutants defective in RdDM, the frequency of chromosomal interactions at RdDM targets is increased. This includes increased frequency of interactions between Pol V methylated sites and distal genes that are repressed by RdDM. We propose a model, where RdDM prevents the formation of chromosomal interactions between genes and their distant regulatory elements.
Author summary
Repetitive elements, such as transposons, comprise a large portion of the genome that is kept silent to maintain genome integrity. Transposon silencing is a canonical role of RNA-directed DNA methylation (RdDM) where small interfering RNA (siRNA) and long non-coding RNA (lncRNA) work together to place repressive chromatin marks. In addition to its role in controlling repetitive sequences, RdDM contributes to the regulation of gene expression yet molecular mechanisms utilized by RdDM to control gene expression remain mostly unknown. We show that the repression of gene expression by RdDM may involve preventing interactions between distant chromosomal regions.
Introduction
Transcriptional gene silencing (TGS) is an important pathway for transposon repression throughout Eukarya. This process involves the establishment of repressive histone modifications and de novo DNA methylation under the guidance of non-coding RNA [1,2]. This pathway’s activity is prominent in the germline of Metazoans in establishing de novo DNA methylation that can be afterwards maintained [1]. In the model flowering plant species Arabidopsis thaliana TGS, also known as RNA-directed DNA methylation (RdDM), occurs throughout development in order to consistently establish and maintain repressive chromatin marks.
RdDM in Arabidopsis uses two specialized RNA polymerases, RNA polymerase IV (Pol IV) and RNA polymerase V (Pol V) to direct repressive chromatin modifications [3]. While Pol IV is believed to produce siRNA precursors, Pol V has been proposed to produce lncRNA scaffolds which help guide proteins to chromatin [3–6]. ARGONAUTE 4 (AGO4) is one of those proteins and its interactions with siRNA, lncRNA, and Pol V largest subunit allow its recruitment to chromatin [7,8]. Binding of AGO4 and other RNA binding proteins, such as SPT5L and IDN2, is then important for binding and activity of enzymes, which establish repressive chromatin modifications [9–12].
Mechanisms and roles of DNA methylation established by Pol V-associated factors have been studied extensively [13–16]; however, the TGS / RdDM pathway is not limited to DNA methylation. It has been shown to mediate the establishment of repressive posttranslational histone modifications [7,9,17]. It has also been implicated in the control of nucleosome positioning through the recruitment of a putative ATP-dependent chromatin remodeling complex [12]. These broad effects of RdDM on chromatin structure indicate that this pathway is a master regulator of chromatin status at its target loci.
RdDM not only represses transposon activity but also affects the expression levels of protein-coding genes [8,12,18,19]. This may be partially explained by the widespread presence of short transposons upstream of transcription start sites, where RdDM components preferentially localize and establish DNA methylation [8,19,20]. Indeed, AGO4 binds to putative promoters and establishes DNA methylation [8,14], which has been proposed as a mechanism of RdDM-mediated gene expression control [8]. However, the exact manner by which this occurs remains unresolved.
One way that the RdDM pathway could directly control gene expression is through manipulation of the chromatin landscape at promoters. Stabilized nucleosomes, repressive histone modifications, DNA methylation, or occupancy by RdDM proteins could directly influence transcription factor binding and thereby inhibit transcription. In addition to promoters, many eukaryotes control expression using enhancers, which may be distant from the transcriptional start site (TSS) [21–25]. 3D spatial interactions between loci allow distal enhancers to influence transcription long range. The presence of these long range chromosomal interactions is often associated with active histone modifications and/or nucleosome free regions [21,26–29].
The presence and importance of distant enhancers in plant genomes remains mostly unknown. It has been argued that most Arabidopsis genes are controlled by proximal regulatory elements located within 3 kb from the transcription start site [30]. However, the role of longer range effects cannot be excluded, especially under non-standard conditions. Therefore, it remains possible that mechanisms controlling chromatin modifications may also affect the formation of long range chromosomal interactions in Arabidopsis, which may influence gene expression levels. Analysis of the chromatin interaction landscape around differentially expressed genes may provide an explanation for how RdDM affects the expression of protein coding genes.
We found that AGO4 binding sites are often distal from genes differentially expressed in ago4. We proposed and tested the hypothesis that RdDM affects the frequency of long range chromosomal interactions through comparison of Hi-C contacts in Col-0 (wild-type), nrpe1 (mutant for the large subunit of Pol V), and ago4. Our data indicate that RdDM reduces the frequency of long range chromosomal interactions at methylated loci. We also found that genes differentially expressed in ago4 gain interactions to previously methylated sites when AGO4 is mutated. These data support a speculative model where RdDM can control the expression of genes over long distances by inhibiting interactions between promoters and their potential distant regulatory elements.
Results
Long distance control of gene expression by RdDM
We first sought to test the hypothesis that RdDM affects the expression of genes located in close proximity to direct RdDM targets. In order to identify RdDM regulated genes we performed RNA-seq in the ago4 mutant with three biological replicates. We tested whether these genes are directly regulated by AGO4 targeting to the promoter by examining the promoters of differentially expressed genes for the presence of AGO4 as identified by ChIP-seq. Although AGO4 binding sites are generally enriched upstream of genes [8], transcriptional start sites (TSS) of genes with differential expression are generally distant from AGO4 binding to chromatin (S1A Fig). In fact, only ~5% of differentially expressed genes had an AGO4 binding site within 2.5 kb upstream of the TSS (Fig 1A). Furthermore, this small overlap was not a result of ChIP-seq peak calling stringency as a less stringent cutoff did not improve the overlapping region (Fig 1A, dashed circle). It should be noted that 2.5 kb is likely an overestimation of putative promoter length in Arabidopsis, but demonstrates the lack of overlap between ago4 differentially expressed genes and AGO4 binding sites. The large portion of differentially expressed genes without proximal AGO4 binding suggests that only a limited subset of RdDM-affected genes is controlled by RdDM within their putative promoters.
The manner by which RdDM can control gene expression from long distances is unclear. One possibility is that RdDM may directly target a transcriptional regulator and thereby change the transcription of multiple genes as an indirect downstream consequence. We tested an alternative, but not mutually exclusive hypothesis that RdDM may control the activity of distant regions via 3D chromatin organization. This could take advantage of existing chromatin interactions or involve alterations of chromatin interactions to regulate gene expression. To investigate these possibilities we performed chromosome conformation capture experiments followed by high throughput sequencing (Hi-C) [31,32]. We performed Hi-C in three biological replicates using the DpnII restriction endonuclease for fragmentation (Fig 1B), and long-distance chromatin contacts were identified by two parallel approaches. First, we used a simple approach to identify potential long distance chromatin interactions based on filtering by sequencing read count and the distance between mapped ends (see Materials and methods). We refer to this method as simple interaction calling which was applied to individual biological repeats (Fig 1B). Analysis of Hi-C often involves combining replicates in order to obtain interactions at greater depth [33–36]. We first tested the reproducibility of the Hi-C contact maps between replicates by examining the correlation in interaction intensities between individual 25 kb bins (S1B Fig). Since we saw high correlation between replicates, we decided to identify contacts from combined replicates as an alternate, deeper view of the data (Fig 1B, S1B and S1C Fig).
In order to obtain high confidence / stringent interactions, we also used a published interaction caller—Fit-Hi-C with ICE normalization (iterative correction and eigenvector decomposition) (S1C Fig) [37,38] which is stringent but not sensitive enough to work with the limited sequencing coverage available for individual biological repeats. We examined the distribution of interaction distances found by each of these methods and found that Fit-Hi-C calls at 1 kb resolution generally represent longer distances due to its use of distance normalization (S1D Fig) [38]. However, Fit-Hi-C inherently does not support inter-chromosomal interaction calling while simple calling maintains these interactions (S1E Fig). Using both methods we can evaluate results independent of the sensitivity / stringency of interaction calls. As the less stringent calling method allows comparison between biological replicates, these interaction calls are used throughout the manuscript and confirmation by Fit-Hi-C is provided in the supplemental data.
Features of wild-type chromatin organization in Arabidopsis thaliana
We started our analysis with testing if our Hi-C datasets recapitulate known features of chromosome organization in Arabidopsis. One aspect of Hi-C is approximately exponential decay of interactions with increasing distance. We plotted this decay for previously published Col-0 (wild-type) Hi-C and the corresponding decrosslinked control (Fig 1C, blue and grey) [29]. When we assigned our contacts to DpnII restriction fragments, our datasets showed much slower decay than the decrosslinked control (Fig 1C, orange) indicating that our Hi-C data may reflect real interactions.
Some studies have suggested the presence of topologically associating domains (TADs) in Arabidopsis [29], while others have not been able to confirm the presence of TADS [28,39]. Likewise, we cannot clearly see TADs in our data (S1F Fig). This could be partially due to the different developmental stages used in our study and in each of the other Arabidopsis Hi-C studies [28,29,39]. Additionally, in an effort to create Hi-C libraries with which we could compare to AGO4 ChIP-seq, we used low crosslinking conditions (0.5% formaldehyde) which may also explain any visual differences in broad raw Hi-C heatmaps (S1F Fig).
To confirm the quality and reproducibility of our Hi-C data, we tested whether we could recapitulate other known and reproducible features of wild-type Arabidopsis chromatin organization. A broad feature of Arabidopsis Hi-C is the self-association of chromosome arms [28,29,39]. To test this, we examined broad associations along chromosomes by calculating the correlation of interactions between each 1 Mb bin. We then performed a principle component analysis and took the first component (eigenvector) to calculate the clustering of genomic bins, a strategy which was originally employed in human cells [31]. Eigenvector analysis of the Hi-C data indicates two separate clusters corresponding to chromosome arms (Fig 1D, S1G Fig). This indicates that regions on chromosome arms preferentially associate with other regions on the same arm rather than between arms.
Another feature typically seen by Hi-C is an enrichment for inter-telomeric association [28,29,39]. Similar to others, when we examined telomeres we detected higher association of telomeres with each other than with the neighboring regions (Fig 1E). Additionally, strong inter-chromosomal interactions between centromeres (including pericentromeric regions) are characteristic of Hi-C data [28,39] which we also find in our data (Fig 1F). We conclude that our Hi-C data recapitulate several previously identified features of Arabidopsis chromatin organization and despite limited sequencing depth are a good measure of chromatin interactions.
Chromatin interactions are inhibited at repressive chromatin
It is interesting that centromeres, which have high levels of repressive chromatin modifications, engage in strong chromatin interactions (albeit inter-chromosomally). This suggests that chromatin with repressive modifications could be more likely to engage in specific chromosomal interactions in Arabidopsis. To test whether repressed chromatin outside of centromeric or pericentromeric regions is also associated with long range chromosomal interactions, we first identified non-centromeric sites that display marks of active or inactive chromatin. Ratios of the levels of the active mark H3K4me2 to the inactive mark H3K9me2 [40,41] in equal sized bins display a bimodal distribution which we used to identify active and inactive chromatin (S1H Fig). We then counted the frequency of identified chromatin interactions associated with active or inactive chromatin. To compare replicates with different sequencing depths and to test for enrichment, we normalized by interactions occurring in randomly permutated non-centromeric bins (expected). From these data we find that non-centromeric inactive regions engage in long distance chromosomal interactions less often than active regions and less often than random bins (expected) (Fig 1G). This difference was consistent between the combined dataset (bar) and individual biological repeats (dots), which shows that it exceeds the biological variation. Similar analyses with Fit-Hi-C called interactions and with data from Wang et al displayed the same pattern (S1I Fig) [29]. We thought it was possible that the lower numbers of interactions in inactive chromatin could stem from inaccessibility to DpnII due to crosslinked compacted chromatin. To test the efficiency of DpnII at inactive regions, we examined a previously published decrosslinked control Hi-C library [29] and saw no difference in read distribution between active and inactive chromatin (S1J Fig). This indicates that DpnII is likely to have the same activity despite the chromatin state and thus the digestion efficiency cannot fully explain the lower number of interactions anchored in inactive chromatin. Therefore, the data indicate that non-centromeric regions with repressive histone modifications are overall inhibited from participating in chromosomal interactions, which is consistent with data previously reported for Arabidopsis [28,29,39,42] and for other organisms [21,26,27].
To test the effects of DNA methylation on chromatin interactions we examined genomic bins (250 bp bins) with high or low levels of DNA methylation in the CG, CHG, and CHH sequence contexts (S1K–S1M Fig). We found that highly methylated bins participate in fewer total long-range chromatin contacts than bins with low levels of DNA methylation especially in the non-CG context (Fig 1H, S1N and S1O Fig). These differences were consistent between biological repeats. This observation is also consistent with previously published Hi-C in Arabidopsis [28,29]. This indicates that methylated regions, especially in the CHH context, are less likely to engage in detectable chromatin interactions.
RdDM inhibits chromosomal interactions
The depletion of chromosomal interactions at genomic regions with repressive chromatin modifications (Fig 1GH) suggests that pathways involved in establishing heterochromatin may also repress chromatin interactions. To test whether the activity of RdDM is correlated with the inhibition of chromatin interactions we examined interactions at bins with high levels of CHH methylation in Col-0 wild-type, which is reduced in nrpe1, a mutant in the largest subunit of Pol V. These nrpe1 differentially methylated bins (nrpe1 DMBs) had fewer interactions than the randomized control (Fig 2A, S2A Fig). In comparison, bins where CHH methylation is unchanged in nrpe1 did not show the same inhibition (Fig 2A, S2A Fig). These differences were consistent between biological repeats. This indicates that long range chromosomal interactions are inhibited at RdDM target loci.
Due to the differences in contact frequency between nrpe1 DMBs and other meCHH bins, we checked whether features other than RdDM distinguish the two categories. Although methylation levels were high in both categories (due to their definition, see Materials and methods), differences between the two are evident such that unchanged meCHH bins generally have higher methylation than nrpe1 DMBs (Fig 2B). We tested whether these differences in methylation levels between nrpe1 DMBs and other meCHH bins were responsible for the differences in interaction signal. This was done by matching methylation levels between the categories. First, we compared nrpe1 DMBs and other meCHH bins where both categories had methylation levels lower than the median nrpe1 DMB level (Fig 2B). These matched nrpe1 DMBs still had less chromatin interactions despite the similar methylation status (Fig 2C—blue). Next, we compared nrpe1 DMBs to other meCHH bins where both had methylation levels higher than the median level in the other meCHH bins (Fig 2B). Again, we found that matched nrpe1 DMBs had less chromatin interactions despite the similar methylation status (Fig 2C—purple). In each case the nrpe1 DMB interaction signal was lower than other meCHH bins irrespective of methylation level or sequence context (Fig 2C). We performed the same analysis comparing H3K9me2 / H3K4me2 levels but found similar results (S2B and S2C Fig). Overall, this indicates that long range chromatin interactions are inhibited on loci targeted by RdDM and that this inhibition is probably not a result of chromatin signatures.
To further test the effects of RdDM on long range chromosomal interactions we performed Hi-C in nrpe1 and ago4 mutants, both of which are defective in RdDM [43–46]. We performed each Hi-C experiment in three replicates and found that the replicates correlated well with each other (S2D and S2E Fig). We examined Hi-C contact maps in these mutants and found that the overall organization of chromatin was similar to that of Col-0 (wild-type) (S2F Fig) and that nrpe1 and ago4 contacts displayed decay rates similar to Col-0 (S2G Fig). To test the effects of RdDM on chromatin interactions, instead of equal sized bins we focused our analysis on published nrpe1 DMRs which are short CHH methylated regions where methylation is lost in nrpe1 (S2H Fig) [14]. We examined high confidence chromosomal interactions detected in Col-0, nrpe1 and ago4 and observed increased interaction signals at nrpe1 DMRs in nrpe1 and ago4 mutants compared to Col-0 wild-type, but no difference at other methylated regions (Fig 2D). This trend was also seen by Fit-Hi-C (S2I Fig). Furthermore, the increase in interactions in nrpe1 and ago4 was more pronounced when we consider interactions called at higher confidence (S2J Fig). These data indicate that RdDM represses the formation of chromosomal interactions.
We examined a locus displaying increased Hi-C interactions in nrpe1 and ago4 and tested the interaction by 3C-qPCR. This verified the mutant-specific increase in interaction between the loci identified by Hi-C (Fig 2E, Anchor—C) while primers on either side showed no enrichment (Fig 2E, primer A and D). This increased interaction occurred between a site enriched in AGO4 ChIP-seq signal (Fig 2E—green) and showed increased RNA-seq signal in ago4 (Fig 2E—red). We performed this same analysis at a control locus which showed no increase in interactions by Hi-C and saw no increase by 3C (Fig 2F). Additionally, by Hi-C we found an increase in an extremely long distant interaction that occurs between two loci more than 8 Mb apart (Chr4: 654624–8811559). We tested whether this increase was also identifiable by 3C-qPCR and found that these loci interact more strongly in nrpe1 and ago4 (S2K Fig—box). We tested several other DpnII fragments closer to the anchor and found no increase in interactions (S2K Fig) indicating that the long distance interaction is specific to that site. These results verify that increases in interactions occur in nrpe1 and ago4 at RdDM targets, and at least some increased interactions occur between AGO4 bound loci and genes differentially expressed in ago4.
Chromosomal interactions have been shown to be affected by nucleosome positioning [47,48] and nucleosome positioning is known to be an aspect of chromatin that is altered by RdDM [12]. We tested whether nucleosome positioning corresponds to changes in chromosomal interactions at nrpe1 DMRs. DMRs with nucleosomes reduced in nrpe1 show a greater increase in chromosomal interactions in nrpe1 and ago4 than those with unchanged nucleosomes (Fig 2G) although this difference was not present when examining the more stringent Fit-Hi-C chromatin interactions (S2L Fig). In any case, while nucleosome positioning may contribute, it cannot fully explain increased interactions in nrpe1 and ago4, as increases in chromatin interactions are still present at DMRs where nucleosomes are unchanged in nrpe1 (Fig 2G and S2L Fig). These data indicate that RdDM inhibits the formation of long range chromosomal interactions at its target loci by a mechanism independent of nucleosome remodeling.
Gene activity corresponds to interactions with transcription factor binding sites
Our observation that RdDM inhibits the formation of long range chromosomal interactions suggests that chromosome organization may be a general mechanism used by RNA-mediated transcriptional silencing pathways to control gene activity. However, it is unknown to what extent chromosome organization contributes to gene regulation in Arabidopsis. To test if the presence of long range chromosomal interactions may be functionally relevant to gene expression, we first checked whether there is a relationship between gene expression levels and the presence of chromosomal interactions. We used RNA-seq in Col-0 wild-type to identify genes that are either inactive or highly expressed and examined how often their putative promoters (defined as 1kb upstream of TSS) are engaged in chromosomal interactions. Inactive genes engage in interactions less often than active genes, and less than expected from random genes (Fig 3A, S3 Fig). This difference was consistent between biological repeats. We further tested the correlation between chromatin interactions and gene expression by analyzing genes with varying levels of interaction signal intensity in their putative promoters. Genes with low numbers of interactions at their promoter regions had lower median expression levels and, conversely, genes with high numbers of interactions had higher median expression levels (Fig 3B). This indicates a positive relationship between gene expression levels and overall chromosomal interaction frequency in Arabidopsis which is similar to findings from other organisms [49–51].
It has been proposed that chromosomal interactions may influence gene expression by bringing distantly bound transcription factors into close proximity with promoters of target genes in other organisms [22,27,52]. It is, however, unknown if a similar mechanism exists in Arabidopsis. To test this possibility, we looked for evidence of interactions between transcription factor binding sites and genes. First we examined a large set of transcription factors and their putative binding sites identified by published DAP-seq data [53]. For each transcription factor in the DAP-seq dataset, we calculated the median expression value for genes with identified interactions that connected the gene to a transcription factor’s binding site. We also performed this analysis for genes not interacting with the transcription factor’s binding sites. We found that for each transcription factor, the median expression of interacting genes was higher than genes not interacting with the transcription factor. It was also higher than the genome-wide average (Fig 3C), which suggests that these interactions may be involved in controlling gene expression.
To find further evidence of long-range control by transcription factors, we then examined published ChIP-seq and gene expression data for a well-studied specific transcription factor, SVP [54]. We identified genomic regions which interact with genes that change expression in the svp mutant. SVP binding to chromatin detected by ChIP-seq was enriched at distant interacting sites. This enrichment was only seen on sites which interact with genes with decreased expression in the svp mutant (Fig 3D).
Overall, these findings indicate that higher gene expression levels in Arabidopsis are associated with the presence of long range chromosomal interactions. Although causality remains to be proven, this indicates that the principle of regulatory regions affecting distant genes by long range chromosomal interactions may be applicable to plant genomes.
Gene activity is associated with inhibition of chromosomal interactions by RdDM
Our observations that RdDM inhibits long range chromosomal interactions and that long range interactions exist between genes and TF binding sites suggest that genes affected by RdDM may preferentially interact with distant genomic regions targeted by RdDM. To test this possibility, we analyzed contact map profiles around TSS’s of genes that are repressed by RdDM (increased expression in ago4). We tested if there was enrichment for long distance connections (> 10kb) to RdDM sites (nrpe1 DMRs). Average contact signal in Col-0 is not enriched between RdDM affected genes and nrpe1 DMRs (Fig 4A) consistent with repression of contacts at nrpe1 DMRs in Col-0 (wild-type) (Fig 2). We next performed the same analysis in the ago4 Hi-C sample and observed an enrichment of contacts between differentially expressed genes and DMRs (Fig 4B). These interactions occur within 1–2 kb of the TSS’s on one end (y-axis) and are centered around nrpe1 DMRs on the other end (x-axis). This indicates that RdDM inhibits long-range chromosomal interactions between promoters of affected genes and RdDM sites similar to what we saw by 3C (Fig 2E). Because of the narrow range around the TSS, we conclude that this increase in interaction is probably not simply a result of a general change in chromatin organization, but is more likely caused by specific chromatin interactions between RdDM sites and promoters (Fig 4B). To further examine the effect of RdDM on promoter specific interactions, we examined the relative number of genes that are repressed by RdDM and are connected to nrpe1 DMRs in each genotype. Chromosomal interactions detected in nrpe1 and ago4 identified larger numbers of genes connected to nrpe1 DMRs than in Col-0 wild-type (Fig 4C). This increase was visible in all three biological repeats (dots), which shows that the observed effects are greater than the biological variation, although it should be noted that this is supported by a relatively small number of genes (S2 Table). However, no enrichment was observed when we analyzed random sets of genes (S4A Fig). It is noteworthy that this effect is greater in the pooled data (Fig 4C—grey bars) than in each of the replicates (Fig 4C—points) suggesting that the sequencing depth is a limiting factor in detecting these interactions. These data indicate that a subset of genes affected by RdDM preferentially interacts with distant genomic regions targeted by RdDM and that these interactions are repressed by RdDM in Col-0 wild-type.
Interestingly, ago4 had a stronger effect on long range chromosomal interactions than nrpe1 (Fig 4C, see also Fig 2D). We compared the effects of nrpe1 and ago4 mutation on gene expression and found a stronger effect in ago4 (Fig 4D, S4B Fig), which is consistent with the larger effect to chromatin interactions (Fig 4C). Furthermore, this indicates that although both Pol V and AGO4 are required for RdDM, nrpe1 and ago4 mutants may differentially affect chromatin and gene expression.
The observation that genes with increased expression in ago4 also gain interactions with RdDM targets (DMRs) suggests that RdDM may control gene expression by changing chromatin organization. To further test this model, we analyzed all protein coding genes and categorized them based on the presence or absence of chromosomal interactions (detectable in ago4) to direct RdDM targets (nrpe1 DMRs). We then separated out genes that show any increases in expression and genes that show any decreases in expression. Genes that interact with RdDM targets in ago4 were more likely to have increased expression in the ago4 mutant compared to genes which do not interact with RdDM targets (Fig 4E). Similarly, reductions in gene expression in ago4 were also more frequent among interacting genes, but this trend was not statistically significant (Fig 4F). Overall the effects on genes activated by RdDM (expression reduced in ago4) were lower than on genes repressed by RdDM (expression increased in ago4) (S4A Fig, Fig 4C). These observations further confirm the association between the effects of RdDM on long range chromosomal interaction and gene expression levels.
Changes in gene expression observed in the ago4 mutant may be explained by two non-mutually exclusive mechanisms (in addition to possible indirect effects). First, RdDM directly working on gene promoters or promoter proximal elements may affect transcription locally without the involvement of chromosomal interactions. Second, RdDM targets on distant regulatory regions may affect gene expression via chromosomal interactions. To distinguish between those possibilities, we tested if AGO4 preferentially localizes to both distant and promoter ends of inhibited interactions. For these purposes, we define distal regulatory regions as located at least three DpnII restriction sites away from the gene or its proximal regulatory region (1kb upstream of TSS) and connected by detectable chromosomal interactions from Hi-C in the ago4 mutant. Many regulated genes with inhibited interactions only had AGO4 on either the promoter or distal region suggesting that either is sufficient for gene regulation (Fig 4G). We also observed no prominent distance or orientation preference between genes and RdDM targets (S4C Fig). Although causality between RdDM-repressed looping and gene expression levels remains to be shown, this is consistent with RdDM affecting gene expression by a combination of local and long range effects.
Together, these results show that RdDM negatively affects long range chromosomal interactions between RdDM-repressed genes and distant direct RdDM targets. This could be explained by RdDM-mediated changes in gene expression affecting chromosomal interactions or RdDM affecting both gene expression and chromosomal interactions in parallel. We propose that RdDM contributes to gene repression by inhibiting interactions between genes and distant regulatory regions.
Discussion
We propose a model, where RdDM represses long range chromosomal interactions. In wild-type Col-0 plants, regions directly targeted by RdDM have high levels of DNA methylation, repressive histone modifications, and stabilized nucleosomes. At these sites, RdDM inhibits the formation of chromosomal interactions (Fig 5—wild-type). When RdDM is not active in the ago4 mutant or under specific environmental conditions, repressive chromatin marks / RdDM components are not present and chromosome interactions are more likely to occur (Fig 5—no RdDM). We further speculate that in some cases these interactions may bring distant enhancers with associated transcription factors to the proximity of gene promoters and affect gene expression.
Looping between specific chromosomal regions is a conserved process found by chromosome conformation capture from bacteria to mammals [50,51,55–57] and is believed to contribute to the regulation of gene expression [22,58,59]. In Drosophila, where enhancers have been identified genome-wide, gene expression correlates well with promoter—enhancer contacts [60]. In human cells, Capture Hi-C also found that promoters of active genes participate in chromatin interactions more than inactive genes [61]. Similarly, yet independently, it was found that sites with higher RNA Pol II signal are more likely to participate in chromatin interactions [62]. Our data are consistent with this mechanism existing also in plants, in that high levels of gene expression correspond to higher confidence chromatin interactions (Fig 3A and 3B) and that differential gene expression correlates with differential chromatin contacts (Fig 4A–4C).
Our model where RdDM inhibits chromatin interactions leads to the important question of which features of RdDM directly affect looping. This may be unknown or untested chromatin modifications; however, DNA methylation, H3K9me2 / H3K4me2, and nucleosome position by themselves cannot explain differences in chromatin interactions (Fig 2C and 2G and S2C Fig). It is more likely that chromosomal interactions are directly mediated by specific transcription factors or architectural proteins that are unable to bind in the presence of RdDM. Alternatively, transcription may be an important part of chromatin organization, and differential expression could directly alter chromatin interactions.
An interesting aspect of our gene expression analysis is the difference between nrpe1 and ago4 (Fig 4D, S4B Fig). This is especially well supported by simple interaction calling, where ago4 has a more prominent effect at RdDM sites than nrpe1 (Fig 2D and 2G). One possibility is that nrpe1 and ago4 affect chromatin modifications differently at some loci. This is consistent with AGO4 binding RNAs produced by RNA polymerases other than Pol V but not engaging in RdDM [63]. A more attractive explanation is that the act of Pol V transcription in the absence of AGO4 has an effect independent of repressive chromatin modifications. In this scenario Pol V transcripts may act to promote long distance interactions as has been suggested for non-coding RNA in other organisms [52,64]. Supporting this model, several proteins required for chromatin interactions such as Mediator, Cohesin, and CTCF have been shown to interact with RNA [65–67] and it is tempting to speculate that Pol V-produced lncRNA acts in a similar manner.
It is important to note that our analysis shows genome-wide trends. Therefore, substantial locus-specific variation is still expected to occur. For example, a recent report demonstrated that RdDM and other chromatin mechanisms may actually facilitate the formation of a specific chromosome loop between the methylated APOLO locus and the PID gene in response to a plant hormone auxin [68]. However, it should be noted that while this locus is methylated in wild-type (Col-0), the methylation status at the APOLO loop anchor is unchanged in RdDM mutants (nrpe1 and ago4) (S5 Fig). Furthermore, H3K9ac does not increase in the RdDM mutant at the APOLO loop anchor [68]. This suggests the presence of unknown contributors or compensators for the regulation of this locus aside from Pol V and AGO4. Additionally, the loop between APOLO and PID occurs at a distance of 3.5 kb apart, which is difficult to detect by Hi-C. Our study mainly focused on longer-range chromosomal interactions and may thereby miss shorter contacts. Thus, it is possible that RdDM may both inhibit and stabilize chromosomal interactions depending on their distance and/or other variables.
It should also be noted that chromosome conformation capture (Hi-C), relies on in vivo crosslinking and ligation, which are not fully understood and should be carefully interpreted. We attempted to mitigate some of the limitations of Hi-C by comparing biological repeats and pooled raw data, and by utilizing two different methods of interaction calling. We also performed comparative analysis with ChIP-seq, which involved a similar crosslinking step. Additionally, our findings do not rely only on wild-type conditions, but also include comparing different genetic backgrounds.
The results presented here indicate that RNA-mediated transcriptional silencing can affect several aspects of chromatin structure including not only chromatin modifications and nucleosome positioning but also long-range chromosomal interactions. This further demonstrates that RNA is a master regulator of chromatin structure.
Materials and methods
Plant material
Arabidopsis thaliana mutant lines nrpe1 (nrpd1b-11) and ago4 (ago4-1 introgressed into Col-0) were described previously [45,69,70]. Seedling tissue was used in all experiments and only datasets from the same tissue type were used in analysis.
3C and Hi-C sample preparation
Arabidopsis seedlings were grown for 2 weeks under long day conditions after which above ground tissue was harvested and cross-linked in 0.5% formaldehyde as previously described [71]. Nuclei were extracted using the same protocol as ChIP [71], followed by DpnII digestion, ligation, and purification of chromatin contacts similar to [72]. Following this protocol, nuclei were washed in 1.2x DpnII buffer, resuspended in 550 μl 1.2x DpnII buffer with 0.26% SDS, and then incubated for 20 min at 65°C followed by 20 min at 37°C with gentle shaking. Triton-X 100 was added to 1.6% with incubation for 60 min at 37°C. Chromatin was digested with 750 units DpnII overnight at 37°C. After digestion, samples were incubated in 1% SDS for 20 min at 65°C. Samples were then diluted in 10 ml 1x T4 DNA ligase buffer (NEB) supplemented with 0.75% Triton-X 100 and incubated at 37°C for 60 min. Ligation was performed with 600 units T4 DNA ligase for 5 hours at room temperature. Decrosslinking occurred with 600 μg Proteinase K at 65°C overnight, followed by addition of 300 μg RNase A for 30 min at 37°C. DNA was then isolated using phenol:chloroform. Libraries were prepared and sequenced by the University of Michigan Sequencing Core. Primers used in 3C can be found in S1 Table.
Hi-C data analysis
Each end of paired-end reads with unique alignment was mapped to the Arabidopsis genome (TAIR10) using Bowtie and then paired. Reads were assigned to 1 kb bins and significant intra-chromosomal interactions more than 3 kb apart were called by Fit-Hi-C with ICE (Fig 1B) [37,38]. These Fit-Hi-C calls were further filtered by estimating a false discovery rate (FDR). This FDR was calculated by permutating p-values such that in a random set of interactions with equal size only .05 were present at or below the p-value cutoff. In essence all called peaks had stringent enough p-values such that they would not be likely to appear in random sets. Alternatively, to obtain a looser view of interactions, the genome was divided into 250bp windows (bin-mapping) and interactions between windows were counted keeping only those with more than three DpnII sites apart (Fig 1B). This was done not only to account for non-uniform cleavage distances, but to allow overlap of uniform datasets such as H3K4me2/H3K9me2 bins or DNA methylation bins. The top 5% interactions (read counts) of the shortest bin distance, 4 bins apart, were kept and that maximum number was applied to all subsequent bins. This approach allowed keeping only the highest confidence short range interactions while maintaining long-range interaction events with a minimum of two reads supporting independent ligation events (S1C and S1D Fig). Interaction scores were calculated from the number of reads supporting the interaction between two 250bp bins multiplied by the ratio between the total number of reads to the total number of loops called in that sample. Pearson Correlation between replicates was done in 25 kb bins using contact counts between bins normalized by sequencing depth. Eigenvector was calculated at 1 Mb resolution using juicebox [34]. Telomere association was plotted using scores from inter-chromosomal interactions. Highly scoring interactions (value > = 10) were used for overlaps between features to compare between replicates unless otherwise indicated.
Overlaps with interaction counts were normalized to total interactions for each where applicable and we have provided the raw overlap counts and normalization scheme for each in S2 Table. Significance scores for each were calculated by a two-sided paired t-test among biological replicates. Genomic regions tested for overlap with Hi-C data were first filtered based on their mappability in Hi-C. This was done by keeping only those features with reads in the decrosslinked control from Wang et al.[29]. Filtered inactive and active regions were further checked to ensure similar digestion efficiency by comparing the read counts in the decrosslinked control (S1J Fig) [29]. This was to ensure that lower mappability was not affecting the results and to ensure that restriction digestion to get mapped reads in Hi-C was able to penetrate heterochromatic regions.
Interaction enrichment plots between promoters and DMRs (Fig 4A and 4B) were plotted by 1) Filtering out interactions <10 kb apart; 2) Keeping reads that overlap bins surrounding the TSS on one side and DMRs on the other; 3) Taking the average or sum of each interaction bin in the matrix of bins according to the distance from TSS and from DMR; 4) Repeating steps 1–3 with TSS’s connected to bins surrounding random regions to calculate a randomized average and subtracting these average values from the matrix. Z-axis values and color scores are then plotted from these values obtained, as a relative score from the minimum and maximum. Interaction overlaps between promoters of differential genes and distal nrpe1 DMRs was taken using an interaction score > = 5 to obtain as many overlapping high quality interactions as possible.
RNA-seq
RNA from ago4 seedlings was isolated and rRNA-depleted in three biological repeats as described [12] and libraries were prepared by the University of Michigan Sequencing Core. Reads were mapped to the TAIR10 genome assembly using Tophat [73] and differential expression was called using EdgeR [74]. Previously published RNA-seq datasets from seedlings (Col-0 wild-type and the nrpe1 mutant) [12] (GSE38464) were obtained from plants grown, harvested, isolated, rRNA-depleted, and sequenced in parallel to the ago4 dataset. Overlaps in differential expression were calculated from EdgeR and plotted as a weighted Venn diagram using the Venneuler package in R.
Overlap with transcription factors
DAP-seq transcription factor binding sites came from the GEO accession GSE60141 [53]. Fig 3C was generated by taking interactions overlapping promoters (1kb upstream of TSS) on one end and DAP-seq transcription factor peaks on the other or no DAP-seq peak for each transcription factor tested. The median RPKM values of genes interacting in each transcription factor DAP-seq reaction were then plotted as a boxplot and provided in S3 Table. The genome-wide median was produced from total genes.
SVP ChIP-seq data were downloaded from GEO accession GSE33120 [54]. Reads were then mapped to the TAIR10 genome and immunoprecipitation vs. control were counted in 250bp windows for comparison to Hi-C which was taken at a more stringent cutoff score of 15 to compensate for variations in ChIP-seq quality. TF binding sites were defined as at least four fold enrichment in sample vs. control. EdgeR was used to call differential expression for svp from GEO accession GSE32397 [54].
Other datasets used in this study
Datasets for AGO4 ChIP-seq came from GSE35381 [8], MNase-seq came from GSE38401 [12] and ChIP-seq for histone modifications came from GSE37644 and GSE49090 [40,41]. DNA methylation data came from GSE39901 [13] and nrpe1 DMR calls came from [14].
Supporting information
Acknowledgments
We thank Gyorgyi Csankovszki for critical reading of the manuscript.
Data Availability
All high throughput sequencing datasets are available in GEO under accession number GSE64389.
Funding Statement
This work was supported by grants from the National Science Foundation (MCB 1120271) and the National Institutes of Health (R01GM108722) to ATW. MJR was partially supported by the University of Michigan Genetics Training Program (T32-GM07544) and by the Ruth L. Kirschstein National Research Service Award F32 GM113570. GB was supported by the Austrian Science Fund (FWF) fellowship J3199-B09. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding agencies. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Matzke MA, Mosher RA. RNA-directed DNA methylation: an epigenetic pathway of increasing complexity. Nat Rev Genet. 2014;15: 394–408. 10.1038/nrg3683 [DOI] [PubMed] [Google Scholar]
- 2.Weinberg MS, Morris KV. Transcriptional gene silencing in humans. Nucleic Acids Research. 2016;44: 6505–6517. 10.1093/nar/gkw139 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Haag JR, Pikaard CS. Multisubunit RNA polymerases IV and V: purveyors of non-coding RNA for plant gene silencing. Nat Rev Mol Cell Biol. 2011;12: 483–492. 10.1038/nrm3152 [DOI] [PubMed] [Google Scholar]
- 4.Li S, Vandivier LE, Tu B, Gao L, Won SY, Li S, et al. Detection of Pol IV/RDR2-dependent transcripts at the genomic scale in Arabidopsis reveals features and regulation of siRNA biogenesis. Genome Res. 2015;25: 235–245. 10.1101/gr.182238.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Haag JR, Ream TS, Marasco M, Nicora CD, Norbeck AD, Pasa-Tolic L, et al. In vitro transcription activities of Pol IV, Pol V, and RDR2 reveal coupling of Pol IV and RDR2 for dsRNA synthesis in plant RNA silencing. Mol Cell. 2012;48: 811–818. 10.1016/j.molcel.2012.09.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Wierzbicki AT, Haag JR, Pikaard CS. Noncoding transcription by RNA polymerase Pol IVb/Pol V mediates transcriptional silencing of overlapping and adjacent genes. Cell. 2008;135: 635–648. 10.1016/j.cell.2008.09.035 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wierzbicki AT, Ream TS, Haag JR, Pikaard CS. RNA polymerase V transcription guides ARGONAUTE4 to chromatin. Nat Genet. 2009;41: 630–634. 10.1038/ng.365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Zheng Q, Rowley MJ, Böhmdorfer G, Sandhu D, Gregory BD, Wierzbicki AT. RNA polymerase V targets transcriptional silencing components to promoters of protein-coding genes. Plant J. 2012; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Rowley MJ, Avrutsky MI, Sifuentes CJ, Pereira L, Wierzbicki AT. Independent chromatin binding of ARGONAUTE4 and SPT5L/KTF1 mediates transcriptional gene silencing. PLoS Genet. 2011;7: e1002120 10.1371/journal.pgen.1002120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ausin I, Mockler TC, Chory J, Jacobsen SE. IDN1 and IDN2 are required for de novo DNA methylation in Arabidopsis thaliana. Nat Struct Mol Biol. 2009;16: 1325–1327. 10.1038/nsmb.1690 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bies-Etheve N, Pontier D, Lahmy S, Picart C, Vega D, Cooke R, et al. RNA-directed DNA methylation requires an AGO4-interacting member of the SPT5 elongation factor family. EMBO Rep. 2009;10: 649–654. 10.1038/embor.2009.31 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zhu Y, Rowley MJ, Böhmdorfer G, Wierzbicki AT. A SWI/SNF Chromatin-Remodeling Complex Acts in Noncoding RNA-Mediated Transcriptional Silencing. Mol Cell. 2013;49: 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Stroud H, Greenberg MVC, Feng S, Bernatavichute YV, Jacobsen SE. Comprehensive analysis of silencing mutants reveals complex regulation of the Arabidopsis methylome. Cell. 2013;152: 352–364. 10.1016/j.cell.2012.10.054 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Böhmdorfer G, Rowley MJ, Kuciński J, Zhu Y, Amies I, Wierzbicki AT. RNA-directed DNA methylation requires stepwise binding of silencing factors to long non-coding RNA. The Plant Journal. 2014;79: 181–191. 10.1111/tpj.12563 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhong X, Du J, Hale CJ, Gallego-Bartolome J, Feng S, Vashisht AA, et al. Molecular mechanism of action of plant DRM de novo DNA methyltransferases. Cell. 2014;157: 1050–1060. 10.1016/j.cell.2014.03.056 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Gao Z, Liu H-L, Daxinger L, Pontes O, He X, Qian W, et al. An RNA polymerase II- and AGO4-associated protein acts in RNA-directed DNA methylation. Nature. 2010;465: 106–109. 10.1038/nature09025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Kuhlmann M, Mette MF. Developmentally non-redundant SET domain proteins SUVH2 and SUVH9 are required for transcriptional gene silencing in Arabidopsis thaliana. Plant Mol Biol. 2012;79: 623–633. 10.1007/s11103-012-9934-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Han Y-F, Dou K, Ma Z-Y, Zhang S-W, Huang H-W, Li L, et al. SUVR2 is involved in transcriptional gene silencing by associating with SNF2-related chromatin-remodeling proteins in Arabidopsis. Cell Res. 2014;24: 1445–1465. 10.1038/cr.2014.156 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zhong X, Hale CJ, Law JA, Johnson LM, Feng S, Tu A, et al. DDR complex facilitates global association of RNA polymerase V to promoters and evolutionarily young transposons. Nat Struct Mol Biol. 2012;19: 870–875. 10.1038/nsmb.2354 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zemach A, Kim MY, Hsieh P-H, Coleman-Derr D, Eshed-Williams L, Thao K, et al. The Arabidopsis nucleosome remodeler DDM1 allows DNA methyltransferases to access H1-containing heterochromatin. Cell. 2013;153: 193–205. 10.1016/j.cell.2013.02.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Sexton T, Yaffe E, Kenigsberg E, Bantignies F, Leblanc B, Hoichman M, et al. Three-dimensional folding and functional organization principles of the Drosophila genome. Cell. 2012;148: 458–472. 10.1016/j.cell.2012.01.010 [DOI] [PubMed] [Google Scholar]
- 22.Pennacchio LA, Bickmore W, Dean A, Nobrega MA, Bejerano G. Enhancers: five essential questions. Nat Rev Genet. 2013;14: 288–295. 10.1038/nrg3458 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Mohan M, Bartkuhn M, Herold M, Philippen A, Heinl N, Bardenhagen I, et al. The Drosophila insulator proteins CTCF and CP190 link enhancer blocking to body patterning. EMBO J. 2007;26: 4203–4214. 10.1038/sj.emboj.7601851 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sanyal A, Lajoie BR, Jain G, Dekker J. The long-range interaction landscape of gene promoters. Nature. 2012;489: 109–113. 10.1038/nature11279 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Rowley MJ, Corces VG. The three-dimensional genome: principles and roles of long-distance interactions. Current Opinion in Cell Biology. 2016;40: 8–14. 10.1016/j.ceb.2016.01.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Jin F, Li Y, Dixon JR, Selvaraj S, Ye Z, Lee AY, et al. A high-resolution map of the three-dimensional chromatin interactome in human cells. Nature. 2013;503: 290–294. 10.1038/nature12644 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Van Bortle K, Corces VG. The role of chromatin insulators in nuclear architecture and genome function. Curr Opin Genet Dev. 2013;23: 212–218. 10.1016/j.gde.2012.11.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Grob S, Schmid MW, Grossniklaus U. Hi-C analysis in Arabidopsis identifies the KNOT, a structure with similarities to the flamenco locus of Drosophila. Mol Cell. 2014;55: 678–693. 10.1016/j.molcel.2014.07.009 [DOI] [PubMed] [Google Scholar]
- 29.Wang C, Liu C, Roqueiro D, Grimm D, Schwab R, Becker C, et al. Genome-wide analysis of local chromatin packing in Arabidopsis thaliana. Genome Res. 2014; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Lee J-Y, Colinas J, Wang JY, Mace D, Ohler U, Benfey PN. Transcriptional and posttranscriptional regulation of transcription factor expression in Arabidopsis roots. Proc Natl Acad Sci USA. 2006;103: 6055–6060. 10.1073/pnas.0510607103 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science. 2009;326: 289–293. 10.1126/science.1181369 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Belton J-M, McCord RP, Gibcus JH, Naumova N, Zhan Y, Dekker J. Hi-C: a comprehensive technique to capture the conformation of genomes. Methods. 2012;58: 268–276. 10.1016/j.ymeth.2012.05.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Li L, Lyu X, Hou C, Takenaka N, Nguyen HQ, Ong C-T, et al. Widespread rearrangement of 3D chromatin organization underlies polycomb-mediated stress-induced silencing. Mol Cell. 2015;58: 216–231. 10.1016/j.molcel.2015.02.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Rao SSP, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, Robinson JT, et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159: 1665–1680. 10.1016/j.cell.2014.11.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485: 376–380. 10.1038/nature11082 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Crane E, Bian Q, McCord RP, Lajoie BR, Wheeler BS, Ralston EJ, et al. Condensin-driven remodelling of X chromosome topology during dosage compensation. Nature. 2015;523: 240–244. 10.1038/nature14450 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Imakaev M, Fudenberg G, McCord RP, Naumova N, Goloborodko A, Lajoie BR, et al. Iterative correction of Hi-C data reveals hallmarks of chromosome organization. Nat Methods. 2012;9: 999–1003. 10.1038/nmeth.2148 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ay F, Bailey TL, Noble WS. Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts. Genome Res. 2014;24: 999–1011. 10.1101/gr.160374.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Feng S, Cokus SJ, Schubert V, Zhai J, Pellegrini M, Jacobsen SE. Genome-wide Hi-C Analyses in Wild-Type and Mutants Reveal High-Resolution Chromatin Interactions in Arabidopsis. Molecular Cell. 2014;55: 694–707. 10.1016/j.molcel.2014.07.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Greenberg MVC, Deleris A, Hale CJ, Liu A, Feng S, Jacobsen SE. Interplay between active chromatin marks and RNA-directed DNA methylation in Arabidopsis thaliana. PLoS Genet. 2013;9: e1003946 10.1371/journal.pgen.1003946 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Moissiard G, Cokus SJ, Cary J, Feng S, Billi AC, Stroud H, et al. MORC family ATPases required for heterochromatin condensation and gene silencing. Science. 2012;336: 1448–1451. 10.1126/science.1221472 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Grob S, Schmid MW, Luedtke NW, Wicker T, Grossniklaus U. Characterization of chromosomal architecture in Arabidopsis by chromosome conformation capture. Genome Biol. 2013;14: R129 10.1186/gb-2013-14-11-r129 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Pontier D, Yahubyan G, Vega D, Bulski A, Saez-Vasquez J, Hakimi M-A, et al. Reinforcement of silencing at transposons and highly repeated sequences requires the concerted action of two distinct RNA polymerases IV in Arabidopsis. Genes Dev. 2005;19: 2030–2040. 10.1101/gad.348405 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kanno T, Huettel B, Mette MF, Aufsatz W, Jaligot E, Daxinger L, et al. Atypical RNA polymerase subunits required for RNA-directed DNA methylation. Nat Genet. 2005;37: 761–765. 10.1038/ng1580 [DOI] [PubMed] [Google Scholar]
- 45.Onodera Y, Haag JR, Ream T, Costa Nunes P, Pontes O, Pikaard CS. Plant nuclear RNA polymerase IV mediates siRNA and DNA methylation-dependent heterochromatin formation. Cell. 2005;120: 613–622. 10.1016/j.cell.2005.02.007 [DOI] [PubMed] [Google Scholar]
- 46.Zilberman D, Cao X, Jacobsen SE. ARGONAUTE4 control of locus-specific siRNA accumulation and DNA and histone methylation. Science. 2003;299: 716–719. 10.1126/science.1079695 [DOI] [PubMed] [Google Scholar]
- 47.Wang J, Lan X, Hsu P-Y, Hsu H-K, Huang K, Parvin J, et al. Genome-wide analysis uncovers high frequency, strong differential chromosomal interactions and their associated epigenetic patterns in E2-mediated gene regulation. BMC Genomics. 2013;14: 70 10.1186/1471-2164-14-70 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Fu Y, Sinha M, Peterson CL, Weng Z. The insulator binding protein CTCF positions 20 nucleosomes around its binding sites across the human genome. PLoS Genet. 2008;4: e1000138 10.1371/journal.pgen.1000138 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Kalhor R, Tjong H, Jayathilaka N, Alber F, Chen L. Genome architectures revealed by tethered chromosome conformation capture and population-based modeling. Nat Biotechnol. 2012;30: 90–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Sexton T, Yaffe E, Kenigsberg E, Bantignies F, Leblanc B, Hoichman M, et al. Three-dimensional folding and functional organization principles of the Drosophila genome. Cell. 2012;148: 458–472. 10.1016/j.cell.2012.01.010 [DOI] [PubMed] [Google Scholar]
- 51.Jin F, Li Y, Dixon JR, Selvaraj S, Ye Z, Lee AY, et al. A high-resolution map of the three-dimensional chromatin interactome in human cells. Nature. 2013;503: 290–294. 10.1038/nature12644 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Plank JL, Dean A. Enhancer Function: Mechanistic and Genome-Wide Insights Come Together. Molecular Cell. 2014;55: 5–14. 10.1016/j.molcel.2014.06.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.O’Malley RC, Huang S-SC, Song L, Lewsey MG, Bartlett A, Nery JR, et al. Cistrome and Epicistrome Features Shape the Regulatory DNA Landscape. Cell. 2016;165: 1280–1292. 10.1016/j.cell.2016.04.038 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Gregis V, Andrés F, Sessa A, Guerra RF, Simonini S, Mateos JL, et al. Identification of pathways directly regulated by SHORT VEGETATIVE PHASE during vegetative and reproductive development in Arabidopsis. Genome Biol. 2013;14: R56 10.1186/gb-2013-14-6-r56 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Cagliero C, Grand RS, Jones MB, Jin DJ, O’Sullivan JM. Genome conformation capture reveals that the Escherichia coli chromosome is organized by replication and transcription. Nucleic Acids Res. 2013;41: 6058–6071. 10.1093/nar/gkt325 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Hou C, Li L, Qin ZS, Corces VG. Gene density, transcription, and insulators contribute to the partition of the Drosophila genome into physical domains. Mol Cell. 2012;48: 471–484. 10.1016/j.molcel.2012.08.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Duan Z, Andronescu M, Schutz K, McIlwain S, Kim YJ, Lee C, et al. A three-dimensional model of the yeast genome. Nature. 2010;465: 363–367. 10.1038/nature08973 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Harmston N, Lenhard B. Chromatin and epigenetic features of long-range gene regulation. Nucleic Acids Res. 2013;41: 7185–7199. 10.1093/nar/gkt499 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Van Bortle K, Corces VG. The role of chromatin insulators in nuclear architecture and genome function. Curr Opin Genet Dev. 2013;23: 212–218. 10.1016/j.gde.2012.11.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Cubeñas-Potts C, Rowley MJ, Lyu X, Li G, Lei EP, Corces VG. Different enhancer classes in Drosophila bind distinct architectural proteins and mediate unique chromatin interactions and 3D architecture. Nucleic Acids Res. 2016; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Mifsud B, Tavares-Cadete F, Young AN, Sugar R, Schoenfelder S, Ferreira L, et al. Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C. Nature Genetics. 2015;47: 598–606. 10.1038/ng.3286 [DOI] [PubMed] [Google Scholar]
- 62.Li G, Ruan X, Auerbach RK, Sandhu KS, Zheng M, Wang P, et al. Extensive Promoter-Centered Chromatin Interactions Provide a Topological Basis for Transcription Regulation. Cell. 2012;148: 84–98. 10.1016/j.cell.2011.12.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Böhmdorfer G, Sethuraman S, Rowley MJ, Krzyszton M, Rothi MH, Bouzit L, et al. Long non-coding RNA produced by RNA polymerase V determines boundaries of heterochromatin. Elife. 2016;5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Nwigwe IJ, Kim YJ, Wacker DA, Kim TH. Boundary Associated Long Noncoding RNA Mediates Long-Range Chromosomal Interactions. PLoS ONE. 2015;10: e0136104 10.1371/journal.pone.0136104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Li W, Notani D, Ma Q, Tanasa B, Nunez E, Chen AY, et al. Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation. Nature. 2013;498: 516–520. 10.1038/nature12210 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Kung JT, Kesner B, An JY, Ahn JY, Cifuentes-Rojas C, Colognori D, et al. Locus-Specific Targeting to the X Chromosome Revealed by the RNA Interactome of CTCF. Mol Cell. 2015;57: 361–375. 10.1016/j.molcel.2014.12.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Lai F, Orom UA, Cesaroni M, Beringer M, Taatjes DJ, Blobel GA, et al. Activating RNAs associate with Mediator to enhance chromatin architecture and transcription. Nature. 2013;494: 497–501. 10.1038/nature11884 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Ariel F, Jegu T, Latrasse D, Romero-Barrios N, Christ A, Benhamed M, et al. Noncoding Transcription by Alternative RNA Polymerases Dynamically Regulates an Auxin-Driven Chromatin Loop. Molecular Cell. 2014;55: 383–396. 10.1016/j.molcel.2014.06.011 [DOI] [PubMed] [Google Scholar]
- 69.Zilberman D, Cao X, Jacobsen SE. ARGONAUTE4 control of locus-specific siRNA accumulation and DNA and histone methylation. Science. 2003;299: 716–719. 10.1126/science.1079695 [DOI] [PubMed] [Google Scholar]
- 70.Wierzbicki AT, Ream TS, Haag JR, Pikaard CS. RNA polymerase V transcription guides ARGONAUTE4 to chromatin. Nat Genet. 2009;41: 630–634. 10.1038/ng.365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Rowley MJ, Böhmdorfer G, Wierzbicki AT. Analysis of long non-coding RNAs produced by a specialized RNA polymerase in Arabidopsis thaliana. Methods. 2013;63: 160–169. 10.1016/j.ymeth.2013.05.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Louwers M, Splinter E, van Driel R, de Laat W, Stam M. Studying physical chromatin interactions in plants using Chromosome Conformation Capture (3C). Nature Protocols. 2009;4: 1216–1229. 10.1038/nprot.2009.113 [DOI] [PubMed] [Google Scholar]
- 73.Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biology. 2013;14: R36 10.1186/gb-2013-14-4-r36 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26: 139–140. 10.1093/bioinformatics/btp616 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Wierzbicki AT, Cocklin R, Mayampurath A, Lister R, Rowley MJ, Gregory BD, et al. Spatial and functional relationships among Pol V-associated loci, Pol IV-dependent siRNAs, and cytosine methylation in the Arabidopsis epigenome. Genes Dev. 2012;26: 1825–1836. 10.1101/gad.197772.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All high throughput sequencing datasets are available in GEO under accession number GSE64389.