Abstract
Hypoxia‐inducible factor (HIF) directs an extensive transcriptional cascade that transduces numerous adaptive responses to hypoxia. Pan‐genomic analyses, using chromatin immunoprecipitation and transcript profiling, have revealed large numbers of HIF‐binding sites that are generally associated with hypoxia‐inducible transcripts, even over long chromosomal distances. However, these studies do not define the specific targets of HIF‐binding sites and do not reveal how induction of HIF affects chromatin conformation over distantly connected functional elements. To address these questions, we deployed a recently developed chromosome conformation assay that enables simultaneous high‐resolution analyses from multiple viewpoints. These assays defined specific long‐range interactions between intergenic HIF‐binding regions and one or more promoters of hypoxia‐inducible genes, revealing the existence of multiple enhancer–promoter, promoter–enhancer, and enhancer–enhancer interactions. However, neither short‐term activation of HIF by hypoxia, nor long‐term stabilization of HIF in von Hippel–Lindau (VHL)‐defective cells greatly alters these interactions, indicating that at least under these conditions, HIF can operate on preexisting patterns of chromatin–chromatin interactions that define potential transcriptional targets and permit rapid gene activation by hypoxic stress.
Keywords: Capture‐C, chromatin, cis‐interaction, HIF, Hypoxia
Subject Categories: Chromatin, Epigenetics, Genomics & Functional Genomics; Transcription
Introduction
Hypoxia‐inducible factor (HIF) is the key transcription factor regulating transcriptional responses to hypoxia and is central to the pathogenesis of many human diseases including cancer. The DNA binding complex compromises an oxygen‐regulated alpha‐subunit, which dimerizes with a constitutively expressed beta‐subunit and binds to cis‐acting hypoxia response elements containing a core “RCGTG” consensus sequence 1, 2. Activation of HIF directs a broad range of inducible responses, which vary according to cellular and pathophysiological context 3, raising questions as to how different responses are specified.
Recent advances in high‐throughput DNA sequencing have facilitated pan‐genomic analyses of the architecture of HIF transcriptional response and are beginning to shed light on these processes 4, 5, 6, 7. These studies have revealed hundreds to thousands of HIF‐binding sites across the genome. As expected, a proportion of these sites clusters with annotated promoters. However, approximately 30% of HIF‐1 & 50% of HIF‐2 binding sites are more than 10 kb away from their nearest annotated gene 7. Tests of association with hypoxia‐inducible transcripts, such as gene set enrichment analyses (GSEA), have demonstrated clear statistical association with positive effects on gene expression, suggesting that at least some of these loci are hypoxia‐inducible transcriptional enhancers 7. However, with the exception of a small number of loci 8, 9, physical association with a hypoxia‐inducible promoter has not been defined. Nor is it known whether any such associations are induced following exposure of cells to hypoxia and/or the binding of HIF to DNA or whether they are present prior to the induction of HIF and contribute to the cell‐type‐specific operation of the HIF transcriptional response. These questions are of interest both mechanistically, since hypoxia has been reported to affect chromatin structure 10, 11, 12, 13, and medically, since specific therapeutic modulation (both positive and negative) of components of the hypoxic transcriptional response is a major focus for pharmaceutical development 14, 15.
To address these, and related questions as to the number and range of interactions made by HIF‐binding sequences under different inducing conditions, we deployed a recently developed multiplexed assay of chromatin conformation 16, 17 in normoxic and hypoxic cells, and in renal cancer cells in which HIF is activated in normoxia by inactivation of the von Hippel–Lindau (pVHL) E3 ubiquitin ligase 18. This technology enabled the detection of an average of approximately 6,000 unique ligation junctions at each of 38 viewpoints overlapping selected promoter‐distant HIF‐binding sites and gene promoters. Our findings revealed that these intergenic HIF‐binding regions commonly interact with the promoters of hypoxia‐inducible genes. They demonstrated cis‐acting chromatin interactions from HIF‐binding regions to multiple types of DNA element, including both the promoters of hypoxia‐inducible genes and other distant elements over distances of up to 250 kb. The findings also revealed that most intergenic regions that bind HIF in hypoxic cells bear the marks of active enhancers in normoxic cells and that cis‐interactions between these sites and hypoxia responsive promoters are already established, with few changes occurring during induction of the HIF response by hypoxia. Taken together with other recent analyses of the HIF response, the results support a model of gene regulation whereby many HIF‐binding sites and their target promoters are accessible, functionally annotated, active, and associated in close proximity prior to HIF binding, ready to effect a rapid transcriptional response through release of promoter‐paused RNApol2.
Results and Discussion
Chromatin environment at distal HIF‐binding sites
To investigate HIF‐binding sites that are distant from annotated gene promoters or transcribed regions of the genome 7, we first defined whether these distal sites carry histone modifications consistent with enhancers rather than unidentified promoters. ChIP‐seq was performed in MCF‐7 cells, incubated in normoxia or hypoxia (0.5% oxygen, 16 h), using antibodies to H3K4me1 and H3K27ac, and analyzed together with RNA‐seq and H3K4me3, RNApol2 and HIF ChIP‐seq datasets obtained previously under similar conditions 4 as well as DNase‐seq analyses published by ENCODE (GSE32970) (Fig 1A–N). Importantly, HIF binding was highly inducible by hypoxia, with almost no detectable levels of normoxic HIF‐1β binding (Fig 1A and B). HIF‐binding sites that were more than 10 kb from annotated transcriptional start sites were associated with negligible levels of RNA expression compared with gene bodies (Fig 1C and D). We were unable to confidently observe eRNAs, most probably because the total RNA‐seq data were not sequenced to sufficient depth. These promoter‐distal HIF‐binding sites were highly accessible compared to promoters and were strongly enriched for the enhancer mark H3K4me1, with lower levels of H3K4me3 19, 20 (Fig 1E–J).
Although levels of H3K4me1 were unchanged at promoter‐distal HIF‐binding sites in hypoxia (Fig 1G), H3K27ac and RNApol2 signals were increased (Fig 1K and M) consistent with increased enhancer activity in hypoxia 21. However, H3K27ac and RNApol2 were also present at these sites in normoxia before HIF binds. Similarly, RNApol2 was present across the gene bodies of hypoxia‐regulated genes (Fig 1N) indicating that they are transcribed, at lower levels (Fig 1D), before HIF is stabilized in hypoxia. This supports a model in which promoter‐distant HIF binding can occur at enhancers that are already partially active in normoxic cells.
Chromatin interactions with HIF‐bound enhancers
To determine how these putative HIF‐bound enhancers act on their target promoters, we next examined chromatin cis‐interactions with a subset of promoter‐distant, HIF‐binding sites (Appendix Table S1). As expected, these co‐localized with ChIP‐seq signals for H3K4me1 and H3K27ac (e.g., Fig 2A). We first studied hypoxic MCF‐7 cells and analyzed cis‐interactions with 18 intergenic HIF‐binding sites that were > 10 kb (median 74 kb, mean 125 kb) away from any annotated promoter, using Capture‐C 16, 17. Capture‐C is a derivative of the chromatin conformation capture (3C) technique 22, coupled with oligonucleotide enrichment and high‐throughput sequencing. Capture‐C allows discovery of previously unknown distant interacting elements (promoters, enhancers, CTCF sites, etc.) from multiple “viewpoints” (in this case HIF‐binding sites) in parallel and at high resolution. Each experiment was performed in duplicate and demonstrated a high degree of reproducibility (Appendix Fig S1).
At 14 of the 18 sites, strong enrichment of sequences at the “viewpoint” site was observed confirming efficient capture; sites with poor capture efficiencies were excluded from further analysis. Regions that had a re‐ligation frequency that was significantly greater than a background, modeled on the average distance‐dependent signal decay, were defined as sites of significant cis‐interaction. Each HIF‐binding site showed significant interaction with an average of seven sites (Fig 2B). These cis‐interacting sites were then correlated with UCSC gene annotations and ChIP‐seq analyses of H3K4me3, H3K4me1, H3K27ac, RNApol2, and CTCF binding to define functionally annotated regions. Approximately 50% of cis‐interacting sites contained one or more of the above marks (Fig 2B). Cis‐interacting elements lay predominantly within 250 kb of the HIF‐binding site (Fig 2C) and all were located within the same topologically associating domain (TAD) 23.
Cis‐interactions were identified between HIF‐bound enhancers and at least one annotated gene promoter lying up to 150 kb from the HIF‐binding site at nine of the 14 HIF‐binding sites assayed. Notably, this was not always the nearest gene. At three sites, cis‐interaction with the promoter of a hypoxia‐regulated gene was observed, although the number of interactions was below the level required to be statistically significant. At only two out of 14 HIF‐binding sites was no association with the promoter of a gene detected. Both had evidence for strong HIF binding, although the closest hypoxia‐regulated gene was > 1.5 Mb away. Whether these two sites represent a distinct functional class of HIF‐binding sites will require further analysis. Three sites where cis‐interactions with more than one promoter were seen were all associated with a co‐regulated coding and noncoding gene pairing (e.g., Fig 2A).
Genes whose promoters interacted with distant HIF‐binding sites were enriched among hypoxia‐upregulated transcripts in RNA‐seq analyses as determined by Gene Set Enrichment Analysis (GSEA) 24, using a ranking metric combining fold‐change in hypoxia and significance 25 (Fig 2D). By comparison, adjacent genes whose promoters did not interact were not enriched for hypoxic regulation (Fig 2E) despite the presence of H3K4me3, RNApol2, and RNA‐seq signal at these gene loci. This suggests that the observed chromatin–chromatin interactions are functional and that Capture‐C may be used to predict the target promoter(s) of intergenic HIF binding elements, even when the genes lie at large distances and are not necessarily the closest locus to the HIF‐binding element. It also provides clear evidence, across multiple gene loci that promoter‐distant HIF‐binding sites can physically interact with hypoxia‐regulated gene promoters.
In addition to showing association with gene promoters, HIF‐binding sites also interacted with an average of approximately three additional distant enhancer elements as characterized by the presence of H3K4me1 and H3K27ac (Fig 2B and Appendix Fig S2). This suggests more extensive convergence between HIF signaling and additional transcriptional regulatory pathways than is apparent from examining sequences local to the HIF‐binding site.
Chromatin interactions with HIF‐regulated promoters
In addition to observing frequent weak ChIP‐seq signal at sites interacting with a strong peak, several gene loci (e.g., NDRG1, DDIT4, ERRFI1, EGLN3) harbored multiple strong signals of roughly equal intensity, each of which contained a consensus HRE motif. These are among the most hypoxia‐upregulated genes in MCF‐7 cells (ranked 8, 28, 68, and 35, respectively) 4. This raised the question as to whether the promoters of these genes are able to interact with more than one HIF‐binding site. To answer this, we performed Capture‐C using the promoters of these genes as “viewpoint” sites. This identified clear association between the promoters of these genes and multiple HIF‐binding sites, both upstream and downstream of the promoter (Fig EV1). As multiple HIF‐bound enhancers are brought into close proximity with the promoters of these genes, this indicates that more than one HIF‐binding site may contribute to the regulation of a single gene, perhaps contributing to greater amplitude of regulation by hypoxia.
Glycolysis is one of the best‐characterized HIF‐regulated pathways, with at least one isoenzyme catalyzing each step of the pathway being a HIF target gene 4 (Appendix Fig S3). Unexpectedly, inspection of HIF ChIP‐seq signals revealed that these gene loci bound both HIF‐1 and HIF‐2 with similar intensity, at or very close to their promoter, although all but one gene was exclusively regulated by HIF‐1 (Fig EV2A–C). Interestingly, the exception, HK1, has a HIF‐binding site within an intron of its gene, which although intragenic is (unusually for glycolytic genes) approximately 50 kb from the promoter (Appendix Fig S3). This raised questions as to whether functional selectivity among bound HIF isoforms might be a general function of distance from a promoter. To test this, we interrogated our genomewide datasets 4, 7. This analysis revealed a clear difference between promoter‐proximal and promoter‐distal sites that bound both HIF isoforms. Promoters bound by both HIF‐1 and HIF‐2 were almost entirely upregulated by HIF‐1 alone, whereas promoter‐distant HIF‐binding sites were found to be associated with genes that were upregulated by either or both HIF‐1 and HIF‐2 (Tables EV1 and EV2 and Fig EV3).
Since HIF binding at glycolytic gene loci is strongly promoter associated (Fig EV2C), we next determined the distribution of other enhancers at these loci. Despite the promoter‐proximal nature of HIF binding, Capture‐C, using the promoters of these genes as “viewpoint” sites, identified frequent interactions with more distant enhancers (Fig EV2D), indicating that the pattern of HIF binding observed does not merely reflect short‐range interactions at these loci.
HIF‐binding sites are precomplexed with interacting elements before HIF binds
There are reports that certain inducible transcription factors bind to preexisting chromatin complexes 26, 27, 28, 29, while others induce new patterns of interaction 30, 31, 32, 33. However, there have been very few studies that examine the question at high resolution and at scale. Since HIF‐bound enhancers and HIF‐regulated promoters bear active histone modifications prior to HIF stabilization, we next wished to determine whether and to what extent cis‐interactions between these sites were pre‐established or to what extent they were induced by HIF binding. We therefore performed Capture‐C on MCF‐7 cells, incubated in normoxia or hypoxia using the same capture probes to HIF‐binding sites as above. The interaction frequencies in hypoxia correlated strongly with those in normoxia (Fig 3A and B). Only about 3% of cis‐interactions were significantly altered in frequency following stabilization of HIF, and the magnitude of these changes was small (1.4‐ to 1.9‐fold). Thus, induction of HIF does not greatly affect the pattern or strength of chromatin–chromatin interactions from HIF‐bound sequences. To further assess the effect of HIF activation on chromatin structure, we also used ChIP‐seq to examine binding of CTCF, a protein that has been shown to coordinate chromatin architecture 34. Consistent with our Capture‐C analysis, CTCF binding flanking HIF‐binding sites was not significantly altered by hypoxia (Fig 3C).
To extend these observations, we next examined cis‐interactions from HIF‐binding sites in the renal cancer cell line 786‐O. This cell line lacks functional pVHL, which is a common and early event, critical to the development of kidney cancer, and leads to long‐term constitutive activation of HIF. To study the effect of short‐term hypoxia in this context, wild‐type VHL was stably re‐introduced (786‐O+VHL) and cells were studied in both normoxia and hypoxia. Again, we found chromatin interactions to be largely pre‐established before hypoxic stabilization of HIF (Fig EV4). Finally, we examined whether long‐term HIF stabilization resulting from VHL inactivation might alter cis‐interactions with these HIF‐binding sites by comparing Capture‐C signals in VHL‐reconstituted (786‐O+VHL) and defective (786‐O‐VHL) 786‐O cells (Appendix Fig S4). For the large part, interaction frequencies were comparable in VHL‐reconstituted and VHL‐deficient cells, with the exception of reduced interaction with the promoter of TSC22D2 in VHL‐reconstituted cells versus VHL‐defective cells, which correlates with a lack of hypoxic inducibility of this VHL‐regulated gene in these cells 35, 36. Thus, neither short‐term activation of HIF by hypoxia, nor its long‐term stabilization by VHL inactivation greatly alters chromatin interactions with HIF‐binding sites.
Despite the potential for both HIF and hypoxia to modify chromatin through multiple interactions with histone and chromatin‐modifying proteins 10, 11, 12, neither short‐term activation by hypoxia nor long‐term activation in VHL‐defective cells appears to substantially alter patterns of chromatin conformation local to HIF‐binding sites, at least as identified in these assays and in these cancer cell lines. Taken together, this reveals a model of induction of the HIF pathway in which the chromatin structure between HIF‐binding sites and HIF‐activated promoters is present prior to HIF induction, distant active enhancer elements being precomplexed with target promoters that are themselves preloaded with paused RNApol2. The processes defining the chromatin structure in normoxic cells are unclear, but likely include DNA binding and structural proteins, including the cohesin complex, that are involved in the establishment and maintenance of the chromatin landscape within which HIF operates and which are thus important in determining the shape of the hypoxia response.
Cell‐type‐dependent patterns of chromatin interaction with HIF‐binding sites
Both the binding of HIF and its transcriptional targets vary considerably between cell types 37. We therefore examined the extent to which these cell‐type differences are reflected by differences in chromatin interactions by comparing Capture‐C signals in hypoxic MCF‐7 cells and in the VHL‐defective 786‐O cell line, in which HIF is constitutively stable (Fig 4A). This analysis showed that approximately 20% of cis‐interacting sites differed significantly between the two cell types (1.2‐ to 6.0‐fold), although many other cis‐interactions were preserved. This result is in contradistinction to the roughly 3% of cis‐interacting sites that differed between normoxia and hypoxia in MCF‐7 cells (Figs 3B and 4B).
Sites that bound HIF in both cell lines interacted with the same promoter in each cell type, suggesting that the transcriptional target of a specific binding site may be independent of cell type. Conversely, sites that bound HIF in one cell line, but not the other (Appendix Table S1), showed a more heterogeneous picture of cis‐interactions (Fig EV5). In particular, significantly reduced association with the target promoter was observed, at three loci, in the cell type, which did not bind HIF at the “viewpoint” enhancer locus (versus that in the HIF‐binding cell type; Fig 4C). However, at the remaining sites, cis‐interaction with the target promoter was clearly preserved in the cell type that lacked HIF binding (Fig 4D). These findings support a model in which preexisting HIF‐independent patterns of chromatin interaction define potential HIF targets, but also identify a second mechanism of selectivity, whereby additional factors contribute to cell‐type‐specific HIF binding at common preformed active chromatin complexes. The marked renal tissue specificity of pVHL‐associated cancer 38 has focused attention on specific components of the HIF pathway that might be oncogenic in this context. Interestingly, two genes that have been defined as contributors to VHL‐associated oncogenesis, CCND1 39, 40 and TGFA 39, 41, are representative of each group: CCND1 among those manifesting cell‐specific chromatin conformation (Fig 4C), and TGFA among those showing cell‐specific HIF binding despite similar patterns of interaction (Fig 4D). Further work aimed at defining the mechanisms underlying these differences will be of interest.
In summary, the present work has revealed complex patterns of preexisting chromatin interaction between multiple HIF‐binding enhancers and promoters that shape the transcriptional architecture of the HIF response. This supports a model in which binding of HIF to primed functional chromatin complexes may enable rapid activation of the HIF transcriptional program. Since HIF activates gene expression through release of promoter‐proximal paused RNA polymerase 2 4, 6, this may be a strategy specific to this type of gene regulation, rather than a general mechanism for enhancer‐induced transcriptional activation. It also distinguishes at least two other mechanisms that operate on top of these frameworks to define selective patterns of gene activation: cell‐type‐specific HIF binding to these preformed complexes and isoform‐specific activity of the bound HIF.
Materials and Methods
Cell culture
MCF‐7 and 786‐O cells were cultured in Dulbecco's modified Eagle's medium (DMEM) supplemented with 10% fetal bovine serum, 2 mM L‐glutamine, 100 U/ml penicillin, and 100 μg/ml streptomycin. For hypoxia treatment, cells were incubated for 16 h in an In Vivo2 Hypoxia Work station (Ruskinn Technology) at 0.5% oxygen.
ChIP‐seq
Chromatin immunoprecipitation was performed as previously described 7 using antibodies against HIF‐1 (PM14), HIF‐2 (PM9) 42, 43, HIF‐1 (Novus NB‐100‐110), H3K4me1 (Millipore, #07‐436), H3K4me3 (#9751, Cell Signaling Technology), H3K27ac (#ab4729, Abcam), CTCF (#07‐729, Millipore), and RNApol2 (#sc‐899, Santa Cruz Biotechnology). Each ChIP was performed in duplicate. Libraries were prepared using PrepX Complete ILMN DNA library kit (#400076, Wafergen).
Capture‐C
Capture‐C libraries were prepared as previously described 16. Cells were fixed with 2% (vol/vol) formaldehyde for 10 min, quenched with 125 mM glycine in PBS, and then lysed in cold lysis buffer (10 mM Tris–HCl, pH 8, 10 mM NaCl, 0.2% IGEPAL, 1× complete protease inhibitor cocktail (Roche). Chromatin was digested with DpnII (New England Biolabs) at 37°C overnight. Fragments were then diluted and ligated with T4 DNA ligase (Thermo Scientific) at 16°C overnight. Cross‐linking was reversed by overnight incubation at 60°C with proteinase K (Bioline). The 3C libraries were then purified by phenol–chloroform and chloroform extraction followed by precipitation in ethanol at −80°C overnight. Digestion efficiency was determined by qPCR and gel electrophoresis. Sequencing libraries were prepared from 5 μg of each 3C library by sonication using a S220 focused ultrasonicator (Covaris) to a average size of 200 bp and indexed using NEBnext reagents (New England Biolabs) according to the manufacturer's protocol. Enrichment of 1–2 μg of an indexed library incubated with 13 pmol of a pool of biotinylated oligonucleotides (Integrated DNA Technologies or Sigma‐Aldrich; Appendix Table S1) was performed using the SeqCap EZ system (#06953212001, Roche/NimbleGen) following the manufacturer's instructions. Two rounds of capture employing 48–72 and 24 h hybridizations, respectively, were used. Capture enrichment was determined by qPCR. Correct library size was confirmed using a Bioanalyzer DNA 1000 or Tapestation D1000 kit (Agilent), and DNA concentrations were determined using a Qubit 2.0 Fluorometer (Thermo Fisher Scientific).
High‐throughput sequencing
All sequencing was performed on the HiSeq 2500 or HiSeq 4000 platforms using paired‐end 50‐ or 75‐bp protocols (Illumina).
Accession codes
Capture‐C data from this study are available from Gene Expression Omnibus at GSE78100 and ChIP‐seq are available at GSE28352 (HIF‐1α, HIF‐2α, and HIF‐1β ChIP‐seq in MCF‐7 cells), GSE32970 (ENCODE DNase‐seq in MCF‐7 cells), GSE67237 (HIF‐2a and HIF‐1b ChIP‐seq in 786‐O cells), GSM1011120 (FAIRE‐seq in 786‐O cells), and from EMBL‐EBI Array Express at E‐MTAB‐1994 and E‐MTAB‐1995 (RNApol2 and H3K4me3 ChIP‐seq and RNA‐seq in MCF‐7 cells) and GSE78113 (other histone modifications and CTCF ChIP‐seq).
ChIP‐seq analysis
ChIP‐seq data were processed as previously described 36; peaks were called using MACS 44 and TPIC 45. ChIP reads were counted under peaks and were normalized to the total library size and the width of the peak using RPKM 36.
Capture‐C analysis
Reads were processed, in silico digested, and aligned to the human genome using Bowtie 1.0.0 (http://bowtie-bio.sourceforge.net/index.shtml). Interaction frequencies were determined using CCanalyser2.pl (https://github.com/telenius/captureC/releases) as previously described 16 (Appendix Table S2). The presence of undigested self‐circles was examined using the method of Jin et al 26. Only small undigested self‐circles of < 2 kb were detected, likely reflecting the higher cutting frequency of DpnII. Beyond this distance, the ratios of outward/same‐strand and inward/same‐strand reads were both 0.5, consistent with efficient cutting and re‐ligation of DpnII fragments (Appendix Fig S5).
Identifying significant interaction regions
Normalization of observed counts
To identify regions of significant interaction with the capture probe, we followed an approach similar to that used previously 26.
Firstly, the observed interaction count Y x,j between a DpnII fragment x and a specific capture probe j was normalized to the total captured interactions for the capture probe Y j and to the size of the DpnII fragment W x.
where is the normalized count, and NF is a numerical normalization factor, 108, to adjust the counts to a similar mean before normalization.
Empirical estimation of random background interaction counts
The random interaction rate between the capture fragment and nearby chromatin is expected to decrease exponentially with distance. Accordingly, we used the normalized interaction counts across all experiments to empirically estimate the expected background distribution of interaction counts at a given distance.
The region surrounding each bait site, j was divided into 2‐kb bins and the normalized count for each 2‐kb bin estimated by summing the counts for each DpnII fragment, x within bin i.
where N is the number of DpnII fragments within each 2‐kb bin i.
The expected interaction frequency at bin i is then given by the interaction counts in bin i averaged across all the probes j for every experiment in that cell type (Appendix Fig S6A):
where N j is the number of bait sites.
However, background counts may vary from experiment to experiment, so the average interaction counts for a given cell type were then corrected using a scalar correction factor C e derived for each experiment, e:
using a method similar to that used in DESeq 46, where:
and where is the geometric mean of all the interaction counts across all the probes j for experiment e:
Similarly, a normalization factor, C j, for each capture probe was calculated to normalize the capture probe efficiency:
where is the geometric mean of the probe across all experiments for the same cell line, averaging over all the bins:
and is the geometric mean of all the interaction counts across all the experiments in the same cell line:
Log–log model of interaction counts
To model the interaction count at a given distance from the capture site, we then fitted a log–log power law for the relationship between the expected interaction count in bin i and its distance from the center 26 (Appendix Fig S6B) in a certain cell line. We then regressed against the distance and defined μ i as the randomly expected interaction count at distance d i as follows:
where e k − 1 is the expected signal closest to the capture site (i.e., at bin 1) or the intercept, and δ is the exponential decay parameter for the interaction signal. Using this model, δ was then calculated, for every cell line used, using the maximum likelihood estimate.
The distribution of interaction counts, , at bin i for any capture probe has been shown to follow a negative binomial distribution, which allows for fitting an extra parameter β i for the mean–variance relationship:
β i was previously observed, to be fixed 26; however, we found that this parameter is a linear function of the mean counts (Appendix Fig S6C) 47:
Accordingly, we used generalized linear modeling to empirically estimate the maximum likelihood parameters for α and k. This identified the expected distribution of counts at a given distance from the capture site as shown in Appendix Fig S6D.
Calculation of interaction significance
Having built a model of the null hypothesis distribution at every bin i, we then used the observed interaction counts to test its significance.
We first segmented the interaction space around the capture probe, into overlapping windows of width 2 kb at an equally spaced distance of 200 bp. We then calculated the interaction counts as shown before. Finally, we normalized the interaction counts for every probe to a similar mean, to avoid any false positives:
We then used the model built before to build the null hypothesis model at the observed distance of the window, to estimate the P‐value. P‐values were calculated using the cumulative distribution function (CDF) for the expected negative binomial distribution:
False discovery rate (FDR) correction
False discovery correction assumes an equal probability of significance across all the tests being conducted. However, within the same capture probe, this is a false assumption, since it is harder to call significance closer to the capture site, and at extreme distances and the distribution of P‐values at different bins will therefore differ. Accordingly, stratified FDR correction 48, correcting at every bin across all the probes being captured, was used. This is justified, since all the probes are captured in one experiment.
GC content and mappability
The effect of potential confounders, including GC content and mappability, was determined using previously described methods 26. The GC content and mappability of statistically significant cis‐interaction regions was comparable to that of control, non‐cis‐interacting, regulatory regions within 500 kb of bait sites (Appendix Fig S7).
Differential significance between conditions
To perform differential analysis, we constructed a union of the significantly interacting regions across the two conditions (i.e., normoxia and hypoxia in MCF‐7). We used any significant region that was called significant in any replicate. We then used an approach similar to that of DESeq 46.
Essentially, we used the likelihood ratio test (LRT) to examine the null hypothesis that the interaction frequencies at each site were similar across the two conditions. Specifically, a generalized linear model (GLM) with a negative binomial response was applied under the null hypothesis and compared to an unrestricted negative binomial GLM. The chi‐squared test with 1 degree of freedom was then used to find the significance of each interaction, and P‐values were adjusted for false discovery rate (FDR).
To test across different cell lines, we normalized the cell line mean interaction frequency. Accordingly, we used a similar normalization approach to the above: we calculated a correction factor for the interaction counts that would ensure that the median interaction counts across the cell lines being tested were similar:
where GY cl,j is the geometric mean across probe j in one cell line, and GY j is the geometric mean across probe j for all the cell lines. C cl is then the median of this normalization factor for the cell line cl. The interaction counts for each cell line are then obtained using:
Author contributions
JLP, RS, JRH, PJR, and DRM designed the experiments. JLP, JS, and HC performed the experiments. JLP, RS, JRH, JOJD, and DRM performed data analysis. JLP, PJR, and DRM wrote the manuscript.
Conflict of interest
PJR is a scientific co‐founder and holds equity in ReOx Ltd., a university spin‐out company that seeks to develop inhibitors of the HIF hydroxylases.
Supporting information
Acknowledgements
We thank Douglas Higgs for his help and advice with establishing the Capture‐C assay in our laboratory and Angie Green and the High‐Throughput Genomics Group at the Wellcome Trust Centre for Human Genetics (funded by the Wellcome Trust—090532/Z/09/Z) for the generation of the sequencing data. This work was supported by a Cancer Research UK (grant number A16016), the Ludwig Institute for Cancer Research, the Higher Education Funding Council for England, the Wellcome Trust (grant numbers 078333/Z/05/Z, WT091857MA) and the Deanship of Scientific Research (DSR), King Abdulaziz University, Ministry of Higher Education for Saudi Arabia.
EMBO Reports (2016) 17: 1410–1421
Contributor Information
Peter J Ratcliffe, Email: Peter.Ratcliffe@ndm.ox.ac.uk.
David R Mole, Email: drmole@well.ox.ac.uk.
References
- 1. Wenger RH, Stiehl DP, Camenisch G (2005) Integration of oxygen signaling at the consensus HRE. Sci STKE 2005: re12 [DOI] [PubMed] [Google Scholar]
- 2. Semenza GL (2012) Hypoxia‐inducible factors in physiology and medicine. Cell 148: 399–408 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Ortiz‐Barahona A, Villar D, Pescador N, Amigo J, del Peso L (2010) Genome‐wide identification of hypoxia‐inducible factor binding sites and target genes by a probabilistic model integrating transcription‐profiling data and in silico binding site prediction. Nucleic Acids Res 38: 2332–2345 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Choudhry H, Schodel J, Oikonomopoulos S, Camps C, Grampp S, Harris AL, Ratcliffe PJ, Ragoussis J, Mole DR (2014) Extensive regulation of the non‐coding transcriptome by hypoxia: role of HIF in releasing paused RNApol2. EMBO Rep 15: 70–76 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Camps C, Saini HK, Mole DR, Choudhry H, Reczko M, Guerra‐Assuncao JA, Tian YM, Buffa FM, Harris AL, Hatzigeorgiou AG et al (2014) Integrated analysis of microRNA and mRNA expression and association with HIF binding reveals the complexity of microRNA expression regulation under hypoxia. Mol Cancer 13: 13–28 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Galbraith MD, Allen MA, Bensard CL, Wang X, Schwinn MK, Qin B, Long HW, Daniels DL, Hahn WC, Dowell RD et al (2013) HIF1A employs CDK8‐mediator to stimulate RNAPII elongation in response to hypoxia. Cell 153: 1327–1339 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Schodel J, Oikonomopoulos S, Ragoussis J, Pugh CW, Ratcliffe PJ, Mole DR (2010) High‐resolution genome‐wide mapping of HIF‐binding sites by ChIP‐seq. Blood 117: e207–e217 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Schodel J, Bardella C, Sciesielski LK, Brown JM, Pugh CW, Buckle V, Tomlinson IP, Ratcliffe PJ, Mole DR (2012) Common genetic variants at the 11q13.3 renal cancer susceptibility locus influence binding of HIF to an enhancer of cyclin D1 expression. Nat Genet 44: 420–425 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Schorg A, Santambrogio S, Platt JL, Schodel J, Lindenmeyer MT, Cohen CD, Schrodter K, Mole DR, Wenger RH, Hoogewijs D (2015) Destruction of a distal hypoxia response element abolishes trans‐activation of the PAG1 gene mediated by HIF‐independent chromatin looping. Nucleic Acids Res 43: 5810–5823 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Melvin A, Rocha S (2012) Chromatin as an oxygen sensor and active player in the hypoxia response. Cell Signal 24: 35–43 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Perez‐Perri JI, Acevedo JM, Wappner P (2011) Epigenetics: new questions on the response to hypoxia. Int J Mol Sci 12: 4705–4721 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Tsai YP, Wu KJ (2014) Epigenetic regulation of hypoxia‐responsive gene expression: focusing on chromatin and DNA modifications. Int J Cancer 134: 249–256 [DOI] [PubMed] [Google Scholar]
- 13. Kirmes I, Szczurek A, Prakash K, Charapitsa I, Heiser C, Musheev M, Schock F, Fornalczyk K, Ma D, Birk U et al (2015) A transient ischemic environment induces reversible compaction of chromatin. Genome Biol 16: 246 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Chan MC, Holt‐Martyn JP, Schofield CJ, Ratcliffe PJ (2016) Pharmacological targeting of the HIF hydroxylases ‐ A new field in medicine development. Mol Aspects Med 47–48: 54–75 [DOI] [PubMed] [Google Scholar]
- 15. Koury MJ, Haase VH (2015) Anaemia in kidney disease: harnessing hypoxia responses for therapy. Nat Rev Nephrol 11: 394–410 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Davies JO, Telenius JM, McGowan SJ, Roberts NA, Taylor S, Higgs DR, Hughes JR et al (2016) Multiplexed analysis of chromosome conformation at vastly improved sensitivity. Nat Methods 13: 74–80 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Hughes JR, Roberts N, McGowan S, Hay D, Giannoulatou E, Lynch M, De Gobbi M, Taylor S, Gibbons R, Higgs DR (2014) Analysis of hundreds of cis‐regulatory landscapes at high resolution in a single, high‐throughput experiment. Nat Genet 46: 205–212 [DOI] [PubMed] [Google Scholar]
- 18. Maxwell PH, Wiesener MS, Chang GW, Clifford SC, Vaux EC, Cockman ME, Wykoff CC, Pugh CW, Maher ER, Ratcliffe PJ (1999) The tumour suppressor protein VHL targets hypoxia‐inducible factors for oxygen‐dependent proteolysis. Nature 399: 271–275 [DOI] [PubMed] [Google Scholar]
- 19. Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis CA, Doyle F, Epstein CB, Frietze S, Harrow J, Kaul R et al (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489: 57–74 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, Wei G, Chepelev I, Zhao K (2007) High‐resolution profiling of histone methylations in the human genome. Cell 129: 823–837 [DOI] [PubMed] [Google Scholar]
- 21. Creyghton MP, Cheng AW, Welstead GG, Kooistra T, Carey BW, Steine EJ, Hanna J, Lodato MA, Frampton GM, Sharp PA et al (2010) Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc Natl Acad Sci USA 107: 21931–21936 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Dekker J, Rippe K, Dekker M, Kleckner N (2002) Capturing chromosome conformation. Science 295: 1306–1311 [DOI] [PubMed] [Google Scholar]
- 23. Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, Hu M, Liu JS, Ren B (2012) Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485: 376–380 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Subramanian A, Kuehn H, Gould J, Tamayo P, Mesirov JP (2007) GSEA‐P: a desktop application for Gene Set Enrichment Analysis. Bioinformatics 23: 3251–3253 [DOI] [PubMed] [Google Scholar]
- 25. Xiao Y, Hsiao TH, Suresh U, Chen HI, Wu X, Wolf SE, Chen Y (2014) A novel significance score for gene selection and ranking. Bioinformatics 30: 801–807 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Jin F, Li Y, Dixon JR, Selvaraj S, Ye Z, Lee AY, Yen CA, Schmitt AD, Espinoza CA, Ren B (2013) A high‐resolution map of the three‐dimensional chromatin interactome in human cells. Nature 503: 290–294 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Hakim O, John S, Ling JQ, Biddie SC, Hoffman AR, Hager GL (2009) Glucocorticoid receptor activation of the Ciz1‐Lcn2 locus by long range interactions. J Biol Chem 284: 6048–6052 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Eijkelenboom A, Mokry M, de Wit E, Smits LM, Polderman PE, van Triest MH, van Boxtel R, Schulze A, de Laat W, Cuppen E et al (2013) Genome‐wide analysis of FOXO3 mediated transcription regulation through RNA polymerase II profiling. Mol Syst Biol 9: 638 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Melo CA, Drost J, Wijchers PJ, van de Werken H, de Wit E, Oude Vrielink JA, Elkon R, Melo SA, Leveille N, Kalluri R et al (2013) eRNAs are required for p53‐dependent enhancer activity and gene transcription. Mol Cell 49: 524–535 [DOI] [PubMed] [Google Scholar]
- 30. Pan YF, Wansa KD, Liu MH, Zhao B, Hong SZ, Tan PY, Lim KS, Bourque G, Liu ET, Cheung E (2008) Regulation of estrogen receptor‐mediated long range transcription via evolutionarily conserved distal response elements. J Biol Chem 283: 32977–32988 [DOI] [PubMed] [Google Scholar]
- 31. Fullwood MJ, Liu MH, Pan YF, Liu J, Xu H, Mohamed YB, Orlov YL, Velkov S, Ho A, Mei PH et al (2009) An oestrogen‐receptor‐alpha‐bound human chromatin interactome. Nature 462: 58–64 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Jing H, Vakoc CR, Ying L, Mandat S, Wang H, Zheng X, Blobel GA (2008) Exchange of GATA factors mediates transitions in looped chromatin organization at a developmentally regulated gene locus. Mol Cell 29: 232–242 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Papantonis A, Kohro T, Baboo S, Larkin JD, Deng B, Short P, Tsutsumi S, Taylor S, Kanki Y, Kobayashi M et al (2012) TNFalpha signals through specialized factories where responsive coding and miRNA genes are transcribed. EMBO J 31: 4404–4414 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Splinter E, Heath H, Kooren J, Palstra RJ, Klous P, Grosveld F, Galjart N, de Laat W (2006) CTCF mediates long‐range chromatin looping and local histone modification in the beta‐globin locus. Genes Dev 20: 2349–2354 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Leisz S, Schulz K, Erb S, Oefner P, Dettmer K, Mougiakakos D, Wang E, Marincola FM, Stehle F, Seliger B (2015) Distinct von Hippel‐Lindau gene and hypoxia‐regulated alterations in gene and protein expression patterns of renal cell carcinoma and their effects on metabolism. Oncotarget 6: 11395–11406 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Salama R, Masson N, Simpson P, Sciesielski LK, Sun M, Tian YM, Ratcliffe PJ, Mole DR (2015) Heterogeneous effects of direct hypoxia pathway activation in kidney cancer. PLoS One 10: e0134645 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Schodel J, Mole DR, Ratcliffe PJ (2013) Pan‐genomic binding of hypoxia‐inducible transcription factors. Biol Chem 394: 507–517 [DOI] [PubMed] [Google Scholar]
- 38. Kaelin WG (2007) Von Hippel‐Lindau disease. Annu Rev Pathol 2: 145–173 [DOI] [PubMed] [Google Scholar]
- 39. Raval RR, Lau KW, Tran MG, Sowter HM, Mandriota SJ, Li JL, Pugh CW, Maxwell PH, Harris AL, Ratcliffe PJ (2005) Contrasting properties of hypoxia‐inducible factor 1 (HIF‐1) and HIF‐2 in von hippel‐lindau‐associated renal cell carcinoma. Mol Cell Biol 25: 5675–5686 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Bindra RS, Vasselli JR, Stearman R, Linehan WM, Klausner RD (2002) VHL‐mediated hypoxia regulation of cyclin D1 in renal carcinoma cells. Cancer Res 62: 3014–3019 [PubMed] [Google Scholar]
- 41. de Paulsen N, Brychzy A, Fournier MC, Klausner RD, Gnarra JR, Pause A, Lee S (2001) Role of transforming growth factor‐alpha in von Hippel‐Lindau (VHL)(‐/‐) clear cell renal carcinoma cell proliferation: a possible mechanism coupling VHL tumor suppressor inactivation and tumorigenesis. Proc Natl Acad Sci USA 98: 1387–1392 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Lau KW, Tian YM, Raval RR, Ratcliffe PJ, Pugh CW (2007) Target gene selectivity of hypoxia‐inducible factor‐alpha in renal cancer cells is conveyed by post‐DNA‐binding mechanisms. Br J Cancer 96: 1284–1292 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Wiesener MS, Jurgensen JS, Rosenberger C, Scholze CK, Horstrup JH, Warnecke C, Mandriota S, Bechmann I, Frei UA, Pugh CW et al (2003) Widespread hypoxia‐inducible expression of HIF‐2alpha in distinct cell populations of different organs. FASEB J 17: 271–273 [DOI] [PubMed] [Google Scholar]
- 44. Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W et al (2008) Model‐based analysis of ChIP‐Seq (MACS). Genome Biol 9: R137 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Hower V, Evans SN, Pachter L (2011) Shape‐based peak identification for ChIP‐Seq. BMC Bioinformatics 12: 15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Anders S, Huber W (2010) Differential expression analysis for sequence count data. Genome Biol 11: R106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Ver Hoef JM, Boveng PL (2007) Quasi‐Poisson vs. negative binomial regression: how should we model overdispersed count data? Ecology 88: 2766–2772 [DOI] [PubMed] [Google Scholar]
- 48. Sun L, Craiu RV, Paterson AD, Bull SB (2006) Stratified false discovery control for large‐scale hypothesis testing with application to genome‐wide association studies. Genet Epidemiol 30: 519–530 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.