Skip to main content
. 2018 May 14;7:e34077. doi: 10.7554/eLife.34077

Figure 1. Features of TAD boundaries and TAD insulator function.

Profiles in A-E represent a normalized aggregate count of peaks or features along the length of all TADs, sub-divided into 100 equally-sized bins per TAD, where bin #1 is the 5’ start of the TAD and bin #100 is at the TAD 3’ end. Normalization was performed to allow comparison of multiple groups with variable peak numbers in a single figure. The y-axis displays the enrichment within a given bin versus the average of the five center bins (bins #48–52). In A-C, the number of binding sites in each group is shown in parenthesis. (A) Cohesin-and-CTCF (CAC) sites are enriched at TAD boundaries, while cohesin-non-CTCF (CNC) sites are weakly depleted. As the cohesin (Coh) complex is a multi-protein complex, the darker color within each group represents a stricter overlap between cohesin subunits (Rad21, Stag1 and Stag2). (B) In both mouse liver and MEFs, cohesin binding sites that are resistant to knockdown (KD) or knockout (KO) of cohesin component subunits (~40% of cohesin binding sites for liver) are strongly enriched for TAD boundaries. Cohesin sites that are sensitive to loss following KD or KO (~60% of sites for liver) are not enriched at TAD boundaries. (C) CTCF binding sites in liver that are deeply-shared across other ENCODE tissues (≥12 out of 15 other tissues examined) are strongly enriched at liver TAD boundaries, while those that are either unique to liver or shared in only one other tissue are not enriched at TAD boundaries. (D) TAD boundaries show greater hypomethylation than the TAD interior. The most hypomethylated CpGs are enriched at TAD boundaries, which likely represents a combination of hypomethylation at gene promoters and hypomethylation at CTCF binding sites. CpG methylation states, determined by liver whole genome bisulfite sequence analysis were subdivided into 10 bins based on the degree of methylation (0–10% methylated, 10–20%, etc.) prior to TAD distribution analysis. (E) 10 liver-expressed TFs are not enriched at TAD boundaries. These profiles are representative of the vast majority of the >50 publically available ChIP peak lists for liver-expressed TFs. Notable exceptions, related to promoter-associated features, marks, and transcription factors, are shown in Figure 1—figure supplement 1B,D. (F) Shown is a heat map of the distribution of the indicated activating and repressive marks and other features determined for male mouse liver across a 1 Mb window around each TAD boundary. TAD clusters, numbered at the left, were defined using k means clustering (k = 4). The boundaries between TADs transition from active to inactive chromatin compartments (or vice versa) for TAD clusters 2 and 3. In downstream analyses based on these results, a TAD was considered active if the boundary at the start of a TAD fell into clusters 1 or 2 and the boundary at the end of the same TAD fell into clusters 1 or 3; inactive TADs are those whose boundaries begin in clusters 3 or 4 and end in clusters 2 or 4 (see Materials and methods). See Supplementary file 1A for a full listing of the 3538 autosomal TADs analyzed and their active/inactive status. (G) UCSC browser screenshot for a transitional TAD boundary on chromosome 13 from TAD cluster 3 in Figure 1F. Arrows at bottom indicate CTCF motif orientation. (H) Box plots showing liver gene expression (RNA-seq) for 12,258 genes in 1930 active TADs and 4643 genes in 1000 inactive TADs (Supplementary file 1A). 939 genes in 473 of the inactive TADs are expressed at >1 FPKM (Supplementary file 1E). Genes found in active compartment TADs are more highly expressed, with the majority of genes showing >1 FPKM, than genes found in inactive TAD compartments. Genes in weakly active and weakly inactive TADs were excluded from these analyses. (I) Genes whose TSS are located in inactive TADs (‘B compartments’) are more tissue specific in their expression pattern than genes found in active TADs (‘A compartments’). The top GO category for expressed genes in the A compartment is RNA binding, while the top category for expressed genes in the B compartment is monooxygenase activity (not shown).

Figure 1.

Figure 1—figure supplement 1. Additional Features of TAD boundaries.

Figure 1—figure supplement 1.

(A) CTCF binding orientations are divergent at TAD boundaries. The top plot indicates bin enrichment relative to the TAD center, as in Figure 1A. The X-axis begins at the midpoint of one TAD, crosses the TAD boundary, and then extends to the midpoint of the subsequent TAD. The lower plot shows the in peak ChIP strength relative to the TAD boundary across a 1 Mb window. Blue shows the signal for CTCF ChIP-seq reads that overlap a (+) CTCF peak, while orange shows the CTCF ChIP-seq reads that overlap a (-) CTCF peak. Reads that either do not overlap a peak or fall in a peak with no CTCF motif are excluded. (B–F) Shown are normalized aggregate count for the indicated peaks or features along the length of all TADs, sub-divided into 100 equally-sized bins per TAD, where bin #1 is at the 5’ end of the TAD and bin #100 is at the TAD 3’ end. Normalization was performed to allow comparison of multiple groups with variable peak numbers and different TAD lengths in a single figure. The y-axis displays the enrichment within a given bin versus the average of the five center bins (bins #48–52). (B) Two TFs, TP and E2F4, show enrichment for TAD boundaries, likely due to their strong bias for binding at gene promoters. (C) The TSS’s of protein-coding genes are enriched at TAD boundaries, while liver-expressed lncRNA TSS show little or no TAD boundary enrichment. (D) Three promoter-associated histone marks, but not the enhancer mark H3K4me1, are enriched at TAD boundaries. This provides further evidence that TAD boundary regions are actively transcribed. (E) Promoters and weak promoters that lack CAC binding are enriched at TAD boundaries. Similarly, CAC that do not overlap promoters or weak promoters are enriched at TAD boundaries. Thus, we expect that promoter and CAC enrichment at TAD boundaries are distinct phenomena and not simply an artifact of CAC binding at some promoters. Promoters and weak promoters were defined by the presence of a DHS and the ratio of H3K4me3 to H3K4me1 signal. (see Materials and methods).
Figure 1—figure supplement 2. Additional Features of TAD boundaries, continued.

Figure 1—figure supplement 2.

(A) Intra-CTCF-motif CpGs are highly hypomethylated compared to the genome-wide average and to neighboring CpGs within 10 kb (‘CpG <10 kb Away’). This is most pronounced for TAD and intra-TAD anchors. The strength of CTCF binding is more strongly associated with methylation level than with motif score (data not shown). (B) Genes immediately adjacent to TAD boundaries are more tissue ubiquitous in expression (i.e., housekeeping genes). Each bar represents the indicated percentile bin into the adjacent TADs, and by three bins into the TAD from each end (or 6% of the total TAD length), the genes are not significantly (ns) more tissue ubiquitous in expression than the overall genome-wide level (horizontal dashed line). Bins were defined as above, with each TAD divided into 100 bins of equal length per TAD.