Summary
While population-level analyses revealed significant roles for CTCF and cohesin in mammalian genome organization, their contributions at the single-cell level remain incompletely understood. Here, we used a super-resolution microscopy approach to measure the effects of removal of CTCF or cohesin in mouse embryonic stem cells. Single-chromosome traces revealed cohesion-dependent loops, frequently stacked at their loop anchors forming multi-way contacts (hubs), bridging across TAD boundaries. Despite these bridging interactions, chromatin in intervening TADs was not intermixed, remaining separated in distinct loops around the hub. At the multi-TAD scale, steric effects from loop stacking insulated local chromatin from ultra-long range (>4 Mb) contacts. Upon cohesin removal, the chromosomes were more disordered and increased cell-cell variability in gene expression. Our data revise the TAD-centric understanding of CTCF and cohesin, and provide a multi-scale, structural picture of how they organize the genome on the single-cell level through distinct contributions to loop stacking.
Keywords: 3D Genome organization, CTCF, cohesin, chromosome tracing, multiplexed imaging, single-cell gene expression, single-chromosome chromatin folding, loop extrusion
Blurb:
Hafner et al. use microscopy to trace folding of individual chromosomes. They observe and quantify chromatin loops and find that loop stacking is important for 3D organization across genomic scales. CTCF is important for preferential clustering of TAD-boundary sites. Cohesin is essential for both TAD and chromosome-scale folding.
Introduction
Cohesin and CTCF are essential for the establishment and maintenance of proper genome folding and have been most recognized for folding the genome into topologically associating domains (TADs). Indeed, cohesin and CTCF binding is enriched at TAD boundaries and acute depletion of these proteins has been shown by Hi-C and imaging to lead to loss of TADs 1–5. Polymer modeling work has proposed that these two proteins work together in forming TADs through distinct roles; cohesin extrudes loops of chromatin but stalls at sites bound by CTCF 6,7, and considerable evidence supports these molecular roles 8. However, a detailed view of the 3D structures of single chromosomes and how they are perturbed upon removal of cohesin or CTCF, with resolution from TADs to whole chromosomes, is lacking. Such a high-resolution view may tease apart regimes in which cohesin and CTCF operate that are not distinguished by pairwise ensemble averages, and thus help us better understand how they affect cis-regulation of transcription. Microscopy approaches are well suited for investigating 3D structure and heterogeneity at the single-cell level.
Recent imaging approaches to study the effects of cohesin and CTCF on chromosome organization in single cells have not provided a complete or fully concordant picture. Our earlier work found single cells exhibited ‘TAD-like’ domains, which were preserved in the absence of cohesin, though the position of their borders became more randomly distributed, leading to the loss of true (population-level) TADs 3. Two more recent microscopy studies that labeled TADs with and without cohesin also observed globular chromatin volumes in both conditions, but reported conflicting observations on the effect of cohesin loss 9,10. Upon degrading cohesin, Luppino and colleagues found reduced overlap between adjacently labeled TADs, while the volume of the TADs remained the same. By contrast, the second imaging study by Szabo and colleagues found that loss of cohesin did not significantly change the overlap between adjacently labeled TADs but increased the volume of each TAD. To our knowledge to date, there are also no studies investigating how the changes at the TAD scale affect single-chromosome folding at larger genomic scales. There is also limited data on how these factors affect multiway contacts in a sequence dependent manner 3,11.
To better understand the role of cohesin and CTCF on genome structure, using a multi-scale chromosome tracing approach, similar to recent reports 12–14, we imaged chromatin in mouse embryonic stem cells, from the scale of loop-dots and TADs to the scale of chromosomes and compartments. To accelerate this investigation, we developed a method to multiplex this imaging and so combine different genetically edited cell lines and different treatment conditions into a single experiment, enabling direct measurement of 3D structure in nanometers without risk of batch effects. Our single-chromosome data provide a detailed view of how cohesin and CTCF shape the 3D structure of chromatin; enhancing local contact by forming heterogeneous loops at sub-megabase scales, preventing distal contact by loop clash at megabase scales, and facilitating TAD-border bypass by forming hubs of loop anchors.
Results
Rapid optical profiling of chromatin structure
We used auxin inducible protein degradation to acutely deplete either CTCF or cohesin 1,10 (Figure S1A) and used Optical Reconstruction of Chromatin Architecture (ORCA) 15, to measure the effects on chromatin folding. We initially focused on two regions on chromosomes 3 and 6, each ~2.5 Mb, which in mouse embryonic stem cells exhibit multiple TADs, sub-TADs, and loop-dots, along with highly studied genes (the Hoxa cluster and the Sox2 gene) (Figure 1A). These regions were divided into 73 or 93 target-segments for ORCA labeling (Tables S1 and S2), balancing throughput and genomic coverage in the assay and on par with the effective resolution of recent Hi-C data from these cells for resolving sub-TADs and loop-dots 1,4 (Figure S1B).
To enable fast imaging of multiple treatment conditions as well as to control for potential batch effects intrinsic to high-content, high-resolution assays, we developed a multiplexed ORCA approach (Figures 1B). We adapted a recently published method from single-cell sequencing16 to attach distinct oligonucleotide-barcodes to the cell surface, after which cells can be combined, hybridized with ORCA probes, and imaged within the same experiment, where cell surface barcodes are read out by sequential hybridization, similar to ORCA barcodes (Figure 1B). This reduced imaging time from 5 weeks to 1 week to cover 5 conditions: CTCF-AID and RAD21-AID endogenously tagged cells, either not treated (NT) or treated with auxin for 4 hrs (+Aux) to degrade CTCF or the RAD21 sub-unit of cohesin (Figure S1A), and the parental E14 mESC line from which these were made. We quantified for each condition the frequency that any step of the traced sequence came in contact (i.e. looped) with any other step along the trace. We used the average distance between adjacent steps as the cut-off distance for calling a contact/loop, since adjacent steps are tiled without gaps (Figure 1A). This selection takes advantage of the internal control provided by adjacent probes and avoids the need to choose an arbitrary cutoff distance. We plotted the resulting pairwise contact frequencies (loop frequencies) as a heatmap (Figure 1C). Note the units in these maps are the percentage (0–100%) of traces in the population exhibiting the given loop, not a normalized or matrix-balanced read count. Replicate experiments showed high reproducibility (Pearson’s r =0.92 – 0.97 over 24 comparisons - Figure S1C) and were merged for subsequent plots and analyses in order to increase sample size. The absolute loop frequency within each of the demultiplexed treatment conditions showed good agreement with Hi-C data from earlier studies degrading CTCF 1 or cohesin 4, both in qualitative features (Figure 1C and 1D) and Pearson’s correlation (Pearson’s r = 0.8–0.93 - Figure S1D).
We next asked how these distinctive population average loop frequencies (Figure 1C and 1D) manifest at the single cell level. We began with an unbiased clustering approach, using t-distributed stochastic neighbor embedding (t-SNE) analysis to cluster the single-chromosome pairwise distance matrices. This produced a well-connected single cloud (Figure 1E). Despite clear differences among the induced degradation conditions and untreated cells at the population average, individual traces from these populations stretched across the cloud, overlapping one another substantially (Figure 1E). Traces from cohesin-depleted cells showed the most asymmetric pattern (Figure 1E). These observations parallel our recently reported finding that the SOX9 locus shows distinct patterns of TADs, sub-TADs and stripes in embryonic stem cells compared to differentiated cranial neural crest, and yet produces overlapping clouds in dimensionality reduction 17.
To test how these results compare to current predictions of single-cell variability, we applied t-SNE to polymers produced from Langevin dynamic simulations of loop extrusion 6,18 using the open2c polychrom toolbox 19 derived from earlier works 6,18 (STAR Methods). Considerable evidence supports the model that cohesin shapes genome structure, including TADs, through loop extrusion (for review see 8,20). In this model, cohesin is loaded onto DNA and extrudes a loop until it is stalled by CTCF. As expected 6, simulations including cohesin and CTCF formed TADs between CTCF boundaries, and loop-dots between convergent CTCF sites, while simulations without CTCF or cohesin did not (Figure 1F). t-SNE analysis of simulated polymers spanning a few TADs, much like our data, revealed overlapping clouds for polymers with or without CTCF or cohesin (Figure 1F).
Thus, both our multiplexed ORCA data and simulations support the idea that TADs are population-level features that arise from a continuous and dynamic process and do not correspond to physical and static structures in single cells. We conclude that dimensionality reduction approaches such as t-SNE, popularized in their ability to distinguish discrete cell-populations marked by unique gene combinations, may not be ideal tools to highlight differences in continuum processes such as the folding of individual chromatin polymers.
CTCF and cohesin affect loop density and 3D separation
To complement unweighted dimensionality reduction of traces, we next focused on features of particular interest, starting with loops in single chromatin traces. To facilitate interpretation of 3D traces projected into a 2D page, we assigned distinct colors to each looped region, while non-looped regions were colored in gray (Figure 2A). Traces from cells without CTCF (Figure 2A, CTCF + Aux), were indistinguishable by eye in terms of looping from NT traces. By contrast, traces from cells without cohesin (RAD21 +Aux) showed fewer loops (e.g. Figure 2B).
To quantify these effects, we computed the difference in loop frequencies between +Aux and NT cells (Figure 2C and 2D for CTCF and RAD21 respectively). When such subtractions of contact maps are performed with Hi-C data, the separate normalizations applied to each condition (to reduce sequencing artifacts) may bias the magnitude and sign of the subtraction map. This is not a concern for our loop maps, which do not require normalization such as matrix balancing.
For CTCF depletion, subtraction maps showed gains in some regions, largely balanced by losses elsewhere (Figure 2C and S2A), consistent with the visual impression from single traces of similar loop numbers (Figure 2A). Loop gains were significantly enriched at between-TAD regions (Figure 2E, Binomial statistic p=4.5e-11 and p=1.12e-7 for chr 6 and chr 3 respectively), consistent with previous observations from Hi-C and microscopy 1,9,10. In contrast to prior work however, which concluded CTCF has little role within-TAD cohesion compared to TAD separation 1,10, we saw marked and significant loss of loops from within-TAD regions (Figure 2C and 2F, p=2.34e-7 and p=7.85e-7 Wilcoxon signed rank sum, for chr 6 and chr 3 respectively). Subtraction of 3D distance, showed that loss of loops within TADs led to slight decompaction within TADs (Figure 2G and 2H). Notably, loop loss within TADs was observed in published Hi-C data, but it was suggested at the time to be a normalization artifact 1. This justified interpretation rested on both consideration for the issues in Hi-C matrix balancing and subtraction, mentioned above, and an inability to detect the effect by concurrent analysis with FISH 1. Understanding this loss of loops within TADs is thus an interesting observation to try to explain with future models.
Contrasting the mixed gain and loss of loops on CTCF removal, cohesin removal reduced loops generally (Figure 2D, S2B). Subtraction of 3D distance showed more substantial and significant (p<0.05) global expansion than CTCF both within and between TADs (Figure 2I, S2C, S2D and S2E), consistent with visual impression from single-chromosome traces (Figure 2B). Counterintuitively, despite the fact that cohesin depleted cells lack TAD boundaries (Figure 1), the region between these neighboring TADs was bridged by fewer cross-TAD loops (Figure 2D) and physically more distal (Figure 2I). Our data also resolves the conflicting super-resolution data described above: the loss of loops between TADs matches the decrease in overlap fraction reported by 9 and the expansion in volume is consistent with measurements by 10.
To test whether the expansion upon depletion of cohesin was dependent in part or in full on the loss of sister chromatid cohesion, we performed immunofluorescence labeling of Geminin and ORCA in the same cells (Figure S2F). Geminin is a cell-cycle-regulated protein that is not detected in G1 and progressively accumulates through the S and G2 phases of the cell cycle. We selected cells with low Geminin levels (lowest 25% of the cells) and found that these also showed expansion upon loss of cohesin, both within and between TADs (Figure S2G). Further, we found no correlation between Geminin levels and the median pairwise separation among all imaged domains on a single-cell basis (Figure S2H) and there was not an obvious separation of G1 or G2 cells in the t-SNE analysis (Figure S2I). From this we conclude that the expansion of chromosomes in cells without cohesin is primarily driven by changes in cis-structure, and not loss of sister cohesion.
Multi-way contacts between loop anchors depend on CTCF and cohesin
Having examined changes in loop frequency and 3D distance in a pairwise fashion, we next turned to analysis of multi-way contacts. Visual inspection of traces revealed loops often shared common anchor points (Figure 3A). We noticed a tendency to find CTCF boundary sites (CBSs) clustered at these loop bases (Figure 3A). To quantify this we computed the frequency of three-way contacts among CBSs across all traces. We compared these to distance-matched, non-CBS contacts using a sliding window across the genomic region. We found three-way contacts among CBSs were more common than expected based on the measured pairwise contact frequencies (assuming pairs are independent) (Figure 3B, gray bars), and higher than for non-CBS triplets (Figure 3B, black bars). Depletion of CTCF significantly reduced the frequency of three-way contacts among both CBSs and CBS proximal sites, though made little change for CBS-distal sites (Figure 3B, blue bars and Figure 3C). Degradation of cohesin clearly reduced the preference for CBSs and CBS-proximal regions to form three-way contacts (Figure 3B, red bars, and Figure 3D).
In addition to clustering, we found chromatin traces tended to organize in a radial pattern, in which the CBS hub was positioned in the geometric center of the trace (Figure 3E). Quantifying the traces which contained CBS hubs (17–20% of traces) of at least 3 sites (out of 5 CBSs in chr 6 or chr 3), CBSs were clearly displaced towards the center relative to flanking regions (Figure 3F). Depletion of CTCF or cohesin significantly reduced the tendency for central positioning of these sites (Figure 3G, p<0.01 Wilcoxon rank sum). This suggests that cohesin may drive the centering of CTCF, possibly through the formation of loops which sterically distribute around the hub forming a rosette structure.
Furthermore we found this rosette organization supports TAD boundary function concurrent with cross-TAD hub formation. In individual traces, 1 Mb distal CBSs exhibited 3D proximity even while maintaining separation between intervening elements (Figure 3H). At the population level, TAD boundaries in hub-containing traces remained as clearly visible as in the bulk population, despite the cross-TAD interaction needed for hub formation (Figure 3H, S3A and S3B).
In the dimensionality reduction of traces using t-SNE, hub-containing traces were distributed across the map (Figure S3C), reinforcing our earlier observation on the limitations of that approach for identifying patterns in chromatin trace data.
To understand how cohesin loop extrusion may influence the preferential three-way interaction among CBSs and their positioning in the center of the domain, we returned to simulations. With sufficient density/lifetime of extruders, we observed collisions among extruders, leading to a stacking of cohesin-mediated loops (Figure 3I left). Steric effects among the loop arms encouraged the loop bases to collect in the geometric center (Figure 3I, right). As extrusion blockers (CTCF) spend more time at loop bases than other sequences, these sites are more commonly found in the resulting multi-way hubs (Figure 3J) and also centered (Figure 3K). Thus, loop extrusion can explain CBS clustering which is CTCF and cohesin dependent, geometrically centered, and not dependent on assuming adhesive interactions among CTCFs.
For the chromosome 6 region, we observed that the two strongest borders were closely linked not only to CTCF sites but also to Polycomb (Pc) domains. One of the domains spans the ~100 kb, CTCF rich, Pc-bound Hoxa gene cluster. We note that this Pc-coated domain was the only non-significantly changed region, remaining contracted upon cohesin depletion (Figure 2I and S2E, arrow). The second Pc domain marks the CTCF bordered, Hox-regulatory region, ~1 Mb distal from the Hox cluster 21–23. The two Pc regions loop together in NT cells at higher frequency than flanking regions (Figure S3D, arrow). Upon cohesin depletion, this local enhancement in looping frequency between the Pc-bound regions is still detected in the ORCA data, though at reduced strength compared to NT cells (25% of cells relative to 37% in RAD21 NT cells), suggesting it is a Pc-dependent interaction. To further test this, we used dTag13 inducible protein degradation to deplete endogenous proteins EED and Ring1b, core subunits of the two major Pc Repression Complexes 24,25 (Figure S3E). Similar to cohesin depletion, Pc depletion resulted in a reduction of the loop frequency (Figure S3F and S3G), and of the three-way contact frequency with internal CTCF marked border sites (Figure S3H) and the reduced central positioning of those sites (Figure S3I). In contrast to cohesin depletion (Figure 2I), 3D distance subtraction showed Pc loss had the strongest effect for regions outside the Pc loop (Figure S3J), highlighting a difference in the mechanism by which they affect loop frequency. Together, these observations suggest that, for this domain, Pc-Pc interactions cooperate with cohesin to facilitate long-range loop-dots and multi-way hubs, rather than opposing them.
Cohesin loss has distinct effects across genomic scales
We next asked whether disrupting the loop frequency by depleting cohesin at the TAD scale had an effect on chromatin structure at larger genomic scales. We designed probes spanning both a ~30 Mb region and the whole chromosome (~150 Mb) for both chromosomes (Figure 4A).
At these larger spatial scales, we observed that consecutive probes could be found farther apart following cohesin depletion (Figure 4B and S4A, red and blue highlighted steps), consistent with our observations at the ~2.5 Mb scale (Figure 3H). However, this increased separation did not propagate to larger length scales. More distal regions we examined often appeared closer after cohesin depletion, such as the 20 Mb-separated loci shown in Figure 4B (red & cyan) or the 100 Mb-separated loci in S4A (red & cyan). We quantified this by plotting the change in median pairwise distance (Figure S4B) among all points at both scales (Figure 4C). While pairwise distances less than ~4 Mb apart increased significantly (Figure 4C and S4C), beyond the ~4 Mb separation we found a gradual transition to decreased distance, becoming more pronounced and statistically significant as a function of linear separation (Figure 4C, blue to white to red, and S4B).
To understand the mechanisms behind these multi-scale effects of cohesin removal, we performed polymer simulations. We asked which, if any, parameter choices within the extrusion modeling framework could recapitulate our results. Adding cohesin-extruded loops to simulated chromosomes generally contracted the chromosome across all scales (Figure 4D), consistent with prior conclusions modeling mitotic and interphase chromatin 8,18, though contrasting our observations above. We next hypothesized that nuclear crowding may induce the chromatin polymer to fold back on itself more without cohesin compaction. Simulated chromosomes confined in a hard sphere, such that the polymer accounts for fully 20% of the internal volume, showed expansion at short length scales, but little systematic change in 3D distances at larger length scales (Figure 4E). Importantly, these simulations allowed for only a limited degree of steric interactions among loops, a feature intended to capture the ability of abundant topoisomerases to facilitate strand crossing events 19. In our third simulations, we reduced the probability of such crossing, allowing more steric clash between cohesin induced loops. In these simulations, we finally reproduce the switch from expansion to contraction, with a crossover point on the scale of tens of cohesin loops (~4 Mb) (Figure 4F and G). This finding parallels recent polymer theory showing that chromatin loops “experience topological repulsion” 26.
Our ORCA measurements and simulations both show that cohesin loops are not only important to give rise to TADs but are also required for maintenance of a stiffer, more ordered chromosome structure at the whole chromosome scale (Figure 4C).
Previous work using Hi-C found that depletion of cohesin led to a slight strengthening of compartments 2,4,27. Compartments refer to a plaid pattern commonly seen at the chromosome scale in Hi-C contact maps, indicating a preferential separation of the genome into two groups, commonly denoted A and B 28. The separation is readily visualized through the Pearson map, where each element of the map reports the cross-correlation of the corresponding row and column after distance normalization 28. Examining the Pearson maps for the 30-Mb and full-chromosome ORCA data, we observed a similar plaid pattern to Hi-C from mESCs, which also strengthened upon cohesin depletion (Figures 5A, S5A, S5B, S5C and S5D).
To determine to what extent this compartment strengthening is reflected on a cell-to-cell basis, we colored each chromosome trace by its A/B compartment designation (Figure 5B, Table S3). Despite the clear plaid pattern in the Pearson maps (Figure 5A), individual cells showed little evident demixing of A/B chromatin. To quantify this effect, we used density-based spatial clustering (DBSCAN) to identify A-A and B-B clusters within each trace (Figure 5C). As a metric of A/B demixing we quantified the number of clusters and the size of the largest cluster (Figures 5D and 5E) in each trace. We found a median of 3 to 4 clusters for both A and B chromatin (for both chr 3 and 6), indicating these chromosomes are not fully demixed into A/B types. Upon cohesin depletion, the distribution of the number of clusters shifted to the left (fewer clusters) and the distribution of cluster size shifted to the right (bigger clusters) indicating more demixing (Figure 5D and 5E). Polymer simulations incorporating an alternating pattern of weak sticky interactions, along with the afore-mentioned loop extrusion, confinement, and clash, were able to reproduce a strong plaid pattern at the population level while lacking clear demixing at the single trace level (Figure 5F and 5G). Removal of loop extrusion from the simulations resulted in mild strengthening of the plaid pattern (as shown previously 18) and mild increase in demixing at the single trace level (Figure 5F, 5G and 5H).
The lack of clear compartmentalization in embryonic stem cells on a cell-by-cell basis is consistent with previous observations from electron microscopy 29 and high-throughput FISH 30, and contrasts the spatial separation of compartments observed by chromatin tracing in more differentiated fibroblasts 31.
Thus, at the single trace level, we see that clear compartment signal (Pearson map) does not necessarily correspond with visually apparent demixing in single chromosomes. At the population level, the increase in compartments strength upon cohesin removal appears to be a consequence of the decreased distance between distal (>4 Mb) sites. By promoting a small number of local interactions (<4Mb contacts) while inhibiting a large selection of potential distal interactions (>4MB contacts) cohesin tends to oppose compartmentalization primarily because most “checks” in the plaid require the formation of distal contacts (>4Mb). Thus, our observation of the distance change upon cohesin depletion provides a new explanation of the established observation of compartment strengthening on cohesin removal.
Cohesin depletion increases the entropy in genome folding and variability in gene expression
To better quantify the disorder in genome folding observed across genomic scales, we next computed the change in the entropy of the ensemble of traces upon depletion of cohesin or CTCF. A highly ordered folding pattern is expected to have a more narrow distribution of 3D distances between equally spaced points along the trace. Whether these steps are large or small, a narrow distribution will have a smaller entropy (Figure 6A). In contrast, a heterogeneous ensemble of traces will have a broad distribution, and larger entropy (Figure 6A). We computed the entropy of the distance distribution from each viewpoint along the trace across all of our data, and plotted the change in this value upon CTCF or cohesin removal (Figure 6B, STAR Methods). Cohesin depletion led to increased entropy at all scales relative to NT cells. By contrast CTCF depleted cells showed more modest changes in entropy at the ~2.5 Mb scale, and little change at the ~30 Mb or chromosome scale (Figure 6B).
Previous studies have shown that degradation of cohesin has a limited immediate effect on transcription 2,4,32–34 relative to degradation of transcription factors or cofactors 24,35–37 or genetic disruptions of TAD borders 15,38–41. Given the increased entropy in genome structure, we wondered if cells would exhibit increased variability in gene expression.
We performed multiplexed single-molecule FISH to measure expression of a set of developmentally regulated pluripotency genes. We used cellular barcoding to minimize batch effects among different treatment conditions and cell lines (RAD21-AID +/− Aux, CTCF-AID +/− Aux, E14) (Figure 6C). We also added mock (DMSO) treated cells for each genetic background as a further comparison. The mean mRNA counts showed small fold changes between treatment conditions, scarcely stronger than for our mock treatment (Figure 6D). However, we found that the majority of genes tested showed a statistically significant, moderate increase in the coefficient of variation after cohesin depletion (Figure 6E). Depletion of CTCF had less effect on the coefficient of variation (Figure 6E), likely reflecting less change in genome structure (Figure 2G vs 2H, S2C vs S2D, S6A–C) and entropy (Figure 6B).
Cluster analysis by t-SNE showed that cohesin-depleted cells did not split into distinct groups, nor cluster separately from NT cells (Figure 6F). This suggests increased variability was not due to partial differentiation of the mESCs after auxin treatment for 4 hours. Indeed all the genetic backgrounds and treatment conditions clustered together (Figure 6F), with variation of cell-cycle-dependent factors aligning with one major t-SNE axis and variation in Tbx3 defining the other (Figure 6G), a gene which showed little CTCF or cohesin dependent change (Figure 6D).
Overall our data show that acute loss of cohesin leads to disordered chromosome structures with higher entropy across genomic scales ranging from a few Mb to whole chromosome leading to modestly increased cell-to-cell variability in gene expression, which is missed by bulk population assays.
Discussion
Here we introduced a multiplexed approach for ORCA, enabling simultaneous measurement of chromosome folding across multiple treatment conditions and cell-backgrounds, substantially enhancing experimental throughput. Our approach enables quantification of physical structure in absolute units, enabling quantitative comparison between conditions, overcoming a limitation of Hi-C. As the field moves increasingly to the study of perturbations, we expect the ability to measure absolute frequency differences across genomic length scales without normalization artifacts or batch effects, the ability to measure variation in single cells, and the ability to process multiple conditions at a time, will facilitate more rapid understanding. In this work, we characterized the structure of the interphase genome, expanding our understanding of the role of CTCF and cohesin in shaping that structure beyond their previously recognized roles in TAD formation, and identifying new implications for gene regulation.
Together our multi-scale experiments and simulations suggest that cohesin and CTCF organize the genome through their distinct effects on chromatin loops and loop stacking (Figure 7). Cohesin increases the density of chromatin loops, and thus proximal sequences (< 4 Mb) are brought into frequent contact, while distal (> 4 Mb) sequences are held apart by the clash of many loops. When cohesin molecules collide, the loops formed by cohesin stack together. Loop bases aggregate into a central hub, and steric effects among the projected loops drive the hub into the geometric center. As CTCF-border sites (CBSs) are more likely to be at the base of cohesin loops than other sequences, this process also positions CBSs in a hub at the domain center, creating a “rosette”. This model explains the observed high-frequency multi-way contact among CBSs and their position in the domain center, and explains the dependence of both behaviors on CTCF and cohesin. The experimentally observed “rosette” structure meanwhile explains how CTCF-proximal regions can bypass intervening CTCF-borders to contact more distal, CTCF-linked regions, without abrogating the boundary behavior of the intervening sites (discussed more below) (Figure 7).
This radial organization of chromatin around a central hub of CTCF provides evidence for a long hypothesized “chromatin rosette” model, primarily motivated by pairwise 3C data 42–46, (reviewed in 47–49). This model has since been challenged by some genetic analyses 50 and conflicting results from multicontact Hi-C data regarding multi-way interactions among CBSs 11,51. We reconcile these conflicting views by super-resolution imaging of CTCF-rosette-structures in intact nuclei. We also find them to be rare in a given population snapshot (~17–20%) and thus likely transient. While prior work hypothesizing rosettes emphasized their potential role in multi-enhancer or multi-gene hubs, we find rosettes also play a role in maintaining TAD boundaries.
This rosette organization has important implications for cis-regulatory interactions. The first is a mechanism for TAD-border bypass, which arises from the formation of the CBS-hubs. Cis-regulatory elements proximal to CBSs sites will make more frequent contact with CBS-proximal promoters, even if separated by TAD borders, since all 3 elements participate in the hub (Figure 7A). Additional promoters in this domain that are not near a CBS, even if they are linearly closer to the enhancer in genome sequence, are more likely to be radially extruded outside the hub. Thus, this structure provides a potential explanation for both boundary-bypassing enhancers and promoter-skipping enhancers (Figure 7A). It has been shown that Hoxa genes respond to developmental enhancers positioned near the CBS 1 Mb upstream of the Hoxa locus, on the far side of several unrelated genes and two prominent TAD boundaries 52 23. The updated rosette model provides a simple explanation of how this enhancer might use a chain of cohesin dependent loops to reach the Hoxa genes, without ectopically activating the intervening promoters (Figure 7A). We recently found a similar configuration at the Pitx1 locus in developing mouse limbs, another important developmental control gene 53. The rosette organization also suggests that it will be difficult to disrupt E-P interactions by inserting new CTCF-insulators between them if the enhancers and promoters are themselves CTCF bound. Interestingly, the Sox2 promoter and its mESC enhancer are both CTCF bound, and two groups who independently inserted CTCF sites between these found that it failed to efficiently block E-P communication, despite establishing a new TAD border 54,55.
The second implication of the loop-stacking is that it could protect genes from ectopic regulatory contacts with the rest of the genome (Figure 7B). While the probability of a gene ectopically encountering any particular enhancer >4Mb is extremely low, the total number of such potential ectopic encounters, across all enhancers >4Mb is considerable. By significantly increasing the separation from all elements of the chromosome >4Mb apart, cohesin reduces the potential for any ectopic enhancer-promoter interactions. With this perspective, rather than expecting a global up- or down-regulation of genes after depletion of cohesin, our ORCA measurements suggested that each gene and each cell may behave differently. This would manifest as increased cell-cell variation in mRNA levels, which we observed for most of the developmentally important genes we assayed by single molecule RNA FISH. Recent studies have similarly reported that disruptions affecting population level 3D chromatin organization increase variability in gene expression, using single-cell RNA-seq 56, RNA-FISH and flow-cytometry 57, or integrative modeling of omics data58. We build on this result by directly linking the structural heterogeneity (entropy) in chromatin folding (greater for cohesin depletion than CTCF), to the expression heterogeneity in the same cells (greater for cohesin depletion than CTCF). Whether and how these acute changes in cell-cell variability due to disruption of chromatin structure connect to functional outcomes of cohesin loss or reduction in development and disease 59–64 will require future investigation.
The structural changes we observe in genome organization at the chromosome scale also allowed us to revisit recent conclusions about the role of cohesin in aspects of genome organization, such as compartments and Polycomb loops. It has been proposed that cohesin opposes demixing of the genome into A/B compartments directly, through its ability to bridge compartment boundaries and pull together A and B chromatin 2,18,27. This proposal is supported by the observed strengthening of A/B compartments upon cohesin depletion 2,27. Our data suggest an additional mechanism. By significantly decreasing the frequency of interactions >4 Mb, through the dense stacking and clash of loops, cohesin indirectly disrupts compartmentalization by reducing the opportunity of linearly separated A or B domains to contact and mix (Figure 7B).
It has been proposed that cohesin processivity can disrupt the Polycomb (Pc) homotypic interactions which drive formation of ultra-long range contacts among Polycomb domains (1–20+ Mb), “Polycomb loops” 4. This data is supported by the observation that these loops strengthen upon cohesin degradation4. Such Polycomb loops are proposed to be important for Pc-spreading and epigenetic maintenance 65–67. As we have shown for the 1-Mb scale interaction in the Hoxa region, the shorter Pc-Pc interactions actually decrease in absolute frequency upon cohesin loss, while >4 Mb interactions increase along the entire chromosome. Thus, the effect of cohesin on Pc loops we propose is also an indirect effect of cohesin “linearizing” the chromosome (promoting short <4 Mb contacts and reducing longer range ones). Indeed, shorter range Pc loops were also reported to show less change or increase interaction by Hi-C 4, an effect readily explained by the loop-clash model that would not be expected if cohesin linear scanning were to knock apart Pc-Pc loops.
Thus our work illustrates how chromatin loops and loop-loop interactions affect genome folding, from the sub-TAD scale to the whole chromosome, with implications for transcriptional regulation and epigenetic interactions.
Limitations of the study
Although we identify multi-loop stacks that are dependent on the presence of CTCF and cohesin, our inference that cohesin sits at the base of these loops is indirect, as we do not directly visualize cohesin or CTCF on the chromatin fiber. Our data do not distinguish if the CTCF and cohesin molecules are in molecular contact – the genomic resolution of the traces and corresponding uncertainty in the nanoscale position precludes measuring protein-scale contacts, even while resolving the larger-scale loops of the chromatin fiber. Smaller scale loops along these paths, including those which may arise from short extrusion events, are also unresolved in our data. In exploring potential biophysical mechanisms that could give rise to loop stacking, and its consequent effects on TADs, TAD bypass, and multi-scale genome organization, we focused on models involving cohesin mediated loop extrusion. It is quite likely other biophysical models, such as the Strings & Binders Switch model 68,69, can produce similar agreement to our data without invoking active loop extrusion. We focus on loop extrusion due to the minimal number of parameters that must be inferred and increasing evidence in vitro and in vivo for its role in genome architecture 8,20.
STAR Methods
RESOURCE AVAILABILITY
Lead contact
Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Alistair Boettiger (boettiger@stanford.edu).
Materials availability
This study did not generate new unique reagents.
Data and code availability
All processed ORCA data and unprocessed Western blot images generated in this study have been deposited and are publicly available as of the date of publication. Accession numbers and DOI are listed in the key resources table. The probe locations (Table S1) can also be viewed through the UCSC genome browser: https://genome.ucsc.edu/s/toniai/mm10_Hafner2023
All original code has been deposited at Github and Zenodo and is publicly available as of the date of publication. DOIs are listed in the key resources table.
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.
Key resources table.
REAGENT or RESOURCE | SOURCE | IDENTIFIER |
---|---|---|
Antibodies | ||
Anti-Geminin antibody | Abcam | ab195047 |
Anti-RAD21 antibody | Abcam | ab154769 |
Anti-CTCF antibody | Abcam | ab128873 |
Anti-Actin antibody | Cell Signaling | 3700 |
Anti-HA antibody | Abcam | 9110 |
Anti-V5 antibody | Abcam | ab27671 |
Chemicals, peptides, and recombinant proteins | ||
Indole – 3 – acetic acid sodium salt (auxin analog) | Sigma-Aldrich | i2886 |
dTAG-13 | Fisher | Tocris 6605/5 |
TCO-PEG4-TFP Ester | Click Chemistry Tools | 1398–2 |
Methyltetrazine-PEG4-Amine | Click Chemistry Tools | 1012–100 |
Methyltetrazine-NHS Ester | Click Chemistry Tools | 1128–25 |
Deposited data | ||
ORCA processed data | This study | 4DN data portal: https://data.4dnucleome.org/A_Hafner_mESC_loop_stacking_chromatin_tracing |
Unprocessed Western blot images | This study | DOI:10.17632/gfk7nvgkcp.1 |
Hi-C (mESs control and RAD21 degraded cells) | Rhodes et al. 2020 | E-MTAB-7816 |
Hi-C (mESs control and CTCF degraded cells) | Nora et al. 2017 | GSE98671 |
CTCF ChIP-seq in mESCs | Bonev et al. 2017 | GSE96107 |
RAD21 ChIP-seq in mESCs | Arruda et al. 2020 | GSM4280494 |
Ring1B ChIP-seq in mESCs | Bonev et al. 2017 | GSE96107 |
Experimental models: Cell lines | ||
E14TG2a (referred to as E14) | Hooper et al. 1987 | |
E14TG2a CTCF-AID (clone) | Nora et al. 2017 | |
E14TG2a RAD21-AID (clone) | Szabo et al. 2020 | |
EED-dTAG and Ring1B-dTAG mES cells | Weber et al. 2021 | |
Oligonucleotides | ||
ORCA oligos | Table S2 | |
Cell barcode oligos (subset of ORCA barcodes with 3prime amine modification) | Table S2 | |
Software and algorithms | ||
ORCA spot calling analysis | Mateo et al. 2019, Mateo et al. 2020 | DOI:10.5281/zenodo7698979 |
Open2C polymer simulations | Imakaev et al., 2019 & This work | DOI: 10.5281/zenodo.7698987 & DOI:10.5281/zenodo.7761973 |
CellPose cell segmentation | Stringer et al., 2021 | |
Cooler | Abdennur and Mirny 2020 |
EXPERIMENTAL MODEL AND SUBJECT DETAILS
Cell lines
E14 mouse embryonic stem cells (E14Tg2a 129/Ola) and the derived Rad21-AID (clone EN272.2) and CTCF-AID (clone EN52.9.1) cell lines were a gift from Elphege Nora. Ring1B and EED dTAG cell line (derived from TC1(129) mouse ESCs) was a gift from Christopher Weber and the Crabtree lab.
METHOD DETAILS
Cell culture, treatment and collection
Cells were cultured in Knockout Dulbecco’s modified Eagles medium (Thermo Fisher, cat. No. 10829–018), 15% qualified embryonic stem cell FBS (Thermo Fisher, cat. No. 16141079), 1x GlutaMAX™ Supplement (Thermo fisher, cat. No. 16141079), 1x Gibco™ Penicillin-Streptomycin (Thermo Fisher cat. No. 15140122), 1x MEM Non-Essential Amino Acids Solution (Thermo Fisher cat. No. 11140050), 10mM HEPES (Thermo Fisher cat. No. 15630080), 0.1mM β-mercaptoethanol (Thermo Fisher cat. No. 21985023) and LIF (final 1000 U/ml, Millipore Sigma ESG1107). E14, CTCF-AID and RAD21-AID mESCs were grown on 0.1% gelatin. Ring1B and EED dTAG mESCs cells were grown on 0.1% gelatin coated dishes with EmbryoMax® Primary Mouse Embryonic Fibroblasts (Sigma, PMEF-CF).
To degrade CTCF or RAD21, a final concentration of 500uM IAA (Sigma i2886) was used for all experiments. Cells were treated for 4 hours, media was changed in the untreated condition at the same time as the fresh media with auxin was added to auxin treated cells. To degrade EED and RING1b, dTAG-13 was added for 8 h at 500 nM as previously described 24,25.
For cell collection, cells were trypsinized, counted, spun down and after removal of supernatant media, were fixed in suspension in 4%PFA in 1xPBS on a nutator for 10 minutes, washed 3 times with 1xPBS, resuspended in 70%EtOH in 1xPBS at a concentration of 1–10 000 cells/ul and stored at −20C for up to 6 months. For Ring1B and EED dTAG mESCs grown on feeder cells, serial replating was performed on 0.1% gelatin coated plates in order to deplete the feeder cells.
Western blot
Cells were collected by trypsinization, washed in 1xPBS and pellets were frozen on dry ice/EtOH bath and stored at −80C. Cells were lysed in mRIPA buffer with Protease and Phosphatase inhibitor cocktail. 20ug of protein was run on a 4–20% Mini-PORTEAN TGX precast gels in Tris-Glycine-SDS running buffer. After transfer onto the nitrocellulose membrane, and blocking in Li-Cor Intercept Blocking Buffer (TBS), blots were probed with either CTCF (Abcam 128873, 1:1000) or Rad21 (Abcam 154769, 1:1000) antibodies as well as beta-Actin (Cell Signaling 3700) in blocking buffer at 4C overnight. Membranes were then washed 3× 5min in TBST at room temperature, incubated with secondary antibodies (Li-Cor: IRDye 680RD Donkey anti-Rabbit 926–68073 and IRDye 800CW Donkey anti-Mouse, 1:10 000 diluted in blocking buffer) for 1h at room temperature. After 3× 5 min washes in TBST, and 1 wash in TBS, samples were imaged on the Li-Cor scanner.
Western blot for Ring1b and EED (Figure S3E) was reproduced from 25. The protocol is the same as above but with the following antibodies: V5 tag (abcam 27671, 1:800), targeting the V5-tagged EED); HA-tag (abcam 9110) (labeling the HA-tagged RING1b, 1:800).
Multiplexed cell labeling
The multiplexed cell labeling protocol was adapted from Gehring et al. 2020. The detailed protocol is as follows.
Part 1: Preparation of labelled oligos
3’ amine-modifed oligos were ordered from IDT (Table S2) and resuspended to 500 μM in 50 mM sodium borate buffer pH 8.5
In 1.5 mL microcentrifuge tube, add 25.0 μL 3’ amine-modified oligo in 50 mM sodium borate buffer, 8.2 μL 10 mM Methyltetrazine-NHS ester, 41.8 μL DMSO, incubate protected from light for 30 minutes with agitation
After 30 minutes, add another 8.2 μL 10 mM Methyltetrazine-NHS ester, then add once again after an additional 30 minutes
After the 90-minute reaction, quench by adding 180 μL 50 mM sodium borate buffer and 30 μL 3 M NaCl, then add 750 μL ice cold 100% EtOH
Precipitate @ −80 °C overnight
Transfer contents to ultracentrifuge tubes, balance with analytical balance, and spin at 20,000xg for 30 minutes at 4 °C
Wash twice with 750 μL 70% EtOH, ensuring not to disrupt the pellet (don’t mix pipette). Additionally, let pellet dry for ~10 minutes in fume hood until nearly all ethanol has evaporated
Resuspend pellet in 100 μL cold HEPES buffer
Quantify recovery. Typical concentration is around ~80 μM.
Part 2: Cell labeling
Between 500K and 5M fixed cells (stored in 70%EtOH in 1xPBS at −20C) were washed twice in 1xPBS and resuspended in 100ul of 1xPBS.
4 μL of 1 mM TCO-PEG4-TFP Ester was added to the cells, mixed by pipetting and incubated for 5 min @RT protected from light
6 μL of each methyltetrazine oligo amide (prepared in Part 1) was added, mixed by pipetting and incubated for 30 min @RT protected from light
Methyltetrazine-PEG4-Amine was added for final concentration of 50 μM and Tris HCl was added to 10 mM final concentration and incubated for 5 min @RT
Cells were then diluted ~twofold with 1X PBS, spun down and washed 3x in 1xPBS
Cells were resuspended in 70% EtOH in 1xPBS, and kept −20C for up to 6 months after the initial cell collection date.
Cell plating for imaging
Cells stored in suspension at −20°C were resuspended by pipetting. Barcoded cells were combined at desired ratios in a single tube. A small volume of cells (~5ul of cells at 10k/ul-20k/ul concentration) was spotted on a poly-lysine coated area in the center of the slide and let dry for 3–5 min. 1xPBS was then added and the slide was inspected to check for optimal cell density (ideally monolayer of cells evenly covering the entire slide). Cells were then used for IF or ORCA.
ORCA primary probe hybridization
For ORCA DNA experiments, we used the protocol described in Mateo et al. 2019 and in detail in Mateo et al. 2020. Briefly, cells were fixed in 4% PFA in 1x PBS for 10 min. After cell plating, cells were permeabilized with 0.5% Triton-X in 1xPBS for 10 min, followed by 2 washes with 1x PBS. Cells were incubated in 0.1M HCl for 5 min followed by 3 washes in 1xPBS. We then treated cells for 30 min with RNAse A (10 ug/ml) at 37°C followed by 3 washes in 2xSSC. Cells were then incubated for 35 min in Hybridization buffer (2xSSC, 50% formamide and 0.1% Tween). Primary probes (for ~2.5 Mb chr 6 and chr 3 probes, we used 15 ug of probe, for ~ 30 Mb and ~150 Mb scale tiling we used 3–4 ug of probe) diluted in hybridization solution (2xSSC, 50% formamide, 10% dextran sulfate, 0.1% Tween) were then added onto cells, covered with a coverglass, denatured for 3min at 90C and incubated overnight at 42C. The following day, cells were washed with 2xSSC, postfixed in 2% GA, 8% PFA in 1xPBS for 30 min to 1 h and set up on the microscope for sequential labeling and imaging. If ORCA was preceded by IF, we picked positions to match as closely as possible to cells imaged by IF. Cell barcodes were either imaged before or after probe barcodes.
RNA labeling was done as described above for DNA with the exception of the HCl, RNAse A or post fix steps.
ORCA imaging
Samples hybridized with primary probes were imaged on the custom microscopy and microfluidics setup as previously described 15,71. The details of the microscope setups are also documented on the Micro-Meta App 72. We used secondary oligos carrying either a Cy5 or a 750 dye complementary to the readout barcodes on primary probes. We used strand displacement to remove imaged barcodes prior to imaging the next barcode.
As fiducial we designed a subset of probes, either directly adjacent to the readout probes for the ~2.5Mb domains, or in the middle of the domain for the ~30 Mb and ~150 Mb domains, that carried a fiducial sequence, that we labeled with a secondary Cy3 oligo. These Cy3 fiducial probes were labeled at the first round of imaging, together with the first readout and were imaged at each round of imaging. The fiducial spots were subsequently used for spot calling and registration in the image analysis pipeline. We note that due to weak fiducial labeling for the ~2.5 Mb domains, we designed an adapter to bind to all probes for these domains.
Adapter sequences:
chr6_3Mb_fid_adapter: ATCGACCCGGCATCAACGCCAATCGACGTGGGACATCACG
chr3_3Mb_fid_adapter: ATCGACCCGGCATCAACGCCACCCGCGTCGAGCCAGTTAG
Immunofluorescence followed by ORCA
Plated cells were permeabilized and blocked in antibody dilution buffer (2%BSA, 0.1% Triton X −100 in 1xPBS) for 10 min @RT. Cells were then incubated with the Geminin primary antibody (ab195047, 1:100) diluted in antibody dilution buffer for 1 h @ RT, washed 3x with 1xPBS, incubated with secondary antibody (anti-rabbit 647 Invitrogen A31573) and DAPI diluted in antibody dilution buffer for 30 min @RT, washed 3x with 1xPBS after which they were ready to image.
QUANTIFICATION AND STATISTICAL ANALYSIS
Loop extrusion simulations
Loop extrusion simulations were performed using the “polychrom” package from “open2c” github project19, which was developed from earlier polymer models of loop extrusion written by the Mirny lab6,18, and powered by the GPU accelerated molecular simulation toolkit openMM73. All simulations were run on an NVIDIA Titan Xp card. Polychrom uses a Langevin approach to simulate the dynamics of a polymer under several user defined energy constraints. Complete details of the energetic constraints and other parameters are specified in the python simulation scripts included in our github repository for this project: https://github.com/BoettigerLab/Hafner2022/ and github.com/BoettigerLab/polychrome, and described in brief below. These can be run independently using polychrom, which may be downloaded from the polychrom github page: https://https://github.com/open2c/polychrom.
We simulated the chromosomes as 6 polymers confined in a spherical geometry representing the nucleus. Each polymer was 5,000 monomers long, corresponding to 30 kb/monomer (with monomer diameter ~50 nm) for 150 Mb chromosomes, on par with the probe size used at our 3 Mb, 30 Mb, and whole-chromosome (150 Mb) experiments. Each simulation was run in 600 independent replicates, for sufficient time that the final 3D structures were uncorrelated with the starting structure at all length scales and that the pattern of TADs and loops could be reproduced from averaging single replicates over time identically as to averaging over replicates (see below for loop extrusion and TADs).
Loop extrusion was simulated using the previously described sequential 1D and 3D simulations, implemented by polychrom. Briefly, the 1D simulations simulate the loading, unloading, and motion of loop extruders with left and right arms that walk bi-directionally on chromatin and interactions with extrusion blockers. These are followed by 3D simulations, in which short harmonic bonds between the left and right arms of the extrusion factors impose the loop. Loop extrusion blocking sites, simulating CTCF, were positioned along the 3Mb fiber as follows (in kb): [150, 870, 900, 1170, 1380, 1770, 2250, 2400, 2700], with the following probabilities for stalling extrusion factors: [.5, .95, .95, .1, .4, .4, .95, .95, .5], acting in the following respective directions, where 0 is a bidirectional bloc, 1 a blocks extruders only from the right and 2 only from the left: [1 2 1 0 0 0 2 1 0 ]. This produced the contact pattern shown in Figure 1F. This 3 Mb pattern was replicated back to back to produce the 150 Mb chromosome pattern. 250 loop extruders (cohesin) loaded randomly with uniform probability across the chromatin fiber, and walked along chromatin with a half-life corresponding to 300 kb extrusion. The confinement radius for the nucleus was chosen so 20% of the nuclear volume was filled with polymer. Confinement volumes 0.02% were used to represent the unconstrained conditioned and 0.2 the 10% of relaxation volume condition. A hard-sphere repulsion energy of 3 was used in the low density and high density simulations, and later increased to 6 in “loop clash” simulations, which simulated a substantially reduced frequency of chain crossing. To simulate A/B compartmentalization, we divided each chromosome into 6 evenly sized blocks, alternating types A and B. Both A and B types were given weak homophilic interaction energies of 0.2, which resulted in mild demixing at the single trace level. See code available on github link above for details.
It should be emphasized that these are coarse-grained simulations, intended to capture the qualitative, emergent properties of a small number of physical processes – such as loop extruders moving on a large polymer in a confined volume. The utility of these coarse grained physics models is not to capture as many features of reality as possible in a common model, but rather to reproduce key patterns in the experimental data from a minimal mechanistic hypothesis 74,75. By using as few features from reality as possible in the model rather than as many as possible, we can develop deeper intuition about the underlying processes. As a consequence of coarse-graining abstraction, the precise relation between simulation units, monomers to kb, cohesin spacing to kb, monomer radii to nanometers, etc, are approximate. We did not undertake a comprehensive sweep of the parameter space of the model. The intensive computational time for the thousands of independent simulations limit such analysis, and showing that the multiscale qualitative effects of cohesin chromatin folding can be captured by loop extrusion did not require it.
Image processing and quantification
Image processing and spot calling for ORCA data
Image processing (drift correction and localization of spots) analysis was performed as described in 15,71 using ORCA analysis tools: github.com/BoettigerLab/ORCA-public.
Cell segmentation
We used the CellPose76 for segmenting individual nuclei.
Geminin IF and cell size quantification
For Geminin IF quantification and the quantification of nuclear size, we performed all calculations on the experiments for the Hoxa and Sox2 domains for 2.2 and 2.8 Mb respectively where cells were stained for Dapi and Geminin prior to the ORCA experiments. Nuclear segmentation was done on Dapi staining using CellPose. The resulting cell masks were then used to quantify levels of Geminin immunofluorescence by computing the mean intensity per nucleus. The Geminin low population was defined as the lowest 25% of the geminin expression level across all stained cells within the experiment (combined the E14 parental cell line together with CTCF-AID and Rad21-AID auxin treated and untreated cells).
Demultiplexing cell barcodes
For each fiducial spot, we quantified the intensity of each cell barcode signal. To normalize for differences in brightness across cell barcodes, we then balanced the intensities across all values. We used the nearest neighbors approach to classify each spot into groups based on the values for all cell barcodes for that spot.
RNA FISH data analysis
RNA FISH data was quantified similarly to15. Briefly, images were maximum-z-projected and flatfield corrected. Foci corresponding to single mRNAs were identified by using a local-maxima search with manually defined thresholds. Foci positions were then overlaid with cell segmentation masks from CellPose to compute single-cell transcript mRNA counts.
ORCA analyses
Merge and filtering of ORCA data
For the ~2.5 Mb scale data, we had 2 technical replicates (performed on the same batch of collected and barcoded cells) and a biological replicate (independently thawed, treated, fixed and barcoded cells). Replicates showed a high degree of reproducibility and showed identical patterns of TADs and loops and were comparable in median distance (Figure S1C). We thus merged the data for all subsequent analyses. Barcodes with low hybridization efficiency (< 10 %) were filtered on the merged data. An additional filter was done on a dataset basis to remove obvious artifacts such as failed strand displacement events.
Determining the loop threshold
As described in text, to avoid using an arbitrary contact threshold, we used the average distance between centroids of adjacent steps as the cut-off distance for calling a contact/loop. We calculated this threshold based on all NT cells for the ~2.5 Mb sized domains on chr3 (th=256 nm) and chr6 (th=238 nm) that we imaged. For Ring1B and EED degron NT cells, th= 196 nm for plots in Figure S3.
t-SNE analyses DNA and RNA
t-SNE analyses in Figure 1, S2 were done on the 3rd replicate of the data combining all imaged conditions (CTCF-AID NT, CTCF-AID +4h Aux, RAD21-AID NT, RAD21-AID +4h Aux and E14 NT). In these cells we also labeled Geminin by immunofluorescence, as described above, to distinguish G1 (defined as cells with the lowest 25% of Geminin) and G2 cells (with Geminin levels in the highest 35%). For t-SNE analyses in Figure S3C, we used all NT cells (CTCF-AID NT, RAD21-AID NT, E14 NT) merged across all replicates to match the hub analysis in Figure 3.
To avoid computational challenges from missing data in the t-SNE analysis, any missing points in each polymer path were first linearly interpolated from the x,y,z positions of the observed flanking positions. Any traces with less than 50% of the points detected above the required confidence threshold for calling a point were excluded from the analysis. The remaining traces were converted into single cell distance maps, which were then transformed into linear vectors. We performed t-SNE using the builtin Matlab(™) R2022b function t-SNE to project the resulting array N2 of vectors into a two dimensional representation (where N=73 for the chr6 region studied and N=93 for the chr3 region studied). The cell type and cell-cycle state of each trace was denoted by color.
For the analysis of mRNA expression by t-SNE, in Figure 6, we created a vector for each cell recording the number of mRNA transcripts observed for each of the 12 genes assayed. We then used the builtin Matlab(™) R2022b function t-SNE to project these data into a 2-dimensional t-SNE representation.
Three-way contact analyses
For the three-way CTCF contact analyses in Figure 3, we picked the 3 CTCF sites for Hoxa (readouts 23, 46, 57, Tables S1 and S2) and Sox2 regions (readouts 45, 49, 61, Tables S1 and S2). We calculated the frequency of chromosomes with loops between sites 1–2 and 2–3. This frequency was normalized for detection efficiency for these pairs. We then shifted barcode positions 8 steps (with each step= 30 kb) in each direction and performed the same calculation for each shift. The expected frequencies were generated by multiplying the frequency for the pairs (frequency(1,2) * frequency (2–3)) and dividing by their detection efficiency.
We performed bootstrapping to estimate the confidence intervals for the contact frequencies. Bootstrapping was done by randomly sampling with replacement from the data. The height bars in the barplot (Figure 3B and S3H) correspond to the mean and the error bars correspond to the 25th and 75th quantiles. Thus, error bars that do not overlap reflect contact-frequencies that were distinct in 94% of resamplings (94 = 1− 0.252). We used 200 resampling draws in our bootstrapping, and noted more resamplings made little difference on the spread.
Identification of hub-containing chromosomes
We define as ‘hub traces’ all chromosomes with 3-way contacts for any 3 out of 5 CBSs (corresponding to readouts 15, 23, 37, 46, 57 for chr 6; and readouts: 21, 45, 49, 61, 76 for chr 3). For centrality analysis we calculated the centroid of each hub-trace and computed the distance (in nm) for each readout in the trace relative to the centroid. For Figure 3F, we plotted the median distance to centroid across all hub traces. We then computed the loop frequency for these CBS hub-containing chromosomes (Figure 3H, S3A) and loop difference between all CBS-hub and all NT traces (Figure S3B).
Entropy
For a given chromosome trace and each readout in the trace, we calculated the 3D distance (r) to all other readouts. Across all traces and for each pair of readouts (i,j), this became a distribution of distances Di,j, which was then normalized by the number of traces (N) to obtain: Pi,j=Di,j/N. The entropy (S) for the readout pair i,j was then computed by summing over all distances, r : Si,j= − sumr ( Pi,j × log2(Pi,j)). For each readout i, we then computed the average entropy: Si = mean(Si,j). For Figure 6B, we plotted the difference in S per barcode between NT cells and cells without CTCF or RAD21.
Compartment analysis using Pearson’s Matrix
Compartmentalization was analyzed in the ORCA data starting from the median pairwise distance matrices, O, for both the ~30 Mb scale and whole chromosome (~150 Mb) data. The “expected” distance was computed by averaging the 3D distance for all loci pairs which had the same genomic spacing. This produces a matrix, E, with a smooth decay from the main diagonal. We then divided the observed distances by the expected distances to create a new matrix, N, Ni,j = Oi,j/Ei,. Finally, we computed the Pearson Matrix P, such that Pi,j is Pearson’s correlation coefficient between row i and column j of matrix N. This parallels the original analysis of compartmentalization described for Hi-C, though using our distance measures in place of the contact-frequency per bin28.
We note that the Hi-C and ORCA compartment analysis were computed with an important difference. The Hi-C compartment analysis at the 30 and 150 Mb scale uses reads from every part of the 30Mb (150 Mb) interval. In contrast, the ORCA data looks exclusively at particular 30 kb elements that skip through that domain at 1 Mb or 5 Mb intervals. Thus, substantial parts of the genome driving the correlation signal in the Hi-C data are not included. We note that our calculation using Hi-C data that focused only on the interactions between the 30 kb windows covered by the ORCA experiments were too sparsely populated to permit detection of compartments. Expanding these to 120 kb bins resulted in only minor improvement, and thus we elect to show the Hi-C compartmentalization computed using all the data. Thus, some qualitative differences in the position/distribution of the compartment boundaries likely reflects the contributions of genomic sequences not covered in our imaging experiments, though the sparsity of the Hi-C data makes this difficult to quantify precisely.
We used compartment calls in 70 (Table S3) to map regions imaged by ORCA at the 150 Mb scale to A or B compartments.
Compartment clustering in single traces
A and B readouts were annotated according to compartment annotations (Table S3). DBSCAN was used to cluster between A or B readouts, using the built-in Matlab(™) R2022b function dbscan, with a minimum cluster size of 2 and a threshold distance equal to the average consecutive-step distance of the trace.
Published genomic data
Hi-C data
We examined Hi-C experiments in mESCs from Nora et al. 2017 1 for CTCF degradation, from Rhodes et al. 2020 4 for Rad21 degradation. This data was loaded in the processed Hi-C .cool files from either GEO (GSE98671) or ArrayExpress (E-MTAB-7816) respectively. We used Cooler77 to export normalized tables directly using the cooler ‘dump’ command for the coordinates that match our probes. We then downsampled the data by averaging bins to match the resolution of the probes used for ORCA (30 kb for the ~2.5 Mb scale, 1 Mb for the ~30 Mb scale and 5 Mb for the ~150 Mb scale probes).
ChIP data
CTCF and Ring1b ChIP-seq data is from 70 RAD21 ChIP-seq data is from 78. Processed bigwig files from GEO (GSE96107 for CTCF and RIng1b or GSM4280494 for RAD21) were viewed in the UCSC genome browser.
Supplementary Material
Highlights:
Chromosome tracing shows radially organized chromosome loops
CTCF boundary sites form hubs that depend on cohesin and CTCF
Loss of cohesin leads to expansion at the <4 Mb scale and increased mixing at >4 Mb scale
Loss of cohesin increases entropy in genome folding and variation in gene expression
Acknowledgements
We thank Max Imakaev and Neil Chowdhury for code sharing and feedback on the polymer simulations; the 4DN community for support and the opportunities to present this work; Andrea Cosolo and Rahi Navelkar (4DN) for help with formatting and uploading the data to the 4DN server; Allison Dear and Kevin Wang’s lab at Stanford for assistance with Western blots; Liang-Fu Chen, Tzuchiao Hung, and Carly Walker for comments on the manuscript; current and former members of the Boettiger lab for feedback and discussion. This work was supported by NIH grants U01DK127419 and DP2GM132935A, the Packard Foundation, and a Beckman Young Investigators Award to A.N.B. A.H. was supported by Walter and Idun V. Berry Postdoctoral fellowship. M.P. was supported by a postdoctoral fellowship from the Damon Runyon Cancer Research Foundation. S.E.B. was supported by the Stanford Graduate Fellowship. S.E.M. was supported by the Stanford Bio-X SIGF Fellowship.
Inclusion and diversity statement
We support inclusive, diverse, and equitable conduct of research
Footnotes
Declaration of interests
Authors declare no competing interests.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Nora EP, Goloborodko A, Valton A-L, Gibcus JH, Uebersohn A, Abdennur N, Dekker J, Mirny LA, and Bruneau BG (2017). Targeted Degradation of CTCF Decouples Local Insulation of Chromosome Domains from Genomic Compartmentalization. Cell 169, 930–944.e22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Rao SSP, Huang S-C, Glenn St Hilaire B, Engreitz JM, Perez EM, Kieffer-Kwon K-R, Sanborn AL, Johnstone SE, Bascom GD, Bochkov ID, et al. (2017). Cohesin Loss Eliminates All Loop Domains. Cell 171, 305–320.e24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bintu B, Mateo LJ, Su J-H, Sinnott-Armstrong NA, Parker M, Kinrot S, Yamaya K, Boettiger AN, and Zhuang X (2018). Super-resolution chromatin tracing reveals domains and cooperative interactions in single cells. Science 362, eaau1783. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Rhodes JDP, Feldmann A, Hernández-Rodríguez B, Díaz N, Brown JM, Fursova NA, Blackledge NP, Prathapan P, Dobrinic P, Huseyin MK, et al. (2020). Cohesin Disrupts Polycomb-Dependent Chromosome Interactions in Embryonic Stem Cells. Cell Rep. 30, 820–835.e10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kubo N, Ishii H, Xiong X, Bianco S, Meitinger F, Hu R, Hocker JD, Conte M, Gorkin D, Yu M, et al. (2021). Promoter-proximal CTCF binding promotes distal enhancer-dependent gene activation. Nat. Struct. Mol. Biol. 10.1038/s41594-020-00539-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Fudenberg G, Imakaev M, Lu C, Goloborodko A, Abdennur N, and Mirny LA (2016). Formation of Chromosomal Domains by Loop Extrusion. Cell Rep. 15, 2038–2049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Sanborn AL, Rao SSP, Huang S-C, Durand NC, Huntley MH, Jewett AI, Bochkov ID, Chinnappan D, Cutkosky A, Li J, et al. (2015). Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl. Acad. Sci. U. S. A. 112, E6456–E6465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mirny L, and Dekker J (2022). Mechanisms of Chromosome Folding and Nuclear Organization: Their Interplay and Open Questions. Cold Spring Harb. Perspect. Biol. 14. 10.1101/cshperspect.a040147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Luppino JM, Park DS, Nguyen SC, Lan Y, Xu Z, Yunker R, and Joyce EF (2020). Cohesin promotes stochastic domain intermingling to ensure proper regulation of boundary-proximal genes. Nat. Genet. 52, 840–848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Szabo Q, Donjon A, Jerković I, Papadopoulos GL, Cheutin T, Bonev B, Nora EP, Bruneau BG, Bantignies F, and Cavalli G (2020). Regulation of single-cell genome organization into TADs and chromatin nanodomains. Nat. Genet. 52, 1151–1157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Allahyar A, Vermeulen C, Bouwman BAM, Krijger PHL, Verstegen MJAM, Geeven G, van Kranenburg M, Pieterse M, Straver R, Haarhuis JHI, et al. (2018). Enhancer hubs and loop collisions identified from single-allele topologies. Nat. Genet, 1. [DOI] [PubMed] [Google Scholar]
- 12.Liu M, Lu Y, Yang B, Chen Y, Radda JSD, Hu M, Katz SG, and Wang S (2020). Multiplexed imaging of nucleome architectures in single cells of mammalian tissue. Nat. Commun. 11, 2907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Su J-H, Zheng P, Kinrot SS, Bintu B, and Zhuang X (2020). Genome-Scale Imaging of the 3D Organization and Transcriptional Activity of Chromatin. Cell 182, 1641–1659.e26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Takei Y, Yun J, Zheng S, Ollikainen N, Pierson N, White J, Shah S, Thomassie J, Suo S, Eng C-HL, et al. (2021). Integrated spatial genomics reveals global architecture of single nuclei. Nature 590, 344–350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Mateo LJ, Murphy SE, Hafner A, Cinquini IS, Walker CA, and Boettiger AN (2019). Visualizing DNA folding and RNA in embryos at single-cell resolution. Nature 568, 49–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Gehring J, Hwee Park J, Chen S, Thomson M, and Pachter L (2020). Highly multiplexed single-cell RNA-seq by DNA oligonucleotide tagging of cellular proteins. Nat. Biotechnol. 38, 35–38. [DOI] [PubMed] [Google Scholar]
- 17.Chen L-F, Long HK, Park M, Swigut T, Boettiger AN, and Wysocka J (2022). Structural elements facilitate extreme long-range gene regulation at a human disease locus. bioRxiv, 2022.10.20.513057. 10.1101/2022.10.20.513057. [DOI] [PubMed] [Google Scholar]
- 18.Nuebler J, Fudenberg G, Imakaev M, Abdennur N, and Mirny LA (2018). Chromatin organization by an interplay of loop extrusion and compartmental segregation. Proc. Natl. Acad. Sci. U. S. A. 115, E6697–E6706. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Imakaev M, Goloborodko A, and Brandao H (2019). polychrom: Polymer simulations of chromosomes and generating “in silico” Hi-C maps (Github; ) https://zenodo.org/badge/latestdoi/178608195. [Google Scholar]
- 20.Davidson IF, and Peters J-M (2021). Genome folding through loop extrusion by SMC complexes. Nat. Rev. Mol. Cell Biol. 22, 445–464. [DOI] [PubMed] [Google Scholar]
- 21.Berlivet S, Paquette D, Dumouchel A, Langlais D, Dostie J, and Kmita M (2013). Clustering of Tissue-Specific Sub-TADs Accompanies the Regulation of HoxA Genes in Developing Limbs. PLoS Genet. 9, e1004018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Yun H, Narayan N, Vohra S, Giotopoulos G, Mupo A, Madrigal P, Sasca D, Lara-Astiaso D, Horton SJ, Agrawal-Singh S, et al. (2021). Mutational synergy during leukemia induction remodels chromatin accessibility, histone modifications and three-dimensional DNA topology to alter gene expression. Nat. Genet. 53, 1443–1455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Wilderman A, D’haene E, Baetens M, Yankee TN, Winchester EW, Glidden N, Roets E, Van Dorpe J, Vergult S, Cox TC, et al. (2022). A distant global control region is essential for normal expression of anterior HOXA genes during mouse and human craniofacial development. bioRxiv, 2022.03.10.483852. 10.1101/2022.03.10.483852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Weber CM, Hafner A, Kirkland JG, Braun SMG, Stanton BZ, Boettiger AN, and Crabtree GR (2021). mSWI/SNF promotes Polycomb repression both directly and through genome-wide redistribution. Nat. Struct. Mol. Biol. 28, 501–511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Murphy S, and Boettiger AN (2022). Polycomb repression of Hox genes involves spatial feedback but not domain compaction or demixing. bioRxiv, 2022.10.14.512199. 10.1101/2022.10.14.512199. [DOI] [PubMed] [Google Scholar]
- 26.Polovnikov K, Belan S, Imakaev M, Brandão HB, and Mirny LA (2022). Fractal polymer with loops recapitulates key features of chromosome organization. bioRxiv, 2022.02.01.478588. 10.1101/2022.02.01.478588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Schwarzer W, Abdennur N, Goloborodko A, Pekowska A, Fudenberg G, Loe-Mie Y, Fonseca NA, Huber W, Haering C, Mirny LA, et al. (2017). Two independent modes of chromatin organization revealed by cohesin removal. Nature 551, 51–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lieberman Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschner MO, et al. (2009). Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Ou HD, Phan S, Deerinck TJ, Thor A, Ellisman MH, and O’Shea CC (2017). ChromEMT: Visualizing 3D chromatin structure and compaction in interphase and mitotic cells. Science 357. 10.1126/science.aag0025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Finn EH, Pegoraro G, Brandão HB, Valton A-L, Oomen ME, Dekker J, Mirny L, and Misteli T (2019). Extensive Heterogeneity and Intrinsic Variation in Spatial Genome Organization. Cell 176, 1502–1515.e10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wang S, Su J-H, Beliveau BJ, Bintu B, Moffitt JR, Wu C-T, and Zhuang X (2016). Spatial organization of chromatin domains and compartments in single chromosomes. Science 353, 598–602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hsieh T-HS, Cattoglio C, Slobodyanyuk E, Hansen AS, Darzacq X, and Tjian R Enhancer-promoter interactions and transcription are maintained upon acute loss of CTCF, cohesin, WAPL, and YY1. 10.1101/2021.07.14.452365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Aljahani A, Hua P, Karpinska MA, Quililan K, Davies JOJ, and Oudelaar AM (2022). Analysis of sub-kilobase chromatin topology reveals nano-scale regulatory interactions with variable dependence on cohesin and CTCF. Nat. Commun. 13, 2139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Liu NQ, Maresca M, van den Brand T, Braccioli L, Schijns MMGA, Teunissen H, Bruneau BG, Nora EP, and de Wit E (2021). WAPL maintains a cohesin loading cycle to preserve cell-type-specific distal gene regulation. Nat. Genet. 53, 100–109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bates LE, Alves MRP, and Silva JCR (2021). Auxin-degron system identifies immediate mechanisms of OCT4. Stem Cell Reports 16, 1818–1831. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Jaeger MG, Schwalb B, Mackowiak SD, Velychko T, Hanzl A, Imrichova H, Brand M, Agerer B, Chorn S, Nabet B, et al. (2020). Selective Mediator dependence of cell-type-specifying transcription. Nat. Genet. 52, 719–727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Muhar M, Ebert A, Neumann T, Umkehrer C, Jude J, Wieshofer C, Rescheneder P, Lipp JJ, Herzog VA, Reichholf B, et al. (2018). SLAM-seq defines direct gene-regulatory functions of the BRD4-MYC axis. Science 360, 800–805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lupiáñez DG, Kraft K, Heinrich V, Krawitz P, Brancati F, Klopocki E, Horn D, Kayserili H, Opitz JM, Laxova R, et al. (2015). Disruptions of Topological Chromatin Domains Cause Pathogenic Rewiring of Gene-Enhancer Interactions. Cell 161, 1012–1025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Flavahan WA, Drier Y, Liau BB, Gillespie SM, Venteicher AS, Stemmer-Rachamimov AO, Suvà ML, and Bernstein BE (2015). Insulator dysfunction and oncogene activation in IDH mutant gliomas. Nature, 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hnisz D, Weintraub AS, Day DS, Valton A-L, Bak RO, Li CH, Goldmann J, Lajoie BR, Fan ZP, Sigova AA, et al. (2016). Activation of proto-oncogenes by disruption of chromosome neighborhoods. Science 351, 1454–1458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Hnisz D, Day DS, and Young RA (2016). Insulated Neighborhoods: Structural and Functional Units of Mammalian Gene Control. Cell 167, 1188–1200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Gerasimova TI, Byrd K, and Corces VG (2000). A chromatin insulator determines the nuclear localization of DNA. Mol. Cell 6, 1025–1035. [DOI] [PubMed] [Google Scholar]
- 43.Gombert Wendy M, Farris Stephen D, Rubio Eric D, Morey-Rosler Kristin M, Schubach William H, and Anton Krumm (2003). Thec-myc Insulator Element and Matrix Attachment Regions Definethe c-myc ChromosomalDomain. Mol. Cell. Biol. 23, 9338–9348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Montavon T, Soshnikova N, Mascrez B, Joye E, Thevenet L, Splinter E, de Laat W, Spitz F, and Duboule D (2011). A Regulatory Archipelago Controls Hox Genes Transcription in Digits. Cell 147, 1132–1145. [DOI] [PubMed] [Google Scholar]
- 45.Guo Y, Monahan K, Wu H, Gertz J, Varley KE, Li W, Myers RM, Maniatis T, and Wu Q (2012). CTCF/cohesin-mediated DNA looping is required for protocadherin α promoter choice. Proc. Natl. Acad. Sci. U. S. A. 109, 21081–21086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Shih H-Y, Verma-Gaur J, Torkamani A, Feeney AJ, Galjart N, and Krangel MS (2012). Tcra gene recombination is supported by a Tcra enhancer- and CTCF-dependent chromatin hub. Proc. Natl. Acad. Sci. U. S. A. 109, E3493–E3502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Labrador M, and Corces VG (2002). Setting the boundaries of chromatin domains and nuclear organization. Cell 111, 151–154. [DOI] [PubMed] [Google Scholar]
- 48.Phillips-Cremins JE, and Corces VG (2009). CTCF: master weaver of the genome. Cell 137, 1194–1211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gómez-Díaz E, and Corces VG (2014). Architectural proteins: regulators of 3D genome organization in cell fate. Trends Cell Biol. 24, 703–711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Kyrchanova O, and Georgiev P (2014). Chromatin insulators and long-distance interactions in Drosophila. FEBS Lett. 588, 8–14. [DOI] [PubMed] [Google Scholar]
- 51.Oudelaar AM, Davies JOJ, Hanssen LLP, Telenius JM, Schwessinger R, Liu Y, Brown JM, Downes DJ, Chiariello AM, Bianco S, et al. (2018). Single-allele chromatin interactions identify regulatory hubs in dynamic compartmentalized domains. Nat. Genet. 50, 1744–1751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wang XQD, Gore H, Himadewi P, Feng F, Yang L, Zhou W, Liu Y, Wang X, Chen C-W, Su J, et al. (2020). Three-dimensional regulation of HOXA cluster genes by a cis-element in hematopoietic stem cell and leukemia. bioRxiv, 2020.04.16.017533. 10.1101/2020.04.16.017533. [DOI] [Google Scholar]
- 53.Hung T-C, Kingsley DM, and Boettiger A (2023). Boundary stacking interactions enable cross-TAD enhancer-promoter communication during limb development. bioRxiv, 2023.02.06.527380. 10.1101/2023.02.06.527380. [DOI] [PubMed] [Google Scholar]
- 54.Huang H, Zhu Q, Jussila A, Han Y, Bintu B, Kern C, Conte M, Zhang Y, Bianco S, Chiariello AM, et al. (2021). CTCF mediates dosage- and sequence-context-dependent transcriptional insulation by forming local chromatin domains. Nat. Genet. 53, 1064–1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Chakraborty S, Kopitchinski N, Zuo Z, Eraso A, Awasthi P, Chari R, Mitra A, Tobias IC, Moorthy SD, Dale RK, et al. (2023). Enhancer-promoter interactions can bypass CTCF-mediated boundaries and contribute to phenotypic robustness. Nat. Genet. 10.1038/s41588-022-01295-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Wang W, Ren G, Hong N, and Jin W (2019). Exploring the changing landscape of cell-to-cell variation after CTCF knockdown via single cell RNA-seq. BMC Genomics 20, 1015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Ren G, Jin W, Cui K, Rodrigez J, Hu G, Zhang Z, Larson DR, and Zhao K (2017). CTCF-Mediated Enhancer-Promoter Interaction Is a Critical Regulator of Cell-to-Cell Variation of Gene Expression. Mol. Cell 67, 1049–1058.e6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Chiang M, Brackley CA, Naughton C, Nozawa R-S, Battaglia C, Marenduzzo D, and Gilbert N (2022). Gene structure heterogeneity drives transcription noise within human chromosomes. bioRxiv, 2022.06.09.495447. 10.1101/2022.06.09.495447. [DOI] [Google Scholar]
- 59.Cuartero S, Weiss FD, Dharmalingam G, Guo Y, Ing-Simmons E, Masella S, Robles-Rebollo I, Xiao X, Wang Y-F, Barozzi I, et al. (2018). Control of inducible gene expression links cohesin to hematopoietic progenitor self-renewal and differentiation. Nat. Immunol. 19, 932–941. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Pauli A, Althoff F, Oliveira RA, Heidmann S, Schuldiner O, Lehner CF, Dickson BJ, and Nasmyth K (2008). Cell-Type-Specific TEV Protease Cleavage Reveals Cohesin Functions in Drosophila Neurons. Dev. Cell 14, 239–251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Pauli A, van Bemmel JG, Oliveira R. a., Itoh T, Shirahige K, van Steensel B, and Nasmyth K (2010). A direct role for cohesin in gene regulation and ecdysone response in Drosophila salivary glands. Curr. Biol. 20, 1787–1798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Cummings CT, and Rowley MJ (2022). Implications of Dosage Deficiencies in CTCF and Cohesin on Genome Organization, Gene Expression, and Human Neurodevelopment. Genes 13. 10.3390/genes13040583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Avagliano L, Parenti I, Grazioli P, Di Fede E, Parodi C, Mariani M, Kaiser FJ, Selicorni A, Gervasini C, and Massa V (2020). Chromatinopathies: A focus on Cornelia de Lange syndrome. Clin. Genet. 97, 3–11. [DOI] [PubMed] [Google Scholar]
- 64.Di Nardo M, Pallotta MM, and Musio A (2022). The multifaceted roles of cohesin in cancer. J. Exp. Clin. Cancer Res 41, 96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Bantignies F, Roure V, Comet I, Leblanc B, Schuettengruber B, Bonnet J, Tixier V, Mas A, and Cavalli G (2011). Polycomb-Dependent Regulatory Contacts between Distant Hox Loci in Drosophila. Cell 144, 214–226. [DOI] [PubMed] [Google Scholar]
- 66.Oksuz O, Narendra V, Lee CH, Descostes N, LeRoy G, Raviram R, Blumenberg L, Karch K, Rocha PP, Garcia BA, et al. (2018). Capturing the Onset of PRC2-Mediated Repressive Domain Formation. Mol. Cell 70, 1149–1162.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Kraft K, Yost KE, Murphy SE, Magg A, Long Y, Corces MR, Granja JM, Wittler L, Mundlos S, Cech TR, et al. (2022). Polycomb-mediated genome architecture enables long-range spreading of H3K27 methylation. Proc. Natl. Acad. Sci. U. S. A. 119, e2201883119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Conte M, Fiorillo L, Bianco S, Chiariello AM, Esposito A, and Nicodemi M (2020). Polymer physics indicates chromatin folding variability across single-cells results from state degeneracy in phase separation. Nat. Commun. 11, 3289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Barbieri M, Chotalia M, Fraser J, Lavitas L-M, Dostie J, Pombo A, and Nicodemi M (2012). Complexity of chromatin folding is captured by the strings and binders switch model. Proc. Natl. Acad. Sci. U. S. A. 109, 16173–16178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Bonev B, Mendelson Cohen N, Szabo Q, Fritsch L, Papadopoulos GL, Lubling Y, Xu X, Lv X, Hugnot J-P, Tanay A, et al. (2017). Multiscale 3D Genome Rewiring during Mouse Neural Development. Cell 171, 557–572.e24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Mateo LJ, Sinnott-Armstrong N, and Boettiger AN (2021). Tracing DNA paths and RNA profiles in cultured cells and tissues with ORCA. Nat. Protoc. 16, 1647–1713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Rigano A, Ehmsen S, Öztürk SU, Ryan J, Balashov A, Hammer M, Kirli K, Boehm U, Brown CM, Bellve K, et al. (2021). Micro-Meta App: an interactive tool for collecting microscopy metadata based on community specifications. Nat. Methods 18, 1489–1495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Eastman P, Swails J, Chodera JD, McGibbon RT, Zhao Y, Beauchamp KA, Wang L-P, Simmonett AC, Harrigan MP, Stern CD, et al. (2017). OpenMM 7: Rapid development of high performance algorithms for molecular dynamics. PLoS Comput. Biol. 13, e1005659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.A Conversation with Leonid Mirny (2017). Cold Spring Harb. Symp. Quant. Biol. 82, 403–405. [DOI] [PubMed] [Google Scholar]
- 75.Igoshin OA, Chen J, Xing J, Liu J, Elston TC, Grabe M, Kim KS, Nirody JA, Rangamani P, Sun SX, et al. (2019). Biophysics at the coffee shop: lessons learned working with George Oster. Mol. Biol. Cell 30, 1882–1889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Stringer C, Wang T, Michaelos M, and Pachitariu M (2021). Cellpose: a generalist algorithm for cellular segmentation. Nat. Methods 18, 100–106. [DOI] [PubMed] [Google Scholar]
- 77.Abdennur N, and Mirny LA (2020). Cooler: scalable storage for Hi-C data and other genomically labeled arrays. Bioinformatics 36, 311–316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Arruda NL, Carico ZM, Justice M, Liu YF, Zhou J, Stefan HC, and Dowen JM (2020). Distinct and overlapping roles of STAG1 and STAG2 in cohesin localization and gene expression in embryonic stem cells. Epigenetics Chromatin 13, 32. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All processed ORCA data and unprocessed Western blot images generated in this study have been deposited and are publicly available as of the date of publication. Accession numbers and DOI are listed in the key resources table. The probe locations (Table S1) can also be viewed through the UCSC genome browser: https://genome.ucsc.edu/s/toniai/mm10_Hafner2023
All original code has been deposited at Github and Zenodo and is publicly available as of the date of publication. DOIs are listed in the key resources table.
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.
Key resources table.
REAGENT or RESOURCE | SOURCE | IDENTIFIER |
---|---|---|
Antibodies | ||
Anti-Geminin antibody | Abcam | ab195047 |
Anti-RAD21 antibody | Abcam | ab154769 |
Anti-CTCF antibody | Abcam | ab128873 |
Anti-Actin antibody | Cell Signaling | 3700 |
Anti-HA antibody | Abcam | 9110 |
Anti-V5 antibody | Abcam | ab27671 |
Chemicals, peptides, and recombinant proteins | ||
Indole – 3 – acetic acid sodium salt (auxin analog) | Sigma-Aldrich | i2886 |
dTAG-13 | Fisher | Tocris 6605/5 |
TCO-PEG4-TFP Ester | Click Chemistry Tools | 1398–2 |
Methyltetrazine-PEG4-Amine | Click Chemistry Tools | 1012–100 |
Methyltetrazine-NHS Ester | Click Chemistry Tools | 1128–25 |
Deposited data | ||
ORCA processed data | This study | 4DN data portal: https://data.4dnucleome.org/A_Hafner_mESC_loop_stacking_chromatin_tracing |
Unprocessed Western blot images | This study | DOI:10.17632/gfk7nvgkcp.1 |
Hi-C (mESs control and RAD21 degraded cells) | Rhodes et al. 2020 | E-MTAB-7816 |
Hi-C (mESs control and CTCF degraded cells) | Nora et al. 2017 | GSE98671 |
CTCF ChIP-seq in mESCs | Bonev et al. 2017 | GSE96107 |
RAD21 ChIP-seq in mESCs | Arruda et al. 2020 | GSM4280494 |
Ring1B ChIP-seq in mESCs | Bonev et al. 2017 | GSE96107 |
Experimental models: Cell lines | ||
E14TG2a (referred to as E14) | Hooper et al. 1987 | |
E14TG2a CTCF-AID (clone) | Nora et al. 2017 | |
E14TG2a RAD21-AID (clone) | Szabo et al. 2020 | |
EED-dTAG and Ring1B-dTAG mES cells | Weber et al. 2021 | |
Oligonucleotides | ||
ORCA oligos | Table S2 | |
Cell barcode oligos (subset of ORCA barcodes with 3prime amine modification) | Table S2 | |
Software and algorithms | ||
ORCA spot calling analysis | Mateo et al. 2019, Mateo et al. 2020 | DOI:10.5281/zenodo7698979 |
Open2C polymer simulations | Imakaev et al., 2019 & This work | DOI: 10.5281/zenodo.7698987 & DOI:10.5281/zenodo.7761973 |
CellPose cell segmentation | Stringer et al., 2021 | |
Cooler | Abdennur and Mirny 2020 |