SUMMARY
The architectural protein CTCF plays a complex role in decoding the functional output of the genome. Guo et al. now show that the orientation of a CTCF site restricts its choice of interacting partner, thus creating a code that predicts the three-dimensional organization of the genome. We propose a DNA extrusion model to account for orientation-specific loop formation.
Keywords: Transcription, Chromatin, Nucleus, Cohesin, Enhancer
CTCF is a DNA-binding protein known to play a variety of roles in the regulation of transcription by forming loops in which distant elements of the genome are brought into spatial proximity within the nucleus (Ong and Corces, 2014). The formation of these loops is believed to involve homodimerization of the CTCF proteins bound to their bases. By mediating contacts between distant sequences, CTCF regulates enhancer-promoter interactions throughout the genome and appears to play a key role in the formation of Topologically Associating Domains (TADs) (Nora et al., 2012). Analysis of genome-wide interaction data obtained by Hi-C suggests that CTCF-mediated contacts occur much more frequently when the binding sites for this protein are present in the convergent forward and reverse orientations (Rao et al., 2014). Interactions between binding sites arranged in the same forward-forward or reverse-reverse orientation still occur, although less frequently, and interactions between CTCF sites in a divergent reverse-forward orientation rarely take place. In this issue, Guo et al. (Guo et al., 2015) carry out a detailed functional analysis of the role of CTCF binding site orientation in the regulation of enhancer-promoter choice underlying stochastic expression of specific protocadherin isoforms.
The protocadherin genes are subject to alternative splicing, and each variable exon contains an upstream promoter, transcription from which depends on interaction with a downstream enhancer via DNA looping. Each variable exon and enhancer has a CTCF binding site. Guo et al. noticed that the CTCF binding sites that form loops between promoters and enhancers are arranged in a convergent orientation. Using the CRISPR-Cas9 genome editing system they create inversions of key CTCF binding sites, switching their orientation. The authors then use 4C to show that the inverted CTCF binding sites now have an inverted interaction bias. This confirms the causal relationship between DNA binding site orientation and the direction of looping. Furthermore, the change in looping directionality is accompanied by changes in transcription, indicating a functional role for the CTCF mediated interactions in regulating gene expression.
The authors then expand their investigation to the entire genome using published CTCF ChIA-PET data. They find the same orientation bias in interactions between CTCF sites as previously shown with Hi-C data. These observations solidify what now appears to be one of the underlying principles by which the orientation of the DNA sequence in CTCF binding sites shapes 3D genome organization. However, this new finding raises a series of questions as to the mechanisms underlying the specificity of interactions between CTCF sites in the genome. CTCF binding sites in divergent and convergent orientations are molecularly identical and impossible to distinguish outside of the larger context of the DNA molecule. Figure 1A shows two theoretical CTCF mediated loops. The only difference between the two loops is which side of the CTCF sites the looped-out DNA is on. Despite this, the loop depicted on the left occurs much more frequently than the loop depicted on the right. This means that the mechanism by which CTCF forms loops must be aware of this context and be capable of discriminating between CTCF sites in convergent and divergent orientations. A simplistic model of loop formation that relies on random collisions in the nuclear space between CTCF bound to DNA in different orientations to form interactions is incompatible with the observations, as it could not be aware of the relative positions or orientations of the CTCF binding sites.
Figure 1. Model of Orientation Biased CTCF Looping.
(A) CTCF mediated loops in convergent and divergent orientations only differ in how they are connected by the DNA. The loop on the left occurs much more frequently than the loop on the right, suggesting the mechanism of loop formation must be able to distinguish the two cases.
(B) A loop-extrusion model would explain the orientation bias seen in CTCF mediated looping. CTCF bends DNA and could be capable of forming a loop on one side of its binding site only, due to the manner in which the DNA is bent. This loop could then be expanded in one direction via the action of cohesin and possibly also transcription, causing the CTCF site to contact other DNA elements such as other CTCF sites, cohesin-associated Mediator complexes, and cohesin-associated gene promoters more frequently in one orientation. Homodimerization of CTCF complexes in anti-parallel orientations may not be favored, leading to continued, rather than completed loop formation when two CTCF binding sites encounter each other during loop extrusions, accounting for the paucity of these interactions observed in genome interaction data.
One potential explanation for the directionality in loop formation is that the bias is created by the binding of CTCF to its recognition site, which causes a ninety degree bending in the DNA, resulting in the formation of an unusual, oriented structure that could be interpreted as a loop (MacPherson and Sadowski, 2010). As this DNA structure is formed in the same orientation as the bias in looping it seems likely that the two phenomena are causally linked. Several potential processes could then contribute to the expansion of the initial loop (Figure 1 B). Since one end of the loop would be defined by CTCF binding, cohesin, which frequently co-binds with CTCF, might function to translocate DNA on the other side of the CTCF-induced “kink” to expand the loop. This is supported by results showing that cohesin is able to extrude a loop, perhaps using energy from its ATPase activity (Alipour and Marko, 2012; Strick et al., 2004). Transcriptional activity could also contribute to the cohesin-based translocation of the DNA into the loop (Lengronne et al., 2004). The observed frequency of interactions between CTCF sites with the same orientation is relatively low (Guo et al., 2015). Perhaps as two sites with the same orientation encounter each other during loop extrusion, the anti-parallel orientations of the CTCF proteins disfavor dimerization, and loop extrusion would continue until a convergent site was met (Figure 1B). In addition, the directionality imposed by this DNA bending-initiated loop extrusion model results in a CTCF site interacting more frequently with the DNA on one side of it, explaining why divergent CTCF sites interact very infrequently (Guo et al., 2015; Rao et al., 2014). This would also explain the finding that TAD boundaries, i.e, the generally “non-looped” stretch of DNA between two TADs, are enriched in CTCF sites arranged in divergent orientations (Vietri Rudan et al., 2015; Gou et al., 2015), since these border-associated divergent sites will tend to loop towards the interior of each adjacent TAD.. This finding helps explain why only a subset of CTCF sites in the genome are able to form these boundaries, and reinforces the functional relevance of CTCF to the formation of TADs. Finally, the loop extrusion model also imposes directionality on the interactions between CTCF and transcriptional complexes and/or gene promoters (Phillips-Cremins et al., 2013). The plethora of genome-probing tools that are constantly emerging should allow rigorous experimental testing of this model, stimulated by the results of Guo et al. and others, in the near future.
References
- Alipour E, Marko JF. Self-organization of domain structures by DNA-loop-extruding enzymes. Nucleic acids research. 2012;40:11202–11212. doi: 10.1093/nar/gks925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guo Y, Xu Q, Canzio D, Shou J, Li J, Gorkin DU, Jung I, Wu H, Zhai Y, Tang Y, et al. CRISPR Inversion of CTCF Sites Alters Genome Topology and Enhancer/Promoter Function. Cell. 2015 doi: 10.1016/j.cell.2015.07.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lengronne A, Katou Y, Mori S, Yokobayashi S, Kelly GP, Itoh T, Watanabe Y, Shirahige K, Uhlmann F. Cohesin relocation from sites of chromosomal loading to places of convergent transcription. Nature. 2004;430:573–578. doi: 10.1038/nature02742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacPherson MJ, Sadowski PD. The CTCF insulator protein forms an unusual DNA structure. BMC molecular biology. 2010;11:101. doi: 10.1186/1471-2199-11-101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nora EP, Lajoie BR, Schulz EG, Giorgetti L, Okamoto I, Servant N, Piolot T, van Berkum NL, Meisig J, Sedat J, et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature. 2012;485:381–385. doi: 10.1038/nature11049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ong CT, Corces VG. CTCF: an architectural protein bridging genome topology and function. Nature reviews Genetics. 2014;15:234–246. doi: 10.1038/nrg3663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Phillips-Cremins JE, Sauria ME, Sanyal A, Gerasimova TI, Lajoie BR, Bell JS, Ong CT, Hookway TA, Guo C, Sun Y, et al. Architectural protein subclasses shape 3D organization of genomes during lineage commitment. Cell. 2013;153:1281–1295. doi: 10.1016/j.cell.2013.04.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rao SS, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, Robinson JT, Sanborn AL, Machol I, Omer AD, Lander ES, et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159:1665–1680. doi: 10.1016/j.cell.2014.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Strick TR, Kawaguchi T, Hirano T. Real-time detection of single-molecule DNA compaction by condensin I. Current biology: CB. 2004;14:874–880. doi: 10.1016/j.cub.2004.04.038. [DOI] [PubMed] [Google Scholar]
- Vietri Rudan M, Barrington C, Henderson S, Ernst C, Odom DT, Tanay A, Hadjur S. Comparative Hi-C reveals that CTCF underlies evolution of chromosomal domain architecture. Cell reports. 2015;10:1297–1309. doi: 10.1016/j.celrep.2015.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

