Abstract
Gene expression programs determine cell fate in embryonic development and their dysregulation results in disease. Transcription factors (TFs) control gene expression by binding to enhancers, but how TFs select and activate their target enhancers is still unclear. HOX TFs share conserved homeodomains with highly similar sequence recognition properties, yet they impart the identity of different animal body parts. To understand how HOX TFs control their specific transcriptional programs in vivo, we compared HOXA2 and HOXA3 binding profiles in the mouse embryo. HOXA2 and HOXA3 directly cooperate with TALE TFs and selectively target different subsets of a broad TALE chromatin platform. Binding of HOX and tissue-specific TFs convert low affinity TALE binding into high confidence, tissue-specific binding events, which bear the mark of active enhancers. We propose that HOX paralogs, alone and in combination with tissue-specific TFs, generate tissue-specific transcriptional outputs by modulating the activity of TALE TFs at selected enhancers.
Author summary
The different anatomical features that develop along the animal body axis are determined by HOX transcription factors. Each member of the HOX family appears in a well-defined territory of the developing embryo, and instruct a specific gene expression program. In stark contrast to their specificity in vivo, which is central to developmental programs, HOX transcription factors display highly similar sequence recognition properties in vitro. To understand how HOX transcription factors control their specific transcriptional programs in vivo, we compared HOXA2 and HOXA3 binding profiles in the mouse embryo. We find that HOXA2 and HOXA3 occupy a large set of high-confidence, non-overlapping genomic regions, that are also bound by three aminoacid loop extension (TALE) transcription factors. We identify three main determinants of HOX paralog-selective binding: recognition of unique variants of the HOX-PBX motif, differential affinity at shared HOX-PBX motifs and, additionally, contribution of tissue-specific transcription factors. These mechanisms result in high-confidence, cooperative HOX-TALE binding at different genomic locations, which bear the mark of active enhancers. We propose that HOX paralogs operate, alone and in concert with tissue-specific transcription factors, to switch on TALE function at selected enhancers.
Introduction
Gene expression programs instruct and maintain cell fate in embryonic development and adult tissue homeostasis. Transcription factors (TFs) control gene expression by binding to enhancers [1,2]. However, we still have no clear idea of how TFs select their precise sets of target enhancers. While TFs contain DNA binding domains which recognize DNA in a sequence-specific manner, these interactions are typically insufficient to direct a TF to its functional targets.
Transcriptional regulation is mediated by TFs working together, rather than in isolation. The widespread occurrence of collaborative TF binding is imposed by chromatin. A single TF cannot easily compete with nucleosomes to access DNA, but multiple TFs that recognize closely spaced binding sites can effectively displace nucleosomes and indirectly facilitate each other’s binding [3,4]. Such indirect cooperativity can also result in TFs recognizing low affinity sites, i.e. sites that deviate from their optimal consensus in vitro [5]. Recent observations indicate that TF cooperativity does not end at binding enhancers: clusters of enhancer-bound TFs concentrate co-activators and other nuclear factors via dynamic fuzzy interactions, driven by their intrinsically disordered regions (IDRs). IDRs function in molecular recognition and mediate the interaction with a diversity of regulatory proteins [6,7] to promote the liquid-liquid phase transition associated with gene activation [8]. Thus, the formation, on DNA segments, of regulatory complexes made of different combinations of factors, is key to activation of gene expression. These distinct combinations of TFs produce virtually inexhaustible flavours of gene expression and cell fate [1].
HOX TFs provide an ideal model to explain how TFs select their target enhancers to direct specific transcriptional programs in vivo. They contain a homeodomain (HD), a highly conserved DNA binding moiety shared by hundreds of TFs [9,10]. HD display highly similar sequence recognition properties and bind the same core of four-base-pair sequence TAAT [11], yet HOX TFs function to establish the identity of entirely different body parts along the antero–posterior axis of all bilaterian animals [12,13]. In mammals, there are 39 Hox genes, classified into anterior (HOX1-2), central (HOX3–8), and posterior (HOX 9–13) paralog groups [14]. HOX paralogs occupy sequential positions along the chromosome, which are faithfully maintained across evolution [15]. This translates into precise HOX expression codes at different levels of the antero-posterior axis, conferring specific spatial and temporal coordinates to each cell.
HOX association with three amino acid loop extension (TALE) HD TFs PBX, and PBX partner MEIS, is a widely accepted mechanism underlying HOX target specificity [10,16,17]. HOX-TALE cooperativity increases the affinity and sequence selectivity of HOX TFs in vitro [16]. In vivo, HOXA2 extensively binds with TALE TFs [18] and Ubx and Hth (fly homologs to vertebrate central HOX and MEIS respectively) co-localize in active nuclear microenvironments, suggesting that their interaction may be critical to trigger phase separation [19]. Interestingly, Hox binding selectivity can be observed in the absence of TALE TFs, and is strongly associated with chromatin accessibility [20]. Although the concept of HOX and TALE interaction is long established, we still understand relatively little about the extent and functional significance of HOX-TALE association in vivo, where compaction of DNA into chromatin and the distribution of sequence-specific TFs (cell-specific and tissue-specific, but also ubiquitous) can considerably affect TF binding to DNA. Also, how the association with fairly ubiquitous proteins eventually translates into HOX paralog-specific transcriptional outputs in vivo, remains unclear.
To understand how HOX TFs execute their specific functions to impart different segmental identity in vivo, we compared binding of HOXA2 and HOXA3, an anterior and a central HOX proteins, in the physiological tissues where these TFs are active. Branchial arches (BA) are blocks of embryonic tissues that merge to form the face and the neck in vertebrates. The second and third branchial arch (BA2 and BA3) are the main domains of HOXA2 and HOXA3 expression respectively, and the embryonic areas most affected by inactivation of Hoxa2 and Hoxa3 in mouse [21–23]. We find that, in the BAs, HOXA2 and HOXA3 occupy a large set of high-confidence, non-overlapping genomic regions, that are also bound by TALE TFs. We identify three main determinants of HOX paralog-selective binding, resulting in high-confidence cooperative HOX-TALE binding at different genomic locations: recognition of unique variants of the HOX-PBX motif, differential affinity at shared HOX-PBX motifs and, additional contribution of tissue-specific TFs. We propose that HOX paralogs operate, alone and in concert with tissue-specific TFs, to switch on TALE function at selected enhancers.
Results
HOXA2 and HOXA3 control diverse processes by targeting different regions of the genome
HOX TFs direct highly specific gene expression programs in vivo, but recognize very similar DNA sequences in vitro. However, it remains to be determined if HOX specificity of action reflects specificity of binding across the genome in vivo, i.e. the binding of paralog HOX TFs to distinct target regions. To establish this, we compared HOXA2 and HOXA3 binding profiles in their physiological domains of expression in the mouse embryo. BAs display an antero-posterior gradient of HOX expression, which replicates Hox gene positions on the chromosome (Fig 1A and 1B): BA1 does not express any Hox gene, BA2 expresses Hox2 paralogs, BA3 Hox3 paralogs, etc. We previously characterized HOXA2 binding in BA2 [18]; here, we profiled HOXA3 binding in BA3-4-6 (hereafter referred to as posterior branchial arches, PBA), the embryonic tissues immediately posterior to the BA2 (identified by the expression of Hox paralogs 3–5, Fig 1A and 1B). Using a HOXA3-specific antibody (S1A Fig), we identified 848 peaks with fold enrichment (FE) ≥10, which largely contained a second biological replicate (S1B Fig; S1 Table). TALE TFs (PBX and MEIS) display cooperative binding with HOX and increase HOX binding specificity in vitro [16]. De novo motif discovery [24] identified HOX-PBX recognition sequence as the top enriched motif in HOXA3 peaks and uncovered MEIS binding site in the top three sequence motifs (S1C Fig). HOXA3 recognition sites in PBA correspond to HOXA2 motifs in BA2 (S1D and S1E Fig); moreover, the frequency of HOX-PBX motifs is comparable across HOXA2 and HOXA3 peaks (S1F Fig). HOX peaks without canonical HOX-PBX consensus motifs, contain similar (i.e. 1nt mismatch), potential low affinity variants of HOX-PBX sites (S1F Fig). The occurrence of high affinity sites (perfect matches) positively correlates with peak FE, and is highest in top HOXA2 and HOXA3 peaks. Low affinity sites (1 mismatch allowed in TGATNNAT) show the opposite trend and occur with higher frequency in lower confidence binding events (S1G Fig).
We overlapped HOXA2 binding in BA2 (S2 Table) with HOXA3 binding in PBA. About half of HOXA3 peaks are contained in the larger HOXA2 datasets (Fig 1C and 1D). When comparing the same number of peaks for both datasets, ranked by FE, we observed an increasing overlap at lower confidence peaks (Fig 1E), suggesting that HOXA2 and HOXA3 select different sites when binding with higher affinity and are more promiscuous at lower binding levels. Functional association of HOXA3-specific peaks in PBA and HOXA2-specific peaks in BA2 [25] (Fig 1F and 1G) highlights distinct biological processes and mouse phenotypes, including abnormal middle ear, sphenoid, temporal and squamosal bone morphologies, whose morphogenesis is controlled by HOXA2 [21,22]. Remarkably, genes dysregulated in the HOXA2 mutant [26] are significantly represented in the top biological processes and mouse phenotypes associated with HOXA2 peaks (S1H Fig). In contrast HOXA3-specific binding is almost exclusively associated with heart and cardiac muscle development and cardiovascular phenotypes, consistent with the role of HOXA3 in the formation of the main arteries [23,27] (Fig 1F). These observations are in line with HOX functional specificity and indicate that in their physiological domains of expression, HOXA2 and HOXA3 bind in the vicinity of, and potentially control, genes involved in very different processes. Hoxa2 expression displays a sharp anterior border between BA1 and BA2 and expands in the more posterior PBA (Fig 1A, Fig 4A). We profiled HOXA2 binding in PBA to understand if HOX-specific binding is determined by differences in the BA2 and PBA chromatin environment. We found that HOXA2 peaks in PBA very rarely overlap with HOXA3 ‘only’ peaks in the same tissue (1% overlap), but are largely contained in the pool of HOXA2-specific binding in BA2 and ‘common’ HOXA2 and HOXA3 binding events (Fig 1H). This, combined with the observation that HOXA2 and HOXA3 are largely co-expressed in the same cells in PBA and therefore exposed to the same chromatin environment [28] (see also Fig 4, S1A and S1B Fig) argues against differences in chromatin accessibility being a main determinant of HOX binding. In sum, analysis of HOXA2 and HOXA3 ChIP-seq in their respective domains of expression indicates that different HOX TFs control diverse and specific processes by targeting different regions of the genome in vivo. Tissue-specific chromatin accessibility does not appear to be a major determinant in HOX paralogs’ target site selection.
HOXA2 and HOXA3 select variants of the HOX/PBX motif
The observations above indicate that HOXA2 and HOXA3 select different genomic sites in vivo, while at a first glance, they recognize very similar DNA sequences. To investigate the determinants of HOX binding specificity, we focused on high confidence HOXA2 and HOXA3 peaks (top 250, ranked by FE), which display the lowest overlap across the genome (Fig 1E). As expected, both HOXA2 and HOXA3 top peaks were enriched in the HOX/PBX motif (present in 66.4% and 68.4% of HOXA2 and HOXA3 respectively). De novo motif discovery identified enrichment of a HOX-PBX variant in HOXA3 top 250 peaks, which contains a C in the second variable position (i.e. TGATNCAT) (Fig 2A). We next counted the distribution of all permutations of the TGATNNAT motif in top HOXA2 and HOXA3 peaks and found the TGATTCAT variant to be highly differentially enriched in HOXA3 peaks (Fig 2B). This sequence, which is highly represented in HOXA3 top peaks (~ 20%), is almost excluded from HOXA2 peaks (Fig 2B). Supporting functional significance, HOXA3 peaks containing TGATTCAT display increased acetylation levels (a mark of active enhancers) [29] in HOXA3-expressing tissues (Fig 2C). In addition, while HOXA2 peaks display a very high representation of TGATGGAT and TGATTGAT, HOXA3 high confidence binding allows higher variability (four variants are counted > 20 times in HOXA3 peaks as opposed to only two variants in top HOXA2 peaks) (Fig 2B). The highest differential enrichment of TGATNNAT variants is observed in top HOXA2 and HOXA3 peaks (S2A Fig), which also display minimal overlap across the genome (Fig 1E); this suggests that the ability to recognize different sequences plays a role in genomic site selections. Finally, the majority of HOXA3 (158/250) and HOXA2 (160/250) top peaks contain MEIS recognition motif, at a preferential distance of less than 20 nt from the TGATNNAT motif (S2B Fig). The Sulf2 locus exemplifies HOXA3 specific binding in PBA: HOXA3 peak summit contains a single TGATTCAT motif and displays high HOXA3 occupancy, but no detectable HOXA2 binding (Fig 2D and 2E). We used electrophoretic mobility shift assay (EMSA) to establish if HOXA3 preferentially recognizes the TGATTCAT sequence in vitro. We did not observe any HOXA2 or HOXA3 binding to the Sulf2 probe (Fig 2F). Incubation with PBX and MEIS resulted in a probe shift. Addition of HOXA3, but not HOXA2, resulted in the formation of a ternary complex, indicating that HOXA3 can bind this site in combination with PBX and MEIS, while HOXA2 cannot (Fig 2F). In support of this conclusion, converting TGATTCAT to TGATTGAT (a single nucleotide substitution in the Sulf2 probe), enables binding of HOXA2, in addition to HOXA3 (Fig 2G, S2C Fig). Finally, we observed no synergy between HOXA3, PBX and MEIS on the Sulf2 enhancer in transfection assays (S2D Fig), suggesting that additional TFs could function with HOXA3 at this site; of note this HOXA3-specific site closely resembles AP-1 TF consensus sequence [24]. These results indicate that HOXA3 and HOXA2 have diverse binding preferences and uncover the existence of sites that are exclusively recognized by HOXA3.
HOXA2 molecular control of BA2 identity
In contrast to HOXA3, which displays unique binding preferences for TGATTCAT, we did not detect HOX-PBX variants exclusively recognized by HOXA2. To investigate the mechanisms underlying HOXA2 control of BA2 identity, we examined HOXA2 binding events (top peaks) in the vicinity of well-established HOXA2 downstream targets. Meis2 and Zfp703 are associated with high levels of HOXA2 binding [18] (Fig 3A and S3A Fig) and are downregulated in Hoxa2 null BA2 [26]. In addition, consistent with Meis2 and Zfp703 expression being HOXA2-dependent, they are expressed at higher levels in BA2 than the HOX-less BA1 and the HOXA3-positive PBA (Fig 3B) [18,30]. Meis2 and Zfp703 loci exhibit high HOXA2 and HOXA3 binding in their vicinity, suggesting their associated chromatin is largely accessible in both BA2 and PBA (Fig 3A and S3A Fig). The sequence contained in Meis2 peak is active in the main domains of HOXA2 expression, the hindbrain and BAs in zebrafish (Fig 3C). When tested in luciferase assays in NIH3T3 cells, the Meis2 functional enhancer displays higher activity in the presence of HOXA2, in combination with MEIS and PBX, relative to HOXA3 (Fig 3D). Meis2 enhancer activity is largely dependent on the integrity of its HOX-PBX site (Fig 3D). Adding TALE to the Zfp703 putative enhancer also significantly increased the difference in HOXA2 and HOXA3 activation abilities (S3B Fig). However, adding HOXA2 and HOXA3 alone resulted in higher activation of the Zfp703 putative enhancer relative to Meis2 enhancer (presumably due to the presence of a TAAT site within the HOX/PBX motif) and including TALE did not significantly change HOXA2 activation. Similar to the Meis2 enhancer, disruption of the HOX/PBX site significantly decreased luciferase activity (S3B and S3C Fig). Finally, HOXD3, another representative of HOX paralog group 3, whose members have largely overlapping functions in vivo [27], also displayed a lower activating capacity than HOXA2 on Meis2 enhancer (S3D Fig). In sum, HOXA2 is more efficient at activating both target regions, in the presence of PBX and MEIS. To understand if this reflects HOXA2 and HOXA3 different DNA binding properties, we generated HOX chimeric proteins by swapping HOXA2 and HOXA3 DNA-binding HDs. We found that providing HOXA2 with HOXA3 HD did not substantially change the ability of HOXA2 to activate transcription from the Meis2 enhancer (Fig 3E). Similarly, the ability of HOXA3 to transactivate the Meis2 and Zfp703 enhancers, alone or in complex with MEIS and PBX, was not improved by swapping HOXA3 HD with HOXA2 HD (Fig 3E and S3B Fig). As HOX TFs cooperate with MEIS and PBX to activate target enhancers and activation relies on the presence of an intact HOX/PBX recognition motif, HOXA2 and HOXA3 diverse activation properties may depend on their respective abilities to interact with PBX and MEIS on DNA. On their own, HOXA2 and HOXA3 weakly bind the Meis2 enhancer, but interact with PBX and MEIS to form a ternary protein complex on DNA (Fig 3F, see also S3E and S3F Fig). A larger fraction of MEIS-PBX complex is bound by HOXA2, while addition of HOXA3 results in a less robust super-shift of the dimeric band (Fig 3F and S3G Fig). To confirm these observations, we co-translated HOXA2 and HOXA3 at different ratios and found that, at all concentrations tested, HOXA2 forms a stronger trimeric complex on the Meis2 probe, relative to HOXA3 at an equivalent concentration (Fig 3G and S3H Fig). Concomitantly, HOXA2 shifts more of the dimeric band (PBX/MEIS) into a trimeric complex (Fig 3G and S3I Fig). We did not detect any slower complexes, suggesting that this DNA sequence can only bind HOXA2 or HOXA3 at one time (Fig 3G and S3J Fig). We observed the same binding patterns using HOX chimeras: swapping HOXA3-HD with HOXA2-HD did not improve the ability of HOXA3 to form a ternary complex with PBX and MEIS, and did not affect HOXA2 ability to bind DNA in complex with MEIS and PBX (Fig 3H, S3G Fig). Next, we investigated the contribution of HOX-PBX and MEIS recognition motifs to HOX and TALE binding to DNA. We systematically introduced mutations in individual sites and tested the resulting probes in EMSA. All mutations strongly reduced the formation of PBX-MEIS dimers (Fig 3I and S3K Fig). We only detected formation of HOX-MEIS-PBX trimeric complexes when one of the two MEIS recognition motifs was still intact, and even in this case complex formation was strongly decreased (S3K Fig), highlighting an essential role of MEIS for HOX binding to DNA. Together, these results indicate that the differential ability of HOXA2 and HOXA3 to bind DNA and activate transcription does not depend on HOX-DNA binary binding. Rather, it reflects differential abilities to form functional HOX-TALE complexes on DNA and is encoded by residues outside the HOXA2 and HOXA3 HD. In summary, while HOXA2 does not exclusively access its sites (HOXA3 can bind as well, Fig 3A), HOXA2 binds more efficiently with TALE at these sites, leading to increased transcriptional activation. Consistently, shared high-confidence HOXA2 and HOXA3 binding events are largely associated with genes expressed at higher levels in the BA2 (Fig 3J). Thus, at least in part, HOXA2 instructs the formation of a BA2 by raising the expression levels of HOX-regulated genes. Crucially, among these genes is Meis2, which encodes a critical component for BA2 identity [18].
HOXA2 activity is decreased in PBA
The above results show that, on its target enhancers, HOXA2 functions more efficiently with TALE relative to HOXA3. Given that HOXA2 is expressed in both the BA2 and in the PBA, why does HOXA2 not instruct a BA2-specific program in the PBA as well? More posterior Hox genes are typically able to repress the expression (and suppress the function) of more anterior genes, a process termed ‘posterior prevalence’ [15]. Indeed, Hoxa2 highest expression is detected in the BA2, while Hoxa2 is expressed at lower levels in Hoxa3 main domain of expression, the BA3 (Fig 4A and 4B and Fig 1B). A lower nuclear concentration of HOXA2 is expected to reduce HOXA2 binding to DNA (Fig 3G and S3H Fig). However, to assess how changes in HOXA2 dose affect binding genome-wide, we compared HOXA2 binding in BA2 and in PBA. While HOXA2 binds similar locations in BA2 and PBA (Fig 1H), HOXA2 binding levels are significantly higher in BA2 (Fig 4C, see also S3A Fig). This is further confirmed by quantitative analysis of selected immunoprecipitated regions (Fig 4D). Relative to the BA2, the PBA display lower levels of HOXA2 and also express HOXA3 (Fig 1B). Indeed single cell analysis of E10.5 post-otic cranial neural crest, which includes the PBA, [28] (S4A and S4B Fig) confirms that HOXA2 and HOXA3 are largely expressed in the same cells. Therefore, we investigated the effect of decreasing HOXA2 levels while increasing HOXA3 levels on HOXA2 target enhancers. Using EMSA, we found that at lower HOXA2 levels, the addition of HOXA3 significantly reduced binding of HOXA2 to the Meis2 probe, together with PBX/MEIS (Fig 4E, S4C Fig). Similarly co-expressing HOXA2 and HOXA3 in NIH3T3 cells reduced activation of HOXA2 target enhancers in vitro (Fig 4F); for Meis2, halving the amount of HOXA2 (50% HOXA2) resulted in higher activation compared to doubling the levels of HOX proteins (50% HOXA2 + 50% HOXA3). While these observations are entirely consistent with the observed lower levels of HOXA2 targets in PBA (Meis2, Zfp703), it is unknown if expression of these genes in the PBA is still dependent on HOXA2 or HOX paralog 3. Provided expression of Meis2, Zfp703 is required for correct PBA development, as this area does not display any phenotype in the absence of HOXA2 [21,22], either HOXA2 is not required for their expression, or its function can be fully compensated by HOX paralog 3. In conclusion, both a lower dose of HOXA2 and the presence of HOXA3 (possibly other paralog 3 as well) decrease HOXA2 binding and activating abilities in the PBA. This effect, combined with the lower efficiency of HOXA3 to activate HOXA2 targets, dampens HOXA2 transcriptional program in the PBA.
HOX cooperates with MEIS
Our results indicate that HOX selectivity is displayed in concert with TALE. Generally, binding with TALE appears to be a dominant feature of HOX binding in the BAs. HOX peaks are enriched in HOX-PBX and MEIS motifs and similar to HOXA2 in BA2 [18], HOXA3 peaks overlap almost entirely with MEIS and PBX peaks in the same embryonic tissue at the same stage (Fig 5A, S5A Fig). We previously discovered that HOXA2 switches its transcriptional program by increasing binding of MEIS TFs to potentially lower-affinity sites across the genome [18]. We investigated if HOXA3 can similarly increase MEIS binding levels. The fraction of MEIS peaks that overlaps HOXA3 binding displays higher FE in PBA, relative to the HOX-free BA1 (Fig 5B). Hoxa2 is also expressed in PBA, where it could be entirely responsible for the observed increase in MEIS binding. Therefore, to assess HOXA3 unique contribution to MEIS binding increase, we extracted HOXA3-specific binding. We found that MEIS peaks in PBA that overlap HOXA3 ‘exclusive’ peaks, display higher FE (relative to MEIS non-overlapping HOX), indicating that HOXA3 also increases binding of MEIS (Fig 5C), similar to HOXA2 in BA2 [18]. Reciprocally, co-occupancy with MEIS enhances HOXA3 binding (Fig 5D). Both HOXA2 and HOXA3 interact with MEIS1 and MEIS2 in co-precipitation experiments (Fig 5E). Cooperativity, an inherent feature of TF binding to DNA, can arise by protein-protein interactions (direct cooperativity) or by collective binding of TFs to nearby sequences and nucleosome displacement (indirect cooperativity) [31]. The observations that HOX and MEIS display higher binding levels, when binding chromatin together, and interact to form a complex, identify direct cooperativity as the mechanism underlying HOX-TALE binding to chromatin. Cooperativity with MEIS appears to be a general operational principle of HOX TFs as, similar to HOXA2 and HOXA3, MEIS co-occupancy with HOXA1 and HOXA9 is associated with the highest MEIS binding levels in mouse embryonic stem cells [32] and bone marrow cells [33] respectively (S5B–S5D Fig). In sum, HOX directly cooperate with TALE on chromatin. HOXA2 and HOXA3 diverse sequence preferences and binding affinities predict that HOX paralogs preferentially cooperate with distinct subsets of TALE binding events.
MEIS ‘ubiquitous’ binding is converted into tissue-specific enhancer activity
MEIS TFs bind broadly and to largely overlapping locations across different BAs (Fig 6A). In contrast to MEIS ubiquitous occupancy, MEIS binding levels are BA-specific (Fig 6B and 6C) and are associated with distinct biological processes (S6A Fig). We systematically extracted MEIS differential binding across the BAs (S7 Fig) and found that higher MEIS binding levels in a tissue are highly predictive of increased enhancer activity in the same tissue (Fig 6D). Moreover, consistent with MEIS positive effects on transcription [34], genes preferentially expressed in one tissue display higher MEIS binding levels in their vicinity, in a pattern mirroring changes in H3K27Ac across the BAs (Fig 6E). As high MEIS binding levels are tissue-specific and linked to transcriptional activation, we next asked what causes differences in MEIS binding levels across the BAs. Consistent with the finding that HOXA2 enhances MEIS occupancy at selected sites across the genome to activate gene expression [18], regions occupied by HOXA2 in BA2, and by HOXA3 in PBA, display higher enhancer activity if associated with higher, differential MEIS binding levels in the corresponding tissue (Fig 6F). Likewise high MEIS peaks in HOX-positive areas (BA2 and PBA) are highly enriched in sequence features matching HOX-PBX motif [35]. However, HOX co-occupancy only partly accounts for high levels of MEIS binding and acetylation in BA2 and PBAs (S6B Fig), suggesting additional TFs may be involved. TF occupancy in vivo occurs in the context of chromatin, which acts as a barrier to TF binding. Multiple TFs that recognize closely spaced binding sites can more effectively compete with nucleosomes and indirectly facilitate each other’s binding. We reasoned that this indirect form of cooperativity between TFs could enhance MEIS binding, resulting in higher MEIS peaks. Therefore, to identify BA-specific TFs that facilitate MEIS binding, we searched for enriched sequence features in MEIS differential binding. CNN models identify enrichment of distinct signature motifs in MEIS differential peaks (Fig 6G), which reflect the distribution of TFs across the BAs (Fig 6H). In support of tissue-specific TFs contributing to MEIS binding levels, GATA6 occupancy in the PBA [30] overlays MEIS peaks, higher in the same tissue (S6C Fig). These observations suggest that binding of tissue-specific TFs, in addition to HOX, can locally increase MEIS binding levels.
We noticed that a set of distinctive sequence features of MEIS differential binding in PBA [NKX (HD) and FOX (Forkhead) motifs] and BA2 [(basic helix-loop-helix (bHLH) recognition sites] (Fig 6I) are also enriched in HOXA3- and HOXA2-specific peaks, respectively, suggesting that HOX and tissue-specific TFs may collaborate in binding with TALE. We decided to focus on FOX TFs, as most Fox genes are expressed at higher levels in PBA than BA2 (Fig 6H). Consistent with the three factors cooperating on chromatin, HOX and FOX recognition sites co-occur in the same differential MEIS peaks (S6D and S6E Fig). The higher levels of FOX TFs in the PBA, relative to BA2, predict FOX TFs to have stronger effects on HOXA3 and MEIS binding in PBA. CNN can be trained to predict binding intensities merely based on DNA nucleotide sequence, and the learned model can be used to predict changes in binding intensities upon random dinucleotides mutations in TF binding motifs (in silico mutagenesis). Indeed, in silico mutagenesis predicts mutations in FOX TF recognition sites to affect binding of both HOXA3 and MEIS in PBA, but not HOXA2 and MEIS in BA2 (Fig 6J, S6F Fig). In contrast, mutagenesis of GATA motifs (enriched in MEIS differential peaks, but not in HOX peaks) does not appear to affect HOX-MEIS binding (Fig 6J). Synergistic effects of FOX and HOX have been identified in the mouse central nervous system [36]. In luciferase assays, addition of FOXC1, HOX and MEIS/PBX increases transcriptional activation driven by the Sfrp2 distal region (co-occupied by HOX and FOXC1 in the BAs) [18] (Fig 6K). While naked DNA in transfection assays does not recapitulate TF cooperativity on chromatin, these results imply that TALE FOX and HOX TFs bind their predicted binding sites and together, contribute to transcriptional activation. Moreover, the presence of FOXC1 is sufficient to enhance MEIS/HOX transcriptional activation of Sfrp2 enhancer, suggesting that cooperation between these TFs could partly compensate for lack of PBX (Fig 6K).
These results identify (direct or indirect) cooperativity with tissue-specific TFs as an additional mechanism for HOX selectivity. By enhancing MEIS binding (residence time on chromatin), tissue-specific TFs also contribute to promote the stability of HOX-MEIS complexes in specific tissues. The distribution of tissue-specific motifs in HOX peaks suggests this mechanism accounts for HOX selectivity at a subset of enhancers, rather than being a general mechanism of HOX enhancer selection (S6G Fig). We propose that HOX and tissue-specific TFs (alone and in combination) increase TALE TF binding affinity and residence time at selected locations, identified using their sequence recognition motifs. As high levels of MEIS are associated with increased enhancer activity and gene expression, longer MEIS residence time at selected locations may translate into BA-specific transcriptional outputs.
Discussion
HOX TFs contain a HD, which display highly similar sequence recognition properties and is shared by hundreds of TFs, yet they instruct diverse, segment-specific transcriptional programs along the antero-posterior axis of all bilaterian animals. By profiling HOXA2 and HOXA3 binding in their physiological domains, we identify three main determinants of HOX-selective binding across the genome: 1) recognition of unique variants of the HOX-PBX motif; 2) differential affinity at ‘shared’ HOX-PBX motifs and; 3) presence of additional tissue-specific, non-TALE, TFs. These mechanisms (with the possible exception of the first) are expected to generate quantitative (rather than qualitative, i.e. binding/no binding) differences in the relative levels of HOX/TALE occupancy on commonly bound regions. Such quantitative changes are a feature of continuous networks [37], in which TFs bind a continuum of functional and non-functional sites and regulatory specificities derive from quantitative differences in DNA occupancy patterns.
HOX paralog-selective binding occurs in cooperation with TALE. The high degree of HOX and TALE interaction flexibility, mediated by paralog-specific protein signatures, has been proposed to generate paralog-specific functions of HOX TFs [38]. Here, by defining the in vivo repertoire of HOX occupied sites, we identify DNA sequence as an additional determinant of HOX-TALE functional specificity in vivo. This finding is consistent with the mechanism of latent specificity described for Drosophila Hox/Exd (PBX) interaction [39] and in vitro observations that HOX TFs bind longer, more specific sequence motifs in the presence of TALE. However, the effects of TALE on HOX binding in vivo go beyond the refinement of HOX binding sites as, at least in the BA context, binding with TALE appears to be a requirement for loading HOX on chromatin. Our observations indicate that HOXA2-A3 overwhelmingly recognize genomic sites that are enriched in HOX-PBX motifs and are also occupied by TALE TFs in vivo. Therefore, TALE provides a platform for HOX to bind; selectivity enables HOX paralogs to preferentially bind different subsets of this common platform. In agreement with our finding that BA-specific chromatin states do not seem to play a role in HOX target site selection, TALE platform is largely similar across BA1-2-PBA.
What is the functional significance of HOX-TALE interaction on chromatin and how does it contribute to paralog-specific transcriptional programs? Many examples from animal development indicate that transcriptional regulation is mediated by distinct combinations of TFs. TALE TFs operate as a hub, which assists combinatorial assembly of TF complexes. TALE platform enables HOX function to be modulated by other TFs; in doing so, it integrates positional signals (encoded by HOX) and local inputs (provided by cell type-/tissue-specific TFs) into defined transcriptional outputs. While it is possible that MEIS and PBX facilitate access of diverse TFs to relatively inaccessible chromatin, MEIS TFs differ from conventional pioneer TFs, which function to open chromatin regions but are not directly involved in enhancer activation [40,41]. Remarkably, independently of the type of TF involved (HOX or other tissue-specific TFs), positive changes in MEIS binding result in a functional effect, i.e. increased enhancer activity. High instances of MEIS binding are typically tissue-specific and highly correlated with enhancer activity. In fact, differential MEIS binding in a specific BAs is generally a very good predictor for matching changes in enhancer activity in the same tissue. Based on our observations and the well-established role of MEIS in transcriptional activation [34,42,43], we propose a model of transcriptional activation, where TALE (MEIS) TFs function as broad or general activators and HOX paralog selectivity is mainly directed at harnessing TALE functional activity at selected locations. Using their recognition motifs, HOX and/or tissue-specific TFs select specific MEIS binding locations, where they stabilize MEIS binding to generate precise functional outputs, or patterns of enhancer activation (Fig 7). Interestingly, MEIS2 interacts with PARP1 [43], a large enzyme capable of triggering phase condensation [44]. Increasing MEIS residence time (as a result of the cooperation with HOX and other TFs) may favour PARP1 recruitment at selected loci and, in turn, generate the liquid-liquid phase transitions observed to promote gene activation [8,45].
Because high instances of MEIS binding are typically associated with combinatorial TF binding, a precise identification of the critical steps for enhancer activation, and their sequential order, remains problematic. For similar reasons, MEIS and PBX shared genomic occupancy complicates dissecting their respective contributions to enhancer binding and activation. In addition to TALE, numerous other TFs are broadly, if not ubiquitously expressed during development, yet their inactivation results in tissue-specific phenotypes. It is tempting to speculate that similar principles of TF functional connectivity could explain other transcriptional networks, i.e. that cell type- tissue-specific regulators harness the activation abilities of broadly expressed TFs to generate cell type -specific gene expression programs.
Material and methods
Ethics statement
Experiments using animals were performed in accordance with legislation in the UK Animals (Scientific Procedures) Act of 1986 (PPL 70/8858 to Graham Morrissey). All procedures on zebrafish adults and embryos were approved by the University of Massachusetts Institutional Animal Care and Use Committee (IACUC).
Animal experiments
CD1 mice were time-mated to obtain BA2 or PBA from E115 embryos. Mouse experiments were carried out under ASPA 1986. Wild type zebrafish were raised in the University of Massachusetts Medical Center Zebrafish Facility. Embryos and adult zebrafish were maintained under standard laboratory conditions. Enhancers were amplified from mouse genomic DNA using the primers (listed in S3 Table), cloned into pCR8/GW/TOPO vector (Life Technologies) and recombined using the Gateway system (Life Technologies) to an enhancer test vector that includes a strong midbrain enhancer (Minitol2-GwB-zgata2-GFP-48, a kind gift from JL Skarmeta) as an internal control. Fertilized zebrafish embryos were collected from natural spawnings. Plasmid DNA was injected into the cytoplasm of one-cell stage embryos. Injected embryos were visualized intermittently by fluorescence microscopy up to 48 hr post fertilization to identify transgenic carriers. These were raised to adulthood, outcrossed to wildtype fish and the resulting F1 embryos were scored for GFP expression in order to generate stable transgenic lines.
Next-generation sequencing data and downstream analyses
ChIP-seq was performed as described [30] using rabbit polyclonal antibodies targeting HOXA3 (non-conserved N-terminal amino acids 24 to 180), HOXA2 [46], PBX1-2-3-4 (sc-25411X, Santa Cruz), MEIS1/2 (sc-10599X, Santa Cruz) and rabbit IgG (Millipore). DNA was recovered from two independent ChIP-seq experiments and purified using DiaPure columns (Diagenode). Enrichment was validated by SYBR green quantitative PCR (qPCR) using primers listed in S3 Table. DNA libraries were constructed using the MicroPlex Library Preparation Kit v2 (Diagenode) and sequenced with the Illumina next generation sequencing platform. ChIP-seq experiments were analysed using Trimmomatic for trimming [47], Bowtie2 for aligning to the mouse genome (mm9) [48], samtools [49] to remove the aligned reads with a mapping quality Q30 and MACS2 for peak calling [50] with default narrow peak calling setting for TFs and broad peak calling setting for histone modification marks. ‘findMotifGenome’ module of the HOMER package was used to detect de novo motif in 200nt summit regions [24]. Venn diagrams were generated using 200nt peak summits with an overlap of at least 1nt. GREAT standard association rule settings [25] was used to associate ChIP-seq peaks with genes and uncover events controlled by TF binding. DiffBind [51] was used to re-center MEIS and H3K27ac peaks across BA1, BA2 and PBA (S7 Fig) and calculate RPKM values and raw counts in the re-centered regions. edgeR generalized linear model (GLM) method with TMM normalization [52] was used to select differential peaks and calculate fold change in MEIS binding and H3K27ac across BAs used to generate boxplots and scatterplots. Gene expression CPM values and differential gene expression at E10.5 and E11.5 were derived from [18,30]. ggplot2 package [53] was used to generate CPM values heatmap. GALAXY [54], Bioconductor GenomicRanges package [55], and Bioconductor ChIPpeakAnno package (https://www.bioconductor.org/packages//2.10/bioc/html/ChIPpeakAnno.html) were used to intersect, modify and visualize genomic coordinates. Bioconductor Biostring [56] was used to locate fixed motif sequences in the binding regions. Distance between HOX and MEIS binding regions was calculated using GenomicRanges package and plotted with ggplot2. The Kernel density distribution of MEIS fold enrichment in HOX binding regions vs non-HOX binding regions were calculated by R kernel density distribution estimation (R core team 2013) and plotted with ggplot2.
All RNA-seq and ChIP-seq datasets are available on ArrayExpress.
Convolutional neural network models and in silico mutagenesis
MEIS differential sequence features are detected by recently published differential convolutional neural network (CNN) structure [35]. For in silico binding site knockout we trained a CNN model for multitask regression of MEIS and HOX RPKM binding level. The CNN was trained by transfer learning, using convolution parameters from a previously published 1-convolutional layer MEIS RPKM model [35]. Convolutional filters were transferred to a new model, which was then trained on a subset of MEIS regions also bound by HOX, to simultaneously predict log2RPKM values in 2 replicates of HOXA2 in BA2, 2 from HOXA3 in PBA, and one replicate of MEIS in BA1, BA2 and PBA. The training data consisted of 6795 regions of 600nt with HOX binding predicted by MACS2 in any tissue. The regression model was subsequently used to predict the change in RPKM values after binding site erasure. For simulated genomic knockout, a 25nt site containing each feature was replaced by random di-nucleotides from the remaining part of the region and RPKM levels were predicted. Random replacement was repeated 100 times for each feature, averaging the predicted RPKM change. To select candidate features for erasure, MEIS PBA up-binding features were first obtained from the previously published 3-task parallel model and subsequently filtered. Sites of HOXA3 and GATA were required to contain consensus motif “TGATNNAT” and “WGATAA” respectively, with no mismatch allowed. Forkhead sites were selected based on long distinct k-mers, derived from KSM motif representation method [57], namely exact matches to any of the following sequences: “AAAATAAACA", "AAAAATAAAC", "AATAAATCAA", "ATNAATCAACA", "AAATAAACAC", "ATAAATCAAC","GAAAATAAAC", "CAAAATAAAC", "AAAATAAACT", "AAATAAACAA". These candidate sites were identified within a +/- 250nt window centred on HOXA3 and GATA6 ChIP-seq peak summits, FE of replicates was combined with edgeR [52] and a Poisson test was performed as in MACS2 using false discovery rate (FDR) cutoff = 0.05. Only Forkhead and GATA motifs that did not contain internal matches to HOX-PBX motif were selected. Subsets of GATA and Forkhead sites located within +/- 100nt from a HOX-PBX sites were selected for mutagenesis.
Single cell analysis
Single cell transcriptomic data from [28] was retrieved from GEO under accession code GSE129114. Low-abundance genes were filtered according to [58]. Read counts are normalized according to [59]. Highly variable genes (HVGs) were selected using ‘modelGeneVar’ from scran bioconductor package [60], and used for clustering and dimension reduction. Clustering was done using Hierarchical clustering function hclust in R with ward.D2 method [61]. Dimension reduction was performed using with the ‘Rtsne’ function in R.
Electrophoretic mobility shift assays
Probes were made from primers with 5’ ATO700, and purified with QIAGEN PCR purification kit (Qiagen). Proteins were generated using TnT Quick Coupled Transcription/Translation System (Promega) and the following plasmids: pcDNA3-Hoxa2, pcDNA3-Hoxa3, pcDNA3-Meis2, containing mouse coding sequences for Hoxa2, Hoxa3 and Meis2 (isoform 1), cloned into pcDNA3 (Invitrogen); pcDNA3-PBX1a is a gift from Francesco Blasi. Reactions (4% Ficoll, 20mM HEPES, 37.5mM KCl, 1mM DTT, 0.1mM EDTA, 2μg Poly dI.dC, 16ng probe, and 2μl of TNT extracts in total volume of 10μl) were mixed by gentle flicking, and incubated at room temperature for 12 minutes before being run on 4% acrylamide gel at 70V in 0.5X TBE. Bands were quantified using ImageJ (version 1.52a). Intensity plots for each lane were generated by selecting the same region across all the lanes. Individual band intensities were plotted as histograms areas and quantified by the software. Values in triplicate were averaged and, unless stated otherwise, adjusted to the band intensity of the probe bound by PBX/MEIS in the absence of any HOX. Corresponding numerical data are provided as supporting information (data summary).
Luciferase assay
Meis2 and Zfp703 enhancers were amplified from mouse genomic DNA using primers listed in S3 Table cloned into pCR8/GW/TOPO vector (Life Technologies) and recombined using the Gateway system (Life Technologies) into pGL4.23-GW (a gift from Jorge Ferrer; Addgene plasmid # 60323; http://n2t.net/addgene:60323RRID:Addgene_60323). Enhancers were co-transfected with pcDNA3, pcDNA3-Hoxa2, pcDNA3-Hoxa3, pcDNA3-Meis2, pcDNA3-PBX1a (described above) and pcDNA3-Hoxd3 generated by GenScript. NIH3T3 cells were grown in DMEM (D6429) supplemented with 10% FBS and 5% penicillin/streptomycin, and seeded in 24-well plates at 100,000 cells in 1ml. Cells were transfected with GeneJuice Transfection Reagent (Novagen), using 250ng luciferase plasmid and 300ng pcDNA3 plasmids per well. Cells were harvested 24 hours after transfection and luciferase measured using Luciferase Assay System and the GloMax Multi-Detection System (Promega). Corresponding numerical data are provided as supporting information (data summary).
Antibody validation
Gateway entry vectors for mouse Hoxb1 and Hoxb2 [62], human HOXA3 and HOXC4 (http://horfdb.dfci.harvard.edu/hv7/) were used to generate mammalian expression vectors for FLAG-HOX (v1899 destination vector) using the gateway technology [63]. Gateway expression vectors for pExpFLAG-Hoxa1 and pExpFLAG-Hoxa2 are described in [64,65]. HEK293 cells were grown at 37°C using DMEM (D6429) supplemented with 10% FBS, 5% penicillin/streptomycin, and 5% L-glutamine. Cells were seeded in 6-well plates at 400,000 cells/well and transfected 24 hours after plating using 1μg of HOX plasmid constructs and Fugene6 (Promega) according to the manufacturer's instructions. Proteins were collected 48 hours after transfection, boiled in Laemmli buffer, run on SDS-page and visualized using anti-FLAG (M2) (#F1804, Sigma), HRP-conjugated anti-β-ACTIN (#A3854, Sigma) and anti-Hoxa3 antibody (1:2000) and HRP-conjugated secondary antibodies.
Co-precipitation experiments
Coding sequences for MEIS1b and MEIS2.1 were cloned in pEnt plasmids, confirmed by DNA sequencing and used to generate pExp mammalian expression vectors for GST-tagged proteins with the pDest-GST N-terminal destination vector using the gateway technology [66]. HEK293 cells were transfected as above, using 500ng each of FLAG/GST constructs per well. Proteins were collected 48 hours after transfection and co-precipitation performed as described in [62].
Supporting information
Acknowledgments
We thank Ian Donaldson, Andy Hayes and the other members of the Genomic Technologies, Bioinformatics and Biological Services Core Facilities at the University of Manchester. We also thank Rene Reszohazy for sharing plasmids and Samir Merabet for helpful suggestions. A special thanks to Paul Shore for sharing his lab space, when our building became inaccessible.
Data Availability
All datasets are available on ArrayExpress with the following accession numbers: Experiment ArrayExpress accession: E-MTAB-8607 Title: ChIP-seq for Hoxa3 on mouse posterior branchial arches at E11.5. https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-8607 Experiment ArrayExpress accession: E-MTAB-7766 Title: ChIP-seq for Hoxa2 on mouse second branchial arches at E11.5 http://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-7766 Experiment ArrayExpress accession: E-MTAB-8606 Title: ChIP-seq for Hoxa2 on mouse posterior branchial arches at E11.5 https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-8606 Experiment ArrayExpress accession: E-MTAB-8608 Title: ChIP-seq for Pbx on mouse second and posterior branchial arches at E11.5 https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-8608 Experiment ArrayExpress accession: E-MTAB-7767 Title: ChIP-seq for Meis on mouse branchial arches at E11.5. http://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-7767.
Funding Statement
This work was supported by Medical Research Council (MRC) (http:www.mrc.ukri.org) grant MR/L009986/1 to NB, Biotechnology and Biological Sciences Research Council (BBSRC) (http:www.bbsrc.ukri.org) grant BB/N00907X/1 to NB, MR and KM and a BBSRC studentship to ML. FL and CS were supported by National Institute of Health (NIH) (http:www.nih.gov) grant NS038183 to CS. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Spitz F, Furlong EE. Transcription factors: from enhancer binding to developmental control. Nat Rev Genet. 2012;13(9):613–26. Epub 2012/08/08. nrg3207 [pii] 10.1038/nrg3207 . [DOI] [PubMed] [Google Scholar]
- 2.Reiter F, Wienerroither S, Stark A. Combinatorial function of transcription factors and cofactors. Curr Opin Genet Dev. 2017;43:73–81. Epub 2017/01/23. 10.1016/j.gde.2016.12.007 . [DOI] [PubMed] [Google Scholar]
- 3.Mirny LA. Nucleosome-mediated cooperativity between transcription factors. Proc Natl Acad Sci U S A. 2010;107(52):22534–9. Epub 2010/12/15. 10.1073/pnas.0913805107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Moyle-Heyrman G, Tims HS, Widom J. Structural constraints in collaborative competition of transcription factors against the nucleosome. Journal of molecular biology. 2011;412(4):634–46. Epub 2011/08/09. 10.1016/j.jmb.2011.07.032 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Farley EK, Olson KM, Zhang W, Brandt AJ, Rokhsar DS, Levine MS. Suboptimization of developmental enhancers. Science. 2015;350(6258):325–8. 10.1126/science.aac6948 WOS:000362838700042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Cumberworth A, Lamour G, Babu MM, Gsponer J. Promiscuity as a functional trait: intrinsically disordered regions as central players of interactomes. Biochem J. 2013;454(3):361–9. Epub 2013/08/31. 10.1042/BJ20130545 . [DOI] [PubMed] [Google Scholar]
- 7.Staby L, O'Shea C, Willemoes M, Theisen F, Kragelund BB, Skriver K. Eukaryotic transcription factors: paradigms of protein intrinsic disorder. Biochem J. 2017;474(15):2509–32. Epub 2017/07/14. 10.1042/BCJ20160631 . [DOI] [PubMed] [Google Scholar]
- 8.Boija A, Klein IA, Sabari BR, Dall'Agnese A, Coffey EL, Zamudio AV, et al. Transcription Factors Activate Genes through the Phase-Separation Capacity of Their Activation Domains. Cell. 2018;175(7):1842–55 e16. Epub 2018/11/20. 10.1016/j.cell.2018.10.042 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Burglin TR, Affolter M. Homeodomain proteins: an update. Chromosoma. 2016;125(3):497–521. 10.1007/s00412-015-0543-8 WOS:000377902300013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bobola N, Merabet S. Homeodomain proteins in action: similar DNA binding preferences, highly variable connectivity. Curr Opin Genet Dev. 2017;43:1–8. Epub 2016/10/22. 10.1016/j.gde.2016.09.008 . [DOI] [PubMed] [Google Scholar]
- 11.Noyes MB, Christensen RG, Wakabayashi A, Stormo GD, Brodsky MH, Wolfe SA. Analysis of homeodomain specificities allows the family-wide prediction of preferred recognition sites. Cell. 2008;133(7):1277–89. 10.1016/j.cell.2008.05.023 WOS:000257144600023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Krumlauf R. Hox genes in vertebrate development. Cell. 1994;78(2):191–201. Epub 1994/07/29. 10.1016/0092-8674(94)90290-9 . [DOI] [PubMed] [Google Scholar]
- 13.Pearson JC, Lemons D, McGinnis W. Modulating Hox gene functions during animal body patterning. Nat Rev Genet. 2005;6(12):893–904. Epub 2005/12/13. nrg1726 [pii] 10.1038/nrg1726 . [DOI] [PubMed] [Google Scholar]
- 14.Rezsohazy R, Saurin AJ, Maurel-Zaffran C, Graba Y. Cellular and molecular insights into Hox protein action. Development. 2015;142(7):1212–27. Epub 2015/03/26. 10.1242/dev.109785 . [DOI] [PubMed] [Google Scholar]
- 15.Duboule D. The rise and fall of Hox gene clusters. Development. 2007;134(14):2549–60. Epub 2007/06/08. 10.1242/dev.001065 . [DOI] [PubMed] [Google Scholar]
- 16.Merabet S, Mann RS. To Be Specific or Not: The Critical Relationship Between Hox And TALE Proteins. Trends Genet. 2016;32(6):334–47. Epub 2016/04/14. 10.1016/j.tig.2016.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Selleri L, Zappavigna V, Ferretti E. 'Building a perfect body': control of vertebrate organogenesis by PBX-dependent regulatory networks. Genes Dev. 2019;33(5–6):258–75. Epub 2019/03/03. 10.1101/gad.318774.118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Amin S, Donaldson IJ, Zannino DA, Hensman J, Rattray M, Losa M, et al. Hoxa2 Selectively Enhances Meis Binding to Change a Branchial Arch Ground State. Dev Cell. 2015;32(3):265–77. 10.1016/j.devcel.2014.12.024 WOS:000349511800004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Tsai A, Muthusamy AK, Alves MR, Lavis LD, Singer RH, Stern DL, et al. Nuclear microenvironments modulate transcription from low-affinity enhancers. Elife. 2017;6 Epub 2017/11/03. 10.7554/eLife.28975 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Porcelli D, Fischer B, Russel S, White R. Chromatin accessibility plays a key role in selective targeting of Hox proteins. Genome Biol. 2019;20 (1):115 10.1186/s13059-019-1721-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Gendron-Maguire M, Mallo M, Zhang M, Gridley T. Hoxa-2 mutant mice exhibit homeotic transformation of skeletal elements derived from cranial neural crest. Cell. 1993;75(7):1317–31. Epub 1993/12/31. 0092-8674(93)90619-2 [pii]. 10.1016/0092-8674(93)90619-2 . [DOI] [PubMed] [Google Scholar]
- 22.Rijli FM, Mark M, Lakkaraju S, Dierich A, Dolle P, Chambon P. A homeotic transformation is generated in the rostral branchial region of the head by disruption of Hoxa-2, which acts as a selector gene. Cell. 1993;75(7):1333–49. Epub 1993/12/31. 0092-8674(93)90620-6 [pii]. 10.1016/0092-8674(93)90620-6 . [DOI] [PubMed] [Google Scholar]
- 23.Manley NR, Capecchi MR. The role of Hoxa-3 in mouse thymus and thyroid development. Development. 1995;121(7):1989–2003. Epub 1995/07/01. . [DOI] [PubMed] [Google Scholar]
- 24.Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010;38(4):576–89. Epub 2010/06/02. 10.1016/j.molcel.2010.05.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.McLean CY, Bristor D, Hiller M, Clarke SL, Schaar BT, Lowe CB, et al. GREAT improves functional interpretation of cis-regulatory regions. Nat Biotechnol. 2010;28(5):495–501. Epub 2010/05/04. nbt.1630 [pii] 10.1038/nbt.1630 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Donaldson IJ, Amin S, Hensman JJ, Kutejova E, Rattray M, Lawrence N, et al. Genome-wide occupancy links Hoxa2 to Wnt-beta-catenin signaling in mouse embryonic development. Nucleic Acids Res. 2012;40(9):3990–4001. 10.1093/nar/gkr1240 WOS:000304201300027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Manley NR, Capecchi MR. Hox group 3 paralogous genes act synergistically in the formation of somitic and neural crest-derived structures. Dev Biol. 1997;192(2):274–88. Epub 1998/01/27. 10.1006/dbio.1997.8765 . [DOI] [PubMed] [Google Scholar]
- 28.Soldatov R, Kaucka M, Kastriti ME, Petersen J, Chontorotzea T, Englmaier L, et al. Spatiotemporal structure of cell fate decisions in murine neural crest. Science. 2019;364(6444). Epub 2019/06/07. 10.1126/science.aas9536 . [DOI] [PubMed] [Google Scholar]
- 29.Creyghton MP, Cheng AW, Welstead GG, Kooistra T, Carey BW, Steine EJ, et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc Natl Acad Sci U S A. 2010;107(50):21931–6. Epub 2010/11/26. 10.1073/pnas.1016071107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Losa M, Latorre V, Andrabi M, Ladam F, Sagerstrom C, Novoa A, et al. A tissue-specific, Gata6-driven transcriptional program instructs remodeling of the mature arterial tree. Elife. 2017;6 Epub 2017/09/28. 10.7554/eLife.31362 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Morgunova E, Taipale J. Structural perspective of cooperative transcription factor binding. Curr Opin Struct Biol. 2017;47:1–8. Epub 2017/03/30. 10.1016/j.sbi.2017.03.006 . [DOI] [PubMed] [Google Scholar]
- 32.De Kumar B, Parker HJ, Paulson A, Parrish ME, Pushel I, Singh NP, et al. HOXA1 and TALE proteins display cross-regulatory interactions and form a combinatorial binding code on HOXA1 targets. Genome Res. 2017;27(9):1501–12. Epub 2017/08/09. 10.1101/gr.219386.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Huang Y, Sitwala K, Bronstein J, Sanders D, Dandekar M, Collins C, et al. Identification and characterization of Hoxa9 binding sites in hematopoietic cells. Blood. 2012;119(2):388–98. Epub 2011/11/11. 10.1182/blood-2011-03-341081 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Choe SK, Lu P, Nakamura M, Lee J, Sagerstrom CG. Meis cofactors control HDAC and CBP accessibility at Hox-regulated promoters during zebrafish embryogenesis. Dev Cell. 2009;17(4):561–7. Epub 2009/10/27. S1534-5807(09)00344-X [pii] 10.1016/j.devcel.2009.08.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Phuycharoen M, Zarrineh P, Bridoux L, Amin S, Losa M, Chen K, et al. Uncovering tissue-specific binding features from differential deep learning. Nucleic Acids Res, 2020. 48(5): p. e27 Epub 2020/01/25. 10.1093/nar/gkaa009 PubMed Central PMCID: PMC7049686 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Rousso DL, Gaber ZB, Wellik D, Morrisey EE, Novitch BG. Coordinated actions of the forkhead protein Foxp1 and Hox proteins in the columnar organization of spinal motor neurons. Neuron. 2008;59(2):226–40. Epub 2008/08/01. 10.1016/j.neuron.2008.06.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Biggin MD. Animal transcription networks as highly connected, quantitative continua. Dev Cell. 2011;21(4):611–26. Epub 2011/10/22. 10.1016/j.devcel.2011.09.008 . [DOI] [PubMed] [Google Scholar]
- 38.Dard A, Reboulet J, Jia Y, Bleicher F, Duffraisse M, Vanaker JM, et al. Human HOX Proteins Use Diverse and Context-Dependent Motifs to Interact with TALE Class Cofactors. Cell Rep. 2018;22(11):3058–71. Epub 2018/03/15. 10.1016/j.celrep.2018.02.070 . [DOI] [PubMed] [Google Scholar]
- 39.Slattery M, Riley T, Liu P, Abe N, Gomez-Alcala P, Dror I, et al. Cofactor binding evokes latent differences in DNA binding specificity between Hox proteins. Cell. 2011;147(6):1270–82. Epub 2011/12/14. 10.1016/j.cell.2011.10.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Cirillo LA, McPherson CE, Bossard P, Stevens K, Cherian S, Shim EY, et al. Binding of the winged-helix transcription factor HNF3 to a linker histone site on the nucleosome. Embo J. 1998;17(1):244–54. Epub 1998/02/28. 10.1093/emboj/17.1.244 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Jacobs J, Atkins M, Davie K, Imrichova H, Romanelli L, Christiaens V, et al. The transcription factor Grainy head primes epithelial enhancers for spatiotemporal activation by displacing nucleosomes. Nat Genet. 2018;50(7):1011–20. Epub 2018/06/06. 10.1038/s41588-018-0140-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Hyman-Walsh C, Bjerke GA, Wotton D. An autoinhibitory effect of the homothorax domain of Meis2. FEBS J. 2010;277(12):2584–97. Epub 2010/06/18. 10.1111/j.1742-464X.2010.07668.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hau AC, Grebbin BM, Agoston Z, Anders-Maurer M, Muller T, Gross A, et al. MEIS homeodomain proteins facilitate PARP1/ARTD1-mediated eviction of histone H1. J Cell Biol. 2017;216(9):2715–29. Epub 2017/07/26. 10.1083/jcb.201701154 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Altmeyer M, Neelsen KJ, Teloni F, Pozdnyakova I, Pellegrino S, Grofte M, et al. Liquid demixing of intrinsically disordered proteins is seeded by poly(ADP-ribose). Nat Commun. 2015;6:8088 Epub 2015/08/20. 10.1038/ncomms9088 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Hnisz D, Shrinivas K, Young RA, Chakraborty AK, Sharp PA. A Phase Separation Model for Transcriptional Control. Cell. 2017;169(1):13–23. Epub 2017/03/25. 10.1016/j.cell.2017.02.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Kutejova E, Engist B, Self M, Oliver G, Kirilenko P, Bobola N. Six2 functions redundantly immediately downstream of Hoxa2. Development. 2008;135(8):1463–70. Epub 2008/03/07. dev.017624 [pii] 10.1242/dev.017624 . [DOI] [PubMed] [Google Scholar]
- 47.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. Epub 2014/04/04. 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nature methods. 2012;9(4):357–9. Epub 2012/03/06. 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–9. Epub 2009/06/10. 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008;9(9):R137 Epub 2008/09/19. gb-2008-9-9-r137 [pii] 10.1186/gb-2008-9-9-r137 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Ross-Innes CS, Stark R, Teschendorff AE, Holmes KA, Ali HR, Dunning MJ, et al. Differential oestrogen receptor binding is associated with clinical outcome in breast cancer. Nature. 2012;481(7381):389–93. Epub 2012/01/06. 10.1038/nature10730 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40. Epub 2009/11/17. 10.1093/bioinformatics/btp616 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Wickham. Elegant Graphics for Data Analysis: Springer; 2016. [Google Scholar]
- 54.Geocks J, Nekrutenko A, Taylor J, Galaxy Team. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010;11(8):R86 Epub 2010/08/25. 10.1186/gb-2010-11-8-r86 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Lawrence M, Huber W, Pages H, Aboyoun P, Carlson M, Gentleman R, et al. Software for computing and annotating genomic ranges. PLoS Comput Biol. 2013;9(8):e1003118 Epub 2013/08/21. 10.1371/journal.pcbi.1003118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Pagès H AP, Gentleman R, DebRoy S. Biostrings: Efficient manipulation of biological strings. 2019. [Google Scholar]
- 57.Guo Y, Tian K, Zeng H, Guo X, Gifford DK. A novel k-mer set memory (KSM) motif representation improves regulatory variant prediction. Genome Res. 2018;28(6):891–900. Epub 2018/04/15. 10.1101/gr.226852.117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Bourgon R, Gentleman R, Huber W. Independent filtering increases detection power for high-throughput experiments. Proc Natl Acad Sci U S A. 2010;107(21):9546–51. Epub 2010/05/13. 10.1073/pnas.0914005107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Lun AT, Bach K, Marioni JC. Pooling across cells to normalize single-cell RNA sequencing data with many zero counts. Genome Biol. 2016;17:75 Epub 2016/04/29. 10.1186/s13059-016-0947-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Lun AT, McCarthy DJ, Marioni JC. A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor. F1000Res. 2016;5:2122 Epub 2016/12/06. 10.12688/f1000research.9501.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Murtagh F, Legendre P. Ward's Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward's Criterion? Journal of Classification. 2014;31(3):274–95. . [Google Scholar]
- 62.Bridoux L, Bergiers I, Draime A, Halbout M, Deneyer N, Twizere JC, et al. KPC2 relocalizes HOXA2 to the cytoplasm and decreases its transcriptional activity. Biochim Biophys Acta. 2015;1849(10):1298–311. Epub 2015/08/26. 10.1016/j.bbagrm.2015.08.006 . [DOI] [PubMed] [Google Scholar]
- 63.Barrios-Rodiles M, Brown KR, Ozdamar B, Bose R, Liu Z, Donovan RS, et al. High-throughput mapping of a dynamic signaling network in mammalian cells. Science. 2005;307(5715):1621–5. Epub 2005/03/12. 10.1126/science.1105776 . [DOI] [PubMed] [Google Scholar]
- 64.Bergiers I, Bridoux L, Nguyen N, Twizere JC, Rezsohazy R. The homeodomain transcription factor Hoxa2 interacts with and promotes the proteasomal degradation of the E3 ubiquitin protein ligase RCHY1. Plos One. 2013;8(11):e80387 Epub 2013/11/19. 10.1371/journal.pone.0080387 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Lambert B, Vandeputte J, Remacle S, Bergiers I, Simonis N, Twizere JC, et al. Protein interactions of the transcription factor Hoxa1. Bmc Dev Biol. 2012;12 Artn 29 10.1186/1471-213x-12-29. WOS:000312127700001. 10.1186/1471-213X-12-29 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Rual JF, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, et al. Towards a proteome-scale map of the human protein-protein interaction network. Nature. 2005;437(7062):1173–8. Epub 2005/09/29. 10.1038/nature04209 . [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All datasets are available on ArrayExpress with the following accession numbers: Experiment ArrayExpress accession: E-MTAB-8607 Title: ChIP-seq for Hoxa3 on mouse posterior branchial arches at E11.5. https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-8607 Experiment ArrayExpress accession: E-MTAB-7766 Title: ChIP-seq for Hoxa2 on mouse second branchial arches at E11.5 http://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-7766 Experiment ArrayExpress accession: E-MTAB-8606 Title: ChIP-seq for Hoxa2 on mouse posterior branchial arches at E11.5 https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-8606 Experiment ArrayExpress accession: E-MTAB-8608 Title: ChIP-seq for Pbx on mouse second and posterior branchial arches at E11.5 https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-8608 Experiment ArrayExpress accession: E-MTAB-7767 Title: ChIP-seq for Meis on mouse branchial arches at E11.5. http://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-7767.