Abstract
Head specification by the head-selector gene, orthodenticle (otx) is highly conserved among bilaterian lineages. However, the molecular mechanisms by which Otx and other transcription factors (TFs) interact with the genome to direct head formation, are largely unknown. We employed ChIP-seq and RNA-seq approaches in Xenopus tropicalis gastrulae, and found that occupancy of the corepressor, TLE/Groucho, is a better indicator of tissue-specific cis-regulatory modules (CRMs) than the coactivator p300, during early embryonic stages. Based on TLE binding and comprehensive CRM profiling, we defined two distinct types of Otx2- and TLE-occupied CRMs. Using these devices, Otx2 and other head organizer TFs [e.g., Lim1/Lhx1 (activator) or Goosecoid (repressor)] are able to upregulate or downregulate a large battery of target genes in the head organizer. An underlying principle is that Otx marks target genes for head specification to be regulated positively or negatively by partner TFs through specific types of CRMs.
Introduction
The bilaterian head forms in the most anterior part of the developing embryo. In early embryogenesis, the head-selector, Otx (orthodenticle), a homeodomain-containing transcription factor (TF), is expressed in the head region. In contrast, homeotic selector Hox cluster TFs are expressed along the anteroposterior axis of the trunk and tail.1,2 Otx homeodomain proteins are conserved among bilaterians, from flies to humans, and their functions are essential for proper head formation.2 However, little is known about the mechanisms by which Otx proteins confer different head structures among different species, or about the types of cis-regulatory modules (CRMs) utilized by Otx proteins. To resolve these questions, we carried out comprehensive analyses of Otx target genes and characterized their CRMs, using thousands of synchronized whole Xenopus gastrula embryos.
In developmental biology, an organizer refers to a group of cells or a small piece of tissue that induces surrounding cells to develop into specific tissues or organs. During amphibian embryogenesis, the gastrula organizer known as the Spemann– Mangold organizer, initiates gastrulation movements and establishes the basic body plan. The organizer consists of two different regions - head and trunk organizers, which effect anteroposterior patterning of the neuroectoderm3 (Fig. 1A). Homeodomain proteins, Otx2, Lim1 (=Lhx1), and Goosecoid (Gsc), are expressed in the head organizer to specify head structures 4 (Fig. 1A). The transcriptional regulatory networks underlying the Xenopus organizer have been studied extensively, especially focusing on regulation of gsc.5 Previous work has shown that Otx2 and Lim1 upregulate head organizer genes such as gsc and cerberus, and that Gsc downregulates trunk genes such as brachyury and wnt8a.5–11 At present, the regulatory principle governing Otx2, Lim1, and Gsc remains unsolved.
Fig. 1. Phenotypes of loss- and gain-of-function of Otx2, Lim1, and Gsc in frogs.
(A) Expression patterns of otx2, otx5, lim1, and gsc in Xenopus gastrula embryos. Schematic representation shows co-expression of lim1, otx2, otx5 and gsc in the head organizer11,17,18 and lim1 expression in the trunk organizers.11,52 Co-expression of these genes in X. tropicalis early gastrula embryos was shown with in situ hybridization using hemisections. In anterior neuroectoderm, otx2 is weakly expressed, while otx5 is more strongly expressed. Arrowhead, dorsal blastopore; open arrowhead, expression in the anterior neuroectoderm; dashed line, boundary of ectoderm and mesoderm. (B) Loss-of-function analysis in X. tropicalis. Phenotypes of normal embryos (control morphants) and head-reduced embryos (lim1/otx2/otx5/gsc morphants) are shown. Sagittal section and whole mount in situ hybridization of the telencephalon marker (eomes) and the midbrain-hindbrain boundary marker (en2) showed that forebrain, midbrain, and foregut were shrunk in head-reduced embryos. Note that the notochord reached the anterior-most region in the morphant. Tailbud embryos are shown in lateral (anterior to the left; three left panels) or dorsoanterior (right-most panels) views. fb, forebrain; fg, foregut; no, notochord. (C) Gain-of-function analysis in X. laevis. Left panels show lateral views of a secondary head with one eye generated after ventral injection of a cocktail of mRNAs as indicated. Right panels show immunostaining of head organizer cocktail-injected embryos against notochord by MZ15 (dorsal view of the same embryo as shown in the left panel) and somites by 12/101 (lateral view), respectively.
Using Xenopus tropicalis gastrula embryos, we carried out genome-wide ChIP-seq analysis for Otx2, Lim1, Gsc, the general coactivator, p300, the general corepressor, TLE/Groucho, and histone marks. In addition, RNA-seq analysis was performed on embryos knocking down these TFs, as well as dissected embryonic tissue fragments. Our analyses revealed for the first time that TLE occupancy around the CRM is a better indicator of tissue-specific CRM activity than is p300 occupancy. Based on molecular interaction studies among Otx2, Lim1, and Gsc via specific CRMs, we propose a regulatory model, in which Otx2 binding on the genome represents marking of head induction processes in early vertebrate gastrula embryos by simultaneously upregulating a large battery of target genes in cooperation with Lim1, and downregulating others in concert with Gsc (Supplementary Fig. S1). The simplicity of this mode of head specification may explain the evolutionarily conservation of the head-selector Otx.
Results
Cooperation of Otx2, Lim1, and Gsc in head formation
Previous studies in mice deficient in Otx2, Lim1, and Gsc have shown that head formation can proceed without Gsc, but not without Otx2 and Lim1.12–16 Because lim1, gsc, otx2 and its paralog, otx5, are co-expressed in the organizer of X. tropicalis early gastrulae similar to X. laevis11,17,18 (Fig. 1A), in X. tropicalis we knocked down combinations of Otx2, Otx5, Lim1, and Gsc using antisense morpholino oligos (MOs) (Fig. 1B; see Supplementary Fig. S2 for MO specificity). Morphants injected with lim1/otx2/otx5 or otx2/otx5/gsc triple MOs or all four MOs exhibited more severe head-reduced phenotypes than single or double morphants (Supplementary Fig. S2C). Sagittal sections and brain marker gene expression confirmed that anterior structures such as the forebrain, midbrain, and foregut were shrunk in the morphants (Fig. 1B). Compared to other single morphants, otx2 morphants exhibited more severe head defects, in which about 25% of embryos had small heads with trace eyes. This may be at least partly due to otx2’s expression in the anterior neural tissue and later in the brain, in addition to the head organizer.19,20 gsc single morphants exhibited cyclopic phenotypes (Supplementary Fig. S2C), similar to those reported in Xenopus laevis.21 In mice, Lim1- or Otx2-deficient embryos display a total lack of head structures, much more severe defects than seen in lim1 or otx2 single morphants in X. tropicalis. Less serious defects in Xenopus may be due to incomplete knockdown by MOs. Nevertheless, knockdown data indicate that Otx2, Otx5, Lim1, and Gsc all contribute to head formation cooperatively, but each works in a different way.
In gain-of-function experiments, we injected combinations of mRNAs encoding Lim1, Ldb1, Ssbp3, Otx2, Gsc, and Tle1 into the ventral equatorial region (Fig. 1C). Ldb1 and Ssbp3 are Lim1 cofactors that activate Lim1.22,23 Tle1 is an Otx2- and Gsc-binding corepressor.24,25 Lim1/Ldb1/Ssbp3 together with Otx2, Gsc/Tle1 or both, induce secondary head structures that were never observed after overexpression of Lim1/Ldb1/Ssbp3 or Otx2/Gsc/Tle1 (Supplementary Fig. S3A). Furthermore, the combination of Lim1, Ldb1, Ssbp3, Otx2, Gsc, and Tle1 showed significantly higher rates of secondary head induction than other combinations (Supplementary Fig. S3A). Therefore, we refer to it as the ‘head organizer cocktail.’ This is the first demonstration that Lim1, Otx2, and Gsc synergistically generate secondary head structures in vertebrate embryos.
As demonstrated with immunostaining, the secondary axis induced by the head organizer cocktail contained the notochord and somites (Fig. 1C). This kind of phenotype is usually called a complete axis, but unlike a Wnt-induced axis, the secondary axis resulting from the head organizer cocktail exhibited a shorten trunk accompanied by a bent tail and an open blastopore (Fig. 1C). The animal cap assay demonstrated that the head organizer cocktail induced head organizer genes (chordin, gsc, and cerberus), but not the trunk organizer gene (brachyury) and the muscle marker gene (actc1) (Supplementary Fig. S3B). Thus, the head organizer cocktail induces primarily the head and secondarily the trunk, possibly due to both anteriorizing and dorsalizing activities of this cocktail.
Features of Otx2, Lim1, and Gsc occupied CRMs
To identify CRMs and in vivo target genes for Otx2, Lim1, and Gsc in the organizer, we performed ChIP-seq using X. tropicalis early gastrula embryos. At this stage of embryonic development, these three genes are strongly co-expressed in the head organizer region (Fig. 1A). Because cross-reactivity of our anti-Otx2 antibody to Otx5 protein is limited,11 and because otx2 expression in anterior neuroectoderm just began at this stage (Fig. 1A), the ChIP-seq data of Otx2 was regarded as coming mainly from the head organizer. To uncover potential enhancer and silencer functions of CRMs and their epigenetic states, we also performed ChIP-seq for p300, TLE, monomethylated histone H3 lysine 4 (H3K4me1), and acetylated histone H3 lysine 27 (H3K27ac). In addition, we utilized previously reported ChIP-seq data for trimethylated histone H3 lysine 27 (H3K27me3) and RNA polymerase II (RNAP2).26
ChIP-seq peak data were validated using qPCR (Supplementary Fig. S4A–E), another peak-calling algorithm (Supplementary Tables S1, S2; Methods for details), and the following specific and overall peak analyses: (i) ChIP-seq peaks of all TFs were detected at a well-studied CRM, gsc-U15,6,11 (Fig. 2A); (ii) Motif discovery analysis for Otx2, Lim1, or Gsc ChIP-seq peaks, showed strong enrichment of their known binding motifs6,7,11,27,28 (Fig. 2B and see Supplementary Fig. S5 for details); (iii) p300 peaks were correlated well with enhancer histone modifications, H3K4me1 and H3K27ac, but not with the repressive histone mark H3K27me3, as expected (Fig. 2C); (iv) Average peak profiles around transcription start sites (TSSs) showed that Otx2, Lim1, p300, H3K4me1 and H3K27ac, but not Gsc and TLE were strongly enriched at TSS (Fig. 2D), implying that transcriptional activators and coactivators (Otx2, Lim1, and p300), but not repressors and corepressors (Gsc and TLE) preferably associate with TSS; (v) Pearson correlation coefficient analysis using ChIP-seq enrichment data in 200 bp windows across the entire genome (Fig. 2E) showed that Lim1 was correlated with p300 (R=0.328) more strongly than with TLE (R=0.202), whereas Otx2 was correlated with TLE (R=0.572) more strongly than with p300 (R=0.473). In this matrix, Gsc was correlated with TLE (R=0.183) more strongly than with p300 (R=0.170), but only marginally; (vi) When co-occupancies of ChIP-seq peaks (overlaps of peaks) were compared, correlation between Gsc and TLE became much more prominent (6.6% vs 15.9% in the line of Gsc in Fig. 2F). Results of (iv-vi) are consistent with previous reports that Gsc and Lim1 function exclusively as a repressor and an activator, respectively, while Otx2 has both functions.6,7,24,25,29,30 These analyses validated the quality of our ChIP-seq data.
Fig. 2. Validations of ChIP-seq data.
(A) ChIP-seq data surrounding a well-known target gene, gsc. Transcriptional regulatory model of gsc gene through gsc-U1 is shown on the left. Lim1 and Otx2 activate gsc and Gsc represses itself via gsc-U1.6,11 In the genome browser representation, the ordinate shows enrichment against input control DNA for five transcription factors, or sequence tag enrichment for epigenetic marks, as indicated. Refseq gene models are shown with TSSs (bent arrows). According to the relative location of CRMs with respect to TSSs, the CRM was named as U-CRM (upstream CRM) or D-CRM (downstream CRM) and indicated by dashed boxes. Type I and type II CRMs (see text for definitions) are indicated with parentheses. (B) Enriched sequence motifs in Otx2, Lim1, and Gsc ChIP-seq peaks. Using the MEME-ChIP program, known binding motifs were found as enriched motifs. For Otx2, both monomer- and dimer-binding motifs were found. E-values were shown with the algorithm. (C) Clustered heatmap representations of ChIP-seq data around p300 peaks. Each genomic position was clustered according to p300 ChIP-seq data (k=10). H3K4me1 and H3K27ac, but not H3K27me3 data were correlated with p300 peaks. (D) DNA occupancies of TFs (left) and epigenetic marks (right) around TSS. Average z-scores of ChIP-seq signals were plotted. (E) A matrix of Pearson correlation coefficients of ChIP-seq enrichment between each pair of TFs. (F) A table of proportions of ChIP-seq peak overlaps between each pair of TFs.
Because short genomic regions targeted by multiple TFs are proposed to function as CRMs in mouse embryonic stem cells, 31 we define CRMs as overlapping binding regions for two or more of the five factors, Otx2, Lim1, Gsc, p300, and TLE (Table 1). Overlapping means that two peaks share more than 120 bases (see Methods for details). We named CRMs U1, U2, or D1, D2, and so on, according to their relative positions upstream or downstream from the transcriptional start site (TSS), respectively (see Figs 2 and 3). Notably, most head or trunk organizer genes examined, such as chordin, otx5, wnt8a and not (Fig. 3A,B), possessed multiple CRMs occupied by Lim1, Otx2, and Gsc.
Table 1.
The overview of ChIP-seq peaks and CRMs.
| ChIP-seq peaks for |
no. of CRMs or peaks |
|||||
|---|---|---|---|---|---|---|
| Otx2 | Lim1 | Gsc | p300 | TLE | ||
| overlapped with 5 peaks |
722 | |||||
| overlapped with 4 peaks |
7 | |||||
| 161 | ||||||
| 1,654 | ||||||
| 768 | ||||||
| 1 | ||||||
| overlapped with 3 peaks |
4 | |||||
| 1,004 | ||||||
| 529 | ||||||
| 20 | ||||||
| 1,556 | ||||||
| 3,506 | ||||||
| 0 | ||||||
| 3 | ||||||
| 73 | ||||||
| 94 | ||||||
| overlapped with 2 peaks |
856 | |||||
| 163 | ||||||
| 2,192 | ||||||
| 4,547 | ||||||
| 8 | ||||||
| 213 | ||||||
| 72 | ||||||
| 27 | ||||||
| 659 | ||||||
| 3,026 | ||||||
| single peaks | 8,732 | |||||
| 5,683 | ||||||
| 20,619 | ||||||
| 13,057 | ||||||
| 18,933 | ||||||
| total peaks potential CRMs |
27,095 | 11,245 | 25,002 | 26,939 | 36,403 | 88,889 |
| 17,689 | 5,307 | 4,193 | 13,307 | 17,371 | 21,865 | |
| p300+;TLE+ | 6,650 | 2,450 | 1,585 | 9,844 | 9,844 | 9,844 |
| p300+;TLE- | 3,223 | 1,224 | 54 | 3,463 | - | 3,463 |
| p300-;TLE+ | 6,793 | 765 | 2,379 | - | 7,527 | 7,527 |
| p300-;TLE- | 1,024 | 873 | 175 | - | - | 1,031 |
Fig. 3. Genome browser representations of ChIP-seq data around head and trunk organizer genes.
(A) ChIP-seq data for head organizer genes, chordin and otx5. (B) ChIP-seq data for trunk organizer genes, wnt8a and not. Genome browser representations were shown as in Fig. 2A.
The X. tropicalis genome contains thousands of “potential CRMs” (“CRMs” hereafter) bound by Otx2 (17,689), Lim1 (5,307), and Gsc (4,193), and most of them were co-occupied by p300 and TLE, suggesting that these CRMs function as enhancers, or silencers, or both in gastrula embryos (Table 1). Although ChIP-analyses were composites of binding data performed using whole embryos that comprise different cell types, it is reasonable to assume that most overlapping peaks indicate co-occupancy of Otx2, Lim1, and Gsc on the same CRM in the same cell, because these genes are predominantly coexpressed in the head organizer (Fig. 1A). We next asked whether Otx2, Lim1, or Gsc-occupied CRMs have a tendency to co-occupy with p300 or TLE, which are designated as p300+/−TLE+/− (see 4 bottom rows in Table 1). Among 4,193 Gsc-binding CRMs, only 54 were p300+TLE- (1.3%), while 2,379 were p300-TLE+ (56.7%). By contrast, among 5,307 Lim1-binding CRMs, 1,224 were p300+TLE-(23.1%; much more than Gsc) and 765 were p300-TLE+ (14.4%; much less than Gsc). Otx2 showed 3,223 p300+TLE- (18.2%; similar to Lim1) and 6,793 p300-TLE+ (38.4%; in between) among 17,689. This tendency for co-occupancy on CRMs between TFs and cofactors further confirmed the activator/repressor functions of Otx2, Lim1, and Gsc, as described above. Otx2 also colocalized frequently with Lim1 (4,937 CRMs), Gsc (3,401 CRMs), and with both (894 CRMs). However, colocalization of Lim1 and Gsc without Otx2 was much less common (12 CRMs). These observations imply that Otx2 cooperates with Lim1 and Gsc on distinct types of CRMs.
TLE marks tissue-specific CRMs
p300 occupancy can serve as a good predictor of tissue-specific enhancers by comparing different tissues isolated from mouse embryos.32 However, because whole embryos were used for ChIP-seq analysis, it is conceivable that p300 binding is associated with both tissue-specific and ubiquitous enhancers, whereas TLE binding is associated with tissue-specific CRMs (Fig. 4A). We first tested whether p300 and TLE occupancies on CRMs are functionally associated with dorsal and ventral genes. We performed ChIP-qPCR for p300 and TLE using the dorsal and ventral halves of bisected gastrula embryos (Fig. 4B; see drawing), and calculated a D/V ratio of %recovery of each CRM (that is, %recovery in the dorsal half was divided by that in the ventral half) (Fig. 4B, see Supplementary Fig. S6 for details). As expected, p300 occupancies on 10 CRMs from dorsal genes exhibited significantly higher D/V ratios than those from ventral genes (P=0.0035 in Fig. 4B; left graph). Conversely, D/V ratios of TLE occupancies tended to be lower in the dorsal genes than in ventral genes (Fig. 4B; right graph). These data suggest that a majority of p300 and TLE-bound CRMs are functionally associated with tissue-specific genes.
Fig. 4. Comprehensive analysis of tissue-specific gene regulation.
(A) Schematic representation of ChIP-seq peaks for coactivators and corepressors with whole embryos. Virtual ChIP-seq peaks from the organizer (blue), the rest of the embryos (yellow), and whole embryos are shown. Tissue-specific CRMs 1 and 2 are occupied by either p300 or other coactivators (coact) in the organizer and mediate gene activation. When the same CRMs are occupied with TLE, silencing occurs somewhere in the non-organizer tissue. If ChIP-seq is performed using whole embryos, p300 binding identifies both ubiquitous enhancers (CRM 3) and tissue-specific enhancers (CRM 1). On the other hand, TLE binding is associated with only tissue-specific CRMs (CRMs 1 and 2). (B) ChIP-qPCR analysis for p300 and TLE using dorsal or ventral halves of gastrula embryos. Ten CRMs from both dorsal and ventral genes were examined. Ratios of %recovery of each CRM between dorsal and ventral halves (D/V) were plotted. Statistical significances of the difference between CRMs from dorsal and ventral genes were examined with chi-square test. (C) Profiles of the distance between CRM and the nearest TSS. Pie charts show proportions of the distance for each group of CRMs as indicated. (D) Comparison of the tissue-specificity of p300/TLE-bound CRMs. Pie charts show the proportions of CRMs assigned to genes in each group as indicated. (E,F) Epigenetic states around CRMs. Average z-scores of ChIP-seq signals of H3K27ac (E) or H3K27me3 (F) around the CRM midpoint were plotted with 95% confidence intervals as indicated. (G,I) Epigenetic states around TSSs. Average z-scores of ChIP-seq signals of H3K27me3 around TSSs assigned with CRMs within ±100 kb were plotted as in (F). (H) Relationships between tissue-specificity and the number of TLE-bound CRMs per gene. Bars represent percentages of three categories of genes from zero to more than three TLE-bound CRMs. P values, statistic significances calculated by the chi-square test between two adjacent groups.
To further investigate relationships between CRMs and gene expression patterns, we next performed RNA-seq analysis with whole and dissected gastrula embryos (st. 10.5) (Supplementary Fig. S7A–D). Quality of dissection experiments was validated by RT-qPCR before sequencing (Supplementary Fig. S7E–G). RNA-seq short reads were mapped to the Xtev gene models.26 We categorized them into three groups: (i) low or no expression (3,392 genes), (ii) tissue-specific (6,562 genes), and (iii) ubiquitous (4,299 genes) (see Supplementary Fig. S7A–D for definition). We then assigned previously identified CRMs to these genes. Because of the limitation of embryo dissection techniques, the ubiquitous group included relatively few tissue-specific genes, but the tissue-specific group did not include ubiquitous genes. Therefore, these categories were practical enough to use for the comprehensive analysis of genes as shown below.
When we assigned each CRM to the nearest TSS, a large fraction of p300+TLE-CRMs (65%) were located within 1 kb of the nearest TSS (Fig. 4C). In contrast, only 20% of p300+TLE+ CRMs occurred within that range, and only 5% of p300-TLE+ CRMs (Fig. 4C). Furthermore, p300+TLE- CRMs within 1 kb of a TSS were strongly associated with ubiquitous genes (P=9.01E-34, chi-square test), whereas other p300+TLE- CRMs occurred almost randomly (Supplementary Fig. S8). p300+TLE+ and p300-TLE+ CRMs (that is, TLE+ CRMs) were strongly associated with tissue-specific genes as long as CRMs were located within ±100 kb of their TSS [P = 1.05E-28 (p300+TLE+) and P = 5.27E-26 (p300-TLE+) for CRMs within 30–100 kb of TSS, chi-square test, see Supplementary Fig. S8 for details]. These results suggest that p300+TLE- CRMs mainly regulate expression of ubiquitous genes close to the TSS and that p300+TLE+ and p300-TLE+ CRMs regulate expression of tissue-specific genes farther from the TSS (Fig. 4D). In other words, our data indicate that p300 occupancy alone does not predict tissue-specific CRMs and that TLE occupancy better predicts tissue-specific CRMs. Therefore, we assigned TLE-bound CRMs to the nearest TSS within a distance of 100 kb and focused on the relationship between TLE and function of tissue-specific CRMs.
We next investigated epigenetic states around p300 or TLE bound CRMs. As expected, p300-bound CRMs (p300+TLE+ and p300+TLE-) showed strong association with the active enhancer mark H3K27ac, whereas TLE showed an inverse association with it (Fig. 4E). Conversely, the repressive histone mark H3K27me3 was associated with TLE-bound CRMs (p300+TLE+ and p300-TLE+ CRMs) more strongly than with TLE-unbound CRMs (p300+TLE- CRMs) (Fig. 4F). At first inspection, it might appear problematic that z-scores of H3K27me3 around TLE-bound CRMs were much smaller than those of H3K27ac around p300-bound CRMs (Fig. 4E,F). However, because the z-score is defined as how many standard deviations a datum is from the mean, these relatively low z-scores of H3K27me3 were due to larger standard deviations of H3K27me3 (8.55-fold of its mean), compared to those of H3K27ac (2.13-fold of its mean). These relatively high standard deviations resulted from H3K27me3’s strongly biased distribution, as noted in previous reports26,33 and exemplified by its high concentrations on ~5% of CRMs (see cluster 2 in Supplementary Fig. S9). Even so, a relatively greater enrichment of H3K27me3 was also observed around TSSs of genes with TLE-bound CRMs (Fig. 4G). This is in good agreement with a previous report that enrichment of H3K27me3 around TSSs is associated with spatially regulated genes.26 We next examined the number of CRMs per gene within ±100 kb of the TSS and found that genes with more TLE-bound CRMs exhibit higher tissue-specificity (Fig. 4H) and higher enrichment of H3K27me3 around TSS (Fig. 4I). These data suggest the possibility that TLE-bound CRMs function as silencers and that multiple TLE CRMs regulate tissue-specific genes.
Combinatorial roles of Otx2, Lim1, and Gsc on TLE-marked CRMs
As mentioned above, we hypothesized that in the head organizer, in concert with Lim1 and Gsc, Otx2 activates and represses target genes, respectively. Because Otx2 target genes are supposedly tissue-specific, we focused on Otx2/TLE-occupied CRMs to which Lim1 or Gsc are bound. Motif finding analyses were performed using Otx2/Lim1/TLE-occupied CRMs (3,066) and Otx2/Gsc/TLE-occupied CRMs (3,207), which only partially overlapped (Fig. 5A). We found that bicoid-binding motifs [TAATC(C/T)],6,7,11,27 which are bound by Otx2 and Gsc, and P3C motifs [bicoid/paired type homo- or heterodimer-binding motifs (TAATCNNATTA)]28 were enriched in Otx2/Gsc/TLE CRMs (Fig. 5A, Supplementary Fig. S10). However, bicoid-binding motifs were not enriched in Otx2/Lim1/TLE CRMs, despite the bound Otx2. In contrast, Lim1-binding motifs [(C/T)TAAT(G/T)(G/A)]6,11,27 were enriched in Otx2/Lim1/TLE CRMs, but not in Otx2/Gsc/TLE CRMs (Fig. 5A, Supplementary Fig. S10). These data suggest that Lim1-, but not Otx2-binding motifs are required for function of Otx2/Lim1/TLE CRMs, and that bicoid-binding motifs or P3C motifs are required for function of Otx2/Gsc/TLE CRMs. These observations were confirmed by two different algorithms, MEME-ChIP and Weeder (Supplementary Fig. S10). We also noticed that the average number of Otx/Gsc binding motifs in Otx2/Gsc/TLE CRMs is 1.24 motifs/CRM, which is significantly higher than the average in Otx2/TLE-occupied CRMs (0.70 motifs/CRM, P=2.5E-107, chi-square test). This implies that CRMs with multiple Otx/Gsc-binding motifs are likely to function by binding both Otx2 and Gsc. Therefore, we further defined Otx2/Lim1/TLE CRMs containing Lim1-binding motifs as “type I” CRMs (1,250 CRMs), and Otx2/Gsc/TLE CRMs containing multiple bicoid sites or P3C sites as “type II” CRMs (1,039 CRMs) (Fig. 5B). We anticipate that genes with type I CRMs are activated in the head organizer, while those with type II CRMs are repressed (Fig. 5B).
Fig. 5. Characterization of type I and type II CRMs.
(A) Motif discovery analysis. Typical motifs enriched in Otx2/Lim1/TLE (3,066) and Otx2/Gsc/TLE (3,207) CRMs are shown with E-values. Algorithms used for motif discovery are indicated with parentheses (see Supplementary Fig. S10 for details). (B) Definition of type I and type II CRMs. The number of genes assigned with type I or type II CRMs alone is shown in parentheses. (C, D) Sequential ChIP assay. Embryos injected with mRNA encoding FLAG-tagged Lim1 (C) or Gsc (D) were used. qPCR data for 1st ChIP (anti-FLAG) shows significant enrichment of type I or type II CRMs bound by FLAG-tagged Lim1 or Gsc, respectively, compared with the negative control region (acta1-promoter). qPCR data for 2nd ChIP shows significant enrichment of each CRM by anti-Otx2 compared with the negative control (normal IgG). Bar graphs show %recovery of DNA ± s.d. *, P<0.05; **, P<0.01; ***, P<0.001 (t-test, two-tailed). (E, F) Epigenetic analysis. Enhancer histone marks, H3K4me1 and H3K27ac, were enriched around type I CRMs. RNA polymerase II (RNAP2) was also enriched at the midpoint of type I CRMs (E). However, these marks were relatively poor around type II CRMs (F).
To ensure that Otx2 interacts with Lim1 or Gsc on type I or type II CRMs, respectively, we performed sequential ChIP assays (Fig. 5C,D). For this assay, we analysed interactions between endogenous Otx2 proteins and exogenous Lim1 or Gsc, because %recoveries of target CRMs by ChIP-qPCR using Lim1 and Gsc antibodies were fairly low compared to those using Otx2 antibodies (see Supplementary Fig. S4A–C). mRNA for FLAG-tagged Lim1 or Gsc was injected into the dorsal equatorial region of 4-cell X. laevis embryos. Chromatin from gastrula embryos was first immunoprecipitated with anti-FLAG antibody, followed by a second ChIP with either anti-Otx2 antibody or pre-immune rabbit IgG (negative control). qPCR for selected type I or II CRMs, the sequences of which, including binding motifs, are largely conserved in X. tropicalis and X. laevis, demonstrated that FLAG-tagged Lim1 was associated with endogenous Otx2 protein on type I CRMs and that FLAG-tagged Gsc was associated with Otx2 on type II CRMs (Fig. 5C,D). These results suggest that Lim1 and Gsc can make a complex with Otx2 on CRMs, further supporting our combinatorial regulatory model (Fig. 5B).
Consistent with our expectations, epigenetic analysis of CRMs revealed that type I CRMs, but not type II CRMs, are closely associated with H3K4me1, H3K27ac, and RNAP2 (Fig. 5E,F), suggesting that type I CRMs tend to function as enhancers. However, it should be noted that our definition of types I and II are not mutually exclusive. Some CRMs (141) were categorized as both types I and II, e.g., gsc-U1. This kind of CRM might be activated by Lim1 and repressed by Gsc, as has been shown for gsc-U16 (see Fig. 2A).
Reporter analysis for type I and type II CRMs
We next examined responsiveness of individual type I and type II CRMs toward Otx2, Lim1, and Gsc, using reporter assays in which a reporter gene and the “head organizer cocktail” (see Fig. 1C) were coinjected in the animal pole region. The head organizer cocktail activated 15 of 16 luciferase reporter genes harbouring type I CRMs derived from nine head organizer genes and three trunk genes (Fig. 6A). To examine repressive activity of type II CRMs, we used SV40 late promoter for raising basal transcriptional activity (see Methods for more details of reporter constructs). In this assay, the same cocktail repressed the activity of reporter genes harbouring eight of nine type II CRMs derived from six trunk genes and three head organizer genes. These data suggest that type I and type II CRMs function as enhancers and silencers, respectively, in the head organizer, irrespective of their component genes, justifying our two-CRM classification. We next examined the dependency of responsiveness on binding motifs. Reporter constructs with mutated Lim1 or Otx2/Gsc binding motifs in CRMs exhibited significantly reduced responsiveness (Fig. 6B). Thus, reporter analysis of individual CRMs strongly supported the classification of type I and type II CRMs (Fig. 5B).
Fig. 6. Enhancer or silencer activities of type I and type II CRMs in X. laevis embryos.
(A) Luciferase reporter assay for type I and type II CRMs isolated from various genes. (B) Mutation analysis for binding motifs. Responses to the head organizer cocktail were compared between wild type (WT) and mutant (mt) CRMs as indicated. Mutated sequences of binding motifs are shown on the right. (C, D) Luciferase reporter assay for 4–9 kb upstream regions of head organizer genes (C) and trunk genes (D). Deletion constructs for each CRM were examined as indicated. Type I and type II CRMs are shown with orange and magenta boxes, respectively. For the not ΔU1 construct, the inserted SV40 promoter is indicated as a grey box (see Methods for details). Bars represent the mean ± s.e.m. of luciferase activities relative to the mean value obtained from a reporter without the head organizer cocktail. These results show that the head organizer cocktail activates and represses target genes through type I and type II CRMs, respectively, in a binding motif dependent manner. Basically, the results from an experiment are shown for each reporter construct except for not-U2 in (A) and chordin upstream region constructs in (C), in which the results of two independent experiments were combined. *, P<0.05; **, P<0.01 (t-test, two-tailed).
As mentioned above, head and trunk organizer genes have multiple CRMs, which may be type I, or type II, or both (Figs 2A and 3). To examine whether type I and type II CRMs participate in regulation of target genes, and to determine how type I and type II CRMs interact with each other, when coexisting in the same genes, we prepared reporter constructs connected with 4–9 kb upstream regions, for chordin, otx5, wnt8a, and not, which possess 3–4 TLE-bound CRMs (Fig. 6C,D). Results of reporter analyses with a series of deletion constructs showed that type I CRMs in chordin and otx5 reporter constructs are necessary for the head organizer cocktail to activate the luciferase reporter gene (Fig. 6C). Furthermore, in otx5, two type I CRMs, U1 and U2, functioned redundantly, and the type II CRM, U3, functioned as a silencer. Because Otx5 has basically the same activity as Otx2, U3 is likely to play a role in autorepression, as has been shown for gsc.6,29 Concerning trunk organizer genes, type II CRMs in wnt8a (U1) and not (U2) reporter constructs are necessary for the head organizer cocktail to repress the luciferase reporter gene (Fig. 6D). Deletion of type I wnt8a-U4 enhanced repression by the head organizer cocktail, whereas deletion of type I not-U3 did not, consistent with the high activity of wnt8a-U4 and near inactivity of not-U3 in reporter assays using a single copy of type I CRMs (see Fig. 6A). These data suggest that type I or type II CRM independently activates or represses target genes, respectively, in the head organizer context.
Type I and type II CRM function in the head organizer
To test whether type I and type II of CRMs are correlated with gene expression in the embryo, we examined expression profiles in lim1/otx2/otx5 morphants and gsc morphants using RNA-seq analysis. We categorized all expressed genes into up- or down-regulated genes in morphants compared with two controls, uninjected embryos and standard control MO-injected embryos (control morphants). Among these categories, Otx2/TLE, Lim1/TLE, and type I CRMs were enriched around down-regulated genes in lim1/otx2/otx5 morphants (Supplementary Fig. S11A). Similarly, Gsc and Gsc/TLE were enriched around both up- and down-regulated genes in gsc morphants, and type II CRMs were enriched around those up-regulated genes (Supplementary Fig. S11B). These results suggested that Otx2, Lim1 and Gsc CRMs including type I and type II CRMs were functionally correlated with proximal gene expression.
We further examined type I CRMs functions in the head organizer. Utilizing RNA-seq results of dissected head organizer regions (Supplementary Fig. S7B), we identified 87 genes that were enriched >5-fold in the head organizer region compared to other embryonic regions. We called them “head organizer genes,” because the list included all 23 known head organizer genes and 5 dorsal endodermal genes, but did not include trunk organizer, pan-mesodermal, pan-endodermal, intermediate mesodermal, or neural ectodermal genes (Supplementary Table S3). Therefore, the list appears to represent a complete compilation of head organizer genes. RNA-seq analysis using lim1/otx2/otx5 morphants showed that expression levels of the head organizer genes possessing type I CRMs were significantly reduced in lim1/otx2/otx5 morphants (Fig. 7A) compared with control morphants (P = 0.0068), and also tended to be lowered than those of genes lacking type I CRMs (P = 0.054). This result suggests that head organizer genes possessing type I CRMs are positively regulated by Lim1 and Otx2 in vivo, possibly through type I CRMs.
Fig. 7. Requirement of type I and type II CRMs for gene activation and repression in the head organizer.
(A) Head organizer genes with and without type I CRMs. Gene expression levels analysed by RNA-seq showing log2 ratios of injected embryos versus uninjected controls. Expression levels in control or lim1/otx2/otx5 morphants were analysed at the mid gastrula stage (st 11.5). (B) Trunk genes with and without type II CRMs. Expression levels in control or gsc morphants were analysed at the mid gastrula stage (st. 11). Expression levels were compared by chi-square test and P-values are shown. The results indicate that Lim1/Otx2/Otx5 upregulate head organizer genes with type I CRMs and that Gsc downregulate trunk genes with type II CRMs.
Finally, we examined type II CRM functions in trunk genes, which are supposed to be repressed in the head organizer. We identified 181 “trunk genes” (Supplementary Table S4) that were enriched >3-fold in the marginal zone (Supplementary Fig. S7C) and were not included among the 87 head organizer genes. Trunk genes include 68 known posterior mesodermal genes such as wnt8a, vegt, and brachyury. RNA-seq results showed that trunk genes possessing type II CRMs were significantly up-regulated in gsc morphants (Fig. 7B), compared with control morphants (P = 0.0030) and with trunk genes lacking type II CRMs (P = 0.00034). This result supports the idea that trunk genes possessing type II CRMs are negatively regulated by Gsc, possibly through type II CRMs.
Discussion
We have shown that the corepressor TLE/Groucho identifies tissue-specific CRMs in the embryo. This is paradoxical, but it makes sense because activating a certain gene in a specific region results in its repression elsewhere. Similarly, it was recently reported that most regions bound by PRC2 subunits Ezh2 and Jarid2 had the enhancer histone mark H3K4me1 in Xenopus embryos.33 Furthermore, we have shown the importance of multiple TLE-marked CRMs for tissue-specific expression. This finding is reminiscent of a recently reported “super-enhancer” in which clusters of enhancers are occupied by the master TFs and the Mediator co-activator complex to regulate “cell identity genes”.34 If up and down-regulation are “two sides of the same coin,” repression of cell identity genes requires TLE to bind to each CRM in the super-enhancer. Thus, TLE is a very useful marker for tissue-specific CRMs in comprehensive genome-wide analysis.
Utilizing TLE as a tissue-specific CRM marker, we were able to identify type I and type II CRMs (Fig. 5A,B) that activate or repress a large battery of genes, including head organizer and trunk genes (Figs 6 and 7). We estimated the number of Otx2 target genes that have type I or type II alone, or both, to be 584, 493, or 265, respectively, in the Xenopus gastrula (Fig. 5B). Based on these data, we propose a regulatory principle, in which the selector, Otx2, acts as a ‘molecular landmark’ of the head organizer by positively or negatively regulating hundreds of target genes in cooperation with Lim1 or Gsc, through type I or type II CRMs, respectively (Supplementary Fig. S1B). This regulatory principle may also apply to other bilaterians because of evolutionary conservation of Otx/Otd in head formation,2 but activators and repressors as partner TFs of Otx can be different from organism to organism, depending on different developmental processes for diverse bilaterian head structures.
ChIP-chip analyses of a Drosophila Hox protein, Ubx, using the haltere disc have identified Ubx peaks near 1,147 – 3,400 genes, which were either activated or repressed by Ubx.35,36 In addition, whole genome ChIP-chip analyses of Dorsal, Twist, and Snail in Drosophila, have also shown similar results,37,38 in which these TFs appeared to exert positive/negative regulation of hundreds to thousands of target genes for dorsoventral patterning, anteroposterior patterning, and mesoderm differentiation. Our ChIP-seq analyses for Otx2, Lim1, and Gsc in early vertebrate embryos are consistent with work in Drosophila. Moreover, our data highlight a general gene regulatory principle, that though the selector TF itself appears to have dual functions, selector TFs may only mark target genes to be regulated. Then partner TFs may activate or repress selector-marked target genes, depending on the biological context.
Reporter assays for type I and type II CRMs showed that Lim1 and Otx2/Gsc binding motifs in CRMs are important to regulate genes (Fig. 6B). Curiously, even though Otx2 binds to type I CRMs, 56.4% of type I CRMs do not contain typical Otx2 binding motifs, suggesting that Otx2 may bind to atypical motifs or function as a co-activator for Lim1 without directly binding to DNA. This possibility is supported by sequential ChIP assays, which confirmed that Otx2 complexes with Lim1 on three type I CRMs that do not contain typical Otx2-binding motifs (Fig. 5C). Because some homeodomain proteins, such as Gsc, Pbx1, and Meis1,39,40 can function without direct DNA binding, Otx2 may also function similarly. By contrast, for repressing target genes through type II CRMs, Otx2 probably needs to bind to bicoid sites. The switching mechanism between activator and repressor activities of Otx2 could be uncovered by deeper ChIP-seq and RNA-seq analyses in future.
Methods
Microinjection experiments in Xenopus embryos
Xenopus tropicalis and X. laevis fertilized eggs were dejellied and injected with MOs and mRNAs. For X. tropicalis embryos, MOs were injected at the animal pole of both blastomeres at the two-cell stage, because MOs diffuse throughout the blastomere to knock down target genes in the entire embryo. Phenotypes of morphants were scored at the tailbud stage. Some of them were subjected to sagittal sections stained with haematoxylin and eosin. MOs were purchased from Gene Tools. MO sequences are as follows (antisense start codons are underlined):
standard control MO, CCTCTTACCTCAGTTACAATTTATA;
lim1 MO, TTTCGCATCCAGCACAGTGAACCAT;
otx2 MO, GGTTGCTTGAGATAAGACATCATGC;
otx5 MO, AATGTAGGACATCATTTCAGTGGCC;
gsc MO, GCTGAACATGCCAGAAGGCATCACC.21
For RNA-seq with loss-of-function experiments, a cocktail of lim1, otx2, otx5 MOs, standard control MO or gsc MO was injected at 0.75 pmol/embryo each, 2.25 pmol/embryo, or 2.25 pmol/embryo, respectively. For secondary head formation, 25 pg of lim1, ldb1 and ssbp3, 20 pg of otx2 and tle1, or 10 pg of gsc mRNAs were injected in various combinations into single blastomeres in the ventral-equatorial region of four-cell X. laevis embryos. For animal cap assays, mRNAs for lim1, ldb1, ssbp3, otx2, gsc and tle1 (25, 25, 25, 20, 10 and 20 pg/embryo, respectively) or gfp (150 pg/embryo) were injected into the animal pole region of both blastomeres at the two cell stage. Animal cap assays were performed as described.41
Whole mount in situ hybridization and immunostaining
Whole mount in situ hybridization was carried out as described.42 For hemisections, rehydrated embryos were embedded in 2% low-melting point agarose with 0.3 M sucrose in 1 x PBS. Agarose-mounted embryos were bisected and refixed as described.11 Digoxigenin-labelled antisense RNA probes were transcribed from linearized plasmids containing full coding sequences of X. tropicalis genes except for gsc whose sequence is from X. laevis (92% identical with that of X. tropicalis). Whole mount immunostaining with MZ-15 and 12/101 were performed as described.41,43
qPCR
Quantitative PCR (qPCR) was performed using SYBR Green. Primer sequences for ChIP-qPCR and RT-qPCR are described in Supplementary Tables S5 and S6, respectively. For ChIP-qPCR using X. laevis embryos, PCR amplicons and primer sets were determined according to sequence conservation in X. tropicalis and X. laevis, using genome data on Xenbase (http://www.xenbase.org/common/). For RT-qPCR using X. laevis and X. tropicalis, relative expression levels of genes were calculated after normalization with the expression level of ef1α.
Antibodies
Anti-Lim1 and anti-Otx2 antibodies were raised and affinity-purified.11 Anti-p300 and anti-pan-TLE antibodies were purchased from Santa Cruz (sc-585 and sc-13373), and anti-H3K4me1 and anti-H3K27ac antibodies were purchased from Abcam (ab8895 and ab4729). Specificity of anti-Otx2 antibody was validated by immunoprecipitation and western blotting with embryos injected with otx2 mRNA at 500 pg /embryo (Supplementary Fig. S4F). Gsc antibody was raised against the N-terminus (167 aa) of the protein, and affinity-purified. Specificity of anti-Gsc antibody was validated by western blotting with embryos injected with FLAG-gsc mRNA at 250 pg /embryo (Supplementary Fig. S4G). The X. tropicalis p300 C-terminal sequence is 85% identical to the epitope derived from human p300 (17/20 aa), which includes a 10-aa match at the C-terminus. The epitope of anti-pan-TLE antibody is 100%, 89.4% and 100% identical to the C-terminus of X. tropicalis TLE1, TLE2/3, and TLE4, respectively.
ChIP-seq analysis
Chromatin extraction from X. tropicalis embryos was performed as previously described,11,44 with slight modifications. Briefly, X. tropicalis gastrula embryos were fixed at 25° C with 1% formaldehyde in 1x PBS for 30 min, further incubated for 10 min after addition of 1/9 volume of 1.25 M glycine, and rinsed with 10 mM Tris-HCl (pH 7.5) or 1x PBS twice. Fixed embryos were homogenized with a Dounce homogenizer in 2.2 M sucrose, 10 mM Tris-HCl (pH 7.5), 3 mM CaCl2, 0.5% Triton X-100, proteinase inhibitor cocktail (50 µg/ml aprotinin, 20 µg/ml pepstatin A, 40 µg/ml leupeptin, 1 mM PMSF). Homogenates were centrifuged at 4°C for 3 hr at 8,000 x g. Precipitated nuclei were rinsed twice with 250 mM sucrose, 10 mM Tris-HCl (pH 7.5), 3 mM CaCl2, and proteinase inhibitors. Nuclei were sonicated in Lysis Buffer 3 (Agilent Mammalian ChIP-on-chip Protocol) with the protease inhibitor cocktail, shearing chromatin into 100–200 bp fragments. Sheared chromatin from 2000–3000 X. tropicalis gastrula embryos (~stage 10.5) was immunoprecipitated with 30 µg of antibodies and 300 µl of Dynabeads protein A (Invitrogen), according to the Agilent protocol. ChIP efficiency was ≥3-fold higher than for a control genomic region, as determined by qPCR (Supplementary Fig. S4A–E). For ChIP-qPCR assay using bisected embryos, approximately 1,000 fixed early gastrula embryos were bisected into dorsal and ventral halves. Sheared chromatin from dorsal or ventral halves was immunoprecipitated with 10 µg of p300 or TLE antibodies. ChIP DNA and input control DNA were sequenced as previously described.45,46 Sequence tags (36 bases) were mapped to the X. tropicalis genome, Joint Genome Institute, assembly version 4.1, allowing up to two mismatches. Mapped tags were computationally extended to the average DNA fragment size used for sequencing (120 bases). Using seqMINER,47 the mapping data were subjected to k-means clustering and visualized as heatmaps. A genome representation was generated using the UCSC genome browser. To detect binding peaks, enrichment against an input control (IP/WCE) was calculated using tag concentrations per genome position.45 For calculations of z-scores and Pearson correlation coefficient, the average enrichment data of each TF in 200 bp bins were used. The definition of ChIP-seq peaks was more than 5- (Lim1 and Gsc) or 10-fold (Otx2, p300, and TLE) enrichment (over control) and more than 120 bases in length (see Supplementary Table S1). Because of the low efficiency of ChIP experiments with anti-Lim1 and anti-Gsc antibodies (see Supplementary Fig. S4A,C), the peak detection threshold was set at 5-fold for Lim1 and Gsc (FDR<0.01, see Supplementary Table S1). Our peak detection algorithm was compared with MACS (http://liulab.dfci.harvard.edu/MACS/) (Supplementary Table S2). Peaks that shared >120 bases were considered overlapping. Consecutively overlapping peaks were combined as a single CRM spanning the overlapping peaks. The number of Lim1 or bicoid/P3C binding motifs was counted for Lim1 or Otx2 peaks in CRMs. If the TSS occurred within the CRM, the distance was considered zero. Motifs enriched in ChIP-seq peaks and CRMs were analysed by MEME-ChIP48 and Weeder49 with settings to consider both strands, to find one or more motifs in some sequences, and to find the motif width from 6 to 15 for the MEME algorithm.
RNA-seq analysis
Total RNA was extracted with ISOGEN (Wako) from ~10 embryos that had been injected with MO, or from a set of dissected tissues from early gastrula embryos (see Supplementary Fig. S7B–D). RNA-seq was performed as previously described.45,50 Sequences were mapped to the 14,253 Xtev gene models26 and two JGI gene models, e_gw1.925.59.1 and gw1.925.54.1 (siamois2/twin and siamois3, respectively), were used to calculate Reads Per Kilobase of exon per Million mapped reads (RPKM) for each gene model. RPKM values were used for comparisons of expression levels of each gene.
Sequence data
Short-read sequence data in this paper are registered in DDBJ Sequence Read Archives under accession nos. DRA000505-000518, 000573-000577, 000642-000649, 000732, 001093-001095. The peak data and wiggled data are available at: http://dbtss.hgc.jp/cgi-bin/downloader2.cgi?UID=7/Table_CRMs_ChIP-seq_peaks_Xenopus_gastrula.xlsx, http://dbtss.hgc.jp/cgi-bin/downloader2.cgi?UID=7/Xenopus_gastrula_Lhx1-ChIP_fold.tar.gz, http://dbtss.hgc.jp/cgi-bin/downloader2.cgi?UID=7/Xenopus_gastrula_Otx2-ChIP_fold.tar.gz, http://dbtss.hgc.jp/cgi-bin/downloader2.cgi?UID=7/Xenopus_gastrula_Gsc-ChIP_fold.tar.gz, http://dbtss.hgc.jp/cgi-bin/downloader2.cgi?UID=7/Xenopus_gastrula_P300-ChIP_fold.tar.gz, http://dbtss.hgc.jp/cgi-bin/downloader2.cgi?UID=7/Xenopus_gastrula_TLE-ChIP_fold.tar.gz, http://dbtss.hgc.jp/cgi-bin/downloader2.cgi?UID=7/Xenopus_gastrula_H3K4me1-ChIP.tar.gz, and http://dbtss.hgc.jp/cgi-bin/downloader2.cgi?UID=7/Xenopus_gastrula_H3K27ac-ChIP.tar.gz.
Sequential ChIP assay
For sequential ChIP assays, we fixed X. laevis gastrula embryos as described above and extracted chromatin as follows. Fixed embryos were homogenized with a Dounce homogenizer in Lysis Buffer 1 (Agilent Protocol) with proteinase inhibitor cocktail (complete, Roche), and rocked at 4°C for 10 min. Homogenates were centrifuged at 4°C for 5 min at 1,500 x g. Precipitated nuclei were resuspended in Lysis Buffer 3 (Agilent Protocol) with proteinase inhibitor cocktail and sonicated. Sheared chromatin was obtained from 300 X. laevis gastrula embryos (~stage 10.5) injected with 250 pg of mRNA encoding C-terminally FLAG-tagged Lim1 or N-terminally FLAG-tagged Gsc into two blastomeres of the dorsal equatorial region at the four cell stage. Chromatin was first immunoprecipitated with 100 µl of anti-FLAG M2 magnetic beads (Sigma, M8823) as described above, except that chromatin was eluted in 300 ul of TE with 30 mM DTT, 500 mM NaCl, and 0.1% SDS at 37°C for 30 min as previously described.51 100 µl of eluted chromatin was kept as a first ChIP sample. The other 100 µl of eluted chromatin was diluted 10-fold with Lysis Buffer 3 (Agilent Protocol) containing 1% Triton X-100 and subjected to a second immunoprecipitation with 10 µg of anti-Otx2 antibody or normal rabbit IgG (CST, #2729). Isolated DNA was purified and examined by qPCR.
Reporter constructs and luciferase assay
Luciferase reporter constructs for type I CRMs were made by inserting a genomic fragment into the pGL4.23 vector (Promega), which has an artificial, minimal promoter. To examine type II CRMs for gene repression, the minimal promoter was replaced with the SV40 late promoter. Because wnt8a-U1 includes original promoter sequences, the wnt8a-U1 genomic fragment was inserted into the pGL4.23 vector. To make reporter constructs shown in Fig. 6C,D, 5’ upstream regions from start codons of genes were connected to the start codon of the luciferase reporter gene. To make ΔU1 constructs (Fig. 6D), the previously reported minimal promoter sequence10 was left for the wnt8a construct, whereas SV40 promoter was inserted for the not construct because not-U1 overlaps the start codon and its minimal promoter sequence was unclear. Luciferase assays using X. laevis embryos were performed as previously described.7,41 Reporter plasmid DNA (50 pg/embryo) with mRNAs for lim1, ldb1, ssbp3, otx2, gsc and tle1 (25, 25, 25, 20, 10 and 20 pg/embryo, respectively) were injected together into the animal pole region of both blastomeres at the two-cell stage. Luciferase activity was analysed at the gastrula stage (st 10.5–11).
Supplementary Material
Acknowledgments
We thank Kiyomi Imamura and Kazumi Abe for sequencing analysis, Etsuko Sekimori and Terumi Horiuchi for bioinformatics analysis, and Yuuki Goya, Hiroshi Mamada, and Ichiro Hiratani for cloning of Xenopus ssbp3. Akiko Suga kindly provided pCSf107mT-XLdb1. X. tropicalis were obtained from the National Bioresource Project. We performed bioinformatics analysis using the supercomputer in the Human Genome Center, Institute of Medical Science, University of Tokyo. We thank Xi He for valuable discussions and Igor Dawid for critical reading of the manuscript. We thank Steven D. Aird for technical editing of the manuscript. This project was supported in part by Grants-in-Aid for Scientific Research from the MEXT (Ministry of Education, Culture, Sports, Science and Technology of Japan) (M. T.), by Grant-in-Aid for Scientific Research on Innovative Areas “Genome Science” from the MEXT (M. T., Y. S., and S. S.), by the Japan Society for the Promotion of Science ( Y. Y. , JSPS research fellow), and by Global COE Program (Integrative Life Science Based on the Study of Biosignaling Mechanisms) from the MEXT (N. S.) and by NIH (K.C.).
Footnotes
Author contributions
YY contributed to all experiments, analysed the data, and wrote the paper. YS and SS performed and analysed massively parallel sequencing experiments and YS helped draft the manuscript. ST and YH designed and performed experiments with X. tropicalis embryos. HS performed experiments with X. laevis embryos. NS and KC generated antibodies and KC also helped draft the manuscript. MA conducted X. tropicalis experiments. MT supervised the study and helped write the manuscript. All authors discussed results and commented on the manuscript.
References
- 1.Arendt D, Nubler-Jung K. Common ground plans in early brain development in mice and flies. Bioessays. 1996;18:255–259. doi: 10.1002/bies.950180314. [DOI] [PubMed] [Google Scholar]
- 2.Reichert H, Simeone A. Developmental genetic evidence for a monophyletic origin of the bilaterian brain. Philos Trans R Soc Lond B Biol Sci. 2001;356:1533–1544. doi: 10.1098/rstb.2001.0972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Heasman J. Patterning the early Xenopus embryo. Development. 2006;133:1205–1217. doi: 10.1242/dev.02304. [DOI] [PubMed] [Google Scholar]
- 4.Lemaire P, Kodjabachian L. The vertebrate organizer: structure and molecules. Trends Genet. 1996;12:525–531. doi: 10.1016/s0168-9525(97)81401-1. [DOI] [PubMed] [Google Scholar]
- 5.Koide T, Hayata T, Cho KW. Xenopus as a model system to study transcriptional regulatory networks. Proc Natl Acad Sci U S A. 2005;102:4943–4948. doi: 10.1073/pnas.0408125102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mochizuki T, et al. Xlim-1 and LIM domain binding protein 1 cooperate with various transcription factors in the regulation of the goosecoid promoter. Dev Biol. 2000;224:470–485. doi: 10.1006/dbio.2000.9778. [DOI] [PubMed] [Google Scholar]
- 7.Yamamoto S, Hikasa H, Ono H, Taira M. Molecular link in the sequential induction of the Spemann organizer: direct activation of the cerberus gene by Xlim-1, Xotx2, Mix.1, and Siamois, immediately downstream from Nodal and Wnt signaling. Dev Biol. 2003;257:190–204. doi: 10.1016/s0012-1606(03)00034-4. [DOI] [PubMed] [Google Scholar]
- 8.Artinger M, Blitz I, Inoue K, Tran U, Cho KW. Interaction of goosecoid and brachyury in Xenopus mesoderm patterning. Mech Dev. 1997;65:187–196. doi: 10.1016/s0925-4773(97)00073-7. [DOI] [PubMed] [Google Scholar]
- 9.Latinkic BV, et al. The Xenopus Brachyury promoter is activated by FGF and low concentrations of activin and suppressed by high concentrations of activin and by paired-type homeodomain proteins. Genes Dev. 1997;11:3265–3276. doi: 10.1101/gad.11.23.3265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Yao J, Kessler DS. Goosecoid promotes head organizer activity by direct repression of Xwnt8 in Spemann’s organizer. Development. 2001;128:2975–2987. doi: 10.1242/dev.128.15.2975. [DOI] [PubMed] [Google Scholar]
- 11.Sudou N, Yamamoto S, Ogino H, Taira M. Dynamic in vivo binding of transcription factors to cis-regulatory modules of cer and gsc in the stepwise formation of the Spemann--Mangold organizer. Development. 2012;139:1651–1661. doi: 10.1242/dev.068395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Acampora D, et al. Forebrain and midbrain regions are deleted in Otx2−/−mutants due to a defective anterior neuroectoderm specification during gastrulation. Development. 1995;121:3279–3290. doi: 10.1242/dev.121.10.3279. [DOI] [PubMed] [Google Scholar]
- 13.Matsuo I, Kuratani S, Kimura C, Takeda N, Aizawa S. Mouse Otx2 functions in the formation and patterning of rostral head. Genes Dev. 1995;9:2646–2658. doi: 10.1101/gad.9.21.2646. [DOI] [PubMed] [Google Scholar]
- 14.Rivera-Perez JA, Mallo M, Gendron-Maguire M, Gridley T, Behringer RR. Goosecoid is not an essential component of the mouse gastrula organizer but is required for craniofacial and rib development. Development. 1995;121:3005–3012. doi: 10.1242/dev.121.9.3005. [DOI] [PubMed] [Google Scholar]
- 15.Shawlot W, Behringer RR. Requirement for Lim1 in head-organizer function. Nature. 1995;374:425–430. doi: 10.1038/374425a0. [DOI] [PubMed] [Google Scholar]
- 16.Ang SL, et al. A targeted mouse Otx2 mutation leads to severe defects in gastrulation and formation of axial mesoderm and to deletion of rostral brain. Development. 1996;122:243–252. doi: 10.1242/dev.122.1.243. [DOI] [PubMed] [Google Scholar]
- 17.Vignali R, et al. Xotx5b, a new member of the Otx gene family, may be involved in anterior and eye development in Xenopus laevis. Mech Dev. 2000;96:3–13. doi: 10.1016/s0925-4773(00)00367-1. [DOI] [PubMed] [Google Scholar]
- 18.Kuroda H, Hayata T, Eisaki A, Asashima M. Cloning a novel developmental regulating gene, Xotx5: its potential role in anterior formation in Xenopus laevis. Dev Growth Differ. 2000;42:87–93. doi: 10.1046/j.1440-169x.2000.00491.x. [DOI] [PubMed] [Google Scholar]
- 19.Blitz IL, Cho KW. Anterior neurectoderm is progressively induced during gastrulation: the role of the Xenopus homeobox gene orthodenticle. Development. 1995;121:993–1004. doi: 10.1242/dev.121.4.993. [DOI] [PubMed] [Google Scholar]
- 20.Pannese M, et al. The Xenopus homologue of Otx2 is a maternal homeobox gene that demarcates and specifies anterior body regions. Development. 1995;121:707–720. doi: 10.1242/dev.121.3.707. [DOI] [PubMed] [Google Scholar]
- 21.Sander V, Reversade B, De Robertis EM. The opposing homeobox genes Goosecoid and Vent1/2 self-regulate Xenopus patterning. Embo J. 2007;26:2955–2965. doi: 10.1038/sj.emboj.7601705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Agulnick AD, et al. Interactions of the LIM-domain-binding factor Ldb1 with LIM homeodomain proteins. Nature. 1996;384:270–272. doi: 10.1038/384270a0. [DOI] [PubMed] [Google Scholar]
- 23.Chen L, et al. Ssdp proteins interact with the LIM-domain-binding protein Ldb1 to regulate development. Proc Natl Acad Sci U S A. 2002;99:14320–14325. doi: 10.1073/pnas.212532399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Jimenez G, Verrijzer CP, Ish-Horowicz D. A conserved motif in goosecoid mediates groucho-dependent repression in Drosophila embryos. Mol Cell Biol. 1999;19:2080–2087. doi: 10.1128/mcb.19.3.2080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Puelles E, et al. Otx2 regulates the extent, identity and fate of neuronal progenitor domains in the ventral midbrain. Development. 2004;131:2037–2048. doi: 10.1242/dev.01107. [DOI] [PubMed] [Google Scholar]
- 26.Akkers RC, et al. A hierarchy of H3K4me3 and H3K27me3 acquisition in spatial gene regulation in Xenopus embryos. Dev Cell. 2009;17:425–434. doi: 10.1016/j.devcel.2009.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Noyes MB, et al. Analysis of homeodomain specificities allows the family-wide prediction of preferred recognition sites. Cell. 2008;133:1277–1289. doi: 10.1016/j.cell.2008.05.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Wilson D, Sheng G, Lecuit T, Dostatni N, Desplan C. Cooperative dimerization of paired class homeo domains on DNA. Genes Dev. 1993;7:2120–2134. doi: 10.1101/gad.7.11.2120. [DOI] [PubMed] [Google Scholar]
- 29.Ferreiro B, Artinger M, Cho K, Niehrs C. Antimorphic goosecoids. Development. 1998;125:1347–1359. doi: 10.1242/dev.125.8.1347. [DOI] [PubMed] [Google Scholar]
- 30.Hiratani I, Mochizuki T, Tochimoto N, Taira M. Functional domains of the LIM homeodomain protein Xlim-1 involved in negative regulation, transactivation, and axis formation in Xenopus embryos. Dev Biol. 2001;229:456–467. doi: 10.1006/dbio.2000.9986. [DOI] [PubMed] [Google Scholar]
- 31.Chen X, et al. Integration of external signaling pathways with the core transcriptional network in embryonic stem cells. Cell. 2008;133:1106–1117. doi: 10.1016/j.cell.2008.04.043. [DOI] [PubMed] [Google Scholar]
- 32.Visel A, et al. ChIP-seq accurately predicts tissue-specific activity of enhancers. Nature. 2009;457:854–858. doi: 10.1038/nature07730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.van Heeringen SJ, et al. Principles of nucleation of H3K27 methylation during embryonic development. Genome Res. 2013 doi: 10.1101/gr.159608.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Whyte WA, et al. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell. 2013;153:307–319. doi: 10.1016/j.cell.2013.03.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Choo SW, White R, Russell S. Genome-wide analysis of the binding of the hox protein ultrabithorax and the hox cofactor homothorax in Drosophila. PLoS One. 2011;6:e14778. doi: 10.1371/journal.pone.0014778. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Slattery M, Ma L, Negre N, White KP, Mann RS. Genome-wide tissue-specific occupancy of the hox protein ultrabithorax and hox cofactor homothorax in Drosophila. PLoS One. 2011;6:e14686. doi: 10.1371/journal.pone.0014686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Sandmann T, et al. A core transcriptional network for early mesoderm development in Drosophila melanogaster. Genes Dev. 2007;21:436–449. doi: 10.1101/gad.1509007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Zeitlinger J, et al. Whole-genome ChIP-chip analysis of Dorsal, Twist, and Snail suggests integration of diverse patterning processes in the Drosophila embryo. Genes Dev. 2007;21:385–390. doi: 10.1101/gad.1509607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Izzi L, et al. Foxh1 recruits Gsc to negatively regulate Mixl1 expression during early mouse development. Embo J. 2007;26:3132–3143. doi: 10.1038/sj.emboj.7601753. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Shanmugam K, Green NC, Rambaldi I, Saragovi HU, Featherstone MS. PBX and MEIS as non-DNA-binding partners in trimeric complexes with HOX proteins. Mol Cell Biol. 1999;19:7577–7588. doi: 10.1128/mcb.19.11.7577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Yasuoka Y, et al. Evolutionary origins of blastoporal expression and organizer activity of the vertebrate gastrula organizer gene lhx1 and its ancient metazoan paralog lhx3. Development. 2009;136:2005–2014. doi: 10.1242/dev.028530. [DOI] [PubMed] [Google Scholar]
- 42.Harland RM. In situ hybridization: an improved whole-mount method for Xenopus embryos. Methods in cell biology. 1991;36:685–695. doi: 10.1016/s0091-679x(08)60307-6. [DOI] [PubMed] [Google Scholar]
- 43.Suga A, Hikasa H, Taira M. Xenopus ADAMTS1 negatively modulates FGF signaling independent of its metalloprotease activity. Dev Biol. 2006;295:26–39. doi: 10.1016/j.ydbio.2006.02.041. [DOI] [PubMed] [Google Scholar]
- 44.Kato Y, Habas R, Katsuyama Y, Naar AM, He X. A component of the ARC/Mediator complex required for TGF beta/Nodal signalling. Nature. 2002;418:641–646. doi: 10.1038/nature00969. [DOI] [PubMed] [Google Scholar]
- 45.Yamashita R, et al. Genome-wide characterization of transcriptional start sites in humans by integrative transcriptome analysis. Genome Res. 2011;21:775–789. doi: 10.1101/gr.110254.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Johnson DS, Mortazavi A, Myers RM, Wold B. Genome-wide mapping of in vivo protein-DNA interactions. Science. 2007;316:1497–1502. doi: 10.1126/science.1141319. [DOI] [PubMed] [Google Scholar]
- 47.Ye T, et al. seqMINER: an integrated ChIP-seq data interpretation platform. Nucleic Acids Res. 2011;39:e35. doi: 10.1093/nar/gkq1287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Machanick P, Bailey TL. MEME-ChIP: motif analysis of large DNA datasets. Bioinformatics. 2011;27:1696–1697. doi: 10.1093/bioinformatics/btr189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Pavesi G, Mereghetti P, Mauri G, Pesole G. Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes. Nucleic Acids Res. 2004;32:W199–W203. doi: 10.1093/nar/gkh465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5:621–628. doi: 10.1038/nmeth.1226. [DOI] [PubMed] [Google Scholar]
- 51.Yoon SJ, Wills AE, Chuong E, Gupta R, Baker JC. HEB and E2A function as SMAD/FOXH1 cofactors. Genes Dev. 2011;25:1654–1661. doi: 10.1101/gad.16800511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Taira M, Otani H, Jamrich M, Dawid IB. Expression of the LIM class homeobox gene Xlim-1 in pronephros and CNS cell lineages of Xenopus embryos is affected by retinoic acid and exogastrulation. Development. 1994;120:1525–1536. doi: 10.1242/dev.120.6.1525. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.







