Abstract
Expression of brachyury in the notochord is regarded as a chordate novelty and links to the origin of the notochord, yet the evolution of this regulatory control remains unclear. Here, we uncovered a regulatory syntax (named SFZE) consisting of binding sites for four transcription factors in notochord enhancers of chordate brachyury genes. SFZE was also identified in potential brachyury enhancers in various non-chordate animals and even in Capsaspora, a unicellular relative to animals. These non-chordate SFZE-containing enhancers exhibited activity in the zebrafish notochord. Furthermore, the SFZE syntax in a non-chordate confers endoderm activity. Our results indicate the ancient association of SFZE with brachyury, likely predating the origin of animals. The emergence of notochordal brachyury expression could be attributed to co-option of upstream signals acting on the conserved SFZE syntax, which facilitates the origin of the notochord from rudimentary endodermal cells.
An ancient gene regulatory code facilitates the origin of the notochord from rudimentary gut cells by recruiting upstream factors.
INTRODUCTION
The notochord, a defining feature of chordates (amphioxus, tunicates, and vertebrates), plays critical roles in chordate development. The notochord cells differentiate from the axial mesoderm, which arises from the organizer located dorsal to the blastopore at gastrulation onset. During gastrulation, the notochord progenitors signal to surrounding tissues, shaping the basic chordate body plan (1). The two closest relatives of chordates are the hemichordates and echinoderms (together ambulacrarians); these three groups constitute the deuterostomes. The hemichordate stomochord, an anterior rod-like protrusion from the pharynx, was once considered homologous to the chordate notochord. However, molecular studies do not support stomochord-notochord homology (2). In a protostome annelid, it was shown that a population of midline mesodermal cells expresses a set of notochord-specific genes and differentiates into a medial ventral longitudinal muscle named the axochord (3). While the axochord has been proposed to be a notochord homolog, it remains possible that the two structures may have evolved convergently (4).
Brachyury encodes a T-box transcription factor (TF) essential for notochord development (5). It may represent the most ancient T-box family member, with orthologs found in various animals and non-animals (6). Non-chordate brachyury genes can transform ascidian endodermal cells into notochord cells (7), indicating protein functional equivalence between chordates and non-chordates. In contrast to the conservation of protein function, brachyury expression patterns differ between chordates and non-chordates: In chordates, the expression pattern is generally conserved. For example, brachyury transcripts in amphioxus and frog are first detected in cells surrounding the blastopore. Subsequently, the brachyury-expressing cells internalize during gastrulation to form the notochord (8, 9). In non-chordate bilaterians, a circumblastoporal expression domain is also observed in gastrulae; brachyury is additionally expressed in the hindgut and oral ectoderm, but not in axial mesodermal cells (10). These observations have led to a hypothesis that the circumblastoporal expression of brachyury is an ancestral trait in bilaterians, while axial mesodermal expression represents a chordate-specific innovation. Therefore, the evolutionary origin of the notochord likely associates with the gain of a novel brachyury expression domain in the axial mesoderm (11). Emergence of new expression domains is often attributed to births or modifications of enhancers [or cis-regulatory modules (CRMs)]. Studies in ascidian and zebrafish have identified CRMs upstream of the brachyury translation start site (TSS) that drive reporter gene expression in the notochord (12–14). It remains to be determined whether these notochord enhancers evolved de novo in the chordate ancestor or have a more ancient origin.
RESULTS
Ambulacrarian brachyury BACs are active in zebrafish
To explore the evolutionary origin of the brachyury notochord enhancer, we analyzed CRMs of brachyury genes from animals that are closely related to chordates, including the hemichordate Ptychodera flava and the sea urchin Strongylocentrotus purpuratus. The gfp knock-in bacterial artificial chromosomes (BACs) harboring the brachyury ortholog of either P. flava or S. purpuratus (Pfbra/Spbra:gfp BACs) recapitulated endogenous brachyury expression in their cognate embryos, driving expression in the blastopore, oral ectoderm, and endoderm (figs. S1 and S2) (15, 16), although some ectopic expression was also observed. These results suggest that the BACs contain CRMs sufficient to drive brachyury expression at the analyzed stages. To examine CRM activities of the ambulacrarian brachyury genes in chordates, we introduced each of the reporter BACs into zebrafish zygotes. At the onset of gastrulation (shield stage), 6.5 hours post fertilization (hpf), ~33% of the zebrafish embryos injected with the Pfbra:gfp BAC showed green fluorescent protein (GFP) signals in the embryonic shield, which is equivalent to the dorsal organizer that gives rise to midline structures, including the notochord (17). By the early segmentation stage (11 to 14 hpf), ~45% of the transgenic embryos showed GFP signal in the dorsal midline (Fig. 1, A to D, and table S2). In contrast to the strong reporter activity driven by Pfbra:gfp BAC, Spbra:gfp BAC produced weak or undetectable GFP signals, with only two of 174 embryos showing GFP expression in the organizer at the shield stage, and the signals became faint at the early segmentation stage (Fig. 1, E to G, and table S2). The lower activity of the sea urchin BAC could be due to the presence of repressive elements, lower compatibility of the sea urchin promoter in zebrafish embryos, and potential modifications of regulatory sequences (experiments shown later). Double fluorescent in situ hybridization confirmed that gfp expression driven by Pfbra:gfp BAC localized to Drntl (18) (no tail, zebrafish brachyury ortholog)–expressing cells in the shield (Fig. 1, H to K), while, at the early segmentation stage, gfp transcripts were detected in the dorsal midline underneath the Drntl-expressing notochord cells (Fig. 1, L to P). The GFP signal was confirmed to be in the hypochord [marked by Drmnx1 expression; (19)], a midline structure sharing the same progenitors with the notochord (Fig. 1, Q to U, and fig. S3) (17). Together, these results indicate that the ambulacrarian brachyury CRMs are responsive to the notochord progenitor regulatory state but not to those of the notochord at later segmentation stage.
Fig. 1. Pfbra:gfp BAC activates GFP expression in the organizer and the hypochord of zebrafish embryos.
(A) Circular Pfbra/Spbra:gfp BACs were each introduced into zebrafish zygotes, and GFP signals were observed at the shield and early segmentation stages (A, anterior; P, posterior; V, ventral; D, dorsal). (B to G) GFP signals in zebrafish embryos injected with Pfbra:gfp BAC or Spbra:gfp BAC. GFP signal in the dorsal margin (shield) at the shield stage is indicated by the red arrowhead. The white dashed boxes in (C) and (F) are magnified and shown in (D) and (G), respectively. (H to U) Double fluorescent in situ hybridization of gfp (green) and Drntl [magenta in (H) to (P)] or Drmnx1 (magenta in Q to U) in Pfbra:gfp BAC-injected zebrafish embryos at indicated stages. Nuclei were counterstained with Hoechst 33342 (blue). The areas in the white dashed boxes in (H), (L), and (Q) are enlarged and shown in (I), (M), and (R); single-channel images are shown in (J) and (K), (N) and (O), and (S) and (T), respectively. [(P) and (U)] The XZ sections along the yellow dashed lines are indicated with yellow arrows in (L) and (Q). The orientation of embryos is indicated in each panel: LV, lateral view; DV, dorsal view. All scale bars represent 100 μm. Panels that are in the same scale: (B), (C), (E), and (F); (D) and (G); (H), (L), and (Q); (I) to (K), (M) to (P), and (R) to (U). Created in BioRender. Fan, T. (2025) https://BioRender.com/n8z3595.
Hemichordate PfCRM2 is active in the notochord
To identify CRM(s) of Pfbra contributing to the transcriptional activities in zebrafish embryos, we analyzed ATAC-seq (assay for transposase accessible chromatin using sequencing) data to locate open chromatin regions at the Pfbra locus. Among the four major ATAC-seq peaks, PfCRM2 spans 717–base pair (bp) immediately upstream of the TSS and contains the promoter (Fig. 2S). We constructed reporters for PfCRM2 and the other three PfCRMs fused with the deduced Pfbra promoter (a 332-bp fragment upstream of the TSS), using an egfp vector (fig. S4A) (20). For comparison, constructs containing Pfbra promoter alone or the known notochord enhancer of Drntl [~1 kb upstream of the Drntl TSS (Drntl-1kb)] (12) were also generated (fig. S4A). At the shield stage, Drntl-1kb and PfCRM2 drove egfp expression in the hypoblast of the embryonic shield (notochord precursor) in most embryos (98% for Drntl-1kb, 80% for PfCRM2). Enhanced GFP (EGFP) signals were also observed in the dorsal germ ring for some embryos (fig. S4, B, C, H, and I; and table S3). Reporters with the Pfbra promoter alone or with PfCRM1, PfCRM3, or PfCRM4 exhibited lower activities in the hypoblast (fig. S4, D to N, and table S3). At the early segmentation stage, Drntl-1kb and PfCRM2 drove egfp expression in the notochord in 49 and 33% of EGFP-positive embryos, respectively (Fig. 2, B to E and X). In contrast, Pfbra promoter alone or with the other CRMs showed minimal or no activity in the notochord (Fig. 2X; fig. S4, O to V; and table S3). The EGFP-positive cells in the PfCRM2-injected embryos were Drntl-expressing notochord cells (fig. S4, W to Z). These results reveal that, among the four PfCRMs, only PfCRM2 exhibits comparable transcriptional activities in the hypoblast and notochord to those of Drntl-1kb. Thus, it appears that PfCRM2 shares similar regulatory features with the zebrafish notochord enhancer. Additionally, the Pfbra promoter has limited regulatory function in the hypoblast, likely accounting for the hypoblast activities of PfCRM1 and PfCRM4. Furthermore, the observation that PfCRM2, but not Pfbra:gfp BAC, is transcriptionally active in the notochord implies the presence of silencer(s) within the BAC that suppress PfCRM2 activity in the notochord. PfCRM3 likely serves as a silencer, given that it significantly reduced Pfbra promoter activity in the hypoblast and the absence of its activity in the notochord.
Fig. 2. Brachyury CRMs with SFZE/SFZE-like syntax show activity in the zebrafish notochord.
(A) Phylogenetic relationships of the analyzed species. (B to Q) Zebrafish embryos injected with SFZE/SFZE-like containing CRMs from zebrafish (Drntl-1kb), hemichordate (PfCRM2), sea urchin (SpCRM4), fruit fly (DmCRM1), sea anemone (NvCRM1, NvCRM2, and NvCRM3), and filasterea (CoCRM4). The right panels show magnified views of the notochord region (white dashed boxes in the corresponding left panels) with magenta arrowheads indicating enhanced GFP (EGFP) signals in the notochord cells. The left panels are in the same scale, as are the right panels. Scale bars in (B) and (C), 100 μm. (R to W) The putative CRMs based on published ATAC-seq (Drntl, Pfbra, and Spbra loci) and chromatin immunoprecipitation sequencing (ChIP-seq) datasets (Dmbyn, H3K4me1, and H3K27me3; Nvbra, H3K4me1; Cobra, H3K4me1, and H3K4me3). Potential CRMs are indicated with numbers, and CRMs carrying SFZE/SFZE-like motifs are shaded in pink. The TF binding sites are denoted schematically with colored arrows placed on top of each diagram to show the orientation and order. Colors of TF sites are shown in (R) and (S). Genome versions, scaffold/chromosome numbers, and positions of the genomic loci displayed in (R) to (W) are indicated below the respective panel. The samples used for generating of the ATAC-seq and ChIP-seq datasets are listed in table S7. (X) Percentages of embryos with EGFP signals in the notochord out of EGFP-positive embryos. Each colored square represents the result of one experiment. Gray columns represent the average of at least three biological replicates, with error bars serving as SDs.
SFZE syntax of brachyury notochord enhancers
To identify the regulatory features underlying the transcriptional activity in the notochord, we compared sequences of PfCRM2 with the known brachyury notochord enhancers from zebrafish (12) and the ascidian Ciona intestinalis type A (or renamed as Ciona robusta) (21, 22). Using mVISTA (23), we were unable to detect conserved regions between PfCRM2 and the notochord enhancers (fig. S5). Nevertheless, the ability of PfCRM2 to drive reporter expression in the notochord suggests that TF binding sites within PfCRM2 are sufficient to respond to particular TFs present in the zebrafish notochord. Studies on the regulatory controls of zebrafish and Ciona brachyury genes have identified TF binding sites critical for expression in the notochord. These include a Foxh1 site located within Drntl-1kb (12) and two sites for Suppressor of Hairless [Su(H)] within the proximal enhancer of the Ciona brachyury (Cibra) (13); one complies the consensus sequence (GTGRGAR) (24), while the other is less ideal (NTGRGAR). Additionally, a shadow enhancer located ~800 bp upstream of the Cibra TSS contains two Ets and one ZicL sites in a face-to-face orientation essential for enhancer activity in the notochord (21). Similarly, in another ascidian, Halocynthia roretzi, Ets and ZicN in the promoter region of brachyury (Hrbra) are required for Hrbra initiation in notochord precursors (14). We thus reasoned that rather than sequence conservation, TF sites with specific grammar may be essential for transcriptional activity in the notochord. Drntl-1kb encompasses two ATAC-seq peaks (DrCRM2 and DrCRM3 in Fig. 2R). Within DrCRM2, we found binding sites for Zic, Ets, and Su(H) adjacent to the previously identified Foxh1 site (Fig. 2R and fig. S6) (12); an additional Foxh1 site was also observed. The Zic and Ets sites of DrCRM2 are facing each other (Fig. 2R and fig. S6), akin to the shadow enhancer of Cibra (21). Additionally, we found two Foxh1 sites adjacent to the previously identified ZicL site in the Cibra shadow enhancer (figs. S6 and S7). In sum, both notochord enhancers share a specific syntax: Binding sites of the four TFs are arranged in a defined order [5′-Su(H)-Foxh1-Foxh1-Zic-Ets-3′ for zebrafish and reversed in Ciona] and with specific orientations (face-to-face orientation of Zic and Ets sites, and the same direction for Zic and the adjacent Foxh1 site). These TF sites are also in close proximity to each other, especially the Foxh1-Zic-Ets sites (span less than one nucleosome length, ~147 bp; table S4). This tight spacing likely facilitates collaborative TF recruitment for regulating brachyury expression. Hereafter, we refer to this arrangement as the “SFZE” syntax, based on the initials of the four TFs. This syntax likely represents a functional unit for driving brachyury expression in the notochord. In line with this hypothesis, we were unable to identify the syntax in the Xenopus brachyury (Xbra) promoter, which activates reporter in the blastopore but not in the notochord (25). Together, our results suggest that the SFZE syntax is conserved in notochord enhancers of chordate brachyury genes.
Deep homology of the SFZE syntax
To explore whether the SFZE syntax has an ancient origin and might account for the notochord activity of the hemichordate PfCRM2, we scanned PfCRM sequences for binding sites of the four TFs. We discovered that PfCRM2 contains two Foxh1, one Zic, two Ets, and two Su(H) sites [one is a less optimal Su(H)-like site], organized similarly to the chordate SFZE syntax (Fig. 2S and fig. S6). One difference is that the Su(H) sites in PfCRM2 are adjacent to Ets sites but to Foxh1 sites in zebrafish SFZE syntax. Thus, the notochord activity of PfCRM2 is likely attributed to the presence of the syntax, with FZE serving as the core, and the position of Su(H) sites is less critical. PfCRM3 does not contain Ets and Su(H) sites, and PfCRM4 lacks Su(H) sites, consistent with the absence of notochord activity. PfCRM1 contains all four TF sites, but the Ets and Zic sites are in the same orientation. Consistent with this slightly disrupted syntax, PfCRM1 exhibited lower notochord activity than PfCRM2 but higher activity than PfCRM3 and PfCRM4 (Fig. 2X and fig. S6). We further examined the sequences of the sea urchin Spbra CRMs. Among the putative CRMs, SpCRM4 (located in the third intron) harbors a SFZE motif. It complies with the FZE orientation rule but is partially inverted compared to the SFZE syntax (i.e., SEZF; Fig. 2T and fig. S6). Nevertheless, SpCRM4 exhibited transcriptional activity in the zebrafish notochord, albeit at a lower level (Fig. 2, F, G, and X; and table S5). These results strongly suggest the role of the syntax in the notochord activity and demonstrate that the association of SFZE with brachyury is present in non-chordate deuterostomes, predating the origins of the notochord.
To further trace the origin of the SFZE syntax, we analyzed potential CRMs of brachyury orthologs of Drosophila melanogaster (Dmbyn), a protostome, and sea anemone Nematostella vectensis (Nvbra), a non-bilaterian. We identified SFZE in DmCRM1 that mirrors the organization in hemichordate PfCRM2 (Fig. 2U and fig. S6). Furthermore, DmCRM1 drove egfp expression in the zebrafish notochord, with activity comparable to that of zebrafish reporter (Fig. 2, H, I, and X; and table S5). For Nematostella, we found that NvCRM1 contains an inverted FZE motif upstream of a Su(H) site, while these sites within NvCRM2 violate the FZE orientation rules. NvCRM3 has two consecutive SFZE motifs. The upstream motif has an inverted FZE syntax that matches the motif order within sea urchin SpCRM4. Meanwhile, the downstream motif deviates from the FZE grammar rules, with Ets and Zic sites oriented in the same direction. Additionally, these sites in NvCRM1, NvCRM2, and the downstream SFZE motif of NvCRM3 span more than one nucleosome length (table S4). Correspondingly, NvCRM3 exhibited higher transcriptional activity in zebrafish notochord than NvCRM1 and NvCRM2 (Fig. 2, J to O and X, and table S5). Its activity exceeded that of SpCRM4, likely due to a synergistic effect of the two SFZE motifs.
The deeply conserved SFZE syntax within animals led us to extend this analysis to the brachyury ortholog of the filasterean Capsaspora owczarzaki (Cobra) (26), a unicellular eukaryote that is phylogenetically close to metazoans. Within one potential CRM (CoCRM4), we found an SFZE syntax containing one Su(H)-like site with the FZE core located within an exon of a gene (CAOG_05510) neighboring Cobra (CAOG_05512) (Fig. 2W and fig. S6). This nonanimal CoCRM4 also drove egfp expression in zebrafish notochord (Fig. 2X and table S5). Together, these results showed the association of the SFZE syntax with brachyury orthologs of a wide range of animals and a unicellular eukaryote. Furthermore, CRMs with SFZE syntax exhibited transcriptional activities in zebrafish notochord. Our results thus strongly support the conclusion that the SFZE syntax is deeply conserved and it likely originated before the emergence of animals.
Foxh1 and Ets sites are functionally important
Nodal and fibroblast growth factor (FGF) signals activate zebrafish brachyury expression in the notochord via Foxh1 and Ets TFs, respectively (12, 27, 28). To evaluate the importance of Foxh1 and Ets sites within the hemichordate SFZE syntax, we removed these sites in PfCRM2. Deletion of the Foxh1 sites significantly reduced reporter activity (Fig. 3, A to E and H), and mutation of the Ets sites strongly decreased reporter activity in the notochord (Fig. 3, F to H). These results indicate that both Foxh1 and Ets sites in PfCRM2 are functional and likely respond to endogenous Nodal and FGF signaling in the notochord. Additionally, the Ets sites play a crucial role, while the Foxh1 sites confer an additive effect, strengthening the transcriptional activity in the notochord.
Fig. 3. Foxh1 and Ets sites of the hemichordate SFZE syntax are functionally important for notochord activity.
(A) Diagram of the reporter constructs of PfCRM2 with deletions of the Foxh1 sites or mutations of the Ets sites. The additional Foxh1 site in the vector is indicated by the leftmost blue arrow with black outline. (B to G) Representative images show zebrafish embryos injected with the truncated or mutated PfCRM2 reporter constructs. Magnifications of the notochord regions highlighted by white boxes in (B), (D), and (F) are shown in (C), (E), and (G), respectively. EGFP signals in the notochord cells are indicated with magenta arrowheads. Embryos are oriented in the dorsal view with anterior to the top. Top panels [(B), (D), and (F)] and bottom panels [(C), (E), and (G)] are in the same scale as indicated in (B) and (C), respectively. w/o, without; delta symbol (△), mutation. All scale bars represent 100 μm. (H) Percentages of zebrafish embryos exhibiting EGFP signals in the notochord among EGFP-positive embryos. Each yellow data point represents the result of a single experiment. The gray columns are average results from at least three biological replicates, with error bars showing SDs. *P < 0.05; ****P < 0.0001.
Sea urchin SFZE syntax confers endoderm activity
To this point, we had shown that non-chordate brachyury CRMs with SFZE have notochord activity in zebrafish embryos; however, their endogenous roles remained unknown. To explore the function of the SFZE syntax in organisms lacking a notochord, we analyzed SpCRM4 activity during sea urchin gastrulation (Fig. 4A). The promoter driven by SpCRM4 exhibited strong activity in the presumptive endoderm of mesenchyme blastula (i.e., initiation of gastrulation) (Fig. 4, B and J), recapitulating the endogenous Spbra expression (fig. S8, A to D), while Spbra promoter alone showed negligible background activity (table S6). By late gastrula stage, the SpCRM4 reporter was active in the archenteron, non-oral ectoderm, cells surrounding the blastopore, and mesenchymal cells (Fig. 4, E and K). The ectodermal expression differed from the endogenous pattern, suggesting that additional CRMs are required. Mutation of the two Foxh1 sites had no effect at the mesenchyme blastula stage but significantly decreased reporter signals in the archenteron. In contrast, disruption of the two Ets sites decreased the activity in the presumptive endoderm significantly but had little effect at late gastrula stage (Fig. 4, C to D and F to K). These results indicate that SpCRM4 functions mainly as an endoderm enhancer at the onset of gastrulation and behaves as a general enhancer across three germ layers during gastrulation. Moreover, the endodermal activity of SpCRM4 at the onset and end of gastrulation partially depends on the respective Ets and Foxh1 sites within the syntax. Given the similarity of endodermal expression patterns of brachyury in sea urchin, sea star, and hemichordate (fig. S8), the endodermal activity of the SFZE syntax is likely conserved in ambulacrarians.
Fig. 4. Foxh1 and Ets sites of the sea urchin SFZE syntax confer endodermal activity at different developmental stages.
(A) Illustrations of sea urchin embryos at the mesenchyme blastula (top left, lateral view; top right, vegetal view) and late gastrula (bottom) stages. Colored arrows and arrowheads mark different embryonic territories. (B to I) Representative EGFP-positive embryos showing reporter activities. Colored arrows and arrowheads indicate EGFP-positive cells within the territories shown in (A). Embryos are oriented as in (A). For late gastrulae, the ventral side is to the left. Insets in (B) to (D) are the vegetal view to show EGFP signals in the presumptive endoderm. delta symbol (△), mutation. Panels are all in the same scale [scale bar in (B) 100 μm]. (J and K) Percentages of mesenchyme blastula (J) and late gastrula (K) embryos showing EGFP signals in the indicated territories. PMC, primary mesenchyme cells; SMC, secondary mesenchyme cells. *P < 0.05; **P < 0.01; n.s., not significant.
SFZE syntax in vertebrate brachyury paralogs
Brachyury is generally a single copy gene in invertebrates (29). After two rounds of whole-genome duplications (2R) during vertebrate evolution (30) following lineage-specific gene loss, ray-finned fishes have tbxta and tbxtb genes (31), while tetrapods only retain tbxtb (29, 31). Because the association of SFZE with brachyury predates 2R, we anticipate that both vertebrate paralogs would have inherited SFZE. In zebrafish, the identified syntax is associated with tbxta (Drntl). At the zebrafish tbxtb locus (Drbra), we uncovered the syntax in two putative CRMs situated 4.8 kb upstream and 12 kb downstream of the TSS (figs. S6 and S7). In mouse brachyury (Mmbra/T/tbxtb), a SFZE-containing CRM (1.6 kb upstream of the TSS) lies within a 8.3-kb region that directs Mmbra expression in the primitive streak (homologous to the blastopore) but not in the notochord of the mouse embryo (32). In Xenopus, two putative CRMs containing SFZE syntax were identified. One is 140.8 kb upstream and the other 13 kb downstream of the Xbra/tbxtb TSS (figs. S6 and S7); activities of these two CRMs remain unknown. These findings demonstrate that the SFZE syntax is present in both vertebrate brachyury paralogs, reinforcing the idea of its ancestral association with brachyury and retention after 2R.
DISCUSSION
In this study, we show that the regulatory features of functionally equivalent enhancers can be preserved by specific motif grammar. We uncover a conserved SFZE syntax associated with brachyury of species across a wide range of taxa, including chordates and non-chordates, and even a nonanimal unicellular eukaryote. The proximity spacing, order, and orientation of the TF sites are maintained despite adjacent sequence divergence, suggesting the deeply conserved regulatory code under strong evolutionary constraints.
At the onset of gastrulation, brachyury expression in ambulacrarians and chordates preferentially localized on the ventral and dorsal blastoporal margins, respectively, which aligns with the hypothesis of the dorsoventral inversion in the chordate ancestor (33, 34). Our results showed that the SFZE-containing CRMs have transcriptional activity in the zebrafish dorsal margin (part of the future blastopore) and sea urchin blastopore, suggesting a conserved regulatory role of the syntax in the blastoporal region of deuterostomes. During gastrulation, chordate brachyury genes maintain expression in the dorsal axial region of the invaginating archenteron (presumptive notochord), whereas, in ambulacrarian, expression is restricted in a few ventral endodermal cells (fig. S8). The function of Nodal and FGF signals in controlling brachyury expression in the notochord appears to have arisen at least before the divergence of tunicates and vertebrates (Fig. 5). In the basal chordate amphioxus, the notochord expression of brachyury is controlled by Nodal (35, 36), but it is not strongly affected when FGF signaling is inhibited (37). Nevertheless, within the potential notochord enhancers (38) of the two amphioxus brachyury paralogs (Bfbra1 and Bfbra2), we found an SFZE motif in Bfbra1 and two pairs of Su(H) binding sites in Bfbra2 (fig. S6).
Fig. 5. Proposed scenario of SFZE evolution and the origin of the notochord.
The SFZE syntax (red rectangles) associated with brachyury orthologs could have already been present before the emergence of animals. The regulatory network that activates brachyury in the axial mesoderm by concurrent Nodal and FGF signals through the SFZE syntax may have been established in the chordate lineage, at least before the divergence of tunicates and vertebrates. The Ets and Fox family factors (activated by yet to be determined signaling pathways, indicated by question marks) likely regulate brachyury expression through SFZE in at least ambulacrarians. Some components of the regulatory inputs could have been modified due to the loss of nodal and/or foxh genes in Drosophila and ambulacrarians (cross symbols). After two rounds of whole-genome duplications (2R), the SFZE syntax remained associated with both brachyury paralogs. Other notochord enhancers (T3/C/I) were subsequently evolved to control one of the paralogs (i.e., tbxb). In tetrapods, only tbxb was retained, and the notochord activity of the mammalian SFZE may have been substituted by T3/C/I enhancers. Created in BioRender. Fan, T. (2025) https://BioRender.com/14o3zg1.
Unlike the synergistic effect of the Foxh1 and Ets sites in the zebrafish notochord, the endodermal activity of the two TF sites during sea urchin gastrulation is temporally decoupled. Intriguingly, foxh orthologs are absent in the sea urchin and hemichordate genomes (39, 40). Additionally, inhibition of Nodal or FGF signals does not affect endodermal expression of sea urchin and hemichordate brachyury (41–43). Similarly, nodal and foxh1 orthologs are missing in the Drosophila genome (44, 45). It is likely that the Foxh1 sites of ambulacrarian and Drosophila are occupied by other Fox TFs (46, 47), and the deeply conserved SFZE syntax may respond to different upstream factors. We thus propose that the notochord could have evolved from rudimentary endodermal cells, and, during chordate evolution, co-option of Nodal and FGF signals, via respective FoxH and Ets factors, reinforced brachyury expression in the dorsal axial region of the archenteron. Additional factors, such as Zic, orthologs of which are not expressed in the endodermal cells of sea urchin and hemichordate gastrulae (48, 49), could also be recruited during chordate evolution. Further recruitment of notochord differentiation genes downstream of brachyury (50) would have also been necessary for notochord evolution.
Previous studies revealed that the 5′ promoter regions of Cibra and Drntl/tbxta are sufficient for brachyury expression in the notochord. However, the corresponding regions of tbxtb in mouse (32) and Xenopus (25) lack notochord activity, instead driving expression in the primitive streak/blastopore. Several notochord-specific enhancers (T3, C, and I) have been identified in the tbxtb loci of various jaw vertebrates (51, 52). These conserved CRMs contain functional Brachyury sites for auto-regulation. However, T3, C, and I enhancers are absent in tbxta of ray-finned fishes, jawless vertebrates, and Ciona, suggesting an origin in the jawed vertebrate common ancestor (52). Notably, the notochord activity of T3, C and I is observed during somitogenesis in zebrafish, after the initial expression of Drbra/tbxtb in the dorsal margin prior to gastrulation (31). The early Drbra/tbxtb expression is thus likely regulated by the SFZE-containing CRMs that we identified. Intriguingly, the proximal enhancer of mouse Mmbra contains the SFZE syntax but is active only in the primitive streak. Thus, although SFZE-containing CRMs are active in both the blastoporal region and notochord during zebrafish gastrulation, their notochord activity may have been fully substituted by T3, C, and I enhancers in mammals.
Together, our study highlights the importance of conserved regulatory syntax in notochord evolution. The SFZE syntax has very deep evolutionary roots, likely originated before the emergence of animals. We propose that the ancestral SFZE syntax functions in endodermal cells during gastrulation. Subsequent co-option of upstream factors and functional divergence of the SFZE syntax after duplication shaped its activity during chordate evolution.
MATERIALS AND METHODS
Animal collection and embryo culture
Adult sea urchin S. purpuratus were obtained from Amro Hamdoun (University of California, San Diego) and kept at 15°C. Mature sea star Archaster typicus and hemichordate P. flava were collected during the breeding season (June to August and September to December, respectively) from Chito Bay, Penghu Islands, Taiwan. Spawning and embryo cultures were conducted as previously described (53–55) with modifications for A. typicus. In brief, starfish were weighed and injected with 1 ml of 200 μM 1-methyladenine per 100 g of weight on the dorsal side of arms to induce spawning. Starfish embryos were cultured at 33°C.
Annotation of brachyury CRMs, promoters of Pfbra and Spbra, and the SFZE syntax
JBrowse (56) (version 1.16.10) was used to visualize the presumed CRMs at the brachyury loci of zebrafish, P. flava, N. vectensis, and C. owczarzaki. The processed datasets of ATAC-seq of zebrafish and the histone chromatin immunoprecipitation sequencing (ChIP-seq) of N. vectensis and C. owczarzaki were obtained from NCBI Gene Expression Omnibus under respective accession numbers GSE130944 (57), GSE46488 (58), and GSE71131 (59). The ATAC-seq dataset of hemichordate P. flava was obtained from a previous study and analyzed using standard pipelines (60). The sequencing reads were aligned to the cognate genome assemblies of zebrafish (Zv10) (61), N. vectensis (Nemvec1) (62), C. owczarzaki (C_owczarzaki_V2, accession number in the European Nucleotide Archive: GCA_000151315.2, www.ebi.ac.uk/ena/browser/view/GCA_000151315.2), and P. flava (ptychodera_flava version 1.0.114) (63) using Bowtie2 (version 2.5.1) (64). The ChIP-seq or the ATAC-seq datasets of mouse, Xenopus, Ciona, sea urchin, and Drosophila were from the NCBI Genome Data Viewer (65), Xenbase (v10.1) (66), ANISEED (67–69), Echinobase (70), and FlyBase (71), respectively. The promoters of Pfbra and Spbra were deduced by identifying positions of the core promoter elements (TATA box, initiator, downstream TFIIB recognition element, and downstream promoter element) (72, 73) around the transcription start site, as well as the promoter-proximal elements (CCAAT and GC box) upstream of the core promoter elements (74). The SFZE motifs were identified by scanning the DNA sequences of putative CRMs to locate the binding sites of Zic and Foxh1, using the vertebrate PFMs (position frequency matrices) from the JASPAR database (75). The Ets and Su(H) binding sites were detected according to the consensus core sequences MGGAW (76, 77) and GTGTRGAR (24), respectively. The suboptimal Su(H) binding site [Su(H)-like] was recognized as NTGRGAR or GTGRGAN.
Generation of GFP knock-in BAC clones and EGFP reporter constructs
The Spbra:gfp BAC (BAC clone ID: Sp_117A03_L in the Echinobase) (70, 78) contains an insert of ~131 kb of DNA, with Spbra located 24 kb from the end. The genomic DNA library of P. flava was constructed in the pBACe3.6 vector in Eric Davidson’s laboratory at California Institute of Technology. To identify BAC clones containing Pfbra, BAC arrayed library filters were screened with a digoxigenin (DIG)-labeled Pfbra probe (PCR DIG probe synthesis kit, Roche) following the hybridization protocol (DIG high prime DNA labeling and detection starter kit II, Roche). The DNA probe spans from 3′ of exon 6 to the flanking intron with a size of 173 bp. The identified Pfbra BAC clone was obtained from the Echinobase (clone position: plate #57, well J10) (70). The insert size of the Pfbra BAC was determined to be ~146 kb by digestion with Not I–HF (New England Biolabs) and separation via pulse-field gel electrophoresis. End sequencing and mapping to the P. flava genome revealed that Pfbra is located ~23 kb from the terminal end of the insert. To create a GFP knock-in BAC construct (Pfbra:gfp BAC), the gfp coding sequence was inserted at the TSS of Pfbra using homologous recombination (79, 80). NucleoBond Xtra BAC plasmid purification kit was used to extract the BACs. Putative CRMs and promoters of Pfbra and Spbra were polymerase chain reaction (PCR) amplified from the BACs. CRMs of zebrafish, fruit fly, and N. vectensis were PCR amplified from the cognate genomic DNA. CoCRM4 was synthesized by BIOTOOLS Co. Ltd. The CRMs and promoters were constructed into the HLC (Hugo’s lamprey construct) reporter vector (20, 81) by either restriction enzyme-mediated method, Gibson assembly (New England Biolabs), or combined by fusion PCR before ligating to the reporter vector. To modify particular TF binding sites in the SFZE syntax, the Q5 Site-Directed Mutagenesis Kit (E0554, New England Biolabs) was used to mutate the Foxh1 sites or the two guanine/cytosine nucleotides in the consensus core sequence (MGGAW) of the Ets sites. However, for the PfCRM2w/oFoxh1 construct, the 5′ end of the PfCRM2 was truncated by 224 bp to eliminate the two Foxh1 binding sites. The primer sets used in this study are listed in table S8.
Microinjection
For microinjection of P. flava, zygotes were first pipetted for 10 s in filtered seawater (FSW) containing 0.25% N-acetyl-l-cysteine (NAC; Sigma-Aldrich) and incubated for 2 min to dissolve the jelly coat. The zygotes were then washed three times with FSW to remove NAC before being aligned on a protamine sulfate-coated culture dish for injection. The injection mixture used for P. flava included linearized Pfbra:gfp BAC (5 ng/μl), Dextran Alexa Fluor 555 (0.75 μg/μl; Invitrogen), and 0.12 M KCl. Microinjection of S. purpuratus and Danio rerio was conducted according to published procedures (82, 83). For sea urchins, 9 pl of injection solution containing either linearized Spbra:gfp BAC (5 ng/μl) or PCR amplicons (0.4 ng/μl) of reporter constructs, Dextran Alexa Fluor 555 (0.75 μg/μl), and 0.12 M KCl was injected into the zygotes. Additionally, carrier DNA (15.4 ng/μl; Hind III digested S. purpuratus genomic DNA) was added to the solution to facilitate the incorporation of PCR amplicons. For zebrafish, 2 nl of the injection solution containing 200 pg of circular reporter constructs and 0.03%/μl of phenol red solution (Sigma-Aldrich) in Danieau’s water was injected into the zygotes. When introducing BACs into zebrafish, 250 pg of circular BAC plasmids in 4 nl of injection solution was used. Procedures for zebrafish were approved by the Academia Sinica Institutional Animal Care and Use Committee (protocol ID: 22-06-1877). Reporter gene expression was observed and imaged with either a Zeiss Axio Observer Z1 or a Nikon SMZ18 microscope.
Statistical analyses
Results of each transgenesis experiment in hemichordates, sea urchins, and zebrafish are listed in the tables S1 to S3, S5, and S6. Statistical comparisons in activities of different CRMs or BACs were performed using two-tailed Welch’s t test with 95% confidence intervals. Bonferroni correction was applied to control type I error rate for multiple comparisons, and P values are shown in Figs. 3 and 4. Statistical graphs were generated using GraphPad Prism 10.
Whole-mount in situ hybridization
Antisense RNA probes were synthesized following the instructions of DIG RNA Labeling mix (Roche) or LabelIT DNP Labeling kit (Mirus) to generate respective DIG-labeled or DNP-labeled probes. In situ hybridization of P. flava and sea urchin embryos was conducted as described (34). A. typicus in situ hybridization was performed according to the same protocol as used for sea urchins. Single and double fluorescent in situ hybridization for sea urchin, P. flava, and zebrafish was performed following the previously described procedure (83–86), except that 100 mM sodium azide was used to quench endogenous peroxidase activity or antibody activity. Tyramide signal amplification (PerkinElmer) was applied to amplify the fluorescent signals. Embryos were imaged with either a Zeiss Axio Imager A2 or a Zeiss Axio Observer Z1 microscope. Fluorescent signals were captured using a Zeiss LSM 880 confocal microscope.
Acknowledgments
We thank the Taiwan Zebrafish Core Facility (TZCAS) for providing the zebrafish ASAB wild-type strain. We are grateful to S.-P. L. Hwang, H.-J. Hsu, and H. Watanabe for providing genomic DNA of zebrafish, fruit fly, and N. vectensis. We also thank C. Lowe for sharing the HCL reporter vector. We thank Y.-F. Lu for technical assistance in zebrafish microinjection. We also thank M. Calkins for English editing. We are grateful for the help from the core facility and the Marine Research Station of the Institute of Cellular and Organismic Biology, Academia Sinica.
Funding: This work was supported by National Science and Technology Council, Taiwan, NSTC-113-2326-B-001-004 (Y.-H.S.) and NSTC-113-2811-B001-093 (Y.-H.S.); and by Academia Sinica, Taiwan AS-GC-111-L01 (Y.H.S. and J.-K.Y.).
Author contributions: Conceptualization: T.-P.F. and Y.-H.S. Investigation: T.-P.F., J.-R.L., Y.-C.C., A.E.C., and R.A.C. Formal analysis: C.-Y.L. and T.-P.F. Visualization: T.-P.F. Funding acquisition: Y.-H.S. and J.-K.Y. Project administration: T.-P.F. and Y.-H.S. Supervision: Y.-H.S. and J.-K.Y. Writing—original draft: T.-P.F. Writing—review and editing: T.-P.F. and Y.-H.S.
Competing interests: The authors declare that they have no competing interests.
Data and materials availability: All data needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials.
Supplementary Materials
This PDF file includes:
Figs. S1 to S8
Tables S1 to S8
REFERENCES AND NOTES
- 1.Stemple D. L., Structure and function of the notochord: An essential organ for chordate development. Development 132, 2503–2512 (2005). [DOI] [PubMed] [Google Scholar]
- 2.Satoh N., Tagawa K., Lowe C. J., Yu J.-K., Kawashima T., Takahashi H., Ogasawara M., Kirschner M., Hisata K., Su Y.-H., Gerhart J., On a possible evolutionary link of the stomochord of hemichordates to pharyngeal organs of chordates: Stomochord and Notochord Relationship. Genesis 52, 925–934 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Lauri A., Brunet T., Handberg-Thorsager M., Fischer A. H. L., Simakov O., Steinmetz P. R. H., Tomer R., Keller P. J., Arendt D., Development of the annelid axochord: Insights into notochord evolution. Science 345, 1365–1368 (2014). [DOI] [PubMed] [Google Scholar]
- 4.Hejnol A., Lowe C. J., Animal evolution: Stiff or squishy notochord origins? Curr. Biol. 24, R1131–R1133 (2014). [DOI] [PubMed] [Google Scholar]
- 5.Bruce A. E. E., Winklbauer R., Brachyury in the gastrula of basal vertebrates. Mech. Dev. 163, 103625 (2020). [DOI] [PubMed] [Google Scholar]
- 6.Sebé-Pedrós A., Ariza-Cosano A., Weirauch M. T., Leininger S., Yang A., Torruella G., Adamski M., Adamska M., Hughes T. R., Gómez-Skarmeta J. L., Ruiz-Trillo I., Early evolution of the T-box transcription factor family. Proc. Natl. Acad. Sci. U.S.A. 110, 16050–16055 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Satoh G., Harada Y., Satoh N., The expression of nonchordate deuterostome Brachyury genes in the ascidian Ciona embryo can promote the differentiation of extra notochord cells. Mech. Dev. 96, 155–163 (2000). [DOI] [PubMed] [Google Scholar]
- 8.Holland P. W., Koschorz B., Holland L. Z., Herrmann B. G., Conservation of Brachyury (T) genes in amphioxus and vertebrates: Developmental and evolutionary implications. Development 121, 4283–4291 (1995). [DOI] [PubMed] [Google Scholar]
- 9.Smith J. C., Price B. M., Green J. B., Weigel D., Herrmann B. G., Expression of a xenopus homolog of Brachyury (T) is an immediate-early response to mesoderm induction. Cell 67, 79–87 (1991). [DOI] [PubMed] [Google Scholar]
- 10.Sebé-Pedrós A., Ruiz-Trillo I., Evolution and classification of the T-box transcription factor family. Curr. Top. Dev. Biol. 122, 1–26 (2017). [DOI] [PubMed] [Google Scholar]
- 11.Satoh N., Tagawa K., Takahashi H., How was the notochord born?: Notochord and evolution of chordates. Evol. Dev. 14, 56–75 (2012). [DOI] [PubMed] [Google Scholar]
- 12.Harvey S. A., Tümpel S., Dubrulle J., Schier A. F., Smith J. C., No tail integrates two modes of mesoderm induction. Development 137, 1127–1135 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Corbo J. C., Levine M., Zeller R. W., Characterization of a notochord-specific enhancer from the Brachyury promoter region of the ascidian, Ciona intestinalis. Development 124, 589–602 (1997). [DOI] [PubMed] [Google Scholar]
- 14.Matsumoto J., Kumano G., Nishida H., Direct activation by Ets and Zic is required for initial expression of the Brachyury gene in the ascidian notochord. Dev. Biol. 306, 870–882 (2007). [DOI] [PubMed] [Google Scholar]
- 15.Rast J. P., Cameron R. A., Poustka A. J., Davidson E. H., brachyury Target genes in the early sea urchin embryo isolated by differential macroarray screening. Dev. Biol. 246, 191–208 (2002). [DOI] [PubMed] [Google Scholar]
- 16.Tagawa K., Humphreys T., Satoh N., Novel pattern of Brachyury gene expression in hemichordate embryos. Mech. Dev. 75, 139–143 (1998). [DOI] [PubMed] [Google Scholar]
- 17.Latimer A. J., Appel B., Notch signaling regulates midline cell specification and proliferation in zebrafish. Dev. Biol. 298, 392–402 (2006). [DOI] [PubMed] [Google Scholar]
- 18.Schulte-Merker S., Ho R., Herrmann B., Nüsslein-Volhard C., The protein product of the zebrafish homologue of the mouse T gene is expressed in nuclei of the germ ring and the notochord of the early embryo. Development 116, 1021–1032 (1992). [DOI] [PubMed] [Google Scholar]
- 19.Wendik B., Maier E., Meyer D., Zebrafish mnx genes in endocrine and exocrine pancreas formation. Dev. Biol. 268, 372–383 (2004). [DOI] [PubMed] [Google Scholar]
- 20.Parker H. J., Sauka-Spengler T., Bronner M., Elgar G., A reporter assay in lamprey embryos reveals both functional conservation and elaboration of vertebrate enhancers. PLOS ONE 9, e85492 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Farley E. K., Olson K. M., Zhang W., Rokhsar D. S., Levine M. S., Syntax compensates for poor binding sites to encode tissue specificity of developmental enhancers. Proc. Natl. Acad. Sci. U.S.A. 113, 6508–6513 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Brunetti R., Gissi C., Pennati R., Caicci F., Gasparini F., Manni L., Morphological evidence that the molecularly determined Ciona intestinalis type A and type B are different species: Ciona robusta and Ciona intestinalis. J. Zoolog. Syst. Evol. Res. 53, 186–193 (2015). [Google Scholar]
- 23.Frazer K. A., Pachter L., Poliakov A., Rubin E. M., Dubchak I., VISTA: Computational tools for comparative genomics. Nucleic Acids Res. 32, W273–W279 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Bailey A. M., Posakony J. W., Suppressor of hairless directly activates transcription of enhancer of split complex genes in response to Notch receptor activity. Genes Dev. 9, 2609–2622 (1995). [DOI] [PubMed] [Google Scholar]
- 25.Lerchner W., Latinkić B. V., Remacle J., Huylebroeck D., Smith J. C., Region-specific activation of the Xenopus Brachyury promoter involves active repression in ectoderm and endoderm: A study using transgenic frog embryos. Development 127, 2729–2739 (2000). [DOI] [PubMed] [Google Scholar]
- 26.Sebé-Pedrós A., de Mendoza A., Lang B. F., Degnan B. M., Ruiz-Trillo I., Unexpected repertoire of metazoan transcription factors in the unicellular holozoan Capsaspora owczarzaki. Mol. Biol. Evol. 28, 1241–1254 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Sirotkin H. I., Gates M. A., Kelly P. D., Schier A. F., Talbot W. S., Fast1 is required for the development of dorsal axial structures in zebrafish. Curr. Biol. 11, 1643 (2001). [DOI] [PubMed] [Google Scholar]
- 28.van Boxtel A. L., Economou A. D., Heliot C., Hill C. S., Long-range signaling activation and local inhibition separate the mesoderm and endoderm lineages. Dev. Cell 44, 179–191.e5 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Inoue J., Yasuoka Y., Takahashi H., Satoh N., The chordate ancestor possessed a single copy of the Brachyury gene for notochord acquisition. Zoological Lett. 3, 4 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Simakov O., Marlétaz F., Yue J.-X., O’Connell B., Jenkins J., Brandt A., Calef R., Tung C.-H., Huang T.-K., Schmutz J., Satoh N., Yu J.-K., Putnam N. H., Green R. E., Rokhsar D. S., Deeply conserved synteny resolves early events in vertebrate evolution. Nat. Ecol. Evol. 4, 820–830 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Martin B. L., Kimelman D., Regulation of canonical Wnt signaling by Brachyury is essential for posterior mesoderm formation. Dev. Cell 15, 121–133 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Clements D., Taylor H. C., Herrmann B., Stott D., Distinct regulatory control of the Brachyury gene in axial and non-axial mesoderm suggests separation of mesoderm lineages early in mouse gastrulation. Mech. Dev. 56, 139–149 (1996). [DOI] [PubMed] [Google Scholar]
- 33.Arendt D., Nübler-Jung K., Inversion of dorsoventral axis? Nature 371, 26 (1994). [DOI] [PubMed] [Google Scholar]
- 34.Su Y.-H., Chen Y.-C., Ting H.-C., Fan T.-P., Lin C.-Y., Wang K.-T., Yu J.-K., BMP controls dorsoventral and neural patterning in indirect-developing hemichordates providing insight into a possible origin of chordates. Proc. Natl. Acad. Sci. U.S.A. 116, 12925–12932 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Onai T., Yu J.-K., Blitz I. L., Cho K. W. Y., Holland L. Z., Opposing Nodal/Vg1 and BMP signals mediate axial patterning in embryos of the basal chordate amphioxus. Dev. Biol. 344, 377–389 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Zhang H., Chen S., Shang C., Wu X., Wang Y., Li G., Interplay between Lefty and Nodal signaling is essential for the organizer and axial formation in amphioxus embryos. Dev. Biol. 456, 63–73 (2019). [DOI] [PubMed] [Google Scholar]
- 37.Meister L., Escriva H., Bertrand S., Functions of the FGF signalling pathway in cephalochordates provide insight into the evolution of the prechordal plate. Development 149, dev200252 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Tominaga H., Satoh N., Ueno N., Takahashi H., Enhancer activities of amphioxus Brachyury genes in embryos of the ascidian, Ciona intestinalis. Genesis 56, e23240 (2018). [DOI] [PubMed] [Google Scholar]
- 39.Fritzenwanker J. H., Gerhart J., Freeman R. Jr., Lowe C., The Fox/Forkhead transcription factor family of the hemichordate Saccoglossus kowalevskii. Evodevo 5, 17–17 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Tu Q., Brown C. T., Davidson E. H., Oliveri P., Sea urchin Forkhead gene family: Phylogeny and embryonic expression. Dev. Biol. 300, 49–62 (2006). [DOI] [PubMed] [Google Scholar]
- 41.Duboc V., Röttinger É., Besnardeau L., Lepage T., Nodal and BMP2/4 signaling organizes the oral-aboral axis of the sea urchin embryo. Dev. Cell 6, 397–410 (2004). [DOI] [PubMed] [Google Scholar]
- 42.Satoh N., Hisata K., Foster S., Morita S., Nishitsuji K., Oulhen N., Tominaga H., Wessel G. M., A single-cell RNA-seq analysis of Brachyury-expressing cell clusters suggests a morphogenesis-associated signal center of oral ectoderm in sea urchin embryos. Dev. Biol. 483, 128–142 (2022). [DOI] [PubMed] [Google Scholar]
- 43.Röttinger E., DuBuc T. Q., Amiel A. R., Martindale M. Q., Nodal signaling is required for mesodermal and ventral but not for dorsal fates in the indirect developing hemichordate, Ptychodera flava. Biol. Open 4, 830–842 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Grande C., Martín-Durán J. M., Kenny N. J., Truchado-García M., Hejnol A., Evolution, divergence and loss of the Nodal signalling pathway: New data and a synthesis across the Bilateria. Int. J. Dev. Biol. 58, 521–532 (2014). [DOI] [PubMed] [Google Scholar]
- 45.Mazet F., Yu J. K., Liberles D. A., Holland L. Z., Shimeld S. M., Phylogenetic relationships of the Fox (Forkhead) gene family in the Bilateria. Gene 316, 79–89 (2003). [DOI] [PubMed] [Google Scholar]
- 46.Chen X., Ji Z., Webber A., Sharrocks A. D., Genome-wide binding studies reveal DNA binding specificity mechanisms and functional interplay amongst Forkhead transcription factors. Nucleic Acids Res. 44, 1566–1578 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zhou S., Zawel L., Lengauer C., Kinzler K. W., Vogelstein B., Characterization of human FAST-1, a TGFβ activin signal transducer. Mol. Cell 2, 121–127 (1998). [DOI] [PubMed] [Google Scholar]
- 48.Materna S. C., Howard-Ashby M., Gray R. F., Davidson E. H., The C2H2 zinc finger genes of Strongylocentrotus purpuratus and their expression in embryonic development. Dev. Biol. 300, 108–120 (2006). [DOI] [PubMed] [Google Scholar]
- 49.Chou C., Lin C.-Y., Lin C.-Y., Wang A., Fan T.-P., Wang K.-T., Yu J.-K., Su Y.-H., Tracing the evolutionary origin of chordate somites in the hemichordate Ptychodera flava. Integr. Comp. Biol. 64, 1226–1242 (2024). [DOI] [PubMed] [Google Scholar]
- 50.Schwaiger M., Andrikou C., Dnyansagar R., Murguia P. F., Paganos P., Voronov D., Zimmermann B., Lebedeva T., Schmidt H. A., Genikhovich G., Benvenuto G., Arnone M. I., Technau U., An ancestral Wnt-Brachyury feedback loop in axial patterning and recruitment of mesoderm-determining target genes. Nat. Ecol. Evol. 6, 1921–1939 (2022). [DOI] [PubMed] [Google Scholar]
- 51.Schifferl D., Scholze-Wittler M., Villaronga Luque A., Pustet M., Wittler L., Veenvliet J. V., Koch F., Herrmann B. G., Genome-wide identification of notochord enhancers comprising the regulatory landscape of the brachyury locus in mouse. Development 150, dev202111 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Kemmler C. L., Smolikova J., Moran H. R., Mannion B. J., Knapp D., Lim F., Czarkwiani A., Hermosilla Aguayo V., Rapp V., Fitch O. E., Bötschi S., Selleri L., Farley E., Braasch I., Yun M., Visel A., Osterwalder M., Mosimann C., Kozmik Z., Burger A., Conserved enhancers control notochord expression of vertebrate Brachyury. Nat. Commun. 14, 6594 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Fan T.-P., Ting H.-C., Yu J.-K., Su Y.-H., Reiterative use of FGF signaling in mesoderm development during embryogenesis and metamorphosis in the hemichordate Ptychodera flava. BMC Evol. Biol. 18, 120 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Chen J.-H., Luo Y.-J., Su Y.-H., The dynamic gene expression patterns of transcription factors constituting the sea urchin aboral ectoderm gene regulatory network. Dev. Dyn. 240, 250–260 (2011). [DOI] [PubMed] [Google Scholar]
- 55.Chen C.-P., Tseng C.-H., Chen B.-Y., The development of the catecholaminergic nervous system in starfish and sea cucumber larvae. Zool. Stud. 34, 248–256 (1995). [Google Scholar]
- 56.Buels R., Yao E., Diesh C. M., Hayes R. D., Munoz-Torres M., Helt G., Goodstein D. M., Elsik C. G., Lewis S. E., Stein L., Holmes I. H., JBrowse: A dynamic web platform for genome visualization and analysis. Genome Biol. 17, 66 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Pálfy M., Schulze G., Valen E., Vastenhouw N. L., Chromatin accessibility established by Pou5f3, Sox19b and Nanog primes genes for activity during zebrafish genome activation. PLOS Genet. 16, e1008546 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Schwaiger M., Schönauer A., Rendeiro A. F., Pribitzer C., Schauer A., Gilles A. F., Schinko J. B., Renfer E., Fredman D., Technau U., Evolutionary conservation of the eumetazoan gene regulatory landscape. Genome Res. 24, 639–650 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Sebé-Pedrós A., Ballaré C., Parra-Acero H., Chiva C., Tena J. J., Sabidó E., Gómez-Skarmeta J. L., Di Croce L., Ruiz-Trillo I., The dynamic regulatory genome of Capsaspora and the origin of animal multicellularity. Cell 165, 1224–1237 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Pérez-Posada A., Lin C.-Y., Fan T.-P., Lin C.-Y., Chen Y.-C., Gómez-Skarmeta J. L., Yu J.-K., Su Y.-H., Tena J. J., Hemichordate cis-regulatory genomics and the gene expression dynamics of deuterostomes. Nat. Ecol. Evol. 8, 2213–2227 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Martin F. J., Amode M. R., Aneja A., Austine-Orimoloye O., Azov A. G., Barnes I., Becker A., Bennett R., Berry A., Bhai J., Bhurji S. K., Bignell A., Boddu S., Branco Lins P. R., Brooks L., Ramaraju S. B., Charkhchi M., Cockburn A., Da Rin Fiorretto L., Davidson C., Dodiya K., Donaldson S., El Houdaigui B., El Naboulsi T., Fatima R., Giron C. G., Genez T., Ghattaoraya G. S., Martinez J. G., Guijarro C., Hardy M., Hollis Z., Hourlier T., Hunt T., Kay M., Kaykala V., Le T., Lemos D., Marques-Coelho D., Marugán J. C., Merino G. A., Mirabueno L. P., Mushtaq A., Hossain S. N., Ogeh D. N., Sakthivel M. P., Parker A., Perry M., Piližota I., Prosovetskaia I., Pérez-Silva J. G., Salam A. I. A., Saraiva-Agostinho N., Schuilenburg H., Sheppard D., Sinha S., Sipos B., Stark W., Steed E., Sukumaran R., Sumathipala D., Suner M.-M., Surapaneni L., Sutinen K., Szpak M., Tricomi F. F., Urbina-Gómez D., Veidenberg A., Walsh T. A., Walts B., Wass E., Willhoft N., Allen J., Alvarez-Jarreta J., Chakiachvili M., Flint B., Giorgetti S., Haggerty L., Ilsley G. R., Loveland J. E., Moore B., Mudge J. M., Tate J., Thybert D., Trevanion S. J., Winterbottom A., Frankish A., Hunt S. E., Ruffier M., Cunningham F., Dyer S., Finn R. D., Howe K. L., Harrison P. W., Yates A. D., Flicek P., Ensembl 2023. Nucleic Acids Res. 51, D933–D941 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Putnam N. H., Srivastava M., Hellsten U., Dirks B., Chapman J., Salamov A., Terry A., Shapiro H., Lindquist E., Kapitonov V. V., Jurka J., Genikhovich G., Grigoriev I. V., Lucas S. M., Steele R. E., Finnerty J. R., Technau U., Martindale M. Q., Rokhsar D. S., Sea anemone genome reveals ancestral eumetazoan gene repertoire and genomic organization. Science 317, 86–94 (2007). [DOI] [PubMed] [Google Scholar]
- 63.Lin C.-Y., Marlétaz F., Pérez-Posada A., Martínez-García P. M., Schloissnig S., Peluso P., Conception G. T., Bump P., Chen Y.-C., Chou C., Lin C.-Y., Fan T.-P., Tsai C.-T., Gómez Skarmeta J. L., Tena J. J., Lowe C. J., Rank D. R., Rokhsar D. S., Yu J.-K., Su Y.-H., Chromosome-level genome assemblies of 2 hemichordates provide new insights into deuterostome origin and chromosome evolution. PLOS Biol. 22, e3002661 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Langmead B., Salzberg S. L., Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Rangwala S. H., Kuznetsov A., Ananiev V., Asztalos A., Borodin E., Evgeniev V., Joukov V., Lotov V., Pannu R., Rudnev D., Shkeda A., Weitz E. M., Schneider V. A., Accessing NCBI data using the NCBI Sequence Viewer and Genome Data Viewer (GDV). Genome Res. 31, 159–169 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Fisher M., James-Zorn C., Ponferrada V., Bell A. J., Sundararaj N., Segerdell E., Chaturvedi P., Bayyari N., Chu S., Pells T., Lotay V., Agalakov S., Wang D. Z., Arshinoff B. I., Foley S., Karimi K., Vize P. D., Zorn A. M., Xenbase: Key features and resources of the Xenopus model organism knowledgebase. Genetics 224, 1–13 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Brozovic M., Dantec C., Dardaillon J., Dauga D., Faure E., Gineste M., Louis A., Naville M., Nitta K. R., Piette J., Reeves W., Scornavacca C., Simion P., Vincentelli R., Bellec M., Aicha S. B., Fagotto M., Guéroult-Bellone M., Haeussler M., Jacox E., Lowe E. K., Mendez M., Roberge A., Stolfi A., Yokomori R., Brown C. T., Cambillau C., Christiaen L., Delsuc F., Douzery E., Dumollard R., Kusakabe T., Nakai K., Nishida H., Satou Y., Swalla B., Veeman M., Volff J.-N., Lemaire P., ANISEED 2017: Extending the integrated ascidian database to the exploration and evolutionary comparison of genome-scale datasets. Nucleic Acids Res. 46, D718–D725 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Madgwick A., Magri M. S., Dantec C., Gailly D., Fiuza U.-M., Guignard L., Hettinger S., Gomez-Skarmeta J. L., Lemaire P., Evolution of embryonic cis-regulatory landscapes between divergent Phallusia and Ciona ascidians. Dev. Biol. 448, 71–87 (2019). [DOI] [PubMed] [Google Scholar]
- 69.Satoh N., An aboral-dorsalization hypothesis for chordate origin. Genesis 46, 614–622 (2008). [DOI] [PubMed] [Google Scholar]
- 70.Telmer C., Karimi K., Chess M. M., Agalakov S., Arshinoff B. I., Lotay V., Wang D. Z., Chu S., Pells T. J., Vize P., Hinman V., Ettensohn C., Echinobase: A resource to support the echinoderm research community. Genetics 227, iyae002 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Gramates L. S., Agapite J., Attrill H., Calvi B. R., Crosby M. A., Dos Santos G., Goodman J. L., Goutte-Gattat D., Jenkins V. K., Kaufman T., Larkin A., Matthews B. B., Millburn G., Strelets V. B., FlyBase Consortium , FlyBase: A guided tour of highlighted features. Genetics 220, iyac035 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Butler J. E. F., Kadonaga J. T., The RNA polymerase II core promoter: A key component in the regulation of gene expression. Genes Dev. 16, 2583–2592 (2002). [DOI] [PubMed] [Google Scholar]
- 73.Juven-Gershon T., Kadonaga J. T., Regulation of gene expression via the core promoter and the basal transcriptional machinery. Dev. Biol. 339, 225–229 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Bucher P., Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. J. Mol. Biol. 212, 563–578 (1990). [DOI] [PubMed] [Google Scholar]
- 75.Castro-Mondragon J. A., Riudavets-Puig R., Rauluseviciute I., Lemma R. B., Turchi L., Blanc-Mathieu R., Lucas J., Boddie P., Khan A., Manosalva Pérez N., Fornes O., Leung T. Y., Aguirre A., Hammal F., Schmelter D., Baranasic D., Ballester B., Sandelin A., Lenhard B., Vandepoele K., Wasserman W. W., Parcy F., Mathelier A., JASPAR 2022: The 9th release of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 50, D165–D173 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Woods D. B., Ghysdael J., Owen M., Identification of nucleotide preferences in DNA sequences recognised specifically by c-Ets-1 protein. Nucleic Acids Res. 20, 699–704 (1992). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Wei G.-H., Badis G., Berger M. F., Kivioja T., Palin K., Enge M., Bonke M., Jolma A., Varjosalo M., Gehrke A. R., Yan J., Talukder S., Turunen M., Taipale M., Stunnenberg H. G., Ukkonen E., Hughes T. R., Bulyk M. L., Taipale J., Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo. EMBO J. 29, 2147–2160 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Cameron R. A., Mahairas G., Rast J. P., Martinez P., Biondi T. R., Swartzell S., Wallace J. C., Poustka A. J., Livingston B. T., Wray G. A., Ettensohn C. A., Lehrach H., Britten R. J., Davidson E. H., Hood L., A sea urchin genome project: Sequence scan, virtual map, and additional resources. Proc. Natl. Acad. Sci. U.S.A. 97, 9514–9518 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Lee E. C., Yu D., Martinez de Velasco J., Tessarollo L., Swing D. A., Court D. L., Jenkins N. A., Copeland N. G., A highly efficient Escherichia coli-based chromosome engineering system adapted for recombinogenic targeting and subcloning of BAC DNA. Genomics 73, 56–65 (2001). [DOI] [PubMed] [Google Scholar]
- 80.Yu D., Ellis H. M., Lee E. C., Jenkins N. A., Copeland N. G., Court D. L., An efficient recombination system for chromosome engineering in Escherichia coli. Proc. Natl. Acad. Sci. U.S.A. 97, 5978–5983 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Minor P. J., Clarke D. N., Andrade López J. M., Fritzenwanker J. H., Gray J., Lowe C. J., I-SceI Meganuclease-mediated transgenesis in the acorn worm, Saccoglossus kowalevskii. Dev. Biol. 445, 8–15 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Rast J. P., Transgenic manipulation of the sea urchin embryo. Methods Mol. Biol. 136, 365–373 (2000). [DOI] [PubMed] [Google Scholar]
- 83.Liu Y.-H., Lin T.-C., Hwang S.-P. L., Zebrafish Pax1a and Pax1b are required for pharyngeal pouch morphogenesis and ceratobranchial cartilage development. Mech. Dev. 161, 103598 (2020). [DOI] [PubMed] [Google Scholar]
- 84.Andrikou C., Pai C.-Y., Su Y.-H., Arnone M. I., Logics and properties of a genetic regulatory program that drives embryonic muscle development in an echinoderm. eLife 4, e07343 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Ikuta T., Chen Y.-C., Annunziata R., Ting H.-C., Tung C.-H., Koyanagi R., Tagawa K., Humphreys T., Fujiyama A., Saiga H., Satoh N., Yu J.-K., Arnone M. I., Su Y.-H., Identification of an intact ParaHox cluster with temporal colinearity but altered spatial colinearity in the hemichordate Ptychodera flava. BMC Evol. Biol. 13, 129 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Chang Y.-C., Pai C.-Y., Chen Y.-C., Ting H.-C., Martinez P., Telford M. J., Yu J.-K., Su Y.-H., Regulatory circuit rewiring and functional divergence of the duplicate admp genes in dorsoventral axial patterning. Dev. Biol. 410, 108–118 (2016). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figs. S1 to S8
Tables S1 to S8





