Skip to main content
eLife logoLink to eLife
. 2020 Nov 16;9:e59157. doi: 10.7554/eLife.59157

Conservation of peripheral nervous system formation mechanisms in divergent ascidian embryos

Joshua F Coulcher 1,, Agnès Roure 1,, Rafath Chowdhury 1, Méryl Robert 1, Laury Lescat 1,, Aurélie Bouin 1,§, Juliana Carvajal Cadavid 1, Hiroki Nishida 2, Sébastien Darras 1,
Editors: Shigehiro Kuraku3, Marianne E Bronner4
PMCID: PMC7710358  PMID: 33191918

Abstract

Ascidians with very similar embryos but highly divergent genomes are thought to have undergone extensive developmental system drift. We compared, in four species (Ciona and Phallusia for Phlebobranchia, Molgula and Halocynthia for Stolidobranchia), gene expression and gene regulation for a network of six transcription factors regulating peripheral nervous system (PNS) formation in Ciona. All genes, but one in Molgula, were expressed in the PNS with some differences correlating with phylogenetic distance. Cross-species transgenesis indicated strong levels of conservation, except in Molgula, in gene regulation despite lack of sequence conservation of the enhancers. Developmental system drift in ascidians is thus higher for gene regulation than for gene expression and is impacted not only by phylogenetic distance, but also in a clade-specific manner and unevenly within a network. Finally, considering that Molgula is divergent in our analyses, this suggests deep conservation of developmental mechanisms in ascidians after 390 My of separate evolution.

Research organism: C. intestinalis, Other

Introduction

The formation of an animal during embryonic development is controlled by the exquisitely precise deployment of regulator and differentiation genes in space and time. This coordinated expression of developmental genes is controlled through gene regulatory networks (GRNs). It has been proposed that these networks are encoded in the genome and control body plan, organs, and cell type formation (Levine and Davidson, 2005). It is thought that, in the course of evolution, essential GRNs or sub-modules have been conserved and control the formation of homologous structures, and that rewiring of the GRNs explains acquisition of novelties and morphological diversification (Davidson and Erwin, 2006). Understanding control of gene expression is central to comparative developmental approaches. Hence, specific domains of the non-coding genome called cis-regulatory modules (CRMs) that serve as docking platforms for transcription factors (TFs) (Buffry et al., 2016; Rebeiz and Tsiantis, 2017) have been deeply studied. It is generally considered that during evolution and species diversification the key functional role of CRMs leads to their persistence and indirectly contributes to maintaining genes together (synteny) and to the existence of conserved non-coding DNA. Indeed, the presence of TF-binding sites (TFBS) in the CRMs for genes with conserved expression and regulation defines regions of conserved sequence; and searching for conservation in non-coding DNA is often used to identify CRMs (phylogenetic footprint: defined as DNA sequence conservation between related species following alignment using tools such as lastz or lagan [Brudno et al., 2003; Schwartz et al., 2003]). Such conserved CRMs have been identified in closely related species but also to a fewer extent for very distantly related species (ultraconserved elements); and in many cases, gene regulation relies on the usage of a similar combination of TFBS.

By contrast, a number of comparative studies performed in several group of animals (chordates, insects, nematodes…) have revealed that development is flexible and that formation of homologous characters may rely on different mechanisms, a process known as developmental system drift (DSD) (Buffry et al., 2016; Arnold et al., 2014; Verster et al., 2014; True and Haag, 2001; Haag, 2014; Bradley et al., 2010; José-Edwards et al., 2015; Lusk and Eisen, 2010; Roure et al., 2014). In some cases, key GRN nodes have changed (for example the role of Snail/Slug genes in vertebrate neural crest [Nieto, 2018]). Focusing on GRN edges — gene expression regulation — various situations have been described, and conservation of CRM sequence at the DNA level does not necessarily correlate with conservation of activity. First, conservation of DNA sequence across species, does not imply a cis-regulatory activity (for example, ultraconserved elements do not necessarily function as CRMs or may be dispensable for gene expression) or an identical regulatory logic (conserved activity may not involve the same TFBS; conserved CRMs may control different expression patterns depending on the species). On the contrary, DNA sequence conservation is not a necessary feature of CRMs. First, bona fide CRMs may not be conserved, even between closely related species. Second, orthologous CRMs with conserved activity harboring similar combinations of TFBS exist despite an absence of sequence conservation because of extensive turnover of TFBS.

Ascidians are marine invertebrates that are particularly interesting for investigating the evolution of gene regulation during embryonic development. They belong to the tunicate phylum, and together with vertebrates and cephalochordates (amphioxus) they constitute the chordate super-phylum (Lemaire, 2011; Satoh, 1994; Satoh et al., 2014). These animals share a body plan characterized by a notochord and a dorsal neural tube during embryonic life. Ascidians, however, took a significantly different evolutionary path from other chordates resulting in divergent morphological, embryological, and genomic features. The development of these animals is fast (hatching of the larva after 18 hr at 18°C in Ciona robusta [Hotta et al., 2007]) and stereotyped with very few cells (around 100 cells in gastrulae and 2500 cells in larvae [Nishida, 1986; Yamada and Nishida, 1999]), and ascidian genomes have undergone compaction and extensive rearrangements when compared to vertebrates (Dehal et al., 2002; Brozovic et al., 2017). In addition, ascidians have extensively diversified (around 3000 species) and their genomes have been reshuffled to the point that there is very little DNA conservation outside of the coding parts of the genome (except for closely related species such as the ones from the same genus) (Shenkar and Swalla, 2011; Dardaillon et al., 2020). Yet, their embryos are virtually identical. This paradoxical situation represents an excellent case to study the evolution and diversification of developmental mechanisms, CRMs and DSD (José-Edwards et al., 2015; Roure et al., 2014; Stolfi et al., 2014; Madgwick et al., 2019; Colgan et al., 2019; Takahashi et al., 1999; Hudson et al., 2011; Racioppi et al., 2017; Johnson et al., 2004; Brown et al., 2007; Oda-Ishii et al., 2005). Ascidians are simple tractable organisms and are excellent models for functional genomics, in particular with plasmid DNA electroporation that allows the easy generation of thousands of transient transgenic embryos. Moreover, several species at various phylogenetic distances with sequenced and annotated genome are amenable to experimentation (Dardaillon et al., 2020; Kourakis and Smith, 2015).

Several in details developmental studies have already been performed in ascidians, and the comparative conclusions are contrasted (reviewed in Lemaire, 2011; Kourakis and Smith, 2015; Hudson and Yasuo, 2008; Lemaire et al., 2008; Lemaire and biology, 2006; Satoh, 2014; Razy-Krajka and Stolfi, 2019). First, comparisons with vertebrates suggest that the 550 My of separate evolution has led to significant differences in the early specification of major chordate tissues, such as the notochord or the central nervous system. But deep similarities have been uncovered in the GRN controlling heart or placode formation for example. Second, comparisons between distantly related ascidian species suggest that although the embryological and cellular processes are highly similar, the developmental regulators (TFs and signaling molecules) may change. Third, while conservation of developmental gene expression has been reported even at large phylogenetic distance, the mechanisms for such conservation relies on shared regulatory mechanism (i.e. conserved but shuffled TFBS in possibly non-alignable CRMs), or by contrast on different mechanisms (extensive DSD).

Comparative studies within ascidians are largely based on data from the most studied species Ciona intestinalis that have been compared with equivalent results obtained from another species belonging to a different family. Importantly, the estimated divergence times with C. intestinalis are considerable (Delsuc et al., 2018): 110 My for Ciona savignyi, 275 My for Phallusia mammillata or Corella inflata and 390 My for Halocynthia roretzi or Molgula species.

Here, we have tried to expand the phylogenetic range by comparing gene expression regulation in several ascidian species, mainly C. intestinalis, P. mammillata and Molgula appendiculata. We have studied a subset of the GRN controlling caudal peripheral nervous system (PNS) formation. Caudal PNS formation is well understood in C. intestinalis both at the level of the developmental and cellular processes, and at the level of the molecular regulators (Roure et al., 2014; Pasini et al., 2006; Pasini et al., 2012; Roure and Darras, 2016; Feinberg et al., 2019; Waki et al., 2015; Chen et al., 2011; Joyce Tang et al., 2013; Horie et al., 2008; Candiani et al., 2005; Torrence and Cloney, 1982). First, dorsal and ventral epidermis midlines are induced as neurogenic territories by Fgf and Bmp signals from the underlying endomesodermal cells, respectively. Then, the number of glutamatergic PNS neurons (caudal epidermal sensory neurons or CESNs) that form in these neurogenic midlines is controlled by the Notch pathway. Molecular markers including a number of developmental regulators have been described for these two key phases of caudal PNS formation. However, their functions and direct interactions have been determined for only a limited number (Figure 1—figure supplement 1A). Focusing on seven midline TFs, we identified CRMs that are active in neurogenic midlines from both C. intestinalis and P. mammillata. Reciprocal cross-species transcriptional assays pointed to a strong conservation of the cis-regulatory logic between these species despite a lack of sequence conservation in these CRMs. Conserved activity was also observed in two additional species that both belong to the Phlebobranchia order. However, conserved activity was observed for only a minority of CRMs for some nodes of this GRN in M. appendiculata that belongs to the Solidobranchia order. Surprisingly, CRMs isolated from another Solidobranchia, Halocynthia roretzi, did show remarkable activity in the P. mammillata midlines but not in M. appendiculata. Overall, our results suggest deep conservation of gene regulation in caudal PNS formation within ascidians and advocate for broad phylogenetic sampling in comparative studies.

Results

Identification of midline CRMs in C. intestinalis

As described above, caudal PNS formation is initiated by two inductive events: Fgf9/16/20 as the inducer of the dorsal neurogenic midline (DML) and Admp as the inducer of the ventral neurogenic midline (VML). While function and regulatory interactions have been determined for several genes, direct interactions, and associated CRMs are unknown for the most part (Figure 1—figure supplement 1A; Roure et al., 2014; Pasini et al., 2006; Roure and Darras, 2016; Waki et al., 2015; Joyce Tang et al., 2013; Bertrand et al., 2003). Here, we have focused our work on seven TFs that are expressed in both ventral and dorsal caudal neurogenic midlines (VDML) and for which we have already described regulatory interactions that generated a provisional GRN (Pasini et al., 2006; Roure and Darras, 2016). These genes are — ordered by their onset of expression in VDML — Msx, Ascl.b, Klf1/2/4, Nkxtun3, Tox, Dlx.c and Bhlhtun1. Msx, Nkxtun3 and Dlx.c code for homeodomain-containing TFs; Ascl.b and Bhlhtun1 for bHLH TFs; Klf1/2/4 for a C2H2 Zn finger TF; and Tox for a HMG TF. We aimed at identifying regions of genomic DNA that behave as midline CRMs. Previous work in the ascidian community has shown that selecting a few kb in conserved non-coding DNA generally upstream of the gene of interest had high chance of identifying most, if not all, regulatory regions. We thus selected upstream regions of up to 7 kb containing phylogenetic footprints between C. robusta and C. savignyi that we placed upstream of the minimal promoter of the Ciinte.Fog gene and LacZ as a reporter, and tested in vivo before further reduction in size. After our work was performed, open chromatin regions from ATAC-seq data were identified from whole embryos up to neurula stages (Madgwick et al., 2019) and it turned out that most of the midline CRMs that we identified correspond to conserved regions with enrichment in ATAC-seq (Figures 1, 2, 3, Supplementary file 1 and 2). Tissue-specific ATAC-seq data should thus have a high predictive power and limit the time-consuming promoter-bashing approach in future experiments seeking for active CRMs.

Figure 1. CRMs controlling Ciinte.Msx expression in VDML.

(A) Snapshot of the Ciinte.Msx locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between C. robusta and C. savignyi (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). (B–E) Representative examples of X-gal stained embryos at late gastrula stages (B, D) and early tailbud stages (C, E) following C. intestinalis embryos electroporation of Ciinte.Msx-Intergenic (B, C) and Ciinte.Msx-up1 (D, E). Embryos are shown in dorsal view (B, D) and in lateral view with dorsal to the top (C, E), and anterior to the left. Scale bar: 100 μm. (F) Schematic representation of the various constructs and their activity at early tailbud stages in DML (blue) and VML (purple) (n indicates the total number of embryos examined; N indicates the number of independent experiments).

Figure 1.

Figure 1—figure supplement 1. Developmental regulators of caudal PNS in C. intestinalis.

Figure 1—figure supplement 1.

(A) Provisional GRN for caudal PNS specification in C. intestinalis. The genes depicted and their interactions are based on previous publications (Roure et al., 2014; Pasini et al., 2006; Roure and Darras, 2016; Waki et al., 2015; Joyce Tang et al., 2013; Bertrand et al., 2003) and hypotheses described in the Materials and methods section. Genes whose requirement for caudal PNS formation has been shown by loss-of-function are in bold. Demonstrated direct regulations are shown in red. (B) List of matrices and consensus motifs used to identify TFBS in CRMs.
Figure 1—figure supplement 2. Model for Ciinte.Msx regulation.

Figure 1—figure supplement 2.

(A) Identification of putative TFBS for candidate upstream factors in Ciinte. Msx-up10 aligned with its counterpart from C. savignyi. All putative sites for ventral factors (SBE, BRE, Tbx2/3, Nkxtun1, Nkx2-3/5/6 and Irx.c) and dorsal factors (Msx and Su(H)/Rbpj) have been mapped, but only conserved sites are shown: 4 BRE, 3 SBE, 1 Irx.c, 1 Tbx2/3 and 2 Msx. The region deleted in up2 and up11 abuts a BRE and contains a SBE; the loss of these sites might thus be responsible of the absence of activity in the VML. The region deleted in up8 and required for VDML activity contains 1 BRE, 1 Irx.c, 1 Tbx2/3 and 1 Msx sites. Note that the size of the highlighted site for of a given TF may vary depending on the matrix used. (B) Working model for Msx transcriptional regulation in the VDML. Msx expression is initiated in DML precursors at the 64-cell stage (Roure et al., 2014) through the proximal/early CRM (denoted b6.5 line, green) activated by Otx and Nodal (via Smad2/3). The Msx protein produced via this regulation would activate Msx transcritption in the DML at gastrula/neurula stages via the distal/late CRM (denoted up10, orange). The same distal CRM would activate Msx expression in the VML directly by Bmp signaling activated by the Admp ligand (via the SBEs and BREs) or Admp targets (Irx.c and Tbx2/3).

Figure 2. CRMs controlling Ciinte.Ascl.b expression in VDML.

(A) Snapshot of the Ciinte.Ascl.b locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between C. robusta and C. savignyi (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). (B–C) Representative examples of X-gal stained embryos at early tailbud stages following C. intestinalis embryos electroporation of Ciinte.Ascl.b-do1 (B) And Ciinte.Ascl.b-do5 (C). Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 100 μm. (D) Schematic representation of the various constructs and their activity at early tailbud stages in DML (blue) and VML (purple) (n indicates the total number of embryos examined; N indicates the number of independent experiments).

Figure 2.

Figure 2—figure supplement 1. Putative TFBS.

Figure 2—figure supplement 1.

Identification of putative TFBS for candidate upstream factors in Ciinte.Ascl.b-do2 aligned with its counterpart from C. savignyi. All putative sites for ventral factors (SBE, BRE, Tbx2/3, Nkxtun1, Nkx2-3/5/6 and Irx.c) and dorsal factors (Msx and Su(H)/Rbpj) have been mapped, but only conserved sites are shown: 2 SBE, 3 Tbx2/3, 2 Nkx2-3/5/6 and 2 Msx sites. The region important for VML expression (deleted in do4) contains 2 Tbx2/3 and 1 Nkx2-3/5/6 sites. Since Ascl.b is expressed after Msx (Roure and Darras, 2016), it could be directly regulated by Msx in both VML and DML. However, the region required for VDML activity (deleted in do3) contains a Tbx2/3 site. This suggest that Ascl.b expression relies on additional unidentified factors. Note that the size of the highlighted site for of a given TF may vary depending on the matrix used.

Figure 3. CRMs with activity in the tail epidermis midlines for Ciinte.Klf1/2/4, Ciinte.Nkxtun3, Ciinte.Tox, and Ciinte.Dlx.c.

Representatives examples of X-gal stained embryos at tailbud stages following C. intestinalis embryos electroporation of C. intestinalis genomic regions for Klf1/2/4 (A, B), Nkxtun3 (C, D), Tox (E, F) and Dlx.c (G, H). For each gene, an example for the largest and the smallest regions with robust VDML activity are shown (the size of the region is shown between parentheses after the CRM's name). Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 100 μm.

Figure 3.

Figure 3—figure supplement 1. CRMs controlling Ciinte.Klf1/2/4 expression in VDML.

Figure 3—figure supplement 1.

(Top) Snapshot of the Ciinte.Klf1/2/4 locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between C. robusta and C. savignyi (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Schematic representation of the various constructs and their activity at tailbud stages in DML (blue) and VML (purple) (n indicates the total number of embryos examined, N indicates the number of independent experiments). (Bottom) Identification of putative TFBS for candidate upstream factors in Ciinte.Klf1/2/4-up7 (604 bp). Note that all predicted sites for Msx (6 sites) (Schwartz et al., 2003), Ascl.b (2 sites) (Davidson and Erwin, 2006), and Klf1/2/4 (2 sites) are depicted since this region does not align with the C. savignyi genome (see top panel). The abundance of sites for Msx makes it a likely activator. Up11 is a smaller derivative of up7 (235 bp fragment long) that behaves as a minimal VDML enhancer (Figure 3B) and is deleted in the up8 region (inactive). It is located 2.4 kb upstream of Klf1/2/4, corresponds to an ATAC-seq enrichment detected from late gastrula stages (Supplementary file 2) and contains the 2 Klf1/2/4 sites, suggesting it might be involved in autoregulation/maintenance. Note that the size of the highlighted site for of a given TF may vary depending on the matrix used.
Figure 3—figure supplement 2. CRMs controlling Ciinte.Nkxtun3 expression in VDML.

Figure 3—figure supplement 2.

(Top) Snapshot of the Ciinte.Nkxtun3 locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between C. robusta and C. savignyi (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Schematic representation of the various constructs and their activity at tailbud stages in DML (blue) and VML (purple) (n indicates the total number of embryos examined, N indicates the number of independent experiments). (Bottom) Identification of putative TFBS for candidate upstream factors in Ciinte.Nkxtun3-up7 aligned with its counterpart from C. savignyi. All putative sites for Msx, Ascl.b, Klf1/2/4 and Nkxtun3 have been mapped, but only conserved sites are shown: 3 Msx, 1 Ascl.b, 2 Klf1/2/4, and 5 Nkxtun3 sites. Deletions on either end of up7 that abolish activity remove 1 Nkxtun3 site, and 1 Msx, 1 Klf1/2/4, and 1 Nkxtun3 sites, respectively. This is suggestive of activation by Msx and Klf1/2/4, and autoregulation/maintenance by Nkxtun3. Note that the size of the highlighted site for of a given TF may vary depending on the matrix used.
Figure 3—figure supplement 3. CRMs controlling Ciinte.Tox expression in VDML.

Figure 3—figure supplement 3.

(Top) Snapshot of the Ciinte.Tox locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between C. robusta and C. savignyi (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Schematic representation of the various constructs and their activity at tailbud stages in DML (blue) and VML (purple) (n indicates the total number of embryos examined, N indicates the number of independent experiments). (Bottom) Identification of putative TFBS for candidate upstream factors in Ciinte.Tox-int1.1 aligned with its counterpart from C. savignyi. All putative sites for Msx, Ascl.b, Klf1/2/4 and Nkxtun3 have been mapped, but only conserved sites are shown: 1 Msx, 4 Ascl.b, 4 Klf1/2/4, and 3 Nkxtun3 sites. The smaller region Ciinte.Tox-int1.2 contains 3 Ascl.b and 2 Klf1/2/4 suggesting activation by Ascl.b and Klf1/2/4. Note that the size of the highlighted site for of a given TF may vary depending on the matrix used.
Figure 3—figure supplement 4. CRMs controlling Ciinte.Dlx.c expression in VDML.

Figure 3—figure supplement 4.

(Top) Snapshot of the Ciinte.Dlx.c locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between C. robusta and C. savignyi (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Schematic representation of the various constructs and their activity at tailbud stages in DML (blue) and VML (purple) (n indicates the total number of embryos examined, N indicates the number of independent experiments). (Bottom) Identification of putative TFBS for candidate upstream factors in Ciinte.Dlx.c-up4 aligned with its counterpart from C. savignyi. All putative sites for Msx, Ascl.b, Klf1/2/4, Nkxtun3, and Dlx.c have been mapped, but only conserved sites are shown: 2 Msx, 1 Klf1/2/4, 3 Nkxtun3, and 2 Dlx.c sites. However, these sites are absent from the essential region deleted in up1. Consequently, unidentified factors are likely to regulate Ciinte.Dlx.c-up4 activity and Dlx.c expression. Note that the size of the highlighted site for of a given TF may vary depending on the matrix used.

Uncoupling of dorsal and ventral activities for the early genes

Msx

Msx is expressed in DML precursors from the 64-cell stage while it is expressed in the VML from the gastrula/neurula stages (Roure and Darras, 2016). We have previously described a Nodal/Otx-regulated CRM located immediately upstream of the gene responsible for the early expression in DML precursors (Ci-msxb-b6.5 line in Figure 1A; Roure et al., 2014). Here, we show that the entire upstream intergenic region is active early in the DML (Figure 1B) but also later in the VML (Figure 1C). A distal fragment, up1, adjacent to the early CRM was active in both VML and DML at tailbud stages but inactive at gastrula/neurula stages (Figure 1D–F). These results show that the seemingly continuous expression of Msx in the DML is in fact under the control of two separate CRMs that are likely regulated differently (see below). We next generated various deletions of this VDML CRM (Figure 1F and Supplementary file 2). We could observe VDML activity for most regions, with generally a decrease of strength when the size of the fragment was reduced, or a lack of activity. The smallest fragment active in VDML, up10, was 291 bp long. Interestingly, two overlapping regions, up2 and up11, were active only in the DML. Consequently, even if we did not identify a CRM specific for the VML, this deletion analysis suggests that ventral and dorsal expressions of Msx are uncoupled, a conclusion in agreement with the proposed role of Msx as an upstream regulator of the neurogenic GRN and an integrator of dorsal and ventral inducing cues (Roure and Darras, 2016). This scenario is further supported by the presence of conserved TFBS for candidate ventral and dorsal regulators (Figure 1—figure supplements 12; Material and methods). Ciinte.Msx-up10 contains two putative Msx-binding sites that could account for activation in the DML. Activation by Bmp signaling in the VML could be direct (4 Bmp-Responsive Elements (BRE) and 3 Smad-Binding Elements (SBE)) or indirect (sites for the ventral TFs Irx.c and Tbx2/3).

Ascl.b

Although the upstream intergenic region (0.8 kb) was not active (Supplementary file 2), the 0.7 kb region immediately downstream of the gene, do1, was robustly active in VDML (Figure 2 and Supplementary file 2). Fragments of this region were either not active, active in VDML or mainly active in the DML. Consequently, similarly to Msx, the VML and DML expression of Ascl.b appear to be distinctly regulated. While we identified putative sites for dorsal and ventral factors in the do2 fragment (Figure 2—figure supplement 1), they are not sufficient to explain VDML activity. This suggests that Ascl.b expression relies on additional unidentified factors.

Pan-midline CRMs for late midline genes

For the five other TF genes, we could also isolate genomic regions active in VDML, except for Bhlhtun1 for which the three tested regions were active in endogenous expression territories (palps, notochord…) but not in VDML (Supplementary file 2 and 3).

Klf1/2/4

A 6.6 kb region, up1, immediately upstream of the gene was strongly active in VDML (Figure 3A and Figure 3—figure supplement 1). Various deletions were generated; most were active in VDML but with a lower efficiency (Figure 3—figure supplement 1). Analysis of putative TFBS suggested activation by Msx and Ascl.b (Figure 3—figure supplement 1).

Nkxtun3

A 4.8 kb region, A, immediately upstream of the gene was strongly active in VDML (Figure 3C and Figure 3—figure supplement 2). Various deletions were generated; most were active in VDML but with a lower efficiency (Figure 3—figure supplement 2). We could identify a 326 bp fragment, up7, which was robustly active in VDML (Figure 3D). It is located 0.7 kb upstream of Nkxtun3 and corresponds to a conserved region with an ATAC-seq enrichment detected from late gastrula stages (Supplementary file 2). TFBS analysis suggested activation by the upstream factors Msx and Klf1/2/4 (Figure 3—figure supplement 2).

Tox

Tox is mainly transcribed as two isoforms that vary in their transcription start sites and first exons (Figure 3—figure supplement 3 and Supplementary file 2; Roure and Darras, 2016). We selected a 913 bp conserved region, int1, located upstream of one isoform and in the first intron of the other. This region is active in VDML (Figure 3E). Two shorter regions were generated: int1.1 (565 bp) that is more active (Figure 3F) and int1.2 (138 bp) that is less (Figure 3—figure supplement 3). Putative binding sites analysis suggested activation by the upstream factors Ascl.b and Klf1/2/4.

Dlx.c

A 4.8 kb region, A2, immediately upstream of the gene was strongly active in VDML (Figure 3G and Figure 3—figure supplement 4). Various deletions were generated; most were active in VDML but with a lower efficiency (Figure 3—figure supplement 4). The region 'upstream' (604 bp), located 1.3 kb upstream of Dlx.c, contains two blocks with higher sequence conservation and ATAC-seq enrichment, and was robustly active (Figure 3H). up7 (206 bp) containing the proximal block was active in VDML albeit at a low level (Figure 3—figure supplement 4). It does not contain any site for the putative upstream VDML regulators (Msx, Ascl.b, Klf1/2/4, and Nkxtun3), suggesting the implication of unidentified factor(s).

Interestingly, for all these four TFs coding genes (Klf1/2/4, Nkxtun3, Tox, and Dlx.c), starting from relatively large regions (up to 6.6 kb), we could reduce the size down to 200–300 bp and maintain VDML activity (Figure 3). Shortening the DNA regions was usually accompanied by a decrease in activity. However, all tested regions, despite some variability, had similar activity in both VML and DML; and we conclude that the expression of these genes is regulated by 'pan-midline' CRMs in agreement with their downstream positions in the PNS GRN (Roure and Darras, 2016).

Overall, results from promoter bashing in C. intestinalis led to three main conclusions: genomic support for VDML expression has been identified for most midline TFs (and this will be used to probe gene regulation conservation in other species); Msx and Ascl.b have been confirmed as upstream regulators of the caudal PNS GRN since dorsal and ventral expression are differently regulated; and downstream genes are regulated similarly dorsally and ventrally.

Tail PNS specification appears to be conserved in P. mammillata

We next wondered whether caudal PNS was specified by similar mechanisms in P. mammillata, an ascidian species that diverged from C. intestinalis around 275 My ago (Delsuc et al., 2018; Figure 4). We have previously shown that the early induction of DML fate by Fgf/Nodal signals is conserved and leads to the expression of the early DML precursors markers Msx and Dlk (previously known as Delta2) (Roure et al., 2014).

Figure 4. Expression and regulation of caudal PNS genes in Phallusia mammillata.

(A) Schematic summary of caudal PNS specification and molecular regulators in Ciona intestinalis (adapted from Roure et al., 2014; Pasini et al., 2006; Roure and Darras, 2016; Waki et al., 2015; Joyce Tang et al., 2013, more details can be found in Figure 1—figure supplement 1A). (B–L) Expression of the P. mammillata orthologs of the C. intestinalis caudal PNS genes. In situ hybridization of a late neurula for Msx (B), at early tailbud stages for Ascl.b (C), Klf1/2/4 (D), Nkxtun3 (E), Tox (F), Bhlhtun1 (G), Dlx.c (H) and Dlk (I); and at mid/late tailbud stages for Atonal (J), Pou4F (K), and Celf3.a (L). (M–Q) Expression of Phmamm-Klf1/2/4 at early tailbud stages in control embryos (M), following electroporation of pFog >Ciinte.Admp (N) or pFog >Ciinte.Noggin (P); or following treatment with Bmp4 protein (O) or Dorsomorphin (Q). (R–U) Expression of Phmamm.Pou4F at late mid/late tailbud stages in control embryos (R), following treatment with Bmp4 protein (S), DAPT (T) or a combination of Bmp4 and DAPT (U). Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bars: 50 μm.

Figure 4.

Figure 4—figure supplement 1. Expression pattern of caudal PNS genes in Phallusia mammillata.

Figure 4—figure supplement 1.

In situ hybridization results at several developmental stages for the P. mammillata orthologs of the C. intestinalis caudal PNS genes: Msx, Ascl.b, Klf1/2/4, Nkxtun3, Bhlhtun1, Tox, Dlx.c, Dlk, Pou4F, Atonal, and Celf3.a. Scale bar: 50 μm.

Expression and regulation of caudal PNS genes

By in situ hybridization, we first determined the expression pattern of orthologous genes for the caudal PNS genes (for each gene, we could identify a single ortholog in the Phallusia genome). Importantly, we found that the seven neurogenic TFs, which are the focus of the present study, were expressed in VDML like in Ciona and with apparently the same temporal sequence (Figure 4B–H and Figure 4—figure supplement 1). Similarly, the neuronal genes Dlk, Atonal, Pou4F, and Celf3.a were also expressed as spaced spots in VDML, likely corresponding to CESNs (Figure 4I–L and Figure 4—figure supplement 1). Interestingly, other sites of expression (palps, notochord, trunk neurons, central nervous system…) were also conserved (Figure 8—figure supplement 3).

We next determined whether Bmp and Notch signaling pathways could regulate caudal PNS formation. Treatment of whole embryos with zebrafish recombinant Bmp4 protein from the eight-cell stage led to ectopic expression in the lateral tail epidermis of Msx (BSA control: 20/20 embryos in VDML, N = 1; 150 ng/ml Bmp4: 27/27 embryos with ectopic expression, N = 1), Klf1/2/4 (BSA control: 100% embryos in VDML, N = 9, n = 172; 150 ng/ml Bmp4: 100% embryos with ectopic expression, N = 6, n = 127; Figure 4M,O), and Pou4F (BSA control: 100% embryos in VDML, N = 4, n = 101; 150 ng/ml Bmp4: 94% embryos with ectopic expression, N = 4, n = 95; Figure 4R,S). Overexpression of the Ciona Bmp ligand Admp in the ectoderm by electroporation led also to ectopic Klf1/2/4 expression (pFog >Venus control: 100% embryos in VDML, N = 4, n = 75; pFog >Ciinte.Admp: 71% embryos with ectopic expression, N = 1, n = 14; Figure 4M,N). Bmp inhibition using the pharmacological inhibitor Dorsomorphin had variable effects depending on the experiment, but we observed, as in Ciona, repression of Klf1/2/4 expression in the VML (BSA control: 100% embryos in VDML, N = 9, n = 172; 20 μM Dorsomorphin: 5% to 70% embryos with VML repression with an average of 37%, N = 9, n = 200; Figure 4M,Q). Similarly overexpression of the secreted Bmp antagonist Noggin from Ciona led to VML repression of Klf1/2/4 expression (pFog >Venus control: 100% embryos in VDML, N = 4, n = 75; pFog >Ciinte.Noggin: 0% to 80% embryos with VML repression with an average of 29%, N = 4, n = 109; Figure 4M,P). Inhibition of Notch signaling with the pharmacological inhibitor DAPT led to a massive increase of the Pou4F-positive cells in VDML; and when combined with Bmp4 treatment, the cells were found throughout the tail epidermis (BSA control: 100% embryos in VDML, N = 4, n = 101; 25 μM DAPT: 99% embryos with ectopic expression in VDML, N = 3, n = 94; 25 μM DAPT + 150 ng/ml Bmp4: 93% embryos with ectopic expression, N = 4, n = 118; Figure 4R,T,U).

Identification of midline CRMs

To define VDML CRMs in P. mammillata, we undertook the same approach as the one performed in C. intestinalis but with a more limited generation of deletion constructs. We selected regions containing blocks of sequence conservation between P. mammillata and P. fumigata that we placed upstream of the minimal promoter for the Ciinte.Fog gene and LacZ as a reporter. Out of the seven targeted genes, we were unable to isolate CRMs with VDML activity for Ascl.b and Bhlhtun1 (Supplementary file 4 and 5).

Msx

Similarly to the situation in C. intestinalis, we have previously isolated Pm-msxb-b6.5 line, a CRM immediately upstream of Msx that is active early in the DML precursors (b6.5 lineage) (Roure et al., 2014). The upstream intergenic region that contains this CRM was active at tailbud stages in VDML (Figure 5A and Figure 5—figure supplement 1), and allowed to define a distal CRM, LED, that was active in VDML and corresponds to a conserved region with a weak ATAC-seq signal (Figure 5—figure supplement 1 and Supplementary file 5). The topology of the Msx locus is strikingly similar between C. intestinalis and P. mammillata: there is local synteny (the gene upstream of Msx is orthologous: KH2012:KH.C2.808 in C. intestinalis and Phmamm.g00005894 in P. mammillata) and the small intergenic region contains a proximal CRM active early in DML precursors and a distal CRM active late in VDML (Figure 1 and Figure 5—figure supplement 1). TFBS analysis of the LED fragment suggested that Phmamm.Msx could be regulated like Ciinte.Msx: a direct target of Bmp signaling in the VML and of Msx in the DML (Figure 5—figure supplement 1).

Figure 5. CRMs with activity in the tail epidermis midlines for Phmamm.Msx, Phmamm.Klf1/2/4, Phmamm.Nkxtun3, Phmamm.Tox, and Phmamm.Dlx.c.

Representatives examples of X-gal staining at tailbud stages following P. mammillata embryos electroporation of P. mammillata genomic regions for Msx (A), Klf1/2/4 (B), Nkxtun3 (C), Tox (D), and Dlx.c (E). Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 50 μm.

Figure 5.

Figure 5—figure supplement 1. CRMs controlling Phmamm.Msx expression in VDML.

Figure 5—figure supplement 1.

(Top) Snapshot of the Phmamm.Msx locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between P. mammillata and P. fumigata (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Representative examples of X-gal staining at tailbud stages following P. mammillata embryos electroporation of P. mammillata genomic regions for Msx (Phmamm.Msx-UP (same picture as Figure 5A) and Phmamm.Msx-LED). Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 50 μm. Schematic representation of the various constructs and their activity at tailbud stages in VDML (blue) (n indicates the total number of embryos examined, N indicates the number of independent experiments). (Bottom) Identification of putative TFBS for candidate upstream factors in Phmamm.Msx-LED aligned with its counterpart from P. fumigata. All putative sites for ventral factors (SBE, BRE, Tbx2/3, Nkxtun1, Nkx2-3/5/6 and Irx.c) and dorsal factors (Msx and Su(H)/Rbpj) have been mapped, but only conserved sites are shown: 5 SBE, 2 BRE, 1 Nkxtun1, 1 Nkx2-3/5/6, and 1 Msx sites. When comparing with Ciinte.Msx-up10 (Figure 1—figure supplement 2), we found shared sites for direct activation by Bmp signaling (SBE and BRE), and for activation by Msx itself. However, sites for ventral TFs were different: Nkxtun1 and Nkx2-3/5/6 in Phmamm.Msx-LED and Irx.c and Tbx2/3 in Ciinte.Msx-up10. Note that the size of the highlighted site for of a given TF may vary depending on the matrix used.
Figure 5—figure supplement 2. CRMs controlling Phmamm.Klf1/2/4 and Phmamm.Nkxtun3 expression in VDML.

Figure 5—figure supplement 2.

Snapshots of the Phmamm.Klf1/2/4 and Phmamm.Nkxtun3 loci depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between P. mammillata and P. fumigata (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Representative examples of X-gal staining at tailbud stages following P. mammillata embryos electroporation of P. mammillata genomic regions for Klf1/2/4 (Phmamm.Klf1/2/4-A (same picture as Figure 5B) and Phmamm.Klf1/2/4-A3) and Nkxtun3 (Phmamm.Nkxtun3-A (same picture as Figure 5C), Phmamm.Nkxtun3-A1 and Phmamm.Nkxtun3-A3). Note that two separate VDML CRMs have been identified for Nkxtun3: a distal one, Phmamm.Nkxtun3-A1, and a proximal one, Phmamm.Nkxtun3-A3. This could be a case of redundant or shadow enhancers, but the distal CRM was preferentially active in posterior VDML while the proximal one did not show differential activity along the antero-posterior axis. Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 50 μm. Schematic representation of the various constructs and their activity at tailbud stages in VDML (blue) (n indicates the total number of embryos examined, N indicates the number of independent experiments).
Figure 5—figure supplement 3. CRMs controlling Phmamm.Tox expression in VDML.

Figure 5—figure supplement 3.

(Top) Snapshot of the Phmamm.Tox locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between P. mammillata and P. fumigata (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Representative examples of X-gal staining at tailbud stages following P. mammillata embryos electroporation of P. mammillata genomic regions for Tox (Phmamm.Tox-A (same picture as Figure 5D), Phmamm.Tox-A1, Phmamm.Tox-A1.1 and Phmamm.Tox-A2). Phmamm.Tox-A (1093 bp) was the longest region tested for Phmamm-Tox, and it was found active in tail muscle in addition to VDML. Both activities could be separated into two sub-domains of Phmamm.Tox-A: Phmamm.Tox-A1 (VDML) and Phmamm.Tox-A2 (muscle). Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 50 μm. Schematic representation of the various constructs and their activity at tailbud stages in VDML (blue) (n indicates the total number of embryos examined, N indicates the number of independent experiments). (Bottom) Identification of putative TFBS for candidate upstream factors in Phmamm.Tox-A1 aligned with its counterpart from P. fumigata. All putative sites for Msx, Ascl.b, Klf1/2/4, and Nkxtun3 have been mapped, but only conserved sites are shown: 1 Msx, 3 Ascl.b, 1 Klf1/2/4, and 2 Nkxtun3 sites. Comparison with Ciinte.Tox-int1.2 (Figure 3—figure supplement 3) was suggestive of a shared regulation by Ascl.b. Note that the size of the highlighted site for of a given TF may vary depending on the matrix used.
Figure 5—figure supplement 4. CRMs controlling Phmamm.Dlx.c expression in VDML.

Figure 5—figure supplement 4.

Snapshot of the Phmamm.Dlx.c locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between P. mammillata and P. fumigata (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Representative examples of X-gal staining at tailbud stages following P. mammillata embryos electroporation of P. mammillata genomic regions for Dlx.C (Phmamm.Dlx.c-A (same picture as Figure 5E), and Phmamm.Dlx.c-A3.2). Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 50 μm. Schematic representation of the various constructs and their activity at tailbud stages in VDML (blue) (n indicates the total number of embryos examined, N indicates the number of independent experiments).
Klf1/2/4

We first isolated a 6.7 kb conserved region immediately upstream of Klf1/2/4, A, which was strongly active in VDML (Figure 5B, Figure 5—figure supplement 2 and Supplementary file 5). When subdivided into three large pieces, two were active in VDML, suggesting a possible redundant regulation (Figure 5—figure supplement 2). The proximal region A3 (2.5 kb) was the region with strongest activity.

Nkxtun3

We first isolated a 7.2 kb conserved region 2 kb upstream of Nkxtun3, A, which was strongly active in VDML (Figure 5C, Figure 5—figure supplement 2 and Supplementary file 5). Similarly to Phmamm-Klf1/2/4, two of three sub-regions of A were active in VDML (Figure 5—figure supplement 2). In this case, a spatial difference of activity was observed: the distal region A1 (2.5 kb) was mainly active in the posterior half of VDML while the proximal region A3 (2.8 kb) was active throughout VDML. This latter region could be reduced to 1.5 kb (A3.1) with maintaining robust activity. However, a shorter fragment A3.3 (0.6 kb) was barely active.

Tox

A short conserved region immediately upstream of Tox, A (1093 bp), was strongly active in VDML but also in tail muscle and mesenchyme (Figure 5D, Figure 5—figure supplement 3 and Supplementary file 5). Activity in muscle/mesenchyme segregated to a proximal fragment, A2 (533 bp), while VDML activity segregated to a distal fragment, A1 (657 bp), which could be reduced to 240 bp (A1.1) with maintaining a robust VDML activity. The small size of these fragments allowed TFBS analysis (Figure 5—figure supplement 3) that suggested a possible regulation by Ascl.b as for Ciinte.Tox (Figure 3—figure supplement 3).

Dlx.c

Based on RNA-seq profiling, we estimated that the predicted gene model for Dlx.c was lacking its first exon located approximately 8 kb upstream (Supplementary file 5). We thus isolated a 5.4 kb region containing 2 blocks of conservation, A, which was strongly active in VDML (Figure 5E and Figure 5—figure supplement 4). While the distal region A1.2 (1.5 kb) containing one block of conservation was weakly active in VDML, the proximal one A3.1 (1205 bp) was robustly active in VDML as was a shorter version A3.2 (1034 bp) (Figure 5—figure supplement 4).

The results of this section indicate strong similarities between C. intestinalis and P. mammillata for caudal PNS formation based on gene expression and gene regulation.

Conserved midline PNS gene regulation in Phlebobranchia

Here, we directly investigated whether the expression of caudal neurogenic midlines genes was regulated similarly between C. intestinalis and P. mammillata. To this end, we performed bi-directional 'enhancer swaps': we tested in P. mammillata embryos the activity of C. intestinalis VDML CRMs and vice versa. Strikingly, all the regions that we have tested were active in VDML in over 40–50% of the embryos in the recipient species (Figure 6A–C and Figure 6—figure supplement 1), the only exception being Phmamm.Nkxtun3-A3.1 that was already not strongly active in P. mammillata (Figure 5—figure supplement 2). Interestingly, conserved activity was not limited to VDML: for example, activity in the anterior palp region for Ciinte.Ascl.b-do1, Ciinte.Klf1/2/4-up1 and Ciinte.Dlx.c-A2 was observed when tested in both C. intestinalis and P. mammillata (Figures 2B, 3A and G, 6Cii, 6Ciii, and 6Cvi).

Figure 6. Conserved VDML activity during enhancer swaps in Phlebobranchia (A).

Representative examples of X-gal stained embryos at tailbud stages following C. intestinalis (B), P. mammillata (C), P. fumigata (D), or A. mentula (E) embryo electroporation with C. intestinalis or P. mammillata CRMs. All shown CRMs for the genes Msx (i), Ascl.b (ii), Klf1/2/4 (iii), Nkxtun3 (iv), Tox (v), and Dlx.c (vi) are active in VDML in their species of origin. The name of the electroporated CRM is indicated on each picture. Details for each experiment can be found in Figure 6—figure supplement 1 and Figure 6—figure supplement 2. Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 50 μm.

Figure 6.

Figure 6—figure supplement 1. Conserved VDML activity during enhancer swaps between C. intestinalis and P. mammillata.

Figure 6—figure supplement 1.

Schematic representation of the various constructs and their activity at tailbud stages in VDML (blue) (n indicates the total number of embryos examined, N indicates the number of independent experiments).
Figure 6—figure supplement 2. C. intestinalis and P. mammillata midline CRMs are active in VDML of P. fumigata and A. mentula embryos.

Figure 6—figure supplement 2.

Schematic representation of the various constructs and their activity at tailbud stages in VDML (blue) (n indicates the total number of embryos examined, N indicates the number of independent experiments).

We next extended such functional assays in two other species, Phallusia fumigata and Ascidia mentula, that like C. intestinalis and P. mammillata belong to the ascidian Phlebobranchia order (Figure 6A). A few CRMs (Ciinte.Nkxtun3-up7, Phmamm.Nkxtun3-A1 and Ciinte.Tox-int1) had weak activity, below 30%, in one or the other species. However, most tested regions were active in VDML in over 40–50% of the embryos independently of their origin (i.e. from C. intestinalis or from P. mammillata) (Figure 6D–E and Figure 6—figure supplement 2). As C. intestinalis is more divergent than P. mammillata when taking the recipient species as references, this suggests that gene regulation has remained unchanged despite almost 300 My of divergence time (Figure 6A).

We further investigated CRM activity conservation in Phlebobranchia by isolating the genomic region upstream of Msx for both P. fumigata and A. mentula. Both regions were active in VDML when electroporated in C. intestinalis or P. mammillata embryos (Supplementary file 6).

Testing the transcriptional activity of midline CRMs into four ascidian species strongly supports deep conservation of caudal midline gene expression regulation in Phlebobranchia.

Minimal enhancers are not robust

Above results suggest a strong conservation of the caudal PNS GRN at the level of gene expression, gene regulation and its genomic encoding (CRMs) in Phlebobranchia ascidians. However, potentially homologous CRMs (CRMs from orthologous genes, located at similar positions in their locus and with conserved VDML activity in swap experiments), do not show significant DNA sequence conservation at large distance (i.e. between C. intestinalis and P. mammillata) (Supplementary file 2 and 5). There are previous examples in ascidians that clearly show that such conserved activity relies on conserved upstream regulators but that extensive binding site turnover explains the lack of sequence conservation (Roure et al., 2014; Madgwick et al., 2019; Colgan et al., 2019). We thus reasoned that the CRMs with conserved activity that we have isolated from C. intestinalis and P. mammillata could help in identifying conserved TFBS and shared TF regulating CRM activity. We also postulated that this approach would be effective with CRMs of small size. By applying a strict conservative approach on known putative regulators, we could hypothesize that Msx expression in the VDML from gastrula/neurula stages is directly regulated by Bmp signaling triggered by Admp in the VML, and by Msx itself in the DML (Figure 1—figure supplement 2 and Figure 5—figure supplement 1). Similarly, Tox may be regulated directly by Ascl.b (Figure 3—figure supplement 3 and Figure 5—figure supplement 3.).

We next focused on the proximal Ci-msxb-b6.5 line CRM that regulates early Ciinte.Msx expression in DML precursors (Figure 1A; Roure et al., 2014). We have previously shown that its size could be reduced from 707 bp to 273 bp (Ci-msxb-B) while maintaining a robust activity (around 50% of embryos with DML activity) (Figure 7A). Surprisingly, when the various deletions of Ci-msxb-b6.5 were tested in P. mammillata, they were barely active (<10% for constructs of 400 bp or less) (Figure 7A). Since these different constructs still contained the Otx and Smad-binding sites that are essential for Ci-msxb-B activity in C. intestinalis, we wondered whether Ci-msxb-B was faithfully recapitulating Ciinte.Msx expression. To this end, we challenged Ci-msxb-B with known regulators of Ciinte.Msx expression (Figure 7B). Similarly to endogenous Ciinte.Msx expression, Ci-msxb-B was ectopically activated in posterior (b-line) ectoderm when the Fgf pathway was activated, and it was downregulated when the Nodal pathway was inhibited. However, upon Nodal pathway activation Ci-msxb-B was ectopically activated in posterior (b-line) ectoderm instead of anterior neurectoderm (a6.5 line); and Otx overexpression led to a repression of Ci-msxb-B instead of an activation.

Figure 7. Minimal CRMs are not robust to phylogenetic and genetic challenges.

Figure 7.

(A) Structure of the early DML CRM 'Ci-msxb-b6.5 line' and its derivatives (adapted from Roure et al., 2014), and in vivo transcriptional activity in the embryos of C. intestinalis and P. mammillata (n indicates the total number of embryos examined, N indicates the number of independent experiments). (B) Effects of various overexpressions on Ci-msxb-B activity. The indicated factors (Fgf9/16/20, Lefty, Nodal and Otx) were expressed under the control of the early ectodermal driver pFog. Embryos at gastrula/neurula stages are shown in neural plate view with anterior to the top (n indicates the total number of embryos examined, N indicates the number of independent experiments). Scale bar: 50 μm. (C) Consequences of caudal midline TF overexpression on the activity of VDML CRMs. The effects were summarized as: activation (green), repression (red), no effect (gray), or not done (white). They were compared with results of endogenous gene expression from Roure and Darras, 2016.

We thus wondered whether other short CRMs that we have described here with VDML activity also behaved inappropriately when submitted to a genetic challenge. We thus determined the activity of these CRMs when a midline TF was overexpressed and compared the effects with what we have already described for the endogenous expression of the associated gene (Roure and Darras, 2016). We were very surprised to realize that while some interactions were identical, a large part was different (Figure 7C). Also, two different CRMs for Ciinte.Nkxtun3 behaved differently.

We conclude from this section that short or minimal CRMs should be taken with cautious because they may not be robust to phylogenetic (enhancer swap) or genetic (overexpression) challenges.

Variable levels of divergence in Stolidobranchia

Here we aimed at exploring whether the striking conservation of VDML GRN holds true outside Phlebobranchia. We first determined the expression patterns of several genes of this network during the development of two solitary Stolidobranchia species, Molgula appendiculata and Halocynthia roretzi (Figure 8, Figure 8—figure supplements 13). Both species are estimated to have diverged around 340 My ago, and 390 My ago with Phlebobranchia (Figure 8A; Delsuc et al., 2018). Using transcriptomic data for M. appendiculata and both transcriptomic and genomic data for H. roretzi (Dardaillon et al., 2020), we identified a single ortholog for each gene that we analyzed, except for Ascl.b for which two paralogs were found in M. appendiculata.

Figure 8. Overall conservation of midline gene expression in Stolidobranchia ascidians.

(A) Phylogenetic relationships between C. intestinalis, P. mammillata, M. appendiculata and H. roretzi (estimated divergence times come from Delsuc et al., 2018). (B–M) In situ hybridization for Msx (B, H) at neurula stages, for Ascl.b (C, I), Klf1/2/4 (D, J), Nkxtun3 (E, K), Tox (F, L), and Dlx.c (G, M) at tailbud stages in embryos of M. appendiculata (B–G) and H. roretzi (H–M). Note that all genes except Moappe.Dlx.c are expressed in tail midlines. Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bars: 100 μm.

Figure 8.

Figure 8—figure supplement 1. Expression pattern of caudal PNS genes in M. appendiculata.

Figure 8—figure supplement 1.

In situ hybridization results at several developmental stages for the M. appendiculata orthologs of the C. intestinalis caudal PNS genes: Msx, Ascl.b (Ascl.bα and Ascl.bβ), Klf1/2/4, Nkxtun3, Tox, Dlx.c, Dlk, and Celf3.a. Scale bars: 100 μm.
Figure 8—figure supplement 2. Expression pattern of caudal PNS genes in H. roretzi.

Figure 8—figure supplement 2.

In situ hybridization results at several developmental stages for the H. roretzi orthologs of the C. intestinalis caudal PNS genes: Msx, Ascl.b, Klf1/2/4, Nkxtun3, Tox, Dlx.c, Dlk, and Celf3.a. Scale bar: 100 μm.
Figure 8—figure supplement 3. Comparison of midline TFs expression in C. intestinalis, P. mammillata, M. appendiculata, and H. roretzi.

Figure 8—figure supplement 3.

For each gene, the main sites of expression from late neurula stages in C. intestinalis (Roure and Darras, 2016; Imai et al., 2004) are schematized on the left. For the other species, identical pattern is represented as a green circle, expression in part of the tissue as an orange circle, and absence of expression as a red circle. For simplicity, developmental timing has not been described, but it has been taken into account for comparisons. For example, Ciinte.Ascl.b is expressed in the palps forming region from initial tailbud to late tailbud stages, while Moappe.Ascl.bα was detected in the palps only at late tailbud stages. This difference was depicted by an orange circle.

Gene expression in M. appendiculata

For the orthologs of neurogenic midline genes, three had an expression very similar to their counterparts in C. intestinalis or P. mammillata. While Moappe.Msx and Moappe.Klf1/2/4 were expressed throughout the VDML, Moappe.Nkxtun3 expression was restricted to the posterior VDML (Figure 8B, D and E and Figure 8—figure supplement 1). Moappe.Ascl.bβ and Moappe.Tox were also expressed in the VDML but only in the posterior region and as spots (Figure 8C and F and Figure 8—figure supplement 1). By contrast, Moappe.Ascl.bα and Moappe.Dlx.c were not expressed in the VDML (Figure 8G and Figure 8—figure supplement 1). We also examined genes downstream of the neurogenic program, Dlk and Celf3.a (Figure 8—figure supplement 1). While Moappe.Dlk was expressed in mesoderm and central nervous system, we did not detect a specific expression in the VDML. Moappe.Celf3.a was expressed strongly in the central nervous system as in Phlebobranchia, and also in isolated spots on surface ectoderm that likely correspond to ectodermal sensory neurons. In particular, Moappe.Celf3.a was detected in two to five spots of the posterior VDML, suggesting that the number of CESNs in this species is small. Moreover, this caudal restriction of expression is in agreement with the expression of Moappe.Nkxtun3, Moappe.Ascl.bβ, and Moappe.Tox.

Gene expression in H. roretzi

Among the orthologs of neurogenic genes, Harore.Dlx.c pattern had already been described as expressed in the VDML (Wada et al., 1999; Figure 8M and Figure 8—figure supplement 2). The other genes that we examined, Harore.Msx, Harore.Ascl.b, Harore.Klf1/2/4, Harore.Nkxtun3, and Harore.Tox, were also expressed in the VDML (Figure 8H–L and Figure 8—figure supplement 2). Among them, Harore.Ascl.b and Harore.Tox, were not expressed throughout the VDML but rather in discrete regions or spots. CESNs have been described in H. roretzi, and they express Celf3.a (also known as Etr) (Figure 8—figure supplement 2; Ohtsuka et al., 2001a; Ohtsuka et al., 2001b; Yagi and Makabe, 2001). It is also known that their formation is regulated by the Notch pathway and the ligand Delta is expressed as spots in the VDML (Akanuma et al., 2002). We could also detect the expression of another Notch ligand Dlk in the VDML (Figure 8—figure supplement 2).

Overall, comparison of gene expression shows striking conservation in patterns but also in the temporal order despite almost 400 My of divergence between Stolidobranchia and Phlebobranchia. However, the degree of conservation is weaker when considering M. appendiculata.

CRM swap

We next wondered whether conservation could extend to CRM activity. We thus tested various CRMs isolated from C. intestinalis and P. mammillata in M. appendiculata that is amenable to electroporation. A first qualitative analysis revealed that CRMs for several midline genes were found active in M. appendiculata VDML: Msx (Figure 9A), Ascl.b (Figure 9B), Nkxtun3 (Figure 9D) and Dlx.c (Figure 9F). In other cases, CRMs were active but not in VDML: anterior ectoderm and central nervous system for Ciinte.Klf1/2/4-up1 (Figure 9C), or tail muscle for Phmamm.Tox-A (Figure 9E). However, when the strength of the activity was considered (measured indirectly as the percentage of stained embryo in VDML), the situation appeared quite different. VDML activity was seldom observed for CRMs from Msx and Ascl.b (Figure 9—figure supplement 1). These were only large CRMs for Nkxtun3 and Dlx.c that were robustly active in M. appendiculata VDML. The overall low levels of activity for the tested CRMs could be explained by a non-optimal electroporation procedure. While this cannot completely be ruled out, electroporation of the regulatory sequences for the ubiquitous gene Cirobu.Ef1α (Feinberg et al., 2019) led to strong expression of the LacZ reporter gene (89% of stained embryos, n = 74 in two independent experiments) and suggested a rather efficient procedure. Interestingly, for both Nkxtun3 and Dlx.c, CRMs from both C. intestinalis and P. mammillata were active in Molgula VDML, supporting the specificity of the results. This further suggests that within the 6-genes network that we have explored, only these two genes have retained conserved regulation.

Figure 9. Divergence in gene regulation in Stolidobranchia ascidians.

(A–F) Activity of various CRMs from C. intestinalis and P. mammillata in the embryos of M. appendiculata. The name of the electroporated CRM is indicated on each picture. Details for each experiment can be found in Figure 9—figure supplement 1. (G–L) Activity of various genomic regions from H. roretzi in the embryos of M. appendiculata (G, I and K) and P. mammillata (H, J and L). Details for each experiment can be found in Figure 9—figure supplement 2. Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bars: 100 μm.

Figure 9.

Figure 9—figure supplement 1. Activity of VDML CRMs from C. intestinalis and P. mammillata in the embryos of M. appendiculata.

Figure 9—figure supplement 1.

Schematic representation of the various constructs and their activity at tailbud stages in VDML (blue) (n indicates the total number of embryos examined, N indicates the number of independent experiments).
Figure 9—figure supplement 2. Activity of genomic regions from H. roretzi in the embryos of P. mammillata and M. appendiculata.

Figure 9—figure supplement 2.

Snapshots of the Harore.Msx, Harore.Nkxtun3 and Harore.Tox loci depicting tested genomic regions, transcript models and conservation between H. roretzi and H. aurantium, M. oculata and Botryllus schlosseri (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Schematic representation of the various constructs and their activity at tailbud stages in VDML (blue) (n indicates the total number of embryos examined, N indicates the number of independent experiments).

To further explore whether the global lack of conservation that we have observed in M. appendiculata could be a generality in Stolidobranchia, we have isolated genomic upstream regions for Msx, Nkxtun3, and Tox from H. roretzi (Figure 9—figure supplement 2 and Supplementary file 7). The three regions were poorly active in M. appendiculata embryos except Harore.Nkxtun3-B that was active in anterior sensory vesicle in up to 29% of the embryos (Figure 9I). Only Harore.Msx-A had a striking VDML pattern but in a minority of embryos (Figure 9G). These results could simply be that these regions do not behave as VDML CRMs since we have not determined their activity in H. roretzi. However, in P. mammillata, while Harore.Nkxtun3-B was weakly active in VDML with additional staining in lateral tail epidermis (Figure 9J and Figure 9—figure supplement 2), both Harore.Msx-A and Harore.Tox-A were specifically and strongly active in VDML (Figure 9H and L and Figure 9—figure supplement 2).

Discussion

Regulation of midline gene expression in C. intestinalis

We have reported the identification of midline CRMs for six out of the seven genes that we have tackled. Our approach was simply based on isolating conserved regions (with C. savignyi) close to the gene of interest. Similarly to numerous reports, active CRMs were found mostly within the first few kilobases immediately upstream of the gene (a single region was located downstream). Interestingly, the largest CRMs that we have tested contain activity for several expression domains where the gene is expressed and do not usually display ectopic activity. We further reduced the size of these CRMs through classical 'promoter bashing' guided by sequence conservation. Activity in different domains usually partitioned during this deletion process while we have not documented this in detail (for example for Dlx.c, palp expression in the large region Ciinte.Dlx.c-A2 is lost from the small region Ciinte.Dlx.c-upstream; Figure 3G,H). Interestingly, the conserved active CRMs that we have identified systematically matched with open chromatin as determined by enrichment in ATAC-seq (Figures 1 and 2, Figure 3—figure supplements 14, and Supplementary file 2; Madgwick et al., 2019). Tissue- and stage-specific ATAC-seq data should certainly facilitate active CRM discovery and allow exhaustive identification of CRMs regulating a given expression pattern.

Although our approach was not exhaustive, important conclusions could be drawn. First, the seemingly continuous expression of Msx in the b6.5 lineage and its derivative, the DML, is actually under the control of two separate CRM: an early proximal element active from the 64-cell stage (Roure et al., 2014) and a late distal element active from gastrula stages (Figure 1). TFBS analysis using our current understanding of Msx regulation (Figure 1—figure supplement 1) supports a model of maintenance of gene expression by autoregulation whereby Msx protein produced through the early proximal CRM activates late Msx expression via the distal CRM (Figure 1—figure supplement 2). Second, we could classify the genes in two classes: Msx and Ascl.b with uncoupling of regulation in dorsal and ventral midlines (Figures 1 and 2); and Klf1/2/4, Nkxtun3, Tox and Dlx.c regulated in a 'pan-midline' manner (Figure 3—figure supplements 14). These observations are in agreement with previous predictions based on regulatory interactions (Pasini et al., 2006; Roure and Darras, 2016; Feinberg et al., 2019; Waki et al., 2015): Msx and Ascl.b are upstream regulators that integrate inputs from the dorsal and ventral midline inducers and regulate the expression of the other four downstream factors. Surprisingly, we did not identify VML-specific CRMs. While the reason is still unclear, a possible explanation would be that there is a physical overlap between VML and DML CRMs with the DML being embedded in the VML CRM. Prediction of TFBS for putative trans acting factors in the Msx VDML minimal CRM (Ciinte.Msx-up10) supports this hypothesis: binding sites for dorsal and ventral factors are intermingled or overlapping (Figure 1—figure supplement 2). Targeted mutagenesis of these putative binding sites should enable testing such an hypothesis. Third, analysis of conserved TFBS in small active CRMs provides interesting hypotheses that will need to be tested experimentally using finer and possibly quantitative approaches. Although both Msx and Ascl.b are likely to integrate initial inducing cues, they do not seem to be regulated similarly: Msx is possibly a direct Bmp signaling target in the VML like vertebrate Msx in some tissues (Esteves et al., 2014; Brugger et al., 2004), but Ascl.b is not (Figure 1—figure supplement 2 and Figure 2—figure supplement 1). Klf1/2/4 and Nkxtun3 are possibly directly activated by Msx, and Tox by Ascl.b. Finally, it is most likely that additional TFs remain to be identified to understand gene regulation in the VDML since the activity of several CRMs and deletions cannot be explained using the TFs that we currently know.

High conservation of PNS formation in Phlebobranchia

Expression patterns, overexpression experiments and CRMs identification in P. mammillata showed high similarities with C. intestinalis (Figures 4, 5 and 10). This was further demonstrated by CRM swap experiments (Figures 6 and 10) where midline CRMs from C. intestinalis were active in P. mammillata midlines and vice versa, and also in two additional Phlebobranchia species (Figure 6).

Figure 10. Summary of comparative results obtained in the present study.

Figure 10.

For each gene, results of in situ hybridization and enhancer swaps are summarized. VDML expression (in situ hybridization) or activity (transcriptional assay) is represented as a green circle, expression in part of the VDML as an orange circle, and lack of expression or activity as a red circle. Gray circle: not done.

These observations suggest that orthologous genes are regulated similarly and that CRMs are controlled using the same regulatory logic. Since non-coding DNA does not show sequence conservation, for example between C. intestinalis and P. mammillata CRMs that we identified (Supplementary file 2 and 5), it is difficult to define homologous CRMs. This is likely explained by extensive turnover of the binding sites for upstream transcription factors, a situation that we have previously described for the regulation of the early Msx CRMs by Otx and Nodal (Roure et al., 2014). The newly identified Msx CRM from A. mentula actually contains a region dense in Otx and Smad binding sites like their counterparts in Ciona and Phallusia (not shown). This phenomenon has been described for other CRMs between C. intestinalis and other species and outside ascidians (Buffry et al., 2016; Colgan et al., 2019; Oda-Ishii et al., 2005).

Given the modular nature of cis-regulation and the increased recognition of the existence of 'redundant', 'shadow' or distributed enhancers (Barolo, 2012; Cannavò et al., 2016), namely multiple CRMs with seemingly similar activity for a given gene, caution has to be taken for pairwise comparisons in TFBS composition of genomic pieces originating from different species. We could nevertheless apply the above approach for two other CRMs. In both Ciona and Phallusia, the late Msx CRMs with VDML activity are distal and share binding sites for Msx and Bmp regulation (SBE and BRE) (Figure 1—figure supplement 2 and Figure 5—figure supplement 1). CRMs for Tox are localized upstream, close to the transcription start in both species (at least for one of the two isoforms in C. intestinalis), and are characterized by an enrichment in E-box sites, possibly mediating activation by Ascl.b (Figure 3—figure supplement 3 and Figure 5—figure supplement 3). These observations are in agreement with conserved regulation at the base of Phlebobranchia and we propose that the above CRMs are orthologous.

We thus aimed at using such a strategy using candidate orthologous minimal midline CRMs to define shared motifs and identify trans-acting factors. However, the 'swap' and overexpression experiments described in Figure 7 refrained us from pursuing since such CRMs were not robust to phylogenetic and genetic perturbations. Whether these observations are general to ascidian minimal CRMs remains to be explored. A possible interpretation could be linked to the modular nature of cis-regulation. The largest CRMs that we tested are likely to contain most, if not all, the regulatory information for VDML expression. By contrast, shorter elements contain only a subset of the regulatory information, and would consequently not behave identically to the gene following genetic challenge. The fact, that such short elements may not be active when placed in another species while larger regions are, suggests that drift has occurred at the level of individual modules making them silent. Independently of our current understanding of this phenomenom, reciprocal in vivo transcriptional assays between Ciona and Phallusia offer a direct way to identify and determine the properties of the DNA elements that confer robustness.

Global conservation of PNS formation in ascidians?

In situ hybridization in two members of the Stolidobranchia revealed strong conservation in gene expression. However, while the patterns were virtually identical between Ciona and Phallusia, we could detect several differences (Figures 8 and 10). First, in both Molgula and Halocynthia, the genes Ascl.b and Tox were not expressed throughout the VDML but had a spotted expression. Such an expression pattern is correlated with the number and location of future PNS neurons. As shown in Figure 8—figure supplements 12 and already known for Halocynthia, presumptive neurons labeled by Celf3.a expression are present in a very small number and only in the posterior tail, at least on the ventral side, when compared to Ciona (Pasini et al., 2006; Ohtsuka et al., 2001b; Yagi and Makabe, 2001; Akanuma et al., 2002). This would suggest a variation at the level of the Stolidobranchia family and that the function of Ascl.b and Tox would possibly be different from the other 'pan-midline' transcription factors and from their orthologs in Phlebrobranchia. Interestingly, in Phlebobranchia, we have seen above a possible direct regulation of Tox by Ascl.b, suggesting that PNS formation in ascidians could be controlled by the following VDML sub-networks: Ascl.b and Tox on one side, and Msx, Klf1/2/4, Nkxtun3, and Dlx.c on the other side. Second, in Molgula, Nkxtun3 expression was restricted to the posterior VDML. Third, Moappe.Dlx.c was not expressed in the VDML, while palp expression was conserved (Figure 8—figure supplement 3). These two latter observations suggest a genus- or species-specific variation.

When focusing on gene regulation, we detected again stronger conservation within Phlebobranchia than between Phlebobranchia and Stolidobranchia (Figure 10). This is particularly true when considering large CRMs of several kilobases that have better chance of capturing the widest regulatory information. This difference between ascidian orders could be the result of increased drift because of larger evolutionary distance and/or because of discrete changes linked to taxonomical level change. We were nevertheless surprised to see that CRMs from Halocynthia were active in midlines when tested in Phallusia but not in Molgula (Figures 9 and 10). Although we have obtained these results on a limited number of CRMs and while we have not tested CRMs from M. appendiculata whose genome has not been sequenced, we believe that, taking expression patterns into account, M. appendiculata is the divergent species among all species that we have used (Figure 10). Our hypothesis is in agreement with previous CRM swap experiments between Ciona and other Molgula species (Stolfi et al., 2014), with the facts that Molgula is a fast evolving genus within ascidians (Delsuc et al., 2018; Alié et al., 2018) and that Molgulids is a family where loss of chordate body plan has occurred several times independently (tail-less larvae) (Huber et al., 2000).

Surprisingly, we did not detect obvious correlations between points of conservation in gene regulation and gene expression or GRN structure. Nkxtun3 and Dlx.c are the two genes whose CRMs are active in Molgula midlines (Figure 9 and Figure 9—figure supplement 1). They are not the genes with the best conserved expression; and while Dlx.c is in a downstream position of the network, Nkxtun3 has an intermediate position (Roure and Darras, 2016). Interestingly, in M. appendiculata Nkxtun3 is found only in posterior VDML and Dlx.c is not expressed in the VDML, but CRMs from both Ciona and Phallusia were active throughout M. appendiculata VDML. Consequently, it is likely that upstream factors are likely expressed throughout the VDML and that changes in the cis-regulatory landscapes have occurred for both genes in M. appendiculata. Similarly, a CRM from Halocynthia Tox (expressed in a spotted manner) was active throughout the VDML in Phallusia.

Given the phylogenetic positions of the species (Figure 8A) and the divergent behavior of Molgula (see above), a parsimonious analysis of our results (summarized in Figure 10) implies that PNS gene expression and regulation are globally conserved in ascidians despite extensive genomic divergence in the course of almost 400 My of evolution. This is a similar time window that separates mammals from teleosts (Delsuc et al., 2018), and several developmental processes including CRM activity have been shown conserved between mouse and zebrafish for example, or even between more divergent vertebrate species (zebrafish and lamprey) (Hockman et al., 2019; Fisher et al., 2006; Gehrke et al., 2015; McGaughey et al., 2008). While some of the cited examples refer to CRM conserved activity despite lack of sequence conservation, in vertebrates CRMs are usually associated with sequence conservation. In ascidians, the situation is the opposite with an absence of non-coding DNA conservation and lack of synteny.

We would like to conclude by stressing two operational points for comparative studies in ascidians that may be relevant more broadly. The CRMs are interesting tools to probe gene regulation especially during swap experiments in species with efficient transgenesis. However, the lack of fidelity of small CRMs that we have uncovered may blur comparative conclusions. Finally, the choice of species for comparisons is also essential. While divergent species are highly informative to understand evolution at work, they may lead to inaccurate conclusions at a broader level. Although the established model species C. intestinalis and H. roretzi appear to be relevant species to instruct the status of ancestral ascidians, we would advocate for broad taxonomic sampling in functional comparative studies.

Materials and methods

Embryo obtention and manipulation

Adults from C. intestinalis (formerly referred to Ciona intestinalis type B Brunetti et al., 2015) and Ascidia mentula were provided by the Centre de Ressources Biologiques Marines in Roscoff (EMBRC-France). Phallusia fumigata and Molgula appendiculata were provided by the Centre de Ressources Biologiques Marines in Banyuls-sur-mer (EMBRC-France). Phallusia mammillata were collected during trawling by professional fishermen in the Banyuls-sur-mer area. Gametes were collected from the gonoducts, except for M. appendiculata where gonads dissociation released sperm and immature eggs that spontaneously matured in sea water within 20–30 min. Egg dechorionation was performed before fertilization for P. fumigata and A. mentula as previously described for P. mammillata (Roure et al., 2014), and after fertilization for M. appendiculata as described for C. intestinalis (Mita-Miyazawa et al., 1985). Plasmid DNA (50 μg) was electroporated as described previously (Feinberg et al., 2019) through a single 32 ms electrical pulse with the following voltage: 20V for M. appendiculata; 25V for C. intestinalis; and 37V for P. mammillata, P. fumigata, and A. mentula.

P. mammillata embryos were treated with 150 ng/ml recombinant zebrafish Bmp4 protein (1128-BM, R&D Systems Inc, stock solution in HCl 4 mM + BSA 0,1%) complemented with 0.1% BSA from the eight-cell stage, with 20 μM of the Bmp receptor inhibitor Dorsomorphin (S7306, Euromedex, stock solution in water) from early gastrula stages or with 25 μM of the γ-secretase inhibitor DAPT (D5942, Sigma-Aldrich, stock solution in DMSO) from early neurula stages. Control embryos were grown in seawater containing 0.1% BSA and 0.5% DMSO. Whole mount in situ hybridization were performed as described (Feinberg et al., 2019) with Dig-labeled probes synthesized from clones described in Supplementary file 8. These clones were selected from EST libraries or PCR-amplified based on genomic or/and transcriptomic data (Supplementary file 9; Dardaillon et al., 2020; Kawashima et al., 2000; Kawashima et al., 2002). Reference transcriptome for M. appendiculata was generated from RNA-seq data on mixed embryonic stages and will be described elsewhere. Ten to 15 control embros per stage were examined for normal expression patterns, whereas 40–60 embryos were analyzed in the case of perturbations.

In vivo transcriptional assay

Genomic regions were PCR amplified and placed upstream of the minimal promoter of the Ciinte.Fog gene (KH2012:KH.C10.574) (except for three regions directly abutting the transcribed region of the gene: Ciinte.Nkxtun3-upstream, Ciinte.Ascl.b-upstream, and Ciinte.Bhlhtun1-upstream) and the LacZ reporter gene using the Gateway technology (Invitrogen) as described previously (Roure et al., 2007). Details for newly isolated genomic regions, primers, templates, and vectors are available in Supplementary file 1. Details for already described regions (Figure 1 and Figure 5—figure supplement 1) can be found in Roure et al., 2014. In the case of Msx in A. mentula, for which no whole genome sequence is available, we reasoned that local synteny would be conserved given the phylogenetic relationships between Ascidia, Phallusia, and Ciona genera (Figure 6A). Since the gene upstream of Msx is orthologous in sequenced Ciona and Phallusia species (KH2012:KH.C2.808 in C. intestinalis; Phmamm.g00005894 in P. mammillata), we amplified the Msx upstream intergenic region using degenerate primers defined in the coding sequences of Msx and its upstream neighbor (Fwd: TTYGAYAARTAYCARTTYGA; Rev: TTYTCYTGRAAYTTRTTYTC). Sequencing of this amplicon led to designing new primers for PCR amplification and cloning (Supplementary file 1). The sequence of the amplified region, Asment.Msx-up, can be found in Supplementary file 9.

Following electroporation and development, embryos were fixed at the desired stage and stained for β-galactosidase activity using X-gal (Roure et al., 2014). For each experiment, the number of embryos with β-galactosidase-positive cells in VDML was scored (a total of 30–300 embryos were counted per experiment). The number of experiments performed and the number of embryos examined is described in the text and figures. Graphs in the figures represent average values, with error bars denoting the standard deviation.

In the case of concomitant overexpression of a TF in C. intestinalis (Figure 7), at least two independent experiments have been performed, but without scoring. Constructs for TF overexpression have been previously described (Roure and Darras, 2016).

Gene model identifiers

The genes described in this study were named according to the nomenclature in the tunicate community (Stolfi et al., 2015) and this may differ from previous publications. Homologous genes were identified using blast against genome/transcriptome and pre-computed orthologies when available from the Aniseed database (Dardaillon et al., 2020). Gene identifiers come from the following assemblies: C. robusta KH2012, P. mammillata MTP2014, P. fumigata MTP2014, and H. roretzi MTP2014 (Supplementary file 1). In some cases, gene or transcript models were absent or incorrect; we tentatively built from RNA-seq and ESTs transcript models that are available in Supplementary file 9.

Identification of putative TFBS

We first generated a tentative GRN for midline gene transcriptional regulation (Figure 1—figure supplement 1A) by incorporating known gene function and interactions (Roure et al., 2014; Pasini et al., 2006; Roure and Darras, 2016; Waki et al., 2015; Joyce Tang et al., 2013; Bertrand et al., 2003) and by formulating the following hypotheses: dorsal expression of the earliest genes, Msx and Ascl.b, may be regulated by the genes expressed earlier in DML precursors, namely Msx itself and Dlk (Notch ligand); ventral expression of the earliest genes, Msx and Ascl.b, may be regulated directly by Bmp signaling or by Bmp-activated genes/VML genes, namely Tbx2/3, Nkxtun1, Nkx2-3/5/6 and Irx.c (Waki et al., 2015; Imai et al., 2004); and midline expression of the other genes may be regulated by all midline factors expressed before. We also considered autoregulation. For each factor or pathway (except Tox that is thought to bind DNA in a sequence-independent manner [O'Flaherty and Kaye, 2003]), we assigned consensus or specificity matrices obtained for the C. intestinalis factor or orthologous gene(s) from human, mouse, or fly (identified from the Aniseed database). Matrices were retrieved from Aniseed, Jaspar, and CIS-BP databases (Dardaillon et al., 2020; Nitta et al., 2019; Weirauch et al., 2014; Fornes et al., 2020; Yao et al., 2006) and are displayed in Figure 1—figure supplement 1B. For CRM shorter than 1 kb, sequences and their counterparts from the sister species (C. robusta and C. savignyi; P. mammillata and P. fumigata) were retrieved from Aniseed and scanned using FIMO (http://meme-suite.org/tools/fimo) with a match p-value threshold of 0.001 (Grant et al., 2011). TFBS were considered conserved when they were found close to each other (within approximately 60 bp) in the aligned sequences (alignment performed using zPicture: https://zpicture.dcode.org/Ovcharenko et al., 2004). Only conserved sites were further considered in the analysis.

Acknowledgements

We thank EMBRC-France (Banyuls-sur-mer and Roscoff marine stations) and G Diaz (Port-Vendres) for providing animals. We are grateful to C Labrune, JM Amouroux and F Monniot for help in identifying M. appendiculata. Many thanks to the Phallusia genome consortium (H Yasuo, A McDougall and P Lemaire) for access to various resources (genomes, transcriptomes and cDNA clones).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Sébastien Darras, Email: sebastien.darras@obs-banyuls.fr.

Shigehiro Kuraku, RIKEN Center for Biosystems Dynamics Research, Japan.

Marianne E Bronner, California Institute of Technology, United States.

Funding Information

This paper was supported by the following grants:

  • Agence Nationale de la Recherche ANR-11-JSV2-007 to Sébastien Darras.

  • Agence Nationale de la Recherche ANR-17-CE13-0027 to Sébastien Darras.

  • Fondation des Treilles to Joshua F Coulcher.

  • Centre National de la Recherche Scientifique DBM2020 to Sébastien Darras.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Data curation, Validation, Investigation, Methodology.

Conceptualization, Resources, Data curation, Validation, Investigation, Methodology.

Conceptualization, Investigation, Methodology.

Conceptualization, Investigation.

Conceptualization, Investigation.

Investigation.

Investigation.

Resources.

Conceptualization, Data curation, Supervision, Funding acquisition, Investigation, Methodology, Writing - original draft, Project administration.

Additional files

Supplementary file 1. List of all genomic regions tested in the present study.
elife-59157-supp1.xlsx (39KB, xlsx)
Supplementary file 2. Genome browser view for each locus of the seven caudal PNS neurogenic TFs in Ciona robusta.

Tested CRMs were added to the data extracted from the Aniseed website (https://www.aniseed.cnrs.fr/; Dardaillon et al., 2020).

elife-59157-supp2.pdf (394.8KB, pdf)
Supplementary file 3. Activity of various genomic regions for the gene Ciinte.Bhlhtun1.

(Top panel) Snapshot of the Ciinte.Bhlhtun1 locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between C. robusta and C. savignyi (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). (Middle panel) Representative examples of X-gal-stained embryos at tailbud stages following electroporation of Ciinte.Bhlhtun1-upstream, Ciinte.Bhlhtun1-up1 and Ciinte.Bhlhtun1-down1. Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 100 μm. (Bottom panel) Schematic representation of the various constructs and their activity at tailbud stages in DML (blue) and VML (purple) (n indicates the total number of embryos examined, N indicates the number of independent experiments). Note that while VDML activity is rare, activity can be detected at other sites of endogenous Ciinte.Bhlhtun1 expression: anterior epidermis around the palps for Ciinte.Bhlhtun1-up1, and notochord and stomodeum for Ciinte.Bhlhtun1-down1.

elife-59157-supp3.pdf (211.3KB, pdf)
Supplementary file 4. Activity of various genomic regions for the genes Phmamm.Ascl.b and Phmamm.Bhlhtun1.

Snapshots of the Phmamm.Ascl.b and Phmamm.Bhlhtun1 loci depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between P. mammillata and P. fumigata (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Representative examples of X-gal stained embryos at tailbud stages following electroporation of Phmamm.Ascl.b-A (no activity), Phmamm.Ascl.b-B (activity in palps and anterior nervous system), and Phmamm.Bhlhtun1-A (activity in notochord, endodermal strand and tail tip) into P. mammillata embryos. Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 50 μm. Schematic representation of the various constructs and their activity at tailbud stages in VDML (blue) (n indicates the total number of embryos examined, N indicates the number of independent experiments). Note that while VDML activity is rather robust for Phmamm.Bhlhtun1-A, it was not considered further since this activity was restricted to the very posterior cells of the midlines at the tail tip.

elife-59157-supp4.pdf (258.2KB, pdf)
Supplementary file 5. Genome browser view for each locus of the seven caudal PNS neurogenic TFs in Phallusia mammillata.

Tested CRMs and predicted cDNAs were added to the data extracted from the Aniseed website (https://www.aniseed.cnrs.fr/; Dardaillon et al., 2020).

elife-59157-supp5.pdf (374.3KB, pdf)
Supplementary file 6. Identification of VDML CRMs for Phfumi.Msx and Asment.Msx genes.

(Top) Snapshot of the Phfumi.Msx locus. (Middle) Activity of Phfumi.Msx and Asment.Msx CRMs at tailbud stages in VDML (blue) of C. intestinalis and P. mammillata embryos (n indicates the total number of embryos examined, N indicates the number of independent experiments). (Bottom) Representative examples of X-gal staining at tailbud stages (embryos in lateral view with dorsal to the top and anterior to the left, scale bar: 50 μm).

elife-59157-supp6.pdf (210.1KB, pdf)
Supplementary file 7. Genome browser view for each locus for three caudal PNS neurogenic TFs in Halocynthia roretzi.

Tested CRMs were added to the data extracted from the Aniseed website (https://www.aniseed.cnrs.fr/; Dardaillon et al., 2020).

elife-59157-supp7.pdf (172.2KB, pdf)
Supplementary file 8. List of DNA clones used for in situ hybridization.
elife-59157-supp8.xlsx (55.5KB, xlsx)
Supplementary file 9. Various sequences.

Predicted cDNA sequences for genes in C. intestinalis, P. mammillata and M. appendiculata: transcripts models from RNA-seq data and ESTs sequences were used to build the cDNA sequences. Open reading frame is highlighted in bold. Sequence of the genomic region Asment.Msx-up isolated from A. mentula.

elife-59157-supp9.docx (181.7KB, docx)
Transparent reporting form

Data availability

All data generated or analyzed during this study are included in the manuscript and supporting files.

References

  1. Akanuma T, Hori S, Darras S, Nishida H. Notch signaling is involved in nervous system formation in ascidian embryos. Development Genes and Evolution. 2002;212:459–472. doi: 10.1007/s00427-002-0264-x. [DOI] [PubMed] [Google Scholar]
  2. Alié A, Hiebert LS, Simion P, Scelzo M, Prünster MM, Lotito S, Delsuc F, Douzery EJP, Dantec C, Lemaire P, Darras S, Kawamura K, Brown FD, Tiozzo S. Convergent acquisition of nonembryonic development in styelid ascidians. Molecular Biology and Evolution. 2018;35:1728–1743. doi: 10.1093/molbev/msy068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Arnold CD, Gerlach D, Spies D, Matts JA, Sytnikova YA, Pagani M, Lau NC, Stark A. Quantitative genome-wide enhancer activity maps for five Drosophila species show functional enhancer conservation and turnover during cis-regulatory evolution. Nature Genetics. 2014;46:685–692. doi: 10.1038/ng.3009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Barolo S. Shadow enhancers: frequently asked questions about distributed cis-regulatory information and enhancer redundancy. BioEssays. 2012;34:135–141. doi: 10.1002/bies.201100121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bertrand V, Hudson C, Caillol D, Popovici C, Lemaire P. Neural tissue in ascidian embryos is induced by FGF9/16/20, acting via a combination of maternal GATA and ets transcription factors. Cell. 2003;115:615–627. doi: 10.1016/S0092-8674(03)00928-0. [DOI] [PubMed] [Google Scholar]
  6. Bradley RK, Li XY, Trapnell C, Davidson S, Pachter L, Chu HC, Tonkin LA, Biggin MD, Eisen MB. Binding site turnover produces pervasive quantitative changes in transcription factor binding between closely related Drosophila species. PLOS Biology. 2010;8:e1000343. doi: 10.1371/journal.pbio.1000343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Brown CD, Johnson DS, Sidow A. Functional architecture and evolution of transcriptional elements that drive gene coexpression. Science. 2007;317:1557–1560. doi: 10.1126/science.1145893. [DOI] [PubMed] [Google Scholar]
  8. Brozovic M, Dantec C, Dardaillon J, Dauga D, Faure E, Gineste M. ANISEED 2017: extending the integrated ascidian database to the exploration and evolutionary comparison of genome-scale datasets. Nucleic Acids Research. 2017;15:D718–D725. doi: 10.1093/nar/gkx1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Brudno M, Do CB, Cooper GM, Kim MF, Davydov E, Green ED, Sidow A, Batzoglou S, NISC Comparative Sequencing Program LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Research. 2003;13:721–731. doi: 10.1101/gr.926603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Brugger SM, Merrill AE, Torres-Vazquez J, Wu N, Ting MC, Cho JY, Dobias SL, Yi SE, Lyons K, Bell JR, Arora K, Warrior R, Maxson R. A phylogenetically conserved cis-regulatory module in the Msx2 promoter is sufficient for BMP-dependent transcription in murine and Drosophila embryos. Development. 2004;131:5153–5165. doi: 10.1242/dev.01390. [DOI] [PubMed] [Google Scholar]
  11. Brunetti R, Gissi C, Pennati R, Caicci F, Gasparini F, Manni L. Morphological evidence that the molecularly determined Ciona intestinalis type A and type B are different species: Ciona robusta and Ciona intestinalis. Journal of Zoological Systematics and Evolutionary Research. 2015;53:186–193. doi: 10.1111/jzs.12101. [DOI] [Google Scholar]
  12. Buffry AD, Mendes CC, McGregor AP. The functionality and evolution of eukaryotic transcriptional enhancers. Advances in Genetics. 2016;96:143–206. doi: 10.1016/bs.adgen.2016.08.004. [DOI] [PubMed] [Google Scholar]
  13. Candiani S, Pennati R, Oliveri D, Locascio A, Branno M, Castagnola P, Pestarino M, De Bernardi F. Ci-POU-IV expression identifies PNS neurons in embryos and larvae of the ascidian Ciona intestinalis. Development Genes and Evolution. 2005;215:41–45. doi: 10.1007/s00427-004-0444-y. [DOI] [PubMed] [Google Scholar]
  14. Cannavò E, Khoueiry P, Garfield DA, Geeleher P, Zichner T, Gustafson EH, Ciglar L, Korbel JO, Furlong EE. Shadow enhancers are pervasive features of developmental regulatory networks. Current Biology. 2016;26:38–51. doi: 10.1016/j.cub.2015.11.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chen JS, Pedro MS, Zeller RW. miR-124 function during Ciona intestinalis neuronal development includes extensive interaction with the notch signaling pathway. Development. 2011;138:4943–4953. doi: 10.1242/dev.068049. [DOI] [PubMed] [Google Scholar]
  16. Colgan W, Leanza A, Hwang A, DeBiasse MB, Llosa I, Rodrigues D, Adhikari H, Barreto Corona G, Bock S, Carillo-Perez A, Currie M, Darkoa-Larbi S, Dellal D, Gutow H, Hokama P, Kibby E, Linhart N, Moody S, Naganuma A, Nguyen D, Stanton R, Stark S, Tumey C, Velleca A, Ryan JF, Davidson B. Variable levels of drift in tunicate cardiopharyngeal gene regulatory elements. EvoDevo. 2019;10:24. doi: 10.1186/s13227-019-0137-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Dardaillon J, Dauga D, Simion P, Faure E, Onuma TA, DeBiasse MB, Louis A, Nitta KR, Naville M, Besnardeau L, Reeves W, Wang K, Fagotto M, Guéroult-Bellone M, Fujiwara S, Dumollard R, Veeman M, Volff JN, Roest Crollius H, Douzery E, Ryan JF, Davidson B, Nishida H, Dantec C, Lemaire P. ANISEED 2019: 4d exploration of genetic data for an extended range of tunicates. Nucleic Acids Research. 2020;48:D668–D675. doi: 10.1093/nar/gkz955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Davidson EH, Erwin DH. Gene regulatory networks and the evolution of animal body plans. Science. 2006;311:796–800. doi: 10.1126/science.1113832. [DOI] [PubMed] [Google Scholar]
  19. Dehal P, Satou Y, Campbell RK, Chapman J, Degnan B, De Tomaso A, Davidson B, Di Gregorio A, Gelpke M, Goodstein DM, Harafuji N, Hastings KE, Ho I, Hotta K, Huang W, Kawashima T, Lemaire P, Martinez D, Meinertzhagen IA, Necula S, Nonaka M, Putnam N, Rash S, Saiga H, Satake M, Terry A, Yamada L, Wang HG, Awazu S, Azumi K, Boore J, Branno M, Chin-Bow S, DeSantis R, Doyle S, Francino P, Keys DN, Haga S, Hayashi H, Hino K, Imai KS, Inaba K, Kano S, Kobayashi K, Kobayashi M, Lee BI, Makabe KW, Manohar C, Matassi G, Medina M, Mochizuki Y, Mount S, Morishita T, Miura S, Nakayama A, Nishizaka S, Nomoto H, Ohta F, Oishi K, Rigoutsos I, Sano M, Sasaki A, Sasakura Y, Shoguchi E, Shin-i T, Spagnuolo A, Stainier D, Suzuki MM, Tassy O, Takatori N, Tokuoka M, Yagi K, Yoshizaki F, Wada S, Zhang C, Hyatt PD, Larimer F, Detter C, Doggett N, Glavina T, Hawkins T, Richardson P, Lucas S, Kohara Y, Levine M, Satoh N, Rokhsar DS. The draft genome of Ciona intestinalis: insights into chordate and vertebrate origins. Science. 2002;298:2157–2167. doi: 10.1126/science.1080049. [DOI] [PubMed] [Google Scholar]
  20. Delsuc F, Philippe H, Tsagkogeorga G, Simion P, Tilak MK, Turon X, López-Legentil S, Piette J, Lemaire P, Douzery EJP. A phylogenomic framework and timescale for comparative studies of tunicates. BMC Biology. 2018;16:39. doi: 10.1186/s12915-018-0499-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Esteves FF, Springhorn A, Kague E, Taylor E, Pyrowolakis G, Fisher S, Bier E. BMPs regulate msx gene expression in the dorsal neuroectoderm of Drosophila and vertebrates by distinct mechanisms. PLOS Genetics. 2014;10:e1004625. doi: 10.1371/journal.pgen.1004625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Feinberg S, Roure A, Piron J, Darras S. Antero-posterior ectoderm patterning by canonical wnt signaling during ascidian development. PLOS Genetics. 2019;15:e1008054. doi: 10.1371/journal.pgen.1008054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Fisher S, Grice EA, Vinton RM, Bessling SL, McCallion AS. Conservation of RET regulatory function from human to zebrafish without sequence similarity. Science. 2006;312:276–279. doi: 10.1126/science.1124070. [DOI] [PubMed] [Google Scholar]
  24. Fornes O, Castro-Mondragon JA, Khan A, van der Lee R, Zhang X, Richmond PA, Modi BP, Correard S, Gheorghe M, Baranašić D, Santana-Garcia W, Tan G, Chèneby J, Ballester B, Parcy F, Sandelin A, Lenhard B, Wasserman WW, Mathelier A. JASPAR 2020: update of the open-access database of transcription factor binding profiles. Nucleic Acids Research. 2020;48:D87–D92. doi: 10.1093/nar/gkz1001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Gehrke AR, Schneider I, de la Calle-Mustienes E, Tena JJ, Gomez-Marin C, Chandran M, Nakamura T, Braasch I, Postlethwait JH, Gómez-Skarmeta JL, Shubin NH. Deep conservation of wrist and digit enhancers in fish. PNAS. 2015;112:803–808. doi: 10.1073/pnas.1420208112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Grant CE, Bailey TL, Noble WS. FIMO: scanning for occurrences of a given motif. Bioinformatics. 2011;27:1017–1018. doi: 10.1093/bioinformatics/btr064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Haag ES. The same but different: worms reveal the pervasiveness of developmental system drift. PLOS Genetics. 2014;10:e1004150. doi: 10.1371/journal.pgen.1004150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Hockman D, Chong-Morrison V, Green SA, Gavriouchkina D, Candido-Ferreira I, Ling ITC, Williams RM, Amemiya CT, Smith JJ, Bronner ME, Sauka-Spengler T. A genome-wide assessment of the ancestral neural crest gene regulatory network. Nature Communications. 2019;10:4689. doi: 10.1038/s41467-019-12687-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Horie T, Kusakabe T, Tsuda M. Glutamatergic networks in the Ciona intestinalis larva. The Journal of Comparative Neurology. 2008;508:249–263. doi: 10.1002/cne.21678. [DOI] [PubMed] [Google Scholar]
  30. Hotta K, Mitsuhara K, Takahashi H, Inaba K, Oka K, Gojobori T, Ikeo K. A web-based interactive developmental table for the ascidian Ciona intestinalis, including 3D real-image embryo reconstructions: I. from fertilized egg to hatching larva. Developmental Dynamics. 2007;236:1790–1805. doi: 10.1002/dvdy.21188. [DOI] [PubMed] [Google Scholar]
  31. Huber JL, da Silva KB, Bates WR, Swalla BJ. The evolution of anural larvae in molgulid ascidians. Seminars in Cell & Developmental Biology. 2000;11:419–426. doi: 10.1006/scdb.2000.0195. [DOI] [PubMed] [Google Scholar]
  32. Hudson C, Ba M, Rouviere C, Yasuo H. Divergent mechanisms specify chordate motoneurons: evidence from ascidians. Development. 2011;138:1643–1652. doi: 10.1242/dev.055426. [DOI] [PubMed] [Google Scholar]
  33. Hudson C, Yasuo H. Similarity and diversity in mechanisms of muscle fate induction between ascidian species. Biology of the Cell. 2008;100:265–277. doi: 10.1042/BC20070144. [DOI] [PubMed] [Google Scholar]
  34. Imai KS, Hino K, Yagi K, Satoh N, Satou Y. Gene expression profiles of transcription factors and signaling molecules in the ascidian embryo: towards a comprehensive understanding of gene networks. Development. 2004;131:4047–4058. doi: 10.1242/dev.01270. [DOI] [PubMed] [Google Scholar]
  35. Johnson DS, Davidson B, Brown CD, Smith WC, Sidow A. Noncoding regulatory sequences of Ciona exhibit strong correspondence between evolutionary constraint and functional importance. Genome Research. 2004;14:2448–2456. doi: 10.1101/gr.2964504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. José-Edwards DS, Oda-Ishii I, Kugler JE, Passamaneck YJ, Katikala L, Nibu Y, Di Gregorio A. Brachyury, Foxa2 and the cis-Regulatory Origins of the Notochord. PLOS Genetics. 2015;11:e1005730. doi: 10.1371/journal.pgen.1005730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Joyce Tang W, Chen JS, Zeller RW. Transcriptional regulation of the peripheral nervous system in Ciona intestinalis. Developmental Biology. 2013;378:183–193. doi: 10.1016/j.ydbio.2013.03.016. [DOI] [PubMed] [Google Scholar]
  38. Kawashima T, Kawashima S, Kanehisa M, Nishida H, Makabe KW. MAGEST: maboya gene expression patterns and sequence tags. Nucleic Acids Research. 2000;28:133–135. doi: 10.1093/nar/28.1.133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kawashima T, Kawashima S, Kohara Y, Kanehisa M, Makabe KW. Update of MAGEST: maboya gene expression patterns and sequence tags. Nucleic Acids Research. 2002;30:119–120. doi: 10.1093/nar/30.1.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kourakis MJ, Smith WC. An organismal perspective on C. intestinalis development, origins and diversification. eLife. 2015;4:e06024. doi: 10.7554/eLife.06024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Lemaire P, Smith WC, Nishida H. Ascidians and the plasticity of the chordate developmental program. Current Biology. 2008;18:R620–R631. doi: 10.1016/j.cub.2008.05.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Lemaire P. Evolutionary crossroads in developmental biology: the tunicates. Development. 2011;138:2143–2152. doi: 10.1242/dev.048975. [DOI] [PubMed] [Google Scholar]
  43. Lemaire P, biology D. How many ways to make a chordate? Science. 2006;312:1145–1146. doi: 10.1126/science.1128784. [DOI] [PubMed] [Google Scholar]
  44. Levine M, Davidson EH. Gene regulatory networks for development. PNAS. 2005;102:4936–4942. doi: 10.1073/pnas.0408031102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Lusk RW, Eisen MB. Evolutionary mirages: selection on binding site composition creates the illusion of conserved grammars in Drosophila enhancers. PLOS Genetics. 2010;6:e1000829. doi: 10.1371/journal.pgen.1000829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Madgwick A, Magri MS, Dantec C, Gailly D, Fiuza U-M, Guignard L, Hettinger S, Gomez-Skarmeta JL, Lemaire P. Evolution of embryonic cis-regulatory landscapes between divergent Phallusia and Ciona ascidians. Developmental Biology. 2019;448:71–87. doi: 10.1016/j.ydbio.2019.01.003. [DOI] [PubMed] [Google Scholar]
  47. McGaughey DM, Vinton RM, Huynh J, Al-Saif A, Beer MA, McCallion AS. Metrics of sequence constraint overlook regulatory sequences in an exhaustive analysis at phox2b. Genome Research. 2008;18:252–260. doi: 10.1101/gr.6929408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Mita-Miyazawa I, Ikegami S, Satoh N. Histospecific acetylcholinesterase development in the presumptive muscle cells isolated from 16-cell-stage ascidian embryos with respect to the number of DNA replications. Journal of Embryology and Experimental Morphology. 1985;87:1–12. [PubMed] [Google Scholar]
  49. Nieto MA. A snail tale and the chicken embryo. The International Journal of Developmental Biology. 2018;62:121–126. doi: 10.1387/ijdb.170301mn. [DOI] [PubMed] [Google Scholar]
  50. Nishida H. Cell division pattern during Gastrulation of the Ascidian, Halocynthia roretzi (cell division pattern/gastrulation/neurulation/ascidian embryo) Development, Growth and Differentiation. 1986;28:191–201. doi: 10.1111/j.1440-169X.1986.00191.x. [DOI] [PubMed] [Google Scholar]
  51. Nitta KR, Vincentelli R, Jacox E, Cimino A, Ohtsuka Y, Sobral D, Satou Y, Cambillau C, Lemaire P. High-Throughput protein production combined with high- Throughput SELEX identifies an extensive atlas of Ciona robusta transcription factor DNA-Binding specificities. Methods in Molecular Biology. 2019;2025:487–517. doi: 10.1007/978-1-4939-9624-7_23. [DOI] [PubMed] [Google Scholar]
  52. O'Flaherty E, Kaye J. TOX defines a conserved subfamily of HMG-box proteins. BMC Genomics. 2003;4:13. doi: 10.1186/1471-2164-4-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Oda-Ishii I, Bertrand V, Matsuo I, Lemaire P, Saiga H. Making very similar embryos with divergent genomes: conservation of regulatory mechanisms of otx between the ascidians Halocynthia roretzi and Ciona intestinalis. Development. 2005;132:1663–1674. doi: 10.1242/dev.01707. [DOI] [PubMed] [Google Scholar]
  54. Ohtsuka Y, Obinata T, Okamura Y. Induction of ascidian peripheral neuron by vegetal blastomeres. Developmental Biology. 2001a;239:107–117. doi: 10.1006/dbio.2001.0425. [DOI] [PubMed] [Google Scholar]
  55. Ohtsuka Y, Okamura Y, Obinata T. Changes in gelsolin expression during ascidian metamorphosis. Development Genes and Evolution. 2001b;211:252–256. doi: 10.1007/s004270100134. [DOI] [PubMed] [Google Scholar]
  56. Ovcharenko I, Loots GG, Hardison RC, Miller W, Stubbs L. zPicture: dynamic alignment and visualization tool for analyzing conservation profiles. Genome Research. 2004;14:472–477. doi: 10.1101/gr.2129504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Pasini A, Amiel A, Rothbächer U, Roure A, Lemaire P, Darras S. Formation of the ascidian epidermal sensory neurons: insights into the origin of the chordate peripheral nervous system. PLOS Biology. 2006;4:e225. doi: 10.1371/journal.pbio.0040225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Pasini A, Manenti R, Rothbächer U, Lemaire P. Antagonizing retinoic acid and FGF/MAPK pathways control posterior body patterning in the invertebrate chordate Ciona intestinalis. PLOS ONE. 2012;7:e46193. doi: 10.1371/journal.pone.0046193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Racioppi C, Valoroso MC, Coppola U, Lowe EK, Brown CT, Swalla BJ, Christiaen L, Stolfi A, Ristoratore F. Evolutionary loss of melanogenesis in the tunicate Molgula occulta. EvoDevo. 2017;8:11. doi: 10.1186/s13227-017-0074-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Razy-Krajka F, Stolfi A. Regulation and evolution of muscle development in tunicates. EvoDevo. 2019;10:13. doi: 10.1186/s13227-019-0125-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Rebeiz M, Tsiantis M. Enhancer evolution and the origins of morphological novelty. Current Opinion in Genetics & Development. 2017;45:115–123. doi: 10.1016/j.gde.2017.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Roure A, Rothbächer U, Robin F, Kalmar E, Ferone G, Lamy C, Missero C, Mueller F, Lemaire P. A multicassette gateway vector set for high throughput and comparative analyses in Ciona and vertebrate embryos. PLOS ONE. 2007;2:e916. doi: 10.1371/journal.pone.0000916. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Roure A, Lemaire P, Darras S. An otx/nodal regulatory signature for posterior neural development in ascidians. PLOS Genetics. 2014;10:e1004548. doi: 10.1371/journal.pgen.1004548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Roure A, Darras S. Msxb is a core component of the genetic circuitry specifying the dorsal and ventral neurogenic midlines in the ascidian embryo. Developmental Biology. 2016;409:277–287. doi: 10.1016/j.ydbio.2015.11.009. [DOI] [PubMed] [Google Scholar]
  65. Satoh N. Developmental Biology of Ascidians. Cambridge University Press; 1994. [Google Scholar]
  66. Satoh N, Rokhsar D, Nishikawa T. Chordate evolution and the three-phylum system. Proceedings of the Royal Society B: Biological Sciences. 2014;281:20141729. doi: 10.1098/rspb.2014.1729. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Satoh N. Developmental genomics of ascidians. Hoboken, New Jersey: John Wiley & Sons, Inc; 2014. [Google Scholar]
  68. Schwartz S, Kent WJ, Smit A, Zhang Z, Baertsch R, Hardison RC, Haussler D, Miller W. Human-mouse alignments with BLASTZ. Genome Research. 2003;13:103–107. doi: 10.1101/gr.809403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Shenkar N, Swalla BJ. Global diversity of ascidiacea. PLOS ONE. 2011;6:e20657. doi: 10.1371/journal.pone.0020657. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Stolfi A, Lowe EK, Racioppi C, Ristoratore F, Brown CT, Swalla BJ, Christiaen L. Divergent mechanisms regulate conserved cardiopharyngeal development and gene expression in distantly related ascidians. eLife. 2014;3:e03728. doi: 10.7554/eLife.03728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Stolfi A, Sasakura Y, Chalopin D, Satou Y, Christiaen L, Dantec C, Endo T, Naville M, Nishida H, Swalla BJ, Volff JN, Voskoboynik A, Dauga D, Lemaire P. Guidelines for the nomenclature of genetic elements in tunicate genomes. Genesis. 2015;53:1–14. doi: 10.1002/dvg.22822. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Takahashi H, Mitani Y, Satoh G, Satoh N. Evolutionary alterations of the minimal promoter for notochord-specific brachyury expression in ascidian embryos. Development. 1999;126:3725–3734. doi: 10.1242/dev.126.17.3725. [DOI] [PubMed] [Google Scholar]
  73. Torrence SA, Cloney RA. Nervous system of ascidian larvae: caudal primary sensory neurons. Zoomorphology. 1982;99:103–115. doi: 10.1007/BF00310303. [DOI] [Google Scholar]
  74. True JR, Haag ES. Developmental system drift and flexibility in evolutionary trajectories. Evolution and Development. 2001;3:109–119. doi: 10.1046/j.1525-142x.2001.003002109.x. [DOI] [PubMed] [Google Scholar]
  75. Verster AJ, Ramani AK, McKay SJ, Fraser AG. Comparative RNAi screens in C. Elegans and C. briggsae reveal the impact of developmental system drift on gene function. PLOS Genetics. 2014;10:e1004077. doi: 10.1371/journal.pgen.1004077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Wada S, Katsuyama Y, Saiga H. Anteroposterior patterning of the epidermis by inductive influences from the vegetal hemisphere cells in the ascidian embryo. Development. 1999;126:4955–4963. doi: 10.1242/dev.126.22.4955. [DOI] [PubMed] [Google Scholar]
  77. Waki K, Imai KS, Satou Y. Genetic pathways for differentiation of the peripheral nervous system in ascidians. Nature Communications. 2015;6:8719. doi: 10.1038/ncomms9719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Weirauch MT, Yang A, Albu M, Cote AG, Montenegro-Montero A, Drewe P, Najafabadi HS, Lambert SA, Mann I, Cook K, Zheng H, Goity A, van Bakel H, Lozano J-C, Galli M, Lewsey MG, Huang E, Mukherjee T, Chen X, Reece-Hoyes JS, Govindarajan S, Shaulsky G, Walhout AJM, Bouget F-Y, Ratsch G, Larrondo LF, Ecker JR, Hughes TR. Determination and Inference of Eukaryotic Transcription Factor Sequence Specificity. Cell. 2014;158:1431–1443. doi: 10.1016/j.cell.2014.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Yagi K, Makabe KW. Isolation of an early neural maker gene abundantly expressed in the nervous system of the ascidian, Halocynthia roretzi. Development Genes and Evolution. 2001;211:49–53. doi: 10.1007/s004270000118. [DOI] [PubMed] [Google Scholar]
  80. Yamada A, Nishida H. Distinct parameters are involved in controlling the number of rounds of cell division in each tissue during ascidian embryogenesis. Journal of Experimental Zoology. 1999;284:379–391. doi: 10.1002/(SICI)1097-010X(19990901)284:4&#x0003c;379::AID-JEZ4&#x0003e;3.0.CO;2-8. [DOI] [PubMed] [Google Scholar]
  81. Yao LC, Blitz IL, Peiffer DA, Phin S, Wang Y, Ogata S, Cho KW, Arora K, Warrior R. Schnurri transcription factors from Drosophila and vertebrates can mediate bmp signaling through a phylogenetically conserved mechanism. Development. 2006;133:4025–4034. doi: 10.1242/dev.02561. [DOI] [PubMed] [Google Scholar]

Decision letter

Editor: Shigehiro Kuraku1

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The authors of this study focused on four ascidian species with more than 250 million years of evolutionary divergence and performed an impressive suite of laboratory experiments and DNA sequence analyses on key molecular factors forming the peripheral nervous system of the ascidians. They report possible conservation of the regulatory mechanism over this evolutionary period for the first time.

Decision letter after peer review:

Thank you for submitting your article "Conservation of peripheral nervous system formation mechanisms in divergent ascidian embryos" for consideration by eLife. Your article has been reviewed by three peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Marianne Bronner as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission. Because of the extensive textual revisions requested by all reviewers, we refer you to the detailed reviews for specific changes that should be make.

We would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). Specifically, we are asking editors to accept without delay manuscripts, like yours, that they judge can stand as eLife papers without additional data, even if they feel that they would make the manuscript stronger. Thus the revisions requested below only address clarity and presentation.

Summary

In this study, Coulcher et al. analyzed the cis-regulatory regions for determinants of dorsal and ventral peripheral nervous system (PNS) in several species of ascidians. Starting with the Phlebobranch Ciona intestinalis, they identified active enhancers for 6 out 7 transcription factor-coding genes that were previously characterized to various extent. They showed that homologous transcription factors have conserved expression patterns in a distant Phlebobranch, P. mammillata, and identified corresponding cis-regulatory modules (CRMs) that drive reporter gene expression in the caudal PNS. Using cross-species reporter gene expression assays, between C. intestinalis and P. mammillata, but also in Ascidia mentula and Phallusia fumigata, the authors conclude that the identified regulatory DNAs have largely conserved enhancer activities within Phlebobranchs. However, minimal elements showed deprecated activity in cross-species assays, and the Ciona elements also responded differently from endogenous genes in response to perturbations of trans-acting regulators in C. intestinalis.

Next, the authors inspected the spatial and temporal expression patterns of homologous genes in two Stolidobranch species, Molgula appendiculata and Halocynthia roretzi. They observed a reduced number of caudal epidermal sensory neurons (CESN) in M. appendiculata and a restriction to the tip of the tail, which was consistent with restricted patterns of Nkx, Ascl and Tox. Finally, the authors performed CRM swaps between Phlebobranchs and Stolidobranch species, which represent longer evolutionary distances, and showed preliminary evidence that Stolidobranch species interpreted Phlebobranch sequences differently.

Reviewer #1:

This study explored the regulatory mechanisms for the peripheral nervous system of ascidians from an evolutionary viewpoint. The authors of this study performed an impressive suite of laboratory experiments and bioinformatic analyses, covering four ascidian species with more than 250 million years of evolutionary divergence. They report possible conservation of the regulatory mechanism over this evolutionary period, while they revealed some dissimilarities. I found a large value of publishing this study, while I recognized quite a few problematic descriptions of their data, lacking objective and quantitative grounds. I suggest a substantial revision of the manuscript to enhance the objectivity of their arguments based on the points listed below.

1) In many parts of the manuscript, the results are described in too obscure expressions, including “simple”, “conserved”, “divergent”, “typical”, “very few”, “fast”, and “large” that do not accompany quantitative or comparative information. For example, the word “conserved” appears seven times in the Introduction, and they do not provide any logical and testable meaning. Those parts of the manuscript should be carefully rewritten to logically convey the intension.

2) Among vertebrates, we already know quite a few cases of evolutionary conservation of regulatory mechanisms over 300 million years, e.g., between mouse and zebrafish. To me, the scope of this study seems confined to urochordate biology and evolution, but it is advisable to discuss the conservation levels across the ascidian and other lineages along the timeline.

3) The citation in paragraph four of the Introduction should be limited to the references about the divergence time, that is, Delsuc et al., 2018.

4) What ascertains the orthology between genes and genomic regions of different species analyzed in this study? State if not done in the manuscript.

5) The use of the words “player” and “actors” are too abstract and should be refrained.

6) Clarify what “phylogenetic footprints” means.

7) Enrichment of particular genomic regions revealed by ATAC-seq should be called “enrichment”, but not “peak”.

8) Splice isoforms are mentioned only for Tox. Consider mentioning the isoform multiplicity for the other genes, if any.

9) The word “manuscript” in subsection “Expression and regulation of caudal PNS genes” should be replace with “study”.

10) In subsection “Expression and regulation of caudal PNS genes”, is it clear from which species the sequence of the recombinant Bmp4 protein is derived? State which species, if not included in Materials and methods.

11) It was not clear to me what this part means “if Molgula as a divergent species is a valid hypothesis”. The word “divergent” without any quantitative basis does not provide any chance for validating it. This part needs to be elaborated.

Reviewer #2:

Coulcher et al. present an in-depth identification of PNS midline enhancers in divergent ascidians. They use cross-species transgenesis to obtain evidence that enhancer DNAs with very little sequence conservation nonetheless produce similar patterns of gene expression. The study is exhaustive and exhausting and should be of interest to specialists in the field, but the authors will need to rework the presentation in order to engage a broader audience.

A few specific comments follow:

The identification of separate early and late Msx enhancers for expression in the b6.5 lineage is a highlight of the study and should be emphasized in a revised manuscript. The authors should discuss why separate enhancers are used and whether they are likely to interact with different classes of sequence-specific transcription factors.

It is curious that the authors identified dorsal-specific midline enhancers but no ventral-specific enhancers. They do not provide a very compelling explanation for this finding. The ventral midline PNS might be a "throwback" to simpler Deuterostomes that contain both dorsal and ventral nerve cords.

I'm not convinced that the authors have done the most thorough job of identifying TFs that recognize the different enhancers. As a result, it remains an open question whether divergent enhancers use a similar logic to produce similar expression profiles, or if there have been substantive changes in the gene regulatory networks governing the specification of ventral midline in divergent ascidians.

Reviewer #3:

This is a very rich study, reporting on heroic efforts to characterize gene expression patterns, enhancers and their activity across multiple species of ascidians, while focusing on the peripheral nervous system, which is of broad interest. The specific data reported in this manuscript will thus be of interest for both specialists in the field, and outside.

For example, it is interesting to identify distinct ventral vs. dorsal elements for genes like Msx and Ascl.b, but pan-midline elements for Tox and Klf, which indeed suggests that the more "upstream" factors integrate distinct regional inputs, while they a common logic controls the core midline factors. However this model needs further support and clarification.

Besides the large amount of specific new information, it is quite intriguing to observe that minimal enhancers are more likely to fail in cross-species assays, and show more discrepancies with endogenous gene expression in experimental conditions that perturb trans-acting factors. These observations suggest that, even when the activity entire cis-regulatory systems is conserved, individual elements are subjected to drift, which is in keeping with the lack of sequence conservation and could be discussed more explicitly in the light of emerging concepts about complex cis-regulatory systems (e.g. so-called shadow enhancers).

Finally, a key merit of the comparison with Stolidobranch is to show evolutionary changes in gene expression patterns (e.g. restriction to the tip of the tail in Molgula, reduced domain of neurogenic competence in Halocynthia), in addition to CRM drift, which has been documented in several other contexts before. The two are difficult to disentangle though, and this last section seems more preliminary; the paper would thus probably benefit from focusing on the sections on Phlebobranchs.

In sum, this is a very rich manuscript with numerous strengths, which would nonetheless benefit from extensive modifications for clarity, a better focus and more in-depth analysis of key points as detailed below.

Major comments

The paper is often very difficult to follow, especially because critical information regarding endogenous and reporter gene expression, ATAC-seq peaks, deletions and reporter activity are scattered across different figures, especially when cross-species comparisons are evoked. Specific cases are listed below:

Figure 1F: difficult to relate the deletions to the peaks of conservation and accessibility if they are not aligned.

Indicate clearly (in main text and figure) what is the core promoter used.

The arrangement of figures makes it very difficult to follow. E.g. in Supplementary file 2, Klf comes after Nkx-tun3 but it comes before in the text.

Figure 3—figure supplement 2: Klf very hard to connect the pattern, with the ATAC-seq data, with the deletions and the scoring. There should be a way to better integrate these informations and make the figures easier to follow.

Nomenclature needs to be consistent across text and figures, especially for gene names (Tox vs CAGF, Dll.C vs. Dlx.c, etc…). Use published guidelines, and for TFs, the names used in the latest version of the genome.

Not clear from Figure 5 and supplement what the authors mean by "The topology of early and late Msx CRMs is strikingly similar between C. intestinalis and P. mammillata".

Main conclusion on "strong similarities" would be better illustrated by placing the diagram representing genes and CRMs of C. intestinalis and P. mammillata side-by-side to highlight said similarities, and give the reader a chance to evaluate the differences.

Summary table of stages and tissues stained for each gene between species would be useful to compare patterns across species. And differences could be presented as heatmaps, similar to other panels in the paper (e.g. Figure 7C). Currently the extensive in situ panels are very nice but difficult to compare to each other.

An important aspect of the paper is the relative contribution of individual CRMs to endogenous expression. While there is evidence that genes like Msx.b and Ascl.b are regulated differently, and part through separate elements in the ventral and dorsal midline, this should still be clarified. Specifically:

The importance of individual elements is never tested using CRISPR/Cas9 mutations followed by in situ hybridization, while this would strongly complement some of the conclusions based exclusively on reporter assays. For no gene is it entirely clear how the identified elements contribute to endogenous regulation, also since there are several other peaks in the regions that were not included in the analysis. E.g. Dlx.c had >14 peaks in the ATAC-seq data, out of which one (1) was included in the reporter constructs.

The existence of separate vs. shared CRMs is intriguing and interesting, except this is not as clear for Ascl.b, since both CiAshdo4 and CiAshdo5 are still active in the VML, and an intronic accessible region has not been tested. Moreover, Nkx.C is presented as an upstream regulator and absent from the Msxb GRN in Roure and Darras, 2016, but here it is incorporated in the core midline PNS network. Since its intron has not been tested, one cannot rule out that there is regulatory activity that would distinguish between ventral and dorsal midlines, which is also true for Dlx.c. This basically casts doubt on the model shown in Figure 4A.

For Ascl.b, the Supplementary file 2 makes it look like the intron should be added to the upstream region for thorough testing. In general, one intronic region also appears accessible for Nkx-tun3.

Multiple CRMs in Msx, Klf, Nkx and Dlx.c also begs the question as to whether these play redundant roles (like shadow enhancers), or have seemingly overlapping activities in reporter assays, but contribute to different aspects of gene expression.

X-gal staining does not report on the dynamics of enhancer activity: could the two CRMs act at different times? Different levels? Like so-called "shadow" enhancers? Different domains. This needs to be clarified to be useful.

Discussion “ this opens an interesting perspective in identifying and determining the properties of the DNA elements that confer robustness.”: this echoes the growing recognition of integrated cis-regulatory systems with multiple elements contributing to gene regulation.

In general, the authors need to integrate the sequence of elements, including conservation, putative binding sites, and known regulators to paint a more complete and clearer picture of enhancer function in the first part, and of possible conservation -or divergence- of regulatory linkages in the comparative part, especially with regards to the ventral vs. dorsal inputs and the pan-medial/core network for PNS specification.

BMP and Notch signaling regulate ventral MDL and PNS, although crude assays do not allow for refined spatial and temporal resolution, so strictly speaking the time of space at which these regulators act could vary. This is potentially important because the conclusions that will be drawn regarding CRM evolution depend on whether the trans-acting inputs are exactly the same or not.

Sequence analysis, especially in terms of known regulatory linkage within the PNS GRN would strengthen the conclusion on conservation, while clarifying the impact of sequence divergence (i.e. redistribution of binding sites, etc…).

Finally, the analysis of Stolidobranchia CRMs is cursory at this point, and one notes that presumably CRMs for the same genes could be obtained from the sequenced genomes of other tailed Molgula species (e.g. Molgula oculata or Molgula occidentalis). Considering the density of the current manuscript, the cursory analysis of Stolidobranch vs. Phlebobranch comparisons, and the fact that this has been done before, I would probably recommend focusing on the within Phlebobranch comparisons and strengthening the paper by better integrating the endogenous and reporter expression data with previously established regulatory linkage, sequences and putative TF binding sites. The latter is mentioned as future work in the Discussion, but it is not clear why it cannot be incorporated here, in a focused but clarified version of the study.

Drift between Phlebobranch and Stolidobranch enhancers for conserved gene expression patterns has been described qualitatively before, so the key novelties are the within order comparisons and the evolutionary changes, especially the seemingly reduced neurogenic compartments in Stolidobranchs.

Localized expression of Tox and Ascl.b in Halocynthia, as opposed to pan-VMDL in Phlebobranch is also a potentially interesting difference (i.e. not the whole midline has neurogenic potential). => possible conservation of the midline program, but drift in its linkage to the neurogenic program in CESNs? This would be very interesting, but difficult to address in a revision.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

Thank you for submitting your article "Conservation of peripheral nervous system formation mechanisms in divergent ascidian embryos" for consideration by eLife. Your article has been reviewed by three peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Marianne Bronner as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

We would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). Specifically, we are asking editors to accept without delay manuscripts, like yours, that they judge can stand as eLife papers without additional data, even if they feel that they would make the manuscript stronger. Thus the revisions requested below only address clarity and presentation.

Summary:

This study presents an elaborate analysis employing open chromatin data investigation and promoter assays to detect binding sites on a genome-wide scale for several transcription factors by encompassing multiple ascidian species. The study was conducted to elucidate the evolutionary origin of the mechanism governing the formation of the peripheral nervous system (PNS). The results highlighted the unexpected commonalities between those species that are divergent from each other over 200 million years since their split and supported the early origin of the regulatory mechanism for the PNS formation.

Revisions:

The authors took a lot of the main comments to heart in the previous revision, but the manuscript became loaded with details and unnecessarily lengthy. For the sake of clarity of keys in the original data and the conclusion, the manuscript should now be streamlined before making a decision. I advise the authors to achieve this by deleting speculative sentences in Results, for example, “the loss of these sites might thus be responsible of the absence of activity in the VML.” and “and this may explain why we did not observe CRM active only in the VML.” in the Msx section This type of information also in the other TF sections should be deleted if not directly linked to the main conclusion, or substantially shortened after moving to Discussion.

eLife. 2020 Nov 16;9:e59157. doi: 10.7554/eLife.59157.sa2

Author response


Reviewer #1:

This study explored the regulatory mechanisms for the peripheral nervous system of ascidians from an evolutionary viewpoint. The authors of this study performed an impressive suite of laboratory experiments and bioinformatic analyses, covering four ascidian species with more than 250 million years of evolutionary divergence. They report possible conservation of the regulatory mechanism over this evolutionary period, while they revealed some dissimilarities. I found a large value of publishing this study, while I recognized quite a few problematic descriptions of their data, lacking objective and quantitative grounds. I suggest a substantial revision of the manuscript to enhance the objectivity of their arguments based on the points listed below.

1) In many parts of the manuscript, the results are described in too obscure expressions, including “simple”, “conserved”, “divergent”, “typical”, “very few”, “fast”, and “large” that do not accompany quantitative or comparative information. For example, the word “conserved” appears seven times in the Introduction, and they do not provide any logical and testable meaning. Those parts of the manuscript should be carefully rewritten to logically convey the intension.

We have revised the text trying to be more precise and specific by bringing additional information and references. We have also added two summary figures (Figure 8—figure supplement 3 and Figure 10) that allow to evaluate the degree of conservation in a visual way.

2) Among vertebrates, we already know quite a few cases of evolutionary conservation of regulatory mechanisms over 300 million years, e.g., between mouse and zebrafish. To me, the scope of this study seems confined to urochordate biology and evolution, but it is advisable to discuss the conservation levels across the ascidian and other lineages along the timeline.

We have added a short reference to comparative studies in vertebrates in the Discussion.

3) The citation in paragraph four of the Introduction should be limited to the references about the divergence time, that is, Delsuc et al., 2018.

This has been corrected.

4) What ascertains the orthology between genes and genomic regions of different species analyzed in this study? State if not done in the manuscript.

To identify orthologous genes, we have performed reciprocal blast and used phylogenies available on the Aniseed database (for Ciona, Phallusia and Halocynthia). For Molgula appendiculata, we have only performed reciprocal blast but no proper phylogenetic analysis. This has been added in the Materials and methods section.

In our study, we have not directly adressed the issue of orthologous genomic regions. We have guided our choice for CRM candidates by DNA sequence conservation in non-coding regions. These “phylogenetic footprints” could be obtained from sister species (Ciona intestinalis and Ciona savignyi; Phallusia mammillata and Phallusia fumigata). However, non-coding DNA does not align between Ciona and Phallusia. This is one of the difficulty and interest of comparative CRM studies in ascidians. We have discussed this aspect in the text.

5) The use of the words “player” and “actors” are too abstract and should be refrained.

We have used more specific terms in all instances.

6) Clarify what “phylogenetic footprints” means.

Done.

7) Enrichment of particular genomic regions revealed by ATAC-seq should be called “enrichment”, but not “peak”.

This has been corrected.

8) Splice isoforms are mentioned only for Tox. Consider mentioning the isoform multiplicity for the other genes, if any.

Tox is the only case with evidence for isoforms.

9) The word “manuscript” in subsection “Expression and regulation of caudal PNS genes” should be replace with “study”.

Done.

10) In subsection “Expression and regulation of caudal PNS genes”, is it clear from which species the sequence of the recombinant Bmp4 protein is derived? State which species, if not included in Materials and methods.

We have used recombinant Bmp4 protein from zebrafish. This was included in the Material and Methods section.

11) It was not clear to me what this part means “if Molgula as a divergent species is a valid hypothesis”. The word “divergent” without any quantitative basis does not provide any chance for validating it. This part needs to be elaborated.

We are sorry of the confusion, we have modified the text. The divergent status of Molgula was developed in a previous paragraph.

Reviewer #2:

Coulcher et al. present an in-depth identification of PNS midline enhancers in divergent ascidians. They use cross-species transgenesis to obtain evidence that enhancer DNAs with very little sequence conservation nonetheless produce similar patterns of gene expression. The study is exhaustive and exhausting and should be of interest to specialists in the field, but the authors will need to rework the presentation in order to engage a broader audience.

A few specific comments follow:

The identification of separate early and late Msx enhancers for expression in the b6.5 lineage is a highlight of the study and should be emphasized in a revised manuscript. The authors should discuss why separate enhancers are used and whether they are likely to interact with different classes of sequence-specific transcription factors.

Following the reviewer's suggestion, we have examined TFBSs for the active CRMs that we have identified in this study (see below). An interesting hypothesis (schematized in Figure 1—figure supplement 2B) is as follows. The early/proximal Msx CRM is activated following neural induction and allows expression in DML precursors. The late/distal CRM would initiate Msx expression in DML at gastrula/neurula stages by Msx itself.

It is curious that the authors identified dorsal-specific midline enhancers but no ventral-specific enhancers. They do not provide a very compelling explanation for this finding. The ventral midline PNS might be a "throwback" to simpler Deuterostomes that contain both dorsal and ventral nerve cords.

We were also disappointed of not finding ventral-specific enhancers for Msx and Ascl.b through rather extensive deletion analyses. We had proposed in the first version of the manuscript that ventral and dorsal CRMs were overlapping. Following the reviewer's suggestion for TFBS identification (see below), we now show in Figure 1—figure supplement 2 that candidate sites for ventral TFs are more abundant than sites for dorsal TFs, and that both types of sites are intermingled or even overlapping. It is thus possible that “by chance” we have eliminated ventral TFBS and revealed dorsal-specific activity, while other deletions have eliminated both dorsal and ventral TFBS. Site-specific mutagenesis will be required to test this hypothesis.

Regarding the evolutionary interpretation of the ventral vs dorsal PNS, we feel it is still difficult comparing ascidian tail PNS with nervous systems from echinoderms and hemichordates. However, the existence of a Bmp-induced ventral neurogenic territory is evident in both ascidians and amphioxus, suggesting an ancestral origin at least in chordates. Since this territory does not exist in vertebrates, it is likely that it has been lost. In our group, we are currently working on a comparative project on ascidians and amphioxus to test the hypothesis that this ancestral ventral program has been co-opted at the dorsal neural plate border and has led to the emergence of vertebrate neural crest and placodes. An attractive hypothesis would be that sites for dorsal TFs (Msx for example) have appeared in an “ancestral” ventral neurogenic epidermis enhancer. However, the results that we present in the current study do not bring results strong enough to discuss this evolutionary aspect.

I'm not convinced that the authors have done the most thorough job of identifying TFs that recognize the different enhancers. As a result, it remains an open question whether divergent enhancers use a similar logic to produce similar expression profiles, or if there have been substantive changes in the gene regulatory networks governing the specification of ventral midline in divergent ascidians.

We also thought that comparing TFBS composition of enhancers with conserved activity but originating from different species would be straightforward in determining whether the regulatory logic is conserved or not. We did not show any data on this aspect in the first version because of the following weaknesses:

– the caudal PNS GRN is still poor, mainly built from expression data, overexpression of midline TFs and loss-of-function of a few genes. Hence, direct regulations and the identity of TFs that are actually involved are not known.

– gene regulation is modular with redundant, shadow or distributed enhancers. Given that DNA sequence conservation between ascidian species from different genera (Ciona intestinalis and Phallusia mammillata for example) is virtually absent, it is not possible to define orthologous CRMs. While topology, sequence conservation (between closely related species: Ciona intestinalis and Ciona savignyi) and accessibility (ATAC-seq) could be used to propose orthologous CRMS; we are not sure that we will be comparing equivalent functional units.

We nevertheless did our best to address this question:

– we have gathered available information and proposed some hypotheses for gene regulation to improve the VDML GRN (Figure 1—figure supplement 1A and Materials and methods)

– we have gathered TF binding specificities (Figure 1—figure supplement 1B) and mapped TFBS on CRMs aligned with their counterpart from closely related species (C. intestinalis and C. savignyi alignment on one side, and P. mammillata and P. fumigata alignment on the other side). We have restricted our analysis to CRMs smaller than 1 kb and to TFBS that would be conserved in the alignments (assuming that regulatory logic is conserved for close species).

This analysis in Ciona intestinalis allowed us to:

– propose some direct regulations for a number of genes

– suggest that each gene of the VDML network is regulated differently (in particular, the early genes Msx and Ascl.b do not seem to be regulated similarly)

– propose that unidentified TFs are likely to play important roles

Unfortunately, given that our in vivo analysis of CRMs is not exhaustive, we could make comparisons between Ciona and Phallusia for only 2 genes. However, this gave support for a similar logic in both species: Msx (direct activation by Bmp signaling in the VML and by Msx in the DML) and Tox (direct activation by Ascl.b).

Reviewer #3:

[…]

Major comments

The paper is often very difficult to follow, especially because critical information regarding endogenous and reporter gene expression, ATAC-seq peaks, deletions and reporter activity are scattered across different figures, especially when cross-species comparisons are evoked. Specific cases are listed below:

We acknowledge that the rather large amount of data that we present may make the reading difficult. We had tried to present the actual data without simplification, while trying at the same time to keep systematic graphical representations for clarity. We have now followed reviewer's advice and made several changes that are described below. We have also added two new summary figures (Figure 8—figure supplement 3 and Figure 10) that schematize the comparative part. We also have added new analyses (TFBS, see below) that we hope will not make the paper even more confusing.

Figure 1F: difficult to relate the deletions to the peaks of conservation and accessibility if they are not aligned.

Conservation and accessibility have now been reported in all schemes describing various CRMs and their activity.

Indicate clearly (in main text and figure) what is the core promoter used.

In all cases, except 3 CRMs (Ciinte.Nkxtun3-upstream, Ciinte.Ascl.b-upstream and Ciinte.Bhlhtun1-upstream), the core promoter is the one of Ciinte.Fog (Rothbacher et al., 2007). This information can be found in Materials and methods and Supplementary file 1. We have now added a sentence at the beginning of the Results section. We have chosen not to repeat this redundant information in every figure legend.

The arrangement of figures makes it very difficult to follow. E.g. in Supplementary file 2, Klf comes after Nkx-tun3 but it comes before in the text.

This has been fixed.

Figure 3—figure supplement 2: Klf very hard to connect the pattern, with the ATAC-seq data, with the deletions and the scoring. There should be a way to better integrate these information and make the figures easier to follow.

As mentioned above, accessibility and conservation are now shown directly below the schemes of the various tested CRMs. We hope figures are clearer and easier to follow.

Nomenclature needs to be consistent across text and figures, especially for gene names (Tox vs CAGF, Dll.C vs. Dlx.c, etc…). Use published guidelines, and for TFs, the names used in the latest version of the genome.

Gene and CRM names have been modified throughout text and figures to adhere to the proposed nomenclature (Stolfi et al., 2015).

Not clear from Figure 5 and supplement what the authors mean by "The topology of early and late Msx CRMs is strikingly similar between C. intestinalis and P. mammillata".

We have clarified this point.

Main conclusion on "strong similarities" would be better illustrated by placing the diagram representing genes and CRMs of C. intestinalis and P. mammillata side-by-side to highlight said similarities, and give the reader a chance to evaluate the differences.

We have moderated our conclusions. The idea that regulatory landscapes were similar emerged from the position of the CRMs active in VDML relative to the gene of interest, and from the Msx locus. While the Msx case is striking, the position of CRMs for other genes (within 3 kb upstream) might correspond to real similarities, but might be simply due to the fact that this is where we looked for them in the first place. Since other VDML CRMs might exist (see below), we removed this idea of “regulatory landscape”.

Summary table of stages and tissues stained for each gene between species would be useful to compare patterns across species. And differences could be presented as heatmaps, similar to other panels in the paper (e.g. Figure 7C). Currently the extensive in situ panels are very nice but difficult to compare to each other.

We have added a supplemental figure to Figure 8 (Figure 8—figure supplement 3) that schematizes the expression patterns in the four species.

An important aspect of the paper is the relative contribution of individual CRMs to endogenous expression. While there is evidence that genes like Msx.b and Ascl.b are regulated differently, and part through separate elements in the ventral and dorsal midline, this should still be clarified.

We do agree with the reviewer that transcriptional regulation is complex and involves multiple CRMs that may or may not have similar activity in time and space. But we did not claim that we have exhaustively analyzed the CRM landscape for PNS expression of our genes of interest. The reviewer has to keep in mind that resources or tools to explore CRMs is still limited in ascidians: ChIP-seq and ATAC-seq are just beginning to be used, others techniques like Chromosome Conformation Capture are not yet established. Very little was known for gene regulation and cis-regulation of the genes we studied; and we have only used phylogenetic footprints and in vivo transcriptional assays. We have nevertheless reached interesting conclusions that together with cross-species tests bring new hypotheses.

The importance of individual elements is never tested using CRISPR/Cas9 mutations followed by in situ hybridization, while this would strongly complement some of the conclusions based exclusively on reporter assays.

This is obviously the next step that we would like to reach. However, CRISPR/Cas9 has proven very frustrating in our hands, possibly because a higher polymorphism rate in C. intestinalis compared to C. robusta. In addition, we have so far reached at best 30% mutagenesis rate using electroporation. We also feel that such experiments would be part of a future study.

For no gene is it entirely clear how the identified elements contribute to endogenous regulation, also since there are several other peaks in the regions that were not included in the analysis. E.g. Dlx.c had >14 peaks in the ATAC-seq data, out of which one (1) was included in the reporter constructs.

As discussed below, currently available ATAC-seq data (whole embryos up to neurula stages) may not actually fit our needs: some important regions may not be apparent, and at the same time regions with visible enrichment may not correspond to VDML expression. Tissue-specific ATAC-seq or 3C data should prove better staring points for an extensive analysis of candidate CRMs.

The existence of separate vs. shared CRMs is intriguing and interesting, except this is not as clear for Ascl.b, since both CiAshdo4 and CiAshdo5 are still active in the VML, and an intronic accessible region has not been tested. Moreover, Nkx.C is presented as an upstream regulator and absent from the Msxb GRN in Roure and Darras, 2016, but here it is incorporated in the core midline PNS network. Since its intron has not been tested, one cannot rule out that there is regulatory activity that would distinguish between ventral and dorsal midlines, which is also true for Dlx.c. This basically casts doubt on the model shown in Figure 4A.

For Ascl.b, the Supplementary file 2 makes it look like the intron should be added to the upstream region for thorough testing. In general, one intronic region also appears accessible for Nkx-tun3.

We agree that the situation for Ascl.b is not as sharp as Msx since ventral activity is strongly reduced but not abolished for do4 and do5 CRMs. Yet we have not observed such a situation for the other four genes that we have analyzed and that are expressed later. Of course, we cannot rule out that unidentified CRMs with pan-midline regulation exist for Msx and Ascl.b and that unidentified dorsal CRMs and ventral CRMs also exist for the other midline TFs. We thus believe that deeper understanding of midline TFs transcriptional regulation will require a substantial amount of work with much finer resolution (in time and space with quantitative approaches) involving much heavier experimental approaches. For example, we propose that the DML expression at gastrula/neurula stage of Ciinte.Msx is initiated by Msx itself through the distal CRM (see below). Once Msx expression has been launched in both dorsal and ventral midlines, it is possible that the same element is used for maintenance through positive autoregulation. Thus the same genomic element would be at the same time a DML enhancer and a “pan-midline” enhancer.

Regarding the Ciinte.Ascl.b gene structure, we have fully sequenced the cDNA clone cien82323; and this indicated that the intron was wrongly predicted in the transcript models (shown in the modified Supplementary file 2). That is why we have selected the CRMs as shown. Yet this clone might be an artifact since it is not well supported by RNA-seq data. Nevertheless, a genomic region covering this region had been previously tested in transcriptional assay and was not active in the tail epidermis (https://www.aniseed.cnrs.fr/aniseed/cisreg/show_cisreg?feature_id=11170652).

The network in Roure and Darras, 2016 is based on overexpression and lacks information on direct interactions. Several structures were actually proposed with Nkxtun3 (previously Nkx-C) included in most.

Multiple CRMs in Msx, Klf, Nkx and Dlx.c also begs the question as to whether these play redundant roles (like shadow enhancers), or have seemingly overlapping activities in reporter assays, but contribute to different aspects of gene expression.

X-gal staining does not report on the dynamics of enhancer activity: could the two CRMs act at different times? Different levels? Like so-called "shadow" enhancers? Different domains. This needs to be clarified to be useful.

Discussion “ this opens an interesting perspective in identifying and determining the properties of the DNA elements that confer robustness.”: this echoes the growing recognition of integrated cis-regulatory systems with multiple elements contributing to gene regulation.

The contribution of several CRMs with seemingly similar activity is indeed a current trend in cis-regulation studies. The role of “redundant” CRMs has been first linked to robustness, their specific roles being revealed following genetic or environmental challenges. Quantitative approaches are now revealing that similar CRMs are intrinsically different. In ascidians, we are aware of only three studies reporting “shadow”/distributed enhancers (Farley et al., 2016; Madgwick et al., 2019; Harder et al., https://doi.org/10.1101/2020.08.07.242016). It seems that we have identified additional ones. However, their analysis would be a separate study.

In general, the authors need to integrate the sequence of elements, including conservation, putative binding sites, and known regulators to paint a more complete and clearer picture of enhancer function in the first part, and of possible conservation -or divergence- of regulatory linkages in the comparative part, especially with regards to the ventral vs. dorsal inputs and the pan-medial/core network for PNS specification.

BMP and Notch signaling regulate ventral MDL and PNS, although crude assays do not allow for refined spatial and temporal resolution, so strictly speaking the time of space at which these regulators act could vary. This is potentially important because the conclusions that will be drawn regarding CRM evolution depend on whether the trans-acting inputs are exactly the same or not.

Sequence analysis, especially in terms of known regulatory linkage within the PNS GRN would strengthen the conclusion on conservation, while clarifying the impact of sequence divergence (i.e. redistribution of binding sites, etc…).

Since this aspect was also suggested by reviewer 2, we have undertaken a careful analysis of VDML CRMs regarding binding site composition for putative regulators of PNS specification (see above). We would like to stress that we have not come to a definitive conclusion in terms of conservation of regulatory logic. The main reasons are 1) uncertainties of the GRN given the small number of validated direct interactions between a TF and a target gene, and the likely existence of unknown regulators, and 2) the impossibility to define homology relationships between CRMs from Ciona and Phallusia since non-coding DNA regions do not align.

Finally, the analysis of Stolidobranchia CRMs is cursory at this point, and one notes that presumably CRMs for the same genes could be obtained from the sequenced genomes of other tailed Molgula species (e.g. Molgula oculata or Molgula occidentalis). Considering the density of the current manuscript, the cursory analysis of Stolidobranch vs. Phlebobranch comparisons, and the fact that this has been done before, I would probably recommend focusing on the within Phlebobranch comparisons and strengthening the paper by better integrating the endogenous and reporter expression data with previously established regulatory linkage, sequences and putative TF binding sites. The latter is mentioned as future work in the Discussion, but it is not clear why it cannot be incorporated here, in a focused but clarified version of the study.

Drift between Phlebobranch and Stolidobranch enhancers for conserved gene expression patterns has been described qualitatively before, so the key novelties are the within order comparisons and the evolutionary changes, especially the seemingly reduced neurogenic compartments in Stolidobranchs.

We acknowledge that comparison between Phlebobranchs and Stolidobranchs is not as deep as the Ciona/Phallusia part. However, while comparative approaches have been reported for single genes between C. intestinalis and H. roretzi (Oda-Ishii et al., 2005; Matsumoto et al., 2008; Takahashi et al., 2009), or for multiple genes between C. intestinalis and Molgula species (Stolfi et al., 2014), we focus here on multiple genes in multiple species. Importantly, our results suggest that, contrary to some of the above reports (comparing possibly non-equivalent CRMs or using unusually divergent species), regulatory mechanisms might be broadly conserved in ascidians despite extensive drift. We thus consider that this part of the study is of strong interest.

Localized expression of Tox and Ascl.b in Halocynthia, as opposed to pan-VMDL in Phlebobranch is also a potentially interesting difference (i.e. not the whole midline has neurogenic potential). => possible conservation of the midline program, but drift in its linkage to the neurogenic program in CESNs? This would be very interesting, but difficult to address in a revision.

The reviewer suggests an interesting possibility. Another scenario could be “parallel” GRNs in the caudal PNS (Pan-midline: Msx->Klf1/2/4->Nkxtun3->Dlx.c and posterior midline: Ascl.b-> Tox) that would be both required for acquiring a neurogenic potential. In H. roretzi, it has been shown that activating Notch inhibits PNS neuron formation (Akanuma et al., 2002). However, Notch inhibition has not been performed, and it is not known whether the entire midline is neurogenic. Such an experiment together with midline TFs loss-of-function would be required to further explore these hypotheses.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

Revisions:

The authors took a lot of the main comments to heart in the previous revision, but the manuscript became loaded with details and unnecessarily lengthy. For the sake of clarity of keys in the original data and the conclusion, the manuscript should now be streamlined before making a decision. I advise the authors to achieve this by deleting speculative sentences in Results, for example, “the loss of these sites might thus be responsible of the absence of activity in the VML.” and “and this may explain why we did not observe CRM active only in the VML.” in the Msx section This type of information also in the other TF sections should be deleted if not directly linked to the main conclusion, or substantially shortened after moving to Discussion.

Following the suggestion of two reviewers we had included an extensive analysis of transcription factor binding sites in the revised manuscript. Unfortunately, adding more information in an already dense manuscript led to a study even more difficult to follow. Following your suggestion, we have kept the description of this aspect minimal in the main text. We nevertheless believe that this new aspect of the study has brought interesting insights, and we have not simply deleted the data but transferred the details to the supplemental figure legends.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Supplementary file 1. List of all genomic regions tested in the present study.
    elife-59157-supp1.xlsx (39KB, xlsx)
    Supplementary file 2. Genome browser view for each locus of the seven caudal PNS neurogenic TFs in Ciona robusta.

    Tested CRMs were added to the data extracted from the Aniseed website (https://www.aniseed.cnrs.fr/; Dardaillon et al., 2020).

    elife-59157-supp2.pdf (394.8KB, pdf)
    Supplementary file 3. Activity of various genomic regions for the gene Ciinte.Bhlhtun1.

    (Top panel) Snapshot of the Ciinte.Bhlhtun1 locus depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between C. robusta and C. savignyi (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). (Middle panel) Representative examples of X-gal-stained embryos at tailbud stages following electroporation of Ciinte.Bhlhtun1-upstream, Ciinte.Bhlhtun1-up1 and Ciinte.Bhlhtun1-down1. Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 100 μm. (Bottom panel) Schematic representation of the various constructs and their activity at tailbud stages in DML (blue) and VML (purple) (n indicates the total number of embryos examined, N indicates the number of independent experiments). Note that while VDML activity is rare, activity can be detected at other sites of endogenous Ciinte.Bhlhtun1 expression: anterior epidermis around the palps for Ciinte.Bhlhtun1-up1, and notochord and stomodeum for Ciinte.Bhlhtun1-down1.

    elife-59157-supp3.pdf (211.3KB, pdf)
    Supplementary file 4. Activity of various genomic regions for the genes Phmamm.Ascl.b and Phmamm.Bhlhtun1.

    Snapshots of the Phmamm.Ascl.b and Phmamm.Bhlhtun1 loci depicting ATAC-seq profile at mid-neurula stages, tested genomic regions, transcript models and conservation between P. mammillata and P. fumigata (from https://www.aniseed.cnrs.fr/ and Dardaillon et al., 2020; Madgwick et al., 2019). Representative examples of X-gal stained embryos at tailbud stages following electroporation of Phmamm.Ascl.b-A (no activity), Phmamm.Ascl.b-B (activity in palps and anterior nervous system), and Phmamm.Bhlhtun1-A (activity in notochord, endodermal strand and tail tip) into P. mammillata embryos. Embryos are shown in lateral view with dorsal to the top and anterior to the left. Scale bar: 50 μm. Schematic representation of the various constructs and their activity at tailbud stages in VDML (blue) (n indicates the total number of embryos examined, N indicates the number of independent experiments). Note that while VDML activity is rather robust for Phmamm.Bhlhtun1-A, it was not considered further since this activity was restricted to the very posterior cells of the midlines at the tail tip.

    elife-59157-supp4.pdf (258.2KB, pdf)
    Supplementary file 5. Genome browser view for each locus of the seven caudal PNS neurogenic TFs in Phallusia mammillata.

    Tested CRMs and predicted cDNAs were added to the data extracted from the Aniseed website (https://www.aniseed.cnrs.fr/; Dardaillon et al., 2020).

    elife-59157-supp5.pdf (374.3KB, pdf)
    Supplementary file 6. Identification of VDML CRMs for Phfumi.Msx and Asment.Msx genes.

    (Top) Snapshot of the Phfumi.Msx locus. (Middle) Activity of Phfumi.Msx and Asment.Msx CRMs at tailbud stages in VDML (blue) of C. intestinalis and P. mammillata embryos (n indicates the total number of embryos examined, N indicates the number of independent experiments). (Bottom) Representative examples of X-gal staining at tailbud stages (embryos in lateral view with dorsal to the top and anterior to the left, scale bar: 50 μm).

    elife-59157-supp6.pdf (210.1KB, pdf)
    Supplementary file 7. Genome browser view for each locus for three caudal PNS neurogenic TFs in Halocynthia roretzi.

    Tested CRMs were added to the data extracted from the Aniseed website (https://www.aniseed.cnrs.fr/; Dardaillon et al., 2020).

    elife-59157-supp7.pdf (172.2KB, pdf)
    Supplementary file 8. List of DNA clones used for in situ hybridization.
    elife-59157-supp8.xlsx (55.5KB, xlsx)
    Supplementary file 9. Various sequences.

    Predicted cDNA sequences for genes in C. intestinalis, P. mammillata and M. appendiculata: transcripts models from RNA-seq data and ESTs sequences were used to build the cDNA sequences. Open reading frame is highlighted in bold. Sequence of the genomic region Asment.Msx-up isolated from A. mentula.

    elife-59157-supp9.docx (181.7KB, docx)
    Transparent reporting form

    Data Availability Statement

    All data generated or analyzed during this study are included in the manuscript and supporting files.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES