SUMMARY
The transition from fins to limbs has been a rich source of discussion for more than a century. One open and important issue is understanding how the mechanisms that pattern digits arose during vertebrate evolution. In this context, the analysis of Hox gene expression and functions to infer evolutionary scenarios has been a productive approach to explain the changes in organ formation, particularly in limbs. In tetrapods, the transcription of Hoxd genes in developing digits depends on a well-characterized set of enhancers forming a large regulatory landscape1,2. This control system has a syntenic counterpart in zebrafish, even though they lack bona fide digits, suggestive of deep homology3 between distal fin and limb developmental mechanisms. We tested the global function of this landscape to assess ancestry and source of limb and fin variation. In contrast to results in mice, we show here that the deletion of the homologous control region in zebrafish has a limited effect on the transcription of hoxd genes during fin development. However, it fully abrogates hoxd expression within the developing cloaca, an ancestral structure related to the mammalian urogenital sinus. We show that similar to the limb, Hoxd gene function in the urogenital sinus of the mouse also depends on enhancers located in this same genomic domain. Thus, we conclude that the current regulation underlying Hoxd gene expression in distal limbs was co-opted in tetrapods from a preexisting cloacal program. The orthologous chromatin domain in fishes may illustrate a rudimentary or partial step in this evolutionary co-option.
Keywords: Hox genes, Mouse, Zebrafish, fin to limb transition, multifunctional enhancers, chromatin, cloacal development
INTRODUCTION
The organization of tetrapod limbs has been conserved since their origin, with a universal pattern of ‘segments’ along the proximal to distal axis. The stylopod is a single bone (upper arm, leg) attached at one end to the torso and at the other end to two zeugopod bones (lower arm, leg), then, most distal are the mesopod (wrist, ankle) and the autopod (hand, foot). The formation of this generic pattern began before the water-to-land transition as sarcopterygian fishes display structures clearly related to proximal tetrapod limb structures4. However, when homologies are considered between fin structures and the most distal parts of tetrapod appendages, the mesopod and the autopod, there remains debate if fishes possess homologous skeletal rudiments. While mesopodial elements and extensive distal segments are present in sarcopterygian fins, the presence of true digital homologues has remained controversial5,6.
Because the HoxA and HoxD gene clusters were shown to be instrumental in making tetrapod limbs7–10, their expression domains during fin development were used to infer the presence of an autopod-related structure in fishes. In particular, Hoxa13 and Hoxd13 were studied due to their specific autopodial expression in tetrapod limbs10 and because their combined inactivation in mice produce autopodial agenesis7. An analysis of hox13 genes in the distal teleost fin suggested that a ‘distal program’ also exists in fishes, implying that this genetic regulatory network, or part thereof, would have preceded digit formation in tetrapods11,12. This core and potentially ancestral distal pattern is however realized in formation of the dermal rays of fins, while the concurrent rudimentary endoskeleton, an array of singular radials connected to the girdle, was hypothesized to be primarily proximal5. In such a scenario, the autopods of tetrapods are proposed to form from the postaxial vestiges of an ancestral sarcopterygian fin13,14. The partial retention of expression patterns that presage the emergence of digits in ray-finned and chondrichthyan fishes is nevertheless suggestive of a common regulatory program shared amongst vertebrates, the deployment of which in different species accompanied changes in form13.
During tetrapod limb bud development, a series of enhancers within in a large regulatory landscape positioned 3’ of the HoxD gene cluster (3DOM) control the transcription of Hoxd genes up to Hoxd11 in a proximal expression domain. These expression domains encompass tissue of the future stylopod (arm) and zeugopod (forearm) (Fig. S1a, green and schemes on the left)15. Posterior-distal limb bud cells then switch off these enhancers and activate another large regulatory landscape (5DOM), located 5’ to the gene cluster. This region is enriched with conserved enhancer elements that have been found to control the formation of digits by activating Hoxd13 and its closest neighbors (Fig. S1a, blue). While the deletion of 3DOM abrogated the expression of all Hoxd genes in the proximal limb domain15, deletion of 5DOM removed all Hoxd mRNAs from the forming autopod2.
In the orthologous zebrafish hoxda cluster, genes are also expressed during early fin bud development, with progressively nested expression domains comparable to the murine situation13,16. At a later stage, transcription of both hoxd9a and hoxd10a persists in the ‘preaxial’ (anterior) part of the fin bud only (Fig. S1b, magenta), while hoxd11a, hoxd12a and hoxd13a transcripts are restricted to ‘postaxial’ (posterior) cells (Fig. S1b, orange), as is the case in the emerging fin bud16. For the latter genes, combined inactivation have revealed their function during distal fin skeletal development11,17. However, despite the analysis of distal enhancers orthologous to those of the mouse12, the functionality of the complete 3DOM and 5DOM regulatory landscapes in zebrafish has not been addressed. Hence, the existence of a comparable bimodal regulation of Hoxd genes has remained elusive, precluding any conclusion on its evolutionary origin.
We asked if the zebrafish hoxda genes were regulated by a comparable enhancer hub located at a distance from the gene cluster in a manner similar to mouse limbs12,18 (Fig. S1b, question marks). By deleting the orthologous zebrafish hoxda regulatory landscapes, we find that while the proximal appendage regulation by 3DOM is fully conserved between fish and mice, the core long-distance regulation by 5DOM underlying the distal expression is deficient in fins. The deletion of the zebrafish 5DOM revealed however that it shared with mouse an ancestral function in patterning the cloacal area. Our findings suggest this core ancestral regulatory landscape arose first in the ancient cloaca and was subsequently redeployed during the evolution and shaping of tetrapod digits and external genitals.
RESULTS
The zebrafish hoxda locus
The zebrafish hoxda locus shares a high degree of synteny with that of the HoxD locus in mammals, reflecting broad conservation given the key patterning role of this complex in development of many axial structures. The gene cluster is flanked by two gene deserts referred to as 3DOM (3’-located domain) and 5DOM (5’-located domain). As in mammals, the extents of both 3DOM and 5DOM correspond to topologically associating domains (TADs) and 3DOM is split into two sub-TADs (Fig. S2). This remarkable similarity in 3D conformations, though with a 2.6-fold difference in size between the mouse versus zebrafish locus, is further supported by the conserved position and orientation of critical CTCF binding sites within the gene clusters and their enrichment at TAD and sub-TAD borders (Fig. S2).
Interspecies genomic alignments reveal several conserved sequences within 5DOM across vertebrates, whereas little conservation was scored in 3DOM (Fig. S3a, b). Within the 5DOM comparison, we identified several of the previously annotated mouse enhancers in their zebrafish counterpart12,19. Consistent with the apparent conservation of chromatin structure, we found the same global organization of both coding and non-coding elements as in the mouse landscape. When compared to the size of the Hox cluster, the relative sizes of both gene deserts are bigger in mouse than in fish, and the zebrafish 5DOM was found to be larger than 3DOM, opposite to the mouse situation (Fig. S2c). Since the overall genomic organization of both HoxD loci is well conserved between mammals and fishes, we have concluded that these two flanking gene deserts and their topologically associating domains are ancestral features predating the divergence between ray finned fishes and tetrapods, likely conserved due to important regulatory functions. Whether these domains have, or retain Hox gene regulation as initially defined at this locus in the mouse1,20 remained nevertheless unclear.
Regulatory potential of zebrafish hoxda flanking gene deserts
To address the potential function(s) of fish hoxd gene deserts, we explored chromatin accessibility and histone modification profiles using ATAC-seq21 and CUT&RUN22 assays, respectively, with posterior trunk as a source of cells, i.e., a domain where most hox genes are active. As a control sample, we used corresponding dissected heads where hoxda genes are not expressed (Fig. S4a, b). This analysis revealed enriched ATAC-seq signals not only within the hoxda cluster but also in the two gene deserts, with stronger signals in the 3DOM region (Fig. S4c). This suggested that in zebrafish, both gene deserts may indeed serve as regulatory landscapes, with long-range acting enhancers as is the case in tetrapods. Histone profiling supported this hypothesis with the same regions enriched in H3K27ac marks, while showing a poor (if any) enrichment in the negative H3K27me3 marks, suggesting several chromatin segments engaged in active transcriptional regulation.
To assess the functional potential of both hoxda gene deserts, we generated zebrafish mutant lines carrying full deletions either of 5DOM (hoxdadel(5DOM), referred to as Del(5DOM) below), or of 3DOM (hoxdadel(3DOM) or Del(3DOM) below), using CRISPR-Cas9 chromosome editing. We first examined the impact of these large deletions on hoxd13a, hoxd10a and hoxd4a expression using whole-mount in situ hybridization (WISH), spanning from 30 hours post fertilization (hpf), i.e., from the onset of hoxd13a expression16 to 60 hpf and 72 hpf, stages when all hoxda gene expression decreases significantly in fin buds. In Del(3DOM) mutant embryos, expression of both hoxd4a and hoxd10a completely disappeared from the pectoral fin buds (Fig. 1a, right and middle panels, arrowheads). The same effect was observed at all stages analyzed (Fig. 1a). These data are consistent with similar analysis in mice where the limb proximal expression domain was no longer visible upon deletion of the 3DOM landscape15. This demonstrates that, as in tetrapods, enhancers controlling the transcription of hoxd3a to hoxd10a during fin bud development are located in the adjacent 3’ landscape. The 3DOM thus has an ancestral regulatory function in the development of proximal paired appendages. Expression of hoxd13a in post-axial cells, however, remained unchanged, with a global transcript distribution indistinguishable from wild-type fin buds (Fig. 1a, left panels, arrowheads). These data indicate that the control of hoxd13a expression is distinct from that impacting hoxd3a to hoxd10a, as is also the case in tetrapods2 (Fig. S1a).
To determine whether hoxd13a transcription was controlled by enhancers present within 5DOM, we similarly analyzed Del(5DOM) zebrafish embryos by WISH. Consistent with regional control of Hox gene transcription, neither hoxd4a nor hoxd10a expression were affected in mutant Del(5DOM) fin buds (Fig. 1b, arrowheads). Unexpectedly, though, hoxd13a transcripts were unaffected, with a pattern closely matching that of control fin buds (Fig. 1b, arrowheads; Fig. S5). This was surprising as the entire regulation required for Hoxd13 expression in tetrapods is located within this region, and previous transgenic results using components of the fish 5DOM sequences in transgenic mice showed sufficiency to drive expression12,19. At later stages of development, however, an effect of the 5DOM deletion on hoxd13a expression is suggestive, though variable. Yet hoxd13a expression remains globally similar to wildtype (Fig. 1c, d; Fig. S5).
These two genomic regions also control expression in other axial systems in mice23,24. Thus, we extended our analysis to assess shared components of regulation between these regulatory landscapes. Mutant Del(3DOM) embryos did not reveal visible differences in expression from controls in the trunk (hoxd13a, hoxda10a, hoxd4a), the pseudo-cloacal region (hoxd13a) or the branchial arches and rhombomeres (hoxd4a) (Fig. S6a). Del(5DOM) embryos also showed comparable expression to wildtype controls, except for the complete disappearance of hoxd13a transcripts from the pseudo-cloacal region (Fig. 2). We noticed a temporary reduction of hoxd13a expression in the tailbud (Fig. 2a), yet this deficit was no longer detectable at 36hpf. These results reveal that in zebrafish, 5DOM-located enhancers regulate hoxd13a genes in the cloacal area, from its onset of expression until at least 72hpf, whereas neither hoxd10a, nor hoxd4a are expressed there (Fig. S6). As previously reported for both hoxd13a and hoxa13b25,26, these transcripts were found in this region within of the nascent pronephric duct and hindgut. These structures eventually converge towards a single pseudo-cloacal complex that exits the body at adjacent openings without ever completely fusing. In 72hpf larvae, hoxd13a mRNAs also appeared in the posterior gut in both control and mutant samples, however still absent in the mutant cloacal region (Fig. 2c, black and red arrows, respectively), indicating that these two expression specificities are regulated separately.
The cloaca evolved at the base of the craniate lineage as a single orifice for the digestive, urinary and reproductive tracts, as found in birds or squamates. In mammals, a cloaca initially forms early on during embryonic development, but as the embryo grows, it divides into different openings for the urogenital and digestive systems. To evaluate whether the observed 5DOM regulation of hoxd13a in the zebrafish pseudo-cloacal region is a derived or an ancestral condition, we looked at the developing mouse urogenital sinus (UGS), a structure that derives from the mammalian embryonic cloacal area.
Hoxd gene expression and regulation in the urogenital sinus
The UGS, positioned below the urinary bladder, is derived from a cloacal rudiment originating from hindgut and ectodermal tissue27,28. During mid-gestation, as the nephric and Müllerian ducts grow towards the posterior end of the embryo, they meet and fuse with the invaginating cloaca. We performed WISH on dissected urogenital systems from control murine male and female embryos at E18.5 (Fig. 3a, b). All genes tested but Hoxd13 were detected in the anterior portions of the urogenital system including the kidneys, uterus, and deferens ducts29,30 (Fig. S7a, b). In contrast, Hoxd13 expression was restricted to the UGS in both male and female embryos, along with Hoxd12, Hoxd11 and, to a weaker extent, Hoxd10 (Figs. 3 and S7), i.e., the same four genes responding to both the digit and external genitals long-range regulations exerted by 5DOM31, thus suggesting a transcriptional control coming from this same 5’-located domain.
We verified this by using an engineered inversion that keeps the HoxD cluster linked with 5DOM, but takes them far away from 3DOM (Fig. 3c, HoxDInv(Itga6-AttP))32. In this allele, Hoxd13 transcription in UGS was unaffected (Fig. 3d). We then tested a comparable inversion, yet with a breakpoint immediately 5’ the HoxD cluster thus disconnecting 5DOM from all Hoxd genes (Fig. 3c, HoxDInv(Itga6-nsi)d11lac))33. We scored a virtually complete loss of Hoxd13 transcription (Fig. 3d), suggesting that most, if not all, UGS-specific enhancers are located within 5DOM. We confirmed this by using a large BAC transgene covering the entire HoxD cluster (Fig. S7c, d)32 introduced into mice lacking both copies of the HoxD locus34 (Fig. S7c). In this mutant allele, Hoxd13 transcription was not detected (Fig. S7c, arrows). Finally, we looked at beta-gal staining of a LacZ reporter integrated into the BAC transgene. While the reporter was strongly active in fetal kidneys as expected24, the UGS was not stained (Fig. S7d). In contrast, a LacZ reporter transgene integrated within the inversion separating 5DOM from the HoxD cluster (Fig. S7d, HoxDInv(Itga6-nsi)d11lac)) robustly stained E18.5 UGS, supporting again the presence of UGS enhancers within 5DOM (Fig. S7d).
We quantified the reduction in Hoxd gene expression in the HoxDInv(Itga6-nsi)d11lac) allele using RNA-Seq on E18.5 UGS of males and females. In both cases, Hoxd13, Hoxd12 and Hoxd10 transcription levels dropped abruptly when compared to wildtype samples, while the transcription level of other Hoxd genes was not affected (Fig. S8a). Altogether, this allelic series demonstrated that the mammalian 5DOM contains the UGS enhancers, similar to the zebrafish 5DOM containing cloacal enhancers. It also showed that Hoxd genes responsive to this regulation (Hoxd13-Hoxd10) are the same sub-group that responds to both digit and external genitals enhancers.
Identification of UGS enhancers
To identify UGS enhancers within the mouse 5DOM, we used three scanning deletion alleles covering 5DOM2 (Fig. 4a, red) and measured the change in expression by RTqPCR (Fig. S8b). In the HoxDDel(Atf2-SB1) allele, the most distal portion of 5DOM was removed with no impact on Hoxd gene expression levels. However, when either the central (HoxDDel(SB1-Rel5)) or the most proximal (HoxDDel(Rel5-Rel1)) portions of 5DOM were removed, transcription of Hoxd13, Hoxd12 and Hoxd10 were significantly reduced indicating that these two 5DOM intervals contain UGS enhancers (Fig. S8b).
We then measured chromatin accessibility by ATAC-seq and profiled H3K27ac and H3K27me3 histone marks associated with either active or inactive chromatin, respectively, by ChIP-seq on micro-dissected male UGSs (Fig. 4a). We identified a cluster of several conspicuous ATAC-seq and H3K27ac signals located approximately 200 kb upstream Hoxd13, in a region encompassing the Rel5 breakpoint, i.e., between the Del(SB1-Rel5) and the Del(Rel5-Rel1) deletions (Fig. 4a, dashed box). Within this 67 kb-large region, the ATAC and H3K27ac signals matched three elements previously characterized as enhancer sequences, yet with distinct tissue specificities; The GT2 and Island E sequences had been identified as a pan- and a proximal-dorsal genital tubercle specific enhancers, respectively35,36, whereas the CsB element was reported as a distal limb and fin enhancer element1,12. The ATAC peaks were positioned at regions that are relatively depleted for the H3K27ac mark (Fig. S9), which is a hallmark of active enhancer elements21.
For the GT2 and CsB elements, portions of the region were highly conserved across bony fish hoxda loci. In contrast, Island E contained a small portion of sequence only conserved amongst mammals (Fig. S9). We tested these three putative enhancers in an enhancer-reporter assay and all three sequences were able to drive robust lacZ expression in the UGS, closely matching the expression of posterior Hoxd genes in this area in both male and female specimens (Fig. 4b), indicating that in mammals, 5DOM contains a set of enhancer elements that control the transcriptional activation of Hoxd genes in the UGS. These three enhancers had been previously identified as specific for either distal limbs or external genitals, i.e., two structures that depend upon the 5DOM regulatory landscape as the only source of enhancers for their development. In zebrafish, while the orthologous 5DOM landscape is indeed necessary to activate posterior hoxda genes in the cloacal region, expression of these genes in the developing fin is not dramatically altered.
An ancestral regulatory landscape for an ancient function
In mammals, the combined mutation of both Hoxa13 and Hoxd13 has a drastic effect on the development of the posterior part of the digestive and urogenital systems30,37, causing an absence of any detectable UGS30. Previous studies revealed the expression of most hox13 genes in the developing zebrafish intestinal and cloacal regions (Fig. S10) that is suggestive of functional conservation38. Supporting this, 5’ hoxa genes were differentially regulated in the normal patterning of the goby cloacal region39. Therefore, we wondered whether cloacal patterning would be a core ancestral function of Hox13 terminal genes regulation. We thus asked what if hox13 gene function had specific roles in cloaca formation in zebrafish.
Wild-type zebrafish exhibit a pseudo-cloacal configuration in which the hindgut and pronephric duct exit the trunk through separate but adjacent openings. The outlet of the hindgut is anterior to that of the pronephric duct, and a septum resides in between (Fig. 5a, e). Homozygous single mutants of hoxa13a, hoxa13b, and hoxd13a are indistinguishable from the wild-type arrangement (Fig. 5b, f), as are animals triply heterozygous for these genes (Fig. S11). However, combined hoxa13a;hoxa13b double homozygous mutants exhibit connection of the hindgut with the pronephric duct before exiting the body through a single opening (Fig. 5c, g). The loss of Hoxa13 paralogs was also found to affect pronephric duct and hindgut length at the level of the median fin fold (Fig. S11). A more severe phenotype is observed in hoxa13a;hoxa13b;hoxd13a triple mutants, in which the septum is dysmorphic and the hindgut and pronephric duct are fused, resulting in a large shared lumen and outlet (Fig. 5d, h). These results reveal a conserved requirement of Hox13 function for the normal patterning of the termini of digestive and urogenital systems across vertebrates.
DISCUSSION
Hox regulatory landscapes and the fin to limb transition
The expression and function of Hoxd and Hoxa genes have been central to hypotheses attempting to explain the evolutionary change from fins to limbs (e.g.5,11,13,14). By making comparisons of their complex transcription patterns across actinopterygian, chondrichthyan, and sarcopterygian fishes, various efforts have sought relate the two types of paired appendages. These analyses have led to the conclusion that, despite being composed of different types of skeletons, the development of actinopterygian fin rays and digits have a common regulatory architecture11,40. Here, by deleting the two TADs flanking the fish hoxda cluster, we show that the essential digit regulatory landscape characterized in tetrapods indeed has a structural counterpart in teleosts (Fig. S1)18,41. However, we report that, unlike in limbs, only a small part of the regulation controlling hoxda gene expression in distal fin buds is located within this landscape. Indeed, while some enhancer(s) controlling hoxd13a transcription in developing zebrafish fins are located within 5DOM12, most of the regulatory control likely resides within the gene cluster itself, probably at the vicinity of the hoxd13a, hoxd12a and hoxd11a genes, i.e., the three genes sharing the same expression in post-axial cells16.
This observation is consistent with the absence, in the zebrafish 5DOM, of sequences related to several strong mouse digit enhancers12, but also confirms results obtained when assaying fish 5DOM conserved sequences as transgenes, either in zebrafish or in mice12,19. It also explains why the zebrafish lnpa gene, which is embedded into 5DOM, is not expressed in the emerging pectoral fin buds16, whereas the mouse counterpart has a strong distal expression due to enhancer hi-jacking1. The presence of such a partial distal landscape in teleosts may illustrate an intermediate step in the full co-option of this regulation, as achieved in tetrapods (see below). Alternatively, it may reflect a secondary loss of several distal enhancers associated with teleost whole genome duplication. These questions may be solved with a comparable deletion and epigenetic characterization of 5DOM in more basal fish species such as gar, sturgeon, or even sharks. In contrast, the deletion of the opposite 3DOM landscape, which is responsible for all proximal Hoxd expression in tetrapods limb buds42, abolished hoxda gene expression in fin buds, demonstrating the genuine ancestral character of this regulation, which must have been implemented as soon as paired appendages evolved.
An ancestral cloacal regulation
Zebrafish hoxa13a and hoxd13a, as well as other hox13 paralogs43, are strongly expressed in and around the developing cloacal region25,26. This is an area where the extremities of both the gastro-intestinal tract and the reproductive and urinary systems come together, even though their openings remain separated, unlike in some chondrichthyan fishes or other vertebrates where the tubes coalesce into a single opening (e.g., in sharks or birds). Here we report that this pseudo-cloacal structure is disrupted in zebrafish carrying hox13 mutant alleles, with an abnormal fusion between the intestinal and the pronephric opening thus giving rise to a single yet abnormal, cloacal opening. Likewise, the developing murine UGS expresses Hoxd1329 and double mouse Hoxd13-Hoxa13 mutant animals had severely malformed posterior regions30,37, with no distinguishable UGS30, illustrating that the evolutionary conservation of this regulatory landscape is accompanied by shared functional effects.
We also document that, as for zebrafish, the control of murine posterior Hoxd genes in the cloaca is achieved by enhancers located within the 5’ located regulatory landscape, i.e., in the same genomic region that regulates expression in both digits and external genitals. In mouse, several 5DOM enhancers are somewhat versatile such as the GT2 sequence, which is both UGS and genitalia-specific44, whereas CsB is UGS and digit-specific1. Other enhancer sequences, however, seem to have kept a unique specificity such as ‘island 2’, the strongest Hox digit enhancer identified thus far45, which is located in a different area of 5DOM2 and absent in zebrafish12. These observations illustrate a ‘functional adaptation’ of enhancers, which could be facilitated by a spatial proximity within the same large chromatin domain thus triggering the sharing of upstream factors (see46). Finally, all the regulatory specificities encoded in this 5DOM landscape control the same subset of posterior Hoxd genes in tetrapods (from Hoxd13 to Hoxd10), suggesting that while groups of enhancers can be reutilized for new tissue types, there is a constraint on which genes they can target.
Successive co-options of a regulatory landscape
In vertebrates, Hox13 genes are located within a topologically associated domain (TAD) distinct from that including more anterior Hox genes and their regulations15,47. This condition prevents Hox13 to be activated too early and hence too anteriorly in the body axis, a situation detrimental for the embryo due to the potent posteriorizing function of these proteins48. As a result, Hoxd13 was likely the main target gene that triggered and stabilized the various evolutionary co-options of 5DOM regulations due to its location within the 5DOM TAD and through its function to organize posterior or distal body parts together with its Hoxa13 paralog7,30,37. Our results indicate that the initial functional specificity of this regulatory landscape was to organize a cloacal region, which is the posterior part of the intestinal and urogenital systems. This conclusion is supported by the documented expression of Hox13 paralogs in the cloacal regions of paddlefish38,49, catshark49,50 and lampreys51, suggesting this patten was a characteristic of the common ancestor of craniate vertebrates.
While this ancestral function has been maintained throughout vertebrates, it is more difficult to infer the temporal sequence of co-options of this regulatory landscape along with the evolution of digits and external genitals (Fig. 6). However, as genitalia are late amniote specializations, it is conceivable that elaboration of digital character arose initially, suggesting a first regulatory co-option of this landscape (or part thereof) from a cloacal to a digital specificity. In support of this proposal, digits appeared in aquatic sarcopterygian fishes. These fishes do not have apparent reproductive structures to facilitate internal fertilization, and many tetrapod species also do not have external genitals. A second co-option of this now multi-functional regulatory landscape then might have occurred along with the emergence of external genitals. This latter step would have been facilitated both by the developmental proximity between external genitals and the embryonic cloacal region where posterior Hox genes are initially expressed52, and by the tight developmental relationships between amniotes limbs and genitals29,52. Altogether, our results indicate that the repeated redeployment of an ancient regulatory landscape, first arising along with the formation of the cloaca, serve as a foundation for the evolution and elaboration of innovations in vertebrates.
MATERIALS AND METHODS
Animal husbandry and ethics
All experiments using mice were approved and performed in compliance with the Swiss Law on Animal Protection (LPA) under license numbers GE45/20 and GE81/14. All animals were kept as a continuous backcross with C57BL6 × CBA F1 hybrids. Mice were housed at the University of Geneva Sciences III animal colony with light cycles between 07:00 and 19:00 in the summer and 06:00 and 18:00 in winter, with ambient temperatures maintained between 22 and 23 °C and 45 and 55% humidity, the air was renewed 17 times per hour. Zebrafish (Danio rerio) were maintained according to standard conditions57 under a 14/10 hours on/off light cycle at 26°C, with a set point of 7.5 and 600uS for pH and conductivity respectively. All zebrafish husbandry procedures have been approved and accredited by the federal food safety and veterinary office of the canton of Vaud (VD-H23). AB, Tu and TL were used as wild-type strains and were obtained from the European zebrafish resource center (EZRC). hoxdaDel(3DOM) and hoxdaDel(5DOM) mutants were generated for this study. Zebrafish embryos were derived from freely mating adults. Wild-type siblings, hoxdaDel(3DOM) and hoxdaDel(5DOM) homozygous embryos were obtained from crossing the corresponding heterozygous mutant. Embryos were collected within 30 minutes after spawning and incubated at 28.5°C in fish water, shifted at 20°C after reaching 80% epiboly and grown at 28.5°C to the proper developmental stage according to58. Pigmentation was prevented by treating the embryos with 0.002% N-phenylthiourea from 1 dpf onwards.
Generation of the hoxdaDel(3DOM) and hoxdaDel(5DOM) deletions in zebrafish
The hoxdaDel(3DOM) and hoxdaDel(5DOM) mutant alleles were generated using the CRISPR/Cas9 system described in59. The sequences of the crRNAs used are listed in Table S3. Loci were identified with the GRCz11 zebrafish genome assembly available on Ensembl. The corresponding genomic regions were amplified and sequenced from fin clips. Adults carrying verified target sequences were isolated and then selected for breeding to generate eggs for genome editing experiments. The gRNAs target sites were determined using the open-source software CHOPCHOP (http://chopchop.cbu.uib.no/index.php) and chemically synthesized Alt-R® crRNAs and Alt-R® tracrRNAs as well as the Alt-R® Cas9 protein were obtained from Integrated DNA Technologies (IDT). To test the efficiency of these gRNAs in generating the expected mutant alleles, we injected boluses ranging from 100 μm to 150 μm and containing 5μM of the duplex crRNAs, tracrRNA and Cas9 RNP complex into the cytoplasm of one-cell stage embryos. Injecting the RNP complex solution in a 100 μm bolus gave less than 5% mortality. With this condition, 30% of the embryos carried the 5DOM deletion and 15% carried the 3DOM deletion. For each condition we extracted the genomic DNA of 20 individual larvae at 24hpf for genotyping60. Identification of hoxdaDel(3DOM) and hoxdaDel(5DOM) mutants were performed by PCR. Amplification of evx2 was used as a control to confirm the presence or absence of the 5DOM. The PCR mix was prepared for using the Phusion® High-Fidelity DNA Polymerase (NEB) and primer sequences are listed in Table S3. In parallel, a hundred and twenty larvae per allele were raised to adulthood. To identify founders, F0 adults were outcrossed with wild-type and 25 embryos were genotyped. Three and four independent founders were obtained for the hoxdaDel(5DOM) allele and hoxdaDel(3DOM), respectively. Two founders of each deletion were verified by Sanger sequencing (File S3) and used for further experiments.
Zebrafish hox13 mutant lines, phalloidin labeling and genotyping
Frameshift loss-of-function alleles hoxa13ach307, hoxa13bch308, and hoxd13a5bp ins were previously generated9. Zebrafish lines were propagated and maintained following61. To generate compound hox13 mutants, animals triple heterozygous for hoxa13a, hoxa13b and hoxd13a were inter-crossed. Resulting larvae were fixed at 6 dpf in 4% paraformaldehyde in phosphate buffered saline (PBS) for 2 hours at room temperature with rocking agitation. After fixation, larvae were rinsed twice for 5 minutes each in PBS with added 1% Triton (PBSX). To visualize cloacal anatomy by labeling filamentous actin, larvae were then incubated in PBSX with fluorophore-conjugated phalloidin (Sigma-Aldrich P1951, phalloidin-Tetramethylrhodamine B isothiocyanate) added to a final concentration of 5U/mL overnight at 4°C with rocking agitation. Larvae were then rinsed twice for one hour each with PBSX.
For genotyping, phalloidin-labeled larvae were cut in half, separating the head, yolk, and pectoral fins from the cloaca and tail. The head half was used for genotyping and the tail half was saved at 4°C for later analysis. DNA was extracted from the head half by digesting tissue in proteinase K diluted to 1 mg/mL in 20 μl 1x PCR buffer (10 mM Tris-HCl, 50 mM KCl, 1.5 mM MgCl2) for 1 hour at 55°C followed by heat inactivation at 80°C for 20 minutes. The digested tissue was then subjected to brief vortexing and then 1 μl was used directly as template for genotyping PCR, with primers listed in Table S3. For thermocycling, after an initial step at 94°C for 2 minutes, reactions were cycled 40x though (15 s at 94°C, 15 s at 58 °C, 20 s at 72 °C) and finished with 5 minutes at 72 °C. PCR products were then heteroduplexed on a thermocycler by heating to 95°C for 10 minutes and then gradually cooled by 1°C every 10 seconds until a final temperature of 4°C was reached. Heteroduplexed PCR amplicons were then run on a high percentage agarose gel to determine genotype by product size.
To analyze cloacal morphology, fixed phalloidin-labeled tails were imaged on a Zeiss LSM 800 confocal microscope. After acquiring a full confocal stack through the cloacal region, a midline frame that demonstrated hindgut and pronephric duct morphology was selected.
Mutant mouse stocks
All mouse lines used in this study were previously reported: Inv(Itga6-nsi)d11lac33, Inv(Itga6-attP) and tgBAC(HoxD)32, Del(HoxD)34 and Del(Atf2-SB1), Del(SB1-Rel5) and Del(Rel5-Rel1)62.
Whole-mount in situ hybridization (WISH)
The zebrafish and mouse antisense probes used in this study are listed in File S1 and File S2, respectively. For zebrafish, WISH were performed as described in60, at 58°C for all riboprobes (hybridization temperature and SSC washes). Wholemount embryos were photographed using a compound microscope (SZX10, Olympus) equipped with a Normarski optics and a digital camera (DP22, Olympus). Genotyping of individual embryos was performed after photographic documentation as in60 with primers listed in Table S3. Wild-type and mutant embryos originated from the same clutch of eggs produced by heterozygote crosses and underwent WISH procedure in the same well. Murine urogenital systems were isolated from E18.5 embryos and processed following a previously reported WISH procedure63, with some specific adjustments. For the Proteinase K treatment, urogenital systems were incubated 20 minutes in proteinase K diluted to 20 μg/ml in PBST. For the refixation step, a solution of 4% PFA containing 0.2% glutaraldehyde was used. Hybridization temperature was 69°C and temperature of post-hybridization washes was 65°C. Staining was developed in BM-Purple (Roche #11442074001) for approximately 4 hours at room temperature.
Mouse genotyping.
For extemporaneous genotyping, yolk sacs were collected and placed into 1.5 ml tubes containing Rapid Digestion Buffer (10 mM EDTA pH8.0 and 0.1 mM NaOH), then placed in a thermomixer at 95 °C for 10 min with shaking at 900 rpm. While the yolk sacs were incubating, the PCR master mix was prepared using Z-Taq (Takara R006B) and primers (Table S1) and aliquoted into PCR tubes. The tubes containing lysed yolk sacs were then placed on ice to cool briefly and quickly centrifuged at high speed. The lysate (1μl) was placed into the reaction tubes and cycled 32× (2 s at 98 °C, 2 s at 55 °C, 15 s at 72 °C). Twenty microliters of the PCR reaction were loaded onto a 1.5% agarose gel and electrophoresis was run at 120 V for 10 min. When samples could be kept for some time, a conventional genotyping protocol was applied with a Tail Digestion Buffer (10 mM Tris pH8.0, 25 mM EDTA pH8.0, 100 mM NaCl, 0.5% SDS) added to each yolk sac or tail clipping at 250μl along with 4μl Proteinase K at 20 mg/ml (EuroBio GEXPRK01–15) and incubated overnight at 55 °C. The samples were then incubated at 95 °C for 15 min to inactivate the Proteinase K and stored at −20 °C until ready for genotyping. Genotyping primers (Supplementary Data 1) were combined with Taq polymerase (Prospec ENZ-308) in 25μl reactions and cycled 2× with Ta = 64 °C and then cycled 32× with Ta = 62 °C.
Mouse RT-qPCR
Urogenital sinuses (UGS) were collected from E18.5 male embryos separately and placed into 1× DEPC-PBS on ice. A little piece of the remaining embryo was collected for genotyping. The UGS were transferred into fresh 1× DEPC-PBS and then placed into RNALater (ThermoFisher AM7020) for storage at −80 °C until processing. Batches of samples were processed in parallel to collect RNA with Qiagen RNeasy extraction kits (Qiagen 74034). After isolating total RNA, first strand cDNA was produced with SuperScript III VILO (ThermoFischer 11754–050) using approximately 500 ng of total RNA input. cDNA was amplified with Promega GoTaq 2× SYBR Mix and quantified on a BioRad CFX96 Real Time System. Expression levels were determined by dCt (GOI–Tbp) and normalized to one for each condition by subtracting each dCT by the mean dCT for each wild-type set. Finally, expression was evaluated by two power this normalized dCT. Table S1 contains the primer sequences used for quantification. RT-qPCR measurements were taken from distinct embryos. Box plots for expression changes and two-tailed unequal variance t tests were produced in DataGraph 4.6.1. The boxes represent the interquartile range (IQR), with the lower and upper hinges denoting the first and third quartiles (25th and 75th percentiles). Whiskers extend from the hinges to the furthest data points within 1.5 times the IQR. The upper whisker reaches the largest value within this range, while the lower whisker extends to the smallest value within 1.5 times the IQR from the hinge.
Mouse RNA-Seq
E18.5 male and female UGS were collected with a dissection separating the bladder from the urogenital sinus but including the proximal urethra (and vagina in females). Tissues were stored in RNALater (ThermFisher AM7020) and processed in parallel with Qiagen RNeasy extraction kits (Qiagen 74034). RNA quality was assessed on an Agilent Bioanalyzer 2100, with RIN scores > 9.5. RNA sequencing libraries were prepared at the University of Geneva genomics platform using Illumina TruSeq stranded total RNA with Ribo-Zero TM Gold Ribo-deleted RNA kits to produce strand-specific 100bp single-end reads on an Illumina HiSeq 2000. Raw RNA-seq reads were processed with CutAdapt version 4.1 (-a GATCGGAAGAGCACACGTCTGAACTCCAGTCAC -q 30 -m 15)64 to remove TruSeq adapters and bad quality bases. Filtered reads were mapped on the mouse genome mm39 with STAR version 2.7.10a65 with ENCODE parameters with a custom gtf file66 based on Ensembl version 108. This custom gtf file was obtained by removing readthrough transcripts and all noncoding transcripts from a protein-coding gene. FPKM values were evaluated by Cufflinks version 2.2.167,68 with options --max-bundle-length 10000000 --multiread-correct --library-type “fr-firststrand” -b mm10.fa --no-effective-length-correction -M MTmouse.gtf -G. Boxplots depicting expression levels in distinct embryos were generated using the same methodology as for RT-qPCR.
ATAC-Seq
Mouse and fish tissues were isolated and placed into 1x PBS containing 10% FCS on ice. Collagenase (Sigma-Aldrich C9697) was added to 50ug/ml and incubated at 37° for 20 minutes with shaking at 900rpm. Cells were washed 3x in 1x PBS. The number of cells was counted and viability confirmed to be greater than 90%. An input of 50000 cells was then processed according to previous description21. Sequencing was performed at EPFL GECF on an Illumina NextSeq 500. Analysis was performed similarly to69. Raw ATAC-seq paired-end reads were processed with CutAdapt version 4.1 (-a CTGTCTCTTATACACATCTCCGAGCCCACGAGAC -A CTGTCTCTTATACACATCTGACGCTGCCGACGA -q 30 -m 15)64 to remove Nextera adapters and bad quality bases. Filtered reads were mapped on mm39 for mouse samples and danRer11 where alternative contigs were removed for fish samples with bowtie2 version 2.4.570 with the following parameters: --very-sensitive --no-unal --no-mixed --no-discordant --dovetail -X 1000. Only pairs mapping concordantly outside of mitochondria were kept (Samtools v1.16.1)71. PCR duplicates were removed by Picard version 3.0.0 (http://broadinstitute.github.io/picard/index.html). BAM files were converted to BED with bedtools version 2.30.072. Peaks were called and coverage was generated by MACS2 version 2.2.7.1 with --nomodel --keep-dup all --shift -100 --extsize 200 --call-summits -B. Coverages were normalized to million mapped reads.
Mouse ChIP-Seq
Male UGS were isolated and placed into 1x PBS containing 10% FCS on ice. ChIP-seq experiments were then performed as previously described in73. Briefly, they were fixed for 10 mn in 1% formaldehyde at room temperature and the crosslinking reaction was quenched with glycine. Subsequently, nuclei were extracted and chromatin was sheared using a water-bath sonicator (Covaris E220 evolution ultra-sonicator). Immunoprecipitation was done using the following anti- H3K27ac (Abcam, ab4729) or anti-H3K27me3 (Merck Millipore, 07–449). Libraries were prepared using the TruSeq protocol, and sequenced on the Illumina HiSeq 4000 (100 bp single-end reads) according to manufactures instructions. CTCF re-analysis from69,74. Accession numbers are listed in TableS2. Raw ChIP-seq single-reads or paired-end reads were processed with CutAdapt version 4.1 (-a GATCGGAAGAGCACACGTCTGAACTCCAGTCAC for single-reads and -a CTGTCTCTTATACACATCTCCGAGCCCACGAGAC -A CTGTCTCTTATACACATCTGACGCTGCCGACGA -q 30 -m 15)64 to remove Truseq or Nextera adapters and bad quality bases. Filtered reads were mapped on mm39 for mouse samples and danRer11 where alternative contigs were removed for reanalysis of fish samples with bowtie2 version 2.4.570 with the default parameters. Only alignments with a mapping quality above 30 were kept (Samtools v1.16.1)75. Peaks were called and coverage was generated by MACS2 version 2.2.7.1 with with --call-summits -B (and --nomodel --extsize 200 for single-read). Coverages were normalized to million mapped reads/pairs.
Mouse enhancer-reporter assay
Transgenic embryos were generated as described in35. Primers were designed to amplify genomic DNA from the region around the observed ATAC and H3K27Ac peaks (Table S3). These primers included extra restriction sites for either XhoI or SalI at the 5’ ends. The PCR fragments were cleaned with Qiagen Gel Extraction Kit (#28704). The PCR fragment and the pSKlacZ reporter construct (GenBank X52326.1)73 were digested with XhoI or SalI and ligated together with Promega 2× Rapid Ligation kit (#C6711). Sanger sequencing confirmed the correct sequences were inserted upstream of the promoter. Maxipreps of the plasmid were prepared and eluted in 1x IDTE (#11-05-01-13). Pro-nuclear injections were performed and embryos were collected at approximately E18.5 and stained for lacZ. UGS were collected from E18.5 embryos in ice-cold 1× PBS in a 12-well plate. All steps were done with gentle shaking on a rocker plate at room temperature. Tissues were fixed for 5 min at room temperature in freshly prepared 4% PFA. After fixing, they were washed three times in 2 mM MgCl2, 0.01% Sodium Deoxycholate, 0.02% Nonidet P40, and 1× PBS, for 20 min at RT. The wash solution was replaced by X-gal staining solution (5 mM Potassium Ferricynide, 5 mM Potassium Ferrocynide, 2 mM MgCl2 hexahydrate, 0.01% Sodium Deoxycholate, 0.02% Nonidet P40, 1 mg/ml X-Gal, and 1× PBS) for overnight incubation with the plate wrapped in aluminum foil to protect from light. Tissues were then washed three times in 1× PBS and fixed in 4% PFA for long-term storage. Images of embryos were collected with an Olympus DP74 camera mounted on an Olympus MVX10 microscope using the Olympus cellSens Standard 2.1 software.
Mouse Capture HiC-seq
E18.5 male UGS were collected and collagenase-treated samples were cross-linked with 1% formaldehyde (Thermo Fisher 28908) for 10 min at RT and stored at −80° until further processing as previously described76. The SureSelectXT RNA probe design used for capturing DNA was done using the SureDesign online tool by Agilent. Probes cover the region chr2: 72240000–76840000 (mm9) producing 2X coverage, with moderately stringent masking and balanced boosting. DNA fragments were sequenced on the Illumina HiSeq 4000 and processed with HiCUP version 0.9.2 on mm39 with --re1 ĜATC77, bowtie2 version 2.4.570 and samtools version 1.16.175. The output BAM was converted to pre-juicer medium format with hic2juicer from HiCUP. The pairs with both mates on chr2:72233000–76832000 were selected, sorted and loaded into a 10kb bins matrix with cooler version 0.8.1178. The final matrix was balanced with the option --cis-only. The TADs were computed with HiCExplorer hicFindTADs version 3.7.279,80 with --correctForMultipleTesting fdr --minDepth 120000 --maxDepth 240000 --step 240000 --minBoundaryDistance 250000. Data was plotted on mm39 (chr2:73600000–75550000).
Zebrafish HiC-seq
The HiC profiles derived from a reanalysis of data from55,74. Accession numbers are listed in Table S2. Reads were mapped on danRer11 where alternative contigs were removed and no selection of reads were performed. Valid pairs were loaded into a 10kb bins matrix. TAD calling parameters were adapted to the smaller size of the genome: --chromosomes “chr9” --correctForMultipleTesting fdr --minDepth 35000 --maxDepth 70000 --step 70000 --minBoundaryDistance 50000. Data was plotted on danRer11 (chr9:1650000–2400000) and on an inverted x axis.
CUT&RUN
Zebrafish samples were processed according to (10.1038/nprot.2018.015), using a final concentration of 0.02% digitonin (Apollo APOBID3301). Approximately 0.5e6 cells were incubated with 0.1μg/100μl of anti-H3K27ac antibody (Abcam Ab4729), or 0.5 μg/100 μl of anti-H3K27me3 (Merck Millipore 07–449) in Digitonin Wash Buffer at 4 °C. The pA-MNase was kindly provided by the Henikoff lab (Batch #6) and added at 0.5 μl/100 μl in Digitonin Wash Buffer. Cells were digested in Low Calcium Buffer and released for 30 min at 37 °C. Sequencing libraries were prepared with KAPA HyperPrep reagents (07962347001) with 2.5ul of adapters at 0.3uM and ligated for 1 h at 20 °C. The DNA was amplified for 14 cycles. Post-amplified DNA was cleaned and size selected using 1:1 ratio of DNA:Ampure SPRI beads (A63881) followed by an additional 1:1 wash and size selection with HXB. HXB is equal parts 40% PEG8000 (Fisher FIBBP233) and 5 M NaCl. Sequencing was performed at EPFL GECF on an Illumina HiSeq 4000. Raw CUT&RUN paired-end reads were processed with CutAdapt version 4.1 (-a GATCGGAAGAGCACACGTCTGAACTCCAGTCAC -A GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT -q 30 -m 15) to remove TruSeq adapters and bad quality bases64. Filtered reads were mapped on danRer11 where alternative contigs were removed with bowtie2 version 2.4.570 with the following parameters: --very-sensitive --no-unal --no-mixed --no-discordant --dovetail -X 1000. Only alignments with mapping quality above 30 were kept (Samtools v1.16.1)75. PCR duplicates were removed by Picard version 3.0.0 (http://broadinstitute.github.io/picard/index.html). BAM files were converted to BED with bedtools version 2.30.072. Peaks were called and coverage was generated by MACS2 version 2.2.7.1 with --nomodel --keep-dup all --shift -100 --extsize 200 --call-summits -B. Coverages were normalized to million mapped reads.
Analyses of conserved sequences
Annotation of orthologous domains was performed using transcription start sites of orthologous genes, as reported in Table S1. To identify conserved sequences between mouse and zebrafish, a pairwise alignment was done between the mouse genomic region chr2:73600000–75550000 (mm39) and the zebrafish orthologous region chr9:1650000–2400000 (danRer11) using discontinuous mega blast. To reduce false positives, only reciprocal hits were considered. To display multispecies conservation levels, MAF files were generated between the chromosome 2 of the mouse genome (mm39) and contig chrUn_DS181389v1 of the Platypus genome (ornAna2), chromosome 7 of the chicken genome (galGal6), contig chrUn_GL343356 of Lizard genome (anoCar2), chromosome 9 of the frog genome (xenTro10), contig JH127184 of the Coelacanth genome (latCha1), chromosome 9 of the zebrafish genome (danRer11), chromosome 1 of the Fugu genome (fr3) and the whole lamprey genome (petMar3). Details for the maf generation are available on the github repository https://github.com/AurelieHintermann/HintermannBoltEtAl2024. To help the visualization, a horizontal line was plotted for each species on each region.
scRNA-seq
The matrix of the scRNA-seq atlas was downloaded from GEO (GSE223922)56 as well as the table with metadata. The matrix was loaded into a Seurat object with Seurat version 4.3.081 with R version 4.3.0. Cells attributed to the ‘tissue.name’ ‘endoderm’ were selected. Normalization and PCA was done as in ref.56 and UMAP was performed on the top 70 PCA and 50 nearest neighbors. UMAP coordinates and hox13 normalized expression of endoderm cells were exported to file and plotted with ggplot2 version 3.4.4.
Software
The phylogenic tree was generated with http://timetree.org using the following species: Mus musculus, Protopterus, Danio rerio, Carcharhinus leucas, Petromyzon marinus, Branchiostoma lanceolatum and subsequently edited with seaview 4.7. Genomic tracks from Next-Generation Sequencing were plotted with pyGenomeTracks 3.8, using custom gene annotations available at https://zenodo.org/records/7510797 (mm39) and https://zenodo.org/records/10283274 (danRer11). RT-qPCR, RNA-seq and domain size quantifications were plotted with R using the ggplot package.
Supplementary Material
ACKNOWLEDGEMENTS
We thank all members of the Duboule laboratories for comments and discussions, the Van der Goot laboratory for providing a fish cDNA library, Mikiko Tanaka for the hoxd10a probe, Jeffrey Farrell for his advices and Andy Oates for his support. The calculations were performed using the facilities of the Scientific IT and Application Support Center of EPFL. This work was supported in part using the resources and services of the Gene Expression Research Core Facility (GECF) at the School of Life Sciences of EPFL and transgene injections were performed at the transgenesis core facility of the University of Geneva, medical school. We acknowledge the support of the Brinson Family Foundation (to NS).
FUNDING
This work was supported by funds from the Ecole Polytechnique Fédérale (EPFL, Lausanne), the University of Geneva, the Swiss National Research Fund (No. 310030B_138662 and 310030B_138662 and the European Research Council grant RegulHox (No 588029) to DD, the NIH 1R01HD112906 to MPH and the NSF 2210072 to TN. Funding bodies had no role in the design of the study and collection, analysis and interpretation of data and in writing the manuscript.
Funding Statement
This work was supported by funds from the Ecole Polytechnique Fédérale (EPFL, Lausanne), the University of Geneva, the Swiss National Research Fund (No. 310030B_138662 and 310030B_138662 and the European Research Council grant RegulHox (No 588029) to DD, the NIH 1R01HD112906 to MPH and the NSF 2210072 to TN. Funding bodies had no role in the design of the study and collection, analysis and interpretation of data and in writing the manuscript.
Footnotes
COMPETING INTERESTS
The authors declare that they have no competing interests.
ETHICAL STATEMENT
All experiments involving animals were performed in agreement with the Swiss Law on Animal Protection (LPA). For mice, work was carried out under license No GE 81/14 (to DD). For zebrafish, work was carried out under a general license of the EPFL granted by the Service de la Consommation et des Affaires Vétérinaires of the canton of Vaud, Switzerland (No VD-H23).
CODE AVAILABILITY
All scripts necessary to reproduce figures from raw data are available at https://github.com/AurelieHintermann/HintermannBoltEtAl2024.
DATA AVAILABILITY
All raw and processed datasets are available in the Gene Expression Omnibus (GEO) repository under accession number GSE250267.
REFERENCES
- 1.Spitz F., Gonzalez F. & Duboule D. A global control region defines a chromosomal regulatory landscape containing the HoxD cluster. Cell 113, 405–17 (2003). [DOI] [PubMed] [Google Scholar]
- 2.Montavon T. et al. A regulatory archipelago controls Hox genes transcription in digits. Cell 147, 1132–45 (2011). [DOI] [PubMed] [Google Scholar]
- 3.Shubin N., Tabin C. & Carroll S. Deep homology and the origins of evolutionary novelty. Nature 457, 818–23 (2009). [DOI] [PubMed] [Google Scholar]
- 4.Thorogood P. The development of the teleost fin and implications for our understanding of tetrapod evolution. in Developmental Patterning of the Vertebrate Limb (ed. Hinchliffe J. ; H. J. M. ;. Summerbell D.) 347–354 (Plenum Press, New York, 1991). [Google Scholar]
- 5.Woltering J. M. & Duboule D. The origin of digits: expression patterns versus regulatory mechanisms. Developmental cell 18, 526–32 (2010). [DOI] [PubMed] [Google Scholar]
- 6.Cloutier R. et al. Elpistostege and the origin of the vertebrate hand. Nature 579, 549–554 (2020). [DOI] [PubMed] [Google Scholar]
- 7.Fromental-Ramain C. et al. Hoxa-13 and Hoxd-13 play a crucial role in the patterning of the limb autopod. Development 122, 2997–3011 (1996). [DOI] [PubMed] [Google Scholar]
- 8.Dolle P., Izpisua-Belmonte J. C., Falkenstein H., Renucci A. & Duboule D. Coordinate expression of the murine Hox-5 complex homoeobox-containing genes during limb pattern formation. Nature 342, 767–72 (1989). [DOI] [PubMed] [Google Scholar]
- 9.Davis M. C. The Deep Homology of the Autopod: Insights from Hox Gene Regulation. Integrative and comparative biology (2013) doi: 10.1093/icb/ict029. [DOI] [PubMed] [Google Scholar]
- 10.Zakany J. & Duboule D. The role of Hox genes during vertebrate limb development. Current Opinion in Genetics & Development 17, 359–66 (2007). [DOI] [PubMed] [Google Scholar]
- 11.Nakamura T., Gehrke A. R., Lemberg J., Szymaszek J. & Shubin N. H. Digits and fin rays share common developmental histories. Nature 537, 225–228 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Gehrke A. R. et al. Deep conservation of wrist and digit enhancers in fish. Proceedings of the National Academy of Sciences of the United States of America 112, 803–8 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sordino P., van der Hoeven F. & Duboule D. Hox gene expression in teleost fins and the origin of vertebrate digits. Nature 375, 678–81 (1995). [DOI] [PubMed] [Google Scholar]
- 14.Woltering J. M. et al. Sarcopterygian fin ontogeny elucidates the origin of hands with digits. Science Advances 6, eabc3510 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Andrey G. et al. A switch between topological domains underlies HoxD genes collinearity in mouse limbs. Science 340, 1234167 (2013). [DOI] [PubMed] [Google Scholar]
- 16.Ahn D. & Ho R. K. Tri-phasic expression of posterior Hox genes during development of pectoral fins in zebrafish: implications for the evolution of vertebrate paired appendages. Developmental biology 322, 220–33 (2008). [DOI] [PubMed] [Google Scholar]
- 17.Hawkins M. B., Henke K. & Harris M. P. Latent developmental potential to form limb-like skeletal structures in zebrafish. Cell 184, 899–911.e13 (2021). [DOI] [PubMed] [Google Scholar]
- 18.Woltering J. M., Noordermeer D., Leleu M. & Duboule D. Conservation and divergence of regulatory strategies at Hox Loci and the origin of tetrapod digits. PLoS Biol 12, e1001773 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Schneider I. et al. Appendage expression driven by the Hoxd Global Control Region is an ancient gnathostome feature. Proceedings of the National Academy of Sciences of the United States of America 108, 12782–6 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Bolt C. C. & Duboule D. The regulatory landscapes of developmental genes. Development 147, dev171736 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Buenrostro J. D., Giresi P. G., Zaba L. C., Chang H. Y. & Greenleaf W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods 10, 1213–1218 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Skene P. J. & Henikoff S. An efficient targeted nuclease strategy for high-resolution mapping of DNA binding sites. Elife 6, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Amândio A. R., Lopez-Delisle L., Bolt C. C., Mascrez B. & Duboule D. A complex regulatory landscape involved in the development of mammalian external genitals. eLife 9, e52962 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Di-Poi N., Zakany J. & Duboule D. Distinct roles and regulations for HoxD genes in metanephric kidney development. PLoS Genet 3, e232 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sordino P., Duboule D. & Kondo T. Zebrafish Hoxa and Evx-2 genes: cloning, developmental expression and implications for the functional evolution of posterior Hox genes. Mech Dev 59, 165–75 (1996). [DOI] [PubMed] [Google Scholar]
- 26.van der Hoeven F., Sordino P., Fraudeau N., Izpisua-Belmonte J. C. & Duboule D. Teleost HoxD and HoxA genes: comparison with tetrapods and functional evolution of the HOXD complex. Mech Dev 54, 9–21 (1996). [DOI] [PubMed] [Google Scholar]
- 27.Georgas K. M. et al. An illustrated anatomical ontology of the developing mouse lower urogenital tract. Development 142, 1893–1908 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Liaw A. et al. Development of the human bladder and ureterovesical junction. Differentiation 103, 66–73 (2018). [DOI] [PubMed] [Google Scholar]
- 29.Dolle P., Izpisua-Belmonte J. C., Brown J. M., Tickle C. & Duboule D. HOX-4 genes and the morphogenesis of mammalian genitalia. Genes Dev 5, 1767–7 (1991). [DOI] [PubMed] [Google Scholar]
- 30.Warot X., Fromental-Ramain C., Fraulob V., Chambon P. & Dolle P. Gene dosage-dependent effects of the Hoxa-13 and Hoxd-13 mutations on morphogenesis of the terminal parts of the digestive and urogenital tracts. Development 124, 4781–91 (1997). [DOI] [PubMed] [Google Scholar]
- 31.Montavon T., Le Garrec J.-F., Kerszberg M. & Duboule D. Modeling Hox gene regulation in digits: reverse collinearity and the molecular origin of thumbness. Genes Dev 22, 346–59 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Schep R. et al. Control of Hoxd gene transcription in the mammary bud by hijacking a preexisting regulatory landscape. PNAS 113, E7720–E7729 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Tschopp P. & Duboule D. A regulatory ‘landscape effect’ over the HoxD cluster. Developmental Biology 351, 288–296 (2011). [DOI] [PubMed] [Google Scholar]
- 34.Spitz F. et al. Large scale transgenic and cluster deletion analysis of the HoxD complex separate an ancestral regulatory module from evolutionary innovations. Genes Dev. 15, 2209–2214 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lonfat N., Montavon T., Darbellay F., Gitto S. & Duboule D. Convergent evolution of complex regulatory landscapes and pleiotropy at Hox loci. Science 346, 1004–1006 (2014). [DOI] [PubMed] [Google Scholar]
- 36.Lonfat N. An ancestral regulatory mechanism underlies Hoxd gene expression in both developing genitals and digits. Infoscience https://infoscience.epfl.ch/record/191170 (2013) doi: 10.5075/epfl-thesis-5997. [DOI] [Google Scholar]
- 37.Kondo T., Zákány J., Innis J. W. & Duboule D. Of fingers, toes and penises. Nature 390, 29–29 (1997). [DOI] [PubMed] [Google Scholar]
- 38.Archambeault S., Taylor J. A. & Crow K. D. HoxA and HoxD expression in a variety of vertebrate body plan features reveals an ancient origin for the distal Hox program. EvoDevo 5, 44 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Crow K. D., Sadakian A. & Kaslly N. A. The role of the 5′ HoxA genes in the development of the hindgut, vent, and a novel sphincter in a derived teleost (bluebanded goby, Lythrypnus dalli). J Exp Zool Pt B 340, 518–530 (2023). [DOI] [PubMed] [Google Scholar]
- 40.Tulenko F. J. et al. HoxD expression in the fin-fold compartment of basal gnathostomes and implications for paired appendage evolution. Sci Rep 6, 22720 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Franke M. et al. CTCF knockout in zebrafish induces alterations in regulatory landscapes and developmental gene expression. Nat Commun 12, 5415 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Rodriguez-Carballo E. et al. The HoxD cluster is a dynamic and resilient TAD boundary controlling the segregation of antagonistic regulatory landscapes. Genes Dev. 31, 2264–2281 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Sur A., Wang Y., Capar P., Margolin G. & Farrell J. A. Single-Cell Analysis of Shared Signatures and Transcriptional Diversity during Zebrafish Development. http://biorxiv.org/lookup/doi/10.1101/2023.03.20.533545 (2023) doi: 10.1101/2023.03.20.533545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Lonfat N., Montavon T., Darbellay F., Gitto S. & Duboule D. Convergent evolution of complex regulatory landscapes and pleiotropy at Hox loci. Science 346, 1004–6 (2014). [DOI] [PubMed] [Google Scholar]
- 45.Bolt C. C. et al. Context-dependent enhancer function revealed by targeted inter TAD relocation. Nat Commun 13, 3488 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Darbellay F. & Duboule D. Topological Domains, Metagenes, and the Emergence of Pleiotropic Regulations at Hox Loci. Curr Top Dev Biol 116, 299–314 (2016). [DOI] [PubMed] [Google Scholar]
- 47.Berlivet S. et al. Clustering of tissue-specific sub-TADs accompanies the regulation of HoxA genes in developing limbs. PLoS genetics 9, e1004018 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Young T. et al. Cdx and Hox genes differentially regulate posterior axial growth in mammalian embryos. Developmental cell 17, 516–26 (2009). [DOI] [PubMed] [Google Scholar]
- 49.Tulenko F. J. et al. Fin-fold development in paddlefish and catshark and implications for the evolution of the autopod. Proc. R. Soc. B. 284, 20162780 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Freitas R., Zhang G. & Cohn M. J. Biphasic Hoxd gene expression in shark paired fins reveals an ancient origin of the distal limb domain. PloS one 2, e754 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kuraku S. et al. Noncanonical role of Hox14 revealed by its expression patterns in lamprey and shark. Proc Natl Acad Sci U S A 105, 6679–83 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Tschopp P. et al. A relative shift in cloacal location repositions external genitalia in amniote evolution. Nature 516, 391–4 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Sordino P., Duboule D. & Kondo T. Zebrafish Hoxa and Evx-2 genes: cloning, developmental expression and implications for the functional evolution of posterior Hox genes. Mechanisms of Development 59, 165–175 (1996). [DOI] [PubMed] [Google Scholar]
- 54.Ahn D. & Ho R. K. Tri-phasic expression of posterior Hox genes during development of pectoral fins in zebrafish: Implications for the evolution of vertebrate paired appendages. Developmental Biology 322, 220–233 (2008). [DOI] [PubMed] [Google Scholar]
- 55.Wike C. L. et al. Chromatin architecture transitions from zebrafish sperm through early embryogenesis. Genome Res. 31, 981–994 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Sur A. et al. Single-cell analysis of shared signatures and transcriptional diversity during zebrafish development. Developmental Cell S1534580723005774 (2023) doi: 10.1016/j.devcel.2023.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Westerfield M. The Zebrafish Book. A Guide for the Laboratory Use of Zebrafish (Danio Rerio). (University of Oregon Press, 2000). [Google Scholar]
- 58.Kimmel C. B., Ballard W. W., Kimmel S. R., Ullmann B. & Schilling T. F. Stages of embryonic development of the zebrafish. Developmental Dynamics 203, 253–310 (1995). [DOI] [PubMed] [Google Scholar]
- 59.Hoshijima K. et al. Highly Efficient CRISPR-Cas9-Based Methods for Generating Deletion Mutations and F0 Embryos that Lack Gene Function in Zebrafish. Dev Cell 51, 645–657.e4 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Narayanan R. & Oates A. C. Detection of mRNA by Whole Mount in situ Hybridization and DNA Extraction for Genotyping of Zebrafish Embryos. Bio Protoc 9, e3193 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Nüsslein-Volhard C. & Dahm R. R. Zebrafish, A Practical Approach. (2002).
- 62.Montavon T. et al. A Regulatory Archipelago Controls Hox Genes Transcription in Digits. Cell 147, 1132–1145 (2011). [DOI] [PubMed] [Google Scholar]
- 63.Woltering J. M. et al. Axial patterning in snakes and caecilians: Evidence for an alternative interpretation of the Hox code. Developmental Biology 332, 82–89 (2009). [DOI] [PubMed] [Google Scholar]
- 64.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal 17, 10–12 (2011). [Google Scholar]
- 65.Dobin A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Lopez-Delisle. Customized gtf file from Ensembl version 108 mm39. Zenodo 10.5281/ZENODO.7510796 (2023). [DOI] [Google Scholar]
- 67.Trapnell C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature biotechnology 28, 511–5 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Roberts A., Trapnell C., Donaghey J., Rinn J. L. & Pachter L. Improving RNA-Seq expression estimates by correcting for fragment bias. Genome biology 12, R22 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Hintermann A. et al. Developmental and evolutionary comparative analysis of a regulatory landscape in mouse and chicken. Development 149, dev200594 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Langmead B. & Salzberg S. L. Fast gapped-read alignment with Bowtie 2. Nat Methods 9, 357–359 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Danecek P. et al. Twelve years of SAMtools and BCFtools. GigaScience 10, giab008 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Quinlan A. R. & Hall I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Beccari L. et al. A role for HOX13 proteins in the regulatory switch between TADs at the HoxD locus. Genes Dev. 30, 1172–1186 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Franke M. et al. CTCF knockout in zebrafish induces alterations in regulatory landscapes and developmental gene expression. Nat Commun 12, 5415 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Danecek P. et al. Twelve years of SAMtools and BCFtools. GigaScience 10, giab008 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Yakushiji-Kaminatsui N. et al. Similarities and differences in the regulation of HoxD genes during chick and mouse limb development. PLoS Biol. 16, e3000004 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Wingett S. et al. HiCUP: pipeline for mapping and processing Hi-C data. F1000Res 4, 1310 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Abdennur N. & Mirny L. A. Cooler: scalable storage for Hi-C data and other genomically labeled arrays. Bioinformatics 36, 311–316 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Ramirez F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic acids research 44, W160–5 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Wolff J. et al. Galaxy HiCExplorer 3: a web server for reproducible Hi-C, capture Hi-C and single-cell Hi-C data analysis, quality control and visualization. Nucleic Acids Research 48, W177–W184 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Hao Y. et al. Dictionary learning for integrative, multimodal and scalable single-cell analysis. Nat Biotechnol (2023) doi: 10.1038/s41587-023-01767-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All raw and processed datasets are available in the Gene Expression Omnibus (GEO) repository under accession number GSE250267.