Abstract
Skates are cartilaginous fish whose body plan features enlarged wing-like pectoral fins, enabling them to thrive in benthic environments1,2. However, the molecular underpinnings of this unique trait remain unclear. Here we investigate the origin of this phenotypic innovation by developing the little skate Leucoraja erinacea as a genomically enabled model. Analysis of a high-quality chromosome-scale genome sequence for the little skate shows that it preserves many ancestral jawed vertebrate features compared with other sequenced genomes, including numerous ancient microchromosomes. Combining genome comparisons with extensive regulatory datasets in developing fins—including gene expression, chromatin occupancy and three-dimensional conformation—we find skate-specific genomic rearrangements that alter the three-dimensional regulatory landscape of genes that are involved in the planar cell polarity pathway. Functional inhibition of planar cell polarity signalling resulted in a reduction in anterior fin size, confirming that this pathway is a major contributor to batoid fin morphology. We also identified a fin-specific enhancer that interacts with several hoxa genes, consistent with the redeployment of hox gene expression in anterior pectoral fins, and confirmed its potential to activate transcription in the anterior fin using zebrafish reporter assays. Our findings underscore the central role of genome reorganization and regulatory variation in the evolution of phenotypes, shedding light on the molecular origin of an enigmatic trait.
Subject terms: Evolutionary developmental biology, Embryogenesis, Evolutionary genetics, Genome evolution
Skate-specific changes in the epigenome and its three-dimensional organization contributed to the evolution of the batoid fin morphology.
Main
The origin and diversification of vertebrates was accompanied by the appearance of key developmental innovations2,3. Among them, paired appendages show an exquisite diversity of forms and adaptations not only in tetrapods, but also in chondrichthyans (cartilaginous fish) in which fin structures are considerably diverse2. The wing-like appendages of batoid fishes (skates and rays) (Fig. 1a) are fascinating examples, in which the pectoral fins extend anteriorly and fuse with the head. This unique structure creates power for forward propulsion and led to the emergence of swimming mechanisms that enabled skates to colonize the sea floor1. Transcriptomic analysis of skate developing fins revealed a major reorganization of signalling gradients relative to other vertebrates1. The redeployment of developmental transcription factors, such as 3′ hox genes, initiates an anterior signalling centre analogous to the posterior apical ectodermal ridge (AER). These changes arose ~286–221 million years ago (Fig. 1b) after the divergence between sharks and skates. Nevertheless, the genomic and regulatory changes underlying these novel expression domains have remained elusive.
Many vertebrate evolutionary innovations were influenced by the substantial genomic reorganizations caused by two rounds of whole-genome duplication (WGD). The ancestral chordate chromosomes were duplicated and rearranged to give rise to the diversity of existing karyotypes in vertebrates4. Concomitantly, the pervasive loss of paralogous genes after WGDs produced gene deserts enriched in regulatory elements5. Compellingly, those genomic alterations were paralleled by marked changes in gene regulation, contributing to an increase in pleiotropy in developmental genes5 and to the complexity of their regulatory landscapes6. In vertebrates, regulatory landscapes are spatially organized into topologically associating domains (TADs)7,8. TADs correspond to large genomic regions with increased self-contact that promote the interaction between cis-regulatory elements (CREs) and cognate promoters to constitute precise transcriptional patterns. While TADs constrain the evolution of gene order9, genomic rearrangements that alter these domains can be a source for developmental phenotypes10 and evolutionary innovation11,12. Yet the importance of TAD organization for the evolution of gene regulation and the emergence of lineage-specific traits after vertebrate WGDs remains largely unexplored.
To gain insights into the evolution of the jawed vertebrate (gnathostome) karyotypes and of wing-like appendages, we generated a chromosome-scale assembly of the little skate L. erinacea and performed extensive functional characterization of its developing fins. Our analyses revealed a karyotype configuration resembling the gnathostome ancestor, characterized by slower paralogue loss and smaller chromosomes than other jawed vertebrates, which suggests fewer fusion events after the second round (2R) of WGD in the skate lineage. We find evidence that three-dimensional (3D) genome organization in skate arises from an interplay between transcription-based A/B compartments and TADs formed by loop extrusion, as described in mammals13. The comparison of the 3D organization of α and β chromosomes after the gnathostome-specific WGD revealed a prominent loss of complete TADs, probably contributing to karyotype stabilization. By combining RNA sequencing (RNA-seq) and assay for transposase-accessible chromatin with sequencing (ATAC–seq) data, we identified the planar cell polarity (PCP) pathway and hox gene regulation as key contributors to skate fin morphology, which we further validated using functional assays in zebrafish and skate. Our study illustrates how comparative multi-omics approaches can be effectively used to elucidate the molecular underpinnings of evolutionary traits.
Genome sequencing and comparative genomics
We assembled the little skate genome at the chromosome scale by integrating long- and short-read genome sequencing with chromatin conformation capture (Hi-C) data. Our assembly includes 40 chromosome-scale (>2.5 Mb) scaffolds, with 19 macrochromosomes (>40 Mb), 14 mesochromosomes (between 20 and 40 Mb) and 7 microchromosomes (<20 Mb) that together represent 91.7% of the 2.2 Gb assembly. This chromosome number is within the range reported for other Rajidae species14. Despite technical challenges due to high polymorphism levels (1.6% heterozygosity) and a repeat content dominated by recently expanded LINE retrotransposons (Extended Data Fig. 1), our assembly showed a similar or higher degree of completeness with respect to gene content compared to other sequenced chondrichthyans (BUSCO; Supplementary Table 1).
We annotated 26,715 protein-coding genes using extensive transcriptome resources15, with 23,870 possessing homologues in other species. Using comparative analysis with 20 other sequenced vertebrates we reconstructed the complete set of skate gene evolutionary histories (the phylome) and used it to infer patterns of gene duplication and loss, as well as orthology and paralogy relationships (Supplementary Table 2; resources are available at PhylomeDB and MetaPhoRs16,17). We used phylogenomic methods to reconstruct jawed vertebrate phylogeny and infer divergence times, finding a more ancient divergence between sharks and skates (around 286 million years ago) than previously estimated18 (Fig. 1b). Compared with other reported chondrichthyan genomes, L. erinacea displays the lowest number of species-specific gene losses (616 losses; Supplementary Fig. 1). Similar to sharks (selachians)19,20, the little skate has larger introns than tetrapods (median size, 2,167 bp versus 1,586 bp in human), although these are not enriched in a particular repeat category (Extended Data Fig. 1).
Skate microchromosomes have an overall higher gene density compared with macro-chromosomes (Extended Data Fig. 1a–c,g), suggesting that, as in birds, these small chromosomes are prone to GC-biased gene conversion21. Skate microchromosomes also show a higher degree of interchromosomal contacts compared with other chromosomes (Fig. 1d,e), as also found in snakes and other tetrapods22.
Chromosome evolution
We surveyed the arrangement of syntenic chromosomal segments derived from ancestral chordate linkage groups (CLGs) in skate, gar and chicken, using amphioxus as an unduplicated outgroup23, and found that the chromosomal organization of the skate genome closely resembles that of the most recent jawed vertebrate common ancestor (Fig. 2a and Extended Data Fig. 2). By analysing the chromosomal locations of single-copy orthologues, we designated chromosomal segments according to their origin at 1R (1 or 2) or 2R (α or β) vertebrate WGDs23 (Fig. 2b). The relatively large number of elasmobranch chromosomes (≥40) reflects the ancestral condition among gnathostomes; with the exception of the losses of two ancestral segments in the skate lineages, and one secondary fusion on chromosome 1, the skate possessed 37 out of the 39 ancestral vertebrate linkages (Supplementary Table 3). The evolution of reduced chromosome number in osteichthyan (bony fishes) lineages is therefore due to subsequent chromosomal fusions.
The smaller vertebrate chromosomes often show a reciprocal correspondence across species and correspond to a single ancestral gnathostome unit23–25 (10 chromosomes have a 1:1:1 orthology between skate, gar and chicken; Fig. 2b). The trios LER25≡LOC20≡GGA15 and LER28≡LOC22≡GGA19 represent two surviving copies of CLG-G from the 1R event. Other trios such as LER21≡LOC18≡GGA20 and LER29≡LOC19≡GGA28 derive from CLG fusions, and the occurrence of some in all gnathostome genomes implies that they happened between the pan-vertebrate 1R and the gnathostome-specific 2R23,25 (Fig. 2b).
In many gnathostomes, larger chromosomes also derive from fusions of CLGs. The skate often represents an ancestral state among jawed vertebrates, with subsequent fusions in bony fishes, including in chicken (for example, GGA5), in gar (for example, LOC5) or in their common ancestor (for example, LER 2 and 4; see below). For example, ancestral gnathostome chromosomes resembling skate LER9, LER12 and LER18 fused in different ways to form chromosomes in gar and chicken. Similarly, LER10≡GGA8 and LER23≡GGA18 (≡BFL8) probably represent ancestral units that fused in gar chromosome LOC10 through a centric Robertsonian fusion (Fig. 2b). Notably, these two chromosomes are also preserved in their ancestral condition in the bowfin, the sister group of gar, implying that fusion occurred specifically in this lineage26.
Alternatively, ancestral chromosomes resembling LER2 and LER4 probably fused in the bony fish ancestor to give rise to chicken GGA2, whereas gar LOC9 and LOC11 are secondarily split from this fused ancestral chromosome. This may have involved a Robertsonian fission that split a metacentric chromosome at the centromere into two acro- or telocentric products. We also observe cases in which microchromosomes have been added to macro-chromosomes recently by terminal translocation, such as the addition of a chromosome similar to LER35≡GGA22 to the start of LOC1, or a LER12-like chromosome to the end of GGA4 (a recent translocation not found in other birds)27.
The extensive conservation of chromosomal identity and gene order between the little skate and the bamboo shark28, despite over 300 million years of divergence, indicates that most chondrichthyans may share this ancestral chromosomal organization (Fig. 1b,c and Extended Data Fig. 2). Notably, gene order collinearity across cartilaginous fish is more extensively conserved than within clades of comparable divergence, such as mammals and frogs29. By contrast, gene order is heavily disrupted between chondrichthyans (such as skate or shark) and osteichthyans (gar or chicken; Fig. 2a,b and Supplementary Fig. 2).
Evolution of the gene complement
The gene complement of the little skate, as in other chondrichthyans, evolved slower than that of Osteichthyes with respect to gene loss (Supplementary Fig. 1). Using species-tree-aware phylogenetic methods, we found that the retention of ohnologues (paralogues derived from vertebrate-specific WGDs) was higher than that observed in bony fishes (Fig. 2c,d and Extended Data Fig. 1h). According to the auto-then-allotetraploidy scenario for jawed vertebrate evolution23, the chromosomes derived from 2R behave distinctly, with beta segments showing increased loss and higher rates of molecular evolution (Fig. 2c,e and Extended Data Fig. 1i).
On the basis of patterns of duplication and loss, we found 68 cases in which one ohnologue was differentially retained in varying jawed vertebrate lineages, 19 genes retained in chondrichthyans but lost in bony fishes, 17 retained in chondrichthyans and coelacanth, and 24 retained in chondrichthyans and actinopterygians (ray-finned fish) but lost in lobe-finned fish (Supplementary Table 3). Some of these retained ancestral ohnologues, including previously characterized genes such as wnt6b20 or novel genes such as chondroitin sulfate proteoglycan 5 (cspg5), show distinct expression patterns among stages and organs (Fig. 2e).
Conservation of 3D regulatory principles
We investigated 3D chromatin organization in skates using Hi-C analysis of developing pectoral fins. We found a type II architecture30 with chromosomes preferentially occupying individual territories within the nucleus (Supplementary Fig. 3), consistent with a complete set of condensin II subunits (smc2, smc4, caph2, capg2 and capd3) in the genome. At higher resolution, skate chromosomes are organized into two distinct compartments, as described in other animals31. The A compartment displays higher gene density, chromatin accessibility and gene expression levels compared with the B compartment (Extended Data Fig. 3).
At the sub-megabase scale, the skate genome is organized into TADs with a median size of 800 kb (Extended Data Fig. 4a,b), an intermediate regime between mammals and teleosts (Supplementary Fig. 4). Aggregate analyses revealed that skate TADs are associated with chromatin loops at the upper corner of domains (Fig. 3a). Chromatin accessibility (ATAC–seq) and motif enrichment analysis revealed binding sites for the architectural factor CTCF at skate TAD boundaries (Extended Data Fig. 4c,d), in comparable proportions to mammals and teleosts (Supplementary Fig. 5). These CTCF sites display an orientation bias with motifs oriented towards the interior of TADs, suggesting that these domains are formed by loop extrusion (Fig. 3b and Extended Data Fig. 4c). Notably, the critical genes involved in loop extrusion are present in the skate genome, including ctcf and those encoding cohesin complex subunits (smc1a, smc3, scc1 and two copies of scc3). An example of skate TAD organization can be observed at the hoxa and hoxd clusters (Fig. 3c and Extended Data Fig. 4d), which display the characteristic bipartite TAD configuration of jawed vertebrates32. Manual microsynteny analysis confirmed that the 3′ and 5′ TADs found at both skate hox loci are orthologous to those described in mammals and teleosts. Such deeply conserved 3D organizations reflect the existence of regulatory constraints that influenced TAD evolution across the whole jawed vertebrate clade.
To investigate enhancer–promoter interactions, we used Hi-C combined with immunoprecipitation (HiChIP) to associate H3K4me3-rich active promoters with potential regulatory loci in the anterior and posterior skate pectoral fin. Notably, these fin regions display transcriptional signatures that differ from other vertebrates. In particular, several 3′ hoxa and hoxd genes are preferentially expressed in the anterior pectoral fin, whereas 5′ hoxa and hoxd genes are located in the posterior pectoral domain. This pattern of expression has been consistently found in other batoid species1,33. HiChiP analyses revealed 50,601 interactions associated with 7,887 different promoters (6.4 interactions per active promoter). Interactions connecting promoters with distal ATAC–seq peaks (χ2, P < 10−138; Extended Data Fig. 5a) and intra-TAD interactions were enriched (empirical P < 10−4; Extended Data Fig. 5b). Differential analysis revealed similar looping patterns between tissues (Pearson correlation > 0.96; Extended Data Fig. 5c), with only 9 and 5 interactions statistically enriched in anterior and posterior fins, respectively (Extended Data Fig. 5d). Promoters with differential looping included hoxa and hoxb genes and the transcription factor alx4 (Extended Data Fig. 5e–g), which are involved in limb development. To confirm those interactions, we performed Hi-C in anterior and posterior pectoral fins, finding only minor variations. Compartment differences were subtle and restricted to less than 10% of the genome (Extended Data Fig. 6a–d). TADs were also extremely similar (Fig. 3d,e and Extended Data Fig. 6e), with insulation score correlations of above 0.98 (Extended Data Fig. 6f). Similarly, high correlations were observed for chromatin loops (Extended Data Fig. 6g) and differential analysis revealed a single significantly stronger loop in the posterior pectoral fin (Extended Data Fig. 6h,i). Notably, the differential contacts predicted by HiChIP were not noticeable (Fig. 3d,e and Extended Data Fig. 6j). The differences in HiChIP data are therefore probably derived from variations in H3K4me3 occupancy, consistent with the selective activation of the hoxa cluster in anterior fins. Overall, both analyses indicate that 3D chromatin folding is largely maintained in the different pectoral fin territories.
To investigate possible regulatory constraints on TAD evolution, we considered 1,464 microsyntenic pairs of genes (that is, consecutive orthologues) conserved between skate, mouse and gar. In skates, such conserved gene pairs shared TADs more often than other consecutive genes (98% versus 95%, χ2, P = 3.7 × 10−13; Extended Data Fig. 7a). Those pairs were present in 718 out of the 1,678 skate TADs (42%), highlighting that individual TADs are constrained but not invariant across deep evolutionary timescales (Extended Data Fig. 7b). TADs containing deeply conserved microsyntenic pairs are significantly larger and contain more distal ATAC–seq peaks and putative promoter–enhancer interactions, as defined on the basis of HiChIP analysis, compared with non-conserved TADs (Extended Data Fig. 7c; Mann–Whitney U-test, P = 1.23 × 10−24, 3.81 × 10−36 and 1.04 × 10−41, respectively). This suggests that the deep conservation of individual TADs emerges from regulatory constraints (Extended Data Fig. 7d,e).
Our results suggest that 3D chromatin organization in skates results from the interplay of two mechanisms—compartmentalization driven by transcriptional state and TADs formed by loop extrusion. Such organization is similar in bony fishes/tetrapods, indicating that TAD formation through loop extrusion was present in the gnathostome ancestor. As the appearance of this common ancestor was temporally close to 2R, we explored the regulatory fate of homologous TADs in relation to this duplication event. We found that, although the size and gene density of TADs is similar between α and β chromosomes, there are notably fewer TADs in beta (Fig. 3f,g and Extended Data Fig. 7f). Regulatory landscapes derived from H3K4me3 HiChIP experiments followed a similar trend (Extended Data Fig. 7g,h). We confirmed that the lower number of TADs in beta could not be explained by TAD fusions in beta or boundary gains in α segments (Extended Data Fig. 7i). These results indicate that many TADs disappeared from the early gnathostome genome after 2R, while those that persist are comparable in size (Fig. 3g). Whether losses in beta segments were caused by the deletion of whole redundant TADs or the progressive erosion and pseudogenization of their genes is difficult to ascertain.
PCP pathway as a driver of fin expansion
To examine whether genomic rearrangements could have driven skate pectoral fin evolution through TAD alterations, as reported for other mammalian traits11, we identified synteny breaks by aligning six jawed vertebrate genomes (Fig. 4a). As expected, the number of (micro)syntenic changes between species increases with phylogenetic distance (Fig. 4a and Extended Data Fig. 8a), from 18 breaks in L. erinacea that occurred after the split of the two skate lineages to 1,801 between cartilaginous and bony fishes (around 2 breaks per million years).
As anterior expansion of the pectoral fin is a defining characteristic of skates, we focused on the 123 synteny breaks shared by the little and thorny skate genomes relative to other vertebrates. We found an enrichment of synteny breaks near TAD boundaries—42 breaks occurred within 50 kb of a TAD boundary, compared with 15 expected under a random break model (empirical P < 1 × 10−4; Fig. 4b). This enrichment supports the hypothesis that genome rearrangements that interrupt TADs are evolutionarily disfavoured owing to deleterious enhancer–promoter rewiring9.
Conversely, we hypothesized that the 81 breaks that interrupt TADs could be enriched for enhancer–promoter rewiring associated with gene regulatory changes. Interrupted TADs include 2,041 genes and, by filtering those with interactions across synteny breaks on the basis of anterior fin H3K4me3 HiChIP analysis, we identified 180 genes that are potentially associated with pectoral fin expansion. Signalling pathway analysis revealed enrichment for Wnt/PCP pathway components (Fig. 4c and Extended Data Fig. 8b,c), including the important regulator prickle1 (Fig. 4d) and other potentially relevant genes such as the hox gene activator psip134 (Extended Data Fig. 8d–g). Among eight candidate genes of which we determined the expression using whole-mount in situ hybridization (WISH), only prickle1 and psip1 exhibited clear anteriorly enriched expression patterns (Fig. 4e and Supplementary Fig. 6).
To test whether alterations in TADs drove changes in gene expression, we performed comparative WISH analysis of prickle1 between skate and chain catshark (S. retifer) embryos at equivalent stages (Fig. 4e). prickle1 expression was higher in the anterior pectoral fin of skates compared to a weak expression without spatial enrichment in shark fins (Supplementary Fig. 7). Similarly, we found differential expression for Psip1, suggesting a potential involvement of Hox-related pathways in the skate fin phenotype (Extended Data Fig. 8f,g).
Given the specific pattern of prickle1 expression, we examined the function of the PCP pathway in anterior fin expansion using cell shape analysis, and found that anterior mesenchymal cells are more oval than those in the central and posterior regions (Supplementary Fig. 8). Treatment with a Rho-kinase (ROCK) inhibitor from stage 29 to 31 showed that the overall number of fin rays associated to each tribasal bone of the skate fin (propterygium, mesopterygium and metapterygium) was reduced in the ROCK-inhibited embryos compared with in the controls, with greater losses in the anterior than in the posterior fin region (Fig. 4f,g, Extended Data Fig. 9 and Supplementary Figs. 9 and 10). Despite significant variation across stage and treatment (Extended Data Fig. 9 and Supplementary Fig. 10), geometric morphometric analyses suggest that ROCK-inhibitor-treated embryos showed a less pronounced anterior expansion of the pectoral fin, in contrast to control embryos in which it extends anteriorly towards the eye by stage 31 (Extended Data Fig. 10). To rule out a general delay in body growth, we implanted acrylic beads soaked in ROCK inhibitor into the anterior pectoral fins at stage 29 and investigated fin rays at stage 31 (Extended Data Fig. 11). In contrast to control embryos with DMSO beads, specimens with ROCK inhibitor exhibited aberrant branching, fusion and loss of fin rays near beads or at potential bead implantation sites (6 out of 9 embryos for 100 μM and 6 out of 10 for 1 mM inhibitor beads). Taken together, these findings suggest that TAD rearrangements had a role in recruiting and repurposing genes and pathways during the evolution of the unique batoid fin morphology.
HOX-driven gli3 repression in skate fins
To examine the transcriptional drivers of skate fin morphology, we generated and compared RNA-seq datasets between pectoral fins and pelvic fins, which exhibit a characteristic tetrapod gene expression pattern1. We identified 193 and 117 genes preferentially expressed in pectoral and pelvic fins, respectively (Supplementary Table 4), including several transcription factors and components of different signalling pathways. To identify changes in the appendage gene regulatory network, we compared differentially expressed genes in skate fins with corresponding mouse fore- and hindlimb RNA-seq data35,36 (Fig. 5a and Supplementary Fig. 11a). Key genes in determining anterior and posterior paired appendages, such as tbx5 and tbx4, display a similar expression pattern, suggesting a conserved function across jawed vertebrates33. However, several genes, including hox genes or the master regulator of vertebrate hindlimb specification pitx137, displayed clear differences between skates and mice (Supplementary Figs. 11a and 12), suggesting that altered regulation of appendage-related factors may contribute to skate pectoral fin expansion.
To examine the transcriptional changes associated with skate pectoral fins, we analysed available anterior and posterior pectoral fin RNA-seq data1. In skates, hox genes show distinctive expression differences between the anterior and posterior pectoral fin (Supplementary Table 5 and Supplementary Fig. 11b). Anterior expression of the hoxa and hoxd genes forms a secondary AER-like organizer that is probably involved in the overgrowth of the skate pectoral fins1,38,39. Secondary AER formation is associated with changes in the expression of gli3—a key regulator of hedgehog signalling in appendage patterning40,41. Specifically, gli3 is expressed in the posterior pectoral fin versus predominantly anterior expression in pelvic fins, as in several vertebrate species1 (Fig. 5b). Recently, it has been shown that (1) the Hoxa13 and Hoxd13 genes downregulate Gli3 expression for proper thumb formation42 in the mouse limb, (2) HOX13 proteins bind to and repress Gli3 limb enhancers and (3) compound Hox13 mutants cause anterior extension of Gli3 expression42. Anterior Hox genes may also have a role in GLI3 transcriptional regulation, as Hoxa2 binds to several enhancers within the Gli3 locus (shown by ChIP–seq data43; Extended Data Fig. 12a). Overexpression of hoxa2 in zebrafish pectoral fins also induces transcription of wnt3 (an AER marker gene) potentially inhibiting gli3 expression1. Some of these hox genes, including hoxa13 and hoxa2, are strongly expressed in skate anterior pectoral fins (Supplementary Fig. 11b).
On the basis of this evidence, and considering the redundancy between Hoxd13 and Hoxa13 proteins44–46, we explored the Hox–Gli3 relationship using a validated hoxd13a-GR overexpression construct in zebrafish47. After dexamethasone treatment, overexpression of Hoxd13a caused increased fin proliferation, distal expansion of chondrogenic tissue and fin fold reduction45. Furthermore, 35% of the injected zebrafish embryos showed a decrease in gli3 fin expression (Extended Data Fig. 12b). Moreover, a gli3 loss-of-function mutant in medaka fish shows multiple radials and rays in a pattern similar to the polydactyly of mouse gli3 mutants, but also to pectoral skate fins48. These findings, together with the anterior expression of 3′ hox genes, suggest that Gli3 downregulation, mediated by Hox repression, is a potential mechanism underlying the striking pectoral skate fin shape.
A skate-specific hoxa fin enhancer
We hypothesized that the anteroposterior expression differences found in other vertebrates but not in skates could arise from changes in cis-regulation. To identify CREs, we performed ATAC–seq analysis in anterior and posterior pectoral fins, as well as in whole pelvic fins. DNA methylation profiling (Supplementary Fig. 16a) revealed that differentially accessible ATAC peaks are hypomethylated in developing pectoral and pelvic fins and remain hypomethylated in adult fins (Supplementary Fig. 16b,c), suggesting epigenetic memory as reported in other vertebrates48–50. We used our HiChIP datasets to associate CREs with target genes, and identified many differentially accessible ATAC peaks clustered around genes that are critical for appendage patterning, such as, tbx5, tbx4, pitx1 and hox genes (Supplementary Tables 6 and 7). Notably, Pitx1 displays a similar regulatory landscape in skate pectoral and pelvic fins (Supplementary Figs. 12 and 13), contrasting with the tissue-specific regulatorion in mouse51.
To further investigate anterior Hox gene regulation in skate pectoral fins, we integrated our anterior and posterior pectoral fin ATAC–seq data with existing RNA-seq data from these tissues1. The few differentially accessible CREs were associated with differentially expressed genes relevant for patterning, such as hoxa2, pax9, tbx2 and alx4 anteriorly, as well as chordin, hoxa9, hoxd10, hoxd11, hoxd12 and grem1 in the posterior region (Supplementary Table 5). Notably, a region located between hoxa1 and hoxa2 is more accessible in anterior pectoral than in posterior pectoral or pelvic fins (Fig. 5c). Zebrafish transgenic assays confirmed enhancer activity for this open chromatin region, which drives gene expression in anterior pectoral fins (Fig. 5d). This element is conserved in cartilaginous fishes but not found in bony fishes (Supplementary Fig. 14). Importantly, the orthologous region in catshark does not promote transgene expression in zebrafish (Fig. 5d), suggesting that, although this region is conserved in different chondrichthyan species, only the skate sequence is functionally active during early development. As this potential enhancer lies close to the hoxa2 promoter, we examined whether it is specific for hoxa2 or shared with other hox genes. Using H3K4me4 HiChIP, HiC and virtual 4C data, we observed that this enhancer forms robust interactions with most genes of the hox cluster in the anterior pectoral fin (Fig. 5c and Supplementary Fig. 15a), including hoxa13 located in the 5′ adjacent TAD (Figs. 3c and 5c) and expressed in the anterior pectoral fin (Fig. 5b and Supplementary Fig. 15b). Overall, these results demonstrate the existence of skate-specific CREs that can be linked to the formation of a secondary AER-like domain in the anterior pectoral fin.
Discussion
Here we combined genomic and functional approaches to uncover fundamental principles of genome regulation in the skate lineage and provide a molecular basis for the formation of wing-like batoid fins2. The position of skates in the vertebrate evolutionary tree, and their slow rate of genome evolution, revealed new insights into karyotype stabilization after two rounds of WGD. Gene loss and karyotype evolution dynamics have occurred at a different pace across jawed vertebrate lineages. Analysis of the elephant shark genome found a slower rate of evolution and reduced gene loss compared with tetrapods25,52. Here we showed that skate not only possesses comparably low rates of change, but also retains numerous ancestral gnathostome chromosomes, and that the smaller chromosome numbers of chicken and spotted gar arose by fusion of these ancestral units. This process was accompanied by considerable gene order rearrangement between cartilaginous and bony fishes, despite extensive conservation of TAD gene contents. Conservation of TADs in the absence of a globally colinear gene order emphasizes the impact of regulatory constraints in maintaining gene groupings.
The skate genome is functionally constrained by 3D regulatory mechanisms that parallel those described in bony fishes and tetrapods, including the presence of a CTCF-orientation code and associated loop extrusion13. Our findings imply that these mechanisms emerged early in vertebrate evolution, probably influencing the appearance of phenotypic novelties. These mechanisms further constrain genome evolution, as most skate-specific chromosome rearrangements occur at TAD boundaries, resulting in limited effects on gene regulation, as reported in mammals53. Notably, we observed the complete disappearance of TADs in the paralogous regions prone to gene loss after 2R (beta segments), with the remaining β and α TADs having the same average size and gene number. Although asymmetric paralogue loss after WGDs is considered to be a key factor in the emergence of novel gene regulation5, the loss of TADs in beta regions indicates that entire paralogous regulatory units can be lost after WGDs and stresses the importance of regulatory constraints in shaping genome organization. It remains to be seen whether the regulatory potential of missing TADs is incorporated into other regulatory landscapes and enhances pleiotropy.
Related to novel skate morphology, we found lineage-specific TAD-disrupting rearrangements affecting genes involved in PCP signalling—an ancient developmental pathway54 that is essential for cell orientation and patterning. We found that the main effector of this pathway, prickle1, has anteriorized pectoral fin expression as well as in anterior pelvic fins and in the clasper (Fig. 4e and Supplementary Fig. 6)—two structures that also extend laterally and posteriorly during skate development55. Importantly, unique pectoral and pelvic fin morphologies evolved simultaneously during batoid diversification, suggesting a deployment of similar/same genetic cascades during paired fin development56, as suggested by the presence of common markers like wnt3 and hoxa111,39. The tissue-specific modulation of the PCP pathway through redeployment of a main pathway effector (prickle1) provides a compelling example of how existing gene networks can evolve new functions through genomic rearrangements.
Finally, we implicate altered regulation of 3′ hox genes and their activator psip1 in novel skate pectoral fin development. Although these genes show posterior expression in most vertebrate appendages (including skate pelvic fins), they are notably expressed in skate anterior pectoral fin. Our hoxd13a overexpression experiments (Extended Data Fig. 12b) suggest that the increased levels of hox gene expression in anterior pectoral fins, together with other regulatory changes, downregulates Gli3, leading to substantially altered morphology and illustrating the plasticity of the Shh–Gli3–Ptch1 pathway in the evolution of vertebrate appendage morphology46,56–59. The identified skate-specific hoxa fin enhancer suggests a cis-regulatory basis for altered Shh–Gli3–Ptch1 signalling. Overall, our study shows how changes in CREs and 3D chromatin organization act as essential forces driving adaptative evolution.
Methods
Animal use
All fish work, including experiments with skate embryos, was conducted according to standard protocols approved by the Institutional Animal Care and Use Committee (IACUC) of Rutgers University (protocol number, 201702646), the IACUC of Marine Biological Laboratory (protocol number, 18-36) and the University of Chicago IACUC (protocol number, 71033). Danio rerio embryos were obtained from AB and Tübingen strains, and manipulated according to protocols approved by the Ethics Committee of the Andalusia Government (license number, 182-41106) and the national and European regulation established. Zebrafish procedures were reviewed and approved by the ethical committees from the University Pablo de Olavide, CSIC, and the Andalusian government, and performed in compliance with all relevant ethical regulations.
Genomic DNA extraction and library construction
Skate DNA was isolated using extensive proteinase K digestion and phenol–chloroform extraction from the muscle of a single L. erinacea specimen. For genome assembly, we generated both accurate short reads and noisy long reads. A contiguous long read (CLR) library for Pacbio sequencing was prepared and sequenced at the Vincent J. Coates Genomics Sequencing Laboratory at UC Berkeley. A total of 32 cells were sequenced on the Pacbio Sequel instrument using the V7 chemistry and yielded a total 10.2 million Pacbio reads totalling 163 Gb with a median size of 10.9 kb and a read N50 of 29 kb.
A paired-end Illumina library with a 600 bp insert was also sequenced for 2 × 250 bp in rapid run mode on the HiSeq 2500 instrument at BGI yielding 641 million reads and 160.3 Gb of sequence.
Genome assembly
Genome size was estimated by analysing a k-mer spectrum with a mer size of 31. By fitting a multimodal distribution using Genomescope 2.0, and estimated a genome size of 2.13 Gb (as well as an heterozygosity of 1.56%)62. To take advantage of both short and long reads, we opted for a hybrid assembly strategy. First, we generated de Brujin graph contigs using megahit (v.1.1.1) using a multi-k-mer approach (31, 51, 71, 91 and 111-mers) and filtering out k-mers with a multiplicity lower than 5 (--min-count 5). We obtained 2,750,419 contigs with an N50 of 1,129 bp representing a total of 2.23 Gb. We then used these contigs to prime the alignment and assembly of the Pacbio reads using dbg2olc (c. 10037fa)63 using a k-mer of 17 (k 17), a threshold on k-mer coverage of 3 (KmerCovTh 3), a minimal overlap of 30 (MinOverlap 30) and an adaptive threshold of 0.01 (AdaptiveTh 0.01) and removing chimeric reads from the dataset (RemoveChimera 1). This assembler generated an uncorrected backbone of overlapping reads with an N50 of 4.96 Mb and a total size of 2.25 Gb. To correct sequencing errors, we processed this sequence file to two successive rounds of consensus by aligning Pacbio reads with minimap2 (v.2.12, map-pb setting)64 and Racon (v.1.3.1) using the default parameters followed by one final round of consensus using the Illumina reads. We evaluated the progress of the polishing process with the BUSCO tool (v.3.0.2) that seeks widely represented single-copy gene families in the assembly65. Our final polished assembly contained 95.1% of vertebrate BUSCO genes (Supplementary Table 1). To exclude residual haploid contigs from the assembly, we aligned Illumina reads once more using bwa and computed a distribution of coverage that showed some residual positions at half coverage (31×). We used purge_haplotigs (v.1.0.2)66 by defining a coverage threshold between haploid and diploid contigs at 40× (and a minimum of 10× and maximum of 100×). The filtered assembly has a size of 2.19 Gb, an N50 of 5.35 Mb and 2,595 contigs in total, and the same BUSCO statistics as the unfiltered one (Supplementary Table 1).
This assembly was then scaffolded using chromatin-contact evidence obtained from Hi-C sequencing analysis of L. erinacea fins (see below) at Dovetail Genomics using the HiRise pipeline67. The accuracy of the resulting scaffolded assembly was verified and proofread by carefully inspecting the contact map in Juicebox68 and HiGlass browser69. This assembly comprises 50 scaffolds larger than 1 Mb that represent 92% of the assembly size and 39 scaffolds larger than 10 Mb that show mostly internal contacts. Despite no karyotyping evidence is directly available for L. erinacea, closely related species show a haploid number of 49 chromosomes, which is consistent with the observed number of chromosomes14.
As the final assembly size was smaller than the experimentally assessed genome size of 3.5 Gb, we performed gap closing on the final assembly using PBjelly70 that proceeds through alignment of the PacBio reads on each gap border and local reassembly. The effect on the assembly statistics was marginal, but we used this assembly as our final one (Supplementary Table 1).
Annotation
RNA-seq reads of strand-specific libraries from five bulk embryonic stages and 13 organs were aligned to the genome using STAR (v.2.5.2b)71 and each library assembled independently using stringtie (v.1.3.3)72. Stringtie assemblies were then merged using TACO (v.0.7.3)73. RNA-seq reads were also assembled de novo using Trinity (v.2.8.4)74. Finally, the iso-seq protocol was applied to generate full-length transcripts using Pacbio long-reads. Both Trinity assembled transcripts and iso-seq transcripts were aligned to the genome using GMAP (v.2018-07-04)75. Then, both TACO assembled transcripts and aligned de novo transcripts were leveraged using Mikado (v.1.2.1)76 to generate one consensus reference transcriptome, while predicting coding loci using Transdecoder (v.5.5.0). Using selected transcripts (2 introns or more, complete CDS, single hit against swissprot), we built an Augustus (v.3.3.3) hidden Markov model (HMM) profile for ab initio gene prediction77. We predicted skate genes using this profile and hints derived from (1) the mikado transcript assembly (exon hints); (2) intron hits obtained using bam2hints on a merged bam alignment of the RNA-seq data after filtering spurious junctions with portcullis (v.1.2.0)78; and (3) an alignment of human protein using exonerate (v.2.2.0)79.
A repeat library was constructed using Repeatmodeler and repeats were masked in the genome using Repeatmasker (v.4.0.7). We filtered out gene models that overlap massively with mobile elements and obtained 30,489 genes models. For these genes, isoforms and untranslated regions were added by two rounds of reconciliation with an assembled transcriptome using PASA80. Our set of coding genes includes 5,800 PFAM domains, a similar value to other well-annotated vertebrate genomes. To further examine the validity of gene models, we assessed (1) whether their coding sequence showed similarity to that of another species using gene family reconstruction (see below); (2) whether they possessed an annotated PFAM domain; and (3) whether they are expressed above 2 FPKMs in at least one RNA-seq dataset. These criteria reduced the number of bona fide coding genes to 26,715.
Gene family, synteny and phylogenetic analyses
We performed gene family reconstruction using OMA (v.2.4.1)81 between selected vertebrate species to identify single-copy orthologues. These orthologues were used to infer gene phylogeny after processing as described previously82: HMM profiles were built for each orthologous gene family and searched against translated transcriptomes using the HMMer tool (v.3.1b2)83. Alignments derived from each orthologue were aligned using MAFFT (v.7.3)84, trimmed for misaligned regions using BMGE (v.1.12)85 and assembled in a supermatrix. Phylogeny was estimated using IQTREE (v.2.1.1) assuming a C60+R model and divergence times estimated using Phylobayes (v.4.1e)86 assuming a CAT+GTR substitution, and a CIR clock model, soft constraints and a birth-death prior on divergence time. Calibrations were taken from previous papers18,87.
We identified conserved segments across vertebrates, by counting single-copy copy genes derived from OMA clustering sharing the same set of chromosomal locations in selected species, to identify putative ancestral vertebrate units. We examined conserved syntenic orthology by identifying sets of genes shared by pairs of chromosomes in distinct species using reciprocal best hits computed using Mmseq288. We performed a Fisher test to detect pairs of chromosomes showing significant enrichment, and assigned ancestral linkage groups (ALG) based on comparison with amphioxus and sea scallop. We computed gene family composition and analysed patterns of gene loss and duplications using reconstructed gene trees derived from gene families established with Broccoli89 and subjected to species-tree aware gene tree inference using Generax90.
Hi-C
The Hi-C protocol was performed as described previously with minor modifications91–93. Two biological replicates of L. erinacea Stg.30 pectoral fin buds, each consisting of ten fins, were fixed in a final concentration of 1% PFA for 10 min at room temperature. Fixation was stopped by placing the samples on ice and by adding 1 M glycine up to a concentration of 0.125 M. The quenched PFA solution was then removed and the tissue was resuspended in ice-cold Hi-C Lysis Buffer (10 mM pH 8 Tris-HCl, 10 mM NaCl, 0.2% NP-40 and 1× Roche Complete protease inhibitor). The lysis was helped with a Dounce Homogenizer Pestle A on ice (series of 10 strokes in 10 min intervals). Nuclei were then pelleted by centrifugation for 5 min, 750 rcf at 4 °C, washed twice with 500 µl of 1× PBS and finally resuspended with water to final volume of 50 µl. A total of 50 µl of 1% SDS was then added and the sample incubated 10 min at 62 °C. The SDS was then quenched by adding 292 µl water and 50 µl of 10% Triton X-100. Chromatin was then digested by adding 50 µl of 10× DpnII buffer and 8 µl of 50 U µl−1 DpnII (NEB, R0543M) followed by incubation at 37 °C overnight in a ThermoMixer with shaking (800 rpm). DpnII was then heat-inactivated at 65 °C for 20 min with no shaking. Chromatin sticky ends were then filled-in and marked with biotin by adding 50 µl of Fill-in Master Mix (5 µl of 10× NEBuffer2, 1.5 µl of 10 mM mix of dCTP, dGTP and dTTP, 37.5 µl of 0.4 mM biotin-dATP (Thermo Fisher Scientific, 19524016) and 10 µl of 5 U µl−1 Klenow (NEB, M0210)) and incubating for 1 h at 37 °C with rotation. Filled-in chromatin was then ligated by adding 500 µl of ligation master mix (100 µl of 10× NEB T4 DNA ligase buffer with ATP (NEB, B0202), 100 µl of 10% Triton X-100, 10 µl of 10 mg ml−1 BSA and 6.5 µl of 400 U µl−1 of T4 DNA ligase (NEB, M0202)) and incubated 4 h at 16 °C with mixing (800 rpm, 30 s pulses every 4 min). Ligated chromatin was then reverse-cross-linked by adding 50 µl of 10 mg ml−1 proteinase K and incubating the sample at 65 °C for 2 h. De-cross-linking was completed by adding 50 µl extra of proteinase K and incubating overnight at 65 °C. DNA from the reverse-cross-linked chromatin was purified using phenol–chloroform extraction and ethanol precipitation. Pelleted DNA was resuspended in 100 µl of TLE. Biotin removal from unligated ends was performed in a final volume of 130 µl with 5 µg of the purified DNA, 13 µl of 10× NEBuffer2.1, 3.25 µl of 1 mM dNTPs, 5 µl of 3 U µl−1 T4 DNA polymerase (NEB, M0203L). The sample was incubated in a thermocycler 4 h at 20 °C and the reaction subsequently stopped by adding EDTA to a final concentration of 10 mM followed by 20 min at 75 °C. A total of 130 µl was used for DNA sonication in a M220 Covaris Sonicator (peak power, 50; duty factor, 20%; cycles/burst, 200; duration, 65 s). After sonication, DNA was size-selected using AMPure XP beads (Agencourt, A63881). In brief, in a first selection, 0.6× bead mix was used and the supernatant was recovered. In the second selection, 1.2× bead mix was used and the bead fraction was recovered. Size-selected DNA was resuspended in 50 µl of TLE and then processed for end repair. End repair was performed by adding 20 µl of the end repair mix (7 µl of 10× NEB ligation buffer, 1.75 µl of 10 mM dNTP mix, 2.5 µl of T4 DNA polymerase (3 U µl−1 NEB M0203), 2.5 µl of T4 PNK (10 U µl−1, NEB M0201) and 0.5 µl of Klenow DNA polymerase (5 U µl−1, NEB, M0210)) and incubating in a thermocycler with the following program: 15 °C for 15 min, 25 °C for 15 min and 75 °C for 20 min. Biotinylated ligation ends were then pulled down using 10 µl of Dynabeads MyOne Streptavidin C1 (Invitrogen, 650.01) per µg of DNA. The beads were washed twice with Tween wash buffer (85 mM Tris-HCl pH 8, 0.5 mM EDTA, 1 M NaCl, 0.05% Tween-20) before being resuspended in 400 µl of 2× bead binding buffer (10 mM Tris-HCl pH 8, 1 mM EDTA, 50 mM NaCl) and incubated for 15 min with rotation with 400 µl of the end repaired sample (70 µl of end repair reaction plus 330 µl of TLE). The beads were then washed once with 400 µl of 1× bead binding buffer and once with 100 µl TLE before being finally resuspended in a final volume of 41 µl. A-tailing was then performed in a total volume of 50 µl by adding 5 µl of 10× NEBuffer2.1, 1 µl of 10 mM dATP and 3 µl of 5 U µl−1 Klenow fragment 3′→5′ exo- (NEB, M0212) in the thermocycler with the following program: 37 °C for 30 min then 75 °C for 20 min. A-tailed sample was then washed with 400 µl of 1× T4 ligase buffer and resuspended in 40 µl of the same buffer to prepare it for the adaptor ligation, which was performed by adding 1 µl of 10× T4 ligation buffer, 4 µl of T4 DNA ligase and 5 µl of 15 µM Illumina paired-end pre-annealed adapters. The reaction was incubated for 2 h at room temperature and the beads were then washed twice with 1× NEBuffer2.1. The beads were resuspended in 50 µl of the final library PCR reaction for library generation (25 µl of NEBNext High-Fidelity 2× PCR Mix, 0.5 µl of PE1 primer 25 µM and 0.5 µl of PE2 primer 25 µM plus milliQ water). The PCR was performed in a thermocycler with the following program: 98 °C for 60 s; 5–10 cycles of 98 °C for 10 s, 65 °C for 30 s, 72 °C for 30 s and 72 °C for 5 min. Test PCRs were used to determine the number of cycles. Final single-sided AMPure XP bead purification was performed to eliminate primer-dimers (1.1× proportion). Final libraries were sent for paired-end sequencing.
Hi-C analysis
Hi-C paired-end reads were mapped to the skate genome using BWA94. Ligation events (Hi-C pairs) were then detected and sorted, and PCR duplicates were removed using the pairtools package (https://github.com/mirnylab/pairtools). Unligated and self-ligated events (dangling and extra-dangling ends, respectively) were filtered out by removing contacts mapping to the same or adjacent restriction fragments. The resulting filtered pairs file was converted to a .tsv file that was used as input for Juicer Tools 1.13.02 Pre, which generated multiresolution .hic files95. These analyses were performed using previously published custom scripts (https://gitlab.com/rdacemel/hic_ctcf-null): the hic_pipe.py script was first used to generate .tsv files with the filtered pairs, and the filt2hic.sh script was then used to generate Juicer .hic files. Visualization of normalized Hi-C matrices and other values described below, such as insulation scores, TAD boundaries, aggregate TAD, Pearson’s correlation matrices and eigenvectors, were calculated and visualized using FAN-C96 and custom scripts available in the GitLab repository (https://gitlab.com/skategenome). The observed–expected interchromosomal matrix (Fig. 1d) was calculated counting interchromosomal normalized interactions in the 1 Mb KR normalized matrix (with the two replicates merged). Expected matrix was calculated as if interchromosomal interactions between two given chromosomes were proportional to the total number of interchromosomal interactions of these two chromosomes. A/B compartments were first called in each of the replicates separately using the first eigenvector of the 500 kb KR normalized matrix. Eigenvector correlation was high (r = 0.91, Extended Data Fig. 3b) and the replicates were then merged. The first eigenvector was calculated again and oriented according to open chromatin using the amount of ATAC–seq signal in the anterior pectoral fin sample. The same strategy was used to look at compartment differences between anterior and posterior fin Hi-C, but this time using 250 kb resolution (Extended Data Fig. 6). ATAC–seq, percentage of GC, gene models and RNA-seq signal overlaps with compartments were calculated using bedtools intersect97. Compartment calling and the different overlaps are available in Supplementary Table 8. The saddle plot was calculated using FAN-C. To define TADs, insulation scores were also calculated separately in the 25 kb resolution KR matrices of each of the replicates (using FAN-C and as described previously98 with a window size of 500 kb). Again, correlation between insulation scores of both replicates was high (r > 0.94, Extended Data Figs. 4b and 6f). Definitive boundaries and TADs were then calculated in a merged 25 kb matrix with a window size of 500 kb and using a boundary score cut-off of 1 (Supplementary Table 9) or no cut-off for interspecies comparison analyses with mouse and zebrafish. CTCF motifs and their relative orientations were mined inside ChIP–seq peaks in mouse and zebrafish or merged ATAC–seq peaks between the anterior and posterior pectoral fin samples using Clover99 or FIMO100 (MA0139.1 Jaspar PWM, PWM score threshold of 8). They were later overlapped with previously calculated boundaries. Boundary feature heat maps from Supplementary Fig. 5 were generated using profileplyr101 (https://bioconductor.org/packages/release/bioc/html/profileplyr.html) after binning the different signals in 5 kb windowed bigwig files. Chromatin loops were called using HICCUPS95 with the default parameters in merged replicates of the anterior and posterior fin Hi-C experiments, and in a megamap merging anterior and posterior fin Hi-C maps. A consensus set of loops was then calculated using hicMergeLoops from the HiCExplorer suite102 and reads were counted in the different replicate 10 kb resolution Hi-C maps to perform the differential loop analysis with EdgeR103. Virtual 4C-seqs were plotted from 10-kb-resolution Hi-C matrices using custom scripts.
HiChIP
HiChIP assays were performed as previously described104, with some modifications. In brief, 10 anterior and posterior pectoral fins of stg. In total, 30 skate embryos were fixed in a final concentration of 1% PFA for 10 min at room temperature. Fixation was quenched with 1 M glycine up to a concentration of 0.125 M. The tissue was then resuspended in 5 ml cell lysis buffer and homogenized using a Douncer on ice. After the lysis, nuclei were pelleted by centrifuging at 2,500 rcf, and washed in 500 µl of lysis buffer. Chromatin digestion and ligation, ChIP, tagmentation and library preparation were performed as previously described92. The antibody used was a ChIP-grade anti-histone H3 trimethyl K4 from Abcam (ab8580). The total amount of antibody used was 20 µg, at a dilution of 1 µg µl−1.
HiChIP analysis
Paired-end reads from HiChIP experiments were aligned to the skate genome using the TADbit pipeline105 with the default settings. In brief, duplicate reads were removed, DpnII restriction fragments were assigned to resulting read pairs, valid interactions were kept by filtering out unligated and self-ligated events and multiresolution interaction matrices were generated. Dangling-end read pairs were used to create 1D signal bedfiles that are equivalent to those of ChIP–seq experiments. Coverage profiles were then generated in the bedgraph format using the bedtools genomecov tool (https://bedtools.readthedocs.io/en/latest/content/tools/genomecov.html), and bedgraph to bigwig conversions were also performed for visualization using the bedGraphToBigWig tool from UCSC Kent Utils (https://github.com/ucscGenomeBrowser/kent). 1D signal bedgraph files were used to call peaks with MACS2106 using the no model and extsize 147 parameters and FDR < 0.01.
FitHiChIP107 was used to identify ‘peak-to-all’ interactions at 10 kb resolution using HiChIP-filtered pairs and peaks derived from dangling ends. Loops were called using a genomic distance of between 20 kb and 2 Mb, and coverage bias correction was performed to achieve normalization. FitHiChIP loops with q values smaller than 0.1 were retained for further analyses. Further filtering was performed to enrich enhancer–promoter interactions. First, loops established by two H3K4me3 peaks (likely promoter–promoter interactions) or no H3K4me3 peaks (likely enhancer–enhancer and others) were filtered out. Second, loops related to the H3K4me3 peak of the same gene promoter are grouped together into a common ‘regulatory landscape’, composed of a promoter anchor and several distal anchors. Then, regulatory landscapes with only one distal anchor were filtered out. Third, to filter out further spurious interactions, we used the rationale that genomic bins that interact with a given promoter rarely do so in isolation. We therefore calculated a distance cut-off for ‘interaction gaps’ in regulatory landscapes. Regulatory landscapes containing interaction gaps bigger than the distance cut-off were trimmed and the distal anchors beyond the interaction gap were discarded. The cut-off was determined for each sample independently by calculating the distribution of the biggest gaps (calculating the biggest gap for each regulatory landscape) and setting the cut-off to the sum of the third quartile plus twice the interquartile range (classic outlier definition). Overlaps with ATAC–seq peaks in the pectoral fin were calculated using bedtools intersect (Extended Data Fig. 5a). Inter-TAD loops were also calculated using bedtools intersect -c using the TADs and the loops. Loops intersecting more than one TAD were considered inter-TAD loops. Randomized controls were generated shuffling TAD positions before the intersection using bedtools shuffle. For differential analysis between the anterior and the posterior fin, filtered distal anchors were fused when closer than 20 kb using GenomicRanges reduce108. The loops with the merged distal anchors are provided in Supplementary Table 10. To perform the differential analysis, the number of reads supporting the union set of loops was extracted for each of the sample replicates. Correlations shown in Extended Data Fig. 5c and the differential analysis performed using EdgeR103 (Extended Data Fig. 5d) were calculated with this table. An FDR cut-off of 0.1 was chosen to consider a loop to be significantly stronger in either the anterior or the posterior fin. Custom code used for enhancer–promoter loop filtering and differential analysis is included in the GitLab repository (https://gitlab.com/skategenome).
RNA-seq
RNA-seq experiments from anterior and posterior pectoral and whole pelvic skate fins were performed as previously described6. In brief, two anterior or posterior pectoral and two pelvic fins of stage 31 skate embryos were used for each biological replicate. Total RNA was extracted from each sample using Direct-zol RNA MiniPrep (Zymo Research) and sent for library preparation and sequencing.
RNA-seq analysis
For the RNA-seq data analysis, we used the nf-core/rnaseq pipeline (v.1.4)109 for read alignment, read count and quality control of the results. After this, we performed a differential gene expression analysis using the DESeq2 R library (v.1.30.1)110. Gene Ontology term enrichment analysis was performed using TopGO R library (v.2.42.0)111, with the elim algorithm and Fisher test, retaining terms with P < 0.01.
ATAC–seq
ATAC–seq experiments from anterior and posterior regions of pectoral skate fins and whole pelvic fins were performed as previously described6,112. After dissecting the pectoral fins, one anterior and one posterior regions were used for each biological replicate. In the case of pelvic fins, one fin was used for each biological replicate. Tissue was homogenized using a Pellet Pestle Motor (Kimble) coupled to a plastic pestle, and treated with lysis buffer. Individual cells were counted, and 75,000 cells were tagmented. ATAC–seq libraries were generated by PCR, using 13 cycles of amplification, purified and sent for external sequencing.
ATAC–seq analysis
ATAC–seq data analysis was performed using the nf-core/atacseq pipeline (v.1.0.0)109, which runs Nextflow (v.19.10.0)113, for quality controls, read alignment against the new skate assembly, filtering, data visualization, peak calling, read count and differential accessibility analysis. To compare whole pectoral and pelvic fin samples, we merged the anterior and posterior pectoral samples into one single pectoral fin sample.
Microsyntenic pair analysis
The analysis of microsyntenic pairs shared across the gnathostome lineage was based on a previously described analysis114. In brief, we used the genome assembly and annotation presented in this paper for the little skate in combination with public assemblies and annotations for mouse and garfish downloaded from ensembl (www.ensembl.org; Mus musculus: GRCm38v101; Lepisosteus oculatus: LepOcu1v104). Annotations in .gtf format were converted to genepred with gtfToGenePred (UCSC Kent Utils). Then, for each pair of consecutive genes in skates, we determined whether the orthologue pairs of genes in mouse and garfish were also consecutive (allowing 4 intervening gene models as described previously114). The intergenic space between pairs of genes categorized as syntenic and non-syntenic in skates was overlapped with TAD boundaries and with TADs again using bedtools intersect. TADs were categorized according to the presence or absence of conserved microsyntenic pairs and then the overlap between the different TADs with ATAC–seq peaks or HiChIP loops was calculated again using bedtools intersect. A list of conserved microsyntenic pairs is provided in Supplementary Table 11 and the code is available in the GitLab repository (https://gitlab.com/skategenome).
TAD rearrangements in the skate lineage
To identify skate-specific TAD rearrangements, global alignments were performed with lastz115 against six different gnathostome genomes using as a reference the little skate assembly presented in this study. The chosen species were the thorny skate Amblyraja radiata, two species of shark (the white shark Carcarodon carcarias and the white-spotted bamboo shark Chiloscyllium plagiosum), one chimera (the elephant shark Callorhinchus milii) and a bony fish (the spotted gar Lepisosteus oculatus).The parameters of lastz were adapted to the phylogenetic distance with skate according to previous recommendations116 (see assemblies, substitution matrices and lastz parameters used in Supplementary Table 12). Syntenic chains and nets were then devised as proposed elsewhere117 and further polished using chainCleaner118. Synteny breaks were then defined as the junctions between syntenic nets of any level, excluding those that were caused by the end of a scaffold for such genome assemblies that were not chromosome grade (white shark, elephant shark). The overlap between synteny breaks of different species was inferred using bedtools multiinter. Breaks that were found to be common in sharks, chimeras and a bony fish (garfish) were further considered. The distance between candidate synteny breaks and TAD boundaries (Supplementary Table 9) was next determined using bedtools closest -d and breaks that were located closer than 50 kb to a TAD boundary were discarded. Randomized analysis of the overlap between synteny breaks and TAD boundaries (Fig. 4b) was performed, combining bedtools closest and bedtools shuffle. Finally, we selected candidate genes that displayed enhancer–promoter HiChIP interactions in the anterior or the posterior pectoral fin samples that crossed the synteny break, using again bedtools intersect. Enrichment of signalling pathways of candidate genes was performed using the ReactomePA119 and ClusterProfiler120 R packages. A list of the final synteny breaks and candidate genes is provided in Supplementary Table 13, and the exact code used is provided at the GitLab code repository (https://gitlab.com/skategenome).
WISH
Skate and shark embryos were recovered from egg cases at stage 27 and 30 and fixed by 4% PFA at 4 °C overnight. The next day, the embryos were rinsed three times with PBS-0.1% Tween-20, soaked in 100% methanol and stored at −80 °C. WISH was conducted as previously described1, except for hybridizing the embryos and probes at 72 °C.
Gain of function analysis
Experiments were performed as previously described47. Zebrafish eggs were injected at the one-cell stage with hoxd13a-GR mRNA (70 pg per embryo). Dexamethasone at 10 nM (Sigma-Aldrich, D4902) was added to the medium at 24 h after fertilization, and embryos were fixed at 48 h after fertilization.
RT–qPCR
The pectoral fins of three shark juveniles (S. retifer) were dissected out in DEPC-PBS at stage 30. Three replicates were prepared. Total RNA was separately extracted from each replicate by Trizol (Invitrogen). cDNA was synthesized from the total RNA using the iScript cDNA Synthesis Kit (Bio-Rad). Then, quantitative reverse transcription PCR (RT–qPCR) analysis of gapdh and prickle1 was conducted using the KAPA HiFi HotStart ReadyMix PCR Kit (Kapa Biosystems) and the Applied Biosystems 7300 Real time PCR system. A list of the primers used in this study is provided in Supplementary Table 14. The obtained Ct value from RT–qPCR was converted to arbitrary gene expression values.
Cell elongation analysis
Pectoral fins were dissected from stage 29 skate embryos and fixed by 4% paraformaldehyde overnight. The next day, the fins were rinsed with PBS including 0.1% Triton X-100 and incubated in the blocking buffer (10% sheep serum and 0.1% BSA in PBS-0.1% Triton X-100) at room temperature for 1 h. The blocking buffer was then replaced with blocking buffer including CellMask Deep Red Plasma membrane Stain (1/1,000 dilution, Invitrogen) and DAPI (1:4000 dilution), and incubated at 4 °C overnight. Subsequently, the fins were washed five times for 1 h with PBS-Triton X-100 and mounted onto glass slides. The fins were then scanned using a confocal microscope (Zeiss, LSM510 META). The scanned images were incorporated into Fiji and cell outlines in fin mesenchyme were manually traced. The cell elongation ratio was automatically calculated by the macro ‘Tissue Cell Geometry Stats’ included in Fiji.
ROCK inhibitor treatment
To test the function of the PCP pathway in the pectoral fin development, skate embryos were treated with Y27632—a ROCK inhibitor—from stage 29 to stage 31 and investigated for their fin morphology. ROCK inhibitor (500 µl; stock 50 mM, final 50 µM, Selleck chemicals) or DMSO solution (negative control) was added to 500 ml of artificial saltwater (Instant Ocean), and five skate embryos at stage 29 for each condition were kept submerged in these solutions. Once the negative control embryos reached stage 31, all embryos were fixed by 4% PFA and their total body length was measured under a stereomicroscope. The embryos were stained with Alcian Blue as previously described121 (n = 5 per condition).
To locally inhibit the PCP pathway by the ROCK inhibitor, the beads soaked in the inhibitor solution (100 μM or 1 mM in DMSO) or DMSO were repeatedly implanted into the anterior pectoral fin from stage 29 (one bead per week, three times as total for each embryo). The embryos were raised up to stage 31 in artificial saltwater, fixed by 4% PFA and stained with Alcian Blue (the replicates were 9 or 10 embryos per condition).
Morphometrics analysis of skate fins
Skate embryos at each stage were photographed from the ventral side. A landmark scheme was designed to capture the shape of the pectoral fin (Extended Data Fig. 10). Six homologous landmarks and three curves were assessed in each sample; curves were used to generate sliding semi-landmarks. The samples were digitized in R using the package Stereomorph122. Digitized files were then uploaded to ShinyGM123,124, in which all downstream analyses were performed. The samples were aligned using a generalized Procrustes analysis to account for shape differences due to differences in specimen size, specimen orientation and scaling. A morphospace was generated using these aligned landmark coordinates; deformation grids were generated for the control stage 31 and ROCK-inhibited stage 31 samples (Extended Data Fig. 10). A linear model was run to test for the effect of length, treatment and stage on shape. Both treatment and stage were significantly associated with shape (P = 0.002 and P = 0.001, respectively); as expected, total length was not significantly associated with the size-corrected shapes (P = 0.711).
Transgenic enhancer activity assay
Shark and skate hoxa enhancers were cloned into pCR8/GW/TOPO vector (Invitrogen) by PCR. A list of the primers is provided in Supplementary Table 14. The cloned enhancers were transferred into the pXIG-cfos-EGFP vector by Gateway LR Clonase II (Invitrogen)53. The created vectors were injected into one-cell-stage zebrafish eggs with tol2 mRNA as previously described125. The injected embryos were observed under a stereo-type fluorescent microscope and photographed at 48 h after fertilization.
Phylome reconstruction
The phylome of L. erinacea, meaning the collection of phylogenetic trees for each protein-coding gene in its genome, was reconstructed using an automated pipeline that mimics the steps that one would take to build a phylogenetic tree and based on the PhylomeDB pipeline126. First, a database with the proteomes (that is, full set of protein-coding genes) of 21 species was built that included L. erinacea (a full list of species included is provided in Supplementary Table 1). A BLASTp search was then performed against this database starting from each of the proteins included in the L. erinacea genome. BLAST results were filtered using an e-value threshold of 1 × 10−5 and a query sequence overlap threshold of 50%. The number of hits was limited to the best 250 hits for each protein. A multiple sequence alignment was performed for each set of homologous sequences. Three different programs were used to build the alignments (Muscle (v.3.8.1551)127, mafft (v.7.407)128 and kalign (v.2.04)129) and the alignments were performed in forward and in reverse, resulting in six different alignments. From this group of alignments, a consensus alignment was obtained using M-coffee from the T-coffee package (v.12.0)130. Alignments were then trimmed using trimAl v1.4.rev15 (consistency-score cut-off 0.1667, gap-score cut-off 0.9)131. IQTREE (v.1.6.9)132 was then used to reconstruct a maximum-likelihood phylogenetic tree. Model selection was limited to five models (DCmut, JTTDCMut, LG, WAG, VT) with freerate categories set to vary between 4 and 10. The best model according to the BIC criterion was used. Then, 1,000 rapid bootstrap replicates were calculated. A second phylome starting from D. rerio was also reconstructed according to the same approach. All trees and alignments are stored in phylomedb126 with phylomeIDs 247 for the L. erinacea phylome and 275 for the D. rerio phylome (http://phylomedb.org).
Species tree reconstruction
A species tree was reconstructed using a gene concatenation approach. The trimmed alignments of 102 protein families with a single orthologue per species were concatenated into a single multiple-sequence alignment. IQTREE132 was then used to reconstruct the species tree using the same parameters as above. The final alignment contained 48,958 positions. The model selected for tree reconstruction was JTTDCMut+F+R5. Moreover, duptree133 was used to reconstruct a second species tree using a super tree method. Duptree searches for the species tree that minimizes the number of duplications inferred when each gene is reconciled with the species tree. All trees built during the phylome reconstruction process were used to reconstruct this species tree. The two topologies were fully congruent.
Skate MethylC-seq library preparation
MethylC-seq library preparation was performed as described previously134. In brief, 1,000 ng of genomic DNA extracted from the embryonic stage 31 and adult skate pelvic and pectoral fins was spiked with unmethylated λ phage DNA (Promega). DNA was sonicated to ~300 bp fragments using the M220 focused ultrasonicator (Covaris) with the following parameters: peak incident power, 50 W; duty factor, 20%; cycles per burst, 200; treatment time, 75 s. Sonicated DNA was then purified, end-repaired using the End-It DNA End-Repair Kit (Lucigen) and A-tailed using Klenow fragment (3′→5′ exo-) (New England Biolabs) followed by the ligation of NEXTFLEX Bisulfite-Seq Adapters. Bisulfite conversion of adaptor-ligated DNA was performed using the EZ DNA Methylation-Gold Kit (Zymo Research). Library amplification was performed using KAPA HiFi HotStart Uracil+ DNA polymerase (Kapa Biosystems). Library size was determined using the Agilent 4200 Tapestation system. The libraries were quantified using the KAPA library quantification kit (Roche).
Skate methylome data analysis
Embryonic stage31 and adult skate pelvic and pectoral fin DNA methylome libraries were sequenced on the Illumina HiSeq X platform (150 bp, paired-end). Elephant shark C. milii raw whole genome bisulphite sequencing data (adult liver) were downloaded from NCBI BioProject (PRJNA379367)135. Zebrafish D. rerio raw whole genome bisulphite sequencing data (adult liver) were downloaded from the GEO (GSE122723)136. Sequenced reads in FASTQ format were trimmed using the Trimmomatic software (ILLUMINACLIP:adapter.fa:2:30:10 SLIDINGWINDOW:5:20 LEADING:3 TRAILING:3 MINLEN:50). Trimmed reads were mapped to the Leri_hhj.fasta genome reference (containing the lambda genome as chrLambda) using WALT137 with the following settings: -m 10 -t 24 -N 10000000 -L 2000. Mapped reads in SAM format were converted to BAM format; BAM files were sorted and indexed using SAMtools138. Duplicate reads were removed using Picard Tools (v.2.3.0; http://broadinstitute.github.io/picard/). Genotype and methylation bias correction were performed using MethylDackel (MethylDackel extract Leri_hhj_lambda.fasta $input_bam -o $output --mergeContext --minOppositeDepth 5 --maxVariantFrac 0.5 --OT 10,110,10,110 --OB 40,140,40,140) (https://github.com/dpryan79/MethylDackel). Methylated and unmethylated calls at each genomic CpG position were determined using MethylDackel (MethylDackel extract Leri_hhj_lambda.fasta $input_bam -o output --mergeContext). DNA methylation profiles at differentially accessible ATAC–seq peaks between embryonic pelvic and pectoral fin samples were generated using deepTools2 computeMatrix reference-point and plotHeatmap139.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41586-023-05868-1.
Supplementary information
Acknowledgements
We thank R. Schneider, D. Sherwood and the staff of the Marine Biological Laboratory Embryology course for providing laboratory space; L. Bertrand and the staff at Leica Microsystems for microscopy support; D. Remsen, S. Bennett, D. Calzarette, and the staff of the Marine Biological Laboratory and MBL Marine Resources Center for technical and animal husbandry assistance; A. Gillis for support and advice with RNA-seq and skate functional experiments; and A. Shindo for technical support of image analysis. T.N., D.N. and A.A. were supported by institutional support provided by the Rutgers University School of Arts and Sciences and the Human Genetics Institute of New Jersey, a Whitman Center Fellowship (Marine Biological Laboratory) and the National Science Foundation under grant no. 2210072. D.N. was further supported by the NIH-IRACDA funded INSPIRE program at Rutgers University; F.M. and D.S.R. by funding from the Okinawa Institute for Science and Technology; D.S.R. by the Marthella Foskett-Brown Chair in Computational Biology; D.G.L. and R.D.A. by a grant from the Deutsche Forschungsgemeinschaft (LU 242672-1) and by a Helmholtz ERC Recognition Award grant from the Helmholtz-Gemeinschaft (ERC-RA1045 0033); R.D.A. and C.P. by EMBO Postdoctoral Fellowships (EMBO ALTF 537-2020 and ALTF 346-2020, respectively); P.M.M.G. by a postdoctoral fellowship from Junta de Andalucía (DOC_00397); J.J.T. and J.L.G.-S. by the European Research Council (ERC, grant no. 740041) and the Spanish Ministerio de Economía y Competitividad (grant no. PID2019-103921GB-I00); J.D. by the NIH grant HG003143; F.M. by the Royal Society (URF\R1\191161); V.A.S. by a Wolfson College Junior Research Fellowship and Marine Biological Laboratory Whitman Early Career Fellowship; J.L.-R. by the Spanish Ministerio de Ciencia e Innovacion (PID2020-113497GB-I00); and A.V. and F.D. by NIH grants R01DE028599 and R01HG003988. Research conducted at the E.O. Lawrence Berkeley National Laboratory was performed under US Department of Energy contract DE-AC02-05CH11231, University of California. J.D. is an investigator of the Howard Hughes Medical Institute.
Extended data figures and tables
Author contributions
J.L.G.-S., F.M., J.J.T., T.N., D.S.R. and D.G.L. conceived the study and designed the experiments. F.M., E.d.l.C.-M., N.S. and J.L.G.-S. coordinated the sequencing of the little skate genome and F.M. assembled and annotated the genome. R.D.A., P.M.M.-G., L.Y., J.H.G., J.D. and D.G.L. performed analyses on 3D chromatin organization. F.M. and D.S.R. designed and performed synteny and comparative analyses. M.M.H., F.M. and T.G. performed phylogenetic and phylogenomic analyses. E.d.l.C.-M., C.P., S.N., R.D.A., J.J.T., I.C., L.G.-F., I.S. and J.L.-R. performed and analysed transgenics, ATAC–seq, RNA-seq and Hi-C experiments. V.A.S. and C.H. performed and analysed additional RNA-seq experiments. F.D. and A.V. performed additional functional assays. K.S., P.E.D., A.G.-R. and O.B. performed and analysed DNA methylation experiments. D.N., A.A. and T.N. conducted embryonic experiments of skates and sharks. J.L.G.-S., J.J.T., F.M., D.S.R. and D.G.L. wrote the manuscript with input from all of the authors.
Peer review
Peer review information
Nature thanks Chris Amemiya and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
Raw and processed sequencing data were deposited at the Gene Expression Omnibus (GEO; GSE188980 and GSE190730) and SRA (PRJNA783899). Mouse hindlimb RNA-seq data used for comparative analyses are publicly available at the GEO (GSE104459) and mouse forelimb RNA-seq data at the GEO (GSE136437). Zebrafish and elephant shark bisulphite sequencing data used for comparison were downloaded from NCBI BioProject (PRJNA379367) and the GEO (GSE122723136), respectively. Skate RNA-seq data are publicly available at NCBI BioProject (PRJNA288370 and PRJNA686126).
Code availability
Code used is available at GitLab (https://gitlab.com/skategenome).
Competing interests
J.D. is on the scientific advisory board of Arima Genomics and of Omega Therapeutics. The other authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Ferdinand Marlétaz, Elisa de la Calle-Mustienes, Rafael D. Acemel, Christina Paliou
These authors jointly supervised this work: Tetsuya Nakamura, Juan J. Tena, Darío G. Lupiáñez, Daniel S. Rokhsar, José Luis Gómez-Skarmeta
Deceased: José Luis Gómez-Skarmeta
Contributor Information
Ferdinand Marlétaz, Email: ferdinand.marletaz@gmail.com.
Tetsuya Nakamura, Email: nakamura@dls.rutgers.edu.
Juan J. Tena, Email: jjtenagu@upo.es
Darío G. Lupiáñez, Email: Dario.Lupianez@mdc-berlin.de
Daniel S. Rokhsar, Email: dsrokhsar@gmail.com
Extended data
is available for this paper at 10.1038/s41586-023-05868-1.
Supplementary information
The online version contains supplementary material available at 10.1038/s41586-023-05868-1.
References
- 1.Nakamura T, et al. Molecular mechanisms underlying the exceptional adaptations of batoid fins. Proc. Natl Acad. Sci. USA. 2015;112:15940–15945. doi: 10.1073/pnas.1521818112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Turner N, et al. The evolutionary origins and diversity of the neuromuscular system of paired appendages in batoids. Proc. Biol. Sci. 2019;286:20191571. doi: 10.1098/rspb.2019.1571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Shimeld SM, Holland PW. Vertebrate innovations. Proc. Natl Acad. Sci. USA. 2000;97:4449–4452. doi: 10.1073/pnas.97.9.4449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Simakov O, et al. Deeply conserved synteny and the evolution of metazoan chromosomes. Sci. Adv. 2022;8:eabi5884. doi: 10.1126/sciadv.abi5884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Touceda-Suárez, M. et al. Ancient genomic regulatory blocks are a source for regulatory gene deserts in vertebrates after whole genome duplications. Mol. Biol. Evol.10.1093/molbev/msaa123 (2020). [DOI] [PMC free article] [PubMed]
- 6.Marlétaz F, et al. Amphioxus functional genomics and the origins of vertebrate gene regulation. Nature. 2018;564:64–70. doi: 10.1038/s41586-018-0734-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Dixon JR, et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485:376–380. doi: 10.1038/nature11082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nora EP, et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature. 2012;485:381–385. doi: 10.1038/nature11049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Berthelot C, Muffato M, Abecassis J, Roest Crollius H. The 3D organization of chromatin explains evolutionary fragile genomic regions. Cell Rep. 2015;10:1913–1924. doi: 10.1016/j.celrep.2015.02.046. [DOI] [PubMed] [Google Scholar]
- 10.Lupiáñez DG, et al. Disruptions of topological chromatin domains cause pathogenic rewiring of gene-enhancer interactions. Cell. 2015;161:1012–1025. doi: 10.1016/j.cell.2015.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Real, F. M. et al. The mole genome reveals regulatory rearrangements associated with adaptive intersexuality. Science10.1126/science.aaz2582 (2020). [DOI] [PMC free article] [PubMed]
- 12.Acemel RD, Gómez-Skarmeta JL. Reprogramming nuclear architecture: just a TAD. Cell Stem Cell. 2019;24:679–681. doi: 10.1016/j.stem.2019.04.007. [DOI] [PubMed] [Google Scholar]
- 13.Rowley MJ, Corces VG. Organizational principles of 3D genome architecture. Nat. Rev. Genet. 2018;19:789–800. doi: 10.1038/s41576-018-0060-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Stingo V, Rocco L. Selachian cytogenetics: a review. Genetica. 2001;111:329–347. doi: 10.1023/A:1013747215866. [DOI] [PubMed] [Google Scholar]
- 15.Hirschberger C, Sleight VA, Criswell KE, Clark SJ, Gillis JA. Conserved and unique transcriptional features of pharyngeal arches in the skate (Leucoraja erinacea) and evolution of the jaw. Mol. Biol. Evol. 2021;38:4187–4204. doi: 10.1093/molbev/msab123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Chorostecki U, Molina M, Pryszcz LP, Gabaldón T. MetaPhOrs 2.0: integrative, phylogeny-based inference of orthology and paralogy across the tree of life. Nucleic Acids Res. 2020;48:W553–W557. doi: 10.1093/nar/gkaa282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Fuentes D, et al. PhylomeDB V5: an expanding repository for genome-wide catalogues of annotated gene phylogenies. Nucleic Acids Res. 2021;50:D1062–D1068. doi: 10.1093/nar/gkab966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Irisarri I, et al. Phylotranscriptomic consolidation of the jawed vertebrate timetree. Nat. Ecol. Evol. 2017;1:1370–1378. doi: 10.1038/s41559-017-0240-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hara Y, et al. Shark genomes provide insights into elasmobranch evolution and the origin of vertebrates. Nat. Ecol. Evol. 2018;2:1761–1771. doi: 10.1038/s41559-018-0673-5. [DOI] [PubMed] [Google Scholar]
- 20.Kuraku S. Shark and ray genomics for disentangling their morphological diversity and vertebrate evolution. Dev. Biol. 2021;477:262–272. doi: 10.1016/j.ydbio.2021.06.001. [DOI] [PubMed] [Google Scholar]
- 21.Duret L, Galtier N. Biased gene conversion and the evolution of mammalian genomic landscapes. Annu. Rev. Genomics Hum. Genet. 2009;10:285–311. doi: 10.1146/annurev-genom-082908-150001. [DOI] [PubMed] [Google Scholar]
- 22.Perry BW, Schield DR, Adams RH, Castoe TA. Microchromosomes exhibit distinct features of vertebrate chromosome structure and function with underappreciated ramifications for genome evolution. Mol. Biol. Evol. 2021;38:904–910. doi: 10.1093/molbev/msaa253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Simakov O, et al. Deeply conserved synteny resolves early events in vertebrate evolution. Nat. Ecol. Evol. 2020;4:820–830. doi: 10.1038/s41559-020-1156-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Nakatani Y, Takeda H, Kohara Y, Morishita S. Reconstruction of the vertebrate ancestral genome reveals dynamic genome reorganization in early vertebrates. Genome Res. 2007;17:1254–1265. doi: 10.1101/gr.6316407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Nakatani Y, et al. Reconstruction of proto-vertebrate, proto-cyclostome and proto-gnathostome genomes provides new insights into early vertebrate evolution. Nat. Commun. 2021;12:4489. doi: 10.1038/s41467-021-24573-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Thompson AW, et al. The bowfin genome illuminates the developmental evolution of ray-finned fishes. Nat. Genet. 2021;53:1373–1384. doi: 10.1038/s41588-021-00914-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Dalloul RA, et al. Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo): genome assembly and analysis. PLoS Biol. 2010;8:e1000475. doi: 10.1371/journal.pbio.1000475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zhang Y, et al. The white-spotted bamboo shark genome reveals chromosome rearrangements and fast-evolving immune genes of cartilaginous fish. iScience. 2020;23:101754. doi: 10.1016/j.isci.2020.101754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Mitros T, et al. A chromosome-scale genome assembly and dense genetic map for Xenopus tropicalis. Dev. Biol. 2019;452:8–20. doi: 10.1016/j.ydbio.2019.03.015. [DOI] [PubMed] [Google Scholar]
- 30.Hoencamp C, et al. 3D genomics across the tree of life reveals condensin II as a determinant of architecture type. Science. 2021;372:984–989. doi: 10.1126/science.abe2218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Rowley MJ, et al. Evolutionarily conserved principles predict 3D chromatin organization. Mol. Cell. 2017;67:837–852. doi: 10.1016/j.molcel.2017.07.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Acemel RD, et al. A single three-dimensional chromatin compartment in amphioxus indicates a stepwise evolution of vertebrate Hox bimodal regulation. Nat. Genet. 2016;48:336–341. doi: 10.1038/ng.3497. [DOI] [PubMed] [Google Scholar]
- 33.Gibson-Brown JJ, et al. Evidence of a role for T-box genes in the evolution of limb morphogenesis and the specification of forelimb/hindlimb identity. Mech. Dev. 1996;56:93–101. doi: 10.1016/0925-4773(96)00514-X. [DOI] [PubMed] [Google Scholar]
- 34.Pradeepa MM, Sutherland HG, Ule J, Grimes GR, Bickmore WA. Psip1/Ledgf p52 binds methylated histone H3K36 and splicing factors and contributes to the regulation of alternative splicing. PLoS Genet. 2012;8:e1002717. doi: 10.1371/journal.pgen.1002717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Onimaru K, et al. Developmental hourglass and heterochronic shifts in fin and limb development. eLife. 2021;10:e62865. doi: 10.7554/eLife.62865. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Wang JS, Infante CR, Park S, Menke DB. PITX1 promotes chondrogenesis and myogenesis in mouse hindlimbs through conserved regulatory targets. Dev. Biol. 2018;434:186–195. doi: 10.1016/j.ydbio.2017.12.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.DeLaurier A, Schweitzer R, Logan M. Pitx1 determines the morphology of muscle, tendon, and bones of the hindlimb. Dev. Biol. 2006;299:22–34. doi: 10.1016/j.ydbio.2006.06.055. [DOI] [PubMed] [Google Scholar]
- 38.Swenson JD, Klomp J, Fisher RA, Crow KD. How the devil ray got its horns: the evolution and development of cephalic lobes in myliobatid stingrays (Batoidea: Myliobatidae) Front. Ecol. Evol. 2018;6:181. doi: 10.3389/fevo.2018.00181. [DOI] [Google Scholar]
- 39.Barry SN, Crow KD. The role of HoxA11 and HoxA13 in the evolution of novel fin morphologies in a representative batoid (Leucoraja erinacea) Evodevo. 2017;8:24. doi: 10.1186/s13227-017-0088-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lopez-Rios J, et al. GLI3 constrains digit number by controlling both progenitor proliferation and BMP-dependent exit to chondrogenesis. Dev. Cell. 2012;22:837–848. doi: 10.1016/j.devcel.2012.01.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Tanaka M. Fins into limbs: autopod acquisition and anterior elements reduction by modifying gene networks involving 5′Hox, Gli3, and Shh. Dev. Biol. 2016;413:1–7. doi: 10.1016/j.ydbio.2016.03.007. [DOI] [PubMed] [Google Scholar]
- 42.Bastida MF, et al. The formation of the thumb requires direct modulation of Gli3 transcription by Hoxa13. Proc. Natl Acad. Sci. USA. 2020;117:1090–1096. doi: 10.1073/pnas.1919470117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Amin S, et al. Hoxa2 selectively enhances Meis binding to change a branchial arch ground state. Dev. Cell. 2015;32:265–277. doi: 10.1016/j.devcel.2014.12.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Fromental-Ramain C, et al. Hoxa-13 and Hoxd-13 play a crucial role in the patterning of the limb autopod. Development. 1996;122:2997–3011. doi: 10.1242/dev.122.10.2997. [DOI] [PubMed] [Google Scholar]
- 45.Sheth R, et al. Distal limb patterning requires modulation of cis-regulatory activities by HOX13. Cell Rep. 2016;17:2913–2926. doi: 10.1016/j.celrep.2016.11.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Nakamura T, Gehrke AR, Lemberg J, Szymaszek J, Shubin NH. Digits and fin rays share common developmental histories. Nature. 2016;537:225–228. doi: 10.1038/nature19322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Freitas R, Gómez-Marín C, Wilson JM, Casares F, Gómez-Skarmeta JL. Hoxd13 contribution to the evolution of vertebrate appendages. Dev. Cell. 2012;23:1219–1229. doi: 10.1016/j.devcel.2012.10.015. [DOI] [PubMed] [Google Scholar]
- 48.Letelier J, et al. The Shh/Gli3 gene regulatory network precedes the origin of paired fins and reveals the deep homology between distal fins and digits. Proc. Natl Acad. Sci. USA. 2021;118:e2100575118. doi: 10.1073/pnas.2100575118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Bogdanović O, et al. Active DNA demethylation at enhancers during the vertebrate phylotypic period. Nat. Genet. 2016;48:417–426. doi: 10.1038/ng.3522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Hon GC, et al. Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. Nat. Genet. 2013;45:1198–1206. doi: 10.1038/ng.2746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kragesteen BK, et al. Dynamic 3D chromatin architecture contributes to enhancer specificity and limb morphogenesis. Nat. Genet. 2018;50:1463–1473. doi: 10.1038/s41588-018-0221-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Venkatesh B, et al. Elephant shark genome provides unique insights into gnathostome evolution. Nature. 2014;505:174–179. doi: 10.1038/nature12826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Krefting J, Andrade-Navarro MA, Ibn-Salem J. Evolutionary stability of topologically associating domains is associated with conserved gene regulation. BMC Biol. 2018;16:87. doi: 10.1186/s12915-018-0556-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Schenkelaars Q, Fierro-Constain L, Renard E, Borchiellini C. Retracing the path of planar cell polarity. BMC Evol. Biol. 2016;16:69. doi: 10.1186/s12862-016-0641-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Maxwell EE, Fröbisch NB, Heppleston AC. Variability and conservation in late chondrichthyan development: ontogeny of the winter skate (Leucoraja ocellata) Anat. Rec. 2008;291:1079–1087. doi: 10.1002/ar.20719. [DOI] [PubMed] [Google Scholar]
- 56.Carrier, J. C., Musick, J. A. & Heithaus, M. R. Biology of Sharks and Their Relatives 2nd edn (CRC Press, 2012).
- 57.Kvon EZ, et al. Progressive loss of function in a limb enhancer during snake evolution. Cell. 2016;167:633–642. doi: 10.1016/j.cell.2016.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Leal F, Cohn MJ. Loss and re-emergence of legs in snakes by modular evolution of Sonic hedgehog and HOXD enhancers. Curr. Biol. 2016;26:2966–2973. doi: 10.1016/j.cub.2016.09.020. [DOI] [PubMed] [Google Scholar]
- 59.Lopez-Rios J, et al. Attenuated sensing of SHH by Ptch1 underlies evolution of bovine limbs. Nature. 2014;511:46–51. doi: 10.1038/nature13289. [DOI] [PubMed] [Google Scholar]
- 60.Enny A, Flaherty K, Mori S, Turner N, Nakamura T. Developmental constraints on fin diversity. Dev. Growth Differ. 2020;62:311–325. doi: 10.1111/dgd.12670. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Gehrke AR, et al. Deep conservation of wrist and digit enhancers in fish. Proc. Natl Acad. Sci. USA. 2015;112:803–808. doi: 10.1073/pnas.1420208112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Ranallo-Benavidez TR, Jaron KS, Schatz MC. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat. Commun. 2020;11:1432. doi: 10.1038/s41467-020-14998-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Ye C, Hill CM, Wu S, Ruan J, Ma ZS. DBG2OLC: efficient assembly of large genomes using long erroneous reads of the third generation sequencing technologies. Sci. Rep. 2016;6:31900. doi: 10.1038/srep31900. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- 66.Roach MJ, Schmidt SA, Borneman AR. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinform. 2018;19:460. doi: 10.1186/s12859-018-2485-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Putnam NH, et al. Chromosome-scale shotgun assembly using an in vitro method for long-range linkage. Genome Res. 2016;26:342–350. doi: 10.1101/gr.193474.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Dudchenko O, et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science. 2017;356:92–95. doi: 10.1126/science.aal3327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Kerpedjiev P, et al. HiGlass: web-based visual exploration and analysis of genome interaction maps. Genome Biol. 2018;19:125. doi: 10.1186/s13059-018-1486-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.English AC, et al. Mind the gap: upgrading genomes with Pacific Biosciences RS long-read sequencing technology. PLoS ONE. 2012;7:e47768. doi: 10.1371/journal.pone.0047768. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Dobin A, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Pertea M, et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 2015;33:290–295. doi: 10.1038/nbt.3122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Niknafs YS, Pandian B, Iyer HK, Chinnaiyan AM, Iyer MK. TACO produces robust multisample transcriptome assemblies from RNA-seq. Nat. Methods. 2017;14:68–70. doi: 10.1038/nmeth.4078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Grabherr MG, et al. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat. Biotechnol. 2011;29:644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Wu TD, Reeder J, Lawrence M, Becker G, Brauer MJ. GMAP and GSNAP for genomic sequence alignment: enhancements to speed, accuracy, and functionality. Methods Mol. Biol. 2016;1418:283–334. doi: 10.1007/978-1-4939-3578-9_15. [DOI] [PubMed] [Google Scholar]
- 76.Venturini L, Caim S, Kaithakottil GG, Mapleson DL, Swarbreck D. Leveraging multiple transcriptome assembly methods for improved gene structure annotation. Gigascience. 2018;7:giy093. doi: 10.1093/gigascience/giy093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Stanke M, et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 2006;34:W435–W439. doi: 10.1093/nar/gkl200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Mapleson D, Venturini L, Kaithakottil G, Swarbreck D. Efficient and accurate detection of splice junctions from RNA-seq with Portcullis. Gigascience. 2018;7:giy131. doi: 10.1093/gigascience/giy131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Slater GSC, Birney E. Automated generation of heuristics for biological sequence comparison. BMC Bioinform. 2005;6:31. doi: 10.1186/1471-2105-6-31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Haas BJ, et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 2008;9:R7. doi: 10.1186/gb-2008-9-1-r7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Roth ACJ, Gonnet GH, Dessimoz C. Algorithm of OMA for large-scale orthology inference. BMC Bioinform. 2008;9:518. doi: 10.1186/1471-2105-9-518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Marlétaz F, Peijnenburg KTCA, Goto T, Satoh N, Rokhsar DS. A new spiralian phylogeny places the enigmatic arrow worms among gnathiferans. Curr. Biol. 2019;29:312–318. doi: 10.1016/j.cub.2018.11.042. [DOI] [PubMed] [Google Scholar]
- 83.Eddy SR. Accelerated profile HMM searches. PLoS Comput. Biol. 2011;7:e1002195. doi: 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Katoh K, Misawa K, Kuma K-I, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30:3059–3066. doi: 10.1093/nar/gkf436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Criscuolo A, Gribaldo S. BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol. Biol. 2010;10:210. doi: 10.1186/1471-2148-10-210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Lartillot N, Lepage T, Blanquart S. PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics. 2009;25:2286–2288. doi: 10.1093/bioinformatics/btp368. [DOI] [PubMed] [Google Scholar]
- 87.Benton, M. J., Donoghue, P. C. J. & Asher, R. J. in The Timetree Of Life (ed. Kumar, S. B. H.) 35–86 (Oxford Univ. Press, 2009).
- 88.Steinegger M, Söding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat. Biotechnol. 2017;35:1026–1028. doi: 10.1038/nbt.3988. [DOI] [PubMed] [Google Scholar]
- 89.Derelle R, Philippe H, Colbourne JK. Broccoli: combining phylogenetic and network analyses for orthology assignment. Mol. Biol. Evol. 2020;37:3389–3396. doi: 10.1093/molbev/msaa159. [DOI] [PubMed] [Google Scholar]
- 90.Morel B, Kozlov AM, Stamatakis A, Szöllősi GJ. GeneRax: a tool for species-tree-aware maximum likelihood-based gene family tree inference under gene duplication, transfer, and loss. Mol. Biol. Evol. 2020;37:2763–2774. doi: 10.1093/molbev/msaa141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Belaghzal H, Dekker J, Gibcus JH. Hi-C 2.0: an optimized Hi-C procedure for high-resolution genome-wide mapping of chromosome conformation. Methods. 2017;123:56–65. doi: 10.1016/j.ymeth.2017.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Franke M, et al. CTCF knockout in zebrafish induces alterations in regulatory landscapes and developmental gene expression. Nat. Commun. 2021;12:5415. doi: 10.1038/s41467-021-25604-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Rao SSP, et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159:1665–1680. doi: 10.1016/j.cell.2014.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Durand NC, et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 2016;3:95–98. doi: 10.1016/j.cels.2016.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Kruse K, Hug CB, Vaquerizas JM. FAN-C: a feature-rich framework for the analysis and visualisation of chromosome conformation capture data. Genome Biol. 2020;21:303. doi: 10.1186/s13059-020-02215-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Crane E, et al. Condensin-driven remodelling of X chromosome topology during dosage compensation. Nature. 2015;523:240–244. doi: 10.1038/nature14450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Frith MC, et al. Detection of functional DNA motifs via statistical over-representation. Nucleic Acids Res. 2004;32:1372–1381. doi: 10.1093/nar/gkh299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Grant CE, Bailey TL, Noble WS. FIMO: scanning for occurrences of a given motif. Bioinformatics. 2011;27:1017–1018. doi: 10.1093/bioinformatics/btr064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Barrows, T. C. A. profileplyr (Bioconductor, 2019); 10.18129/B9.BIOC.PROFILEPLYR
- 102.Wolff J, Backofen R, Grüning B. Loop detection using Hi-C data with HiCExplorer. Gigascience. 2022;11:giac061. doi: 10.1093/gigascience/giac061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Mumbach MR, et al. HiChIP: efficient and sensitive analysis of protein-directed genome architecture. Nat. Methods. 2016;13:919–922. doi: 10.1038/nmeth.3999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Serra F, et al. Automatic analysis and 3D-modelling of Hi-C data using TADbit reveals structural features of the fly chromatin colors. PLoS Comput. Biol. 2017;13:e1005665. doi: 10.1371/journal.pcbi.1005665. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Bhattacharyya S, Chandra V, Vijayanand P, Ay F. Identification of significant chromatin contacts from HiChIP data by FitHiChIP. Nat. Commun. 2019;10:4221. doi: 10.1038/s41467-019-11950-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Lawrence M, et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 2013;9:e1003118. doi: 10.1371/journal.pcbi.1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Ewels PA, et al. The nf-core framework for community-curated bioinformatics pipelines. Nat. Biotechnol. 2020;38:276–278. doi: 10.1038/s41587-020-0439-x. [DOI] [PubMed] [Google Scholar]
- 110.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Alexa, A. & Rahnenfuhrer, J. topGO; 10.18129/B9.BIOC.TOPGO (Bioconductor, 2017).
- 112.Fernández-Miñán A, Bessa J, Tena JJ, Gómez-Skarmeta JL. Assay for transposase-accessible chromatin and circularized chromosome conformation capture, two methods to explore the regulatory landscapes of genes in zebrafish. Methods Cell. Biol. 2016;135:413–430. doi: 10.1016/bs.mcb.2016.02.008. [DOI] [PubMed] [Google Scholar]
- 113.Di Tommaso P, et al. Nextflow enables reproducible computational workflows. Nat. Biotechnol. 2017;35:316–319. doi: 10.1038/nbt.3820. [DOI] [PubMed] [Google Scholar]
- 114.Irimia M, et al. Extensive conservation of ancient microsynteny across metazoans due to cis-regulatory constraints. Genome Res. 2012;22:2356–2367. doi: 10.1101/gr.139725.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Harris, R. S. Improved Pairwise Alignment of Genomic DNA. PhD thesis, Pennsylvania State Univ. (2007).
- 116.Hiller M, et al. Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish. Nucleic Acids Res. 2013;41:e151. doi: 10.1093/nar/gkt557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Kent WJ, Baertsch R, Hinrichs A, Miller W, Haussler D. Evolution’s cauldron: duplication, deletion, and rearrangement in the mouse and human genomes. Proc. Natl Acad. Sci. USA. 2003;100:11484–11489. doi: 10.1073/pnas.1932072100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Suarez HG, Langer BE, Ladde P, Hiller M. chainCleaner improves genome alignment specificity and sensitivity. Bioinformatics. 2017;33:1596–1603. doi: 10.1093/bioinformatics/btx024. [DOI] [PubMed] [Google Scholar]
- 119.Yu G, He Q-Y. ReactomePA: an R/Bioconductor package for reactome pathway analysis and visualization. Mol. Biosyst. 2016;12:477–479. doi: 10.1039/C5MB00663E. [DOI] [PubMed] [Google Scholar]
- 120.Yu G, Wang L-G, Han Y, He Q-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Dahn RD, Davis MC, Pappano WN, Shubin NH. Sonic hedgehog function in chondrichthyan fins and the evolution of appendage patterning. Nature. 2006;445:311–314. doi: 10.1038/nature05436. [DOI] [PubMed] [Google Scholar]
- 122.Olsen AM, Westneat MW. StereoMorph: an R package for the collection of 3D landmarks and curves using a stereo camera set‐up. Methods Ecol. Evol. 2015;6:351–356. doi: 10.1111/2041-210X.12326. [DOI] [Google Scholar]
- 123.Baken, E. K., Collyer, M. L., Kaliontzopoulou, A. & Adams, D. C. geomorph v4.0 and gmShiny: Enhanced analytics and a new graphical interface for a comprehensive morphometric experience. Methods Ecol. Evol.12, 2355–2363 (2021).
- 124.Adams, D., Collyer, M., Kaliontzopoulou, A. & Baken, E. geomorph: geometric morphometric analyses of 2D/3D landmark data. R package version 4.0.1 (2021).
- 125.Suster ML, Abe G, Schouw A, Kawakami K. Transposon-mediated BAC transgenesis in zebrafish. Nat. Protoc. 2011;6:1998–2021. doi: 10.1038/nprot.2011.416. [DOI] [PubMed] [Google Scholar]
- 126.Huerta-Cepas J, Capella-Gutierrez S, Pryszcz LP, Marcet-Houben M, Gabaldon T. PhylomeDB v4: zooming into the plurality of evolutionary histories of a genome. Nucleic Acids Res. 2014;42:D897–D902. doi: 10.1093/nar/gkt1177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Katoh K, Kuma K, Toh H, Miyata T. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 2005;33:511–518. doi: 10.1093/nar/gki198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Lassmann T, Sonnhammer EL. Kalign—an accurate and fast multiple sequence alignment algorithm. BMC Bioinform. 2005;6:298. doi: 10.1186/1471-2105-6-298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Wallace IM, O’Sullivan O, Higgins DG, Notredame C. M-Coffee: combining multiple sequence alignment methods with T-Coffee. Nucleic Acids Res. 2006;34:1692–1699. doi: 10.1093/nar/gkl091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Wehe A, Bansal MS, Burleigh JG, Eulenstein O. DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony. Bioinformatics. 2008;24:1540–1541. doi: 10.1093/bioinformatics/btn230. [DOI] [PubMed] [Google Scholar]
- 134.Urich MA, Nery JR, Lister R, Schmitz RJ, Ecker JR. MethylC-seq library preparation for base-resolution whole-genome bisulfite sequencing. Nat. Protoc. 2015;10:475–483. doi: 10.1038/nprot.2014.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Peat JR, Ortega-Recalde O, Kardailsky O, Hore TA. The elephant shark methylome reveals conservation of epigenetic regulation across jawed vertebrates. F1000Research. 2017;6:526. doi: 10.12688/f1000research.11281.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Skvortsova K, et al. Retention of paternal DNA methylome in the developing zebrafish germline. Nat. Commun. 2019;10:3054. doi: 10.1038/s41467-019-10895-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Chen H, Smith AD, Chen T. WALT: fast and accurate read mapping for bisulfite sequencing. Bioinformatics. 2016;32:3507–3509. doi: 10.1093/bioinformatics/btw490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.Li H, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Ramírez F, Dündar F, Diehl S, Grüning BA, Manke T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 2014;42:W187–W191. doi: 10.1093/nar/gku365. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Raw and processed sequencing data were deposited at the Gene Expression Omnibus (GEO; GSE188980 and GSE190730) and SRA (PRJNA783899). Mouse hindlimb RNA-seq data used for comparative analyses are publicly available at the GEO (GSE104459) and mouse forelimb RNA-seq data at the GEO (GSE136437). Zebrafish and elephant shark bisulphite sequencing data used for comparison were downloaded from NCBI BioProject (PRJNA379367) and the GEO (GSE122723136), respectively. Skate RNA-seq data are publicly available at NCBI BioProject (PRJNA288370 and PRJNA686126).
Code used is available at GitLab (https://gitlab.com/skategenome).