Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Nov 24.
Published in final edited form as: Cell Stem Cell. 2020 Sep 28;27(5):765–783.e14. doi: 10.1016/j.stem.2020.09.001

Loss of extreme long-range enhancers in human neural crest drives a craniofacial disorder

Hannah K Long 1,2,3, Marco Osterwalder 4,5, Ian C Welsh 6, Karissa Hansen 6, James O J Davies 7, Yiran E Liu 8, Mervenaz Koska 1,2, Alexander T Adams 1,9, Robert Aho 6, Neha Arora 1, Kazuya Ikeda 10, Ruth M Williams 11, Tatjana Sauka-Spengler 11, Matthew H Porteus 10, Tim Mohun 12, Diane E Dickel 4, Tomek Swigut 1, Jim R Hughes 7, Douglas R Higgs 7,13, Axel Visel 4,14,15, Licia Selleri 6, Joanna Wysocka 1,2,3,16,17
PMCID: PMC7655526  NIHMSID: NIHMS1628626  PMID: 32991838

Summary

Non-coding mutations at the far end of a large gene desert surrounding the SOX9 gene result in a human craniofacial disorder called Pierre Robin sequence (PRS). Leveraging a human stem cell differentiation model, we identify two clusters of enhancers within the PRS-associated region that regulate SOX9 expression during a restricted window of facial progenitor development, at distances up to 1.45 Mb. Enhancers within the 1.45 Mb cluster exhibit highly synergistic activity that is dependent on the Coordinator motif. Using mouse models, we demonstrate that PRS phenotypic specificity arises from the convergence of two mechanisms: confinement of Sox9 dosage perturbation to developing facial structures through context-specific enhancer activity, and a heightened sensitivity of the lower jaw to Sox9 expression reduction. Overall, we characterize the longest-range human enhancers involved in congenital malformations, directly demonstrate that PRS is an enhanceropathy, and illustrate how small changes in gene expression can lead to morphological variation.

Keywords: Enhancer, Enhanceropathy, Pierre Robin Sequence, SOX9, Gene dosage, Long-range regulation, Neural crest, Craniofacial, Transcription, Non-coding mutation

eTOC Blurb

Non-coding mutations over a megabase from SOX9 cause the craniofacial disorder Pierre Robin sequence (PRS). Long et al. leverage a human neural crest model to demonstrate that PRS is caused by loss of extreme long-range enhancers active during a restricted developmental window, and explore mechanisms underlying specificity of disease manifestations.

Graphical Abstract

graphic file with name nihms-1628626-f0001.jpg

Introduction

Distal regulatory sequences called enhancers control gene transcription at a distance and play a critical role in directing developmental gene expression patterns (Long et al., 2016). Increasingly, non-coding mutations are being implicated in human disease (Franke et al., 2016; Laugsch et al., 2019; Lupiáñez et al., 2015) and, in particular, perturbations of enhancers have been documented as causative due to their effects on gene regulation during development (Spitz, 2016). While mutations of protein-coding sequences often affect multiple tissues in which a given gene is active, mutations in non-coding regulatory regions can perturb target gene expression selectively in specific tissue contexts. For example, SOX9 is an HMG-box transcription factor that plays numerous important roles during embryogenesis including sex determination, chondrogenesis and craniofacial development (Lee and Saint-Jeannet, 2011; Lefebvre and Dvir-Ginzberg, 2016). Heterozygous loss-of-function mutations in the SOX9 coding sequence cause a severe congenital disorder called campomelic dysplasia, which is associated with bowed long-limbs, disorders of sex determination, and craniofacial defects (Wagner et al., 1994). Interestingly, SOX9 is the sole protein-coding gene within an unusually large ~2Mb topologically associating domain (TAD) (Bagheri-Fam et al., 2006; Gordon et al., 2009). A multitude of non-coding mutations have been described within this gene desert, including large deletions, translocations, and duplications, which cause a range of defects that recapitulate distinct aspects, but not all features, of campomelic dysplasia leading to the hypothesis that cell-type specific enhancers are disrupted in these tissue-selective disorders (Baetens et al., 2017; Kurth et al., 2009; Sanchez-Castro et al., 2013). In some cases, the perturbed enhancers have been mapped and characterised, for example an SRY-responsive regulatory element essential for sex determination (Gonen et al., 2018).

A cluster of large genomic deletions and translocation breakpoints at the centromeric far end of the SOX9 TAD are associated with isolated Pierre Robin sequence (PRS), a congenital craniofacial disorder characterized by a single primary phenotype: underdevelopment of the lower jaw or mandible (micrognathia) that leads to secondary phenotypes including retraction of the tongue (glossoptosis), obstruction of the airways and, with incomplete penetrance, horseshoe-shaped cleft palate (Paletta et al., 1994; Tan and Farlie, 2013). This sequence of anomalies, in turn, results in feeding and breathing difficulties and failure to thrive (Rathé et al., 2015). It has been proposed that PRS-associated mutations perturb the function of key SOX9 long-range enhancers active during craniofacial development (Amarillo et al., 2013; Benko et al., 2009; Gordon et al., 2009, 2014), however functional characterization of putative craniofacial enhancers and direct demonstration that SOX9 is the target gene are still lacking. Given the specificity of the developmental defects in PRS and the well-documented requirement for SOX9 function in the neural crest (Cheung and Briscoe, 2003; Mori-Akiyama et al., 2003; Spokony et al., 2002), we hypothesized that the centromeric far end of the SOX9 TAD harbors enhancers active in cranial neural crest cells (CNCCs), a transient population of multipotent progenitor cells that give rise to the majority of vertebrate craniofacial structures, including the jaw (Bronner and LeDouarin, 2012; Minoux and Rijli, 2010; Trainor et al., 2003).

Leveraging a well-characterized in vitro differentiation model of human CNCCs (hCNCCs) (Bajpai et al., 2010; Prescott et al., 2015; Rada-Iglesias et al., 2012) we uncover two clusters of hCNCC-specific enhancers overlapping PRS mutations and demonstrate that they regulate SOX9 transcription within a defined developmental window and over extremely large genomic distances of 1.45 Mb and 1.25 Mb, respectively. To model the sensitivity of craniofacial development to changes in Sox9 gene dosage we generate an allelic series in mice with increasing severity of Sox9 perturbation. Together, we propose a mechanism of disease aetiology, whereby two features of Sox9 regulation converge to confine disease phenotypes to the lower jaw. Firstly, loss of the tissue-specific activity of PRS locus enhancers restricts Sox9 dosage perturbation to the developing facial structures, and secondly, a heightened sensitivity of the lower jaw to Sox9 level reduction further confines PRS-associated malformations.

Results

Three clusters of candidate human cranial neural crest enhancers overlap sequences lost in PRS

Many large non-coding deletions identified in PRS patients map to the SOX9 locus, yet are mostly non-overlapping, suggesting the presence of multiple regulatory elements with non-redundant functions whose loss leads to similar phenotypic outcomes (Amarillo et al., 2013; Benko et al., 2009; Gordon et al., 2014). Additionally, numerous translocation breakpoints have been identified that displace much of the distal SOX9 gene desert away from the remainder of the locus (Figure 1A) (Benko et al., 2009). To identify candidate hCNCC enhancer elements that map within regions of the SOX9 gene desert lost in PRS patients, we used ChIP-seq and ATAC-seq datasets from in vitro derived hCNCCs ((Prescott et al., 2015) and this study) (Figure 1A). Among the candidate enhancers identified within the SOX9 TAD, three enhancer clusters were located at the far centromeric end of the SOX9 gene desert upstream of the PRS translocation breakpoint region and overlapped with at least one of the large deletions seen in patients with PRS. Each cluster contained two or more discrete binding peaks for the general coactivator p300, was enriched for the active enhancer marks H3K27ac and H3K4me1 and corresponded to regions of open chromatin (Prescott et al., 2015) (Figures 1A and 2A) (Calo and Wysocka, 2013). All three putative enhancer clusters were located over 1Mb upstream of the SOX9 gene (Figure 1A), and were named to reflect their genomic arrangement: enhancer cluster 1.45 (EC1.45) is 1.45Mb upstream of SOX9, EC1.35 is 1.35Mb upstream of SOX9 and EC1.25 is 1.25Mb upstream of SOX9 (Figures 1A and 2A).

Figure 1: Human cranial neural crest-specific enhancers are associated with Pierre Robin sequence patient mutations.

Figure 1:

(A) ChIP-seq and ATAC-seq for hESCs (upper) and P4 hCNCCs (lower) at human SOX9 locus. Three putative hCNCC-specific enhancer clusters overlap the PRS locus: EC1.45, EC1.35 and EC1.25. PRS patient deletions (red), translocation breakpoints (blue), topological domains (TADs) (Dixon et al., 2012) and protein-coding genes are shown. Centro, centromeric; telo, telomeric.

(B) Capture-C from the SOX9 promoter (see anchor) in hESC, neuroectodermal spheres (NEC), early (D11) and late (P4) hCNCCs.

See also Figures S1 and S2.

Figure 2: PRS locus enhancer clusters EC1.45 and EC1.25 are active in hCNCCs and during mouse craniofacial development.

Figure 2:

(A) ChIP-seq and ATAC-seq for PRS locus putative enhancer clusters EC1.45 (p300 peaks P1 and P2), EC1.35 (S1 and S2) and EC1.25 (S3, S4, S5 and S6).

(B) Luciferase reporter assays for EC1.45, EC1.35 (S1-S2) and EC1.25 (S3-S4-S5-S6) in hCNCC (left) and hESC (right).

(C) Schematic outlining craniofacial domains at E9.5 and E11.5. BA1–2, branchial arch 1–2; FNP, frontonasal prominence; LNP, lateral nasal process; MdP, mandibular process; MNP, medial nasal process; MxP, maxillary process.

(D) In situ hybridization (ISH) for Sox9 at E9.5 and E11.5.

(E) Mouse lacZ reporter assay for EC1.45, EC1.35 (S1-S2) and EC1.25 (S3, S4, S5, S6 tested individually) at E9.5 and E11.5.

(F) HREM for EC1.45 LacZ reporter embryo at E11.5 (frontal view, upper; parasagittal section, lower). White arrow, activity in MdP.

See also Figure S3 and Tables S12.

Importantly, the three clusters of putative enhancers were not marked by active chromatin marks in human embryonic stem cells (hESC) (Figure 1A), nor other available profiled cell-types (Figure S1A), except for human fetal craniofacial tissues (Figure S1B) (Wilderman et al., 2018), suggesting that the putative enhancers exhibit cell type-specific activity in the neural crest and developing face. Indeed, activation of these putative enhancers coincided with a strong increase in SOX9 expression during the transition from hESCs through NEC to hCNCCs (Figures S1CE). Together, epigenomic signatures identified three putative hCNCC enhancer clusters overlapping sequences lost in PRS patients.

PRS region candidate enhancer clusters make long-range contacts with the SOX9 promoter

To determine whether the PRS region candidate enhancer clusters make contact with the SOX9 promoter over more than a megabase of genomic space, we performed SOX9 promoter-anchored Capture-C assays (Davies et al., 2015) in hESCs, neuroectodermal spheres, early migrating hCNCCs (hereafter called early hCNCC) or late-passage hCNCCs (hereafter called late hCNCC) (Prescott et al., 2015). In hESCs, the SOX9 promoter formed contacts that spanned the previously defined TAD (Dixon et al., 2012, 2015), with the majority of interactions confined to the telomeric side, and also showed frequent interactions with CTCF/cohesin sites across the locus (Figures 1AB and S1F). A strong shift in interaction frequencies was apparent during the hCNCC differentiation. In particular, extreme long-range interactions with the EC1.45, EC1.35 and EC1.25 putative enhancer clusters at the far centromeric end of the TAD substantially increased in late hCNCCs as compared to hESCs (Figures 1B and S2AB), albeit the dynamics for each of the enhancer clusters were distinct. Specifically, EC1.35 already contacted the SOX9 promoter in hESCs (Figures S2AB) and notably was occupied by CTCF and cohesin in both hESCs and hCNCCs. This was mirrored by a similarly bound CTCF site 2kb upstream of the SOX9 promoter, suggesting that these genome organizing proteins may facilitate a developmentally stable long-range interaction between the SOX9 promoter and the distal region of the TAD (Arzate-Mejía et al., 2018; Guo et al., 2015; Ren et al., 2017; Schoenfelder and Fraser, 2019; Wit et al., 2015). By comparison, EC1.25 and EC1.45 did not interact frequently with the SOX9 promoter in hESCs by Capture-C (Figures S2AB), and only in early and late hCNCCs did contact the frequency increase.

To confirm these cell-type specific interactions, we performed reciprocal Capture-C experiments anchored at each of the three PRS region candidate enhancer clusters. Again, we observed an increase in contact frequencies with the SOX9 promoter in late hCNCCs compared to hESCs (Figure S2C). Importantly, Capture-C performed from other gene promoters in nearby TADs, including KCNJ2, COG1 and SDK2 did not reveal interaction with the PRS region putative enhancer clusters, and did not cross the SOX9 TAD boundaries (Figure S1F). Together, the extreme long-range candidate enhancers at the PRS locus make selective contacts with the SOX9 gene promoter in a disease-relevant cell type.

Two PRS locus candidate enhancer clusters drive reporter gene expression in human CNCCs and in developing mouse facial structures

To investigate the regulatory potential of the putative PRS region enhancer clusters, we tested their capacity to activate transcription in a luciferase assay. We cloned the entire human EC1.45 region, including both p300 peaks, upstream of a luciferase reporter gene with an SV40 minimal promoter. Given their greater size, we combined the two p300 peaks for EC1.35 (S1-S2) and the four p300 peaks for EC1.25 (S3-S6) (Figure 2A). EC1.45 and EC1.25 were found to be extremely strong drivers of transcription in hCNCCs, rivalling the activity of the viral SV40 enhancer positive control (Figure 2B, left panel). However, despite harboring epigenetic marks suggestive of active enhancer identity (albeit weaker than EC1.45 or EC1.25), EC1.35 was not active in the luciferase assay suggesting it is not a strong driver of transcription, at least in the examined context. As expected, none of the three enhancer clusters was active in hESCs (Figure 2B, right panel).

To characterize the spatiotemporal activity of the PRS locus enhancers during development, we utilized an in vivo LacZ enhancer reporter assay at two mouse embryonic stages, gestational days (E)9.5 and E11.5 (Figure 2C). Both human EC1.45 and a number of the constituent p300 peaks for human EC1.25 were active during mouse development, exhibiting reproducible activity patterns that mirrored distinct spatiotemporal subdomains of endogenous SOX9 craniofacial expression (Figure 2D). This included activity in embryonic domains that will form the mandible: the first branchial arch at E9.5 (i.e. S3 and S5 of EC1.25) and the mandibular process at E11.5 (i.e. EC1.45 and S3 and S6 of EC1.25; Figures 2C, 2E and S3AF and Tables S12). Mandibular activity of EC1.45 was further confirmed by high resolution episcopic microscopy (HREM) (Figures 2F and S3GH). Similar to the in vitro luciferase assay, human EC1.35 did not display activity in the developing facial structures nor any other tissues at either developmental stage (Figure 2E).

Heterozygous ablation of PRS region enhancer clusters causes allele-specific reduction in SOX9 expression

To directly characterize the contribution of the human EC1.45 and EC1.25 enhancers to SOX9 gene regulation during hCNCC differentiation, we generated hESC lines with heterozygous deletions of either EC1.45 or EC1.25 (Figures 3A and S4AD) (Ikeda et al., 2018). To determine the impact of these deletions on SOX9 expression, we developed an allele-specific reverse transcriptase digital droplet PCR (RT-ddPCR) assay that distinguished a single nucleotide polymorphism (SNP) in the 3’ UTR of the SOX9 gene (T or C) (Figures 3B and S4E) and linked this in cis to the presence or absence of EC1.45 or EC1.25 via genome-wide phasing (Table S3).

Figure 3: Heterozygous PRS enhancer deletion in vitro impacts SOX9 expression during a restricted window of development.

Figure 3:

(A) Overview of differentiation, including early hCNCCs at day 11 (D11), passage 1–2 early hCNCCs (P1–2), P3–4 late hCNCCs, and chondrocytes at day 5 and 9.

(B) Schematic of allele-specific RT-ddPCR indicating primers and LNA probes (HEX/FAM) for T/C SNP (rs74999341) in SOX9 3’UTR. Wildtype (left), heterozygous EC1.45 deletion (right).

(C) RT-ddPCR for wildtype (green boxplot) and EC1.45 heterozygous deletion (red), plotting SOX9 C:T expression ratio.

(D) ATAC-seq reveals hCNCC-specific accessibility for EC1.45. Representative traces from 3–4 replicates.

(E) Luciferase assay for late hCNCCs (left) and chondrocytes (right). A COL2A1 enhancer is active in both cell types, while EC1.45 and EC1.25 become inactive in chondrocytes.

See also Figure S4 and Table S3.

During CNCC differentiation, the two alleles of SOX9 were expressed at nearly equivalent levels in wildtype cells (Figures 3C and S4F, green boxplots), regardless of changes in overall SOX9 expression (Figures S1CE). By contrast, enhancer deletion was associated with a striking allelic skew in SOX9 expression, indicating that loss of either EC1.45 or EC1.25 disrupted normal regulation of SOX9 (Figures 3C and S4F, red boxplots). The effect of EC1.45 enhancer deletion on SOX9 expression was larger than that of EC1.25, especially in P3-P4 late hCNCCs where it led to a 50–55% lower expression of mutated allele. This greater effect on SOX9 expression in late hCNCCs was in keeping with an observed ~4.5 fold increase in EC1.45 activity between P2 and P4 hCNCCs (Figure S4G). Together, EC1.45 and EC1.25 are required for normal expression of SOX9 in cranial neural crest, thereby establishing some of the longest-range functional enhancer-gene interactions reported to date in the human genome.

PRS region enhancers are decommissioned in cranial chondrocytes

SOX9 plays two sequential critical roles in the development of the mandible – firstly in the specification and migration of CNCCs, and secondly during chondrogenesis and formation of Meckel’s cartilage, the developmental precursor of the lower jaw (Amano et al., 2010; Wyganowska-Świątkowska and Przystańska, 2011). We therefore tested whether the EC1.45 and EC1.25 enhancer clusters also regulate SOX9 expression in cranial chondrocytes derived from hCNCCs (Figure 3A; differentiation validated in Figure S4HI). Remarkably, cranial chondrocytes derived from hCNCCs heterozygous for EC1.45 and EC1.25 enhancer deletions did not show allelic skew in SOX9 expression, indicating that the requirement for these enhancers in the regulation of SOX9 transcription is highly cell-type restricted (Figures 3C and S4F). In agreement, ATAC-seq analysis revealed that the EC1.45 and EC1.25 enhancers lost hypersensitivity during differentiation of hCNCCs to chondrocytes (Figures 3D and S4J), and luciferase reporter assays further confirmed that the regulatory potential of EC1.45 and EC1.25 was sharply reduced in chondrocytes (Figure 3E). Together, these data reveal that, despite high SOX9 expression in chondrocytes, the PRS-associated enhancers have a restricted and transient activity during CNCC development and become decommissioned during chondrogenesis, hence defining a developmental window for disease aetiology.

Two short segments act synergistically to drive the majority of EC1.45 enhancer activity

To interrogate sequence features critical for hCNCC-specific activity of the PRS enhancer clusters, we focused on EC1.45, whose deletion is associated with greater allelic imbalance in SOX9 expression in hCNCCs. Firstly, we tested the enhancer activity of the two constituent EC1.45 p300 peaks (Peak1 and Peak2, Figure 4A) in luciferase reporter assays. Intriguingly, individually the two p300 peaks exhibited only weak enhancer activity, while the combined Peak1+Peak2 led to an activation greater than the sum of the two regions, indicative of synergistic activity (Figures S5AC). To further refine regions of enhancer activity within EC1.45, we performed a tiling deletion screen across Peak1+Peak2 (Figure S5A) and identified two minimal regions (overlapping deletions 3–4 in Peak1 and deletions 10–11 in Peak2) whose loss lead to a significant reduction in luciferase reporter activity, min1 and min2 respectively.

Figure 4: Dissection of EC1.45 enhancer region uncovers a core role for the Coordinator motif and TWIST1 binding in developmental enhancer regulation.

Figure 4:

(A) TWIST1 ChIP-seq peaks (marked under track) at EC1.45 overlap p300 peaks Peak1 and Peak2 and minimally active sequences (min1 and min2).

(B) Luciferase assay for EC1.45 min1 and min2 tested separately and combined, along with Coordinator mutant sequences, schematic of constructs on left.

(C) Coordinator motif (upper, Prescott et al, 2015) compared to motif enriched at TWIST1 binding sites in hCNCCs (lower).

(D) Luciferase assay for heterologous enhancer sequence for human min1 plus vertebrate min2, schematic of constructs on left. A scatter plot depicts luciferase signal compared to the sum of Coordinator scores (ANOVA, p = 0.00035, right).

(E) TWIST1 is upregulated during hCNCC differentiation and reduced in chondrocytes (fragments per million, FPM).

(F) Schematic of plasmids, primers and probes for ChIP-ddPCR for wildtype (WT) and Coordinator mutant (4x mut) min1+min2 plasmids. F, forward; R, reverse.

(G) TWIST1 ChIP-ddPCR for P4 late hCNCCs transfected with plasmids in (F), normalized to input, and wildtype adjusted to 1. Two biological replicates are depicted.

See also Figure S5.

Importantly, min1+min2 recapitulated both the activity and the synergy of combined Peak1 and Peak2 and accounted for nearly the full activity of EC1.45 (Figures 4B and S5B). Of interest, three of the constituent putative enhancers from EC1.25 also act independently as enhancers in late hCNCCs, while combination of all four individual elements appears to similarly drive synergistic activation of luciferase expression (Figure S5C). Unsurprisingly, the two constituent EC1.35 p300 peaks were not active enhancers by luciferase assay (Figure S5C). Therefore, we identify two core enhancer elements within EC1.45 (and three within EC1.25) that are individually weak enhancers but work together in a robustly synergistic manner to activate gene expression.

Coordinator motifs are essential for activity and synergy of EC1.45 enhancers

In our previous study investigating sequence features associated with divergence of enhancer activity between human and chimpanzee CNCCs, we identified a long bipartite sequence, which we termed ‘Coordinator’ (Prescott et al., 2015) (Figure 4C, upper). Of all motifs tested, Coordinator had the greatest effect on surrounding chromatin features as well as impacting the highest number of enhancers (Prescott et al., 2015), suggesting a privileged role in the establishment of enhancer competence in CNCCs. Strikingly, there are seven Coordinator motifs within the EC1.45 Peak1 plus Peak2 region, four of which fall within min1+min2 (Figure S5D). Mutations of all four motifs in min1+min2 diminished activity to the level of the empty vector, while mutation of the Coordinator sequence in min1 brought the activity of min1+min2 down to a level similar to min2 alone (Figure 4B). Similarly, mutation of the three Coordinator sequences in min2 brought the activity of min1+min2 down to a level similar to min1 alone (Figure 4B), indicating that the Coordinator motif is essential for both the activity and the synergistic function of the EC1.45 enhancers. A mutation screen of each of the seven motifs within the Peak1+Peak2 region further supported that the most substantial contribution of Coordinator motifs to overall enhancer activity are within the min1 and min2 regions, and revealed that mutation of all seven Coordinator motifs led to a reduction of activity below the baseline level of the minimal promoter control (p=0.027). Notably, this suggests that repressive sequence features exist within the enhancer region that are unmasked by the loss of Coordinator sites (Figure S5E) and may be harboured within the del1-del2 region (Figure S5A, p<0.0063).

Coordinator motif content in deeply conserved region of EC1.45 correlates with enhancer activity across species

Interestingly, EC1.45 min2 is conserved at the sequence level from human to the lobe-finned fish coelacanth, across ~400 million years of evolution (Figure S5FG). To examine the relationship between the Coordinator motif content (estimated from fimo, Figure S5H (Grant et al., 2011)) of orthologous min2 regions and their enhancer activity, we cloned the min2 sequences from mouse, opossum, platypus, chicken, lizard, frog and coelacanth downstream of the human min1 sequence, and assessed their combined activity by luciferase assay (Figure 4D). Strikingly, an increased Coordinator score was associated with increased enhancer activity (Figure 4D, right panel). These changes in activity did not simply recapitulate the phylogenetic relationship between the examined species as, for example, the most distantly related coelacanth sequence was relatively high in both Coordinator content and enhancer activity, suggesting that the presence of Coordinator motif rather than merely evolutionary drift drive the observed changes in activity.

TWIST1 regulates EC1.45 enhancers in a Coordinator-dependent manner

We next sought to uncover the trans-regulatory inputs that control EC1.45 activity. The Coordinator sequence resembles an E-box and Homeobox-like motif, separated by 6 bp, though the factors that bind are heretofore unknown. E-box motifs are recognized by the basic helix-loop-helix (bHLH) transcription factors, and we noted that TWIST1 was among the most highly expressed bHLH factors in hCNCCs. TWIST1 levels tightly coincided with EC1.45 enhancer activity, being strongly upregulated during hCNCC differentiation (with highest expression in late hCNCCs) and downregulated during chondrogenesis (Figure 4E, compare to EC1.45 enhancer activity in Figure 3C). TWIST1 is also known to play an essential role in neural crest biology and craniofacial development (Bildsoe et al., 2009; Parsons et al., 2014; Qin et al., 2011). Furthermore, Twist1 inactivation in NCCs populating the mandibular arch in mice leads to micrognathia and cleft palate (Zhang et al., 2012), phenotypes overlapping those seen in PRS.

Genome-wide analysis of TWIST1 ChIP-seq performed in hCNCCs revealed that the top enriched sequence matched the Coordinator motif (Figure 4C, lower), followed by a canonical TWIST1 E-box motif (Figure S5I), suggesting that in hCNCCs a substantial fraction of TWIST1 chromatin binding occurs in the context of the Coordinator motif. In keeping with the presence of multiple Coordinator motifs, TWIST1 binds to both EC1.45 constituent enhancers in hCNCCs (Figure 4A). To assess whether this binding is dependent on the Coordinator sequence, we developed an episomal TWIST1 ChIP-ddPCR assay that distinguished transfected plasmids containing wildtype or Coordinator mutant min1+min2 sequences (Figure 4F). In this assay, the strong TWIST1 binding observed for the wildtype min1+min2 sequence was greatly diminished by mutation of the Coordinator sequences (Figure 4G). These results establish that TWIST1 binds to the min1 and min2 regulatory sequences in CNCCs in a Coordinator motif-dependent manner.

Mouse mandibular development is highly sensitive to changes in Sox9 gene dosage

With the identification of two enhancer clusters at the PRS locus that regulate SOX9 gene dosage in hCNCCs, we next turned to mouse models to probe the morphological impact of Sox9 dosage perturbation on craniofacial development. Previous work showed that heterozygous deletion of Sox9 recapitulates many aspects of campomelic dysplasia (Bi et al., 2001). To characterize the impact of neural crest-specific Sox9 haploinsufficiency, we crossed mice carrying a floxed (F) Sox9 allele (Akiyama et al., 2002) with mice carrying the second generation Wnt1::Cre2 driver (C) which directs Cre expression in the neural crest just before or during delamination from the neural tube (Lewis et al., 2013) (Figure S6A). Many heterozygous Wnt1::Cre2;Sox9F/+ (CF: Cre Flox) animals died in the neonatal period from postnatal day (P)0–12, or failed to gain weight at the same rate as their wildtype siblings (Figures 5A and S6B). To investigate the cause of neonatal lethality and reduced fitness, we performed microCT to monitor craniofacial skeletal development at E18.5 (Figure 5B). Clefting was detected in the maxilla and palatine bones in 50% of mutant embryos, a phenotype in PRS patients thought to be a secondary consequence of mandibular hypoplasia (Tan and Farlie, 2013). Although this link remains to be established in the mouse, the observed cleft palate is most likely the cause of postnatal lethality due to feeding or breathing difficulties (Tan and Farlie, 2013).

Figure 5: Conditional neural-crest specific Sox9 heterozygous mutant embryos have craniofacial defects and fail to thrive in the neonatal period.

Figure 5:

(A) Boxplot of postnatal growth rate in g/day for mutant Wnt1::Cre2;Sox9F/+ (CF) and wildtype Sox9F/+ (F) pups (ANOVA p-val = 1.793e-07).

(B) MicroCT scans of E18.5 wildtype (upper) and mutant (lower) embryos, maximum intensity projection (left) and segmented hemimandibles (right).

(C) Boxplot of distance measurements for wildtype (F) and mutant mandibles with (CF cleft) and without (CF) cleft palate. Data from two litters (17 embryos).

(D) PCA of mandible landmarks following Procrustes analysis. Mutant (CF) and wildtype (F) mandibles are separated by PC1 regardless of clefting.

(E) Morphometric landmarks for wildtype (upper, F) and mutant (lower, CF) mandibles projected onto a thin plate spline. All 18 landmarks differed significantly between wildtype and mutant mandibles by Hotelling test (p < 0.0006).

(F) Boxplot of distance measurements for wildtype (F) and mutant midfacial elements with (CF cleft) and without (CF) cleft palate. PM, premaxilla; Mx, maxilla; Pal, palatine bones.

(G) PCA of skull landmarks following Procrustes analysis. Mutant skulls without cleft (CF) cluster with wildtype skulls (F).

(H) Wireframe outline of nasal bone, premaxilla, maxilla and palatine bone for half a skull for wildtype (F, dark pink) and mutant skulls without cleft (CF, yellow, left) or with cleft (CF, brown, right).

For PCA, different shape markers represent independent litters.

See also Figure S6 and Table S4.

To further quantify craniofacial defects, we performed morphometric landmarking (Ho et al., 2015; Welsh et al., 2018) focusing first on the mandible, as micrognathia is both a diagnostic characteristic of PRS, as well as a feature of campomelic dysplasia (Foster, 1996; Paletta et al., 1994; Tan and Farlie, 2013) (Figures S6CD). From this analysis, we quantified a reduction in mandible length for Wnt1::Cre2;Sox9F/+ embryos and gross changes in the shape of the ramus, including a dramatically hypoplastic coronoid process and reduced condylar process width, recapitulating aspects of human patient phenotypes with SOX9 haploinsufficiency (Figure 5C). Notably, these differences in mandibular shape and size were fully penetrant in all embryos analyzed regardless of the presence of cleft palate as illustrated by principal component analysis (PCA) based on calculated Procrustes distance (Figures 5CD). The dramatic changes in mandibular morphology can be illustrated by projecting mandibular landmarks onto a thin plate spline (Figure 5E).

We next analyzed the remaining skull morphology (Figures S6EG) and interestingly, while Wnt1::Cre2;Sox9F/+ mutant embryos with cleft palate displayed a number of measurable skull anomalies, we did not detect significant alterations in skull length or width nor midfacial length in mutant embryos without a cleft (Figures 5F and S6H). Indeed, PCA revealed that skull shapes of non-clefted Wnt1::Cre2;Sox9F/+ animals cluster with those of the wildtype embryos and away from the clefted heterozygotes (Figure 5G), with no significant change in overall skull shape (Figure 5H). Therefore, despite the broad expression and function of Sox9 throughout developing craniofacial structures, the mandible exhibits a heightened and fully penetrant sensitivity to a 50% reduction of Sox9 gene dosage during mouse neural crest development compared to other craniofacial structures where phenotypes are of variable expressivity.

Mouse orthologous EC1.45 sequence exhibits conserved spatiotemporal activity pattern but weakened contribution to Sox9 expression relative to human EC1.45

To assess whether the spatiotemporal activity of the EC1.45 and EC1.25 elements was conserved for the orthologous mouse sequences (located 1.21 Mb and 1.04 Mb from the mouse Sox9 promoter respectively; for clarity, we refer to these regions as mEC1.45 and mEC1.25) we again utilised in vivo LacZ reporter assays. Similar to the human sequence, mEC1.45 was active in the frontonasal prominence at E9.5, and also in the maxillary and mandibular processes and limb buds at E11.5 (Figures 6AB and S7AB). Of note, a sub-region of this sequence was previously tested in the VISTA Enhancer Browser (mm628) (Visel et al., 2007) (Figures S7A and S7C). By contrast, for the three human EC1.25 constituent enhancers with craniofacial activity (Figure 2E), there was no reproducible activity for mouse orthologous S3 and S5 enhancers, and a reduced craniofacial activity for the S6 ortholog at E11.5 in a domain not including precursors of the mandible (Figure S7D). To compare the activity of human EC1.45 and its mouse ortholog in a more quantitative assay we performed parallel luciferase assays in hCNCCs. Although mEC1.45 is indeed an active enhancer in hCNCCs, it is a much less potent activator of luciferase expression than the human ortholog (around 15-fold lower), indicating a divergence in enhancer strength (Figure 6C), consistent with a reduced Coordinator content for mouse min2 (Figures 4D and S5H).

Figure 6: Reduction in Sox9 activity impacts mouse craniofacial development in a dosage-dependent manner.

Figure 6:

(A) Schematic of mouse orthologous mEC1.45 with liftover of human EC1.45, Peak1, Peak2, min1 and min2 sequences, and human to mouse MultiZ alignment.

(B) Mouse lacZ reporter assay for mEC1.45 at E11.5.

(C) Luciferase assay for human EC1.45 and mouse mEC1.45.

(D) Location of sgRNAs and Founder 1 and 2 mEC1.45 deletions (aligned with A). (E) Digital droplet RT-PCR for Sox9 from wildtype and mEC1.45del/+ dissected E11.5 craniofacial tissues. Plotted as C:G allelic ratio, mEC1.45 deleted on C-allele. M/LNP, medial and lateral nasal processes; MxP, maxillary process; MdP, mandibular process. * pval < 0.05; ** pval < 0.01.

(F) Schematic of Sox9 heterozygous conditional knock-out Wnt1::Cre2;Sox9F/+ (CFW) and compound heterozygous Wnt1::Cre2;Sox9F/mEC1.45del (CFD) mice with Sox9 deleted in CNCCs on one allele, and mEC1.45 deleted on the other. Purple triangles, loxP sites; neo, neomycin resistance.

(G) Boxplot of postnatal growth rate (P20-P25, g/day) for CFW and CFD animals, ANOVA p-val = 0.01676.

(H) Landmarks for CFW (upper) and CFD (lower) mandibles projected onto a thin plate spline. Landmarks that differ significantly by Hotelling test are highlighted in red (p < 1E-04).

(I) PCA plot of mandible landmarks 12–13 following Procrustes analysis at E18.5 for 5 litters (23 embryos) of CFW (yellow) and CFD (brown) embyros.

(J) Procrustes transformed average mandible wireframes for wildtype (dark pink, FW), CFW (yellow) and CFD (brown) embryos.

(K) Measurements of width and length of the condylar process for CFW (yellow) and CFD (brown) mandibles. For condylar width, ANOVA p-val = 1.52E-07 and for condylar length, ANOVA p-val = 1.11E-04.

(L) As for K, two measurements of mandible length (2–13), ANOVA p-val = 0.00143 and (4–9), ANOVA p-val = 0.00687.

(M) PCA plot of all mandible landmarks following Procrustes analysis for wildtype (mEC1.45+/+, blue, WW) and homozygous mutant (mEC1.45del/del, orange, DD) embryos at E18.5 for 5 litters (32 embryos).

(N) Landmarks for WW and DD mandibles projected onto a thin plate spline. Landmarks that differ significantly by Hotelling test are highlighted in pink (p < 0.05) and red (p < 1E-05).

(O) Boxplot of postnatal growth rate (P20-P25, g/day) for WW and DD embryos. Two replicate groups plotted as residuals of linear regression, ANOVA p-val = 0.01473.

For PCA plots, different shape markers represent independent litters.

See also Figure S7 and Tables S12.

Based on the conserved, albeit weak, activity of mEC1.45, we performed pronuclear injection of CRISPR/Cas9 ribonucleoprotein complexes (RNPs) to target the region for deletion (Figure 6D), and established two distinct founder lines (Figures 6D and S7EF). To determine the impact of mEC1.45 deletion on Sox9 expression, we crossed mEC1.45del/+ FVB mice with wildtype C57BL/6 mice, dissected craniofacial processes from embryos at E11.5 and performed allele-specific RT-ddPCR for Sox9 utilising strain-specific SNPs (Figure S7G). Analysis of Sox9 expression in the combined medial and lateral nasal processes (M/LNP), maxillary process (MxP) and mandibular process (MdP) revealed that in wildtype embryos Sox9 was expressed at similar levels from the FVB and C57BL/6 alleles (Figure 6E, green boxplots). By contrast, in mEC1.45del/+ embryos Sox9 expression was significantly reduced for the FVB allele carrying the enhancer deletion, with the greatest reduction observed for the mandibular process (Figure 6E, red boxplots, p<0.032). Consistent with the weaker activity of mEC1.45 compared to human, deletion of the mEC1.45del caused quantitatively milder (8% in M/LNP, 12% in MxP and 13% in MdP) reduction in Sox9 expression from the mutant allele, compared to a much greater reduction of SOX9 expression in EC1.45del/+ late hCNCCs (~50–55%). At an earlier stage of development, E9.5, we also observed a modest reduction in Sox9 expression from the mEC1.45del mutant allele, with the most significant effect seen for the frontonasal prominence (FNP), consistent with the enhancer activity pattern at this stage (Figures S7B and S7H). Therefore, while the spatiotemporal activity of mEC1.45 is conserved, there is a substantially diminished strength of activity and input into Sox9 expression compared to the human sequence.

Deletion of mouse orthologous EC1.45 affects mandible morphology and exacerbates PRS-like phenotypes associated with Sox9 heterozygosity

Considering the overall weaker regulatory activity of mEC1.45 compared to human, we chose a sensitized background strategy to first assess a possible function of mEC1.45 in craniofacial development. We therefore crossed Wnt1::Cre2;mEC1.45del/+ females and Sox9F/F males and compared Wnt1::Cre2;Sox9F/mEC1.45del compound heterozygous mice (CFD, Cre Sox9-Flox Delete-mEC1.45) for which all Sox9 transcripts in CNCCs and derivatives are expressed from the allele with mEC1.45 deleted (Figure 6F) to Wnt1::Cre2;Sox9F/+ conditional Sox9 knock-out animals (CFW, Cre Sox9-Flox Wildtype-mEC1.45) for which the remaining allele expressing Sox9 is wildtype. In this sensitized setting, we may predict exacerbated phenotypes compared to those seen in conditional Sox9 heterozygotes.

We initally weighed surviving pups up to weaning and observed a decreased growth rate for the compound mutant (CFD) animals compared to the Sox9 heterozygous (CFW) animals (Figure 6G). We next performed microCT and landmarking for E18.5 mandibles from the same cross, and Procrustes analysis followed by a Hotelling test revealed a landmark at the condylar process as most morphometrically distinct between genotypes (p<1e-04, Figure 6H). Indeed, PCA analysis using landmarks at the condylar process clearly separated mandibular morphology for CFW and CFD embryos (Figure 6I). These results show that additional loss of the mEC1.45 enhancer exacerbates the changes in mandible morphology observed in the conditional heterozygous Sox9 mutant (Figures 5E and 6J). Furthermore, quantification of condylar process length and width revealed a reduction for CFD compared to CFW embryos (p<1.1e-4, ANOVA; Figure 6K), while overall mandible length was also reduced by 3–5% (p<0.007, ANOVA; Figure 6L). Therefore, ablation of a developmental enhancer cluster that intersects a human disease locus exacerbates PRS-like phenotypes in a sensitized genetic background.

To determine whether mEC1.45 enhancer deletion alone, which even in a homozygous setting is expected to cause only a 13% reduction in Sox9 expression (Figure 6E), results in altered jaw morphology, we performed microCT analysis for E18.5 embryos obtained from a cross between heterozygous mEC1.45del/+ animals. Using all 18 mandibular landmarks, we were able to separate the wildtype (WW) and mEC1.45 homozygous knockout (DD) embryos by PCA indicating a reproducible phenotypic alteration of mandibular shape when the mEC1.45 enhancer is ablated (Figure 6M). A Hotelling test once again revealed that the ramus was the mandibular structure most affected by changes in Sox9 dosage (Figure 6N). Albeit milder, these alterations in mandibular ramus morphology are reminiscent of phenotypes observed in PRS patients, as quantified by a number of studies (Bienstock et al., 2016; Chen et al., 2015; Chung et al., 2012; Suri et al., 2010; Susarla et al., 2017; Volk et al., 2020; Zellner et al., 2017) (Table S4 and Figure S6C). Finally, to address if enhancer knockout results in failure to thrive, we weighed pups up to weaning age (P20-P25) and detected a reduction in weight gain for mEC1.45 knockout animals (Figure 6O). Collectively, these data show that even a subtle reduction in gene dosage, resulting from enhancer loss, can lead to alterations of craniofacial morphology and result in a reduced ability of an organism to thrive.

Discussion

Given the phenotypic overlap between craniofacial abnormalities of campomelic dysplasia and PRS, it had been long speculated, but not formally demonstrated, that regulatory elements harbored by the PRS region deletions might regulate SOX9 (Amarillo et al., 2013; Benko et al., 2009; Gordon et al., 2009, 2014). Furthermore, several distinct hypotheses have been put forth regarding the cellular origins of the disease (Tan et al., 2013). In this study, we shed light onto these long-standing questions, identifying and characterizing two clusters of enhancers 1.25 and 1.45 megabases upstream of the SOX9 gene that fall within the PRS locus, make long-range contacts with the SOX9 promoter and dynamically regulate its expression during cranial neural crest development. These enhancers become inactive following hCNCC differentiation to chondrocytes, defining a developmental window for the aetiology of craniofacial phenotypes observed in PRS (Figure 7A).

Figure 7: Summary of PRS locus enhancer activity, with a proposed model for PRS aetiology and associated Neanderthal DMR evolution.

Figure 7:

(A) A model of EC1.45 and EC1.25 hCNCC-specific regulation of SOX9 expression at extreme long-distance followed by decommissioning in chondrocytes. A Neanderthal-specific hypomethylated region (HMR) overlaps EC1.45. Two minimal elements in EC1.45 have synergistic activity, i.e. (min1+min2) > (min1)+(min2). Coordinator motifs in min1 and min2 sequences are central for their activity and are bound by TWIST1.

(B) EC1.45 and EC1.25 are active in the developing face.

(C) A model for PRS aetiology, whereby by two features converge to confine disease phenotypes to the lower jaw.

(D) Phylogenetic tree of the inferred regulatory evolution for an EC1.45 Neanderthal-specific hypomethylated region (HMR, green). From left to right: anatomically modern humans (AMH), Denisovans, Neanderthals and chimpanzee. Million years ago, mya.

BA1–2, branchial arch 1–2; FNP, frontonasal prominence; LNP, lateral nasal process; MdP, mandibular process; MNP, medial nasal process; MxP, maxillary process. </p/> See also Figure S6.

Unlike the case of some enhanceropathies, where a number of patient-specific mutations overlap to reveal a single minimal element that is disrupted in the disorder e.g.(Gonen et al., 2018; Kortüm et al., 2011), many of the PRS deletions described to date are non-overlapping and harbor one or the other of the enhancer clusters identified here (Amarillo et al., 2013; Benko et al., 2009; Gordon et al., 2014; Rainger et al., 2014). Interestingly, PRS patients with translocation breakpoints, in which both EC1.25 and EC1.45 are lost, appear to display more severe phenotypes (Benko et al., 2009). This suggests that loss of distinct enhancers that each quantitatively affect SOX9 dosage in CNCCs can lead to similar disease outcomes, while loss of the broader regulatory region via chromosomal translocation can have an additive effect on both SOX9 gene dosage and lower jaw morphology. Indeed, our mouse modelling revealed that lower jaw development is sensitive to even small perturbations in Sox9 gene dosage, with a range of phenotypes of increasing severity observed over a range of reductions in Sox9 expression (Figures 5 and 6).

In our analysis of the 1.45 Mb enhancer cluster, the two constituent p300-binding regions within EC1.45 are individually weak enhancers but display striking combinatorial synergy far greater than the sum of the individual activities (Figure 7A). Previous studies looking at the relationship between multiple enhancers within super enhancer clusters have supported an additive or redundant, rather than synergistic activity of the constituent enhancer elements (Dukler et al., 2017; Hay et al., 2016; Moorthy et al., 2017; Shin et al., 2016).

The regulatory elements identified in our study represent the longest-range developmental enhancers involved in congenital malformations that have been described to date, at a distance of nearly 1.5 Mb from the regulated gene promoter. These enhancers provide a valuable paradigm for continuing investigation of long-range developmental gene regulation and its perturbation in human disease and they join a small class of documented extreme long-range regulatory sequences that activate transcription at over a megabase genomic interval, such as the Shh ZRS and the Myc BENC and MNE enhancers (e.g. (Bahr et al., 2018; Herranz et al., 2014; Lettice et al., 2003; Uslu et al., 2014)). The enormous genomic distance begs questions about how the PRS-associated enhancer clusters can communicate with the SOX9 promoter to drive tissue-specific regulation in a precise and robust manner. Interestingly, one of the PRS-associated candidate enhancer clusters, EC1.35, that does not harbor activity in reporter assays, contains a constitutive CTCF binding site and interacts with the SOX9 promoter already in hESCs. This interaction is further augmented in hCNCCs along with the formation of interactions between all three PRS-associated enhancers and between EC1.45 and EC1.25 and the SOX9 promoter. Remarkably, a recent study in which the centromeric Sox9 TAD boundary was deleted in mice showed no significant impact on either Sox9 expression or examined phenotypes (Despang et al., 2019) suggesting that TAD integrity may not be required for these long-range interactions and Sox9 regulation. However, while the EC1.35 element may not act as a canonical enhancer, it may instead participate in organizing extreme long-range contacts at the SOX9 locus via formation of CTCF-cohesin mediated chromatin loops.

When attempting to model human non-coding mutations at the orthologous PRS locus in mice there were a number of challenges to consider. First, due to extensive reshaping of mammalian genomes during evolution by transposons and other genomic forces, many functional human enhancer regions do not have orthologous sequences in mice (Chuong et al., 2017; Villar et al., 2015; Yue et al., 2014). Second, even when the orthologous sequence exists, its regulatory activity may not be conserved or can differ in strength or relative contribution to the target gene dosage (Denas et al., 2015; Shen et al., 2012). This second challenge is well illustrated by EC1.45: while the orthologous sequence is present in mouse and its spatiotemporal activity is conserved, mEC1.45 is a substantially weaker enhancer compared to its human counterpart, perhaps compensated for by additional mouse-specific CNCC enhancers at the locus (Figure S7I). Consequently, deletion of mEC1.45 results in only ~ 6–13% decrease in Sox9 expression level, compared to the 50–55% SOX9 reduction seen for human EC1.45 deletion. Nonetheless, it is quite remarkable that even such a slight reduction in Sox9 gene dosage results in measurable changes in lower jaw shape and reduction in postnatal growth.

Despite the caveats outlined above, from our combined human and mouse results we can propose a model for the specificity of PRS manifestations whereby mutations at the far end of the SOX9 gene desert perturb broadly active craniofacial developmental enhancer clusters and impact SOX9 expression across the cranial neural crest. However, the heightened sensitivity of the mandible to SOX9 gene dosage further restricts the manifestations to micrognathia, which can in sequence lead to additional PRS-associated phenotypes (Figure 7C). Our work raises the interesting question of why the mandible is more sensitive to Sox9 dosage perturbation, despite the broad expression of Sox9 across craniofacial structures. We suggest two potential hypotheses. In the first, we note that distinct transcription factors and signaling components are expressed in the future upper and lower jaw during development, for example high levels of Hand2 and Dlx5/6 are expressed in the mandibular but not maxillary process (Beverdam et al., 2002; Funato et al., 2016). Loss of this patterning through ablation of the upstream Edn1/Ednra signalling pathway leads to a striking jaw transformation (Minoux and Rijli, 2010). Therefore, if spatially restricted morpho-regulatory programs such as these are differentially sensitive to Sox9 activity, this could lead to tissue selective impacts on craniofacial development. An alternate hypothesis for the observed mandibular sensitivity to Sox9 perturbation could be related to the differences in trajectory of craniofacial skeletal development. In a somewhat atypical process, the formation of the mandible is intimately associated with a cartilage “template” called Meckel’s, while by contrast the midfacial skeleton forms strictly via intramembranous ossification independent of any cartilage precursor. Therefore, should perturbation of Sox9 expression in CNCCs impact the propensity or ability to differentiate into chondrocytes, that could account for the selective impact on mandibular development.

From an evolutionary standpoint, the mandible is extremely interesting as it is has widely divergent forms related to feeding and predation (Albertson and Kocher, 2006; Martinez et al., 2018). Furthermore, mandible shape evolution in hominins appears exceptionally rapid as compared to any other primate clade (Raia et al., 2018), and includes shape changes within the ramus (including the condylar and coronoid processes, and gonial angle) (Meloro et al., 2015; Terhune et al., 2014, 2018) – structures which are especially sensitive to slight alterations in Sox9 expression in our mouse models. It is therefore tempting to speculate that some of this morphological divergence could be mediated by regulatory changes leading to minor differences in SOX9 expression levels during CNCC development. In fact the EC1.45 enhancer cluster featured in this study overlaps a Neanderthal-specific hypomethylated region from bone samples (based on reconstructed DNA methylation maps (Gokhman et al., 2014, 2020)), (Figures 7A, 7D and S6I). While somewhat speculative, this suggests that the Neanderthal enhancer element might have retained regulatory activity for longer during development (as DNA methylation is generally associated with silencing), as compared to the human enhancer, which becomes decommissioned during chondrogenesis and is hypermethylated in human bones of various origin (see Figures 3CE). Together, the PRS locus enhancers represent a fascinating locus for future investigation of extreme long-range gene regulation in development, disease and across evolutionary time.

Limitations of the Study

As outlined above, there are a number of challenges and limitations when attempting to model a human enhanceropathy in mice due to remodeling of the enhancer landscape across evolutionary time. In our study, this is exemplified by the weakened enhancer activity of mE1.45 as compared to the human counterpart, and associated lower contribution to Sox9 expression. An additional limitation relates to the mouse strains used in the study, as the Wnt1::Cre and Sox9F/F mice are on an C57BL/6J background, while the mEC1.45 deletion was generated on the FVB background. These different genetic backgrounds may cause a differential sensitivity to Sox9 perturbation due to other modifying variants in the genome.

STAR Methods

RESOURCE AVAILABILITY

Lead Contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Joanna Wysocka (wysocka@stanford.edu).

Materials Availability

DNA constructs and other research reagents generated by the authors will be distributed upon request to other researchers.

Data and Code Availability

The ChIP-seq, ATAC-seq and RNA-seq datasets generated in this study are available at Gene Expression Ombnibus (GEO) accession GSE145327 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE145327. The 10X Genomics linked-read sequencing is available at Sequence Read Archive (SRA) BioProject PRJNA648128.

EXPERIMENTAL MODEL AND SUBJECT DETAILS

Mouse models and husbandry

C57BL/6J (RRID: IMSR_JAX:000664), FVB/NJ (RRID: IMSR_JAX:001800), C57BL/6 Wnt1::Cre2 (RRID: IMSR_JAX:022501) (Lewis et al., 2013) and C57BL/6 Sox9 flox (RRID: IMSR_JAX:013106) (Akiyama et al., 2002) mice were obtained from Jackson Labs and mEC1.45 deletion lines were generated and characterized on the FVB background as described in Method Details. Mice were housed in RAFII facility at Stanford University, with free access to food and water. All animal protocols were approved by the Administrative Panel on Laboratory Animal Care at Stanford University. Of note, the second generation Wnt1::Cre2 driver avoids ectopic Wnt activation and impact on midbrain development, which might have confounded studies performed with the first-generation Wnt1::Cre driver (Lewis et al., 2013). Only females were used to propagate the Wnt1::Cre2 driver.

For breeding, two female mice were introduced into a cage with a single male and monitored for timed pregnancies. To generate compound heterozygous mice carrying a deletion of mEC1.45 on one allele and a conditionally deletable Sox9 gene on the other allele, we crossed the mEC1.45 enhancer deletion lines to Wnt1::Cre2 driver females, and then crossed the resultant Wnt1::Cre2;mEC1.45del/+ females to Sox9F/F males. Mice were genotyped by tail clipping, lysis with Proteinase K in tail buffer (0.2M NaCl, 0.2% SDS, 0.05M EDTA and 0.1M Tris-HCl pH 8.0), precipitation with isopropanol, and analytical PCR using genotype-specific primers and Dream Taq Master Mix (Thermo Fisher Scientific). To monitor weight-gain, mice were weighed from birth to post-natal day 25 (P0-P25) using a scale with 2 decimal places (accuracy +/− 0.01g). Males and females were included for embryonic assays at E9.5 and E11.5, and for post-natal weighing.

Culture of H9 human embryonic stem cells (hESCs)

Female H9 (WA09, RRID: CVCL_9773) human embryonic stem cells (hESCs) were obtained from ATCC and cultured in mTeSR (Stem Cell Technologies) and grown on Matrigel Growth Factor Reduced (GFR) Basement Membrane Matrix (Corning) at 37°C. hESCs were fed every day and passaged every 5–6 days using ReLeSR (Stem Cell Technologies).

METHOD DETAILS

Mouse genome editing using CRISPR/Cas9

Mouse orthologous mEC1.45 was deleted in vivo using CRISPR-Cas9 editing in the FVB strain as previously described (Osterwalder et al., 2018). Briefly, pairs of sgRNAs were designed to target upstream and downstream of the enhancer sequence to be deleted using CHOPCHOP (Labun et al., 2016) and Benchling (https://www.benchling.com/). sgRNAs were generated using a modified version of a previously published oligo assembly protocol (Varshney et al., 2015). In this process an oligo encoding a T7 promoter and the guide RNA sequence were annealed to a second, generic oligo, and Phusion polymerase (NEB) was used for extension. The guide RNA was synthesised using HiScribe T7 Quick High Yield RNA Synthesis kit (NEB) and purified using the MEGAclear Transcription Clean-up kit (Ambion) prior to quantification. A mix containing Cas9 protein (final concentration of 20 ng/ul; IDT Cat. No. 1074181) and four sgRNAs (12.5 ng/μL each) in an injection buffer (10 mM Tris, pH 7.5; 0.1 mM EDTA) was injected into the pronucleus of FVB mouse embryos at the single-cell stage. F0 founder mice were genotyped using primers spanning the desired deletion region and High-Fidelity Platinum Taq polymerase (Thermo Fisher Scientific) to identify deletion breakpoints, which were validated and mapped using Sanger sequencing. Deletions were validated in second generation F1 animals, and heterozygous animals were crossed to generate homozygous and heterozygous animals for breeding. Two founder lines were established with deletions differing by 58bp. Founder 1 has a 3572bp deletion, and Founder 2 has a 3630bp deletion.

Generation of CRISPR/Cas9 genome-edited cell lines

Human ESCs were targeted for enhancer deletion using two strategies. In the first strategy, H9 hESCs were transfected using FuGENE 6 (Promega) with a targeting construct containing Blasticidin selection cassette, flanked by FRT sites, and homology arms for either side of EC1.45 along with a plasmid encoding Cas9 plus single guide RNAs (sgRNAs) flanking EC1.45. Transfected hESCs were grown to confluency and split onto a new plate before selection with 1 μg/mL Blasticidin until all cells died on a mock/GFP transfected control well. Surviving colonies were picked into a 48-well plate, expanded, split and screened for enhancer deletion using a genomic primer and a primer in the targeting cassette. Heterozygous enhancer deleted clones were infected with 1E+08 pfu/mL Adeno-flippase (Ad5CMVFlpO, Fred Hutchinson Cancer Research Center) and clones screened for excision of the selection cassette by PCR. For screening, genomic DNA was extracted using QE buffer (Lucigen) and PCR was performed using Q5 polymerase (NEB). Heterozygous enhancer deletions were generated to be in keeping with the heterozygous deletions seen in PRS patients, and to enable allele-specific SOX9 gene expression analysis.

In the second targeting strategy, a scar-less editing methodology was performed (Ikeda, 2018). A targeting construct was designed to insert adjacent to EC1.25 in H9 hESCs, containing a hUbC promoter driving expression of an eGFP-T2A-tCD8 cassette. H9 hESCs were nucleofected with 3μg this construct and 3ȝg of a plasmid encoding Cas9 and a single guide RNA targeting one side of EC1.25 using an Amaxa 4D nucleofector (pulse code CB150). GFP positive cells were isolated by FACS after 7–12 days and plated onto a 6-well plate. GFP positive colonies were then picked, expanded and screened for integration of the targeting cassette using primers within the targeting cassette and flanking genomic sequence. Genomic DNA was extracted using QE buffer (Lucigen) and PCR screening was performed using PrimeSTAR GXL DNA Polymerase (Takara) and heterozygous targeted clones were isolated. A second cassette was designed to excise the first targeting construct as well as EC1.25 using enhancer-flanking homology arms with no extra exogenous sequences, to leave a scar-less deleted enhancer region. To generate matched wildtype clones, a wildtype homology template was used to excise the targeting cassette. Colonies were selected by screening for loss of tCd8 by magnetic activated sorting (MACS), plated onto a 6-well plate, and following dilute re-plating, colonies were picked into a 48-well plate. Clones were passaged and screened using primers flanking EC1.25 to identify positive clones with EC1.25 deleted on one allele, or excision of the targeting construct for the matched wildtype controls. Genomic DNA was extracted using QE buffer (Lucigen) and PCR screening was performed using Q5 polymerase (NEB).

Differentiation of hESC to CNCCs and chondrocytes

hESCs were differentiated to cranial neural crest cells (CNCCs) using a protocol described previously (Prescott, 2015). Briefly, hESCs were grown for 5–6 days until large colonies formed, then were disaggregated using collagenase IV and gentle pipetting. Clumps of ~200 hESCs were washed in PBS and transferred to a 10cm petri dish in neural crest differentiation media (NDM). NDM: 1:1 ratio of DMEM-F12 and Neurobasal, 0.5x Gem21 NeuroPlex Supplement With Vitamin A (Gemini, 400–160), 0.5x N2 NeuroPlex Supplement (Gemini, 400–163), 1x antibiotic/antimycotic, 0.5x Glutamax, 20ng/ml bFGF (PeproTech, 100–18B), 20ng/ml EGF (PeproTech, AF-100–15) and 5ug/ml bovine insulin (Gemini Bio-Products, 700–112P). After 7–8 days, neural crest emerged from neural spheres attached to the petri dish, and after 11 days, neural crest cells were passaged onto fibronectin-coated 6-well plates using accutase and fed with neural crest maintenance media (NMM). NMM: 1:1 ratio of DMEM-F12 and neurobasal, 0.5x Gem21 NeuroPlex Supplement with Vitamin A (Gemini, 400–160), 0.5x N2 NeuroPlex Supplement (Gemini, 400–163), 1x antibiotic/antimycotic, 0.5x Glutamax, 20ng/ml bFGF, 20ng/ml bFGF EGF and 1mg/ml BSA (Gemini). After 2–3 days, neural crest cells were split 1:3 and the following day cells were fed with neural crest long-term media. Long term media: neural crest maintenance media + 50pg/ml BMP2 (PeproTech, 120–02) + 3uM CHIR-99021 (Selleck Chemicals, S2924) (BCh media). CNCCs were then passaged twice to passage 4 when the majority of assays were performed, or cells were further differentiated to chondrocytes.

To differentiate CNCCs to chondrocytes, passage 3 CNCCs were passaged to passage 4, and the following day were transitioned to chondrocyte media without TGFb3 (ChM). ChM: DMEM-HG, 5% FBS, 1x ITS premix, 1mM sodium pyruvate, 50 μg/mL ascorbic acid, 0.1 μM dexamethasone and 1x antibiotic/antimycotic. The following day, cells were fed with chondrocyte media with TGFb3 (ChMT). ChMT: ChM + 10 ng/mL TGFb3. Cells were fed every subsequent 3 days with ChMT. Cells were harvested at day 5 and/or 9 of the differentiation for the majority of assays.

To evaluate the chondrogenic differentiation, we performed qRT-PCR for two independent experiments, and assessed the differentiation from P4 hCNCCs to Day 9 of chondrogenic differentiation. COL2A1 and Aggrecan (ACAN) are known to be directly regulated by SOX9 during chondrogenesis, COL2A1 is an early marker of the chondrocyte lineage, while ACAN is a marker of overtly differentiated chondrocytes. SOX5 and SOX6 are two SOX family transcription factors that are co-expressed with SOX9 in chondrocytes, and all three factors often form a trio at regulatory elements to promote chondrocyte differentiation. Notably, SOX5 is induced early during our in vitro differentiation. BMP2 is a marker of hypertrophic chondrocytes and is essential for chondrocyte proliferation and maturation.

Alcian Blue Staining

Alcian Blue stains extracellular matrix proteoglycan components associated with chondrocytes. hCNCCs were differentiated to chondrocytes and fixed with 4% PFA for 15 minutes. Cells were washed three times with 1x PBS, and incubated overnight in 20% sucrose at 4°C. The following day, cells were stained with Alcian Blue solution (pH 2.5) for 30 minutes. Alcian Blue solution was prepared by diluting 1g Alcian Blue, 8GX in 100mL 3% Acetic Acid solution, pH was adjusted to 2.5 with acetic acid. Following Alcian Blue staining, cells were washed with 3% acetic acid for 3 minutes, followed by several washes with 95% ethanol. Ethanol was removed and cells were imaged.

Capture-C

Capture-C was performed as previously described (Davies et al., 2015). Briefly, cells were crosslinked for 10 min in 2% formaldehyde in PBS, quenched with 125mM glycine for 5 min, scraped, collected and pelleted by centrifugation (500 rcf, 5 min, 4°C). Cells were washed with 5mL cold PBS, pelleted and resuspended in 5mL cold lysis buffer (10mM Tris pH8, 10mM NaCl, 0.2% Ipegal CA-630 in water with cOmplete Protease Inhibitor Cocktail) for 20 min. Cells were pelleted, washed with 5mL cold PBS, pelleted, resuspended in 1mL cold PBS, flash frozen in liquid nitrogen and stored at −80°C. For 3C library preparation, samples were defrosted on ice, pelleted and resuspended in 1xDpnII digestion buffer before extended digestion with DpnII overnight with addition of extra enzyme. DpnII was then heat inactivated at 65°C and samples were ligated with T4 Ligase for ~22 hours. DNA was extracted by sequential Proteinase K and RNase digestion followed by phenol-chloroform isoamylalcohol extraction and precipitation with ethanol and sodium acetate. Efficiency of digestion and ligation was assessed by gel electrophoresis and digestion efficiency was further assessed by qPCR. For addition of Illumina sequencing adaptors, samples were sheared by Covaris sonication and purified by XP SPRI bead clean-up. Sequencing adaptors were annealed using NEB Ultra II kit. Libraries were PCR amplified using Herculase II polymerase (Agilent) in duplicate to add indexing primers and purified by XP SPRI bead clean-up. Samples were pooled and quantified using Qubit dsDNA BR assay kit.

Biotinylated oligos for capture were designed using the Capsequm online tool (http://apps.molbiol.ox.ac.uk/CaptureC/cgi-bin/CapSequm.cgi) and ordered from IDT. 1–2μg indexed 3C library was mixed with 5μg COT DNA, 1nmol TS-HE Universal Oligo and 1nmol of TS-HE Index Oligo (Nimblegen SeqCap EZ HE-oligo and Accessory kit) and dried by vacuum centrifugation at 55°C until completely dry. The 3C library plus blocking oligos were then carefully resuspended in 7.5μL 2X hybridization buffer and 3μL Hybridization buffer A and denatured at 95°C for 10min. The 3C library was then transferred into a preheated PCR tube at 47°C containing 4.5μL of pooled Biotinylated oligonucleotide capture probes at 2.9μM. After a brief mix and centrifugation, the 3C library oligo mix was incubated at 47°C for 18–20 hrs. Following this incubation, the 3C library was washed and recovered by streptavidin bead (M-270 Dynabeads, Invitrogen) pull down using the Nimblegen SeqCap EZ Hybridization and wash kit. Following the recovery of the captured material, the captured DNA was amplified on the streptavidin beads using KAPA HiFi HotStart ReadyMix and POST-LM PCR oligo 1&2 and purified by XP SPRI bead clean-up. For improved capture efficiency, a second round of capture was performed on the total amplified DNA from the first capture. Following the second round of capture, the library was quantified using KAPA library quantification kit using the average size calculated from Bioanalyzer. Libraries were then sequenced on Illumina HiSeq-4000 (2× 150bp).

RNA isolation and preparation of RNA-seq libraries

Total RNA was extracted from a 6-well of hESC, early (P1 and 2) and late (P3 and P4) hCNCCs and day 9 chondrocytes differentiated from hCNCCs using Trizol reagent (Invitrogen) for four independent differentiations. 10μg RNA was purified twice by Dyna1 oligo(dT) beads (Invitrogen) to enrich for poly(A)+ mRNA. The mRNA was then fragmented using 10X fragmentation buffer (Ambion) for exactly 5 min and purified by ethanol precipitation with sodium acetate and RNase-free glycogen. First strand synthesis was performed using Random Hexamer Primers (Invitrogen) and SuperScript II (Invitrogen), followed by second strand synthesis using DNA PolI and RNaseH (Invitrogen), and cDNA was purified using Nucleospin Gel and PCR Cleanup (Takara). All of the cDNA was used for library preparation by end repair, A-tailing and adapter ligation (NEB). The samples were treated with USER enzyme, purified using XP SPRI beads then subjected to dual size selection using XP SPRI beads using bead ratios 0.55x to remove >700bp followed by 0.85x to recover >200bp sized cDNA. Size-selected cDNA was amplified using NEBNext HiFi 2X PCR mix and Dual Index Primers (NEB, E7600) for 7–10 cycles (as determined by qPCR). Libraries were then purified using XP SPRI beads, and quantified using Qubit dsDNA HS assay kit and pooled for sequencing using average library size (bp) from Bioanalyzer and concentration from KAPA quantification (Kapa Biosystems). Libraries were sequenced using NextSeq 500 (2× 75 bp).

10X Genomics Linked-Read sequencing

High molecular weight genomic DNA (HMW gDNA) was generated from H9 hESCs by the salting out method (10x Genomics, manual CG000116) and quality was checked on a FEMTO pulse instrument (Agilent). Linked read libraries were prepared according to the manufacturer’s instructions (10x Genomics, manual CG00043) and sequenced on a HiSeq 4000 instrument (2 lanes, 2× 150 bp).

cDNA preparation and reverse transcriptase digital droplet PCR (RT-ddPCR)

Total RNA was extracted from early (Day 11, P1 and 2) and late (P3 and P4) hCNCCs and day 5 and 9 chondrocytes differentiated from hCNCCs for wildtype or enhancer mutant cell-lines using Trizol reagent (Invitrogen) for at least four differentiations, or from dissected craniofacial prominences from at least four embryos. 100ng – 1ug RNA was used to generate cDNA using the SuperScript Vilo IV MasterMix with ezDNase enzyme (Invitrogen). Primers and locked nucleic acid (LNA) probes were designed by IDT’s custom design service to the human SOX9 or mouse Sox9 3’UTR. For H9 hESC samples, LNA probes were centered on the rs74999341 T/C SNP – a HEX LNA probe detects the T-allele and FAM detects the C-allele. For mouse samples, LNA probes were centred on a G/C SNP in the C57BL6 vs FVB mouse strains respectively – a HEX LNA probe detects the G-allele and FAM detects the C-allele. cDNA dilution factor was determined using qPCR with 1X PrimeTime Gene Expression Master Mix (IDT), 500nM primers and 250nM probes, run on LightCycler 480 (Roche). ddPCR reactions were performed using diluted cDNA (10–100X diluted), 900nM primers and 250nM probes and 1X ddPCR Supermix for probes (no dUTP, BioRad). ddPCR droplets were generated using the QX200 Droplet Generator (BioRad) and droplets were read using QX200 Droplet Reader (BioRad) and analyzed using the QuantaSoft Software (BioRad).

Chromatin immunoprecipitation (ChIP)

5–15 million cells were cross-linked per ChIP experiment in 2mL PBS per 6-well with 1% methanol-free formaldehyde for 5–10 min and quenched with a final concentration of 0.125M glycine for 5 min with nutation. Cross-linked cells were washed with PBS, scraped and pelleted by centrifugation, flash-frozen in liquid nitrogen and stored at −80°C. Samples were defrosted on ice and resuspended in 5mL LB1 (50 mM Hepes-KOH pH 7.5, 140 mM NaCl, 1 mM EDTA, 10% glycerol, 0.5% NP-40, 0.25% Triton X-100, with 1X cOmplete Protease Inhibitor Cocktail and optionally 1mM PMSF) and rotated vertically for 10 min at 4°C. Samples were centrifuged for 5 min at 1350 × g at 4°C, and resuspended in 5mL LB2 (10 mM Tris, 200 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, with 1X cOmplete Protease Inhibitor Cocktail and optionally 1mM PMSF) and rotated vertically for 10 min at 4°C. Samples were centrifuged for 5 min at 1350 × g at 4°C, and resuspended in 1mL LB3 per 10 million cells (maximum concentration of cells for Covaris sonication), or 1 mL per ChIP. Samples were sonicated in 1mL AFA tubes for 5 min on E220 evolution Covaris with settings Peak power = 140, Duty Factor = 10, Cycles per burst = 200 to achieve chromatin sized approximately 500–2000bp.

Following sonication, samples were re-combined (if aliquoted for sonication), Triton X-100 was added to the fragmented chromatin to a final concentration of 1%, and the chromatin divided for input (1–2%) and ChIP samples. 5ȝg anti-histone antibody or 5–9μg anti-transcription factor antibody was added per ChIP sample, and incubated overnight at 4°C. Antibodies used include TWIST1 (Abcam, ab50887), RAD21 (Abcam, ab992), CTCF (Cell Signaling, 2899S), H3K4me1 (Active Motif, 39297), H3K4me3 (Active Motif, 39159), H3K27ac (Active Motif, 39133) and p300 (Santa Cruz). Protein G Dynabeads (ThermoFisher) were first blocked with Block solution (0.5% BSA (w/v) in 1X PBS) and then added to cleared chromatin to bind antibody-bound chromatin for a 4–6 hour incubation. Chromatin-bound Dynabeads were washed at least 6 times with chilled RIPA wash buffer (50 mM Hepes-KOH pH 7.5, 500 mM LiCl, 1 mM EDTA, 1% NP-40, 0.7% Na-Deoxycholate), followed by a wash with chilled TE + 50 mM NaCl. Chromatin was eluted for 15–30 min in Elution Buffer (50 mM Tris, 10 mM EDTA, 1% SDS) at 65°C with frequent vortexing. The ChIP and input samples were then incubated at 65°C overnight to reverse cross-links (12–16 hours). Samples were diluted and sequentially digested with RNase A (0.2 mg/mL) for 2 hours at 37°C followed by Proteinase K (0.2 mg/mL) for 2 hours at 55°C for 2–4 hours to digest protein. ChIP and input samples were purified by phenol-chloroform-isoamylalcohol extraction and precipitation with final concentration 70% ethanol, 0.3M NaOAc pH 5.2 and 1.5μL glycogen.

For library preparation, samples were quantified by Qubit dsDNA HS assay kit, and 10–30ng of ChIP DNA was used for library preparation with end repair, A-tailing, and adaptor ligation (NEB). Following USER enzyme treatment, samples were purified using Nucleospin Gel and PCR Cleanup (Takara) and separated by gel electrophoresis and size-selected for 220–500 bp by gel extraction. Libraries were then amplified to add indices using NEBNext HiFi 2X PCR mix and NEBNext Multiplex Oligos for Illumina kit (NEB, E7335S) with 9–15 cycles (as determined by qPCR). ChIP libraries were purified by two rounds of XP SPRI bead clean-up to deplete adaptors. Library concentration and quality was assessed by Bioanalyzer (to determine size) and KAPA qPCR was used to pool multiple libraries. Samples were sequenced using NextSeq or HiSeq 4000 platform (2× 75bp).

Episomal ChIP-ddPCR

Around 5 million passage 4 hCNCCs were transfected with 1.5μg wildtype (WT) and 1.5μg Coordinator mutant (4x mut) luciferase min1-min2 reporter plasmid in 300μL optimum with 9μL Fugene-6. Cells were fixed after 24 hours and ChIP performed as described above. 9μg TWIST antibody was used per ChIP (Abcam, ab50887). Primers were designed to amplify across the min2-plasmid backbone junction, and plasmid-specific probes were designed to distinguish between the wildtype and Coordinator mutant sequences. ChIP and input dilution factors were determined using qPCR with 1X PrimeTime Gene Expression Master Mix (IDT), 500nM primers and 250nM probes, run on LightCycler 480 (Roche). ddPCR reactions were performed using ChIP DNA (40X diluted) and input DNA (640X diluted), 900nM primers and 250nM probes and 1X ddPCR Supermix for probes (no dUTP, BioRad). ddPCR droplets were generated using the QX200 Droplet Generator (BioRad) and droplets read using QX200 Droplet Reader (BioRad) and analyzed using the QuantaSoft Software (BioRad).

ATAC-seq

ATAC-seq was performed as described previously (Buenrostro et al., 2013; Corces et al., 2017) Briefly, cells were dissociated and treated with DNaseI (Worthington) and 50,000 viable cells were sorted as DAPI negative. Cells were pelleted at 500 RCF for 5 min at 4°C and resuspended in ATAC-resuspension buffer containing 0.1% NP40, 0.1% Tween20, and 0.01% Digitonin and incubated on ice for 3 minutes. Following wash-out with cold ATAC-Resuspension Buffer (RSB, 10 mM Tris-HCl pH 7.4, 10mM NaCl, 3mM MgCl2 in sterile water) containing 0.1% Tween20, cells were pelleted and resuspended in 50μL transposition mix (25 μL 2x TD buffer, 2.5 μL transposase (100nM final), 16.5 μL PBS, 0.5 μL 1% digitonin, 0.5 μL 10% Tween20, 5 μL H2O) and incubated for 30 minutes at 37°C with shaking. The reaction was purified using the Zymo DNA Clean and Concentrator Kit, and amplified using PCR primers defined in (Buenrostro et al., 2013). Libraries were purified using the Zymo DNA Clean and Concentrator Kit, quantified using the KAPA Library Quantification kit and quality assessed by Bioanalyzer (also used to determine average size for pooling libraries). Samples were sequenced on the HiSeq4000 platform (2× 75bp).

Cloning PRS locus enhancers for luciferase assay, including Coordinator mutant and vertebrate species min2 sequences

EC1.45, as well as combined constituent enhancers for EC1.35 (S1 and S2) and EC1.25 (S3, S4, S5 and S6) were PCR-amplified from H9 hESC genomic DNA and cloned into the pGL3 luciferase reporter vector. To generate Coordinator mutant sequences, Coordinator motifs were identified in EC1.45 using fimo from the meme suite (Grant et al., 2011). Coordinator motifs were then mutated in silico at positions with greatest information content in the PWM, to resemble sequence changes associated with reduction in enhancer activity during human-chimpanzee divergence (Prescott et al., 2015). To synthesize orthologous min2 sequences, Multiz Alignments of 100 Vertebrates (UCSC) was used to identify orthologous sequences for mouse, opossum, platypus, lizard, chicken, frog and coelacanth which were then extended to each be 267 bp long. Sequences containing mutant EC1.45 Coordinator motifs and vertebrate min2 sequences were ordered from TWIST Bioscience (or IDT for coelacanth sequence) and cloned into the pGL3 luciferase reporter vector.

Luciferase assay

Luciferase assays were performed as described previously (Prescott, 2015). Briefly, H9 hESC were differentiated following the CNCC differentiation protocol and passaged to passage 3. For CNCCs, cells were transfected immediately following passaging to passage 4 in 24-well plates. For hESCs, cells were split the day before into a 24 well plate in ROCKi (Y27632) and transfected the following day. For chondrocytes, passage 3 CNCCs were split into a 24-well plate, and the now passage 4 CNCCs were transitioned to ChM media for 1 day, and ChMT media for 4 more days before being transfected for the luciferase assay. Transfections were performed in technical triplicate in a single experiment, with each well receiving 10ng of pGL3 plasmid, 0.5ng of control pRL firefly renilla plasmid, 89.5μL carrier DNA (circularized pGEMT plasmid) and 0.3 μL Fugene 6 in 50μL of optimum. The pGL3 plasmid contains the firefly luciferase gene driven by an SV40 promoter with either a control SV40 enhancer downstream, or a test enhancer sequence cloned upstream (Promega), the pRL plasmid acts as a transfection control with Renilla luciferase driven by an upstream CMV enhancer and CMV promoter (Promega). 24 hours after transfection, cells were washed in PBS, and lysed in 150μL 1X passive lysis buffer (in PBS) for 15 min (Promega). 20μL lysate was then transferred to an opaque flat-bottomed plate for reading with a luminometer (Veritas). An automated injector added 100μL LARII reagent and the well was read using the following parameters: 2 sec delay, 10s integration. 100μL Stop-and-Glow reagent was then injected into the well and read using the same parameters. Luciferase assays were repeated in biological duplicate or triplicate with at least two different DNA preps; empty vector and SV40 enhancers were included in each experiment as negative and positive controls, respectively.

LacZ in vivo reporter assay

A second generation LacZ reporter vector was cloned with a Hsp68 promoter driving expression of LacZ-P2A-tdTomato flanked by core insulator sequences to minimize position effects of site of integration and with a multiple cloning site upstream for cloning enhancer sequences to be tested. EC1.45 was subcloned into this vector from the luciferase reporter, as was S1-S2 combined EC1.35 enhancer cluster. The constituent enhancers from EC1.25 (S3, S4, S5 and S6) were triplexed and inserted into the multiple cloning site. The reporter plasmids were linearized and injected into fertilized mouse oocytes, implanted into recipient females and allowed to develop to two distinct mouse embryonic stages, E9.5 or E11.5, that represent distinct periods of craniofacial development (Cyagen – for EC1.45, EC1.35 and EC1.25-S5; LBNL for remaining constructs). Embryos were harvested and processed for X-gal staining to reveal spatial expression pattern of the LacZ gene under the control of the putative cloned enhancer sequence. To be considered reproducible, craniofacial expression patterns had to be observed in at least three embryos (Visel et al., 2007). Embryos were imaged using a Leica M205 FA Stereo Microscope coupled to a Leica DFC7000T digital camera or a Leica MZ16 microscope coupled to a Leica DFC420 digital camera.

Preparation of in situ hybridization probes

Murine cDNA probes for Sox9 corresponding to nucleotides 2443–3588 of RefSeq NM_011448 were TA-cloned from E14.5 C57BL/6 cDNA. The Sox9 probe was linearized and used to transcribe a DIG-labelled antisense probe using an in vitro transcription kit (Roche), following the manufacturer’s instructions.

In situ hybridization

In situ hybridization was performed as described previously (Welsh et al., 2018). Embryos (C57BL/6J) were collected at E9.5 and E11.5, dissected into cold PBS and fixed overnight in 4% PFA in PBS at 4°C. Embryos were dehydrated through a methanol-PBST (PBS + 0.1% Tween20) series and stored at −20°C in 100% methanol. For hybridization, embryos were bleached in 6% hydrogen peroxide in methanol for 1 hour, followed by rehydration, treatment with Proteinase K (10 μg/mL) for 10 minutes at 37°C, and then washed twice in ice cold PBST. Embryos were re-fixed in 4% PFA/0.2% glutaraldehyde in PBST for 20 minutes then washed twice in PBST. Embryos were then transferred to a prehybridization solution for 1 hour at 68°C followed by overnight incubation with the Sox9 riboprobe at 68°C. The following day, embryos were first rinsed with hybridization buffer, then washed 7 times (30 minutes each) with 2XSSC/50%formamide/0.1% Tween at 68°C. Subsequently, embryos were washed twice with TBST, twice with MABT, and blocked in 10% blocking solution (Roche) diluted in MABT for 1 hour at room temperature. Embryos were then incubated with anti-digoxigenin-AP antibody (Roche) at a concentration of 1:4000 in 1% blocking solution overnight at 4°C with rocking. The following day, embryos were washed extensively in TBST buffer at room temperature for 4 days. Colorimetric detection was performed with BM-Purple Chromogenic Reagent (Roche). Lastly, embryos were washed in PBST and post-fixed with 4% PFA and stored in PBS containing 0.05% sodium azide. Embryos were imaged using a Leica M205 FA Stereo Microscope and a Leica DFC7000T digital camera. A minimum of three embryos were processed per stage.

High resolution episcopic microscopy (HREM)

HREM was performed for a representative LacZ reporter embryo for human EC1.45. In HREM, the embedding medium is made highly fluorescent via addition of eosin. The embryo signal is detected as suppression of this fluorescence, in addition chromogenic substrates, for example those used in our LacZ reporter embryos are detected at distinct wavelengths from tissue morphology (Mohun and Weninger, 2012a). The embryo was embedded in JB-4 embedding solution as previously described (Mohun and Weninger, 2012b), sectioned with an SM2500 motorized microtome (Leica) and the block face was concurrently imaged using custom imaging apparatus. Visualization of HREM images was performed in Amira.

MicroCT and mandibular morphometry

Mouse embryos were collected at E18.5 of development and fixed in 4% PFA. MicroCT was performed using a Bruker Skyscan at 15um resolution, 0.25mm Al filter, 415ms exposure, 2k resolution and images were acquired every 0.5 degrees for 180 degrees.

QUANTIFICATION AND STATISTICAL ANALYSIS

Analysing and plotting luciferase data

Multiple independent luciferase experiments were performed with independent plasmid preparations and distinct cell passages or differentiations. The number of independent replicates for each luciferase assay is indicated in the figure legend, or in the plot as n=. For each biological replicate experiments, technical triplicate transfections were performed for each plasmid. Empty vector and SV40 enhancer were included in all experiments as negative and positive controls, respectively. When testing enhancer activity in chondrocytes, an additional COL2A1 intron1 enhancer was included which is active in both hCNCCs and chondrocytes. To plot multiple luciferase experiments on the same plot, linear regression was performed in R and residuals were plotted as boxplots. For ease of visualization, the median value for the empty vector control was set to 0. Significance of activity change were determined by t-test.

Fimo analysis was used to detect Coordinator motifs in the vertebrate min2 sequences, and the fimo scores were summed to estimate the relative number and match to consensus of Coordinator motifs for each species. The summed fimo score was compared to the luciferase signal and linear regression analysis performed to define the relationship between the two measures and ANOVA was used to determine the significance.

Analysing and plotting ddPCR data

Concentration of the two alleles of SOX9 or Sox9 was determined by RT-ddPCR using allele-specific HEX/FAM probes from the QuantaSoft Software. To plot allelic skew, concentration for the mutated allele was divided by the concentration for the wildtype allele and plotted as a ratio (red boxplots). For matched wildtype cells (green boxplots), the same ratio was plotted; this revealed the presence of an allelic skew even in unedited cells suggesting that polymorphisms between the two alleles can drive differences in allelic expression. For Figure 3C, samples were collected for two wildtype cell lines and two mutant EC1.45del/+ cell lines for two independent differentiations. For Figure S4F, samples were collected for five wildtype cell lines and five mutant EC1.25del/+ cell lines for three independent differentiations.

To calculate the % change in Sox9 expression upon mEC1.45 deletion, the mean allelic skew for the edited embryo facial tissues (mEC1.45del/+) was divided by the mean allelic skew from the matched wildtype tissues to account for the normal level of allelic skew detected in unedited wildtype samples. For Figure 6E, four wildtype and four mEC1.45del/+ embryos were dissected at E11.5. For Figure S7H, six wildtype and seven mEC1.45del/+ embryos were dissected at E9.5 (from two litters). Significant differences in allelic skew were determined by t-test.

For TWIST1 ChIP-ddPCR, the concentration of wildtype or Coordinator mutant plasmid were calculated for ChIP and input samples from the QuantaSoft Software using plasmid-specific HEX/FAM probes and primers spanning the min2 sequence and the plasmid backbone. A ratio was calculated between the TWIST ChIP and input samples, and normalised to 1 for the wildtype plasmid, revealing that TWIST1 binding was dramatically reduced on the Coordinator mutant plasmid.

External datasets

External next generation sequencing data were downloaded from the Sequence Read Archive SRA and analysed as below.

ChIP-seq analysis

ChIP sequencing reads were trimmed using skewer and aligned to the human genome (hg19) using bowtie2. For normalization, bedgraph files were generated from aligned bam files using bedtools genomecov with -scale option to normalize to 1 million reads. For visualization, bigwig files were generated using bedgraphToBigWig. Peak calling was performed using macs1.4, for replicate experiments the intersect tool from bedtools was used to identify peaks present in both replicates.

ATAC-seq analysis

Nextera adapter sequences were trimmed from ATAC sequencing reads using cutadapt and aligned to the human genome (hg19) using bowtie2. For normalization, bedgraph files were generated from aligned bam files using bedtools genomecov with -scale option to normalize to 1 million reads. For visualization, bigwig files were generated using bedgraphToBigWig. Peak calling was performed using macs1.4, for replicate experiments the intersect tool from bedtools was used to identify peaks present in both replicates.

RNA-seq analysis

RNA sequencing reads were trimmed using cutadapt and aligned to human gene models (hg38) using HISAT2. Read counts per gene were quantified using featureCounts, gene expression was normalized to fragments per million and plotted in R.

Capture-C analysis

Capture-C analysis was performed using bespoke analysis scripts outlined here: http://userweb.molbiol.ox.ac.uk/public/telenius/captureManual/oligofile.html. For comparison across samples, Capture-C profiles were plotted as the number of unique interactions per restriction fragment normalized to 10,000 interactions in cis. For quantification, normalized interactions in cis were extracted for DpnII fragments overlapping the feature of interest and plotted as a boxplot in R. Statistical significance was determined by t-test for successive stages of hCNCC differentiation.

10X Linked-Read analysis

10X Linked-Read sequencing data was analyzed using Long Ranger (longranger-2.2.2) and visualized using the 10X loupe genome browser.

Motif discovery

To identify de novo DNA sequence motifs enriched at TWIST1 binding sites, we called peaks from the TWIST1 ChIP-seq data using MACS2 and identified de novo motifs underlying these peaks using the SeqPos tool in Cistrome. Known motifs were identified using the TOM-TOM motif comparison tool (Gupta et al., 2007). Consensus DNA binding motifs were plotted using R package ggseqlogo and plotted using ggplot2.

MicroCT mandibular morphometry, quantification and plotting

Reconstruction of microCT data was performed using Bruker Recon software. Hemimandible, pre-maxilla, maxilla, palatine and occipital bones were segmented and landmarks were placed using Amira software (Ho et al., 2015; Welsh et al., 2018). x-y-z coordinates were imported into the R Geomorph package that was used to calculate Procrustes distances, calculate inter-landmark absolute distances and perform ANOVA statistics to determine significance. For boxplots representing inter-landmark distance measurements, a significant reduction in size of the mutant mandibles is labelled as a percentage of the wildtype mandible. Hotelling tests were performed using Procrustes transformed data, and the hotelling.test function in R. Plotting was performed in R. Numbers of embryos analysed is indicated in the figure legends.

Comparison to other datasets

To determine similarity of the epigenomic landscape at the SOX9 locus to that of other human cell-types, we downloaded over 50 publicly available H3K27ac ChIP-seq datasets from a number of cell-types (see (Prescott et al., 2015)). We compared these datasets to our in vitro hCNCC H3K27ac ChIP-seq data and determined that activity of the PRS-associated enhancer clusters EC1.45, EC1.35 and EC1.25 was restricted to hCNCCs as they were not marked in other cell-types analyzed.

Analysis of differentially methylated regions (DMRs)

A Neanderthal-specific significantly hypomethylated region was identified at genomic coordinates chr17:68668482–68674772 (hg19) from previously published datasets (Gokhman et al., 2014, 2020) that overlaps the EC1.45 enhancer cluster. A heatmap was generated representing DNA methylation at CpG dinucleotides spanning the DMR locus for one chimp (rib), seven anatomically modern humans (femur, crania, teeth), one Denisovan (finger) and two Neanderthal (femur and toe) bone samples. DNA methylation levels were determined by whole genome bisulfite sequencing (WGBS) or reconstructed using previously published methods which leverages spontaneous C → T deamination for ancient samples (Gokhman et al., 2014, 2020). For reconstructed DNA methylation maps, the C → T ratio was calculated for each CpG dinucleotide and then translated via a linear transformation (based on modern fully hypomethylated or fully methylated sites) to a methylation percentage. All samples were smoothed using a sliding window of 25 CpG dinucleotides. DNA methylation is marked in the heatmap from green (0% methylation) to red (100% methylation), while white indicates no information.

Supplementary Material

2

Supplemental Table S5. A list of primers and probes used for ChIP-ddPCR, RT-ddPCR and RT-qPCR analysis, primers for genotyping, and cloning guide RNAs and HDR templates, Capture-C probes, and coordinates of PRS locus enhancer loci, Related to STAR Methods.

3

KEY RESOURCES TABLE.

REAGENT or RESOURCE SOURCE IDENTIFIER
Antibodies
Rabbit polyclonal p300 (discontinued) – ChIP Santa Cruz Biotechnology Cat# sc-585, RRID:AB_2231120
Rabbit polyclonal H3K4me1 – ChIP Abcam Cat# ab8895, RRID:AB_306847
Rabbit polyclonal H3K27ac – ChIP Active Motif Cat# 39133, RRID:AB_2561016
Rabbit polyclonal H3K4me3 – ChIP Active Motif Cat# 39159, RRID:AB_2615077
Rabbit polyclonal CTCF – ChIP Cell Signalling Cat# 2899, RRID:AB_2086794
Rabbit polyclonal RAD21 – ChIP Abcam Cat# ab992, RRID:AB_2176601
Mouse monoclonal TWIST1 – ChIP Abcam Cat# ab50887, RRID:AB_883294
Chemicals, Peptides, and Recombinant Proteins
mTeSR Stem Cell Technologies Cat# 85850
Matrigel Growth Factor Reduced (GFR) Basement Membrane Matrix Corning Cat# 356231
ReLeSR Stem Cell Technologies Cat# 05872
Collagenase IV Gibco Cat# 17104019
DMEM/F12 1:1 medium, with L-glutamine; without HEPES GE Healthcare Cat# SH30271.FS
Neurobasal Medium Thermo Fisher Scientific Cat# 21103049
Gem21 NeuroPlex Supplement With Vitamin A Gemini Bio-Products Cat# 400-160
N2 NeuroPlex Supplement Gemini Bio-Products Cat# 400-163
Antibiotic-Antimycotic (100X) Gibco Cat# 15240062
GlutaMAX Supplement (100X) Life Technologies Cat# 35050061
Recombinant Human FGF-basic (154 a.a.) PeproTech Cat# 100-18B
Animal-Free Recombinant Human EGF Peprotech Cat# AF-100-15
Bovine Insulin Powder Gemini Cat# 700-112P
Human Plasma Fibronectin Purified Protein MilliporeSigma Cat# FC01010MG
Accutase Sigma-Aldrich Cat# A6964-100ML
Bovine Serum Albumin (BSA), Fraction V—Serum Replacement Grade Gemini Bio-Products Cat# 700-104P
Recombinant Human/Murine/Rat BMP-2 (E.coli derived) PeproTech Cat# 120-02
CHIR-99021 (CT99021) HCl Selleck Chemicals Cat# S2924
DMEM/High glucose with L-glutamine, sodium pyruvate Cytiva (formerly GE Healthcare) Cat# SH30243.01
Corning ITS+ Premix Universal Culture Supplement Corning Cat# 354352
Sodium pyruvate Life Technologies Cat# 11360070
Ascorbic acid Sigma-Aldrich Cat# A4403-100MG
Dexamethasone Thermo Fisher Scientific Cat# AAA1759003
Recombinant Human TGF-β3 PeproTech Cat# 100-36E
Y-27632 RHO/ROCK pathway inhibitor Stem Cell Technologies Cat# 72304
KnockOut DMEM Gibco Cat# 10829018
Alcian Blue 8GX Sigma-Aldrich Cat# A3157-10G
cOmplete, EDTA-free Protease Inhibitor Cocktail MilliporeSigma Cat# 11873580001
Blasticidin (Solution), 100 mg Invivogen Cat# NC9016621
QuickExtract DNA Extraction Solution Lucigen QE09050
Bacterial and Virus Strains
Adeno-flippase (Ad5CMVFlpO) Fred Hutchinson Cancer Research Center VVC-U of Iowa-530 (MTA)
Critical Commercial Assays
NEBNext Ultra II DNA Library Prep Kit for Illumina New England BioLabs Cat# E7645S
SeqCap EZ Accessory Kit v2 Roche Cat# 07145594001
SeqCap EZ Hybridization and Wash Kit Roche Cat# 05634261001
SeqCap EZ HE-Oligo Kit A Roche Cat# 06777287001
KAPA Library Quantification Kit Illumina platforms, qPCR Master Mix optimized for LightCycler 480 Kapa Biosystems Cat# KK4854
TRIzol Reagent Invitrogen Cat# 15596018
FuGENE 6 Promega Cat# E2691
NEBNext Multiplex Oligos for Illumina kit New England BioLabs Cat# E7335S
AMPure XP Beckman Coulter Cat# A63881
Dynabeads mRNA Purification Kit (for mRNA purification from total RNA preps) Invitrogen Cat# 61006
Dynabeads Protein G for Immunoprecipitation Invitrogen Cat# 10004D
NEBNext Multiplex Oligos for Illumina (Dual Index Primers Set 1) New England BioLabs Cat# E7600
Qubit dsDNA HS Assay Kit Invitrogen Cat# Q32854
SuperScript IV VILO Master Mix with ezDNase Enzyme Invitrogen Cat# 11766050
Dual-Luciferase Reporter Assay System Promega Cat# E1960
HiScribe T7 Quick High Yield RNA Synthesis Kit New England BioLabs Cat# E2050S
MEGAclear Transcription Clean-up kit Ambion Cat# AM1908
Deposited Data
ChIP-seq, ATAC-seq, RNA-seq and Capture-C data This paper GEO: GSE145327
H9 hESC 10X Genomics linked-read sequencing This paper Sequence Read Archive (SRA) BioProject PRJNA648128
Experimental Models: Cell Lines
Human: Female H9 human embryonic stem cells (hESCs) WiCell WA09, RRID: CVCL_9773
Experimental Models: Organisms/Strains
Mouse: C57BL/6J The Jackson Laboratory RRID: IMSR_JAX:000664
Mouse: FVB/NJ The Jackson Laboratory RRID: IMSR_JAX:001800
Mouse: B6.Cg-E2f1Tg(Wnt1-cre)2Sor/J The Jackson Laboratory, (Lewis et al., 2013) RRID: IMSR_JAX:022501
Mouse: B6.129S7-Sox9tm2Crm/J The Jackson Laboratory, (Akiyama et al., 2002) RRID: IMSR_JAX:013106
Mouse: FVB-mEC1.45del-founder1 This paper N/A
Mouse: FVB-mEC1.45del-founder2 This paper N/A
Oligonucleotides
Primers for qRT-PCR, CRISPR-Cas9, LNA probes, see Table S5 This paper N/A
RNA sequence: mEC1.45 upstream guide RNA1 (E1-45del_sg2U): AACAAGGTAGCGCCTCCTTA This paper N/A
RNA sequence: mEC1.45 downstream guide RNA1 (E1-45del_sg2D): ATATCAAGCACAAGGAGTGC This paper N/A
RNA sequence: mEC1.45 upstream guide RNA2 (CR50_sg3U): gatgttatggaaccttaagg This paper N/A
RNA sequence: mEC1.45 downstream guide RNA2 (CR53_sg3D): gaacaattacaaccaaacag This paper N/A
Recombinant DNA
Super piggyBac Transposase expression vector System Biosciences (SBI) Cat# PB210PA-1
Plasmid: pGL3-SV40_control Promega
Plasmid: pRL Promega
Plasmid: pGL3-noSV40-humanEC1.45 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak2 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_min1 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_min2 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45 min1-2 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.35_S1-2 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.35_S1 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.35_S2 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.25_S3-4-5-6 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.25_S3 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.25_S4 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.25_S5 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.25_S6 This paper N/A
Plasmid: pGL3-noSV40-humanCOL2A1enhancer This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45 min1-2_4XCoordinatorMutant This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_min1-2_1XCoordinatorMutant This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_min1-2_3XCoordinatorMutant This paper N/A
Plasmid: pGL3-noSV40-EC1.45-human_min1-mouse_min2 This paper N/A
Plasmid: pGL3-noSV40-EC1.45-human_min1-opossum_min2 This paper N/A
Plasmid: pGL3-noSV40-EC1.45-human_mini-platypus_min2 This paper N/A
Plasmid: pGL3-noSV40-EC1.45-human_min1-chicken_min2 This paper N/A
Plasmid: pGL3-noSV40-EC1.45-human_min1-lizard_min2 This paper N/A
Plasmid: pGL3-noSV40-EC1.45-human_min1-frog_min2 This paper N/A
Plasmid: pGL3-noSV40-EC1.45-human_min1-coelacanth_min2 This paper N/A
Plasmid: pGL3-noSV40-mouseEC1.45 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del1 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del2 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del3 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del4 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del5 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del6 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del7 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del8 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del9 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del10 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del11 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_del12 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_CoordinatorMutant#1 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_CoordinatorMutant#2 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_CoordinatorMutant#3 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_CoordinatorMutant#4 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_CoordinatorMutant#5 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_CoordinatorMutant#6 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_CoordinatorMutant#7 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_4XCoordinatorMutant#1-2-3-4 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_3XCoordinatorMutant#5-6-7 This paper N/A
Plasmid: pGL3-noSV40-humanEC1.45_p300peak1-2_7XCoordinatorMutant This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator_humanEC1.45 This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator_humanEC1.35-S1-2 This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator_humanEC1.25-S3x3 This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator_humanEC1.25-S4x3 This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator_humanEC1.25-S5x3 This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator_humanEC1.25-S6x3 This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator_mouseEC1.45 This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator_mouseEC1.25-S3x3 This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator_mouseEC1.25-S5x3 This paper N/A
Plasmid: pHsp68-LacZ-P2A-tdTomato-coreinsulator_mouseEC1.25-S6x3 This paper N/A
Plasmid: Sox9 in situ plasmid From Ian Welsh N/A
Plasmid: pX458-dual-U6prom-sgRNA-EC1.45_CAGprom-Cas9-GFP This paper N/A
Plasmid: EC1.45-HAs_FRT-EF1a-mCherry-T2A-Blast-FRT-FRT3 This paper N/A
Plasmid: pX458-U6prom-sgRNA-EC1.25_CAGprom-Cas9-GFP This paper N/A
Plasmid: EC1.25-firstDonor_HAs-hUbCprom-eGFP-tCD8 This paper N/A
Plasmid: EC1.25-secondDonor_HAs_only This paper N/A
Software and Algorithms
CHOPCHOP (Labun et al., 2019) https://chopchop.cbu.uib.no/
Benchling Benchling [Biology Software]. (2017) https://www.benchling.com/
Bruker Recon software Bruker N/A
Amira software ThermoFisher Scientific https://www.thermofisher.com/us/en/home/industrial/electron-microscopy/electron-microscopy-instruments-workflow-solutions/3d-visualization-analysis-software/amira-life-sciences-biomedical.html
The R package for Statistical Computing R Core Team (2019), R version 3.6.0 https://www.r-project.org/
R Geomorph package (Adams and Otárola-Castillo, 2013) https://cran.r-project.org/web/packages/geomorph/index.html
R hotelling.test function https://cran.r-project.org/web/packages/Hotelling/Hotelling.pdf
skewer (Jiang et al., 2014) https://github.com/relipmoc/skewer
bowtie2 (Langmead and Salzberg, 2012) http://bowtie-bio.sourceforge.net/bowtie2/index.shtml
bedtools (Quinlan and Hall, 2010) https://github.com/arq5x/bedtools2
bedgraphToBigWig https://github.com/ENCODE-DCC/kentUtils
macs1.4 (Zhang et al., 2008) https://github.com/taoliu/MACS
cutadapt (Martin, 2011) https://cutadapt.readthedocs.io/en/stable/
HISAT2 (Kim et al., 2019) https://daehwankimlab.github.io/hisat2/
featureCounts (subread package) (Liao et al., 2014) http://subread.sourceforge.net/
CapSequm2 (Hughes et al., 2014) http://apps.molbiol.ox.ac.uk/CaptureC/cgi-bin/CapSequm.cgi
Long Ranger (longranger-2.2.2) 10X Genomics https://support.10xgenomics.com/genome-exome/software/pipelines/latest/what-is-long-ranger
Macs2 (macs2 2.1.1.20160309) (Zhang et al., 2008) https://github.com/taoliu/MACS
SeqPos (Cistrome Project) (Liu et al., 2011) http://cistrome.org/
ggseqlogo (Wagih, 2017) https://cran.r-project.org/web/packages/ggseqlogo/ggseqlogo.pdf
ggplot2 (Wickham, 2016) https://ggplot2.tidyverse.org/
UCSC (Kent et al., 2002) https://genome.ucsc.edu/
Samtools (v1.3.1) (Li et al., 2009) http://samtools.sourceforge.net/
Bioanalyzer 2100 Expert Software Agilent https://www.agilent.com/en/product/automated-electrophoresis/bioanalyzer-systems/bioanalyzer-software/2100-expert-software-228259
QuantaSoft Software BioRad https://www.bio-rad.com/en-us/sku/1864011-quantasoftsoftware-regulatory-edition?ID=1864011
Other
Leica imaging stereoscope Leica N/A
Covaris sonicator E220 Covaris N/A
Bruker Skyscan 1276 MicroCT (purchased with an NIH S10 Shared Instrumentation Grant, 1S10OD02349701, PI Timothy C. Doyle) Bruker https://www.bruker.com/products/microtomography/in-vivo-micro/skyscan-1276/overview.html
Veritas Microplate Luminometer Turner Biosystems N/A
Leica M205 FA Stereo Microscope coupled to a Leica DFC7000T digital camera Leica N/A
Leica MZ16 microscope coupled to a Leica DFC420 digital camera Leica N/A
QX200 Droplet Generator BioRad https://www.bio-rad.com/en-gu/sku/1864002-qx200-droplet-generator?ID=1864002
QX200 Droplet Reader BioRad https://www.bio-rad.com/en-us/sku/1864003-qx200-droplet-reader?ID=1864003
Chromium controller 10X Genomics https://www.10xgenomics.com/instruments/chromium-controller/
HiSeq4000 (purchased with funds from NIH under award number S10OD018220) Illumina N/A
NextSeq500 Illumina N/A
Amaxa 4D nucleofector Lonza https://bioscience.lonza.com/lonza_bs/US/en/Transfection/p/000000000000203684/4D-Nucleofector-Core-Unit
A&D Weighing EJ-120 Newton Portable Balance, 120g x 0.01g; 115V A&D Weighing N/A
LightCycler 480 Roche N/A
High Resolution Episcopic Microscope Tim Mohun lab N/A
Bioanalyzer Agilent https://www.agilent.com/en/product/automated-electrophoresis/bioanalyzer-systems/bioanalyzer-instrument/2100-bioanalyzer-instrument-228250

Highlights.

  • Extreme long-range enhancer clusters overlap PRS patient mutations at the SOX9 locus

  • PRS enhancers drive stage-specific SOX9 expression in the cranial neural crest

  • Mandible development has heightened sensitivity to perturbation of SOX9 gene dosage

  • Deletion of mouse EC1.45 leads to quantitative changes in mandible morphology

Acknowledgements

We acknowledge Stanford Functional Genomics Facility (SFGF) for next generation sequencing (NIH S10 Shared Instrumentation Grant S10OD018220), and Stanford Center for Innovation in In-Vivo Imaging (SCI3) for use of the Bruker Skyscan 1276 (NIH S10 Shared Instrumentation Grant 1S10OD02349701, PI Timothy C. Doyle). We thank Dr David Gokhman for sharing hominin DNA methylation data, Fabian Suchy for ddPCR advice, Dr Jaaved Mohammed for genomics advice, and Dr Marta Losa Llabata and Dr Antoine Zalc for help and expertise with mouse work. We thank Dr Seungsoo Kim and Dr Sahin Naqvi for critical reading of the manuscript. This work was supported by HHMI (JW), Pitch Johnson Fund (JW), U01 DE024430 FaceBase2 (JW and LS), Wellcome Trust Sir Henry Wellcome fellowship 106051/Z/14/Z (HL), NIDCR F32DE026950 fellowship (IW), MRC Clinician Scientist Fellowship MR/R008108 (JD), the Chan-Zuckerberg Biohub Chan-Zuckerberg Investigator (MHP), the Medical Research Council MC_UU_00016 and MR/N00969X/1 (DRH and JH), U01 DE024427 and R01 DE028599 (AV). Research conducted at the E.O. Lawrence Berkeley National Laboratory was performed under Department of Energy Contract DE-AC02-05CH11231, University of California (AV).

Footnotes

Declaration of Interests

Joanna Wysocka is a member of CAMP4 scientific advisory board and ISSCR board of directors. Jim R. Hughes and James O. J. Davies are founders and on the board of directors of Nucleome Theraputics.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Adams DC, and Otárola-Castillo E (2013). Geomorph: An r package for the collection and analysis of geometric morphometric shape data. Methods Ecol. Evol 4, 393–399. [Google Scholar]
  2. Akiyama H, Chaboissier MC, Martin JF, Schedl A, and De Crombrugghe B (2002). The transcription factor Sox9 has essential roles in successive steps of the chondrocyte differentiation pathway and is required for expression of Sox5 and Sox6. Genes Dev. 16, 2813–2828. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Albertson RC, and Kocher TD (2006). Genetic and developmental basis of cichlid trophic diversity. Heredity (Edinb). 97, 211–221. [DOI] [PubMed] [Google Scholar]
  4. Amano O, Doi T, Yamada T, Sasaki A, Sakiyama K, Kanegae H, and Kindaichi K (2010). Meckel’s cartilageௗ: Discovery, embryology and evolution -overview of the specificity of meckel’s cartilage-. J. Oral Biosci. 52, 125–135. [Google Scholar]
  5. Amarillo IE, Dipple KM, and Quintero-Rivera F (2013). Familial Microdeletion of 17q24.3 Upstream of SOX9 Is Associated With Isolated Pierre Robin Sequence Due to Position Effect. Am. J. Med. Genet. Part A 161, 1167–1172. [DOI] [PubMed] [Google Scholar]
  6. Arzate-Mejía RG, Recillas-Targa F, and Corces VG (2018). Developing in 3D: the role of CTCF in cell differentiation. Development 145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Baetens D, Mendonça BB, Verdin H, Cools M, and De Baere E (2017). Non-coding variation in disorders of sex development. Clin. Genet 91, 163–172. [DOI] [PubMed] [Google Scholar]
  8. Bagheri-Fam S, Barrionuevo F, Dohrmann U, Günther T, Schüle R, Kemler R, Mallo M, Kanzler B, and Scherer G (2006). Long-range upstream and downstream enhancers control distinct subsets of the complex spatiotemporal Sox9 expression pattern. Dev. Biol 291, 382–397. [DOI] [PubMed] [Google Scholar]
  9. Bahr C, von Paleske L, Uslu VV, Remeseiro S, Takayama N, Ng SW, Murison A, Langenfeld K, Petretich M, Scognamiglio R, et al. (2018). A Myc enhancer cluster regulates normal and leukaemic haematopoietic stem cell hierarchies. Nature 553, 515–520. [DOI] [PubMed] [Google Scholar]
  10. Bajpai R, Chen D. a, Rada-Iglesias A, Zhang J, Xiong Y, Helms J, Chang C-P, Zhao Y, Swigut T, and Wysocka J (2010). CHD7 cooperates with PBAF to control multipotent neural crest formation. Nature 463, 958–962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Benko S, Fantes J, Amiel J, Kleinjan D-J, Thomas S, Ramsay J, Jamshidi N, Essafi A, Heaney S, Gordon CT, et al. (2009). Highly conserved non-coding elements on either side of SOX9 associated with Pierre Robin sequence. Nat. Genet 41, 359–364. [DOI] [PubMed] [Google Scholar]
  12. Beverdam A, Merlo GR, Paleari L, Mantero S, Genova F, Barbieri O, Janvier P, and Levi G (2002). Jaw transformation with gain of symmetry after Dlx5/Dlx6 inactivation: Mirror of the past? Genesis 34, 221–227. [DOI] [PubMed] [Google Scholar]
  13. Bi W, Huang W, Whitworth DJ, Deng JM, Zhang Z, Behringer RR, and de Crombrugghe B (2001). Haploinsufficiency of Sox9 results in defective cartilage primordia and premature skeletal mineralization. Proc. Natl. Acad. Sci 98, 6698–6703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bienstock D, Ayyala R, Eisig SB, and Perrino MA (2016). Evaluation of the Pierre Robin Sequence Mandible Using Computed Tomography. J. Oral Maxillofac. Surg 74, e52–e53. [Google Scholar]
  15. Bildsoe H, Loebel DAF, Jones VJ, Chen Y, Behringer RR, and Tam PPL (2009). Requirement for Twist1 in frontonasal and skull vault development in the mouse embryo. Dev. Biol 331, 176–188. [DOI] [PubMed] [Google Scholar]
  16. Bronner ME, and LeDouarin NM (2012). Development and evolution of the neural crest: An overview. Dev. Biol 366, 2–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Buenrostro JD, Giresi PG, Zaba LC, Chang HY, and Greenleaf WJ (2013). Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Calo E, and Wysocka J (2013). Modification of enhancer chromatin: what, how, and why? Mol. Cell 49, 825–837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Chen W, Davidson EH, MacIsaac ZM, and Kumar A (2015). Mapping the Mandibular Lingula in Pierre Robin Sequence: A Guide to the Inverted-L Osteotomy. J. Craniofac. Surg 26, 1847–1852. [DOI] [PubMed] [Google Scholar]
  20. Cheung M, and Briscoe J (2003). Neural crest development is regulated by the transcription factor Sox9. Development 130, 5681–5693. [DOI] [PubMed] [Google Scholar]
  21. Chung MT, Levi B, Hyun JS, Lo DD, Montoro DT, Lisiecki J, Bradley JP, Buchman SR, Longaker MT, and Wan DC (2012). Pierre Robin sequence and Treacher Collins hypoplastic mandible comparison using three-dimensional morphometric analysis. J. Craniofac. Surg 23, 1959–1963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Chuong EB, Elde NC, and Feschotte C (2017). Regulatory activities of transposable elements: From conflicts to benefits. Nat. Rev. Genet 18, 71–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Corces MR, Trevino AE, Hamilton EG, Greenside PG, Sinnott-Armstrong NA, Vesuna S, Satpathy AT, Rubin AJ, Montine KS, Wu B, et al. (2017). An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues. Nat. Methods 14, 959–962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Davies JOJ, Telenius JM, McGowan SJ, Roberts NA, Taylor S, Higgs DR, and Hughes JR (2015). Multiplexed analysis of chromosome conformation at vastly improved sensitivity. Nat. Methods 1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Day A, Dong J, Funari V. a., Harry B, Strom SP, Cohn DH, and Nelson SF (2009). Disease gene characterization through large-scale co-expression analysis. PLoS One 4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Denas O, Sandstrom R, Cheng Y, Beal K, Herrero J, Hardison RC, and Taylor J (2015). Genome-wide comparative analysis reveals human-mouse regulatory landscape and evolution. 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Despang A, Schöpflin R, Franke M, Ali S, Jerkoviü I, Paliou C, Chan W-L, Timmermann B, Wittler L, Vingron M, et al. (2019). Functional dissection of the Sox9-Kcnj2 locus identifies nonessential and instructive roles of TAD architecture. Nat. Genet 51, 1263–1271. [DOI] [PubMed] [Google Scholar]
  28. Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, Hu M, Liu JS, and Ren B (2012). Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Dixon JR, Jung I, Selvaraj S, Shen Y, Antosiewicz-Bourget JE, Lee AY, Ye Z, Kim A, Rajagopal N, Xie W, et al. (2015). Chromatin architecture reorganization during stem cell differentiation. Nature. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Foster JW (1996). Mutations in SOX9 cause both autosomal sex reversal and campomelic dysplasia. Acta Paediatr. Jpn. (Overseas Ed 38, 405–411. [DOI] [PubMed] [Google Scholar]
  31. Franke M, Ibrahim DM, Andrey G, Schwarzer W, Heinrich V, Schöpflin R, Kraft K, Kempfer R, Jerkoviü I, Chan W-L, et al. (2016). Formation of new chromatin domains determines pathogenicity of genomic duplications. Nature 538, 265–269. [DOI] [PubMed] [Google Scholar]
  32. Funato N, Kokubo H, Nakamura M, Yanagisawa H, and Saga Y (2016). Specification of jaw identity by the Hand2 transcription factor. Sci. Rep 6, 1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Gokhman D, Lavi E, Prüfer K, Fraga MF, Riancho JA, Kelso J, Pääbo S, Meshorer E, and Carmel L (2014). Reconstructing the DNA Methylation Maps of the Neandertal and the Denisovan. Science (80-. ). 344, 523–528. [DOI] [PubMed] [Google Scholar]
  34. Gokhman D, Nissim-Rafinia M, Agranat-Tamir L, Housman G, García-Pérez R, Lizano E, Cheronet O, Mallick S, Nieves-Colón MA, Li H, et al. (2020). Differential DNA methylation of vocal and facial anatomy genes in modern humans. Nat. Commun 11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Gonen N, Futtner CR, Wood S, Garcia-Moreno SA, Salamone IM, Samson SC, Sekido R, Poulat F, Maatouk DM, and Lovell-Badge R (2018). Sex reversal following deletion of a single distal enhancer of Sox9. Science 360, 1469–1473. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Gordon CT, Tan TY, Benko S, Fitzpatrick D, Lyonnet S, and Farlie PG (2009). Long-range regulation at the SOX9 locus in development and disease. J. Med. Genet 46, 649–656. [DOI] [PubMed] [Google Scholar]
  37. Gordon CT, Attanasio C, Bhatia S, Benko S, Ansari M, Tan TY, Munnich A, Pennacchio L. a., Abadie V, Temple IK, et al. (2014). Identification of Novel Craniofacial Regulatory Domains Located far Upstream of SOX9 and Disrupted in Pierre Robin Sequence. Hum. Mutat 35, 1011–1020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Grant CE, Bailey TL, and Noble WS (2011). FIMO: Scanning for occurrences of a given motif. Bioinformatics 27, 1017–1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Guo Y, Xu Q, Canzio D, Shou J, Li J, Gorkin DU, Jung I, Wu H, Zhai Y, Tang Y, et al. (2015). CRISPR Inversion of CTCF Sites Alters Genome Topology and Enhancer/Promoter Function. Cell 162, 900–910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Gupta S, Stamatoyannopoulos JA, Bailey TL, and Noble WS (2007). Quantifying similarity between motifs. Genome Biol 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Herranz D, Ambesi-Impiombato A, Palomero T, Schnell SA, Belver L, Wendorff AA, Xu L, Castillo-Martin M, Llobet-Navás D, Cordon-Cardo C, et al. (2014). A NOTCH1-driven MYC enhancer promotes T cell development, transformation and acute lymphoblastic leukemia. Nat. Med 20, 1130–1137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Ho T, Iwata J, Anh H, Grimes WC, Park S, Sanchez-lara PA, and Chai Y (2015). Integration of comprehensive 3D microCT and signaling analysis reveals differential regulatory mechanisms of craniofacial bone development. Dev. Biol 400, 180–190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Hughes JR, Roberts N, Mcgowan S, Hay D, Giannoulatou E, Lynch M, De Gobbi M, Taylor S, Gibbons R, and Higgs DR (2014). Analysis of hundreds of cis-regulatory landscapes at high resolution in a single, high-throughput experiment. Nat. Genet 46, 205–212. [DOI] [PubMed] [Google Scholar]
  44. Ikeda K, Uchida N, Nishimura T, White J, Martin RM, Nakauchi H, Sebastiano V, Weinberg KI, and Porteus MH (2018). Efficient scarless genome editing in human pluripotent stem cells. Nat. Methods 15, 1045–1047. [DOI] [PubMed] [Google Scholar]
  45. Jiang H, Lei R, Ding SW, and Zhu S (2014). Skewer: A fast and accurate adapter trimmer for next-generation sequencing paired-end reads. BMC Bioinformatics 15, 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, and Haussler a. D. (2002). The Human Genome Browser at UCSC. Genome Res 12, 996–1006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kim D, Paggi JM, Park C, Bennett C, and Salzberg SL (2019). Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol 37, 907–915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Kurth I, Klopocki E, Stricker S, van Oosterwijk J, Vanek S, Altmann J, Santos HG, van Harssel JJT, de Ravel T, Wilkie AOM, et al. (2009). Duplications of noncoding elements 5’ of SOX9 are associated with brachydactyly-anonychia. Nat. Genet 41, 862–863. [DOI] [PubMed] [Google Scholar]
  49. Labun K, Montague TG, Gagnon JA, Thyme SB, and Valen E (2016). CHOPCHOP v2: a web tool for the next generation of CRISPR genome engineering. Nucleic Acids Res 44, W272–W276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Labun K, Montague TG, Krause M, Torres Cleuren YN, Tjeldnes H, and Valen E (2019). CHOPCHOP v3: Expanding the CRISPR web toolbox beyond genome editing. Nucleic Acids Res 47, W171–W174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Langmead B, and Salzberg SL (2012). Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Laugsch M, Bartusel M, Rehimi R, Alirzayeva H, Karaolidou A, Crispatzu G, Zentis P, Nikolic M, Bleckwehl T, Kolovos P, et al. (2019). Modeling the Pathological Long-Range Regulatory Effects of Human Structural Variation with Patient-Specific hiPSCs. Cell Stem Cell 24, 736–752.e12. [DOI] [PubMed] [Google Scholar]
  53. Lee YH, and Saint-Jeannet JP (2011). Sox9 function in craniofacial development and disease. Genesis 49, 200–208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Lefebvre V, and Dvir-Ginzberg M (2016). SOX9 and the many facets of its regulation in the chondrocyte lineage. Connect. Tissue Res. 8207, 03008207.2016.1183667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Lettice L. a., Heaney SJH, Purdie L. a., Li L, de Beer P, Oostra B. a., Goode D, Elgar G, Hill RE, and de Graaff E (2003). A long-range Shh enhancer regulates expression in the developing limb and fin and is associated with preaxial polydactyly. Hum. Mol. Genet 12, 1725–1735. [DOI] [PubMed] [Google Scholar]
  56. Lewis AE, Vasudevan HN, O’Neill AK, Soriano P, and Bush JO (2013). The widely used Wnt1-Cre transgene causes developmental phenotypes by ectopic activation of Wnt signaling. Dev. Biol 379, 229–234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, and Durbin R (2009). The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Liao Y, Smyth GK, and Shi W (2014). FeatureCounts: An efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930. [DOI] [PubMed] [Google Scholar]
  59. Limberg MM, Zumhagen S, Netter MF, Coffey AJ, Grace A, Rogers J, Böckelmann D, Rinné S, Stallmeyer B, Decher N, et al. (2013). Non dominant-negative KCNJ2 gene mutations leading to Andersen-Tawil syndrome with an isolated cardiac phenotype. Basic Res. Cardiol 108, 1–15. [DOI] [PubMed] [Google Scholar]
  60. Liu T, Ortiz JA, Taing L, Meyer CA, Lee B, Zhang Y, Shin H, Wong SS, Ma J, Lei Y, et al. (2011). Cistrome: an integrative platform for transcriptional regulation studies. Genome Biol 12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Long HK, Prescott SL, and Wysocka J (2016). Ever-Changing Landscapes: Transcriptional Enhancers in Development and Evolution. Cell 167, 1170–1187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Lupiáñez DG, Kraft K, Heinrich V, Krawitz P, Brancati F, Klopocki E, Horn D, Kayserili H, Opitz JM, Laxova R, et al. (2015). Disruptions of topological chromatin domains cause pathogenic rewiring of gene-enhancer interactions. Cell. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Martin M (2011). Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.Journal 17, 10–12. [Google Scholar]
  64. Martinez CM, McGee MD, Borstein SR, and Wainwright PC (2018). Feeding ecology underlies the evolution of cichlid jaw mobility. Evolution (N. Y). 72, 1645–1655. [DOI] [PubMed] [Google Scholar]
  65. Meloro C, Cáceres NC, Carotenuto F, Sponchiado J, Melo GL, Passaro F, and Raia P (2015). Chewing on the trees: Constraints and adaptation in the evolution of the primate mandible. Evolution (N. Y). 69, 1690–1700. [DOI] [PubMed] [Google Scholar]
  66. Minoux M, and Rijli FM (2010). Molecular mechanisms of cranial neural crest cell migration and patterning in craniofacial development. Development 137, 2605–2621. [DOI] [PubMed] [Google Scholar]
  67. Mohun TJ, and Weninger WJ (2012a). Episcopic three-dimensional imaging of embryos. Cold Spring Harb. Protoc. 7, 641–646. [DOI] [PubMed] [Google Scholar]
  68. Mohun TJ, and Weninger WJ (2012b). Embedding embryos for high-resolution episcopic microscopy (HREM). Cold Spring Harb. Protoc 7, 678–680. [DOI] [PubMed] [Google Scholar]
  69. Mori-Akiyama Y, Akiyama H, Rowitch DH, and de Crombrugghe B (2003). Sox9 is required for determination of the chondrogenic cell lineage in the cranial neural crest. Proc. Natl. Acad. Sci 100, 9360–9365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Osterwalder M, Barozzi I, Tissières V, Fukuda-Yuzawa Y, Mannion BJ, Afzal SY, Lee EA, Zhu Y, Plajzer-Frick I, Pickle CS, et al. (2018). Enhancer redundancy provides phenotypic robustness in mammalian development. Nature. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Paletta CE, Dehghan K, Hutchinson RL, and Klaw BA (1994). A fall of the base of the tongue considered as a new cause of nasopharyngeal respiratory impairment: Pierre robin sequence, a translation. Plast. Reconstr. Surg 93, 1301–1303. [PubMed] [Google Scholar]
  72. Parsons TE, Weinberg SM, Khaksarfard K, Howie RN, Elsalanty M, and Yu JC (2014). Craniofacial Shape Variation in Twist1 1 / 2 Mutant Mice. 833, 826–833. [DOI] [PubMed] [Google Scholar]
  73. Prescott SL, Srinivasan R, Marchetto MC, Grishina I, Narvaiza I, Selleri L, Gage FH, Swigut T, and Wysocka J (2015). Enhancer Divergence and cis-Regulatory Evolution in the Human and Chimp Neural Crest. Cell 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Qin Q, Xu Y, He T, Qin C, and Xu J (2011). Normal and disease-related biological functions of Twist1 and underlying molecular mechanisms. Nat. Publ. Gr 22, 90–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Quinlan AR, and Hall IM (2010). BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Rada-Iglesias A, Bajpai R, Prescott S, Brugmann S. a., Swigut T, and Wysocka J (2012). Epigenomic annotation of enhancers predicts transcriptional regulators of human neural crest. Cell Stem Cell 11, 633–648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Raia P, Boggioni M, Carotenuto F, Castiglione S, Di Febbraro M, Di Vincenzo F, Melchionna M, Mondanaro A, Papini A, Profico A, et al. (2018). Unexpectedly rapid evolution of mandibular shape in hominins. Sci. Rep 8, 1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Rathé M, Rayyan M, Schoenaers J, Dormaar JT, Breuls M, Verdonck A, Devriendt K, Poorten V Vander, and Hens, G. (2015). Pierre Robin sequence: Management of respiratory and feeding complications during the first year of life in a tertiary referral centre. Int. J. Pediatr. Otorhinolaryngol 79, 1206–1212. [DOI] [PubMed] [Google Scholar]
  79. Ren G, Jin W, Cui K, Rodrigez J, Hu G, Zhang Z, Larson DR, and Zhao K (2017). CTCF-Mediated Enhancer-Promoter Interaction Is a Critical Regulator of Cell-to-Cell Variation of Gene Expression. Mol. Cell 67, 1049–1058.e6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Sanchez-Castro M, Gordon CT, Petit F, Nord AS, Callier P, Andrieux J, Guérin P, Pichon O, David A, Abadie V, et al. (2013). Congenital Heart Defects in Patients with Deletions Upstream of SOX9. [DOI] [PubMed] [Google Scholar]
  81. Schoenfelder S, and Fraser P (2019). Long-range enhancer–promoter contacts in gene expression control. Nat. Rev. Genet [DOI] [PubMed] [Google Scholar]
  82. Shen Y, Yue F, Mc Cleary DF, Ye Z, Edsall L, Kuan S, Wagner U, Dixon J, Lee L, Ren B, et al. (2012). A map of the cis-regulatory sequences in the mouse genome. Nature 488, 116–120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Spitz F (2016). Gene regulation at a distance: From remote enhancers to 3D regulatory ensembles. Semin. Cell Dev. Biol 57, 57–67. [DOI] [PubMed] [Google Scholar]
  84. Spokony RF, Aoki Y, Saint-Germain N, Magner-Fink E, and Saint-Jeannet J-P (2002). The transcription factor Sox9 is required for cranial neural crest development in Xenopus. Development 129, 421–432. [DOI] [PubMed] [Google Scholar]
  85. Suri S, Ross RB, and Tompson BD (2010). Craniofacial morphology and adolescent facial growth in Pierre Robin sequence. Am. J. Orthod. Dentofac. Orthop 137, 763–774. [DOI] [PubMed] [Google Scholar]
  86. Susarla SM, Vasilakou N, Kapadia H, Egbert M, Hopper RA, and Evans KN (2017). Defining mandibular morphology in Robin sequence: A matched case-control study. Am. J. Med. Genet. Part A 173, 1831–1838. [DOI] [PubMed] [Google Scholar]
  87. Tan TY, and Farlie PG (2013). Rare syndromes of the head and face-Pierre Robin sequence. Wiley Interdiscip. Rev. Dev. Biol 2, 369–377. [DOI] [PubMed] [Google Scholar]
  88. Tan TY, Kilpatrick N, and Farlie PG (2013). Developmental and genetic perspectives on pierre robin sequence. Am. J. Med. Genet. Part C Semin. Med. Genet 163, 295–305. [DOI] [PubMed] [Google Scholar]
  89. Tawil R, Ptacek LJ, Pavlakis SG, DeVivo DC, Penn AS, Özdemir C, and Griggs RC (1994). Andersen’s syndrome: PotassiumǦsensitive periodic paralysis, ventricular ectopy, and dysmorphic features. Ann. Neurol 35, 326–330. [DOI] [PubMed] [Google Scholar]
  90. Terhune CE, Robinson CA, and Ritzman TB (2014). Ontogenetic variation in the mandibular ramus of great apes and humans. J. Morphol 275, 661–677. [DOI] [PubMed] [Google Scholar]
  91. Terhune CE, Ritzman TB, and Robinson CA (2018). Mandibular ramus shape variation and ontogeny in Homo sapiens and Homo neanderthalensis. J. Hum. Evol 121, 55–71. [DOI] [PubMed] [Google Scholar]
  92. Trainor P. a., Melton KR, and Manzanares M (2003). Origins and plasticity of neural crest cells and their roles in jaw and craniofacial evolution. Int. J. Dev. Biol 47, 541–553. [PubMed] [Google Scholar]
  93. Uslu VV, Petretich M, Ruf S, Langenfeld K, Fonseca N. a, Marioni JC, and Spitz F (2014). Long-range enhancers regulating Myc expression are required for normal facial morphogenesis. Nat. Genet 46, 1–24. [DOI] [PubMed] [Google Scholar]
  94. Varshney GK, Pei W, Lafave MC, Idol J, Xu L, Gallardo V, Carrington B, Bishop K, Jones M, Li M, et al. (2015). High-throughput gene targeting and phenotyping in zebrafish using CRISPR/Cas9. Genome Res 25, 1030–1042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Villar D, Berthelot C, Aldridge S, Rayner TF, Lukk M, Pignatelli M, Park TJ, Deaville R, Erichsen JT, Jasinska AJ, et al. (2015). Enhancer Evolution across 20 Mammalian Species. Cell 160, 554–566. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Visel A, Minovitsky S, Dubchak I, and Pennacchio LA (2007). VISTA Enhancer Browser - A database of tissue-specific human enhancers. Nucleic Acids Res 35, 88–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Volk AS, Davis MJ, Narawane AM, Abu-Ghname A, Dempsey RF, Lambert EM, Tran B, Wirthlin JO, and Buchanan EP (2020). Quantification of Mandibular Morphology in Pierre Robin Sequence to Optimize Mandibular Distraction Osteogenesis. Cleft Palate-Craniofacial J [DOI] [PubMed] [Google Scholar]
  98. Wagih O (2017). Ggseqlogo: A versatile R package for drawing sequence logos. Bioinformatics 33, 3645–3647. [DOI] [PubMed] [Google Scholar]
  99. Wagner T, Wirth J, Meyer J, Zabel B, Held M, Zimmer J, Pasantes J, Bricarelli FD, Keutel J, Hustert E, et al. (1994). Autosomal sex reversal and campomelic dysplasia are caused by mutations in and around the SRY-related gene SOX9. Cell 79, 1111–1120. [DOI] [PubMed] [Google Scholar]
  100. Welsh IC, Hart J, Brown JM, Hansen K, Rocha Marques M, Aho RJ, Grishina I, Hurtado R, Herzlinger D, Ferretti E, et al. (2018). Pbx loss in cranial neural crest, unlike in epithelium, results in cleft palate only and a broader midface. J. Anat 233, 222–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Wickham H (2016). ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. [Google Scholar]
  102. Wilderman A, VanOudenhove J, Kron J, Noonan JP, and Cotney J (2018). High-Resolution Epigenomic Atlas of Human Embryonic Craniofacial Development. Cell Rep 23, 1581–1597. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Wit E. De, Vos ESM, Holwerda SJB, Valdes-quezada C, Verstegen MJAM, Teunissen H, Splinter E, Wijchers PJ, Krijger PHL, and Laat W. De (2015). CTCF Binding Polarity Determines Chromatin Looping. Mol. Cell 1–9. [DOI] [PubMed] [Google Scholar]
  104. Wyganowska-Świątkowska and Przystańska A (2011). The Meckel’s cartilage in human embryonic and early fetal periods. Anat. Sci. Int 86, 98–107. [DOI] [PubMed] [Google Scholar]
  105. Yue F, Cheng Y, Breschi A, Vierstra J, Wu W, Ryba T, Sandstrom R, Ma Z, Davis C, Pope BD, et al. (2014). A comparative encyclopedia of DNA elements in the mouse genome. Nature 515, 355–364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Zeevaert R, Foulquier F, Dimitrov B, Reynders E, Van Damme-Lombaerts R, Simeonov E, Annaert W, Matthijs G, and Jaeken J (2009). Cerebrocostomandibular-like syndrome and a mutation in the conserved oligomeric Golgi complex, subunit 1. Hum. Mol. Genet 18, 517–524. [DOI] [PubMed] [Google Scholar]
  107. Zellner EG, Reid RR, and Steinbacher DM (2017). The Pierre Robin Mandible is Hypoplastic and Morphologically Abnormal. J. Craniofac. Surg 28, 1946–1949. [DOI] [PubMed] [Google Scholar]
  108. Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nussbaum C, Myers RM, Brown M, Li W, et al. (2008). Model-based analysis of ChIP-Seq (MACS). Genome Biol 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Zhang Y, Blackwell EL, Mcknight MT, Knutsen GR, Vu WT, and Ruest LB (2012). Specific inactivation of Twist1 in the mandibular arch neural crest cells affects the development of the ramus and reveals interactions with hand2. Dev. Dyn 241, 924–940. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

2

Supplemental Table S5. A list of primers and probes used for ChIP-ddPCR, RT-ddPCR and RT-qPCR analysis, primers for genotyping, and cloning guide RNAs and HDR templates, Capture-C probes, and coordinates of PRS locus enhancer loci, Related to STAR Methods.

3

Data Availability Statement

The ChIP-seq, ATAC-seq and RNA-seq datasets generated in this study are available at Gene Expression Ombnibus (GEO) accession GSE145327 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE145327. The 10X Genomics linked-read sequencing is available at Sequence Read Archive (SRA) BioProject PRJNA648128.

RESOURCES