Skip to main content
Human Genetics and Genomics Advances logoLink to Human Genetics and Genomics Advances
. 2023 Aug 25;4(4):100234. doi: 10.1016/j.xhgg.2023.100234

Trio-based GWAS identifies novel associations and subtype-specific risk factors for cleft palate

Kelsey Robinson 1, Trenell J Mosley 1,14, Kenneth S Rivera-González 2, Christopher R Jabbarpour 2, Sarah W Curtis 1, Wasiu Lanre Adeyemo 3, Terri H Beaty 4, Azeez Butali 5, Carmen J Buxó 6, David J Cutler 1, Michael P Epstein 1, Lord JJ Gowans 7, Jacqueline T Hecht 8, Jeffrey C Murray 9, Gary M Shaw 10, Lina Moreno Uribe 11, Seth M Weinberg 12, Harrison Brand 13, Mary L Marazita 12, Robert J Lipinski 2, Elizabeth J Leslie 1,15,
PMCID: PMC10502411  PMID: 37719664

Summary

Cleft palate (CP) is one of the most common craniofacial birth defects; however, there are relatively few established genetic risk factors associated with its occurrence despite high heritability. Historically, CP has been studied as a single phenotype, although it manifests across a spectrum of defects involving the hard and/or soft palate. We performed a genome-wide association study using transmission disequilibrium tests of 435 case-parent trios to evaluate broad risks for any cleft palate (ACP) (n = 435), and subtype-specific risks for any cleft soft palate (CSP), (n = 259) and any cleft hard palate (CHP) (n = 125). We identified a single genome-wide significant locus at 9q33.3 (lead SNP rs7035976, p = 4.24 × 10−8) associated with CHP. One gene at this locus, angiopoietin-like 2 (ANGPTL2), plays a role in osteoblast differentiation. It is expressed both in craniofacial tissue of human embryos and developing mouse palatal shelves. We found 19 additional loci reaching suggestive significance (p < 5 × 10−6), of which only one overlapped between groups (chromosome 17q24.2, ACP and CSP). Odds ratios for the 20 loci were most similar across all 3 groups for SNPs associated with the ACP group, but more distinct when comparing SNPs associated with either subtype. We also found nominal evidence of replication (p < 0.05) for 22 SNPs previously associated with orofacial clefts. Our study to evaluate CP risks in the context of its subtypes and we provide newly reported associations affecting the broad risk for CP as well as evidence of subtype-specific risks.

Keywords: palate, palatogenesis, osteogenesis, genome sequencing, GWAS, subtype, phenotypic heterogeneity, hard palate, soft palate


Cleft palate is a common birth defect for which few genetic causes are known. Here, we identify a locus associated with cleft hard palate at 9q33.3 near ANGPTL2, which is expressed in the developing mouse palate and plays a role in bone formation.

Introduction

Orofacial clefts (OFCs) are the most common craniofacial congenital anomalies in humans, occurring at a birth prevalence of ∼1 in 1,000 live births.1,2 Prognosis is favorable with surgical intervention, although individuals with OFCs face many healthcare challenges such as multiple corrective surgeries, abnormal dentition, hearing and speech problems, and increased morbidity and mortality throughout life.1,3 As such, these anomalies are associated with increased healthcare costs and long-term psychosocial burdens for individuals and their families.4

OFCs are typically classified into groups including cleft lip (CL), cleft lip with cleft palate (CLP), and cleft palate (CP). Etiologically, cleft lip with or without a cleft palate (CL/P) is considered anatomically and epidemiologically distinct from CP, although both are highly heritable,5,6 with estimates up to 90% for both CL/P and CP.7 However, compared with dozens of known risk loci for CL/P,8,9 common risk variants for CP remain largely undiscovered. Large-scale studies of CP, including seven genome-wide association studies (GWASs),10,11,12,13,14,15,16 have mostly identified variants occurring either at low frequency or in specific populations. For example, a missense variant in GRHL3 occurs at ∼3% only in Europeans,10 a variant near CTNNA2 occurs at ∼1.5% in an African population,14 and a variant in an enhancer region of IRF6 occurs almost exclusively in Finnish and Estonian Europeans.16

Historically, CP has been evaluated as a single group, but it encompasses a phenotypic spectrum including overt clefts of the hard and/or soft palate and submucous CP, which may arise due to genetic differences among subtypes. Development of the secondary palate in humans begins around gestational week 6. Initially, there are paired vertical outgrowths from the maxillary processes called the palatal shelves. These shelves first flank the tongue, and then elevate horizontally and begin to grow toward the midline as the tongue descends. At midline, the palatal shelves meet and fuse to each other as well as to the primary palate and nasal septum. This is complete by week 12 of gestation, resulting in a complete separation of the oral and nasal cavities. Throughout this process, there are complex and distinct molecular patterns across the anterior-posterior axis as the anterior region develops into the bony, hard palate, and the posterior region becomes the muscular, soft palate.17,18 The phenotypic spectrum of CP is hypothesized to be caused, in part, by perturbation of genes expressed in specific compartments. For example, in mice, Shox2 expression is observed only in the anterior region of the palatal shelves, and knockout of this gene results in a cleft of the anterior (hard) palate only with normal posterior palatal fusion. In human embryos at 42 days post conception, SHOX2 is expressed in the anterior palatal shelves, and absent in the posterior region. In contrast, expression of Msx1 is restricted to the anterior portion of the palate shelves in mice, although knockout results in a complete cleft due to a proliferation defect.19

Given these findings, we hypothesized additional CP risk variants may be detected in a subtype-specific manner. To investigate this further, we performed GWAS of three groups of case-parent trios via transmission disequilibrium tests (TDTs). These groups included all cases of CP regardless of subtype, cases involving the soft palate (i.e., hard and soft palate, soft palate only, and submucous CP), and cases involving the hard palate (i.e., hard and soft palate plus hard palate only).

Materials and methods

Study population and phenotyping

The study population comes from multiple domestic and international sites where recruitment and phenotypic assessment occurred following institutional review board approval for each local recruitment site and the coordinating center (University of Iowa, University of Pittsburgh, and Emory University). We assembled a collection of 435 case-parent trios ascertained on proband affection status (e.g., CP) (Table S1). The majority of trios (96%) consist of affected CP probands with unaffected parents. Although probands/trios were not excluded based on additional clinical features consistent with a syndromic diagnosis, only 45 trios were classified as possibly or probably syndromic based on a reported presence of additional major or minor clinical features. Trios represent all major ancestry groups affected by CP including those with European ancestry (recruited from Spain, Turkey, Hungary, the United States), as well as understudied populations from Latin America (Puerto Rico, Argentina), Asia (China, Singapore, Taiwan, the Philippines), and Africa (Nigeria, Ghana). All probands and parents were assessed for the presence of a CP with ∼2/3 of the assembled samples undergoing additional phenotyping to assess the location and severity of the CP. Here, we designate these probands as having a cleft of the hard and soft palate (n = 82), cleft of the hard palate only (n = 43), cleft of the soft palate only (n = 152), and submucous CP (n = 25). For the purposes of analysis, submucous CP was grouped with cleft soft palate.

Sample preparation and whole-genome sequencing

Whole-genome sequencing was performed at the Center for Inherited Disease Research (CIDR) at Johns Hopkins University (Baltimore, MD). Prior to sequencing, samples were tested for adequate quantity and quality of genomic DNA using a Fragment Analyzer system and were processed with an Illumina InfiniumQCArray-24v1-0 array to confirm sex, relatedness, and known duplicates. For each sample, 500–750 ng of genomic DNA was sheared to 400–600 bp fragments, then processed with the Kapa Hyper Prep kit for End-Repair, A-Tailing, and Ligation of IDT (Integrated DNA Technologies) unique dual-indexed adapters according to the Kapa protocol to create a final PCR-free library.

A NovaSeq 6000 platform using 150 bp paired-end runs was used for sequencing followed by base calling through the Illumina Real-Time Analysis software (version 3.4.4). Files were demultiplexed from binary format (BCL) to individual fastq files with Illumina Isas bcl2fastq (version 1.37.1) and aligned to the human hg38 reference sequence. The DRAGEN Germline version 3.7.5 pipeline on the Illumina BaseSpace Sequence Hub platform was used for alignment, variant calling, and quality control, which produced single-sample VCF files. The DRAGEN contamination detection tool was used to check for any cross-human sample contamination. Genotype concordance with existing array-based genotypes was performed using CIDRSeqSuite (version 7.5.0), and genotype concordance checks among replicate samples was performed in Picard GenotypeConcordance (Picard 2019). Following data quality steps and confirmation of adequate coverage (at least 80% of the genome at 20× or autosomal coverage at 30×), joint variant calling was performed, generating a multi-sample VCF file.

Quality control

Variants aligning outside of standard chromosomes (1–22, X, Y), those with a filter flag, and variants with a minor allele count of <2 were removed. Genotypes with a quality score of <20 or a read depth of <10 were set to missing, and sites with missingness values of >10% were subsequently filtered out. Sample-level quality control metrics included transition/transversion ratio, silent/replacement rate, and heterozygous/homozygous ratio; outlier samples with values outside of 3 standard deviations from the cohort mean were discarded. Samples with high missing data (>5% missing) were removed.

Principal-component analysis

A principal-component (PC) analysis was performed for probands only using PLINK (version 2.0) to determine genetic ancestry within our cohort. Variants were excluded if the minor allele frequency (MAF) < 15% or if any site was missing a genotype (missingness > 0%). We performed linkage disequilibrium (LD) pruning for R2 > 0.1 prior to analysis. Ultimately, there were 67,584 variants for which we generated 15 PCs. After visualization of PC plots, PCs 1–3 were used to group by ancestry (Figures S1 and S2A).

Statistical analysis

We performed TDTs to statistically analyze common variant associations with CP. The TDT, originally described by Spielman et al.,20 tests for the rate of transmission of the minor allele to an affected proband using McNemar’s test (i.e., a modified chi-squared test for paired data). Because it tests transmission rather than allele frequency, it is robust to population stratification; however, it is only informative for sites at which parents are heterozygous for a variant. As differences in heterozygosity between populations may mask signals, we also performed a chi-squared test of homogeneity on our three CP groups to verify there were no significant differences in population makeup. Following filtering steps described above, the multisample VCF was imported into PLINK (version 1.90b53). Trios for which all individuals had a genotype missingness rate <5% and a Mendelian error rate <2% were included. Variants were included if they met the following criteria: MAF of ≥3%, Mendelian error rate <0.1%, Hardy-Weinberg exact test p value of >1 × 10−6, and missingness rate of <5%. Results were considered genome-wide significant at p < 5 × 10−8, and suggestive of significance at p < 5 × 10−6. We report our odds ratio (OR) in reference to the alternate allele, and list the effect allele as that which increases risk for CP. Following TDT output, we applied FINEMAP 21 to our genome-wide significant locus. In brief, FINEMAP uses a stochastic shotgun search to calculate the posterior probability of SNP association with disease based on effect size (for which we used the natural log of the OR), MAF, and an LD matrix (generated in PLINK). We ran FINEMAP on SNPs within 1 Mb in either direction of the lead SNP using default settings.

DECIPHER variants

We queried the DECIPHER database 22 for copy number variants (CNVs) affecting ANGPTL2 for individuals with phenotypes related to both palate and limb abnormalities. Terms included for palatal phenotypes were: CP, high palate, narrow palate, narrow mouth, and micrognathia. Terms included for limb phenotypes were: two- to three-toe syndactyly, arachnodactyly, camptodactyly, long toe, abnormality of finger, tapered finger, upper limb undergrowth, and short foot. We then compared the rate of these phenotypes in individuals with CNVs to the general population based on the EUROCAT prevalence data using a two tailed Fisher’s exact test.

Animal studies and gene expression assays

Animal studies were conducted in strict accordance with recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health. The protocol was approved by the University of Wisconsin School of Veterinary Medicine Institutional Animal Care and Use Committee (protocol no. 13-081.0). C57BL/6J mice were purchased from The Jackson Laboratory and housed in rooms maintained at 22°C ± 2°C and 30%–70% humidity on a 12 h dark cycle. Mice were fed Irradiated Soy Protein-Free Extruded Rodent Diet (catalog no. 2920x; Envigo Teklad Global) until day of plug, when dams received Irradiated Teklad Global 19% Protein Extruded Rodent Diet (catalog no. 2919; Envigo Teklad Global). For timed matings, one or two nulliparous female mice were placed with a single male mouse for 1–2 h and then examined for copulation plugs. The beginning of the mating period was designated as gestational day 0 (GD0), and pregnancy was confirmed by assessing weight gain between GD7 and GD10, as described previously.23 Dams were euthanized by carbon dioxide inhalation followed by cervical dislocation between GD10 and GD14.5 ± 1 h for embryo collection. One cohort of embryos collected for in situ hybridization assays were dissected in PBS and fixed in 4% paraformaldehyde for 18 h. Embryos subsequently underwent graded dehydration (1:3, 1:1, 3:1, v/v) into 100% methanol and were stored at −20°C indefinitely. Riboprobes were synthesized with gene-specific primers (Table S2), and in situ hybridization was performed as described previously.24,25 Embryos were subsequently embedded in 4% agarose gel and cut in sections (130 μM for head, 60 μM for limb) using a vibrating microtome. Images were captured using a MicroPublisher 5.0 camera (QImaging) mounted on an Olympus SZX-10 stereomicroscope. Another cohort of embryos was generated for quantitative gene expression analysis. Embryos were collected and microdissected in PBS, and enzymatic separation and isolation of the mesenchyme from maxillary process (GD10–GD12) or palatal shelf tissue (GD13–GD14) was performed as described previously.26,27 RNA was isolated using the QIAGEN RNeasy Mini Kit with on-column DNase I digestion according to the manufacturer’s recommendations. cDNA was synthesized from 100 ng of total RNA using the GoScript Reverse Transcription Reaction Kits (Promega). Singleplex quantitative real-time PCR was performed using SSoFast EvaGreen Supermix (Bio-Rad) on a Bio-Rad CFX96 Real-Time PCR Detection System (Bio-Rad). Real-time qPCR primers were designed using PrimerQuest (IDT), and sequences are listed in Table S3. Target gene specificity was confirmed using the National Center for Biotechnology Information Primer Basic Local Alignment Search Tool (NCBI Primer-BLAST). Gapdh was used as the housekeeping gene, and analyses were conducted with the 2−ΔΔCt method.

Replication of previously published SNPs

We searched for previously published SNPs associated with OFCs using the NHGRI-EBI GWAS catalog,28 which reports any SNPs with p values less than 1 × 10−5. We initially identified 202 SNPs from GWAS data; however, after filtering for duplicates (i.e., SNPs reported in multiple studies) there were 166 SNPs of interest. When reporting the p value for duplicated SNPs, we chose the most significant value. We then evaluated these variants for association with CP or CP subtypes in our current dataset and found 139 SNPs with data in at least one analysis. When comparing the two datasets, we reported the effect allele as the allele with increased OR as found in our current study.

Results

GWAS of any CP type

We performed TDT using all 435 CP case-parent trios (Table S1) for 6,946,419 variants (Figure S3). In the combined analysis of all trios with CP (hereafter referred to as any CP, or ACP), no loci reached genome-wide significance (p < 5 × 10−8), although there were 10 loci with at least one SNP surpassing a suggestive threshold of p < 5 × 10−6 (Table 1). Several GWASs of CP have been published to date10,11,12,13,14,15,16; however, we did not find any of these previous loci beyond nominal significance in this study, as further discussed below (Table S4).

Table 1.

Suggestive and significant loci from any CP type and subtype-specific GWAS

Locus (nearest gene) Lead SNP Ref/Alta ACP OR (95% CI) ACP p value CSP OR (95% CI) CSP p value CHP OR (95% CI) CHP p value gnomAD v.3.1.2 alternate allele frequencies
EAS (%) SAS (%) AFR (%) EUR (%)
ACP

1q41 (LYPLAL1) rs59611530 G/GAAT 1.75 (1.41–2.17) 3.11 × 10−7 1.76 (1.34–2.32) 4.07 × 10−5 2.27 (1.51–3.39) 4.48 × 10−5 68.2 61.4 39.3 49.5
2p21 (MTA3) rs57081889 C/T 1.65 (1.34–2.04) 2.55 × 10−6 1.55 (1.19–2.02) 1.15 × 10−3 2.07 (1.34–3.20) 8.49 × 10−4 22.6 37.3 51.6 36.7
4p14 (UBEK2, PDS5A) rs10000967 T/C 2.60 (1.76–3.84) 6.07 × 10−7 2.63 (1.55–4.46) 1.90 × 10−4 2.54 (1.34–4.82) 3.19 × 10−3 7.7 14.6 9.6 11.3
5q14.1 (HOMER1) rs79156100 T/C 2.81 (1.80–4.39) 2.32 × 10−6 3.00 (1.70–5.28) 6.33 × 10−5 1.83 (0.91–3.70) 8.64 × 10−2 12.2 4.0 0.1 0.5
9p24.3 (DMRT2) rs12002920 G/T 2.96 (1.88–4.66) 8.45 × 10−7 3.15 (1.69–5.89) 1.39 × 10−4 2.75 (1.22–6.18) 1.06 × 10−2 2.7 3.2 18.7 5.4
11q24.2 (OR8B8) rs375612889 C/T 0.31 (0.19–0.52) 2.66 × 10−6 0.49 (0.27–0.87) 1.26 × 10−2 0.30 (0.13–0.71) 3.49 × 10−3 1.7 13.0 3.9 9.0
12q21.1 rs7955287 G/A 1.90 (1.47–2.46) 6.34 × 10−7 2.18 (1.53–3.10) 9.33 × 10−6 1.78 (1.07–2.97) 2.45 × 10−2 88.8 59.6 69.8 62.2
14q22.1 (FRMD6-AS2) rs10431684 C/A 1.68 (1.35–2.02) 3.42 × 10−6 1.83 (1.35–2.48) 8.17 × 10−5 1.73 (1.15–2.59) 7.22 × 10−3 70.8 67.2 59.1 52.9
17p13.2 (ANKFY1) rs58695167 G/A 1.70 (1.35–2.13) 4.26 × 10−6 1.67 (1.26–2.22) 3.44 × 10−4 1.82 (1.19–2.78) 5.11 × 10−3 76.4 64.5 23.6 61.5
17q24.2 (ARSG) rs75850252 C/T 0.50 (0.37–0.68) 3.86 × 10−6 0.42 (0.28–0.62) 5.99 × 10−6 0.42 (0.23–0.79) 5.58 × 10−3 14.3 2.4 25.2 1.5

CSP

1q32.1 (LINC00862) rs61824892 C/T 1.58 (1.19–2.11) 1.42 × 10−3 2.47 (1.66–3.68) 4.17 × 10−6 1.10 (0.67–1.78) 7.10 × 10−1 17.4 8.8 2.8 11.6
4p15.1 (LINC02497) rs61795400 T/C 1.76 (1.20–2.58) 3.54 × 10−3 3.77 (2.05–6.95) 4.83 × 10−6 2.50 (0.97–6.44) 4.95 × 10−2 11.7 6.3 10.3 3.9
9q22.31 (FAM120A) rs41274384 G/A 2.14 (1.38–3.32) 5.41 × 10−4 4.27 (2.22–8.24) 2.28 × 10−6 1.30 (0.57–2.97) 5.32 × 10−1 7.1 4.1 0.6 3.3
11q14.3 (DISC1FP1) rs11019136 C/T 1.58 (1.27–1.97) 3.05 × 10−5 1.97 (1.48–2.63) 2.88 × 10−6 1.68 (1.12–2.52) 1.20 × 10−2 30.0 23.1 25.8 38.3
12q13.11 (LOC105369747) rs855134 T/C 0.75 (0.61–0.92) 6.34 × 10−3 0.53 (0.41–0.70) 3.92 × 10−6 0.95 (0.65–1.38) 7.74 × 10−1 41.0 37.4 20.5 35.5
15q26.2 (SPATA8-AS1) rs36062094 T/C 1.65 (1.32–2.05) 7.70 × 10−6 1.97 (1.47–2.64) 3.42 × 10−6 1.75 (1.16–2.64) 6.66 × 10−3 32.0 17.5 20.5 22.5
17q24.2 (ARSG, WIPI1) rs3785607 T/C 0.49 (0.35–0.69) 2.07 × 10−5 0.35 (0.23–0.54) 9.44 × 10−7 0.50 (0.27–0.93) 2.54 × 10−2 17.0 0.9 0.2 0.1
17q25.3 (RPTOR) rs1468036 C/G 1.74 (1.15–2.64) 7.96 × 10−3 4.67 (2.27–9.59) 3.82 × 10−6 8.00 (1.84–34.79) 9.67 × 10−4 0.2 5.8 36.6 2.5

CHP

5q15 (LINC02234) rs72781553 C/T 0.75 (0.54–1.04) 8.12 × 10−2 0.71 (0.46–1.08) 1.08 × 10−1 0.21 (0.10–0.44) 3.82 × 10−6 5.3 14.2 16.9 6.7
9q33.3b (RALGPS1, ANGPTL2) rs7035976 T/C 0.74 (0.61–0.89) 1.79 × 10−3 0.62 (0.48–0.80) 2.31 × 10−4 0.32 (0.21–0.49) 4.24 × 10−8 38.8 35.9 71.5 48.7

EAS, East Asian; SAS, South Asian; AFR, African; EUR, European.

a

Odds ratio reported for the alternate allele.

b

Genome-wide significance (p < 5 × 10−8).

Subtype-specific GWAS

CP is phenotypically heterogeneous, where clefts can occur in the hard palate (the bony, anterior portion) and/or the soft palate (the muscular, posterior portion). Given that such phenotypic distinctions arise from different developmental origins, we hypothesized some of the relative lack of associated SNP signals could be attributed to this underlying heterogeneity. To determine if CP subtypes were associated with unique loci, we performed TDTs on two subgroups: “any cleft of the soft palate,” which included 259 trios with clefts of the hard and soft palate, soft palate only, or submucous cleft (CSP) (Figure S4), and “any cleft of the hard palate,” which included 125 trios with hard and soft palate or hard palate only (CHP) (Figure S5). Those with clefts of both the hard and soft palate were included in both analyses because that group may be etiologically heterogeneous and may have hard palate and/or soft palate risk factors.

The CSP analysis included 7,286,217 variants. There were no loci reaching genome-wide significance, although there were 8 loci of suggestive significance (Table 1). Our CHP analysis included 7,337,001 variants, and we identified a genome-wide locus on chromosome 9q33.3 (Figures 1A and 1B) spanning the genes RALGPS1 and ANGPTL2, as well as one locus of suggestive significance (Table 1). We used FINEMAP21 to perform statistical fine-mapping of the 9q33.3 locus, which identified three SNPs within the credible set for which there was 100% confidence of at least one is associated with disease: rs2417050, rs777676, and rs12350252 (Figure 1C; Table 2).

Figure 1.

Figure 1

Genome-wide significant locus at 9q33.3 spans craniofacially expressed gene ANGPTL2 and a craniofacial superenhancer

(A) Regional association plot for 9q33.3. The labeled SNPs were identified by FINEMAP with 100% confidence of belonging to the credible set of SNPs associated with disease.

(B) UCSC genome browser tracks for craniofacial-specific gene expression and regulatory regions.

(C) Zoomed-in view of the region with high density of SNPs in linkage disequilibrium (LD) with SNPs labeled in (A). Point color corresponds to LD (r2) with rs2417050 across all populations. For browser tracks, yellow indicates an enhancer region (darker shades represent stronger elements), green indicates active transcription, and red indicates a transcription start sites. CNCC, cranial neural crest cells; CS, Carnegie stage.

Table 2.

Credible sets from fine-mapping and probability for inclusion of listed SNPs

Set (log10(BF)) 1 (393)a 2 (392)a 3 (333)a 4 (260) 5 (105)
Chr:BP rsID:Ref/Alt 9:127166841 rs2417050:A/G 1 9:127153352 rs12350252:C/A 1 9:126929634 rs777676:T/A 1 9:126617239 rs10819189:G/A 0.25 9:126994978 rs144622817:A/G 0.33
N/A N/A N/A 9:126623752 rs35622144:ACCT/A 0.25 9:126953456 rs7854363:C/T 0.33
N/A N/A N/A 9:126615117 rs2235057:A/G 0.25 9:126927357 rs78702307:G/A 0.33
N/A N/A N/A 9:126608676 rs10819188:G/A 0.25 N/A

N/A, not applicable.

a

SNPs are labeled in Figure 1A.

We next compared the CSP and CHP analyses with each other and with the ACP group to determine to what extent these loci were associated with a specific CP subtype. The only overlap in suggestive loci was 17q24.2 (near ARSG), shared by the ACP (lead SNP rs75850252, 2.89 × 10−6) and CSP analyses (lead SNP rs3785607, p = 9.44 × 10−7). The 17q24.2 signal was driven by the CSP group but also showed nominal evidence of association in CHP (p = 0.006). Due to the overlap in samples, however, we cannot distinguish between an association of this locus with any type of CP or with a cleft of both the hard and soft palate. There were additional regions, such as the 1q41 locus shown in Figure 2A, in which the association patterns were similar between ACP, CSP, and CHP, but did not reach the suggestive threshold in the subtype analyses. In contrast, we observed loci with stronger association signals in one subtype versus the other or in ACP. For example, the top locus in CHP, 9q33.3, reached genome-wide significance but this signal was less significant in CSP (p = 2.28 × 10−4) and ACP (p = 1.79 × 10−3) (Figure 2B), suggesting that this signal was driven by clefts of the hard palate (either with or without cleft soft palate). Similarly, the 9q22.31 locus identified in CSP is less significant in ACP (p = 5.41 × 10−4) and was essentially absent when looking at CHP (p = 0.53) (Figure 2C), suggesting that it is driven exclusively by cleft soft palate.

Figure 2.

Figure 2

Regional association plots illustrate differences between groups

(A) The 1q41 locus spanning LYPAL1 with index SNP rs10779347, (B) the 9p33.3 locus spanning RALGPS1 with index SNP rs7035976, and (C) the 9q22.31 locus spanning FAM120A with index SNP 4127438 demonstrating similar association patterns (A) and subtype specific associations (B and C). Point color corresponds to LD (r2) and the blue lines represent linkage block boundaries.

We then compared the estimated ORs for the lead SNPs in each region showing suggestive significance for the three subgroups (Figure 3) to identify subtype-specific risks suggested by comparisons of p values. As predicted, the ORs were very similar across all groups for loci identified in the ACP analysis. However, for loci identified in either CSP or CHP, the differences in ORs were more pronounced. Although most confidence intervals overlap between CSP and CHP, there was some evidence for subtype-specific effects. For the SNPs identified from the CHP analysis, there was no overlap in the range of effects for the 5q15 locus in CHP and CSP (which also contained 1), indicating that this locus is specific to clefts involving the hard palate. It is less clear for the SNPs identified in the CSP analysis. For 4 of the 10 SNPs, the confidence interval for CHP contained 1, which allows the possibility of no effect in that group. In addition, all of the confidence intervals overlapped, indicating that these loci may not be subtype specific. However, taken altogether, these findings suggest that there are subtype-specific genetic risks for CP.

Figure 3.

Figure 3

Comparison of odds ratios for any suggestive loci demonstrates subtype-specific effects

Loci associated with any cleft palate convey similar ORs for both subtypes (left). Loci associated with a specific subtype (right) carry less extreme ORs and/or are insignificant in the opposing group. Loci marked with an asterisk are featured in Figure 2. ACP, any cleft palate; CSP, cleft soft palate; CHP, cleft hard palate.

Finally, because our cohort represents diverse populations, we evaluated population-specific signals. Although our test of homogeneity did not show any significant differences between each analysis (p = 0.137), differences in site heterozygosity between ancestries can mask signals. Using PC analysis, we subdivided groups into Asian, European, and African ancestry groups (Figure S1); however, only the Asian subgroup contained enough trios (N = 262) to perform TDT for 6,491,466 variants (Figure S1). The results from this analysis did not reveal any additional loci not already present in the full cohort, but did demonstrate that some signals in the full cohort, such as 5q14.1, are likely driven by this population. This is also supported by the population frequencies in Table 1.

Expression during mouse palatogenesis and limb development

We further investigated the 9q33.3 locus, the only genome-wide significant locus, to locate a candidate gene for CP. Based on available expression and chromatin segmentation data from human embryonic craniofacial tissue,29 ANGPTL2 is actively transcribed in neural crest cells and human craniofacial tissue during embryogenesis at Carnegie stages 13, 14, 15, and 17, whereas this pattern is not apparent for RALGPS1 (Figure 1B). We therefore hypothesized that ANGPTL2 was more likely than RALGPS1 to be involved in palatal development (Figure 1B). We performed in situ hybridization on mouse embryos at two key stages of palatogenesis: initial vertical outgrowth of the palatal shelves and subsequent horizontal outgrowth just prior to their fusion.18 Angptl2 staining was observed in the palatal shelves and the lateral aspects of the upper lip at both stages of development (Figures 4A and 4B). Tissue sectioning revealed Angptl2 staining in the mesenchymal compartment of the palatal shelves (Figures 4Aʹ and 4B′). Subsequent quantitative assessment demonstrated that mesenchymal Angptl2 expression increases during palatogenesis in a manner similar to that of Runx2, an established marker of osteogenic differentiation (Figure 4C); this is consistent with our observation that the association was driven by clefts of the hard palate.

Figure 4.

Figure 4

Angptl2 expression during mouse palate and limb development

(A–E) Mouse embryos at gestational day 13 (GD13) or GD14.5 were stained by in situ hybridization to visualize Angptl2 expression. Whole-mount tissues were imaged to view the developing palatal shelves and upper lip (A and B). Subsequent coronal sections illustrate prominent staining in mesenchymal tissue of the palatal shelves (A′ and B′). qPCR was conducted on mesenchyme isolated from maxillary process/palatal shelf tissue from mice at indicated time points (C). Each value represents the mean ± SEM of n = 3 samples isolated from individual embryos. Forelimbs from GD13 and 14.5 mouse embryos were also stained by in situ hybridization to visualize Angptl2 expression (D and E). A section through the first digit illustrates staining restricted to the mesenchyme adjacent to apical ectodermal ridge. Similar domains of expression were observed in hindlimbs. Scale bars, 0.50 mm.

We also observed strong Angptl2 staining during limb development in a domain restricted to the mesenchyme adjacent to the apical ectodermal ridge (Figures 4D and 4E). The apical ectodermal ridge secretes signals that maintain the adjacent mesenchyme (i.e., progress zone) in a highly proliferative state, driving proximodistal outgrowth of the limb and digits.30 We therefore investigated overlapping phenotypes using the DECIPHER database.22 We found 38 reported individuals with CNVs affecting ANGPTL2 and adjacent genes in this region. Of these, 13/38 (34.2%) had craniofacial phenotypes (e.g., CP, narrow palate, high palate) and 15/38 (39.5%) had limb abnormalities (e.g., camptodactyly of finger, arachnodactyly) with an overlap of 10/38 (26.3%) with clinical features of both craniofacial and limb abnormalities. Using prevalence data from EUROCAT, we compared the rate of OFCs (14.95 per 10,000 births) and limb anomalies (38.18 per 10,000 births) in the general population with that of patients with CNVs of ANGPTL2 and found a significant difference between these two groups (p < 2.2 × 10−16, Fisher’s exact two-tailed test) for both OFCs and limb anomalies.

Attempted replication of previous published SNPs

We investigated SNPs from previously published studies of OFCs for evidence of replication in our dataset. Using a list from the GWAS catalog,28 we tested 139 unique SNPs for association with ACP, CSP, or CHP. There were 22 SNPs within 18 loci achieving nominal significance (p < 0.05) in at least one group from our analyses (Tables 3 and S4). When we applied Bonferroni multiple-test correction (p < 3.59 × 10−4), a single SNP (rs1838105, p = 2.95 × 10−4) remained significant for replication in the ACP group.

Table 3.

Previous published OFC-associated SNPs with evidence of replication in the current study

Previous study
Current study
Variant Locus (hg38) Alleles Effect allelea OR (95% CI) Trait PubMed ID CP OR (95% CI) CP p value CSP OR (95% CI) CSP p value CHP OR (95% CI) CHP p value
rs12065278 1p36.11 A/G G 2.43 (1.66–3.56) CP 27018472 1.18 (0.92–1.52) 2.00E−01 1.22 (0.89–1.67) 2.25E−01 1.63 (1.04–2.57) 3.25E−02
rs481931 1p22.1 G/T G 1.25 CLP 28232668 1.25 (1.01–1.54) 3.96E−02 1.22 (0.93–1.61) 1.61E−01 1.08 (0.70–1.61) 7.51E−01
rs4839542 1p13.3 C/T C 0.83 CLP 28232668 1.23 (0.95–1.59) 1.14E−01 1.22 (0.88–1.72) 2.33E−01 1.96 (1.20–3.12) 5.27E−03
rs2235371 1q32.2 C/T T 1.49 (1.37–1.61) CL/P 25775280 N/A N/A 1.38 (1.0–1.89) 4.60E−02 1.53 (1.0–2.33) 4.64E−02
rs6540559 1q32.2 G/A A 1.67 (1.47–1.85) CL/P 28054174 1.24 (1.01–1.52) 3.57E−02 1.30 (1.0–1.70) 5.11E−02 1.48 (0.99–2.2) 5.62E−02
rs75477785 1q32.2 T/G G 1.75 (1.54–2.04) CL/P 28054174 N/A N/A 1.38 (1.0–1.9) 5.16E−02 1.61 (1.04–2.48) 3.10E−02
rs72741048 1q32.2 A/T T 1.31 CP 31609978 1.35 (1.09–1.66) 5.58E−03 1.28 (0.97–1.67) 7.55E−02 1.67 (1.15–2.44) 6.85E−03
rs11119394 1q32.2 A/G G 1.30 CP 32758111 N/A N/A 1.48 (1.09–2.01) 1.16E−02 1.46 (0.98–2.18) 5.87E−02
rs3815854 2q35 C/T C 0.80 (0.74–0.87) CL/P 25775280 1.25 (1.03–1.52) 2.45E−02 1.35 (1.06–1.72) 1.38E−02 1.19 (0.83–1.72) 3.53E−01
rs9347594 6q26 T/C T 2.22 (1.60–3.08) CP 27018472 N/A N/A 1.11 (0.83–1.47) 4.77E−01 1.49 (1.0–2.22) 4.77E−02
rs11774066 8q22.2 C/T C 0.86 CLP 28232668 1.28 (1.03–1.61) 2.44E−02 1.16 (0.87–1.54) 3.07E−01 1.09 (0.741.59) 6.92E−01
rs1487022 8q22.2 G/T G 1.17 CLP 28232668 1.45 (1.16–1.85) 1.36E−03 1.47 (1.08–2.0) 1.58E−02 1.14 (0.76–1.72) 5.27E−01
rs4246129 8q24.3 C/G G 2 (1.47–2.63) CP 28054174 1.28 (1.03–1.61) 2.75E−02 1.47 (1.1–1.97) 8.65E−03 1.16 (0.77–1.74) 4.73E−01
rs7928246 11q22.3 A/G A 0.41 CP 32758111 1.77 (1.05–2.99) 2.95E−02 2.18 (1.07–4.45) 2.80E−02 N/A N/A
rs730643 14q13.3 G/A A 1.35 CP 31609978 1.34 (1.09–1.66) 5.85E−03 1.38 (1.05–1.82) 2.18E−02 1.60 (1.05–2.44) 2.77E−02
rs4901118 14q22.1 G/A Gb N/R CL/P 28087736 1.41 (1.16–1.72) 4.57E−04 1.45 (1.12–1.89) 4.04E−03 1.19 (0.84–1.69) 3.29E−01
rs152745 16p12.2 G/A G 0.46 (0.33–0.64) CP 27018472 N/A N/A 1.19 (0.88–1.59) 2.57E−01 1.61 (1.04–2.50) 2.91E−02
rs57933945 16p12.1 C/T C 2.86 (1.85–4.35) CP 28054174 1.92 (1.32–2.86) 6.25E−04 2.5 (1.45–4.35) 6.70E−04 2.22 (1.04–4.55) 3.39E−02
rs9911652 17p13.1 C/T C 0.64 (0.55–0.74) CL/P 28054174 1.33 (1.01–1.75) 3.94E−02 1.3 (0.92–1.85) 1.35E−01 1.37 (0.85–2.17) 1.92E−01
rs3785888 17q21.32 C/T T 0.88 (0.58–0.97) CL/P 28054174 1.28 (1.06–1.55) 9.13E−03 1.19 (0.93–1.51) 1.59E−01 1.08 (0.76–1.53) 6.60E−01
rs1838105 17q21.32 A/G G 0.82 CLP 28232668 1.44 (1.18–1.75) 2.40E−04 1.34 (1.05–1.72) 2.00E−02 1.32 (0.9–1.93) 1.51E−01
rs227731 17q22 T/G G 0.81 (0.72–0.89) CL/P 28054174 1.30 (1.06–1.59) 1.07E−02 1.27 (0.97–1.64) 8.24E−02 1.59 (1.08–2.33) 1.77E−02

Bold indicates p < 0.05.

N/A indicates absence of data at this SNP location in the CPSeq data.

N/R indicates absence of data from original publication.

a

Effect allele is reported for CPSeq data: underlines indicate opposite effect reported in original publication.

b

Effect allele for original publication is not reported.

Discussion

CP has historically been evaluated as a single phenotype, but here we have identified CP subtype-specific risk factors, including one genome-wide significant locus and 19 regions of suggestive significance. Our genome-wide significant locus—found in the CHP group—spans both RALGPS1 and ANGPTL2 genes on 9q33.3. Although fine-mapping analysis did not implicate any single gene as all three of the SNPs in the credible set fall within intronic regions of RALGPS1; two SNPs (rs2417050 and rs12350252), as well as the lead SNP at this locus, also fall within a region considered a craniofacial super enhancer upstream of ANGPTL2. We then showed using in situ hybridization that Angptl2 was expressed in the developing mouse palate.

Specifically, we found that Angptl2 is expressed during initial vertical and subsequent horizontal outgrowth of the palatal shelves (Figures 4A and 4B) prior to the approximation and fusion at the midline that forms the secondary palate. Expression appeared restricted to the mesenchymal compartment, which is primarily comprised of cranial neural crest cells that rapidly proliferate to drive palatal shelf outgrowth and differentiate into osteoblasts that form the bones of the hard palate. As palatogenesis proceeded, mesenchymal Angptl2 expression increased along with osteoblast marker Runx2 (Figure 4C). This is consistent with previous evidence suggesting that ANGPTL2 positively regulates osteoblast differentiation via positive activation of Osterix,31 and its dysfunction may result in reduced proliferation and/or osteoblast differentiation. These data are also consistent with bulk RNA sequencing from human craniofacial tissue at CS 13, 14, 15, and 17 showing a similar increase with time, and single-cell RNA sequencing of the mouse palate showing that Angptl2 is expressed in cells consistent with osteoprogenitors at GD15.5.32 These findings, in combination with significantly higher rates of OFCs with CNVs of this locus, support a role for ANGPTL2 in palatal development, particularly as related to the hard palate.

The most strongly associated locus in ACP was located in an intergenic region at 12q21.1, closest to the gene TRHDE (∼330 kb upstream). The second most significant locus was on chromosome 4p14. Although also in an intergenic region, the nearest genes are UBEK2 and PDS5A (∼20 kb upstream and downstream, respectively), both of which are strongly expressed in craniofacial tissue during embryogenesis.29 However, PDS5A, which plays a role in sister chromatid cohesion during mitosis, is of particular interest: both null and heterozygous loss in mice leads to a CP phenotype with variable expressivity and penetrance.33

In the CSP group, the top non-overlapping locus was at 9q22.31 spanning FAM120A, and RNA-binding protein.34 Although its function has been primarily studied in the context of gastric carcinoma, FAM120A plays a role in protecting against oxidative stress-induced apoptosis. In addition, it has been shown to directly bind insulin-like growth factor II (IGF-II) mRNA,35 which is spatially and temporally expressed in developing murine secondary palate,36 and can result in CP—among other clinical features—when dysregulated.37 FAM120A is expressed in both mouse soft palate tissue32 and in human craniofacial tissue during embryological development.29

Another locus of interest from this analysis is at 12q13.11, in which the lead SNP is approximately 650 kb away from COL2A1. Variants within this gene are well established as causal for Stickler syndrome (OMIM: 108300), in which CP is a common feature38; however, our associated signal does not appear to be within the same topological-associated domain as COL2A1, at least in embryonic stem cells, so additional evaluation of this finding is needed.

Because differences in sample sizes (and statistical power) prevent direct comparisons of p values, we compared ORs between analysis groups to better understand subtype-specific risks. We found that, for all loci, the risks between the ACP, CHP, and CSP groups were either in the same direction (14 loci) or included the null for at least on group (6 loci) (i.e., no loci conveyed opposite effects for different groups). For loci belonging to the ACP group, the ORs and confidence intervals were similar for CHP and CSP (Figure 3, left); however, for loci identified in the CSP or CHP analyses, differences in ORs were more pronounced (Figure 3, right). For example, the 5q14.1 region was associated solely with CHP with no overlap in ACP or CSP. Differences between estimated ORs of the CSP loci were less apparent, since none were entirely distinct from ACP or CHP. Because so many individuals in the dataset had a cleft hard and soft palate and were included in both the CHP and CSP analyses, the overlapping confidence intervals were expected. However, of the eight SNPs of suggestive significance for CSP, four of them contained the null for CHP, and of those three failed to reach even nominal significance (p < 0.05). Thus, while they are unlikely to be associated with CHP, we cannot fully rule out influence of CSP SNPs on CHP as much larger sample sizes would be required to contrast cleft hard palate only and cleft soft palate only due to the infrequency of cleft hard palate only. Overall, our results suggest that there are variants contributing more risk to a specific type of CP versus the other.

Although the allelic TDT is not confounded by population stratification, examination of allele frequencies across populations suggests that some of our findings may be, in part, driven by certain populations. The most pronounced of these occurs for rs1468036 (effect allele G) at 17q25.3, which is found at a frequency of 36.6% in African populations, and less than 6% in other populations studied here. Another example is the 5q14.1 locus near HOMER1, which reached suggestive significance in the ACP group (p = 2.32 × 10−6) but is more significant in the Asian ancestry-stratified study (p = 8.35 × 10−7) and occurred with a minor allele frequency of 12.2% in East Asian populations compared with 4% in South Asians and <0.5% in the remaining study populations. Presently, ancestry-specific analysis was only possible for the Asian population, but future studies on both additional ancestral risks as well as combined stratification by ancestry and subtype would be of interest; however, limited sample sizes in this study preclude these evaluations here.

All of the risk loci identified in this study were of novel association with CP, although three of these of these loci have been reported previously in studies associated with CL/P. Both 9q22.31 and 5p14.1 have been reported as suggestive in a consanguineous GWAS of 40 families,39 and the 12q21.1 region was reported in a Chinese Han population12; however, the lead SNPs for each of these loci are approximately 1 Mb away from each of our lead SNPs. In addition, none of the identified SNPs within the same region are in LD, and therefore may or may not be tagging the same causal variant(s).

We were able to show nominal evidence of replication for 22 previously published SNPs associated with OFCs. Interestingly, there were only two SNPs from previous studies that replicated in all three of our studies (i.e., p < 0.05 in ACP, CSP, and CHP), both of which were originally published as associated with CP. There were nine previously published SNPs replicating at nominal significance in any two of our groups, only one of which was shared between only CSP and CHP. Unfortunately, deeper phenotype data are not available to classify subtypes from previous studies for more detailed comparisons, but this general lack of overlap may further support subtype-specific differences. An additional striking finding was that, for all seven previously published SNPs with reported ORs associated with CL/P, the risk allele conveyed opposite effects for CP in our dataset, whereas this was only true for two of nine in SNPs previously associated with CP. Although some of these findings could be a result of unclear effect allele reporting, there is evidence of opposite effects for the same allele in CL versus CP, as previously reported near IRF6116 (rs72741048, in Table 3), indicating additional investigation of these SNPs in the context of CL/P versus CP is warranted.

Our results support the hypothesis there are subtype-specific risks for CP, although this study has limitations. First, due to sample size we chose to evaluate clefts affecting both the hard and soft palate in both subtype groups, rather than as three separate groups. Given that both structures are affected in these cases, it is likely that they share risks for hard or soft palate clefts as suggested by our distinct findings between analyses. A lack of genome-wide significant signal in the ACP group could be due to our sample being underpowered to identify common variants of modest effect, or may result from SNPs of opposite effects negating signal when all phenotypes are combined. Alternatively, this may support a more prominent role for rare as opposed to common variants in the pathogenesis of CP in general, or there may be environmental effects not captured by our study. We also failed to replicate some well-established risk loci, such as GRHL3 or CTNNA2, although this is likely explained by our study population. These risk variants occur in ∼3% of Europeans and ∼2% of Africans, respectively. In individuals of Asian ancestry, the MAF for rs41268753 in GRHL3 is <1% (SAS) and <0.02% (EAS), and the MAF for both reported variants near CTNNA2 (rs113691307 and rs80004662) is ∼4% (SAS) and <0.04% (EAS); therefore, our cohort is unlikely to harbor these variants at a rate detectable above our filter for common variants at MAF >3%. Despite these limitations, this study evaluates CP risks in the context of its subtypes and our findings show there are broad factors affecting the risk for CP in general, as well as variants influencing the risk of specific CP subtypes.

Data and code availability

The sequence and phenotype data generated for this study are available from the Database of Genotypes and Phenotypes, dbGaP: phs002220.v1.p1.

Acknowledgments

We are incredibly grateful to participating families and colleagues who have made this research possible. Sequencing services were provided by the Center for Inherited Disease Research (CIDR). CIDR is fully funded through a federal contract from the National Institutes of Health to The Johns Hopkins University, contract no. HHSN268201700006I. Patient recruitment, assembly of phenotypic information, sequencing services, and data analysis were supported by National Institutes of Health (NIH) grants: X01-HG010835 (to E.J.L.), R01-DE016148 (to M.L.M. and S.M.W.), R01-DE030342 (to E.J.L.), R01-DE011931 (to J.T.H.), R01-DE028300 (to A.B.), R01-DE014581 (to T.H.B.), R37-DE008559 (to T.J.M.), R00-DE024571 (to C.J.B.), S21-MD001830 (to C.J.B.) U54-GM133807 (to C.J.B.), T32-GM008490 (to K.R.), F31-DE032588 (to K.R.), R56-DE030917 (to R.J.L.), and F32-DE032260 (to S.W.C.). Some of this work was supported through cooperative agreements under PA 96043 from the Centers for Disease Control and Prevention to the Centers for Birth Defects Research and Prevention participating in the NBDPS. The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention or the California Department of Public Health. This article was prepared while T.J.M. was employed at Emory University. The opinions expressed in this article are the author’s own and do not reflect the view of the National Institutes of Health, the Department of Health and Human Services, or the United States government. This study makes use of data generated by the DECIPHER community. A full list of centers who contributed to the generation of the data is available from https://deciphergenomics.org/about/stats and via email from contact@deciphergenomics.org. Funding for the DECIPHER project was provided by Wellcome (grant no. WT223718/Z/21/Z).

Declaration of interests

M.P.E. is a member of the Human Genetics and Genomics Advances Editorial Board.

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.xhgg.2023.100234.

Web resources

Hg38 human genome reference, https://www.ncbi.nlm.nih.gov/assembly/GCF_000001405.39

Plink 2.0, https://www.cog-genomics.org/plink/2.0/

Plink 1.9, https://www.cog-genomics.org/plink/1.9/

FINEMAP, http://www.christianbenner.com

DECIPHER database, https://www.deciphergenomics.org

EUROCAT database, https://eu-rd-platform.jrc.ec.europa.eu/eurocat/eurocat-data/prevalence_en

NHGRI-EBI GWAS catalog, https://www.ebi.ac.uk/gwas/

UCSC genome browser, https://genome.ucsc.edu/cgi-bin/hgGateway

OMIM database, https://www.omim.org

Supplemental information

Document S1. Figures S1–S5 and Tables S1–S3
mmc1.pdf (561.5KB, pdf)
Table S4. All replicated SNPs
mmc2.xlsx (37.6KB, xlsx)
Document S2. Article plus supplemental information
mmc3.pdf (3.6MB, pdf)

References

  • 1.Mossey P.A., Modell B. Epidemiology of oral clefts 2012: an international perspective. Front. Oral Biol. 2012;16:1–18. doi: 10.1159/000337464. [DOI] [PubMed] [Google Scholar]
  • 2.Mai C.T., Isenburg J.L., Canfield M.A., Meyer R.E., Correa A., Alverson C.J., Lupo P.J., Riehle-Colarusso T., Cho S.J., Aggarwal D., et al. National population-based estimates for major birth defects, 2010-2014. Birth Defects Res. 2019;111:1420–1435. doi: 10.1002/bdr2.1589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Burg M.L., Chai Y., Yao C.A., Magee W., 3rd, Figueiredo J.C. Epidemiology, Etiology, and Treatment of Isolated Cleft Palate. Front. Physiol. 2016;7:67. doi: 10.3389/fphys.2016.00067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Berk N.W., Marazita M.L. Oxford University Press; 2002. Cleft Lip and Palate: From Origin to Treatment. [Google Scholar]
  • 5.Sivertsen A., Wilcox A.J., Skjaerven R., Vindenes H.A., Abyholm F., Harville E., Lie R.T. Familial risk of oral clefts by morphological type and severity: population based cohort study of first degree relatives. BMJ (Clinical research ed.) 2008;336:432–434. doi: 10.1136/bmj.39458.563611.AE. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Grosen D., Chevrier C., Skytthe A., Bille C., Mølsted K., Sivertsen A., Murray J.C., Christensen K. A cohort study of recurrence patterns among more than 54,000 relatives of oral cleft cases in Denmark: support for the multifactorial threshold model of inheritance. J. Med. Genet. 2010;47:162–168. doi: 10.1136/jmg.2009.069385. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Grosen D., Bille C., Petersen I., Skytthe A., Hjelmborg J.v.B., Pedersen J.K., Murray J.C., Christensen K. Risk of oral clefts in twins. Epidemiology. 2011;22:313–319. doi: 10.1097/EDE.0b013e3182125f9c. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Leslie E.J., Marazita M.L. Genetics of cleft lip and cleft palate. Am. J. Med. Genet. C Semin. Med. Genet. 2013;163C:246–258. doi: 10.1002/ajmg.c.31381. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Dixon M.J., Marazita M.L., Beaty T.H., Murray J.C. Cleft lip and palate: understanding genetic and environmental influences. Nat. Rev. Genet. 2011;12:167–178. doi: 10.1038/nrg2933. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Leslie E.J., Liu H., Carlson J.C., Shaffer J.R., Feingold E., Wehby G., Laurie C.A., Jain D., Laurie C.C., Doheny K.F., et al. A Genome-wide Association Study of Nonsyndromic Cleft Palate Identifies an Etiologic Missense Variant in GRHL3. Am. J. Hum. Genet. 2016;98:744–754. doi: 10.1016/j.ajhg.2016.02.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Leslie E.J., Carlson J.C., Shaffer J.R., Butali A., Buxó C.J., Castilla E.E., Christensen K., Deleyiannis F.W.B., Leigh Field L., Hecht J.T., et al. Genome-wide meta-analyses of nonsyndromic orofacial clefts identify novel associations between FOXE1 and all orofacial clefts, and TP63 and cleft lip with or without cleft palate. Hum. Genet. 2017;136:275–286. doi: 10.1007/s00439-016-1754-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Yu Y., Zuo X., He M., Gao J., Fu Y., Qin C., Meng L., Wang W., Song Y., Cheng Y., et al. Genome-wide analyses of non-syndromic cleft lip with palate identify 14 novel loci and genetic heterogeneity. Nat. Commun. 2017;8 doi: 10.1038/ncomms14364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Huang L., Jia Z., Shi Y., Du Q., Shi J., Wang Z., Mou Y., Wang Q., Zhang B., Wang Q., et al. Genetic factors define CPO and CLO subtypes of nonsyndromicorofacial cleft. PLoS Genet. 2019;15 doi: 10.1371/journal.pgen.1008357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Butali A., Mossey P.A., Adeyemo W.L., Eshete M.A., Gowans L.J.J., Busch T.D., Jain D., Yu W., Huan L., Laurie C.A., et al. Genomic analyses in African populations identify novel risk loci for cleft palate. Hum. Mol. Genet. 2019;28:1038–1051. doi: 10.1093/hmg/ddy402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.He M., Zuo X., Liu H., Wang W., Zhang Y., Fu Y., Zhen Q., Yu Y., Pan Y., Qin C., et al. Genome-wide Analyses Identify a Novel Risk Locus for Nonsyndromic Cleft Palate. J. Dent. Res. 2020;99:1461–1468. doi: 10.1177/0022034520943867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rahimov F., Nieminen P., Kumari P., Juuri E., Nikopensius T., Karjalainen J., Kurki M., Palotie A., FinnGen N., Heliövaara A. 2021. High Incidence and Regional Distribution of Cleft Palate in Finns Are Associated with a Functional Variant in an IRF6 Enhancer. in press. [Google Scholar]
  • 17.Smith T.M., Lozanoff S., Iyyanar P.P., Nazarali A.J. Molecular signaling along the anterior-posterior axis of early palate development. Front. Physiol. 2012;3:488. doi: 10.3389/fphys.2012.00488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Bush J.O., Jiang R. Palatogenesis: morphogenetic and molecular mechanisms of secondary palate development. Development. 2012;139:231–243. doi: 10.1242/dev.067082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Yu L., Gu S., Alappat S., Song Y., Yan M., Zhang X., Zhang G., Jiang Y., Zhang Z., Zhang Y., Chen Y. Shox2-deficient mice exhibit a rare type of incomplete clefting of the secondary palate. Development. 2005;132:4397–4406. doi: 10.1242/dev.02013. [DOI] [PubMed] [Google Scholar]
  • 20.Spielman R.S., McGinnis R.E., Ewens W.J. Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM) Am. J. Hum. Genet. 1993;52:506–516. [PMC free article] [PubMed] [Google Scholar]
  • 21.Benner C., Spencer C.C.A., Havulinna A.S., Salomaa V., Ripatti S., Pirinen M. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics. 2016;32:1493–1501. doi: 10.1093/bioinformatics/btw018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Firth H.V., Richards S.M., Bevan A.P., Clayton S., Corpas M., Rajan D., Van Vooren S., Moreau Y., Pettett R.M., Carter N.P. DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources. Am. J. Hum. Genet. 2009;84:524–533. doi: 10.1016/j.ajhg.2009.03.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Heyne G.W., Plisch E.H., Melberg C.G., Sandgren E.P., Peter J.A., Lipinski R.J. A Simple and Reliable Method for Early Pregnancy Detection in Inbred Mice. J. Am. Assoc. Lab. Anim. Sci. 2015;54:368–371. [PMC free article] [PubMed] [Google Scholar]
  • 24.Abler L.L., Mehta V., Keil K.P., Joshi P.S., Flucus C.L., Hardin H.A., Schmitz C.T., Vezina C.M. A high throughput in situ hybridization method to characterize mRNA expression patterns in the fetal mouse lower urogenital tract. J. Vis. Exp. 2011:2912. doi: 10.3791/2912. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Everson J.L., Sun M.R., Fink D.M., Heyne G.W., Melberg C.G., Nelson K.F., Doroodchi P., Colopy L.J., Ulschmid C.M., Martin A.A., et al. Developmental Toxicity Assessment of Piperonyl Butoxide Exposure Targeting Sonic Hedgehog Signaling and Forebrain and Face Morphogenesis in the Mouse: An in Vitro and in Vivo Study. Environ. Health Perspect. 2019;127 doi: 10.1289/EHP5260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Everson J.L., Fink D.M., Yoon J.W., Leslie E.J., Kietzman H.W., Ansen-Wilson L.J., Chung H.M., Walterhouse D.O., Marazita M.L., Lipinski R.J. Sonic hedgehog regulation of Foxf2 promotes cranial neural crest mesenchyme proliferation and is disrupted in cleft lip morphogenesis. Development. 2017;144:2082–2091. doi: 10.1242/dev.149930. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Li H., Williams T. Separation of mouse embryonic facial ectoderm and mesenchyme. J. Vis. Exp. 2013 doi: 10.3791/50248. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Buniello A., MacArthur J.A.L., Cerezo M., Harris L.W., Hayhurst J., Malangone C., McMahon A., Morales J., Mountjoy E., Sollis E., et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019;47:D1005–D1012. doi: 10.1093/nar/gky1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Wilderman A., VanOudenhove J., Kron J., Noonan J.P., Cotney J. High-Resolution Epigenomic Atlas of Human Embryonic Craniofacial Development. Cell Rep. 2018;23:1581–1597. doi: 10.1016/j.celrep.2018.03.129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Rubin L., Saunders J.W., Jr. Ectodermal-mesodermal interactions in the growth of limb buds in the chick embryo: constancy and temporal limits of the ectodermal induction. Dev. Biol. 1972;28:94–112. doi: 10.1016/0012-1606(72)90129-7. [DOI] [PubMed] [Google Scholar]
  • 31.Takano A., Fukuda T., Shinjo T., Iwashita M., Matsuzaki E., Yamamichi K., Takeshita M., Sanui T., Nishimura F. Angiopoietin-like protein 2 is a positive regulator of osteoblast differentiation. Metabolism. 2017;69:157–170. doi: 10.1016/j.metabol.2017.01.006. [DOI] [PubMed] [Google Scholar]
  • 32.Han X., Feng J., Guo T., Loh Y.H.E., Yuan Y., Ho T.V., Cho C.K., Li J., Jing J., Janeckova E., et al. Runx2-Twist1 interaction coordinates cranial neural crest guidance of soft palate myogenesis. Elife. 2021;10 doi: 10.7554/eLife.62387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Zhang B., Chang J., Fu M., Huang J., Kashyap R., Salavaggione E., Jain S., Kulkarni S., Deardorff M.A., Uzielli M.L.G., et al. Dosage effects of cohesin regulatory factor PDS5 on mammalian development: implications for cohesinopathies. PLoS One. 2009;4 doi: 10.1371/journal.pone.0005232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kelly T.J., Suzuki H.I., Zamudio J.R., Suzuki M., Sharp P.A. Sequestration of microRNA-mediated target repression by the Ago2-associated RNA-binding protein FAM120A. RNA. 2019;25:1291–1297. doi: 10.1261/rna.071621.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Tanaka M., Sasaki K., Kamata R., Hoshino Y., Yanagihara K., Sakai R. A novel RNA-binding protein, Ossa/C9orf10, regulates activity of Src kinases to protect cells from oxidative stress-induced apoptosis. Mol. Cell Biol. 2009;29:402–413. doi: 10.1128/MCB.01035-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Ferguson M.W., Sharpe P.M., Thomas B.L., Beck F. Differential expression of insulin-like growth factors I and II (IGF I and II), mRNA, peptide and binding protein 1 during mouse palate development: comparison with TGF beta peptide distribution. J. Anat. 1992;181:219–238. [PMC free article] [PubMed] [Google Scholar]
  • 37.Eggenschwiler J., Ludwig T., Fisher P., Leighton P.A., Tilghman S.M., Efstratiadis A. Mouse mutant embryos overexpressing IGF-II exhibit phenotypic features of the Beckwith-Wiedemann and Simpson-Golabi-Behmel syndromes. Genes Dev. 1997;11:3128–3142. doi: 10.1101/gad.11.23.3128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Hoornaert K.P., Vereecke I., Dewinter C., Rosenberg T., Beemer F.A., Leroy J.G., Bendix L., Björck E., Bonduelle M., Boute O., et al. Stickler syndrome caused by COL2A1 mutations: genotype-phenotype correlation in a series of 100 patients. Eur. J. Hum. Genet. 2010;18:872–880. doi: 10.1038/ejhg.2010.23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Camargo M., Rivera D., Moreno L., Lidral A.C., Harper U., Jones M., Solomon B.D., Roessler E., Vélez J.I., Martinez A.F., et al. GWAS reveals new recessive loci associated with non-syndromic facial clefting. Eur. J. Med. Genet. 2012;55:510–514. doi: 10.1016/j.ejmg.2012.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S5 and Tables S1–S3
mmc1.pdf (561.5KB, pdf)
Table S4. All replicated SNPs
mmc2.xlsx (37.6KB, xlsx)
Document S2. Article plus supplemental information
mmc3.pdf (3.6MB, pdf)

Data Availability Statement

The sequence and phenotype data generated for this study are available from the Database of Genotypes and Phenotypes, dbGaP: phs002220.v1.p1.


Articles from Human Genetics and Genomics Advances are provided here courtesy of Elsevier

RESOURCES