Skip to main content
BMC Genomics logoLink to BMC Genomics
. 2013 Feb 20;14:113. doi: 10.1186/1471-2164-14-113

Systematic analysis of palatal transcriptome to identify cleft palate genes within TGFβ3-knockout mice alleles: RNA-Seq analysis of TGFβ3 Mice

Ferhat Ozturk 1,4, You Li 2, Xiujuan Zhu 1, Chittibabu Guda 2,3, Ali Nawshad 1,
PMCID: PMC3618314  PMID: 23421592

Abstract

Background

In humans, cleft palate (CP) accounts for one of the largest number of birth defects with a complex genetic and environmental etiology. TGFβ3 has been established as an important regulator of palatal fusion in mice and it has been shown that TGFβ3-null mice exhibit CP without any other major deformities. However, the genes that regulate cellular decisions and molecular mechanisms maintained by the TGFβ3 pathway throughout palatogenesis are predominantly unexplored. Our objective in this study was to analyze global transcriptome changes within the palate during different gestational ages within TGFβ3 knockout mice to identify TGFβ3-associated genes previously unknown to be associated with the development of cleft palate. We used deep sequencing technology, RNA-Seq, to analyze the transcriptome of TGFβ3 knockout mice at crucial stages of palatogenesis, including palatal growth (E14.5), adhesion (E15.5), and fusion (E16.5).

Results

The overall transcriptome analysis of TGFβ3 wildtype mice (C57BL/6) reveals that almost 6000 genes were upregulated during the transition from E14.5 to E15.5 and more than 2000 were downregulated from E15.5 to E16.5. Using bioinformatics tools and databases, we identified the most comprehensive list of CP genes (n = 322) in which mutations cause CP either in humans or mice, and analyzed their expression patterns. The expression motifs of CP genes between TGFβ3+/− and TGFβ3−/− were not significantly different from each other, and the expression of the majority of CP genes remained unchanged from E14.5 to E16.5. Using these patterns, we identified 8 unique genes within TGFβ3−/− mice (Chrng, Foxc2, H19, Kcnj13, Lhx8, Meox2, Shh, and Six3), which may function as the primary contributors to the development of cleft palate in TGFβ3−/− mice. When the significantly altered CP genes were overlaid with TGFβ signaling, all of these genes followed the Smad-dependent pathway.

Conclusions

Our study represents the first analysis of the palatal transcriptome of the mouse, as well as TGFβ3 knockout mice, using deep sequencing methods. In this study, we characterized the critical regulation of palatal transcripts that may play key regulatory roles through crucial stages of palatal development. We identified potential causative CP genes in a TGFβ3 knockout model, which may lead to a better understanding of the genetic mechanisms of palatogenesis and provide novel potential targets for gene therapy approaches to treat cleft palate.

Keywords: RNA-Seq, Next-generation sequencing, TGFβ3, Knockout, Transcriptome, Cleft palate, Palatogenesis, Palate, Craniofacial

Background

Cleft palate (CP) is the second most common birth defect (1/800 live births) in humans, and is caused by the lack of fusion of the embryonic palatal shelves early in gestation (6–10 weeks) [1]. The formation of a continuous palate is a complex process involving multiple steps, including: palatal shelf growth, elevation, attachment, and fusion. The stages of palatogenesis are regulated by numerous genes that are essential for normal palate development. Our laboratory has a long-standing interest in identifying the genetic and molecular mechanisms that regulate palatogenesis in order to understand the factors involved in the development of orofacial clefts [2-11]. It has been shown that both genetic and environmental elements contribute to the development of cleft palate [12,13]. We previously presented that Transforming Growth Factor (TGF)-β isoforms play essential roles in the regulation of palatal morphogenesis, including the finding that TGFβ3 mediates palatal fusion both in primary cells and organ culture [5,14,15]. According to several population-based mutation screening studies, TGFβ3 is considered a candidate gene for non-syndromic CP in humans [16-22]. Furthermore, TGFβ3 knockout mice are born with CP but lack other major defects [23-25].

In our earlier studies [26], we examined gene expression during palatal fusion in normal mice using microarray analysis and detected several genes essential for completion of palatal development. As genome sequencing technologies advanced, it has become feasible to systematically analyze global transcriptomal changes and identify the key molecular components in the developing palate during crucial stages of palatogenesis. Failure to regulate functional or structural genes during these stages may result in cleft palate. Next-generation sequencing (NGS) technologies, or RNA-Seq, have recently emerged as a revolutionary tool of transcriptomics [27] by revealing the complex landscape of the transcriptome with high-throughput at an incomparable level of sensitivity and accuracy [27,28]. The results of RNA-Seq demonstrate high levels of reproducibility for both technical and biological replicates [29,30]. The analysis of TGFβ3 knockout mice by RNA-Seq provides a valuable resource to facilitate our understanding of the complex genetic and molecular mechanisms of palatogenesis. This technology also allows us to detect allelic and splice variants of some of the genes involved in palate development in TGFβ3 knockout mice, which is currently beyond the scope of this article.

Evidence suggests that TGFβ3 is essential to palato-genesis, especially during embryonic days (E) E14.5 to E16.5. However, the cellular mechanisms maintained and regulated by TGFβ3 signaling have not been extensively explored to enable an understanding of those genes functionally regulated by TGFβ3 during normal palatogenesis or those genes deregulated in TGFβ3-null mice. Our objective in this study was to analyze transcriptome changes and their contribution to the development of CP among different gestational ages and TGFβ3 knockout alleles. In this study, we analyzed the transcriptome of TGFβ3 mice by RNA-Seq at crucial stages of palatogenesis, including: palatal convergence, adhesion, and fusion; and we characterized some crucial transcripts that may play key regulatory roles throughout palatal development. We used palatal shelves extracted from E14.5, E15.5 and E16.5 allelic mice to analyze differential expression patterns of their transcriptome throughout the palatal developmental process. The overall transcriptome analysis of TGFβ3 WT mice revealed that almost 6000 genes were upregulated during the transition from E14.5 to E15.5 and more than 2000 were downregulated from E15.5 to E16.5, which suggests that regulation of these transcripts is essential during the stages of palate development. In order to streamline the analysis of the high number of transcripts, we downsized the list of genes to CP genes only. Using biological databases, we categorized a comprehensive inventory of CP genes both in human and mouse genomes and focused on their differential expressions and biological interpretations. Unexpectedly, the expression patterns of CP genes between TGFβ3+/− and TGFβ3−/− were not significantly different from each other, and the expression of the majority of CP genes remained unchanged from E14.5 to E16.5. Using this data, we identified 8 unique genes (Chrng, Foxc2, H19, Kcnj13, Lhx8, Meox2, Shh, and Six3), which may regulate cleft palate formation in TGFβ3−/− mice. We used Ingenuity Pathway Analysis (IPA) software to analyze downstream effects of these genes on biological functions, molecular networks, and regulatory pathways- particularly TGFβ signaling. This is the first genome scale analysis of the palatal transcriptome of mice generated by RNA-Seq technology during critical stages of palatogenesis. This data will allow the comprehensive analysis of TGFβ signaling during palatogenesis and provide information on the chronological regulation of downstream TGFβ-activated transcription factors that function to induce cellular differentiation, transformation, and apoptosis.

Results

Analysis of RNA-seq data

As outlined in the experimental workflow of our study (Figure  1), RNA samples were extracted from mouse palatal tissues of TGFβ3 knockout mice (homozygous (−/−), heterozygous (+/−), and wildtype (+/+)) at three developmental stages (E14.5, E15.5, and E16.5); libraries were prepared, converted to cDNA, and sequenced by using the Illumina HiSeq2000 next generation sequencer. The bioinformatics analysis of RNA-Seq data was processed using the Tuxedo protocol [31], which includes TopHat, Cufflinks and CummeRbund packages from R. The CP genes were identified from OMIM and MGI databases and downstream effects were analyzed using IPA.

Figure 1.

Figure 1

Experimental workflow of the study. RNA samples were extracted from mouse palatal tissues of TGFβ3 knockout mice (homozygous, heterozygous, and wildtype) at three developmental stages (E14.5, E15.5, and E16.5); libraries were prepared, converted to cDNA, and sequenced by using the Illumina HiSeq2000 next generation sequencer. Fastq files were uploaded to the server for quality control analysis of sequence reads using FastQC. There were no sequence manipulation processes performed for any fastq file, given that the quality score was high at both the 3 and 5 ends. All of the reads were mapped to the reference genome (mm9, built name NCBIM37) by using TopHat. Quantification of transcripts, statistics tests for differential expression, and detection of splice variants were performed using Cufflinks; quality assessments of biological and technical replicates were performed using CummeRbund; and pairwise comparisons of samples and differential expression of transcripts were analyzed using CuffDiff. Venn diagrams of all significantly altered (FC ≥ 2.0) transcripts were drawn using the R VennDiagram package [75]. CP related genes (n = 322) were extracted from human (OMIM) and mouse (MGI) genome databases. CP genes were classified based on their patterns of expression (n = 9) for each genotype and their heatmaps were generated using the R pheatmap package. Expression pattern groups of CP genes were uploaded to IPA software as datasets and core analyses were run to detect downstream effects: canonical pathways and biological functions relevant to sample datasets.

Our data clearly demonstrate that TGFβ3 wildtype (+/+) palates from E15.5 expressed increased TGFβ3 protein, double the expression of heterozygous (+/−) and as expected the homozygous (−/−) palates were completely devoid of any TGFβ3 expression (Additional file 1). Extracted RNA samples were evaluated by RNA integrity numbers (RIN) to monitor quality and degradation with BioAnalyzer (Agilent Technologies, Palo Alto, CA). We confirmed that all of the palatal RNA samples had an RIN ≥ 8.4, indicating the high quality and low degradation of our experimental samples (Additional file 2). The RNA libraries were prepared for cDNA conversion and the samples were sequenced by Illumina techniques using HiSeq 2000. We obtained total reads from the RNA-Seq samples ranging from 14 million to 96 million (Additional file 3). Among all samples, only 4 samples have less than 40 million reads. We did not apply sequence manipulations on either 3 or 5 ends because the RNA-Seq sequencing quality was high across all samples based on our FastQC data quality control analysis. The relative abundance of transcripts was normalized to the total read number and shown as fragments per kilobase per million (FPKM) and mapped via Cufflinks.

In order to evaluate the reproducibility of our experimental samples and procedures, two independent biological and technical replicates for each gestational age and each allele resulting in 36 (n = 9 × 2 × 2) paired-end samples were assayed [32]. Using the scatter plots generated by CummeRbund, biological and technical replicates of our samples were validated based on the comparison of their read numbers per transcript. We detected a high correlation among both the biological and technical replicates of each gestational age and allele (Figure  2) (r = 0.98, Pearson’s correlation; p < 0.001). The replicates of E14.5 of TGFβ3−/− mice are presented in Figure  2. Additional scatter plots for all of the other samples with their replicates are shown in Additional file 4. Because the expression values (FPKM) of certain CP related genes differed significantly among the gestational ages, we transformed the FPKM value of each gene into log10 (FPKM + 1), rounded up to 2 decimal places, and used transformed values for further analysis.

Figure 2.

Figure 2

Validation of biological and technical replicates using CummeRbund. The read numbers per transcript between the (A) biological and (B) technical replicates were compared. Since the read number per transcript (FPKM value) ranged from 0 to >10,000, the read number + 1 was transformed by log10. Each dot represents data from one transcript. There was a high correlation between the two sequencing results (r = 0.98, Pearson’s correlation; p < 0.001 two tail). Additional scatter plots for the other samples are shown in the Additional file 4.

Overall transcriptome expression profile

Using the R VennDiagram package, we created Venn diagrams to illustrate the number of transcripts in each experimental group that passed the significance and fold change (FC ≥ 2.0) thresholds designated by Cufflinks (Additional file 5). From our overall transcriptome analyses of TGFβ3 knockout mice alleles (Table  1), in all genotypic groups, the majority of the differentially expressed transcripts between E14.5 and E15.5 were upregulated: 62% for TGFΒ3−/− (n = 1721), 53% for TGFΒ3+/− (n = 1063), and 80% for WT (n = 5946). Out of 1721 transcripts upregulated in TGFΒ3−/−, 1030 (60%) were shared with WT. Interestingly, a significant portion of the upregulated genes in WT (30%, n = 1792) were downregulated between E15.5 and E16.5, but not in TGFΒ3−/−’s E15.5 to E16.5 (Table  1). The names of the genes significantly increased from E14.5 to E15.5 followed by downregulation in E16.5 and their corresponding Venn Diagrams are presented in Additional file 6. This first-up-then-down pattern of expression suggests that these transcripts may be crucial at E15.5, which corresponds to the stage of midline epithelial seam (MES) degradation and palatal fusion. Since it was not feasible to analyze all significantly altered genes during palatal stages, we have shortlisted the genes to be analyzed to CP-causing genes only.

Table 1.

Number of transcript changes among TGFβ3 alleles

All Significant E14.5 - E15.5 E14.5 - E16.5 E15.5 - E16.5
TGFβ3−/−
1806
2096
747
TGFβ3+/−
1170
1262
783
WT
5966
1903
2256
Upregulated
 
 
 
TGFβ3−/−
1721
1941
629
TGFβ3+/−
1063
1181
574
WT
5945
1835
417
Downregulated
 
 
 
TGFβ3−/−
86
155
118
TGFβ3+/−
108
81
209
WT
21
68
1839
TGFβ3−/−: Homozygous
 
 
 
TGFβ3+/−: Heterozygous
 
 
 
WT: Wild-type      

Identification of CP related genes and their expression patterns

Following the identification of genes causing CP in humans and mice, as described in the methods, we generated a Venn diagram to detect common genes between humans and mice that cause CP after becoming dysfunctional (Additional file 7). We identified 37 common genes that cause CP in both species. Within the human CP list, there were 194 transcripts that share orthologous copies with mice. We approached these genes as putative CP genes, considering that their KO models have not been generated yet. Overall, the CP gene list we developed includes 322 genes: 128 genes derived from mouse and 230 genes derived from human databases (Table  2). For an enhanced visualization of the expression values (FPKM) of all CP genes (n = 322), a Circos image was drawn based on the circular composition of CP genes located on human chromosomes [33] (Additional file 8).

Table 2.

List of genes causing CP according to MGI and OMIM databases (n = 322)

Acvr1 Lhx8 Cask ATR FLNB MEGF10 RPL11
Akap8
Luzp1
Cdkn1c
ATRX
FLVCR2
MID1
RPL5
Ap2b1
Meox2
Chd7
B3GALTL
FRAS1
MKS1
RPS17
Apaf1
Mmp14
Col2a1
B3GAT3
FREM2
MLL2
RPS19
Bmp7
Mmp16
Dhcr7
B9D1
FTO
MSX2
RPS7
Bnc2
Mn1
Dlx5
BCOR
G6PC3
MTHFR
SALL4
Boc
Mnt
Efnb1
BMP2
GABRB3
MUTYH
SC5DL
Cacna1s
Msc
Eya1
BMP4
GATA6
MYH3
SCARF2
Cdon
Ncoa6
Fgf10
BMPER
GCMB
NBS1
SEMA3E
Chrd
Ncor2
Fgfr1
BRAF
GDF1
NEB
SF3B4
Chuk
Osr2
Fgfr2
BRIP1
GDF6
NEK1
SHFM3
Crebbp
Pax9
Flna
BUB1B
GJA1
NIPBL
SLC26A2
Crk
Pbx1
Foxc2
CANT1
GJB2
NKX2-5
SLC35D1
Ctgf
Pcgf2
Foxe1
CASR
GLI3
NKX2-6
SMOC1
Ctnnb1
Pdgfc
Gad1
CDC6
GPC3
NSD1
SMS
Dlg1
Pdgfra
Hoxa2
CDH1
GUSB
OFC4
SOX2
Dlx1
Pds5a
Impad1
CFC1
H19
OFD1
SPECC1L
Dlx2
Pds5b
Inpp5e
CHRNA1
HCA1
ORC1
SPINT2
Ednrb
Phc1
Irf6
CHRND
HPGD
OTX2
SPTLC1
Ext1
Phc2
Kcnj2
CHRNG
HVEC
PAX3
SRY
Fgf18
Pkdcc
Msx1
CHST14
HYAL1
PAX8
STRA6
Fgf9
Prdm16
Pitx1
CHSY1
HYLS1
PEX7
TAB2
Fign
Prrx2
Prrx1
COL11A1
ICK
PHF8
TBX15
Foxf2
Ptprf
Ror2
COL11A2
IL1B
PIGL
TBX19
Foxg1
Ptprs
Runx2
COL9A2
IL1RN
PIGV
TBX4
Fst
Rad23b
Satb2
COLEC11
IMAGE
PIK3CA
TBX5
Fuz
Rfng
Shh
COMT
IRF1
PLCB4
TCTN2
Fzd1
Rspo2
Six3
CRLF1
KAL1
POLR1C
TGFβ1
Fzd2
Ryk
Sox9
DGCR2
KAT6B
POLR1D
TGFβR1
Gab1
Ryr1
Sumo1
DHCR24
KCNJ13
POMT1
TMCO1
Gli2
Sfn
Tbx1
DHODH
KIAA1289
POMT2
TMEM216
Gpr124
Shox2
Tbx22
DIS3L2
KIF22
PORCN
TMEM67
Grb2
Sim2
Tcof1
DOK7
KIF7
PQBP1
TRIM37
Gsk3b
Skor2
Tfap2a
DXS423E
KLF6
PROKR2
TRPS2
Hand2
Slc32a1
TGFβ3
DYNC2H1
KRAS
PTCH1
TWIST1
Hoxa1
Snai1
TGFβr2
EFTUD2
L1CAM
PTEN
UBB
Hs2st1
Snai2
Tp63
EPHB2
LMNA
PTH
VCAN
Ift172
Sox11
ABCD3
ERBB2
LMX1B
PTPN11
WDR65
Ift88
Sox5
ACAN
ERCC5
LRP4
PXMP3
WNT3
Inhba
Tbx10
ACTB
ESCO2
MADH3
RAI1
WNT5A
Inhbb
Tbx2
ALX1
FAM123B
MAP2K1
RAPSN
WNT7A
Itgav
Tbx3
ALX3
FAM20C
MAP2K2
RB1
WT1
Itgb8
TGFβ2
ARHGAP31
FBN1
MASP1
RBM10
ZEB2
Jag2
Tshz1
ARID1B
FGD1
MBTPS2
RECQL4
ZIC2
Jmjd6
Wdr19
ARNT
FGF8
MCOPS8
RIPK4
ZIC3
Kif3a Zeb1 ASXL1 FGFR3 MED12 RPGRIP1L ZMPSTE24

On the basis of our differential expression profile data between genotypes and gestational ages (Table  1), we decided to group and analyze the potential CP genes based on their expression patterns throughout different stages of palatogenesis. We categorized the 322 CP genes identified above into 9 possible expression patterns based on the comparison of their expression values (upregulated, downregulated, or unchanged) across the 3 different gestational age groups analyzed by RNA-seq (E14.5, E15.5, and E16.5) (Figure  3). The patterns (p) represent the change in expression value of the transcript first from E14.5 to E15.5, and second from E15.5 to E16.5, which is explained in detail in Figure  3. The genes in each expression pattern group are listed and common genes among them are diagrammed in Additional file 9.

Figure 3.

Figure 3

Expression patterns of CP genes through palatal stages: FPKM values of each CP gene were transformed into log10 (FPKM + 1) and rounded up to 2 decimal places. These 322 CP genes were categorized into 9 possible expression patterns (p) based on the comparison of the transformed FPKM values with FC ≥ 2.0 across 3 different gestational groups for each genotype (n = 3). The patterns represent the expression value of the transcript first from E14.5 to E15.5, then from E15.5 to E16.5. Each pattern is color-coded to differentiate patterns for further analysis. p1 unchanged-unchanged (remain unchanged); p2 upregulated-unchanged; p3 unchanged-upregulated; p4 upregulated-downregulated; p5 upregulated-upregulated; p6 downregulated-unchanged; p7 downregulated-upregulated; p8 unchanged-downregulated; and p9 downregulated-downregulated.

Considering the differential expression of CP genes throughout palatogenesis, we generated individual heatmaps according to the expression patterns of CP related genes and grouped them into corresponding genotypes (Figure  4). In TGFΒ3−/−, the majority of transcripts (n = 294; 92%) were placed in p1, which denotes unchanged expression of genes throughout gestation. Similarly, in TGFΒ3+/− samples, the majority of transcripts were in p1 (n = 293; 91%). Surprisingly, the list of genes in p1 from TGFΒ3−/− and TGFΒ3+/− were highly comparable (92%) (Additional file 9). The genes clustered in p1 unique to TGFΒ3−/−, but not in TGFΒ3+/−, were Chrng, Foxc2, H19, Kcnj13, Lhx8, Meox2, Shh, and Six3 (Table  3). We concluded that these 8 genes demand additional attention due to their unchanged level of expression during crucial stages of palatogenesis.

Figure 4.

Figure 4

CP genes represented in pattern-oriented heatmaps. Transcripts were clustered based on expression patterns (n = 9) and heatmaps were constructed using the R pheatmap package. Each column corresponds to the transformed FPKM value of transcripts in duplicate, and each row corresponds to a gene. None of the genotypes had transcripts clustered in p9. Additionally, p4 in TGFΒ3−/− and p5 in TGFΒ3+/− did not contain transcripts. Therefore, the total number of CP gene patterns/genotype groups is 22. The increasing intensity of red to yellow signifies a higher level of expression (FPKM) in the given sample of a specific transcript. The list of genes for each particular pattern and genotype are shown in Additional file 9.

Table 3.

Unique TGFβ3−/− genes following the p1 pattern

  TGFβ3−/− TGFβ3 +/− WT Name Type
Chrng
p1
p3
p4
Cholinergic receptor, nicotinic, gamma (muscle)
Transmembrane glycoprotein
Foxc2
p1
p8
p6
Forkhead box C2
Transcription factor
H19
p1
p7
p2
Imprinted maternally expressed transcript H19
Non-protein coding RNA
Kcnj13
p1
p2
p2
Potassium inwardly-rectifying channel, subfamily J, member 13
Inwardly rectifying potassium channel family of proteins
Lhx8
p1
p6
p4
LIM homeobox 8
Transcription factor
Meox2
p1
p6
p8
Mesenchyme homeobox 2
Antennapedia-like homeobox-containing genes
Shh
p1
p3
p4
Sonic hedgehog
Morphogen – Secreted protein
Six3 p1 p2 p3 SIX homeobox 3 Transcription factor

Finally, in WT samples, the transcripts were more equally dispersed into p1, p2 and p4 as 29%, 32%, and 23% respectively. These results suggest that in normal mice, the majority of CP transcripts are required to be upregulated between E14.5 and E15.5 (denoted as p2 and p4), which corresponds to the most crucial stages of palatogenesis- the adhesion and the fusion stages [34]. Some of these upregulated genes were eventually downregulated (p4) or remained at the same expression levels (p2), which suggest that they may contribute to the final phase of palate development.

Downstream pathway analysis of CP genes

We used the IPA software to determine how the expression pattern-clustered CP genes interconnected with each other to facilitate cellular biofunctions and signaling. According to the core analysis of all CP genes (n = 322) using the IPA, the most significant top 5 cellular functions were cellular development, gene expression, cellular growth and proliferation, cell signaling and cell morphology (Table  4). Although the numbers of molecules were higher in some cellular functions, the ranking was based on p-values calculated by IPA according to relevance of datasets with IKB. Differentially expressed molecules clustered into GO of these cellular biofunctions are tabulated in Additional file 10. The pattern-oriented distribution of the top cellular biofunctions (n = 8) in relation to the TGFβ pathway are illustrated with a color coded matrix with the darker color representing higher relevance between cellular functions and uploaded CP gene pattern datasets (Figure  5).

Table 4.

Differentially expressed molecules clustered into GO of cellular biofunctions

Category p-Value # Mols
Gene Expression
1.23E-30
151
Cellular Growth and Proliferation
1.48E-23
199
Cell Cycle
2.58E-14
76
Cellular Movement
2.74E-12
107
Cell Signaling
3.20E-22
25
Cell Death
3.44E-12
168
Cell Morphology
4.03E-16
103
TGFβ Signaling 5.02E-18 24

Figure 5.

Figure 5

Downstream effects analysis of CP gene patterns using IPA. The effects of TGFβ3 alleles on CP genes during palatogenesis were examined in the context of differentially expressed human and mouse genes using Ingenuity Pathway Analysis (IPA). Entrez IDs of each CP gene pattern/genotype group (n = 22) were uploaded to IPA as individual datasets. Each identifier was mapped to its corresponding gene object in the Ingenuity Knowledge Base (IKB). Interactions were then queried between these datasets and all other gene objects stored within IPA to generate a set of overlapping direct interaction networks. The significant genes were categorized, compared to functional categories in the IPA database, and ranked according to their p-values. The cellular processes that are most significantly altered in our datasets were: gene expression, cellular development, cellular growth and proliferation, cell signaling, cell morphology, cell death and survival, cellular movement, and cell cycle. The pattern-oriented distribution of the top cellular biofunctions were ranked and illustrated in a color coded matrix: the darker color represents greater relevance between cellular function and CP gene datasets.

The CP genes unique to p1 of TGFΒ3−/− (n = 8) may have necessary functions for the complete development of palatal shelves. Therefore, we analyzed the interaction of these genes with each other, as well as with key biofunctions and TGFβ signaling using IPA (Figure  6). We found that 5 of these genes (Shh, Foxc2, Six3, Meox2, and H19) are closely related to each other, with Tnf (tumor necrosis factor) and Ccnd1 (Cyclin d1) being their intersecting molecules. These genes were mostly associated with cell morphology and cell death biofunctions, and were indirectly associated with TGFβ signaling through Pias4 (protein inhibitor of activated STAT, 4).

Figure 6.

Figure 6

Cellular biofunction and pathway analysis of CP genes unique to p1 of TGFΒ3−/− (n = 8). Entrez IDs of CP genes unique to p1 of TGFΒ3−/− were identified using the Venn diagram and uploaded to IPA as an individual dataset. Core analysis was run by default settings on IPA, and the top cellular biofunctions, diseases, and canonical pathways were detected. IPA Path Designer was used to generate the network of these genes through an overlay of the biofunctions and pathways. Genes belonging to the Unique TGFΒ3−/− p1 list were merged into a network (Foxc2, H19, Meox2, Six3, Shh) and gray-shaded. Relationships are shown with colored arrows for easy gene identification.

According to IPA core analysis, there were 19 molecules associated with TGFβ signaling [Ratio: 19/89 (21%)] among all of the CP genes (n = 322). TGFβ signaling molecules along the pathway and their cellular locations are illustrated in Figure  7. TGFβ signaling transcripts expressed in both TGFΒ3−/− and TGFΒ3+/− were (n = 18): Acvr1, Bmp2, Bmp4, Bmp7, Crebbp, Grb2, Inhba, Inhbb, Kras, Map2k1, Map2k2, Nkx2-5, Runx2, Smad3, TGFβ2, TGFβ3, TGFβr1, TGFβr2 (Table  3). All of these molecules followed the p1 expression pattern, which represents a uniform level of expression throughout all time points. However, within WT transcripts, TGFβ-associated molecules were almost equally distributed among p1, p2, p4, and p8. All represented TGFβ pathway molecules (n = 19) were grouped into genotypes according to their expression patterns (Table  5). When the significantly altered CP genes were overlaid with TGFβ signaling, all of these genes followed the Smad-dependent pathway (Figure  7).

Figure 7.

Figure 7

Characterization of CP genes in relation to the canonical TGFβ signaling pathway. The cellular localization of CP genes which function in the canonical TGFβ pathway was illustrated using IPA Path Designer. CP gene datasets uploaded to IPA were analyzed using default settings for the core analysis. A total of 19 CP transcripts from these datasets were associated with the TGFβ canonical pathway: ACVR1, BMP2, BMP4, BMP7, CREBBP, GRB2, INHBA, INHBB, KRAS, MAP2K1, MAP2K2, NKX2-5, RUNX2, SMAD3, TGFβ2, TGFβ3, TGFβR1, and TGFβR2. These datasets (n = 7) were selected and overlaid with each other using the IPA Path Designer. Color codes reflect the genotype and expression pattern of CP genes as follows: Orange lines: both TGFβ3−/−p1 and TGFΒβ3+/−p1; green filled: WTp1; light blue filled: WTp2; orange filled: WTp4; and purple filled: WTp8. The types of molecules are annotated in the legend.

Table 5.

TGFβ pathway molecules grouped into genotypes with patterns

TGFΒ3−/− TGFΒ3+/− WT
p1
p2
p1
p1
p2
p4
p8
Acvr1
Tgfβ1
Acvr1
Bmp2
Acvr1
Inhba
Bmp4
Bmp2
 
Bmp2
Map2k2
Bmp7
Kras
Crebbp
Bmp4
 
Bmp4
TGFβr1
Grb2
Map2k1
Smad3
Bmp7
 
Bmp7
 
Inhbb
Nkx2-5
Tgfβ2
Crebbp
 
Crebbp
 
Runx2
 
 
Grb2
 
Grb2
 
Tgfβ1
 
 
Inhba
 
Inhba
 
Tgfβ3
 
 
Inhbb
 
Inhbb
 
TGFβr2
 
 
Kras
 
Kras
 
 
 
 
Map2k1
 
Map2k1
 
 
 
 
Map2k2
 
Map2k2
 
 
 
 
Nkx2-5
 
Nkx2-5
 
 
 
 
Runx2
 
Runx2
 
 
 
 
Smad3
 
Smad3
 
 
 
 
Tgfβ2
 
Tgfβ1
 
 
 
 
Tgfβ3
 
Tgfβ2
 
 
 
 
TGFβr1
 
Tgfβ3
 
 
 
 
TGFβr2
 
TGFβr1
 
 
 
 
    TGFβr2        

Discussion

Global transcriptome of TGFβ3 alleles

Palatogenesis is a complex process involving multiple steps, including: palatal shelf growth, elevation, adhesion and fusion (Additional file 11). It requires the orchestration of several key cellular functions (such as cellular proliferation, differentiation, movement, communication, and death) regulated by numerous genes and signaling pathways. It has been well documented that disruption of structural and regulatory genes involved in any stage of palatogenesis, which occurs between E11.5 and E16.5 in mice, contributes to the formation of cleft palate [34]. On the basis of our studies and others [5,6,35], the stages from E14.5 to E16.5 have been identified as the most critical phases. During these stages, the palatal shelves adhere to each other with a seam and the seam begins to disintegrate in order to complete fusion, resulting in a confluent palate. Therefore, we anticipated that the majority of the genes contributing to physiological palatogenesis would be progressively regulated during these phases. In this study, we took advantage of high-throughput sequencing technology, RNA-Seq, to quantitatively analyze the expression of transcripts in mouse embryonic palatal tissues at three developmental stages.

The integrity and quality of the isolated RNA are the primary factors contributing to successful RNA-Seq studies [36], therefore we ensured that our palatal RNA samples were of high quality based on their RIN (>8.4). In our study, we used the “paired end” approach for RNA-Seq, wherein a single 100bp-long read is sequenced at both ends, which allows the tracking of alternate splice junctions, as well as further analysis of SNPs, insertions and deletions; thus providing more accurate and informative sequencing results from our samples [37,38]. We submitted the raw reads of RNA-Seq data (fastq files) to SRA and the assembly data to the Transcriptome Shotgun Assembly Database.

Our global transcriptome analyses of TGFβ3 alleles showed that significant numbers of transcripts (~6000 genes) were differentially expressed between E14.5 and E15.5, particularly within the WT fetuses, indicating functional roles of these genes during this time period of palatogenesis (Table  1). Interestingly, we found lower numbers of transcriptome alterations between the same time points within the TGFβ3−/− mice (n = 2770), which can be explained by the consequential suppression of gene clusters due to inactivation of the TGFβ3 gene. However it was surprising to observe that the lowest number of transcriptome changes were in TGFβ3+/− mice, n = 1993, which are phenotypically identical to WT mice [25]. This may suggest that only this number of gene alterations is sufficient to enhance the palatal growth and fusion.

Furthermore, in normal mice (WT) the majority of transcripts were upregulated during the early transition of palatogenesis (E14.5 - E15.5) and downregulated during the later transitions (E15.5 - E16.5), suggesting that these are critical stages of palatogenesis requiring extensive gene regulation. This pattern of upregulated-downregulated expression suggests that particular CP genes may follow the same expression pattern to stimulate disintegration of the palatal seam, resulting in a confluent palate.

Understanding the expression of CP genes

The OMIM and MGI databases provide up-to-date and adequately classified information about genetic diseases in humans and mice, respectively; therefore, the identification of CP related genes from either species was uncomplicated. However the merger of the information from these databases required further computer analysis to identify common and individual CP genes between the two species (Additional file 7). To our knowledge, our list of CP genes (Table  2) covers the most comprehensive number of genes (n = 322), which cause CP when mutated in either species alone or both species [39,40].

It was expected that TGFβ3 allelic mice follow 9 different expression pattern combinations during the E14.5-E15.5 and the E15.5-E16.5 transitions (Figure  3). Interestingly, none of the genes followed pattern 9, which represents the continuous downregulation of the transcript throughout all palatal stages. We observed that the majority of the CP genes were equally clustered into p2, p4, and p8 in WT samples (WT), representing normal palate development (Figure  4). Conversely, in both the TGFΒ3−/− and TGFΒ3+/− samples, almost all CP transcripts (91%) were accumulated in p1, and demonstrated high similarity between datasets (92%) (Additional file 9). This suggests that expression regulation of CP genes in both of these allelic mice was either delayed or abandoned.

Unique TGFΒ3−/− genes following the p1 pattern

Although the pattern of transcriptional expression between TGFΒ3−/− and TGFΒ3+/− are unexpectedly similar, phenotypic expression of CP occurs only in TGFΒ3−/− and not in TGFΒ3+/− pups [24,25]. This suggests that the unique p1 genes of TGFΒ3−/− (n = 8; Chrng, Foxc2, H19, Kcnj13, Lhx8, Meox2, Shh, and Six3) may play crucial roles in the completion of palatogenesis. Therefore, we analyzed their specific functions within the cell, their roles in palatogenesis, and their interactions with the TGFβ signaling pathway using IPA and GeneCards [41].

Chrng (Cholinergic receptor, nicotinic, gamma) is a transmembrane glycoprotein and has been shown to play a role in neuromuscular organogenesis and ligand binding [42]. Mutation of Chrng causes the developmental disorder multiple pterygium syndrome in humans, exhibited by isolated CP, short stature, vertebral (spine) defects, joint contractures, and webbing of the neck, armpit, elbow, and knee [42,43]. Pterygium is also associated with Irf6 and p63 mutations in humans [44], and both Irf6 and p63 are known to be closely associated with TGFβ signaling. Nevertheless, according to current literature, we could not find any relationships between Chrng and other molecules of the TGFΒ3−/− unique p1 list (Figure  6), or the TGFβ pathway.

Foxc2 (Forkhead box C2) is a member of the forkhead box (FOX) family of transcription factors and has been shown to be functional during development of mesenchymal tissues [45]. Mutations in Foxc2 are responsible for the hereditary lymphedema-distichiasis syndrome with CP observed in some patients [46]. In addition, Foxc2 is also involved in cancer metastases. In particular, expression of Foxc2 is induced when epithelial cells undergo epithelial-mesenchymal transition (EMT) [47]. Recently, Lindley at al. (2010) showed that Foxc2 may play a role in EMT maintenance when human mammary epithelial cells are treated with TGFβ [45]. (The Foxc2 knockout mouse is not available, according to KOMP). Overall, these relationships suggest that Foxc2 could be a vital element of palatal confluency through facilitation of EMT. Furthermore, failure of proper Foxc2 transcript regulation may result in CP in TGFβ3−/− fetus.

H19 is an imprinted maternally expressed transcript and expresses a long noncoding RNA. It has been shown that an enhancer deletion affects the expression of both H19 and Insulin-like growth factor 2 (Igf2), thus the Igf2 and the H19 genes are proposed to utilize a set of common enhancers [48,49]. When Thomas et al. (1997) produced transgenic mice that express the Igf2 gene under the control of the H19 enhancers, a large fraction of homozygous mice developed CP [44]. In our network analysis (Figure  6), we found that H19 is indirectly related with TGFβ signaling through TNF, which suggests that it may play a crucial role during palatogenesis under regulation of TGFβ.

The Kcnj13 gene encodes a member of the inwardly rectifying potassium channel family of proteins and regulates ion transmembrane transport and mutations in Kcnj1 are associated with snowflake vitreoretinal degeneration [50]. There were no significant relationships found between Kcnj12 and other molecules of the TGFΒ3−/− unique p1 list (Figure  6), or the TGFβ pathway. However, this does not rule out its potential regulatory interaction with TGFβ signaling, and therefore demands additional studies to determine its role during palate development.

Lhx8 is a member of the LIM homeobox family of transcription factor proteins, which are involved in patterning and differentiation of various tissue types. Lhx8 is expressed in the mesenchyme of the mouse palatal structures throughout their development [11]. Zhao et al. (1999) showed that in Lhx8 homozygous mutant embryos, the bilateral primordial palatal shelves formed and elevated normally, but they often failed to make contact and to fuse properly, resulting in a cleft secondary palate [51]. Inoue M et al. (2006) proposed that the L3/Lhx8 gene contributes to epithelial mesenchymal interactions during facial morphogenesis and that Fgf-8b and Tgfβ3 were responsible for Lhx8 expression in the maxillary process [52]. Our RNA-Seq study of TGFβ3 −/− fetuses further confirmed the requirement of upregulated expression of Lhx8 throughout palatal morphogenesis.

The Meox2 gene (mesenchyme homeobox 2) encodes a member of a subfamily of non-clustered, diverged, antennapedia-like homeobox-containing genes [53,54]. Jin et al. (2006) found that Meox2 is highly expressed in the posterior region of the developing palate, and the reduction of Meox2 gene levels increased the susceptibility of mice to cleft palate through a novel post-fusion mechanism [53]. The posterior cleft in E15.5 Meox2 −/− mice contained no epithelial cells, therefore they proposed that Meox2 may function to strengthen the fusing zone of palatal shelves. The lack of strong fusion of epithelial cells during stable expression of Meox2 may explain why the palatal shelves of TGFβ3−/− mice attach to each other, but fail to fuse [25]. Recently, Valcourt et al. (2007) showed that ectopic Meox2 suppressed epithelial cell proliferation in cooperation with Tgfβ1, and mediated induction of the cell cycle inhibitor gene p21 [54]. Furthermore, they showed that Meox2 failed to promote EMT and partially blocked Tgfβ1-induced EMT [54]. Considering the requirement of EMT during palatal fusion, Meox2 expression needs to be downregulated at the later stages of palatogenesis, as observed in both WT and TGFβ3+/− mice in our study.

Sonic hedgehog, Shh, is a member of the hedgehog family of secreted proteins and has been implicated as the key inductive morphogen in patterning of vertebrates during organogenesis; particularly the ventral neural tube, the anterior-posterior limb axis, and the ventral somites [55]. Shh signaling plays essential roles in craniofacial development by regulating a number of transcription factors and signaling interactions that take place between the epithelium and mesenchyme during normal palatogenesis [56]. Lan et al. (2009) showed that Shh is predominantly expressed in palatal epithelia and signals directly to the palatal mesenchyme to regulate palatal mesenchyme cell proliferation through maintenance of cyclin D1 and D2 (Ccnd1 and Ccnd2) expression [56]. Recently, Sasaki et al. (2007) showed that Shh expression in the palates of Tgfβ3-null mice was reduced throughout E12.5–E15.5, and thereby proposed that Shh may be involved in TGFβ3 regulation of normal palatal fusion [57]. Our results further support that upregulated expression of Shh is necessary for the successful completion of palatogenesis, as observed in the TGFβ3+/− and WT mice, but not in TGFβ3−/−.

SIX homeobox 3 (Six3) is a transcription factor crucial to embryonic development by providing the necessary instructions for the formation of the forebrain and eye development [58]. Similar to Shh [59], mutations in the homeodomain of the human Six3 gene cause holoprosencephaly and are associated with midline facial cleft - tessier cleft 14 [60]. In mice, it has been shown that Six3 protein increases expression of Ccnd1 protein [61], which is directly related to TGFβ signaling (Figure  6). Our results indicate that Six3 expression has to be upregulated after E14.5 for the precise progression of palate development. Therefore, the interaction of Six3 with TGFβ3 and their mechanism of activation deserve further analysis using biological techniques.

Cellular biofunctions and TGFβ pathway molecules

The expression pattern-clustered CP genes interconnect each other to facilitate cellular biofunctions and canonical pathways. The relevance of these groups of gene datasets was detected using IPA core analysis (Figure  5).

Our primary objective in this study was to analyze the transcriptomal differences among the alleles of TGFβ3 knockout mice. Thus, it was critical to find the relationship of CP genes with the TGFβ signaling pathway, which encompasses 89 molecules (Figure  7). TGFβ3 controls the fusion of palatal shelves by EMT and apoptosis [5,14,62]. Furthermore, TGFβ1 and TGFβ2 regulate mesenchymal cell proliferation and extracellular matrix synthesis in the palate [63]. Although not all of the molecules of the TGFβ signaling pathway are associated with CP genes, it is noteworthy that all of the related CP genes followed the Smad-dependent pathway. This may indicate that TGFβ3 regulates the CP genes through Smad-signaling, with ancillary involvement of ERK, PI3K, JNK, MAPK, etc. Thus, the promoter regions of these CP genes should be analyzed for Smad-binding elements using ChIP-PCR or ChIP-Seq. The majority of these CP genes followed the p8 expression pattern in WT mice, which represents the unchanged-downregulated expression pattern. This suggests that TGFβ molecules are required during E14.5 and E15.5, but are less necessary during E16.5, which correlates with our previous findings demonstrating the sequential function of TGFβ3 throughout the degradation of the palatal seam [5].

Overall, these sequencing and bioinformatics analysis studies are used as gene identification tools for future experiments to further understand the detailed mechanism of palate development. The 8 putative CP causing genes and Smad-dependent TGFβ signaling molecules will be analyzed in detail using gene and protein expression studies, as well transcription factor and promoter binding assays.

Conclusions

As a result of the complexity of TGFβ signaling, apparently simple but highly important questions regarding the development of cleft palate remain unanswered, including: what is the genetic footprint of TGFβ3 in normal palatogenesis and how does the deregulation of TGFβ3 signaling cause cleft palate? In this study, we have attempted to answer these questions through the extensive quantitative transcriptome analysis of the palates of TGFβ3 knockout mice through RNA sequencing technology. Our study represents the first detailed analysis of palatal transcriptomes generated by RNA-Seq technology during critical stages of palatogenesis. The dataset described here will provide an enriched resource for searching transcripts that may play key regulatory roles at different stages of palatal development. Furthermore, we identified 8 key genes which may play fundamental roles in the development of cleft palate in TGFβ3−/− mice. Our combined results provide an initial understanding of the complex genetic mechanism of TGFβ3-mediated palatogenesis, and may lead to the identification of genes for the targeted prenatal treatment of cleft palate.

Methods

Mouse breeding and genotyping

The generation and genotyping of TGFβ3−/− mice have been described previously [25]. TGFβ3 heterozygous (+/−) C57BL/6J male and female mice were provided by Tom Doetschman (BIO5 Institute, University of Arizona, AZ). Mice were housed in accredited animal facilities at the University of Nebraska Medical Center (UNMC). All procedures were approved by the IACUC (Institutional Animal Care and Use Committee) of UNMC. Homozygous (−/−) embryos were obtained by mating TGFβ3+/− male and female mice. The morning that the vaginal plug was pulled was considered embryonic day (E) 0.5. Tissue samples were individually labeled and embryonic tails were used for genotyping by PCR. Genomic DNA was prepared using the Wizard Genomic DNA Purification Kit (Promega – A1120). The primers used for TGFβ3 were as follows:

TGFβ3 Forward 5 TGGGAGTCATGGCTGTAACT 3

TGFβ3 Reverse 5 CACTCACACTGGCAAGTAGT 3

pMC1-Neo 5 GCCGAGAAAGTATCCATCAT 3

These primers amplified 600 bp and 400 bp fragments for the mutated and WT alleles, respectively. PCR conditions were 30 cycles of 95°C for 30 seconds, 57°C for 45 seconds, 72°C for 1 minute, followed by one cycle of 72°C for 10 minutes.

Typically, 8 fetuses were obtained from each pregnancy in a Mendelian-fashion 3 allelic-genotype (1:2:1) as 2 TGFβ3−/−, 4 TGFβ3+/−, and 2 WT. Palates were extracted from fetuses as described earlier [2] at 3 time points (E14.5, E15.5, and E16.5) as duplicates and the tissues were stored in RNAlater RNA Stabilization Reagent (QIAGEN, CA) to preserve transcript integrity. In order to evaluate the reproducibility of our experimental samples and procedures, two independent biological and technical replicates were assayed for each gestational age and each allele resulting in 36 (n = 3 × 3 × 2 × 2) samples for sequencing.

Protein, RNA extraction, construction of small RNA libraries, and RNA-Seq

Total proteins were extracted from the palatal shelves collected from embryonic (E) 15.5 and western blot was performed as previously done in our lab and described in [9,10] using antibody against TGFβ3 (R & D Systems, MN). We extracted the RNA using the RNeasy Kit (QIAGEN, CA) and evaluated the RNA for purity and concentration by ultraviolet spectroscopy (NanoDrop, Wilmington, DE). RNA integrity numbers (RIN) [64,65], based on an algorithm for judging the integrity of RNA samples, were evaluated using the Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA). Libraries were prepared using the Illumina mRNA-Seq Sample Preparation Kit (San Diego, CA) according to the manufacturer’s instructions. Briefly, poly(A) + RNA was recovered from 1 μg of total RNA using two rounds of isolation with oligo-dT-coated Sera-Mag magnetic beads. The recovered poly (A) + RNA was then chemically fragmented. RNA fragments were converted to cDNA using SuperScript II and random primers. The second strands were synthesized using RNaseH and DNA Pol I. The ends of the cDNA were repaired using T4 DNA polymerase, T4 polynucleotide kinase, and Klenow DNA polymerase. A single adenosine was added to the 3 end using Klenow fragment (3 to 5 exo minus). Adaptors were attached to both ends of the cDNA using T4 DNA ligase. RNA fragments were extracted from a 2% low range ultra agarose sizing gel. The fragments were amplified by 15 cycles of PCR using Phusion DNA polymerase. Libraries were validated with Agilent Bioanalyzer (Palo Alto, CA). Libraries were diluted to 10 pM and applied to an Illumina flow cell using the Illumina Cluster Station. Sequencing was performed on Illumina HiSeq 2000 according to the manufacturer’s instructions [66,67].

Mapping and quantification of transcripts

RNA-Seq datasets were analyzed by following the recently published RNA-Seq protocol [31]. The 36 samples (paired-end reads, including biological replicates and technical replicates; n = 3 × 3 × 2 × 2) were individually mapped onto the mouse genome (mm9, build name NCBIM37) by using TopHat v1.4.1 [68-70]. Preprocessing steps were not applied to the raw data (fastq) files because the sequencing quality was very high in both 3 and 5 ends according to the FastQC reports [71]. The relative abundance of reads was normalized to the total read number and shown as fragments per kilobase per million mapped (FPKM) [71]. After the alignment, Cufflinks v1.3.0 [72-74] was used to estimate the abundances of transcripts. Cuffdiff from Cufflinks repository was used to perform pairwise comparisons between different genotypes and gestational groups. The dispersion between each biological replicate was also tested by Cuffdiff, as well as the dispersion between each technical replicate. CummeRbund ( http://compbio.mit.edu/cummeRbund/) software [31] provides functions for creating commonly used expression plots, such as: volcano, scatter and box plots, using processed data from Cuffdiff. We used CummeRbund to generate scatter plots to use for quality assessments of both biological and technical replicates.

We used the VennDiagram [75], an R package that enables the automated generation of highly-customizable, high-resolution Venn diagrams, to illustrate the number of transcripts in each experimental group which passed the significance and fold change (FC ≥ 2.0) filtrations through Cufflinks. Pattern-oriented listings of CP genes were illustrated with GeneVenn, a web application for comparing gene lists (Additional file 5).

Identification of cleft palate genes

We used the OMIM (Online Mendelian Inheritance in Man) database to retrieve 280 genes and loci involved in genetic diseases associated with CP syndromes in humans. We used information from developmental biology studies demonstrating CP in knockout mouse models and the Mouse Genome Informatics (MGI) website with the mammalian phenotype browser, the international database resource for the laboratory mice, to retrieve 128 genes (out of 223 genotypes) causing CP in mice. The pattern-oriented heatmaps of CP genes were illustrated by using R pheatmap package

Biological interpretation and pathway analysis

The effects of TGFβ alleles on CP genes during palatogenesis were examined in the context of differentially expressed human and mouse genes using Ingenuity Pathway Analysis (IPA), (Ingenuity Systems, CA), a web-delivered application used to discover, visualize, and explore relevant networks. After performing statistical analysis and filtering of the RNA-Seq data, the Entrez IDs and FPKM values of each group were uploaded to IPA as a dataset. Each identifier was mapped to its corresponding gene object in the IKB. Interactions were then queried between these datasets and all other gene objects stored within IPA to generate a set of direct interaction networks that were overlapped. The significant genes were categorized, compared to functional categories in the IPA database, and ranked according to their p-values. P-values less than 0.05 indicate a statistically significant, nonrandom association between a set of significant genes and a set of all genes related to a given function in the IKB [76]. Through the assessment of differentially expressed genes, cellular processes that were most significantly altered in our dataset were: gene expression, cellular development, cellular growth and proliferation, cell signaling, cell morphology, cell death and survival, cellular movement, and cell cycle. Based on these IPA analyses, the cellular biofunction categories mentioned above and the TGFβ pathway were filtered among other cellular processes and canonical pathways; then ranked according to their p-values and represented as a color coded matrix. Datasets associated with the TGFβ pathway were overlaid using the IPA Pathway Designer.

Abbreviations

TGFβ3: Transforming growth factor β 3; NGS: Next generation sequencing; RNA-Seq: RNA Sequencing; FC: Fold change; IPA: Ingenuity pathway analysis; E: Embryonic day; CP: Cleft palate; TGFΒ3−/−: Homozygous; WT: Wild-type; TGFΒ3+/−: Heterozygous; FPKM: Fragments per kilo base per million mapped.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

FO carried out the study design, animal breeding, sample collection, RNA extraction, identification of CP genes, IPA analyses, and drafted the manuscript. YL undertook RNA-Seq data quality control, processing, alignment, visualization, bioinformatics analyses, and helped to draft the manuscript. XZ performed animal breeding, collected palatal tissues and extract RNA CG and AN supervised the design and coordination of the study, and edited the manuscript. All authors read and approved the final manuscript.

Authors’ information

FO is a postdoctoral research associate in AN’s group and has a strong background in cell biology and bioinformatics. He designed the research, identified CP genes, and analyzed the transcriptome. YL is a graduate student in CG’s group with training in computer science. He performed all RNA-Seq data processing and several bioinformatics studies. XZ is a graduate student in AN’s group studying palatal signaling. CG (Associate professor) has an interdisciplinary background both in molecular and computational biology. He has published a number of computational prediction methods on protein subcellular localization since 2004. AN (Associate professor) is a craniofacial developmental biologist and has extensive experience in cell signaling that governs palate development in TGFβ3 mice.

Supplementary Material

Additional file 1

TGFβ3 protein expression in the alleles of TGFβ3 knockout mice. Western blot analysis show that while homozygous (HM; −/−) palates did not show any expression TGFβ3 protein in the palate collected at E15.5 heterozygous (HT; +/−) palates expressed limited levels of the TGFβ3 protein while the wild-type (WT; +/+) palates expressed higher level of TGFβ3 protein.

Click here for file (390.4KB, tiff)
Additional file 2

Table listing RNA integrity numbers (RIN) to monitor quality and degradation of RNA samples.

Click here for file (13.3KB, xlsx)
Additional file 3

Table showing the number of reads per replicate of RNA samples.

Click here for file (14.3KB, xlsx)
Additional file 4

Validation of biological and technical replicates using CummeRbund. The read numbers per transcript between the biological replicates were compared. Since the read number per transcript (FPKM value) ranged from 0 to >10,000, the read number + 1 was transformed by log10. Each dot represents data from one transcript. There was a high correlation between the two sequencing results (r = 0.98, Pearson’s correlation; p < 0.001 two tail).

Click here for file (1.5MB, tiff)
Additional file 5

Venn diagrams of all significantly differentially expressed genes. Differential expression of transcripts within TGFβ3 alleles (TGFΒ3−/−, TGFΒ3+/−, and WT) were compared using Venn diagrams. Lists of differentially expressed genes with their EntrezIDs were uploaded to the R VennDiagram package [75], the query was run, and the detected numbers and gene lists were exported to MS Excel. Relationships among the lists of differentially expressed genes within TGFβ3 allelic mice are depicted as intersections or uniqueness and grouped with their corresponding gestational age (E14.5, E15.5, and E16.5). A significant overlap of differentially expressed genes was observed among the allelic variants at each gestational age. Venn diagrams were not drawn per scale.

Click here for file (684.7KB, tiff)
Additional file 6

Table listing the names of the genes significantly increased from E14.5 to E15.5 followed by downregulation in E16.5 and their.

Click here for file (251.2KB, xlsx)
Additional file 7

Venn diagram of cleft palate (CP) related genes acquired from OMIM and MGI Using the OMIM (Online Mendelian Inheritance in Man) database, we identified 280 genes and loci involved in genetic diseases associated with CP syndromes in humans. Several studies have developed knockout mouse models which exhibit CP [77]. We used the Mouse Genome Informatics (MGI) international database resource for laboratory mice to identify 128 genes (out of 223 genotypes) that cause CP in mice. The mouse genes were filtered based on gene names only, because some of the same genes were either studied by different labs or have splice variants, and as a result they were listed several times as genotypes in the MGI database. The human and mouse CP-associated genes were compared to find common genes, and the Homologene tool of NCBI [78] was used to compare and detect the orthologs of human and mouse CP related genes. We determined that 50 (out of 280) human CP related genes and loci were not represented, i.e. non-orthologs, within the mouse genome.

Click here for file (308.4KB, tiff)
Additional file 8

Ideogram of CP genes on chromosomes. A zoomable Circos image is drawn based on circular composition of CP genes on chromosomes. Data tracks appear inside and/or outside the circular layout. When the PDF file is magnified, tracks include FPKM values of CP genes as histograms and labeled from the outer circle to inner circle as “gene name, 14.5HM, 15.5HM, 15.5HM, 14.5HT, 15.5HT, 16.5HT, 14.5WT, 15.5WT, and 16.5WT”.

Click here for file (3.3MB, pdf)
Additional file 9

Table listing the all the CP genes sorted according to their pattern.

Click here for file (187.2KB, xlsx)
Additional file 10

Table showing differentially expressed molecules clustered into GO of cellular biofunctions.

Click here for file (13.9KB, xlsx)
Additional file 11

Stages of palatogenesis in the mouse. (A) Time-course of palate development in mice. (B-F) Scanning electron micrographs showing oral views of the secondary palate at representative developmental stages [reprinted from Kaufman (Kaufman, 1992) with permission]. Orange lines mark sites of fusion between the medial nasal processes and maxillary processes, white arrowheads point to initial outgrowths of the primary palate, white arrows point to the initial outgrowth of the secondary palatal shelves, red arrowheads mark the initial site of palatal adhesion and fusion, and the yellow arrowhead points to the gap between the primary and secondary palates that will disappear following fusion between these tissues. (G-U) Representative histological frontal sections from anterior (G-K), middle (L-P), and posterior (Q-U) regions of the developing palate at each indicated stage. The middle palate region is flanked by the developing upper molar tooth germs (black arrows in M-P) and corresponds to the palatine region of the future hard palate. The posterior palate region corresponds to the future soft palate. At E11.5 (G,L,Q), the palatal shelf outgrowths arise from the oral surface of the maxillary processes. At E13.5 (H,M,R), the palatal shelves exhibit distinct shapes along the AP axis. By E14.5 (I,N,S), the palatal shelves have elevated to the horizontal position. At ~ E15.0 (J,O,T), the palatal shelves make contact at the midline and initiate fusion by formation of the midline epithelial seam (MES) in the mid-anterior region (arrowhead in O). By E15.5 (K,P,U), palatal shelf fusion is evident in the middle and posterior regions, with complete removal of the MES (black arrowheads in P,U). Remnants of the MES can still be seen in the anterior region (K) at this stage and the palatal shelves also fuse superiorly with the nasal septum. Magnification is not equivalent between stages. MdbP, mandibular process; MNP, medial nasal process; MxP, maxillary process; NS, nasal septum; PP, primary palate; PS, palatal shelf; SP, secondary palate; T, tongue. Adapted from Bush J O and Jiang R. Development 139, 231–243 (2012).

Click here for file (3.9MB, tiff)

Contributor Information

Ferhat Ozturk, Email: fozturk@unmc.edu.

You Li, Email: you.li@unmc.edu.

Xiujuan Zhu, Email: xzhu@unmc.edu.

Chittibabu Guda, Email: babu.guda@unmc.edu.

Ali Nawshad, Email: anawshad@unmc.edu.

Acknowledgement

This research is supported by NIDCR, NIH grant (R01DE017986) to Dr. Ali Nawshad. The UNMC DNA Sequencing Core receives partial support from the NCRR (1S10RR027754-01, 5P20RR016469, RR018788-08) and the NIGMS (8P20GM103427, GM103471-09). This publication’s contents are the sole responsibility of the authors and do not necessarily represent the official views of the NIH or NIGMS.

References

  1. Schutte BC, Murray JC. The many faces and factors of orofacial clefts. Hum Mol Genet. 1999;8(10):1853–1859. doi: 10.1093/hmg/8.10.1853. [DOI] [PubMed] [Google Scholar]
  2. Nawshad A, Hay ED. TGFbeta3 signaling activates transcription of the LEF1 gene to induce epithelial mesenchymal transformation during mouse palate development. J Cell Biol. 2003;163(6):1291–1301. doi: 10.1083/jcb.200306024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Nawshad A, LaGamba D, Olsen BR, Hay ED. Laser capture microdissection (LCM) for analysis of gene expression in specific tissues during embryonic epithelial-mesenchymal transformation. Dev Dyn. 2004;230(3):529–534. doi: 10.1002/dvdy.20064. [DOI] [PubMed] [Google Scholar]
  4. Nawshad A, Lagamba D, Polad A, Hay ED. Transforming growth factor-beta signaling during epithelial-mesenchymal transformation: implications for embryogenesis and tumor metastasis. Cells Tissues Organs. 2005;179(1–2):11–23. doi: 10.1159/000084505. [DOI] [PubMed] [Google Scholar]
  5. Ahmed S, Liu CC, Nawshad A. Mechanisms of palatal epithelial seam disintegration by transforming growth factor (TGF) beta3. Dev Biol. 2007;309(2):193–207. doi: 10.1016/j.ydbio.2007.06.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Nawshad A, Medici D, Liu CC, Hay ED. TGFbeta3 inhibits E-cadherin gene expression in palate medial-edge epithelial cells through a Smad2-Smad4-LEF1 transcription complex. J Cell Sci. 2007;120(Pt 9):1646–1653. doi: 10.1242/jcs.003129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Nawshad A. Palatal seam disintegration: to die or not to die? that is no longer the question. Dev Dyn. 2008;237(10):2643–2656. doi: 10.1002/dvdy.21599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Iordanskaia T, Nawshad A. Mechanisms of transforming growth factor beta induced cell cycle arrest in palate development. J Cell Physiol. 2011;226(5):1415–1424. doi: 10.1002/jcp.22477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Jalali AZX, Liu CC, Nawshad A. Induction of palate epithelial mesenchymal transition by Tranforming Growth Factor b3 signaling. Dev Growth Differ. 2012;54(6):633–648. doi: 10.1111/j.1440-169X.2012.01364.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Zhu X, Ozturk F, Liu C, Oakley GG, Nawshad A. Transforming growth factor-beta activates c-Myc to promote palatal growth. J Cell Biochem. 2012;113(10):3069–3085. doi: 10.1002/jcb.24184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Zhu X, Ozturk F, Pandey S, Guda CB, Nawshad A. Implications of TGFbeta on Transcriptome and Cellular Biofunctions of Palatal Mesenchyme. Frontiers Physiol. 2012;3:85. doi: 10.3389/fphys.2012.00085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Guo Z, Huang C, Ding K, Lin J, Gong B. Transforming growth factor beta-3 and environmental factors and cleft lip with/without cleft palate. DNA Cell Biol. 2010;29(7):375–380. doi: 10.1089/dna.2009.1009. [DOI] [PubMed] [Google Scholar]
  13. Dixon MJ, Marazita ML, Beaty TH, Murray JC. Cleft lip and palate: understanding genetic and environmental influences. Nat Rev Genet. 2011;12(3):167–178. doi: 10.1038/nrg2933. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Nawshad A, LaGamba D, Hay ED. Transforming growth factor beta (TGFbeta) signalling in palatal growth, apoptosis and epithelial mesenchymal transformation (EMT) Arch Oral Biol. 2004;49(9):675–689. doi: 10.1016/j.archoralbio.2004.05.007. [DOI] [PubMed] [Google Scholar]
  15. Iordanskaia T, Nawshad A. Mechanisms of transforming growth factor b induced cell cycle arrest in palate development. J Cell Physiol. 2011;226(5):1415–1424. doi: 10.1002/jcp.22477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Ichikawa E, Watanabe A, Nakano Y, Akita S, Hirano A, Kinoshita A, Kondo S, Kishino T, Uchiyama T, Niikawa N. PAX9 and TGFB3 are linked to susceptibility to nonsyndromic cleft lip with or without cleft palate in the Japanese: population-based and family-based candidate gene analyses. J Hum Genet. 2006;51(1):38–46. doi: 10.1007/s10038-005-0319-8. [DOI] [PubMed] [Google Scholar]
  17. Shaw D, Ray A, Marazita M, Field L. Further evidence of a relationship between the retinoic acid receptor alpha locus and nonsyndromic cleft lip with or without cleft palate (CL +/− P) Am J Hum Genet. 1993;53(5):1156–1157. [PMC free article] [PubMed] [Google Scholar]
  18. Marazita ML, Mooney MP. Current concepts in the embryology and genetics of cleft lip and cleft palate. Clin Plast Surg. 2004;31(2):125–140. doi: 10.1016/S0094-1298(03)00138-X. [DOI] [PubMed] [Google Scholar]
  19. Marazita ML, Murray JC, Lidral AC, Arcos-Burgos M, Cooper ME, Goldstein T, Maher BS, Daack-Hirsch S, Schultz R, Mansilla MA. Meta-analysis of 13 genome scans reveals multiple cleft lip/palate genes with novel loci on 9q21 and 2q32-35. Am J Hum Genet. 2004;75(2):161–173. doi: 10.1086/422475. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Lidral AC, Murray JC, Buetow KH, Basart AM, Schearer H, Shiang R, Naval A, Layda E, Magee K, Magee W. Studies of the candidate genes TGFB2, MSX1, TGFA, and TGFB3 in the etiology of cleft lip and palate in the Philippines. Cleft Palate Craniofac J. 1997;34(1):1–6. doi: 10.1597/1545-1569_1997_034_0001_sotcgt_2.3.co_2. [DOI] [PubMed] [Google Scholar]
  21. Lidral AC, Romitti PA, Basart AM, Doetschman T, Leysens NJ, Daack-Hirsch S, Semina EV, Johnson LR, Machida J, Burds A. Association of MSX1 and TGFB3 with nonsyndromic clefting in humans. Am J Hum Genet. 1998;63(2):557–568. doi: 10.1086/301956. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Mitchell H, Choudhury A, Pagano RE, Leof EB. Ligand-dependent and -independent transforming growth factor-beta receptor recycling regulated by clathrin-mediated endocytosis and Rab11. Mol Biol Cell. 2004;15(9):4166–4178. doi: 10.1091/mbc.E04-03-0245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Doetschman T. Interpretation of phenotype in genetically engineered mice. Lab Anim Sci. 1999;49(2):137–143. [PubMed] [Google Scholar]
  24. Kaartinen V, Voncken JW, Shuler C, Warburton D, Bu D, Heisterkamp N, Groffen J. Abnormal lung development and cleft palate in mice lacking TGF-beta 3 indicates defects of epithelial-mesenchymal interaction. Nat Genet. 1995;11(4):415–421. doi: 10.1038/ng1295-415. [DOI] [PubMed] [Google Scholar]
  25. Proetzel G, Pawlowski SA, Wiles MV, Yin M, Boivin GP, Howles PN, Ding J, Ferguson MW, Doetschman T. Transforming growth factor-beta 3 is required for secondary palate fusion. Nat Genet. 1995;11(4):409–414. doi: 10.1038/ng1295-409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. LaGamba D, Nawshad A, Hay ED. Microarray analysis of gene expression during epithelial-mesenchymal transformation. Dev Dyn. 2005;234(1):132–142. doi: 10.1002/dvdy.20489. [DOI] [PubMed] [Google Scholar]
  27. Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009;10(1):57–63. doi: 10.1038/nrg2484. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Marguerat S, Bahler J. RNA-seq: from technology to biology. Cell Mol Life Sci. 2010;67(4):569–579. doi: 10.1007/s00018-009-0180-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M. The transcriptional landscape of the yeast genome defined by RNA sequencing. Science. 2008;320(5881):1344–1349. doi: 10.1126/science.1158441. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Cloonan N, Forrest AR, Kolle G, Gardiner BB, Faulkner GJ, Brown MK, Taylor DF, Steptoe AL, Wani S, Bethel G. Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nat Methods. 2008;5(7):613–619. doi: 10.1038/nmeth.1223. [DOI] [PubMed] [Google Scholar]
  31. Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 2012;7(3):562–578. doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Auer PL, Doerge RW. Statistical design and analysis of RNA sequencing data. Genetics. 2010;185(2):405–416. doi: 10.1534/genetics.110.114983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19(9):1639–1645. doi: 10.1101/gr.092759.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Bush JO, Jiang R. Palatogenesis: morphogenetic and molecular mechanisms of secondary palate development. Development. 2012;139(2):231–243. doi: 10.1242/dev.067082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Taya Y, O’Kane S, Ferguson MW. Pathogenesis of cleft palate in TGF-beta3 knockout mice. Development. 1999;126(17):3869–3879. doi: 10.1242/dev.126.17.3869. [DOI] [PubMed] [Google Scholar]
  36. Winnepenninckx B, Van de Peer Y, Backeljau T, De Wachter R. CARD: a drawing tool for RNA secondary structure models. Biotechniques. 1995;18(6):1060–1063. [PubMed] [Google Scholar]
  37. Williams LJ, Tabbaa DG, Li N, Berlin AM, Shea TP, Maccallum I, Lawrence MS, Drier Y, Getz G, Young SK. Paired-end sequencing of Fosmid libraries by Illumina. Genome Res. 2012;22(11):2241–2249. doi: 10.1101/gr.138925.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Sengupta S, Bolin JM, Ruotti V, Nguyen BK, Thomson JA, Elwell AL, Stewart R. Single read and paired end mRNA-Seq Illumina libraries from 10 nanograms total RNA. JVE. 2011;56:e3340. doi: 10.3791/3340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Gritli-Linde A. p63 and IRF6: brothers in arms against cleft palate. J Clin Invest. 2010;120(5):1386–1389. doi: 10.1172/JCI42821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Moretti F, Marinari B, Lo Iacono N, Botti E, Giunta A, Spallone G, Garaffo G, Vernersson-Lindahl E, Merlo G, Mills AA. A regulatory feedback loop involving p63 and IRF6 links the pathogenesis of 2 genetically different human ectodermal dysplasias. J Clin Invest. 2010;120(5):1570–1577. doi: 10.1172/JCI40267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Stelzer G, Dalah I, Stein TI, Satanower Y, Rosen N, Nativ N, Oz-Levi D, Olender T, Belinky F, Bahir I. In-silico human genomics with GeneCards. Hum Genomics. 2011;5(6):709–717. doi: 10.1186/1479-7364-5-6-709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Vogt J, Morgan NV, Rehal P, Faivre L, Brueton LA, Becker K, Fryns JP, Holder S, Islam L, Kivuva E. CHRNG genotype-phenotype correlations in the multiple pterygium syndromes. J Med Genet. 2012;49(1):21–26. doi: 10.1136/jmedgenet-2011-100378. [DOI] [PubMed] [Google Scholar]
  43. Morgan NV, Brueton LA, Cox P, Greally MT, Tolmie J, Pasha S, Aligianis IA, Van Bokhoven H, Marton T, Al-Gazali L. Mutations in the embryonal subunit of the acetylcholine receptor (CHRNG) cause lethal and Escobar variants of multiple pterygium syndrome. Am J Hum Genet. 2006;79(2):390–395. doi: 10.1086/506256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Thomason HA, Zhou H, Kouwenhoven EN, Dotto GP, Restivo G, Nguyen BC, Little H, Dixon MJ, Van Bokhoven H, Dixon J. Cooperation between the transcription factors p63 and IRF6 is essential to prevent cleft palate in mice. J Clin Invest. 2010;120(5):1561–1569. doi: 10.1172/JCI40266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Lindley LE, Briegel KJ. Molecular characterization of TGFbeta-induced epithelial-mesenchymal transition in normal finite lifespan human mammary epithelial cells. Biochem Biophys Res Commun. 2010;399(4):659–664. doi: 10.1016/j.bbrc.2010.07.138. [DOI] [PubMed] [Google Scholar]
  46. Fang J, Dagenais SL, Erickson RP, Arlt MF, Glynn MW, Gorski JL, Seaver LH, Glover TW. Mutations in FOXC2 (MFH-1), a forkhead family transcription factor, are responsible for the hereditary lymphedema-distichiasis syndrome. Am J Hum Genet. 2000;67(6):1382–1388. doi: 10.1086/316915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Hader C, Marlier A, Cantley L. Mesenchymal-epithelial transition in epithelial response to injury: the role of Foxc2. Oncogene. 2010;29(7):1031–1040. doi: 10.1038/onc.2009.397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Leighton PA, Saam JR, Ingram RS, Stewart CL, Tilghman SM. An enhancer deletion affects both H19 and Igf2 expression. Genes Dev. 1995;9(17):2079–2089. doi: 10.1101/gad.9.17.2079. [DOI] [PubMed] [Google Scholar]
  49. Bartolomei MS, Zemel S, Tilghman SM. Parental imprinting of the mouse H19 gene. Nature. 1991;351(6322):153–155. doi: 10.1038/351153a0. [DOI] [PubMed] [Google Scholar]
  50. Hejtmancik JF, Jiao X, Li A, Sergeev YV, Ding X, Sharma AK, Chan CC, Medina I, Edwards AO. Mutations in KCNJ13 cause autosomal-dominant snowflake vitreoretinal degeneration. Am J Hum Genet. 2008;82(1):174–180. doi: 10.1016/j.ajhg.2007.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Zhao Y, Guo YJ, Tomac AC, Taylor NR, Grinberg A, Lee EJ, Huang S, Westphal H. Isolated cleft palate in mice with a targeted mutation of the LIM homeobox gene lhx8. Proc Natl Acad Sci USA. 1999;96(26):15002–15006. doi: 10.1073/pnas.96.26.15002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Inoue M, Kawakami M, Tatsumi K, Manabe T, Makinodan M, Matsuyoshi H, Kirita T, Wanaka A. Expression and regulation of the LIM homeodomain gene L3/Lhx8 suggests a role in upper lip development of the chick embryo. Anat Embryol. 2006;211(3):247–253. doi: 10.1007/s00429-006-0078-0. [DOI] [PubMed] [Google Scholar]
  53. Jin JZ, Ding J. Analysis of Meox-2 mutant mice reveals a novel postfusion-based cleft palate. Dev Dyn. 2006;235(2):539–546. doi: 10.1002/dvdy.20641. [DOI] [PubMed] [Google Scholar]
  54. Valcourt U, Thuault S, Pardali K, Heldin CH, Moustakas A. Functional role of Meox2 during the epithelial cytostatic response to TGF-beta. Mol Oncol. 2007;1(1):55–71. doi: 10.1016/j.molonc.2007.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Cobourne MT, Green JB. Hedgehog signalling in development of the secondary palate. Front Oral Biol. 2012;16:52–59. doi: 10.1159/000337543. [DOI] [PubMed] [Google Scholar]
  56. Lan Y, Jiang R. Sonic hedgehog signaling regulates reciprocal epithelial-mesenchymal interactions controlling palatal outgrowth. Development. 2009;136(8):1387–1396. doi: 10.1242/dev.028167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Sasaki Y, O’Kane S, Dixon J, Dixon MJ, Ferguson MW. Temporal and spatial expression of Pax9 and Sonic hedgehog during development of normal mouse palates and cleft palates in TGF-beta3 null embryos. Arch Oral Biol. 2007;52(3):260–267. doi: 10.1016/j.archoralbio.2006.09.012. [DOI] [PubMed] [Google Scholar]
  58. Granadino B, Gallardo ME, Lopez-Rios J, Sanz R, Ramos C, Ayuso C, Bovolenta P. Rodriguez de Cordoba S: Genomic cloning, structure, expression pattern, and chromosomal location of the human SIX3 gene. Genomics. 1999;55(1):100–105. doi: 10.1006/geno.1998.5611. [DOI] [PubMed] [Google Scholar]
  59. Schell-Apacik C, Rivero M, Knepper JL, Roessler E, Muenke M, Ming JE. SONIC HEDGEHOG mutations causing human holoprosencephaly impair neural patterning activity. Hum Genet. 2003;113(2):170–177. doi: 10.1007/s00439-003-0950-4. [DOI] [PubMed] [Google Scholar]
  60. Wallis DE, Roessler E, Hehr U, Nanni L, Wiltshire T, Richieri-Costa A, Gillessen-Kaesbach G, Zackai EH, Rommens J, Muenke M. Mutations in the homeodomain of the human SIX3 gene cause holoprosencephaly. Nat Genet. 1999;22(2):196–198. doi: 10.1038/9718. [DOI] [PubMed] [Google Scholar]
  61. Goudreau G, Petrou P, Reneker LW, Graw J, Loster J, Gruss P. Mutually regulated expression of Pax6 and Six3 and its implications for the Pax6 haploinsufficient lens phenotype. Proc Natl Acad Sci USA. 2002;99(13):8719–8724. doi: 10.1073/pnas.132195699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Kaartinen V, Cui XM, Heisterkamp N, Groffen J, Shuler CF. Transforming growth factor-beta3 regulates transdifferentiation of medial edge epithelium during palatal fusion and associated degradation of the basement membrane. Dev Dyn. 1997;209(3):255–260. doi: 10.1002/(SICI)1097-0177(199707)209:3<255::AID-AJA1>3.0.CO;2-H. [DOI] [PubMed] [Google Scholar]
  63. Fitzpatrick DR, Denhez F, Kondaiah P, Akhurst RJ. Differential expression of TGF beta isoforms in murine palatogenesis. Development. 1990;109(3):585–595. doi: 10.1242/dev.109.3.585. [DOI] [PubMed] [Google Scholar]
  64. Schroeder A, Mueller O, Stocker S, Salowsky R, Leiber M, Gassmann M, Lightfoot S, Menzel W, Granzow M, Ragg T. The RIN: an RNA integrity number for assigning integrity values to RNA measurements. BMC Mol Biol. 2006;7:3. doi: 10.1186/1471-2199-7-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Fleige S, Pfaffl MW. RNA integrity and the effect on the real-time qRT-PCR performance. Mol Aspects Med. 2006;27(2–3):126–139. doi: 10.1016/j.mam.2005.12.003. [DOI] [PubMed] [Google Scholar]
  66. Bottomly D, Walter NA, Hunter JE, Darakjian P, Kawane S, Buck KJ, Searles RP, Mooney M, McWeeney SK, Hitzemann R. Evaluating gene expression in C57BL/6J and DBA/2J mouse striatum using RNA-Seq and microarrays. PLoS One. 2011;6(3):e17820. doi: 10.1371/journal.pone.0017820. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, Brown CG, Hall KP, Evers DJ, Barnes CL, Bignell HR. Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008;456(7218):53–59. doi: 10.1038/nature07517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25(9):1105–1111. doi: 10.1093/bioinformatics/btp120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Kim D, Salzberg SL. TopHat-Fusion: an algorithm for discovery of novel fusion transcripts. Genome Biol. 2011;12(8):R72. doi: 10.1186/gb-2011-12-8-r72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5(7):621–628. doi: 10.1038/nmeth.1226. [DOI] [PubMed] [Google Scholar]
  72. Roberts A, Pimentel H, Trapnell C, Pachter L. Identification of novel transcripts in annotated genomes using RNA-Seq. Bioinformatics. 2011;27(17):2325–2329. doi: 10.1093/bioinformatics/btr355. [DOI] [PubMed] [Google Scholar]
  73. Roberts A, Trapnell C, Donaghey J, Rinn JL, Pachter L. Improving RNA-Seq expression estimates by correcting for fragment bias. Genome Biol. 2011;12(3):R22. doi: 10.1186/gb-2011-12-3-r22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, Van Baren MJ, Salzberg SL, Wold BJ, Pachter L. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28(5):511–515. doi: 10.1038/nbt.1621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Chen H, Boutros PC. VennDiagram: a package for the generation of highly-customizable Venn and Euler diagrams in R. BMC Bioinformatics. 2011;12:35. doi: 10.1186/1471-2105-12-35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Mori R, Xiong S, Wang Q, Tarabolous C, Shimada H, Panteris E, Danenberg KD, Danenberg PV, Pinski JK. Gene profiling and pathway analysis of neuroendocrine transdifferentiated prostate cancer cells. Prostate. 2009;69(1):12–23. doi: 10.1002/pros.20851. [DOI] [PubMed] [Google Scholar]
  77. Meng L, Bian Z, Torensma R, Von den Hoff JW. Biological mechanisms in palatogenesis and cleft palate. J Dent Res. 2009;88(1):22–33. doi: 10.1177/0022034508327868. [DOI] [PubMed] [Google Scholar]
  78. Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Edgar R, Federhen S. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2007;35(Database issue):D5–12. doi: 10.1093/nar/gkl1031. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Additional file 1

TGFβ3 protein expression in the alleles of TGFβ3 knockout mice. Western blot analysis show that while homozygous (HM; −/−) palates did not show any expression TGFβ3 protein in the palate collected at E15.5 heterozygous (HT; +/−) palates expressed limited levels of the TGFβ3 protein while the wild-type (WT; +/+) palates expressed higher level of TGFβ3 protein.

Click here for file (390.4KB, tiff)
Additional file 2

Table listing RNA integrity numbers (RIN) to monitor quality and degradation of RNA samples.

Click here for file (13.3KB, xlsx)
Additional file 3

Table showing the number of reads per replicate of RNA samples.

Click here for file (14.3KB, xlsx)
Additional file 4

Validation of biological and technical replicates using CummeRbund. The read numbers per transcript between the biological replicates were compared. Since the read number per transcript (FPKM value) ranged from 0 to >10,000, the read number + 1 was transformed by log10. Each dot represents data from one transcript. There was a high correlation between the two sequencing results (r = 0.98, Pearson’s correlation; p < 0.001 two tail).

Click here for file (1.5MB, tiff)
Additional file 5

Venn diagrams of all significantly differentially expressed genes. Differential expression of transcripts within TGFβ3 alleles (TGFΒ3−/−, TGFΒ3+/−, and WT) were compared using Venn diagrams. Lists of differentially expressed genes with their EntrezIDs were uploaded to the R VennDiagram package [75], the query was run, and the detected numbers and gene lists were exported to MS Excel. Relationships among the lists of differentially expressed genes within TGFβ3 allelic mice are depicted as intersections or uniqueness and grouped with their corresponding gestational age (E14.5, E15.5, and E16.5). A significant overlap of differentially expressed genes was observed among the allelic variants at each gestational age. Venn diagrams were not drawn per scale.

Click here for file (684.7KB, tiff)
Additional file 6

Table listing the names of the genes significantly increased from E14.5 to E15.5 followed by downregulation in E16.5 and their.

Click here for file (251.2KB, xlsx)
Additional file 7

Venn diagram of cleft palate (CP) related genes acquired from OMIM and MGI Using the OMIM (Online Mendelian Inheritance in Man) database, we identified 280 genes and loci involved in genetic diseases associated with CP syndromes in humans. Several studies have developed knockout mouse models which exhibit CP [77]. We used the Mouse Genome Informatics (MGI) international database resource for laboratory mice to identify 128 genes (out of 223 genotypes) that cause CP in mice. The mouse genes were filtered based on gene names only, because some of the same genes were either studied by different labs or have splice variants, and as a result they were listed several times as genotypes in the MGI database. The human and mouse CP-associated genes were compared to find common genes, and the Homologene tool of NCBI [78] was used to compare and detect the orthologs of human and mouse CP related genes. We determined that 50 (out of 280) human CP related genes and loci were not represented, i.e. non-orthologs, within the mouse genome.

Click here for file (308.4KB, tiff)
Additional file 8

Ideogram of CP genes on chromosomes. A zoomable Circos image is drawn based on circular composition of CP genes on chromosomes. Data tracks appear inside and/or outside the circular layout. When the PDF file is magnified, tracks include FPKM values of CP genes as histograms and labeled from the outer circle to inner circle as “gene name, 14.5HM, 15.5HM, 15.5HM, 14.5HT, 15.5HT, 16.5HT, 14.5WT, 15.5WT, and 16.5WT”.

Click here for file (3.3MB, pdf)
Additional file 9

Table listing the all the CP genes sorted according to their pattern.

Click here for file (187.2KB, xlsx)
Additional file 10

Table showing differentially expressed molecules clustered into GO of cellular biofunctions.

Click here for file (13.9KB, xlsx)
Additional file 11

Stages of palatogenesis in the mouse. (A) Time-course of palate development in mice. (B-F) Scanning electron micrographs showing oral views of the secondary palate at representative developmental stages [reprinted from Kaufman (Kaufman, 1992) with permission]. Orange lines mark sites of fusion between the medial nasal processes and maxillary processes, white arrowheads point to initial outgrowths of the primary palate, white arrows point to the initial outgrowth of the secondary palatal shelves, red arrowheads mark the initial site of palatal adhesion and fusion, and the yellow arrowhead points to the gap between the primary and secondary palates that will disappear following fusion between these tissues. (G-U) Representative histological frontal sections from anterior (G-K), middle (L-P), and posterior (Q-U) regions of the developing palate at each indicated stage. The middle palate region is flanked by the developing upper molar tooth germs (black arrows in M-P) and corresponds to the palatine region of the future hard palate. The posterior palate region corresponds to the future soft palate. At E11.5 (G,L,Q), the palatal shelf outgrowths arise from the oral surface of the maxillary processes. At E13.5 (H,M,R), the palatal shelves exhibit distinct shapes along the AP axis. By E14.5 (I,N,S), the palatal shelves have elevated to the horizontal position. At ~ E15.0 (J,O,T), the palatal shelves make contact at the midline and initiate fusion by formation of the midline epithelial seam (MES) in the mid-anterior region (arrowhead in O). By E15.5 (K,P,U), palatal shelf fusion is evident in the middle and posterior regions, with complete removal of the MES (black arrowheads in P,U). Remnants of the MES can still be seen in the anterior region (K) at this stage and the palatal shelves also fuse superiorly with the nasal septum. Magnification is not equivalent between stages. MdbP, mandibular process; MNP, medial nasal process; MxP, maxillary process; NS, nasal septum; PP, primary palate; PS, palatal shelf; SP, secondary palate; T, tongue. Adapted from Bush J O and Jiang R. Development 139, 231–243 (2012).

Click here for file (3.9MB, tiff)

Articles from BMC Genomics are provided here courtesy of BMC

RESOURCES