Abstract
In adult tissue, stem and progenitor cells must tightly regulate the balance between proliferation and differentiation to sustain homeostasis. How this exquisite balance is achieved is an area of active investigation. Here, we show that epidermal genes, including ~30% of induced differentiation genes already contain stalled Pol II at the promoters in epidermal stem and progenitor cells which is then released into productive transcription elongation upon differentiation. Central to this process are SPT6 and PAF1 which are necessary for the elongation of these differentiation genes. Upon SPT6 or PAF1 depletion there is a loss of human skin differentiation and stratification. Unexpectedly, loss of SPT6 also causes the spontaneous transdifferentiation of epidermal cells into an intestinal-like phenotype due to the stalled transcription of the master regulator of epidermal fate P63. Our findings suggest that control of transcription elongation through SPT6 plays a prominent role in adult somatic tissue differentiation and the inhibition of alternative cell fate choices.
Subject terms: Transcription, Skin stem cells, Stem-cell differentiation
The role of RNA polymerase II (Pol II) recruitment during adult stem cell differentiation is well understood, but not that of Pol II elongation. Here, the authors show that 30% of epidermal stem cell differentiation genes depend on SPT6 and PAF1 for Pol II elongation, and SPT6 loss leads to an intestine-like phenotype.
Introduction
Stem and progenitor cells in the basal layer of the epidermis must maintain a tight balance between self-renewal and differentiation to achieve homeostasis1. Failure to properly differentiate is the basis for more than 100 human genetic skin diseases that can impact up to 20% of the population2. Thus, a tremendous amount of effort has been focused on elucidating the pathways and factors involved in the differentiation process. As basal layer stem and progenitor cells differentiate, they exit out of the cell cycle, detach from the basement membrane and progressively migrate upwards to form the spinous and then the granular layers of the epidermis. Terminal differentiation results in the formation of the stratum corneum which is essential for skin barrier function. Central to this process is the transcription factor P63 which is necessary for both epidermal growth and differentiation with mutations in this gene leading to a variety of skin genetic disorders such as Ectrodactyl-Ectodermal Dysplasia Clefting Syndrome (EEC)3–7. Work by our lab and others have also shown a prominent role for the transcription factors ZNF750, KLF4, MAF, MAFB, GRHL1, OVOL1, GRHL3, EHF, and CEBP alpha/beta in promoting differentiation8–16. The differentiation program can be controlled at the transcription initiation stage through the recruitment of RNA polymerase II (Pol II) to promoters by specific DNA binding transcription factors. However, there may be additional regulatory steps downstream of transcription initiation such as promoter proximal pausing/transcription elongation that could potentially be a major regulator of the differentiation process17.
Pol II pausing was first characterized in Drosophila heat shock genes and subsequently in the human MYC gene where there are much higher levels of Pol II near their promoters than gene bodies suggesting control at the elongation level18–20. Pol II can then escape pause and proceed into productive elongation upon the appropriate stimulus such as stress (heat shock genes) or serum (MYC). More recent genome-wide studies have found that promoter proximal pausing regulates developmental genes in both Drosophila embryos and murine and human embryonic stem cells (ESCs)21–24. Actively transcribed genes in ESCs were also subject to promoter-proximal pausing suggesting the importance of this mode of gene regulation21. It is currently unclear whether there is any role for control of transcription elongation in adult cell fate transitions such as during somatic tissue differentiation.
Pol II pauses after being recruited to gene promoters by transcription factors on a significant portion of metazoan genes25. It pauses within the first 20–60 bp downstream of the transcription start site (TSS) after synthesis of a short nascent RNA and remains paused until receiving further signals for elongation26. Promoter paused Pol II is enriched for serine 5 (Ser5) phosphorylation in its C terminal domain (CTD) and bound by the DRB sensitivity-inducing factor (DSIF: SPT4 and SPT5) and negative elongation factor (NELF)27,28. Productive elongation occurs when positive elongation factor b (P-TEFb: CDK9 and Cyclin T) phosphorylates DSIF, NELF, serine 2 (Ser2) of Pol II’s CTD29 and CTD linker30. Phosphorylated NELF dissociates from the complex which allows Ser2 Pol II, DSIF, PAF complex, and SPT6 to enter in the elongation phase of transcription30,31. Loss of NELF binding allows PAF association with Pol II, while CTD linker phosphorylation recruits SPT630. Association with SPT6 and PAF allows conformational changes in the DSIF clamps, which may drive the polymerase forward to promote elongation30. Elongation can also be enhanced by association with the super elongation complex (ELL1/2/3, EAF1/EAF2, AFF1/AFF4, AF9/ENL, and pTEFb), which is necessary for the expression of rapidly induced genes17. Upon promoter pause escape, the elongating Pol II must also overcome the nucleosome barrier through association with the histone chaperones FACT (SPT16 and SSRP1) and SPT632–34. FACT and SPT6 both have histone chaperone activity allowing them to facilitate the disassembly and reassembly of nucleosomes as Pol II passes through resulting in productive elongation.
SPT6 interacts with both histones H3 and H4 to disassemble and reassemble nucleosomes to allow passage of Pol II during elongation35,36. Spt6 has also been found to increase the elongation rate of Pol II on naked DNA and in Drosophila S2 cells37,38. It can also regulate the expression of lncRNAs through the redistribution of H3K36me3 marks39. Spt6 is necessary to promote murine ESC self-renewal by preventing the polycomb repressive complex from silencing super enhancers controlling the expression of pluripotency transcription factors40. Surprisingly, whether SPT6 has any role in adult stem cell homeostasis and lineage differentiation is unknown.
Here, we show that epidermal genes, including ~30% of induced differentiation genes already contain paused Pol II in undifferentiated, proliferating epidermal cells, which is released into productive elongation upon differentiation. Through a small RNAi screen of factors involved in transcription elongation, we identify SPT6 as being necessary to promote epidermal differentiation. SPT6 promotes the elongation of genes that encode transcription factors and structural proteins that are essential for the differentiation process. In the absence of SPT6, Pol II is stalled at promoter/TSS regions of differentiation genes which inhibits human skin stratification and differentiation. In the same genetic pathway as SPT6, PAF1 depletion blocks epidermal differentiation due to a loss of transcription elongation of differentiation genes. Unexpectedly, SPT6 loss not only prevents skin stratification but also results in the transdifferentiation of epidermal cells into an intestinal-like phenotype. This is due to Pol II pausing and subsequent loss of expression of the master regulator of epidermal fate, P63, upon SPT6 depletion. Importantly, ectopic expression of P63 in SPT6 depleted cells can block the expression of intestinal genes. Our results suggest that control of transcription elongation through SPT6 plays a prominent role in adult somatic cell fate choices.
Results
Promoter proximal pausing and subsequent transcription elongation regulates epidermal differentiation
To investigate whether Pol II is already bound to differentiation genes in epidermal stem and progenitor cells and how it changes upon induction of differentiation, we performed Pol II ChIP-Seq in primary human keratinocytes cultured in proliferation (subconfluent) and differentiation (confluent with 1.2 mM calcium for 3 days) conditions (Fig. 1a and Supplementary Data 1 and 2). MACS (model-based analysis of ChIP-Seq with cutoff q value of 0.05) was used to call significant peaks in both proliferation and differentiation conditions. There was similar Pol II binding across genic regions in both proliferation and differentiation conditions (Supplementary Fig. 1a). Increased accumulation of Pol II at promoter proximal regions relative to the gene body has previously been used to provide evidence that promoter pausing/transcription elongation is an important mode of gene regulation21–23,41. To characterize whether there is promoter proximal pausing, we calculated the travel ratio (TR) of the genes bound by Pol II in proliferation and differentiation conditions. TR is defined as the relative ratio between Pol II density in the gene body compared to the promoter proximal regions (Fig. 1b)21,41. A TR of 1 suggests the amount of elongating Pol II is the same as promoter paused Pol II, whereas a TR of 0.25 suggests that there is four times more Pol II paused than elongating. Thus, we have divided these genes into highly (TR ≤ 0.25), moderately (0.25 < TR < 1) and non-paused (TR ≥ 1), according to their TR values. We found that a total of 1617 genes (19.55%) were highly paused, 5275 genes (63.78%) were moderately paused, and 1377 genes (16.65%) were non-paused in proliferating cells (Fig. 1c). Interestingly, in differentiation conditions the percentage of highly paused genes dropped to 4.15% (496 genes), while non-paused genes increased to 31.74% (3789 genes) suggesting that Pol II pausing/transcription elongation may play a role in epidermal differentiation (Fig. 1c). To orthogonally validate our Pol II results, we used a previously published timecourse of Pol II binding during epidermal differentiation (days 0, 2, 4, and 7)42. Similar to our Pol II data, highly paused genes decreased during differentiation from 17.73% in undifferentiated cells down to 11.21% by day 4 of differentiation (Supplementary Fig. 1b). There was also an increase in non-paused genes by day 7 of differentiation (Supplementary Fig. 1b). To address whether Pol II pausing regulates epidermal differentiation, we analyzed (FDR < 0.05 and ≥2-fold change) a previously published RNA-Seq dataset comparing primary human keratinocytes cultured in proliferation and differentiation conditions using the same timepoints as our work12. In total, 2307 genes were upregulated during differentiation which were enriched for gene ontology (GO) terms such as epidermis development (Supplementary Fig. 1c, d and Supplementary Data 3). In total, 2129 genes were downregulated during differentiation with enrichment for proliferation related GO terms (Supplementary Fig. 1c, e and Supplementary Data 3). Of the 2307 upregulated epidermal differentiation genes, ~37% (850/2307) of those genes already had Pol II binding in proliferation conditions (Fig. 1d). The high prevalence of Pol II bound differentiation genes in progenitor cells is not due to their premature differentiation as early differentiation protein K1 is not detectable in proliferation conditions but stains the vast majority of differentiated cells (Supplementary Fig. 1f, g). Approximately 82% (694/850) of the differentiation genes that already had Pol II binding in progenitor cells were either highly paused or moderately paused, while a minority (18.35%) were not paused (Fig. 1d). The vast majority (>95%) of these genes had little to no expression with RPKM values of <50, suggesting that Pol II pausing prevents productive elongation of these genes (Supplementary Fig. 1h). We decided to focus on these differentiation genes that are paused in progenitor cells to see if they are released during differentiation. A plot of the 155 highly paused and the 539 moderately paused genes showed high levels of promoter proximal Pol II pausing in progenitor cells which is dramatically reduced upon differentiation (Fig. 1e, f). Importantly, the estimated cumulative distribution function (ECDF) TR of the 694 genes significantly increased (p value = 1.296e−10) upon differentiation suggesting that ~30% (694/2307) of the induced epidermal differentiation genes are highly to moderately paused in progenitor cells, which is then released into productive elongation upon differentiation (Fig. 1g).
A closer examination of individual differentiation genes such as GRHL1, ABCA12, and OVOL1 showed high levels of promoter proximal Pol II pausing with no or minimal levels of Pol II in the gene bodies of progenitor cells (Fig. 1h, i and Supplementary Fig. 1k). Upon differentiation, there is a dramatic increase in Pol II onto the gene bodies of these genes suggesting regulation on the level of promoter pausing/transcription elongation (Fig. 1h, i and Supplementary Fig. 1k). RNA-Seq tracks show these genes are only expressed upon differentiation (Fig. 1h, i and Supplementary Fig. 1k). The timecourse data for these genes also show paused Pol II in undifferentiated cells which led to progressively greater levels of elongation as differentiation proceeds (Supplementary Fig. 1i, j). GRHL1 and OVOL1 are both transcription factors that are critical for the differentiation process14,16. ABCA12 is necessary for the transport of lipids to form the barrier of the skin with mutations in the gene causing harlequin ichthyosis43. While ~37% (850/2307) of the induced epidermal differentiation genes already had Pol II binding in progenitor cells, the remaining ~63% (1457/2307) of the genes did not have binding (Fig. 1d). These differentiation genes include KRTDAP, KLK7, and S100A8 which had no Pol II binding “or mRNA expression” in progenitor cells but gained it upon differentiation (Supplementary Fig. 1l–n). These genes are regulated in the “traditional” manner in that Pol II is likely recruited to these genes only upon the induction of differentiation by transcription factors. To test the possibility that the paused differentiation genes were enriched for transcription factors that could potentially recruit Pol II to the remaining ~63% of induced differentiation genes, we took three approaches. First, we performed GO on the 694 paused and 1457 (without Pol II loaded) differentiation genes. The paused differentiation genes were enriched for “regulators of transcription and gene expression”, while the other 1457 differentiation genes did not enrich for those terms (Supplementary Fig. 2a, b). Second, we manually looked for transcription factors previously published to promote epidermal differentiation in both gene sets. Interestingly, there were 14 transcription factors found in the highly and moderately paused genes (694 genes) including GRHL1, CEBPG, MAFB, POU2F3, RORA, POU3F1, KLF4, MAF, FOSL2, DLX3, DLX5, CEBPB, KLF5, and OVOL1 (Supplementary Fig. 2c). There were only four transcription factors found in the induced differentiation genes (1457 genes) that did not have Pol II binding in proliferative cells (Supplementary Fig. 2d). Thus, the paused genes had a 3.5-fold enrichment (14 versus 4) in transcription factors known to be important for epidermal differentiation even though it had 50% less overall genes (694 versus 1457 genes). Third, analysis of the 1457 genes using Enrichr showed that the GRHL family, OVOL1, and KLF5 are co-expressed with these genes44 (Supplementary Fig. 2e). Collectively, these results suggest that promoter proximal pausing/transcription elongation is an important regulatory step during epidermal differentiation and that transcription factors regulated by pausing may turn on the rest of the differentiation genes without paused Pol II.
SPT6 is necessary to promote human epidermal differentiation
Since Pol II promoter pausing occurs at a significant number of differentiation genes in progenitor cells, this suggests that elongation or pause release factors may play a role in the differentiation process. To test this, we performed a small RNAi screen targeting these factors (SPT5, SPT16, SSRP1, SPT6, ELL, ELL2, AFF1, and AFF4) in differentiated human epidermal cells and achieved >75% knockdown of each gene (Fig. 2a). Knockdown efficiency was also checked on the protein level for a couple of the genes such as SPT5 and ELL that showed good depletion (Fig. 2b). SPT6 was the only one that blocked expression of differentiation genes TGM1 and GRHL1 (Fig. 2c,d). These results were validated using two distinct siRNAs targeting SPT6 (Fig. 2a, c, d). To determine the impacts of SPT6 loss in a tissue setting that allows faithful representation of the stratification and gene expression program of human epidermis, control (CTLi) and SPT6 knockdown (SPT6i) primary human keratinocytes were seeded onto devitalized human dermis to regenerate human skin45,46. Depletion of SPT6 blocked skin stratification, complete absence of stratum corneum formation, and loss of expression of late and early differentiation genes/proteins FLG, LOR, K10, HOPX, ABCA12, DSG3, and OVOL1 (Fig. 2e–j). Many of these differentiation genes are also paused in progenitor cells including GRHL1, ABCA12, DSG3, and OVOL1. The proliferative capacity of the basal layer was also diminished as shown by the loss of ki67 positive cells which resulted in a hypoplastic tissue (Fig. 2f–i). RNA-Seq was also performed on CTLi and SPT6i tissue to determine the gene expression program that SPT6 regulates. In total, 4275 genes (FDR < 0.01 and ≥2-fold change) were upregulated upon SPT6 knockdown which were enriched in GO terms such as transcription elongation (Fig. 2k, l and Supplementary Data 4). In total, 1465 genes were reduced in expression upon SPT6 loss which were enriched in genes involved in skin development and keratinocyte differentiation (Fig. 2k, m and Supplementary Data 4). Close to 30% of the genes overlapped between the SPT6 and the differentiation gene expression signature (p < 7.510e−31) supporting the critical role of SPT6 in promoting differentiation (Fig. 2n). The correlation was even stronger comparing the overlap between the SPT6 signature with either the paused differentiation (~44% overlap: 303/694) or the non Pol II loaded differentiation genes (~38% overlap:559/1457) (Supplementary Fig. 2f, g). This suggests that this second group of genes (1457 non-paused) may be regulated by the differentiation promoting transcription factors that are pause regulated (Supplementary Fig. 2).
SPT6 binds to induced epidermal differentiation genes and promotes their transcription elongation
To ascertain whether SPT6 directly regulated the expression of epidermal differentiation genes, we performed ChIP-Seq on SPT6 in differentiated cells to determine where it bound in the genome. SPT6 bound to 20,782 peaks with 83% of the bound sites mapping to genic regions (5′UTR, promoter, intron, exon, TTS, 3′UTR) (Fig. 3a and Supplementary Data 5). The 20,782 peaks mapped back to 6307 genes that were enriched for keratinocyte differentiation and epidermis development GO terms (Fig. 3b and Supplementary Data 5). SPT6 binding to differentiation genes suggests that it may regulate elongation. Supporting this, RNA Pol II’s binding profile mimicked SPT6’s at SPT6 bound regions (Fig. 3c). Similarly, regions of higher Pol II density also correlated with increased SPT6 binding with a Pearson coefficient of 0.88 (Fig. 3d and Supplementary Fig. 3a). SPT6 bound to ~35% of the genes upregulated during differentiation, while also binding to ~27% of the downregulated genes (Supplementary Fig. 3b). However, SPT6 binding to upregulated differentiation genes was much stronger than downregulated genes (Fig. 3e). Close to 40% (573/1465) of genes downregulated upon SPT6 loss were bound by SPT6 suggesting direct regulation (Fig. 3f). These 573 genes were also enriched for GO terms such as skin development and keratinocyte differentiation (Supplementary Fig. 3c). SPT6 also bound to genes increased upon SPT6 depletion but binding was significantly less than dowregulated genes (Fig. 3f, g). To validate the SPT6 binding data, SPT6 ChIP-QPCR was performed in CTLi and SPT6i cells placed in differentiation conditions. Depletion of SPT6 led to loss of SPT6 binding to differentiation genes such as ABCA12, TGM1, GRHL1, DSG3, and HOPX (Supplementary Fig. 3d). These results suggest that high levels of SPT6 binding to upregulated differentiation genes may be critical for their expression. To determine how SPT6 impacts Pol II binding, RNA Pol II ChIP-Seq was performed in CTLi and SPT6i cells cultured in differentiation conditions (Supplementary Data 6a, b). At SPT6 bound genes, loss of SPT6 resulted in loss of Pol II from the gene body and accumulation at promoter proximal regions (Fig. 3h). At genes where SPT6 does not bind, there was minimal change to the distribution of Pol II upon SPT6 knockdown demonstrating that SPT6 specifically impacts Pol II distribution at SPT6 bound genes (Fig. 3i). SPT6 knockdown also caused depletion of RNA Pol II from the gene body of the 694 paused differentiation genes and significantly decreased the TR of those genes (Fig. 3j, k). Loss of SPT6 at SPT6 bound differentiation genes such as ABCA12, GRHL1, and DSG3 resulted in loss of Pol II from their gene bodies and accumulation at promoter proximal regions (Fig. 3l, m and Supplementary Fig. 3e). These data suggest that SPT6 is necessary for the transcription elongation of its bound epidermal differentiation genes.
PAF1 promotes epidermal differentiation through control of transcription elongation
Since SPT6 and the PAF complex have been identified as being part of the core activated elongation complex in vitro, we wanted to determine if PAF performed the same function as SPT6 in a tissue setting. To do this, we knocked down PAF1 and regenerated human skin. Similar to SPT6 loss, PAF1 depletion blocked early and late differentiation genes/proteins K1, K10, FLG, LOR, HOPX, KRT80, DSG3, and LCE3D (Fig. 4a–c). PAF1 also binds to the same differentiation genes as SPT6 and is required for their elongation (Fig. 4d, e). These results suggest that PAF1 and SPT6 are necessary for differentiation by promoting the elongation of differentiation genes and validates the in vitro model of elongation in a tissue setting.
SPT6 maintains open chromatin regions for P63 binding sites and blocks chromatin accessibility for transcription factors specifying other lineages
Next, we wanted to determine if loss of transcription elongation on differentiation genes due to SPT6 depletion led to altered chromatin accessibility at those genes. SPT6 has been shown to promote H3K27 demethylation and thus knockdown of SPT6 could result in increased repressive chromatin (H3K27me3)47. To do this, CTLi and SPT6i differentiated cells were subjected to ATAC-Seq. The vast majority (83.52%) of the genome did not change in chromatin accessibility (Fig. 5a). In total, 10.74% of the genomic regions became less accessible upon SPT6 knockdown which were primarily located in intergenic (42.29%) and intronic (44.78%) regions (Fig. 5b and Supplementary Data 7a). In total, 5.73% of the genomic regions became more accessible upon SPT6 loss which were also located in intergenic (44.21%) and intronic (45.87%) regions (Fig. 5c and Supplementary Data 7b). Plotting the ATAC-Seq signal on the 1465 genes that were downregulated upon SPT6 knockdown showed that there was a slight increase of chromatin accessibility around the TSS of those genes (Fig. 5d). This further supports the idea that the genes dependent on SPT6 for expression is mediated through transcription elongation rather than a closure of chromatin. Interestingly, a transcription factor motif search at the 10.74% of the less accessible genomic regions showed P63 as the most significantly enriched factor suggesting that P63 binding sites are lost upon SPT6 depletion (Fig. 5e). Notably, the less accessible genomic regions were also enriched for FOSL2 and KLF5 binding sites suggesting that SPT6 may directly regulate the expression of these paused regulated transcription factors that are necessary for epidermal differentiation (Fig. 5e and Supplementary Fig. 2c). Plotting of the ATAC-Seq data on the 4275 genes that were upregulated upon SPT6 knockdown showed a large increase in chromatin accessibility at the TSS of those genes (Fig. 5f). A transcription factor motif search at the 5.73% of the more accessible genomic regions showed enrichment for factors such as ASCL1 and FOXA2 which are lineage-specific transcription factors that regulate neuronal and intestinal development (Fig. 5g). A look at transcription factors and structural proteins important for endoderm/intestinal fate such as FOXA1, HNF1A, EPCAM showed increased chromatin accessibility upon SPT6 depletion at regions proximal to these genes (Fig. 5h–j). These results suggest that SPT6 may potentially be regulating the transcription elongation of P63 and the other pause regulated transcription factors such as FOSL2 and KLF5. Thus, without SPT6, P63, and the other epidermal transcription factor gene expression may go down which would then lead to closure of their bound sites. The increase in chromatin accessibility at non-epidermal lineages may be a secondary effect of SPT6 regulation. SPT6 may directly regulate factors such as P63 and these transcription factors may in turn regulate non-epidermal genes.
SPT6 suppresses an intestinal fate by promoting the expression of P63
Since there was increased chromatin accessibility near genes important for specifying non-epidermal lineages, we asked if SPT6 knockdown triggers a different tissue fate. To test this, genes upregulated ≥10 fold (472 genes) upon SPT6 depletion were used to query the Human Gene Atlas and ARCHS4 Tissues database through Enrichr to determine which tissue or cell type the 472 genes correlated with the most44,48,49. The Human Gene Atlas contains RNA-Seq samples from 37 different types of normal tissue and ARCHS4 is a compilation of 84,863 human RNA-Seq datasets covering a broad range of cell types and tissues. The only significant tissue that matched with the 472 upregulated genes was colon from the Human Gene Atlas (Fig. 6a). Similarly, gastric epithelial cells were the top match from the ARCHS4 Tissues database and matches to colon was also significant in the same database (Fig. 6b). A comparison between CTLi and SPT6i tissue showed that these intestinal genes (identified from the ARCHS4 database) were highly upregulated upon SPT6 loss and includes genes such as HNF1A, MUC2, LGR5, VIL1, FOXA1, CDX2, CLDN3, AGR2, TSPAN1, EPCAM, and CDHR2 (Fig. 6c and Supplementary Fig. 4a). We also wanted to determine if the entire SPT6 gene expression signature matched with that of colon. We performed Pearson correlations of the SPT6 knockdown RNA-Seq with that of previously published RNA-Seq data derived from human brain, skeletal muscle, adipocyte, trachea, and colon50–52. The SPT6 signature was most similar to colon with an average Pearson coefficient of 0.7 followed by skeletal muscle (0.305) and low correlations with brain (0.085), adipocyte (0.125), and trachea (0.07) (Fig. 6d). The control human regenerated skin samples showed low correlations (0.29) with colon and SPT6 knockdown samples (0.44) (Fig. 6d). This suggests that loss of SPT6 caused the tissue to resemble more of a colon than skin phenotype. Next, we wanted to elucidate the mechanism behind this altered cell fate. P63 is a master regulator of epidermal fate with p63 knockout mouse embryos developing intestine-like metaplasia53. SPT6 bound to P63 and is required for its transcription elongation and mRNA expression (Fig. 6e and Supplementary Fig. 3d and Supplementary Data 4). Measurement of nascent P63 mRNA by pulsing control and SPT6 depleted cells with 5-ethynyl uridine also showed a significant decrease of P63 mRNA from the gene body (exon 1, exon 6, intron 8, and the last exon) in SPT6 knockdown cells (Fig. 6f). Loss of P63 mRNA from the gene body was accompanied by a large increase of the message accumulating near the TSS region suggesting that SPT6 is necessary for elongation of P63 transcripts (Fig. 6f). This resulted in loss of P63 on the protein level using in vitro Ca2+ induced differentiation and regenerated human tissue (Fig. 6g and Supplementary Fig. 4b). Intersection of the upregulated genes due to SPT6 knockdown (RNA-Seq) with genes bound by P63 [(P63 ChIP-Seq in differentiated primary human keratinocytes54] showed that 45.9% of the genes overlapped (Supplementary Fig. 4c). This suggests that p63 may act as a repressor on those genes. Supporting this, the chromatin accessibility increased at P63 bound intestinal genes such as HNF1A, FOXA1, CDHR2, and EPCAM upon SPT6 loss (Supplementary Fig. 4d–g). Furthermore, knockdown of SPT6 caused decreased P63 binding at those genes (Supplementary Fig. 4h). Loss of P63 in regenerated human tissue also resulted in the upregulation of the same set of intestinal genes as SPT6i including HNF1A, MUC2, LGR5, VIL1, FOXA1, CDX2, CLDN3, AGR2, TSPAN1, EPCAM, and CDHR2 (Supplementary Fig. 4i). This suggests that SPT6 may be promoting the expression of P63 to suppress the intestinal genes. If this were true, then ectopic expression of P63 should be able to block induction of intestinal gene expression in SPT6 knockdown tissue. Control epidermis, generated from keratinocytes treated with control siRNAs and retroviral expression of LACZ (CTLi + LACZ), had robust P63 and no detectable FOXA1 expression (Fig. 6h). FOXA1 has been shown to be one of three transcription factors necessary to reprogram murine embryonic fibroblasts into an intestinal fate55. Expression of P63 was lost in SPT6 (SPT6i + LACZ) or P63 (P63i + LACZ) knockdown tissue which was replaced by robust expression of FOXA1 (Fig. 6h). Importantly, retroviral expression of P63 suppressed FOXA1 expression as well as HNF1A, MUC2, LGR5, VIL1, FOXA1, CDX2, CLDN3, AGR2, TSPAN1, EPCAM, and CDHR2 mRNA expression in SPT6 depleted tissue but did not restore epidermal differentiation gene expression (Fig. 6h–j). These results suggest that SPT6 directly regulates epidermal differentiation genes through elongation and indirectly prevents an intestinal fate by promoting the expression of P63 and possibly other paused transcription factors.
To explore whether loss of SPT6 can lead to formation of intestinal epithelial structures, we cultured control and SPT6 knockdown keratinocytes as single cells in matrigel to form 3D organoids. The cells were grown in proliferation (promotes intestinal stem cell expansion) or differentiation media (promotes budding into crypt-like structures via differentiation of stem cells into mature crypt cells) in the presence of Wnt3a, R-spondin, and noggin. These culture conditions have previously been shown to promote the growth of intestinal stem cells into large spherical structures containing a polarized epithelium around a centered lumen with budding crypt-like domains55–58. In the case of control cells, ~90% of the colonies grew as large solid spheroids (without a lumen) regardless of whether they were grown in proliferation or differentiation medium (Fig. 7a, e, f). In contrast, the proportion of solid spheres was decreased to ~60% in proliferation and ~45% in differentiation media in SPT6 depleted cells (Fig. 7e, f). This drop in solid spheres was replaced by a concomitant increase in spheroid with lumen (single or multiple: Fig. 7c, right panel), budding (Fig. 7b, g; ~8% in proliferation media and ~15% in differentiation media), and apoptotic events (Fig. 7c, left panel). These phenotypes mirrored that seen in healthy human colon organoids derived from biopsies obtained during routine colonoscopy (Fig. 7d). Notably, no budding structures were detected in control cells (Fig. 7g). SPT6 depleted organoids also expressed intestinal genes such as HNF1A, FOXA1, LGR5, KRT20, MUC2, and CDHR2 (Fig. 7h). These data suggest that SPT6 loss leads to a morphologic and gene expression transformation that resembles an intestinal fate.
Discussion
Promoter proximal pausing has been prominently studied in Drosophila embryos and ESCs in the context of regulation of developmental genes21–24. Pol II may be paused on developmental genes to poise critical genes for rapid induction upon the appropriate signal during development59. Surprisingly, there is not much known about this process in adult stem and progenitor cell fate decisions. Here, we have shown that ~30% of induced epidermal differentiation genes already have paused Pol II binding in stem and progenitor cells. Pol II accumulated at the promoter proximal regions. In total, 155 of these genes were categorized as highly paused with a TR ≤ 0.25, while 539 genes were moderately paused. Upon induction of differentiation, there was a dramatic decrease in Pol II accumulation/pausing at the promoter proximal regions in the 694 differentiation genes which also resulted in a significant increase in their TRs. These important genes include DSG3 and ABCA12 where loss of function results in skin diseases such as pemphigus vulgaris and harlequin ichthyosis60,61. Notably, the paused genes were also enriched for transcription factors critical for the differentiation process such as OVOL1, GRHL1, KLF4, KLF5, MAF, FOSL2, and CEBPB9,10,13,15,16,62,63. This suggests that these pause regulated transcription factors may recruit Pol II to turn on the non-pause regulated differentiation genes.
If transcriptional elongation regulates ~30% of epidermal differentiation genes, then factors that regulate pause release/transcription elongation should be critical for the differentiation process. Through an RNAi screen of these factors, we identified SPT6 to be a critical regulator of epidermal differentiation. It is interesting to note that none of the other factors had any impacts on differentiation. This is in line with observations in ESCs where specific elongation/pause release factors govern distinct ESC behavior. For example, Paf1C and Phf5a are important for the elongation of genes coding for pluripotency transcription factors, while Ell3 is necessary for establishing Pol II at developmental genes in ESCs64–66. In addition, NELF is necessary for regulating cell cycle genes in ESCs suggesting that different elongation/pause release factors have different targets as well as functions even in the same cell type67. Thus, it is possible that the factors we screened may have other functions in the skin such as in promoting self-renewal. This will be an interesting area for future investigation.
Loss of SPT6 blocked early (KRT1, KRT10) and late (FLG, LOR) differentiation as well as stratification of the epidermis. Global gene expression profiling of SPT6 knockdown tissue showed that 1465 genes were downregulated that had GO terms related to skin development and differentiation. ChIP-Seq of SPT6 showed that SPT6 bound to 6307 genes that were enriched in epidermis development and keratinocyte differentiation GO terms. However, it should be noted that SPT6 does not exclusively bind differentiation genes as it also binds and regulates the expression of basal layer specific genes such as KRT5, P63, and KRT14 (Supplementary Data 4 and 5). It is currently unclear how SPT6 binds with such specificity to epidermal genes. It is possible that SPT6 is recruited by an early epidermal differentiation transcription factor to promote elongation. Growth promoting transcription factors such as YAP and C-MYC have been found to promote Pol II pause release instead of Pol II recruitment to target genes, suggesting that a transcription factor may potentially recruit SPT6 to promote transcriptional elongation of epidermal genes21,68. SPT6 bound to 34.5% of upregulated differentiation genes and nearly 40% of the genes with decreased expression upon SPT6 depletion supporting the idea that SPT6 is important for epidermal gene expression. SPT6 binding to these genes resembled the profile of Pol II suggesting it may act as an elongation factor. To demonstrate this, Pol II ChIP-Seq was used to determine the binding profile of Pol II in the absence of SPT6. On SPT6 bound genes, loss of SPT6 increased promoter proximal pausing of Pol II and reduced Pol II binding throughout the gene suggesting SPT6 promotes transcription elongation. Pol II binding to the gene bodies of the 694 paused differentiation genes was also decreased that correlated with a significant decrease in their TRs. In contrast, SPT6 knockdown had minimal impact on the RNA Pol II binding profile on non-SPT6 bound genes. Our results suggest that SPT6 binds to and is necessary for the transcriptional elongation of epidermal genes. The current model of transcription elongation elucidated by cryo-EM suggests that the complex of pTEFb, SPT6, and PAF is required for transcription elongation activation in vitro30. However, it is unclear whether this same mode of regulation occurs in a tissue setting. We have shown here that PAF1 and SPT6 work in the same genetic pathway where they are both required for epidermal differentiation. Additionally, PAF1 binds to the same genes as SPT6 and loss of PAF1 results in Pol II stalling at the TSS regions of differentiation genes. Our data suggest that SPT6 promotes elongation in conjunction with PAF1 to promote differentiation in a tissue setting that is in line with the in vitro model of elongation. It will be interesting to determine whether this same group of proteins is necessary for elongation in other organs of the body. This is important in light of our small RNAi screen where other major known elongation factors had no apparent impact on epidermal differentiation. This suggests that the elongation factors that have been primarily characterized in vitro and thought to work together in a complex may have differing and even opposite functions in tissue.
The next question we asked was whether a loss of transcription elongation impacted chromatin accessibility through ATAC-Seq. The vast majority of the genome had no change in accessibility upon SPT6 depletion whereas 5.73% gained, while 10.74% lost accessibility. Interestingly, there was a slight increase in the chromatin accessibility of the 1465 genes decreased upon SPT6 loss suggesting that loss of transcription elongation does not lead to a loss of chromatin accessibility. An examination of the 4275 genes upregulated upon SPT6 depletion showed an even greater increase in chromatin accessibility in their TSS. Moreover, a motif search of the regions that gained chromatin accessibility was surprisingly enriched for transcription factors motifs that specify other lineages such as the brain and gut. This suggested the possibility that SPT6 loss resulted in an altered cell fate. To test this, we took the 472 genes that are ≥10 fold upregulated in expression upon SPT6 depletion and found that they were enriched for colon and gastric epithelial cell signatures. In addition, pearson correlations of the entire gene expression signature of SPT6i cells showed high correlation (avg. 0.7) with RNA-Seq signatures derived from human colon but not skeletal muscle or brain. Since SPT6 promotes transcriptional elongation of epidermal genes, how does its loss lead to an intestinal-like phenotype. A motif search of the regions that lost chromatin accessibility showed the transcription factor P63 as the most significant. P63 is a transcription factor necessary for epidermal fate determination and part of SPT6’s loss of function phenotype may be mediated through this factor. Supporting this, SPT6 knockdown resulted in Pol II depletion from the gene body of P63 which led to loss of P63 expression on the mRNA and protein level. Knockdown of P63 also resulted in upregulation of intestinal genes such as FOXA1, CDX2, VIL1, CLDN3, AGR2, EPCAM, LGR5, and CDHR2. These results are consistent with observations that p63 null mouse embryos develop intestine-like metaplasia53. Another group also showed that in embryonic epithelia, p63 knockout prevented stratification which resulted in a single layer epithelium with expression of mesodermal/muscle genes69. More recently, Qu et al. used keratinocytes derived from EEC patients to show that epidermal cell identity was lost in cells with p63 DNA binding mutations70. These mutant cells also had upregulated expression of neuronal and mesenchymal genes. It is notable that SPT6 loss had the highest Pearson correlation with colon and followed by skeletal muscle suggesting regulation of P63 may be partially responsible for some of its phenotype. In support of this, over 45% of the genes upregulated upon SPT6 knockdown had P63 binding suggesting that P63 may be acting as a repressor on these genes. SPT6 knockdown also resulted in increased chromatin accessibility at P63 bound intestinal genes including HNF1A, FOXA1, EPCAM, and CDHR2. Furthermore, ectopic expression of P63 prevented the induction of intestinal genes but did not restore epidermal differentiation gene expression in SPT6 depleted tissue. It is also possible that other pause regulated transcription factors contribute to the intestinal-like phenotype since SPT6 knockdown also causes loss of their expression. These other transcription factors may also contribute to epidermal identity as well as suppressing other cell fates.
We have also shown here that SPT6 knockdown cells grown in matrigel in the presence of Wnt3a, R-spondin, and noggin can form structures that resemble intestinal organoids with the presence of centered lumen and rudimentary budding structures. However, it is unclear how similar and whether these cells behave like real intestinal organoids. It will be interesting in the future to determine whether organoids derived from SPT6 depleted human keratinocytes can rescue mice in colon injury models.
Here, we have found a prominent role for SPT6 in promoting epidermal differentiation through elongation. SPT6 directly promotes the elongation of structural differentiation genes as well as transcription factors necessary to turn on the rest of the differentiation program (differentiation genes without Pol II loading in undifferentiated cells). Lastly, SPT6 indirectly suppresses an intestinal cell fate through master regulators of epidermal fate such as P63. In summary, we have elucidated a critical pathway where regulation of promoter pausing/transcriptional elongation through SPT6 plays a major role in somatic cell fate decisions.
Methods
Primers and siRNA sequences
All primers and siRNA sequences can be found in Supplementary Data 8.
Primary cells
Primary human epidermal keratinocytes were derived from neonatal foreskin and cultured in EpiLife medium (ThermoFisher: MEPI500CA) supplemented with human keratinocyte growth supplement (ThermoFisher: S1001K) and pen/strep. Proliferating, non-differentiated keratinocytes were cultured in subconfluent conditions. To induce epidermal differentiation, primary human keratinocytes were plated at full confluence in the presence of 1.2 mM calcium for 3 days. Phoenix cells were cultured in DMEM with 10% fetal calf serum.
Organotypic cultures of human epidermis
For the organotypic skin cultures, one million control or SPT6 knockdown cells were seeded onto devitalized human dermis to regenerate human epidermis45,71. Human dermis was purchased from the New York Firefighters Skin Bank. Dermis seeded cells were raised to the air liquid interface to promote differentiation and stratification. Tissue was harvested 6 days after initial seeding. Half of each sample was embedded into OCT for sectioning and the other half was collected for RNA extraction.
siRNA transfection
siRNAs targeting human SPT6 (final concentration 10 nM) or control siRNAs were transfected into keratinocytes using Lipofectamine RNAiMAX (Thermo: 13778-500) reagent according to the manual and incubated for 18 h.
RNA isolation and RT-QPCR
Total RNA from cells or tissue was extracted using the GeneJET RNA purification kit (Thermo Scientific: K0732) and quantified using a Nanodrop. In total, 1 μg of total RNA was reversed transcribed using the Maxima cDNA synthesis kit. Quantitative PCR was performed using the BioRad LFX96 real-time system. L32 was used as internal control for normalization.
Western blotting
Twenty microgram of the cell lysates were used for immunoblotting and resolved on 10% SDS-PAGE and transferred to PVDF membranes. Primary antibodies used include beta-actin (Santa Cruz: Sc-47778) at 1:5000, ELL (Cell Signaling Technology: 14468) at 1:500, SPT5 (Bethyl: A300-869A) at 1:500, p63 (Abcam: ab124762) at 1:1000, and SPT6 (Bethyl: A300-801A) at 1:1000. Secondary antibodies including IRDye 800CW (LI-COR: 926-32212) donkey anti-mouse and IRDye 680RD (LI-COR: 926-68073) donkey anti-rabbit were used at 1:10000.
Histology and immunofluorescence
Cultured cells or tissue were fixed in 4% paraformaldehyde for 11 min followed by blocking in PBS with 2.5% normal goat serum, 0.3% triton X-100, and 2% bovine serum albumin for 30 min Primary antibodies used were P63 (Rabbit, Abcam: ab124762) at 1:1000, P63 (Mouse, Abcam: Ab735) at 1:100, FOXA1 (Cell Signaling: 53528S) at 1:1000, Loricrin (Abcam: Ab198994) at 1:1000, Filaggrin (Abcam: Ab3137) at 1:200, MKi67 (Abcam: Ab16667) at 1:300, Keratin 10 (Abcam: Ab9025) at 1:500, Keratin 1 (Biolegend: 905204) at 1:500 for 1 h. The secondary antibodies used were Alexa 555 conjugated goat anti-mouse IgG (Thermo: A11029) or Alexa 488 conjugated donkey anti-rabbit IgG (Thermo: A21206) both at 1:500. Nuclear dye, Hoechst 33342 (Thermo:H3570) was used at 1:1000.
RNA-sequencing (RNA-seq) and library preparation
One million control or SPT6i cells were placed on devitalized human dermis and harvested 6 days later. Three biological triplicates were obtained for both CTLi and SPT6i and total RNA was isolated using the GeneJET RNA (Thermo: K0732) purification kit and quantified by Nanodrop. RNA-seq was performed using the Illumina HiSeq 4000 machine at the Institute of Genomic Medicine core facility at UCSD. RNA-seq libraries were prepared with TruSeq RNA Library Prep Kit then multiplexed and ~40 million reads per sample were obtained.
Chromatin immunoprecipitation sequencing (ChIP-seq), library preparation, and ChIP-QPCR
Ten million cells and 5 μg of antibody were used for each antibody pulldown experiment for ChIP46,72,73. ChIP was performed using the following antibodies: SPT6 (Bethyl: A300-801A), RNA Pol II (Active Motif: 91151), PAF1 (Bethyl: A300-172A), p63 (Abcam: Ab735), Rabbit IgG (Millipore: 12-370), and mouse IgG (Abcam: Ab18413). Cells for the RNA Pol II ChIP-QPCR or ChIP-Seq were fixed at a final concentration of 1% formaldehyde. Cells for the SPT6 ChIP-QPCR or ChIP-Seq were fixed in both formaldehyde (1% final concentration, Thermo: 28908) and disuccinimidyl glutarate (2 mM final concentration, Thermo: 20593). QPCR results are represented as a percentage of input DNA.
For ChIP-Seq, the ChIP DNA library was prepared using the TruSeq DNA sample prep kit (Illumina). Sequencing was done on the HiSeq 4000 System (Illumina) using single 1 × 75 reads at the Institute for Genomic Medicine Core, UCSD.
ATAC-sequencing (ATAC-seq) and library preparation
One million keratinocytes were transfected with control siRNA or SPT6 siRNA 1 day before seeded on a 12-well plate (Wuxi, CELL NEST, China, Cat# 712001). After inducing differentiation for 3 days, the cells were washed briefly with phosphate-buffered saline (PBS) two times. Permeabilized nuclei were obtained by resuspending cells in 250 µl Nuclear Permeabilization Buffer [(10 mM Tris-HCl (pH 7.4), 10 mM NaCl, 3 mM MgCl2, 0.1% Tween-20, 0.1% IGEPAL-CA630, 0.01% digitonin in Molecular biology water], and incubating for 5 min on a rotator at 4 °C. Nuclei were then pelleted by centrifugation for 5 min at 500 × g at 4 °C. The pellet was resuspended in 25 µl ice-cold Tagmentation Buffer [33 mM Tris-acetate (pH = 7.8), 66 mM K-acetate, 11 mM Mg-acetate, 16% DMF in Molecular biology water]. An aliquot was then taken and counted by hemocytometer to determine nuclei concentration. Approximately 50,000 nuclei were resuspended in 10 µl ice-cold Tagmentation Buffer, and incubated with 1 µl Tagmentation enzyme at 37 °C for 30 min with shaking 500 rpm. The tagmentated DNA was purified using MinElute PCR purification kit.
The libraries were amplified using NEBNext High-Fidelity 2X PCR Master Mix with primer extension at 72 °C for 5 min, denaturation at 98 °C for 30 s, followed by eight cycles of denaturation at 98 °C for 10 s, annealing at 63 °C for 30 s, and extension at 72 °C for 60 s. Amplified libraries were then purified using MinElute PCR purification kit, and two size selection steps were performed using SPRIselect bead at 0.55X and 1.5X bead-to-sample volume rations, respectively. Sequencing was performed at the UC San Diego IGM Genomics Center on an Illumina NovaSeq 6000 using the 100 bp paired-end protocol. The ATAC-seq was performed in CTLi and SPT6i cells in duplicates.
Nascent P63 mRNA measurement
Control and SPT6 knockdown cells were differentiated for 3 days and pulsed with 5-ethynyl uridine (EU) for 4 h. After the pulse, nascent RNA labeled with EU is used in a copper catalyzed click reaction with azide-modified biotin. The EU-biotin labeled RNA is then purified and selectively isolated using streptavidin magnetic beads according to manufacturer’s protocol (Click-iT Nascent RNA Capture Kit: Invitrogen C10365). RT-QPCR was used to measure levels of nascent P63 mRNA.
Intestinal organoid culture
Control and SPT6 knockdown keratinocytes were cultured under intestinal organoid conditions56. Cells were seeded in Matrigel (Corning, 354234) domes at 5 × 104 cells per well in a 24-well plate (USA Scientific, CC7682-7524). The plate was inverted and incubated at 37 °C to allow for Matrigel polymerization. After 10 min, 500 ul of proliferation media (50% conditioned media prepared from L-WRN cells (ATCC, CRL-3276) containing Wnt3a, R-spondin, and noggin) or differentiation media (5% conditioned media) was added to each well57. Cells were maintained in a humidified 37 °C/5% CO2 incubator for 9–10 days. Media changes were performed every 2–3 days. Cells were monitored under a light microscope for evidence of organoid formation and crypt budding. Every 3 days, organoid lumen structures (centered, solid, multi, incomplete, budding, and apoptotic) were imaged and counted. The colon organoids were isolated from the colonic tissue specimens of healthy adult human subjects undergoing routine colonoscopy56,74. The isolation and biobanking of organoids from these biopsies was carried out using an approved human research protocol (IRB# 190105: PI Ghosh and Das) that covers human subject research at the UC San Diego HUMANOID Center of Research Excellence.
Quantification and statistical analysis
RNA-seq data processing
Reads were aligned to the GENCODE v19 transcriptome hg19 using TopHat2 with default settings75. Differential expression among samples was calculated using ANOVA from the Partek Genomic Suite (Partek Incorporated). Analysis of the read count distribution indicated that a threshold of ten reads per gene generally separated expressed from unexpressed genes, so all genes with fewer than ten reads were excluded from ANOVA analysis. Gene lists for significantly upregulated or downregulated genes were created using FDR < 0.05 and ≥2-fold change. Enriched GO terms for RNA-seq differentially expressed gene sets were identified using Enrichr44,76. Heatmaps for the RNA-seq data were generated using Partek’s Genomic Suite (http://www.partek.com/partek-genomics-suite/).
ChIP-seq data processing
The ChIP-seq reads were processed by the ENCODE Transcription Factor and Histone ChIP-Seq processing pipeline (https://github.com/ENCODE-DCC/chip-seq-pipeline2) on our local workstation. The reads were first trimmed based on quality score before alignment to reference hg19; Upon alignment and deduplication, the peak calling was then carried out by MACS 2.2.4 with a cutoff q value of 0.0577,78. The heatmaps for the ChIP-Seq data were generated using ngs.plot79. Gene tracks were visualized using UCSC genome browser along with annotation tracks.
ATAC-seq data processing
We deployed the ENCODE ATAC-seq pipeline ATAC-seq pipeline v1.7.0 (https://github.com/ENCODE-DCC/atac-seq-pipeline) on our local workstation for ATAC-seq peak calling. The raw reads were trimmed with cutadapt2.5 before aligned to the GRCh37 (hg19) using bowtie2.380. The alignment files were then deduplicated using picard2.20 prior to being converted to tag files and subjected to peaking calling using MACS 2.2.4. The overlapped peaks from different samples were then merged into a consensus set of peaks prior to being divided into evenly spaced 500 bp chromatin regions over that the aligned reads could be recounted for differential enrichment analysis using DiffBind2.1481. DEseq2 was invoked to perform the differential analysis for accessible peaks82. FDR of less or equal to 0.05 was used to correct for the multiple tests.
Calculation of travel ratio from RNA Pol II ChIP-Seq data
To compute the TR for each gene, the following steps were implemented: (1) we defined the TSS region as 50 bp upstream to 300 bp downstream of the TSS, and the gene body as 300 bp downstream of the TSS to 1000 bp downstream of the TES; (2) the average enrichment score over a genomic region was computed using UCSC genome browser utility bigWigAverageOverBed with the Pol II ChIP-Seq fold change signal track; (3) the TR was then calculated as the ratio between the average enrichment score over the gene body region and the score over TSS region. As gene with multiple transcripts would bring about multiple options in defining the corresponding genomic regions, we opted to use the transcript with the highest average signal over the entire gene in the ChIP-seq control sample (input) for simplicity. The TSS reference was taken from UCSC genome browser (GRCh37/hg19 version). Genes with a length smaller than 1 kb and overlapping with or less than 2 kb away from other genes were excluded for calculation to avoid enrichment double counting. The empirical cumulative distribution function (ECDF) plot was generated using log2 transformed TRs (averaged from replicates or triplicates) and plotted using R. Wilcoxon rank sum test was used to evaluate the difference between groups.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This work was supported by grants from the National Institutes of Health (NIH R01AR066530 and R01CA225463) to G.L.S. This publication includes data generated at the UC San Diego IGM Genomics Center utilizing an Illumina NovaSeq 6000 that was purchased with funding from a National Institutes of Health SIG grant (#S10 OD026929). This publication includes data generated at the UC San Diego Center for Epigenomics. Work at the Center for Epigenomics was supported in part by the UC San Diego School of Medicine. This work was also supported by grants from NIH/NCATS (UG3TR003355 and UG3TR002968) to P.G. and S.D.
Source data
Author contributions
J.L. and G.L.S. conceived of the project, designed the experiments, and wrote the paper. J.L., M.T., Y.C., and V.B. performed the experiments. J.L. and X.X. performed the bioinformatics analysis. P.G., S.D., M.F., and J.L. performed the intestinal organoid experiments. J.L., G.L.S., X.X., M.F., V.B., P.T., S.D., and P.G. contributed to the editing and reading of the manuscript.
Data availability
The datasets generated from this study including RNA-Seq, ChIP-Seq, and ATAC-Seq data has been deposited in GEO (GSE153129). Source data are provided with this paper.
Code availability
The script for calculating travel ratio can be found at github (https://github.com/maxabruzzi/travelRatio) and DOI of 10.5281/zenodo.4383281.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks Idan Cohen and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-021-21067-w.
References
- 1.Blanpain C, Fuchs E. Epidermal homeostasis: a balancing act of stem cells in the skin. Nat. Rev. Mol. Cell Biol. 2009;10:207–217. doi: 10.1038/nrm2636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Lopez-Pajares V, Yan K, Zarnegar BJ, Jameson KL, Khavari PA. Genetic pathways in disorders of epidermal differentiation. Trends Genet. 2013;29:31–40. doi: 10.1016/j.tig.2012.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Brunner HG, Hamel BC, Van Bokhoven H. The p63 gene in EEC and other syndromes. J. Med. Genet. 2002;39:377–381. doi: 10.1136/jmg.39.6.377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Truong AB, Kretz M, Ridky TW, Kimmel R, Khavari PA. p63 regulates proliferation and differentiation of developmentally mature keratinocytes. Genes Dev. 2006;20:3185–3197. doi: 10.1101/gad.1463206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Celli J, et al. Heterozygous germline mutations in the p53 homolog p63 are the cause of EEC syndrome. Cell. 1999;99:143–153. doi: 10.1016/S0092-8674(00)81646-3. [DOI] [PubMed] [Google Scholar]
- 6.Mills AA, et al. p63 is a p53 homologue required for limb and epidermal morphogenesis. Nature. 1999;398:708–713. doi: 10.1038/19531. [DOI] [PubMed] [Google Scholar]
- 7.Yang A, et al. p63 is essential for regenerative proliferation in limb, craniofacial and epithelial development. Nature. 1999;398:714–718. doi: 10.1038/19539. [DOI] [PubMed] [Google Scholar]
- 8.Sen GL, et al. ZNF750 is a p63 target gene that induces KLF4 to drive terminal epidermal differentiation. Dev. Cell. 2012;22:669–677. doi: 10.1016/j.devcel.2011.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lopez-Pajares V, et al. A LncRNA-MAF:MAFB transcription factor network regulates epidermal differentiation. Dev. Cell. 2015;32:693–706. doi: 10.1016/j.devcel.2015.01.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Segre JA, Bauer C, Fuchs E. Klf4 is a transcription factor required for establishing the barrier function of the skin. Nat. Genet. 1999;22:356–360. doi: 10.1038/11926. [DOI] [PubMed] [Google Scholar]
- 11.Ting SB, et al. A homolog of Drosophila grainy head is essential for epidermal integrity in mice. Science. 2005;308:411–413. doi: 10.1126/science.1107511. [DOI] [PubMed] [Google Scholar]
- 12.Rubin AJ, et al. Lineage-specific dynamic and pre-established enhancer-promoter contacts cooperate in terminal differentiation. Nat. Genet. 2017;49:1522–1528. doi: 10.1038/ng.3935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lopez RG, et al. C/EBPalpha and beta couple interfollicular keratinocyte proliferation arrest to commitment and terminal differentiation. Nat. Cell Biol. 2009;11:1181–1190. doi: 10.1038/ncb1960. [DOI] [PubMed] [Google Scholar]
- 14.Mlacki M, Darido C, Jane SM, Wilanowski T. Loss of Grainy head-like 1 is associated with disruption of the epidermal barrier and squamous cell carcinoma of the skin. PLoS ONE. 2014;9:e89247. doi: 10.1371/journal.pone.0089247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Wilanowski T, et al. Perturbed desmosomal cadherin expression in grainy head-like 1-null mice. EMBO J. 2008;27:886–897. doi: 10.1038/emboj.2008.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lee B, et al. Transcriptional mechanisms link epithelial plasticity to adhesion and differentiation of epidermal progenitor cells. Dev. Cell. 2014;29:47–58. doi: 10.1016/j.devcel.2014.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Chen FX, Smith ER, Shilatifard A. Born to run: control of transcription elongation by RNA polymerase II. Nat. Rev. Mol. Cell Biol. 2018;19:464–478. doi: 10.1038/s41580-018-0010-5. [DOI] [PubMed] [Google Scholar]
- 18.Gilmour DS, Lis JT. RNA polymerase II interacts with the promoter region of the noninduced hsp70 gene in Drosophila melanogaster cells. Mol. Cell Biol. 1986;6:3984–3989. doi: 10.1128/MCB.6.11.3984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Rougvie AE, Lis JT. The RNA polymerase II molecule at the 5’ end of the uninduced hsp70 gene of D. melanogaster is transcriptionally engaged. Cell. 1988;54:795–804. doi: 10.1016/S0092-8674(88)91087-2. [DOI] [PubMed] [Google Scholar]
- 20.Krumm A, Meulia T, Brunvand M, Groudine M. The block to transcriptional elongation within the human c-myc gene is determined in the promoter-proximal region. Genes Dev. 1992;6:2201–2213. doi: 10.1101/gad.6.11.2201. [DOI] [PubMed] [Google Scholar]
- 21.Rahl PB, et al. c-Myc regulates transcriptional pause release. Cell. 2010;141:432–445. doi: 10.1016/j.cell.2010.03.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Muse GW, et al. RNA polymerase is poised for activation across the genome. Nat. Genet. 2007;39:1507–1511. doi: 10.1038/ng.2007.21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zeitlinger J, et al. RNA polymerase stalling at developmental control genes in the Drosophila melanogaster embryo. Nat. Genet. 2007;39:1512–1516. doi: 10.1038/ng.2007.26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Guenther MG, Levine SS, Boyer LA, Jaenisch R, Young RA. A chromatin landmark and transcription initiation at most promoters in human cells. Cell. 2007;130:77–88. doi: 10.1016/j.cell.2007.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Liu X, Kraus WL, Bai X. Ready, pause, go: regulation of RNA polymerase II pausing and release by cellular signaling pathways. Trends Biochem. Sci. 2015;40:516–525. doi: 10.1016/j.tibs.2015.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Rasmussen EB, Lis JT. In vivo transcriptional pausing and cap formation on three Drosophila heat shock genes. Proc. Natl Acad. Sci. USA. 1993;90:7923–7927. doi: 10.1073/pnas.90.17.7923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Smith E, Shilatifard A. Transcriptional elongation checkpoint control in development and disease. Genes Dev. 2013;27:1079–1088. doi: 10.1101/gad.215137.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Vos SM, Farnung L, Urlaub H, Cramer P. Structure of paused transcription complex Pol II-DSIF-NELF. Nature. 2018;560:601–606. doi: 10.1038/s41586-018-0442-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Saunders A, Core LJ, Lis JT. Breaking barriers to transcription elongation. Nat. Rev. Mol. Cell Biol. 2006;7:557–567. doi: 10.1038/nrm1981. [DOI] [PubMed] [Google Scholar]
- 30.Vos SM, et al. Structure of activated transcription complex Pol II-DSIF-PAF-SPT6. Nature. 2018;560:607–612. doi: 10.1038/s41586-018-0440-4. [DOI] [PubMed] [Google Scholar]
- 31.Hsin JP, Manley JL. The RNA polymerase II CTD coordinates transcription and RNA processing. Genes Dev. 2012;26:2119–2137. doi: 10.1101/gad.200303.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hondele M, et al. Structural basis of histone H2A-H2B recognition by the essential chaperone FACT. Nature. 2013;499:111–114. doi: 10.1038/nature12242. [DOI] [PubMed] [Google Scholar]
- 33.Kemble DJ, McCullough LL, Whitby FG, Formosa T, Hill CP. FACT disrupts nucleosome structure by binding H2A-H2B with conserved peptide motifs. Mol. Cell. 2015;60:294–306. doi: 10.1016/j.molcel.2015.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Teves SS, Weber CM, Henikoff S. Transcribing through the nucleosome. Trends Biochem. Sci. 2014;39:577–586. doi: 10.1016/j.tibs.2014.10.004. [DOI] [PubMed] [Google Scholar]
- 35.Kwak H, Lis JT. Control of transcriptional elongation. Annu. Rev. Genet. 2013;47:483–508. doi: 10.1146/annurev-genet-110711-155440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bortvin A, Winston F. Evidence that Spt6p controls chromatin structure by a direct interaction with histones. Science. 1996;272:1473–1476. doi: 10.1126/science.272.5267.1473. [DOI] [PubMed] [Google Scholar]
- 37.Ardehali MB, et al. Spt6 enhances the elongation rate of RNA polymerase II in vivo. EMBO J. 2009;28:1067–1077. doi: 10.1038/emboj.2009.56. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Endoh M, et al. Human Spt6 stimulates transcription elongation by RNA polymerase II in vitro. Mol. Cell Biol. 2004;24:3324–3336. doi: 10.1128/MCB.24.8.3324-3336.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Nojima T, et al. Deregulated expression of mammalian lncRNA through loss of SPT6 Induces R-Loop formation, replication stress, and cellular senescence. Mol. Cell. 2018;72:970–984 e977. doi: 10.1016/j.molcel.2018.10.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Wang AH, et al. The elongation factor Spt6 maintains ESC pluripotency by controlling super-enhancers and counteracting polycomb proteins. Mol. Cell. 2017;68:398–413 e396. doi: 10.1016/j.molcel.2017.09.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Day DS, et al. Comprehensive analysis of promoter-proximal RNA polymerase II pausing across mammalian cell types. Genome Biol. 2016;17:120. doi: 10.1186/s13059-016-0984-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kouwenhoven EN, et al. Transcription factor p63 bookmarks and regulates dynamic enhancers during epidermal differentiation. EMBO Rep. 2015;16:863–878. doi: 10.15252/embr.201439941. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Akiyama M. ABCA12 mutations and autosomal recessive congenital ichthyosis: a review of genotype/phenotype correlations and of pathogenetic concepts. Hum. Mutat. 2010;31:1090–1096. doi: 10.1002/humu.21326. [DOI] [PubMed] [Google Scholar]
- 44.Kuleshov MV, et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 2016;44:W90–W97. doi: 10.1093/nar/gkw377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Wang Y, Arribas-Layton M, Chen Y, Lykke-Andersen J, Sen GL. DDX6 orchestrates mammalian progenitor function through the mRNA degradation and translation pathways. Mol. Cell. 2015;60:118–130. doi: 10.1016/j.molcel.2015.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Li J, et al. HNRNPK maintains epidermal progenitor function through transcription of proliferation genes and degrading differentiation promoting mRNAs. Nat. Commun. 2019;10:4198. doi: 10.1038/s41467-019-12238-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Wang AH, et al. The histone chaperone Spt6 coordinates histone H3K27 demethylation and myogenesis. EMBO J. 2013;32:1075–1086. doi: 10.1038/emboj.2013.54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Uhlen, M. et al. A pathology atlas of the human cancer transcriptome. Science35710.1126/science.aan2507 (2017). [DOI] [PubMed]
- 49.Lachmann A, et al. Massive mining of publicly available RNA-seq data from human and mouse. Nat. Commun. 2018;9:1366. doi: 10.1038/s41467-018-03751-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Suntsova M, et al. Atlas of RNA sequencing profiles for normal human tissues. Sci. Data. 2019;6:36. doi: 10.1038/s41597-019-0043-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Consortium EP, et al. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Divoux A, et al. Fat distribution in women is associated with depot-specific transcriptomic signatures and chromatin structure. J. Endocr. Soc. 2020;4:bvaa042. doi: 10.1210/jendso/bvaa042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Wang X, et al. Residual embryonic cells as precursors of a Barrett’s-like metaplasia. Cell. 2011;145:1023–1035. doi: 10.1016/j.cell.2011.05.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Oti M, Kouwenhoven EN, Zhou H. Genome-wide p63-regulated gene expression in differentiating epidermal keratinocytes. Genom. Data. 2015;5:159–163. doi: 10.1016/j.gdata.2015.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Morris SA, et al. Dissecting engineered cell types and enhancing cell fate conversion via CellNet. Cell. 2014;158:889–902. doi: 10.1016/j.cell.2014.07.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Ghosh, P. et al. The stress polarity signaling (SPS) pathway serves as a marker and a target in the leaky gut barrier: implications in aging and cancer. Life Sci. Alliance310.26508/lsa.201900481 (2020). [DOI] [PMC free article] [PubMed]
- 57.Miyoshi H, Stappenbeck TS. In vitro expansion and genetic modification of gastrointestinal stem cells in spheroid culture. Nat. Protoc. 2013;8:2471–2482. doi: 10.1038/nprot.2013.153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Miura S, Suzuki A. Generation of mouse and human organoid-forming intestinal progenitor cells by direct lineage reprogramming. Cell Stem Cell. 2017;21:456–471 e455. doi: 10.1016/j.stem.2017.08.020. [DOI] [PubMed] [Google Scholar]
- 59.Levine M. Paused RNA polymerase II as a developmental checkpoint. Cell. 2011;145:502–511. doi: 10.1016/j.cell.2011.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Schmidt E, Kasperkiewicz M, Joly P. Pemphigus. Lancet. 2019;394:882–894. doi: 10.1016/S0140-6736(19)31778-7. [DOI] [PubMed] [Google Scholar]
- 61.Akiyama M, et al. Mutations in lipid transporter ABCA12 in harlequin ichthyosis and functional recovery by corrective gene transfer. J. Clin. Invest. 2005;115:1777–1784. doi: 10.1172/JCI24834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Sur I, Rozell B, Jaks V, Bergstrom A, Toftgard R. Epidermal and craniofacial defects in mice overexpressing Klf5 in the basal layer of the epidermis. J. Cell Sci. 2006;119:3593–3601. doi: 10.1242/jcs.03070. [DOI] [PubMed] [Google Scholar]
- 63.Wurm S, et al. Terminal epidermal differentiation is regulated by the interaction of Fra-2/AP-1 with Ezh2 and ERK1/2. Genes Dev. 2015;29:144–156. doi: 10.1101/gad.249748.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Ding L, et al. A genome-scale RNAi screen for Oct4 modulators defines a role of the Paf1 complex for embryonic stem cell identity. Cell Stem Cell. 2009;4:403–415. doi: 10.1016/j.stem.2009.03.009. [DOI] [PubMed] [Google Scholar]
- 65.Strikoudis A, et al. Regulation of transcriptional elongation in pluripotency and cell differentiation by the PHD-finger protein Phf5a. Nat. Cell Biol. 2016;18:1127–1138. doi: 10.1038/ncb3424. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Lin C, Garruss AS, Luo Z, Guo F, Shilatifard A. The RNA Pol II elongation factor Ell3 marks enhancers in ES cells and primes future gene activation. Cell. 2013;152:144–156. doi: 10.1016/j.cell.2012.12.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Williams LH, et al. Pausing of RNA polymerase II regulates mammalian developmental potential through control of signaling networks. Mol. Cell. 2015;58:311–322. doi: 10.1016/j.molcel.2015.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Galli GG, et al. YAP drives growth by controlling transcriptional pause release from dynamic enhancers. Mol. Cell. 2015;60:328–337. doi: 10.1016/j.molcel.2015.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Shalom-Feuerstein R, et al. DeltaNp63 is an ectodermal gatekeeper of epidermal morphogenesis. Cell Death Differ. 2011;18:887–896. doi: 10.1038/cdd.2010.159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Qu J, et al. Mutant p63 affects epidermal cell identity through rewiring the enhancer landscape. Cell Rep. 2018;25:3490–3503 e3494. doi: 10.1016/j.celrep.2018.11.039. [DOI] [PubMed] [Google Scholar]
- 71.Li, J. & Sen, G. L. Generation of genetically modified organotypic skin cultures using devitalized human dermis. J. Vis. Exp., e53280 10.3791/53280 (2015). [DOI] [PMC free article] [PubMed]
- 72.Li, J. et al. TEAD1 and TEAD3 play redundant roles in the regulation of human epidermal proliferation. J. Invest. Dermatol. 10.1016/j.jid.2020.01.029 (2020). [DOI] [PMC free article] [PubMed]
- 73.Jones, J. et al. BRD4 is necessary for differentiation downstream of epidermal lineage-determining transcription factors. J. Invest. Dermatol.10.1016/j.jid.2020.01.030 (2020). [DOI] [PMC free article] [PubMed]
- 74.Sayed, I. M. et al. Host engulfment pathway controls inflammation in inflammatory bowel disease. FEBS J.10.1111/febs.15236 (2020). [DOI] [PMC free article] [PubMed]
- 75.Kim D, et al. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:R36. doi: 10.1186/gb-2013-14-4-r36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Chen EY, et al. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinforma. 2013;14:128. doi: 10.1186/1471-2105-14-128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Liu T. Use model-based Analysis of ChIP-Seq (MACS) to analyze short reads generated by sequencing protein-DNA interactions in embryonic stem cells. Methods Mol. Biol. 2014;1150:81–95. doi: 10.1007/978-1-4939-0512-6_4. [DOI] [PubMed] [Google Scholar]
- 78.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Shen L, Shao N, Liu X, Nestler E. ngs.plot: quick mining and visualization of next-generation sequencing data by integrating genomic databases. BMC Genom. 2014;15:284. doi: 10.1186/1471-2164-15-284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Ross-Innes CS, et al. Differential oestrogen receptor binding is associated with clinical outcome in breast cancer. Nature. 2012;481:389–393. doi: 10.1038/nature10730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated from this study including RNA-Seq, ChIP-Seq, and ATAC-Seq data has been deposited in GEO (GSE153129). Source data are provided with this paper.
The script for calculating travel ratio can be found at github (https://github.com/maxabruzzi/travelRatio) and DOI of 10.5281/zenodo.4383281.