Abstract
Until now the essential transcription factor that determines the epithelial phenotype of breast cancer has not been identified and its role in epithelial-to-mesenchymal transition (EMT) and tumor progression remain unclear. Here, by analyzing large expression profiles of human breast cancer cells, we found an extraordinary correlation between the expression of Grainyhead transcription factor Grhl2 and epithelial marker E-cadherin. Knockdown of Grhl2 expression by shRNA in human mammary epithelial cell MCF10A leads to down-regulation of E-cadherin and EMT. Grhl2 is down-regulated in disseminated cancer cells that have undergone EMT, and over-expression of Grhl2 is sufficient to induce epithelial gene expression. Large clinical datasets reveal that expression of Grhl2 is significantly associated with poor relapse free survival and increased risk of metastasis in breast cancer patients. In mouse models, over-expression of Grhl2 significantly promotes tumor growth and metastasis. Further testing of several Grhl2 regulated genes leads to the same conclusions that the tumorigenic and metastatic potentials of tumor cells are linked to epithelial phenotype but not mesenchymal phenotype. In conclusion, our findings indicate that Grhl2 plays an essential role in the determination of epithelial phenotype of breast cancers, EMT and tumor progression.
Introduction
Epithelial-to-mesenchymal transition (EMT) has been demonstrated to play a critical role during tumor metastasis [1], which generates cells with migratory and invasive properties that are able to disseminate to distant organs [1], [2]. The molecular mechanisms of EMT during tumor progression are not fully understood. EMT is defined as a series of changes, including cell morphology transformation from cobble-stone like epithelial to spindle fibroblastic-like morphology, loss of epithelial markers (E-cadherin) with concurrent gaining of mesenchymal markers (vimentin and N-cadherin), disruption of cell-cell contacts and loss of epithelial integrity with acquisition of migratory and invasive ability [3]. During the EMT process, transcription factors play crucial roles. Many signaling pathways that trigger cancer cells to undergo EMT converge at a group of transcription factors, including Snai1, Snai2, Zeb1, Zeb2, and Twist1 [4], [5]. These transcription factors suppress E-cadherin and other epithelial specific genes directly or indirectly.
Here, we postulate that there is an epithelial specific transcription factor that determines epithelial phenotype of breast cancer cells and loss of its expression during tumor progression contributes to loss of the epithelial phenotype and leads to epithelial-to-mesenchymal transition. In our previous study [6], we found approximately 3 weeks after 4T1 cells were injected into mammary fat pads of syngeneic mice, the 4T1 cells could be recovered from the lung. These lung recovered 4T1 cells had undergone EMT. Among the genes down-regulated significantly in these disseminated 4T1 cells, Grainyhead transcription factor, Grhl2, became the most attractive candidate. Grhl2 belongs to the Grainyhead transcription factor family that plays an evolutionarily conserved role in regulating epithelial cell differentiation from Drosophila to mammals [7]–[9]. In this study, we demonstrate that Grhl2 is the epithelial specific transcription factor that determines the epithelial phenotype of breast cancers, and plays a critical role in tumor progression.
Results
Grhl2 is expressed specifically in epithelial cells and co-regulated tightly with E-cadherin in human breast cancers
4T1 breast tumor is a classic mouse model for studying the molecular mechanisms of metastasis. After injection into the mammary fat pad, 4T1 tumors spontaneously metastasize in a short period to distant organs, such as lung. 4T1 cancer cells can be recovered from lung about three weeks after injecting the tumor. These lung recovered 4T1 cells had a spindle-like mesenchymal morphology (Figure S1A). Further analyses revealed that these lung recovered cells had a loss of E-cadherin expression and other epithelial specific genes (Figure S1B) [6], indicating that these disseminated cancer cells had undergone epithelial-to-mesenchymal transition (EMT). Grhl2 was one of the genes that were down-regulated significantly in 4T1 cells that had undergone EMT (Figure S1C, S1D). Grhl2 plays an evolutionarily conserved role in regulating epithelial cell differentiation [9], [10], suggesting Grhl2 could be the epithelial specific transcription factor.
To investigate if Grhl2 is associated with the epithelial phenotype in human breast cancers, we analyzed two microarray datasets of human breast cancer cell lines. We used the PAM305 breast cancer subtype gene classifier [11] to identify molecular subtype. We used a molecular definition of epithelial cells (Cdh1, Cldn3, Cldn4, Cldn7, Tjp2, Jup and CD24) and mesenchymal cells (Cdh2, Vim and Zeb1) to distinguish epithelial cells and mesenchymal cells [3], [12], [13].
Dataset GSE12777 contained microarray data of 51 human breast cancer cells [14]. Hierarchical clustering (HCL) of cells based on transcription levels of the epithelial and mesenchymal markers readily separated these cells into two groups: an epithelial group and a mesenchymal group (Figure 1A). We performed two-class significance analysis of microarrays (SAM) to identify epithelial specific genes (false discovery rate <5%). Grhl2 was specifically expressed in epithelial cells with an extraordinarily low p-value (p = 1.27558E-07, two tailed student t tests), suggesting a strong association with epithelial phenotypes. In addition, we calculated standard Pearson correlation coefficients of all transcripts (54675 transcripts) on the microarray with the epithelial marker E-cadherin, or with the mesechymal marker vimentin. Grhl2 was one of a few genes that was over-expressed in epithelial cells and showed a strong correlation with E-cadherin (r = 0.53) (Figure 1B), and negative correlation with vimentin (r = −0.57) (Figure S2A). Furthermore, among the 3000 transcripts that were annotated as transcription factors, DNA binding proteins or RNA binding proteins, Grhl2 was one of few that displayed the strongest correlation with E-cahderin and the highest expression ratio between epithelial cells and mesenchymal cells (Figure S2B). Using the PAM305 breast cancer subtype gene classifier [11], these cell lines can be separated into three groups, a luminal group expressing luminal cell markers, a basal A group expressing myoepithelial cell markers, and a basal B group expressing mesenchymal markers. Grhl2 is expressed in both luminal group and basal A group, but not basal B group (Figure S2C). Dataset GSE16795 contains microarray data of 39 human breast cancer cell lines [15]. Results from Dataset GSE16795 also revealed that Grhl2 was specifically expressed in epithelial cells (p = 3.9E-8, two tailed student t tests) (Figure S2D). Above all, these data indicate that Grhl2 is specifically expressed in epithelial cells and tightly correlated with E-cadherin in human breast cancer cells.
To confirm the microarray data, RT-PCR was performed on some breast cancer cells. Results from RT-PCR clearly showed that transcripts of Grhl2 were abundant in epithelial cells, including MCF10A, MCF7, and T-47D, but was undetectable in mesenchymal breast cells, including MDA-MB-231 and Hs578T (Figure 1C). The RT-PCR data was further confirmed by western blot analysis (Figure 1D).
To further confirm the results, we analyzed the Grhl2 promoter activities in human epithelial and mesenchymal breast cell lines by using a luciferase reporter. Examination of the genome location of mouse Grhl2 in the vicinity of the transcription start site using the UCSC genome browser indicates conserved CpG islands lying within this region (Figure 1E, top panel), suggesting a promoter region. In addition, this region is highly conserved across human, mouse and rat. The DNA sequence of mouse in this region (−183/+320) shares 94% sequence identity with the human sequence, suggesting that regulation of Grhl2 in human and mouse is probably conserved also. We cloned the proximal promoter of mouse Grhl2 and placed it upstream of a firefly luciferase reporter gene in a pGL3 basic vector. Grhl2 promoter driving luciferase activities was detected in MCF10A cells but not in MDA-MB-231 cells (p = 0.000136, two-tailed student t tests) (Figure 1E, bottom panel). As a control, E-cadherin promoter driving luciferase activity was assayed in these two cell lines also. E-cadherin promoter driving luciferase activities were detected only in MCF10A cells, but not in MDA-MB-231 cells (p = 0.013, two-tailed student t tests) (Figure 1E, right bottom panel). In conclusion, our data generated by analyzing human breast cancer microarray data, RT-PCR, and promoter driving luciferase reporters indicate that Grhl2 is specifically expressed in epithelial cells and tightly co-regulated with E-cadherin, suggesting that Grhl2 is the essential transcription factor that determines the epithelial phenotype of breast cancer cells.
Grhl2 determines the epithelial phenotype of mammary epithelial cells
Next, we tested if Grhl2 is the essential transcription factor that determines the epithelial phenotype of mammary epithelial cells. shRNA vectors against firefly luciferase (as a control) and Grhl2 were introduced into the human mammary epithelial cell line MCF10A by retroviral transfection. Efficient knockdown of Grhl2 mRNA by shRNA was verified by quantitative real-time PCR (Figure 2A). In comparison with empty vector or the luciferase shRNA vector – infected cells, knockdown of Grhl2 in MCF10A cells led to significant cell morphology changes, with transformation of the cobble-stone like epithelial morphology to an elongated fibroblast-like mesenchymal morphology (Figure 2B). To confirm that the phenotype changes observed in these cells was due to loss of the epithelial phenotype, we performed immunofluorescent staining of epithelial and mesenchymal markers. The immunofluorescence images clearly showed drastically reduced endogenous E-cadherin protein levels, and more importantly disruption of cell-cell border E-cadherin staining in Grhl2 knockdown cells (Figure 2C, left panel), indicating a breakdown of cell-cell junctions. Furthermore, staining of the mesenchymal marker vimentin was increased significantly (Figure 2C, right panel), indicating that knockdown of Grhl2 led to MCF10A cells to undergo epithelial-to-mesenchymal transition (EMT). The EMT phenotype was further confirmed by expression of characteristic molecular markers using western blotting (Figure 2D) and realtime PCR (Figure 2E, 2F). In Grhl2 knockdown MCF10A cells, epithelial markers E-cadherin, Epcam, and CD24 were down-regulated, while the mesenchymal marker vimentin was up-regulated dramatically (Figure 2D, 2E). EMT inducers, including Snai1, Twist1, Zeb1 and Zeb2, were up-regulated in Grhl2 knockdown MCF10A cells while expression levels of primary microRNA of miR-200c∼141 cluster whose products are negative regulators of Zeb1 and Zeb2, was decreased (Figure 2F). Collectively, these data indicate that Grhl2 is essential for mammary epithelial cells to maintain the epithelial phenotype. Knockdown of Grhl2 leads to a loss of epithelial morphology, epithelial markers and induction of EMT. Our data showed that 4T1 cells recovered from lung did not express Grhl2 and had undergone EMT. These data suggest that loss of Grhl2 during tumor progression could contribute to loss of the epithelial phenotype and promote EMT.
Over-expression of Grhl2 induces epithelial gene expression and promotes tumor growth and metastasis
Our data showed that Grhl2 determines the epithelial phenotypes of breast cancers. Next, we tested if over-expression of Grhl2 was able to inhibit EMT during tumor progression and block tumor metastasis. We used the 4T1 model to answer this question, in which disseminated cancer cells had undergone EMT and did not express Grhl2 (Figure S1A, S1B, S1C).
4T1 cells were transfected with lentiviral constructs expressing Grhl2 or empty vector. A pool of sorted GFP+ cells was used for the in vivo studies, thus avoiding clonal selection bias. In Grhl2 over-expressing cells, the transcription level of Grhl2 was four times that of the control cells as measured by quantitative realtime PCR (Figure S3A). Over-expression of Grhl2 caused an up-regulation of primary transcripts of microRNA-200b∼200a∼429 cluster and Epcam, and down-regulation of Zeb2 (Figure S3A, S3B). Surprisingly, tumors formed by 4T1-Grhl2 cells were significantly larger (p = 0.005, two tailed student t tests) (Figure 3A), and more metastatic tumors were formed in the lungs than 4T1-control cells (Figure 3B) (18 mice in each group, four independent experiments).
Using RT-PCR, we confirmed high levels of Grhl2 transcripts in 4T1-Grhl2 cells recovered from primary tumors and lungs (Figure 3C, 3D). Consistent with our previous data, 4T1-control cells recovered from lungs had undergone EMT, and did not express Grhl2, E-cadherin and Epcam (Figure 3D, 3E, 3F, 3G). In 4T1-Grhl2 cells recovered from lungs, high levels of E-cadherin, Epcam and miR-200s expression were detected (Figure 3D, 3E, 3F, 3G). Immunofluorescent staining of fixed lung sections also confirmed E-cadherin staining in the metastatic tumors formed by 4T1-Grhl2 cells, but not in metastatic tumors originating from 4T1-control cells (Figure 3H). Together, these data indicate that stable expression of Grhl2 induces E-cadherin expression in 4T1 cells at metastatic sites. Surprisingly, these data also show that stable expression of Grhl2 promotes tumor growth and metastasis.
To identify downstream target genes of Grhl2, we performed microarray gene expression profiling. 4T1-control cells recovered from lung had undergone EMT and did not express Grhl2. Compare to 4T1-control cells recovered from primary tumors 260 transcripts were down-regulated and 33 transcripts were up-regulated (<2.0 fold) in the 4T1-control cells recovered from the lungs (>2.0 fold) (Table S1). Among the 260 transcripts that were down-regulated in 4T1-control cells from the lung, 133 transcripts were up-regulated (>2.0 fold) in 4T1-Grhl2 cells recovered from the lung. In addition, genes that were down-regulated in the 4T1-control cells from the lung that had the highest fold changes were also the genes that also were up-regulated in 4T1-Grhl2 cells with the highest fold changes (Figure 3I, Table S1), suggesting these genes are directly or indirectly regulated by Grhl2. Genes regulated by Grhl2 cover a broad range of epithelial characteristics, including cell adhesion (Cdh1), tight junction (Cldn7, Cldn4, Cldn3, Ocln and Tjp2), desmosomes (Dsp, Jup, Pkp2 and Dsg2), gap junction (Gjb3), cytokeratine (Krt7, Krt8, Krt14, Krt17 and Krt19), Integrin (Itgb6, Itgb4 and Itgb2), Wnt ligands (Wnt7A, Wnt7B) and epithelial specific splicing factor (Esrp1). Some of the data from microarrays were further confirmed by quantitative realtime PCR (Figure 3J). Collectively, these data reveal that 51.5% (133/260) of genes down-regulated in 4T1 cells that had undergone EMT can be reversed by Grhl2, indicating a critical role of Grhl2 in maintaining the epithelial phenotype.
Wnt7A is regulated by Grhl2, and over-expression of Wnt7A phenotypically mimics Grhl2 in promoting epithelial gene expression and tumor progression
Our data show that Grhl2 promotes epithelial gene expression and also promotes tumor growth and metastasis. Next, we tested whether Grhl2 mediated promotion of tumor growth and metastasis is through its regulated genes. Among them, Wnt7A is a very attractive candidate. Using RT-PCR, we confirmed that Wnt7A was down-regulated significantly in 4T1 cells that had undergone EMT (Figure 4A), and stable expression of Grhl2 up-regulated Wnt7A expression in 4T1 cells recovered from the lung (Figure 4B).
Using specific primers to amplify the full length open reading frame of mouse Wnt7A from a cDNA pool derived from 4T1 cells, we obtained two cDNA clones. Sequencing results showed that the longer cDNA shared 100% sequence identity with the reference sequence of mouse Wnt7A (Accession number: NM_009527.3), encoding a protein with 349 amino acids in length. This isoform is designated as Wnt7A isoform 1 or Wnt7A1 (Figure S4A). The shorter cDNA lacked 155 base pairs compared to the reference sequence of mouse Wnt7A, due to using an alternative 5′ splice site inside exon3, which results in introducing an in-frame stop codon immediately after the alternative splicing site. This isoform encodes a truncated Wnt7A protein 148 amino acids in length. This isoform is designated as Wnt7A isoform 2 or Wnt7A2 (Figure S4A, S4B). In 4T1 cells in the epithelial state, Wnt7A1 is the dominant isoform (Figure 4A, 4B). Stable expression of Wnt7A1 in 4T1 cells caused a slight up-regulation of E-cadherin, the primary transcript of miR-200s, and a down-regulation of Twist1 and vimentin, while stable expression of Wnt7A2 led to down-regulation of E-cadherin, Epcam, Grhl2, and primary transcripts of miR-200s (Figure 4C, S4C).
Next, these cells were transplanted into mammary fat pads of female BALB/c mice, and tumor progression was monitored every three days. Tumor growth was significantly delayed in the 4T1-Wnt7A2 group when compared to the other two groups (Figure 4D). At four weeks post tumor implantation, the average size of 4T1-Wnt7A2 tumors was 47% of the size of 4T1-Wnt7A1 tumors, and 62% of the size of 4T1-control tumors making them significantly smaller (p = 0.033, two-tailed student t test) (Figure 4D). Lung tumor metastatic lesions were observed in mice bearing 4T1-Wnt7A1 tumors and 4T1-control tumors, but not in mice bearing 4T1-Wnt7A2 tumors (Figure 4E). These results indicate that Wnt7A1 promotes tumor growth, while Wnt7A2 inhibits tumor growth and metastasis.
We then examined gene expression in cells recovered from primary tumors and lungs. 4T1-control cells recovered from primary tumors were mainly comprised of epithelial cells, and expressed E-cadherin, Grhl2 and miR-200, while 4T1-control cells recovered from lungs were characterized by mesenchymal morphology and did not express E-cadherin, Grhl2 and miR-200 (Figure 4F, 4G, 4H, 4I). 4T1-Wnt7A1 cells recovered from either primary tumors or lungs were characterized by a cobble-stone like epithelial morphology and expressed high levels of E-cadherin, Grhl2 and miR-200 (Figure 4F, 4G, 4H, 4I). In contrast, 4T1-Wnt7A2 cells recovered from either primary tumors or lungs had a spindle or round-like mesenchymal morphology and did not express E-cadherin, Grhl2 and miR-200 (Figure 4F, 4G, 4H, 4I).
We also obtained the gene expression profiles of 4T1-Wnt7A1 and 4T1-Wnt7A2 cells recovered from lungs. Compared to 4T1-control cells recovered from lung, 419 transcripts were up-regulated (>2.0 fold) in 4T1-Wnt7A1 cells recovered from lung (Table S2). Among them, 156 transcripts overlapped with genes up-regulated in 4T1-Grhl2 cells from the lung (>2.0 fold). Genes up-regulated in 4T1-Wnt7A1 cells with the highest fold changes were the same genes that were up-regulated by Grhl2, including Cdh1, Cldn4, Cldn7, Dsg2, Gjb3, Tjp2, Esrp1, Rab25, and Epcam (Figure 4J). These data indicate that Wnt7A1 signaling mediated induction and expression of Grhl2 and other epithelial genes; while Wnt7A2, which encodes a truncated Wnt7A protein, causes in 4T1 cells to undergo EMT probably by competing for the same receptor as Wnt7A1 or perhaps some other mechanism.
Esrp1 is regulated by Grhl2 and stable-expression of a variant isoform of Esrp1 (Esrp1-V1) leads to 4T1 cells to undergo EMT and inhibits tumor progression
Our data have shown that Grhl2 and Wnt7A1 promote epithelial gene expression and tumor progression while Wnt7A2 promotes EMT but inhibits tumor progression, suggesting that 4T1 tumor progression in vivo might be associated with the epithelial phenotype. Based on these data, we hypothesized that a gene that promotes EMT would inhibit tumor progression. To test this hypothesis, we tested another Grhl2 regulated gene, i.e., Esrp1 an epithelial cell specific splicing factor that controls epithelial specific splicing of its targeted genes [16]. Esrp1 was down-regulated in 4T1 cells recovered from the lung that had undergone EMT (Figure 5A), correlating with its targeted gene splicing switching from the epithelial to the mesenchymal isoform, including CD44, CTNND1, MAP3K7 and Enah (Figure S5A). Over-expression of Grhl2 up-regulates Esrp1 expression (Figure 3I, 4J, 5B).
We obtained two cDNA clones of mouse Esrp1 from a cDNA pool derived from 4T1 cells. Sequencing results showed that one clone lacked 12 in-frame base pairs compared to the reference sequence of mouse Esrp1 (NM_194055), due to using an alternative 5′ splicing site inside exon12 (Figure S5B). This isoform is designated as Esrp1 variant isoform 1 (Esrp1-V1). Searching EST database revealed that using this alternative splicing site was very common both in human and mouse cells (Figure S5C). Esrp1 contains three conserved RNA recognition domain (RRM) [16]. The four amino acids missing in Esrp1-V1 are not located in any of the RRM domains. Surprisingly, stable expression of Esrp1-V1 in 4T1 cells led to significant cell morphology changes with a transformation from cobble-stone like epithelial morphology to spindle like mesenchymal morphology (Figure 5C, 5D). Expression of epithelial markers, including E-cadherin, Epcam, Grhl2, as well as Wnt7A, were significantly down-regulated, while mesenchymal markers, including vimentin, Snai2 and Zeb2, were up-regulated in 4T1-Esrp1-V1 cells (Figure 5D, 5E, 5F). Esrp1-V1 also perturbed Esrp1 regulated alternative splicing, with switching of its targeted genes, including CD44, CTNND1, MAP3K7 and Enah, from the epithelial specific isoform to the mesenchymal specific isoform (Figure S5D). Together, these data indicate that the Esrp1 regulated splicing program is important for maintaining the epithelial phenotype. Over-expressing Esrp1-V1, a variant isoform of Esrp1, perturbs the Esrp1 regulated splicing program and leads to changes in gene expression and morphology that are associated with EMT.
We used Esrp1-V1 to test our hypothesis that if a gene promoted EMT it would inhibit tumor progression. Tumors formed by 4T1-Esrp1-V1 cells were significantly smaller than controls (p = 0.033, two-tailed student t test) (Figure 5G). At four weeks post tumor implantation, the average size of 4T1-Esrp1-V1 tumors was only 37% of the size of control tumors (Figure 5G). Lung metastatic tumor lesions were also significantly reduced in mice bearing 4T1-Esrp1-V1 tumors (11 lung metastatic tumor lesions in 4T1-control versus 1 lung metastatic tumor lesion in 4T1-Esrp1-V1) (Figure 5H). These data support our hypothesis that a gene promoting EMT would inhibit tumor growth and metastasis.
Over-expression of Grhl2 correlates with poor relapse-free survival and increased risk of metastasis in breast cancer patients
Data from mouse models show that over-expression of Grhl2 promotes epithelial gene expression and promotes tumor progression. Examination of expressions of Grhl2 and E-cadherin in human breast cancers reveals that these epithelial genes are not only expressed in primary tumors (Figure S6A, S6B) but also a majority of metastatic tumors (Figure S6C). To evaluate the clinical relevance of Grhl2 in breast cancer patients, we used a large public microarray database [17]. Clinical data revealed that higher expression levels of Grhl2 were significantly correlated with a shorter relapse-free survival interval (2898 patients, p = 5.5E-6) and increased risk of metastasis (1354 patients, p = 0.0028) (Figure 6A 6B). Consistently, higher expression levels of other epithelial markers were significantly associated with poor relapse free survival and poor distant metastasis free survival, including E-cadherin, Cldn3, Esrp1, and Epcam (Figure S6D S6E). In addition, the 4T1 series, including 4T1 and 4TO7, are near-isogenic mouse mammary tumor cell lines. 4TO7 cells are able to disseminate to lung but have a very low efficiency to form metastatic lesions. Only 4T1 cells are able to complete the whole metastatic process and successfully establish metastases [5]. We found Grhl2 was only expressed in highly metastatic 4T1 cells but not weakly metastatic 4TO7 cells (Figure S6F S6G). Above all, these data indicate that Grhl2 is associated with an unfavorable outcome in breast cancer patients.
Discussion
Grhl2 determines the epithelial phenotype of breast cancers
Using the 4T1 tumor model, we found Grhl2 to be significantly down-regulated in disseminated cancer cells that had undergone EMT. Extensive bioinformatics analyses of large breast cancer microarray datasets revealed a remarkable level of correlation between the expression of Grhl2 and E-cadherin. Knockdown of Grhl2 in the human mammary epithelial cell line MCF10A led to down-regulation of E-cadherin and induction of EMT. Multiple functional assays confirm that Grhl2 plays a dominant role in maintaining the epithelial phenotype, and changing Grhl2 levels in cancer cells leads to EMT/MET induction. Collectively, these data indicate that Grhl2 is the essential transcription factor that determines the epithelial phenotype of breast cancers.
Our studies reveal that Grhl2 regulates a broad range of epithelial genes, including epithelial specific splicing factor (Esrp1), cell adhesion and tight junction (E-cadherin, Tjp2, Cldn4 and Cldn7), Wnt ligands (Wnt7A, Wnt7B), and possibly microRNA (miR-200s). A previous study has reported that in Grhl2 mutant embryos, Esrp1 was significantly down-regulated [10] suggesting that the relationship between Grhl2 and Esrp1 was probably conserved across all types of epithelial cells. Recent studies suggest Wnt7A may play an important role in regulating epithelial cell differentiation in mice. Expression of Wnt7A is tightly co-regulated with epithelial genes during genital tubercle development [18], and in uterine Wnt7A is specifically expressed in epithelial cells [19], [20].
Members of the miR-200 family are epithelial specific microRNAs and play a critical role in EMT by targeting Zeb1 and Zeb2 [21], [22]. One study reported that expression of miR-200s is negatively regulated by Zeb1 and Zeb2 through the conserved E-box sites located within their promoter regions [23]. However, the transcription factor that drives the expressions of miR-200s in epithelial cells remains unidentified. In our study, we observed a strong correlation between the expression of Grhl2 and miR-200s. In Grhl2 knockdown MCF10A cells, the transcription level of miR-200c∼141 cluster was decreased. In 4T1 cells that had undergone EMT, Grhl2 was not expressed and transcription levels of miR-200s were very low. When Grhl2 was over-expressed, expression levels of the miR-200 family were up-regulated. Our data suggest that Grhl2 could be the transcription factor that drives the expression of miR-200 in breast cancer cells.
Several transcription factors, e.g. Gata3 and Klf4, have been identified that could induce E-cadherin expression through binding to E-cadherin promoter in human breast cancer cells [24], [25]. Gata3 plays a key role in determining luminal lineage cell differentiation [26], [27]. Microarray data clearly showed that Gata3 was specifically expressed in luminal subtype cancer cells (Figure S2C). Basal A subtype tumors expressed very low level of Gata3, but high levels of E-cadherin and Grhl2 (Figure S2C), indicating Gata3 was not the essential transcription factor that controls E-cadherin expression in human breast cancers. Klf4 also has been reported to regulate E-cadherin expression via binding to the E-cadherin promoter region [25]. However, examination of Klf4 expression in a panel of human breast cancer cells showed no correlation between Klf4 and E-cadherin (Pearson correlation coefficient r = −0.05879, dataset GSE12777, 51 human breast cancer cells), indicating that although Klf4 was able to restore E-cadherin in mesenchymal breast cancer cells [25], it is not the essential transcription factor that determines the epithelial phenotype of breast cancer cells. Klf17 has been identified as a negative regulator of EMT by targeting Id1 [28]. Knockdown of Klf17 led to EMT and promoted tumor metastasis [28]. Examination of Klf17 expression in a panel of human breast cancer cells revealed no correlation between Klf17 and E-cadherin (Pearson correlation coefficient r = −0.1407, dataset GSE12777). There was no correlation between Id1 and E-cadherin either (Pearson correlation coefficient r = −0.033). Our data showed that Grhl2 was the only transcription factor that had such an extraordinary correlation with E-cadherin expression, and multiple functional assays confirmed that changes of epithelial gene expressions were strongly associated with Grhl2 expression during tumor progression. Our study indicates that Grhl2 is the essential transcription factor that determines the epithelial phenotypes of breast cancers. Our data also indicate that down-regulation of Grhl2 expression during tumor progression contributes to loss of epithelial phenotype and epithelial-to-mesenchymal transition.
Grhl2 plays a critical role in epithelial-to-mesenchymal transition
During EMT, epithelial genes are down-regulated along with Grhl2, which is the essential transcription factor that controls epithelial gene expression. Down-regulation of Grhl2 could be a necessary step during EMT. In our studies, Grhl2 is significantly down-regulated in early disseminated 4T1 cells that have undergone EMT. Over-expression of either Wnt7A2 or Esrp1-V1 in 4T1 cells caused the cells to undergo EMT, and led to down-regulation of Grhl2. MCF10A cells were induced to undergo EMT by TGFβ (Figure S7A). During TGFβ induced EMT, Snai1 and Zeb1 are induced, but Grhl2 is down-regulated (Figure S7B). Further examination of publicly available microarray datasets also gave the same conclusion, which is that Grhl2 is always down-regulated in cells that have undergone EMT. Human mammary epithelial cells were induced to undergo EMT by Zeb1, Twist1 or Zeb2 combined with TGFβ [29], and Grhl2 was significantly down-regulated by these EMT inducers (Figure S7C). In a sub-population of spontaneously arising mesenchymal cells from immortalized human mammary epithelial cells (HMLE), expression level of Grhl2 is very low (GSE28681) [30]. In a sub-population of CD44+CD24− cells derived from MCF10A cells, expression levels of Grhl2 is very low compared to CD44−CD24+ sub-population (GSE15192) [31]. Breast cancer cells derived from MMTV-NEU mouse were induced to undergo EMT in vivo after implanted into syngeneic mice, and Grhl2 is dramatically down-regulated in the cells that have undergone EMT (GSE13259) [32]. Snai1, Zeb1 and Zeb2 suppress gene expression by binding to E-box sites located within the promoter region of their targeted genes [23], [33], [34]. Two conserved E-box sites are located within the proximal promoter region of Grhl2 (Figure 1E), suggesting that Grhl2 may be suppressed directly by Snai1, Zeb1 or Zeb2. Down-regulation of Grhl2 eventually leads to down-regulation of Grhl2 regulated genes, including E-cadherin, Epcam and Claudin4 [9], which contribute to inhibiting epithelial gene expression during the EMT process. Collectively, these data indicate that no matter what signaling pathway induces breast cancer cells to undergo EMT, Grhl2 is always down-regulated.
Grhl2 regulates a broad range of epithelial specific genes, and some of these genes can feedback to regulate the expression of Grhl2 and epithelial genes. Wnt7A is a Grhl2 regulated gene in 4T1 cells. Over-expression of Wnt7A leads to up-regulation of Grhl2 and E-cadherin. Microarray data reveal that genes that are up-regulated by Wnt7A with the highest fold changes are the same genes that are up-regulated by Grhl2, suggesting that Wnt7A mediated signaling could induce epithelial gene expression through induction of Grhl2. Interestingly, over-expressing a truncated Wnt7A isoform, Wnt7A2, which could potentially compete with the full length Wnt7A for binding to Wnt receptor or some other mechanism, leads to down-regulation of Grhl2 and E-cadherin. These data also suggest that Grhl2 and Wnt7A may form a positive feedback loop, and this signaling loop is important for maintaining epithelial status. Esrp1 is another Grhl2 regulated gene. Recent studies have identified Esrp1 as an epithelial specific splicing factor that controls the epithelial specific splicing of its targeted genes [16], [35]. Knockdown of Esrp1 in human mammary epithelial cells leads to epithelial cells undergoing some of the changes in gene expression and morphology that are associated with EMT [35], which suggests that the Esrp1 regulated splicing program is critical for maintaining the epithelial phenotype. In our study, we identified Esrp1 as a Grhl2 regulated gene. We confirmed that splicing of four Esrp1 regulated genes, including CTNND1 (P120 catenin), CD44, MAP3K7, and Enah, switched from the epithelial isoform to the mesenchymal isoform in 4T1 cells that had undergone EMT. Furthermore, we found that over-expressing a variant isoform of Esrp1, Esrp1-V1, perturbed the Esrp1 regulated splicing program and led to down-regulation of E-cadherin and Grhl2. Collectively, these data suggest that Grhl2 regulated genes also can feedback to regulate the expression of Grhl2 and epithelial phenotype.
Grhl2 promotes tumor progression
Our results showed that over-expression of Grhl2 significantly promoted tumor growth and metastasis. Further testing of several Grhl2 regulated genes, which either promoted EMT or inhibited EMT, always gave the same results, that the tumor growth and metastasis were linked to epithelial phenotype but not mesenchymal phenotype. In addition, we found Grhl2 was only expressed in highly metastatic 4T1 cells, but not weakly metastatic 4TO7 cells. Large clinical data also revealed that high levels of Grhl2 expression were significantly associated with poor relapse free survival and increased risk of metastasis in breast cancer patients.
It has been widely accepted that EMT plays a critical role during tumor metastasis [2]. Recently, several studies have reported conflicting roles of EMT in tumor growth and metastasis [36]–[38]. One study reported that over-expression of the miR-200s family was associated with increased risk of metastasis in breast cancer, and promoted metastatic colonization in mice models [36]. Clinical observations revealed that metastatic tumors were epithelial type [2], [13], [39]. Examination of large microarray datasets revealed that Grhl2 was not only expressed in primary tumors but also metastasis tumors. Tumor metastasis is a complex multistep process [40]. So far, very little is known about how the early disseminated cancer cells develop into metastasis tumors. Some researchers suggested that these early disseminated cancer cells might undergo mesenchymal-to-epithelial transition (MET) [2], which is based on observations that metastatic tumors were epithelial type. Given the essential role of Grhl2 in determining the epithelial phenotype, Grhl2 could play a pivotal role at the final step of tumor metastasis, and this possibility will require further studies.
Materials and Methods
Microarray hybridization and data analysis
RNAs isolated from 4T1-control cells recovered from primary tumors and lungs, 4T1-Grhl2, 4T1-Wnt7A1 and 4T1-Wnt7A2 cells recovered from lungs were processed for microarray hybridization using Affymetrix Murine GeneST Arrays according to the manufacturer's instructions (Affymetrix). Raw data were preprocessed by the robust multi-array analysis method as described previously [6].
Software MeV4.6 was used for microarray data analysis [41]. Dataset GSE12777 [14] and GSE16795 [15] were downloaded from the Gene Expression Omnibus (GEO) repository. According to the molecular definition of epithelial and mesenchymal breast cells [1], [12], [13], the following genes were set as epithelial markers: Cdh1, Cldn3, Cldn4, Cldn7, Tjp2, Jup and CD24; and these genes were set as markers for mesenchymal cells: Cdh2, Vim and Zeb1. Breast cancer cells were separated into an epithelial group and a mesenchymal group by Hierarchical clustering (HCL) analysis based on the transcription levels of epithelial markers and mesenchymal markers. A two-class Significance Analysis of Microarrays (SAM) was used to identify genes differentially expressed between the epithelial group and mesenchymal group (false discovery rate <5.00%). To search for epithelial-specific genes that co-regulated with E-cadherin in human breast cancer cells, all probes representing 54,675 transcripts were analyzed for differential expression in epithelial versus mesenchymal cells and Pearson correlation coefficient r was calculated between each transcript and E-cadherin across all microarrays. PAM305 breast cancer gene classifier [11] was used to identify molecular subtype.
The details of other methods used for this study, including cell culture, construction of lentiviral vectors, western blot analysis, RT-PCR and Real-time PCR, dual-luciferase assay, immunohistology staining, in vivo tumor growth and assays for metastasis have been published previously [6] and are described in Methods S1.
Supporting Information
Acknowledgments
The authors thank Dr. John Kappes for providing Lentiviral vector 2229 and Dr. Eric R. Fearon for the pGL2-E-cadherin promoter luciferase vector [34]. We thank Dr. Jerald Ainsworth for editorial assistance.
Funding Statement
This work was supported by grants from the National Institutes of Health (NIH) (RO1CA137037 and RO1CA116092); the Louisville Veterans Administration Medical Center (VAMC) Merit Review Grants (H.-G.Z.); and a grant from the Susan G. Komen Breast Cancer Foundation. The authors thank Dr. Jerald Ainsworth for editorial assistance. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Thiery JP, Acloque H, Huang RYJ, Nieto MA (2009) Epithelial-Mesenchymal Transitions in Development and Disease. Cell 139: 871–890. [DOI] [PubMed] [Google Scholar]
- 2. Polyak K, Weinberg RA (2009) Transitions between epithelial and mesenchymal states: acquisition of malignant and stem cell traits. Nat Rev Cancer 9: 265–273. [DOI] [PubMed] [Google Scholar]
- 3. Thiery JP, Sleeman JP (2006) Complex networks orchestrate epithelial-mesenchymal transitions. Nat Rev Mol Cell Biol 7: 131–142. [DOI] [PubMed] [Google Scholar]
- 4. Peinado H, Olmeda D, Cano A (2007) Snail, Zeb and bHLH factors in tumour progression: an alliance against the epithelial phenotype? Nat Rev Cancer 7: 415–428. [DOI] [PubMed] [Google Scholar]
- 5. Yang J, Mani SA, Donaher JL, Ramaswamy S, Itzykson RA, et al. (2004) Twist, a Master Regulator of Morphogenesis, Plays an Essential Role in Tumor Metastasis. cell 117: 927–939. [DOI] [PubMed] [Google Scholar]
- 6. Xiang X, Zhuang X, Ju S, Zhang S, Jiang H, et al. (2011) miR-155 promotes macroscopic tumor formation yet inhibits tumor dissemination from mammary fat pads to the lung by preventing EMT. Oncogene 30: 3440–3453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Boglev Y, Wilanowski T, Caddy J, Parekh V, Auden A, et al. (2010) The unique and cooperative roles of the Grainy head-like transcription factors in epidermal development reflect unexpected target gene specificity. Developmental Biology 349: 512–522. [DOI] [PubMed] [Google Scholar]
- 8. Yu Z, Bhandari A, Mannik J, Pham T, Xu X, et al. (2008) Grainyhead-like factor Get1/Grhl3 regulates formation of the epidermal leading edge during eyelid closure. Developmental Biology 319: 56–67. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Werth M, Walentin K, Aue A, Schönheit J, Wuebken A, et al. (2010) The transcription factor grainyhead-like 2 regulates the molecular composition of the epithelial apical junctional complex. Development 137: 3835–3845. [DOI] [PubMed] [Google Scholar]
- 10. Pyrgaki C, Liu A, Niswander L (2011) Grainyhead-like 2 regulates neural tube closure and adhesion molecule expression during neural fold fusion. Developmental Biology 353: 38–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Neve RM, Chin K, Fridlyand J, Yeh J, Baehner FL, et al. (2006) A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes. Cancer cell 10: 515–527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Mani SA, Guo W, Liao M-J, Eaton EN, Ayyanan A, et al. (2008) The Epithelial-Mesenchymal Transition Generates Cells with Properties of Stem Cells. Cell 133: 704–715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Shipitsin M, Campbell LL, Argani P, Weremowicz S, Bloushtain-Qimron N, et al. (2007) Molecular Definition of Breast Tumor Heterogeneity. Cancer cell 11: 259–273. [DOI] [PubMed] [Google Scholar]
- 14. Hoeflich KP, O'Brien C, Boyd Z, Cavet G, Guerrero S, et al. (2009) In vivo Antitumor Activity of MEK and Phosphatidylinositol 3-Kinase Inhibitors in Basal-Like Breast Cancer Models. Clinical Cancer Research 15: 4649–4664. [DOI] [PubMed] [Google Scholar]
- 15. Hollestelle A, Nagel J, Smid M, Lam S, Elstrodt F, et al. (2009) Distinct gene mutation profiles among luminal-type and basal-type breast cancer cell lines. Breast Cancer Research and Treatment 121: 53–64. [DOI] [PubMed] [Google Scholar]
- 16. Warzecha CC, Sato TK, Nabet B, Hogenesch JB, Carstens RP (2009) ESRP1 and ESRP2 Are Epithelial Cell-Type-Specific Regulators of FGFR2 Splicing. Molecular cell 33: 591–601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Györffy B, Lanczky A, Eklund A, Denkert C, Budczies J, et al. (2010) An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Research and Treatment 123: 725–731. [DOI] [PubMed] [Google Scholar]
- 18. Chiu HS, Szucsik JC, Georgas KM, Jones JL, Rumballe BA, et al. (2010) Comparative gene expression analysis of genital tubercle development reveals a putative appendicular Wnt7 network for the epidermal differentiation. Developmental Biology 344: 1071–1087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Winuthayanon W, Hewitt SC, Orvis GD, Behringer RR, Korach KS (2010) Uterine epithelial estrogen receptor α is dispensable for proliferation but essential for complete biological and biochemical responses. Proceedings of the National Academy of Sciences 107: 19272–19277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Filant J, Zhou H, Spencer TE (2012) Progesterone Inhibits Uterine Gland Development in the Neonatal Mouse Uterus. Biology of Reproduction 86: 146, 141–149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Gibbons DL, Lin W, Creighton CJ, Rizvi ZH, Gregory PA, et al. (2009) Contextual extracellular cues promote tumor cell EMT and metastasis by regulating miR-200 family expression. Genes & Development 23: 2140–2151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Park S-M, Gaur AB, Lengyel E, Peter ME (2008) The miR-200 family determines the epithelial phenotype of cancer cells by targeting the E-cadherin repressors ZEB1 and ZEB2. Genes & Development 22: 894–907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Bracken CP, Gregory PA, Kolesnikoff N, Bert AG, Wang J, et al. (2008) A Double-Negative Feedback Loop between ZEB1-SIP1 and the microRNA-200 Family Regulates Epithelial-Mesenchymal Transition. Cancer Research 68: 7846–7854. [DOI] [PubMed] [Google Scholar]
- 24. Yan W, Cao QJ, Arenas RB, Bentley B, Shao R (2010) GATA3 Inhibits Breast Cancer Metastasis through the Reversal of Epithelial-Mesenchymal Transition. Journal of Biological Chemistry 285: 14042–14051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Yori JL, Johnson E, Zhou G, Jain MK, Keri RA (2010) Krüppel-like Factor 4 Inhibits Epithelial-to-Mesenchymal Transition through Regulation of E-cadherin Gene Expression. Journal of Biological Chemistry 285: 16854–16863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Visvader JE (2009) Keeping abreast of the mammary epithelial hierarchy and breast tumorigenesis. Genes & Development 23: 2563–2577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Kouros-Mehr H, Bechis SK, Slorach EM, Littlepage LE, Egeblad M, et al. (2008) GATA-3 Links Tumor Differentiation and Dissemination in a Luminal Breast Cancer Model. Cancer cell 13: 141–152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Gumireddy K, Li A, Gimotty PA, Klein-Szanto AJ, Showe LC, et al. (2009) KLF17 is a negative regulator of epithelial-mesenchymal transition and metastasis in breast cancer. Nat Cell Biol 11: 1297–1304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Morel A-P, Hinkal GW, Thomas C, Fauvet F, Courtois-Cox S, et al. (2012) EMT Inducers Catalyze Malignant Transformation of Mammary Epithelial Cells and Drive Tumorigenesis towards Claudin-Low Tumors in Transgenic Mice. PLoS Genet 8: e1002723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Scheel C, Eaton Elinor N, Li Sophia H-J, Chaffer Christine L, Reinhardt F, et al. (2011) Paracrine and Autocrine Signals Induce and Maintain Mesenchymal and Stem Cell States in the Breast. Cell 145: 926–940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Bhat-Nakshatri P, Appaiah H, Ballas C, Pick-Franke P, Goulet R, et al. (2010) SLUG/SNAI2 and Tumor Necrosis Factor Generate Breast Cells With CD44+/CD24− Phenotype. BMC Cancer 10: 411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Santisteban M, Reiman JM, Asiedu MK, Behrens MD, Nassar A, et al. (2009) Immune-Induced Epithelial to Mesenchymal Transition In vivo Generates Breast Cancer Stem Cells. Cancer Research 69: 2887–2895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Comijn J, Berx G, Vermassen P, Verschueren K, van Grunsven L, et al. (2001) The Two-Handed E Box Binding Zinc Finger Protein SIP1 Downregulates E-Cadherin and Induces Invasion. Molecular cell 7: 1267–1278. [DOI] [PubMed] [Google Scholar]
- 34. Hajra KM, Chen DY-S, Fearon ER (2002) The SLUG Zinc-Finger Protein Represses E-Cadherin in Breast Cancer. Cancer Research 62: 1613–1618. [PubMed] [Google Scholar]
- 35. Warzecha CC, Jiang P, Amirikian K, Dittmar KA, Lu H, et al. (2010) An ESRP-regulated splicing programme is abrogated during the epithelial-mesenchymal transition. EMBO J 29: 3286–3300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Korpal M, Ell BJ, Buffa FM, Ibrahim T, Blanco MA, et al. (2011) Direct targeting of Sec23a by miR-200s influences cancer cell secretome and promotes metastatic colonization. Nat Med advance online publication [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Celià-Terrassa T, Meca-Cortés Ó, Mateo F, Martínez de Paz A, Rubio N, et al. (2012) Epithelial-mesenchymal transition can suppress major attributes of human epithelial tumor-initiating cells. The Journal of Clinical Investigation 122: 1849–1868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Dykxhoorn DM, Wu Y, Xie H, Yu F, Lal A, et al. (2009) miR-200 Enhances Mouse Breast Cancer Cell Colonization to Form Distant Metastases. PLoS ONE 4: e7181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Chaffer CL, Thompson EW, Williams ED (2007) Mesenchymal to Epithelial Transition in Development and Disease. Cells Tissues Organs 185: 7–19. [DOI] [PubMed] [Google Scholar]
- 40. Nguyen DX, Bos PD, Massague J (2009) Metastasis: from dissemination to organ-specific colonization. Nat Rev Cancer 9: 274–284. [DOI] [PubMed] [Google Scholar]
- 41.Saeed AI, Bhagabati NK, Braisted JC, Liang W, Sharov V, et al.. (2006) [9] TM4 Microarray Software Suite. Methods in Enzymology: Academic Press. pp. 134–193. [DOI] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.