Abstract
Integrated studies of accumulated data can be performed to obtain more reliable information and more feasible measures for investigating the potential diagnostic and prognostic biomarkers of breast cancer and exploring related molecular mechanisms. Our study aimed to explore the GATA family members involved in breast cancer by integrating data from The Cancer Genome Atlas (TCGA), Gene Expression Omnibus (GEO) and other online databases. We performed an integrated analysis of published studies from GEO and analyzed clinical data from TCGA and GTEx to evaluate the clinical significance and prognosis values of the GATA family in breast cancer. GATA3 was found to be upregulated and exhibited a favorable value in the diagnosis and prognosis of breast cancer. Through this study, we identified possible GATA3-correlated genes and core pathways that play an important role, which requires further investigation in breast cancer.
Introduction
GATA transcription factors are defined as a family of transcription factors characterized by their DNA-binding specificities to the “GATA” DNA sequence1. These transcription factors are highly conserved by amino acid sequence identity within the binding target site, conforming to the consensus motif WGATAR (where W = A or T and R = A or G)2. Currently, six members are in the GATA transcription factor family, named in order of when they were discovered, from “GATA1” to GATA6”.
GATA1 and GATA2 were first found to be involved in regulating the cell cycle and in proliferation during primitive hematopoietic development3. More recent studies have revealed that GATA1 and GATA2 also participate in the progression of breast cancer and prostate cancer via EMT processes4,5. GATA3 plays an integral role in luminal cell differentiation in mammary glands6. Consequently, GATA3 has recently been drawing scientists’ attention in breast cancer, but the supporting evidence is inconclusive7. In addition, the prognostic significance of GATA3 in breast cancer and other malignancies remains controversial according to differing research results8. GATA4, GATA5, and GATA6 were also classified as endodermal GATA factors. The altered expression of GATA4, GATA5, and GATA6 is associated with various malignancies broadly from the gastric tract, lungs, ovaries, and even the brain9. However, their role in cancer as an oncogene or a tumor suppressor gene is still uncertain.
Progressively, using gene regulation networks from large databases to develop an understanding of transcription factor functions has been widely accepted by the biology research field. Thus far, our current state of knowledge about GATA factors in the context of human cancers is still limited. Despite the distinctive role of individual GATA members in the development and progression of human cancers, the integrated functions and prognostic values of different GATA members in breast cancer are largely unexplored.
The present study aimed to systemically investigate the expression and prognostic values of GATA family members with potential gene functions in breast cancer by using integrated large databases. We explored the characterization of the GATA family member gene status of breast cancer patients from expression patterns to prognostic values and potential clinical pathology application to provide a comprehensive understanding of GATA family utilities in breast cancer.
Methods
GATA family Expression Data Pan-cancer analysis
To analyze the expression of the GATA family gene in a variety of malignancies, the TCGA, GTEx and Oncomine online databases were accessed for the visualization of gene expression. Oncomine is an online cancer microarray database used to facilitate and promote discoveries from genome-wide expression analyses. The pan-cancer studies in Oncomine were selected to compare the expression levels in tumors vs normal tissues. The selection criteria for the Oncomine studies were P < 0.05 as a threshold, 2-fold change and gene rank in the top 10%. The P-values, fold changes, and cancer subtypes were extracted. In addition, we compared GATA family mRNA RNA-Seq data from clinical specimens of pan-cancer including breast cancer tissue versus normal tissue from the data imported from TCGA and GTEx by Gene Expression Profiling Interactive Analysis (GEPIA)10. Moreover, the expression profile of the GATA family members in each breast cancer subtype from TCGA was visualized by Oncomine in a log2 median-centered ratio.
Breast Cancer Gene-Expression Miner v4.1 with survival meta-analysis
To analyze the association among the expression levels of each member of the GATA family and the clinicopathological features of breast cancer, the online database Breast Cancer Gene-Expression Miner v4.1 (BC-GEM, bcGenExMiner v4.1), which comprises 36 genomic datasets, was used (http://bcgenex.centregauducheau.fr/BC-GEM/GEM-requete.php)11,12. All validated GATA family members in the 36 datasets containing over 3000 cases from BC-GEM were pooled for a survival meta-analysis.
GOBO expression analysis
The mRNA expression levels of GATA3 in different clinical parameters were analyzed by the GOBO database v 1.0.2 (http://co.bmc.lu.se/gobo/gsa.pl). GOBO is an accessible tool that contains 1881 breast cancer tumor sample datasets, including clinical characteristics13. The GEO datasets GSE11121, GSE7390, GSE2034, GSE5327, GSE2603, GSE3494, GSE1456, GSE6532, GSE4922 and GSE12093 were combined to perform an analysis on the GATA3 expression differences within the breast cancer molecular subtypes, PAM50 subtypes, ER status and tumor grade. Additionally, the expression in the breast cancer stage was compared by one-way ANOVA.
The relapse-free survival Kaplan-Meier analysis
To analyze the prognostic values of GATA3 in breast cancer samples with different clinical conditions, the Kaplan-Meier plotter for breast cancer (www.kmplot.com)14 was used to display relapse-free survival (RFS) with different clinical parameters separated by median expression with the probe 209602_s_at. The log-rank P-value was calculated with <0.05 considered statistically significant.
Immunohistochemistry expression pattern
The Human Protein Atlas (HPA) is an open access program that maps all human proteins in cells and tissue samples (https://www.proteinatlas.org/)15. IHC data for clinical potential application were extracted from the Human Protein Atlas in both normal breast tissue and breast cancer. A high and low expression pattern was selected to validate the potential applicants for GATA3 in breast cancer prognosis prediction.
Gene expression omnibus data mining and identification of DEGs
Microarray profiles related to breast cancer were downloaded from the GEO database (http://www.ncbi.nlm.nih.gov/geo/)16. For the GATA3 overexpression profile in breast cancer, the GSE24249 dataset was included in our analysis. GSE24249 is an expression profile with overexpressed GATA3 in the MDA-MB231 breast cancer cell line in a GPL570 Affymetrix Human Genome U133 Plus 2.0 Array platform17. The raw microarray data were extracted and normalized. R software was used with the limma package to identify the differentially expressed genes (DEGs). A volcano plot was applied to show the DEGs in GSE24249. Additionally, numerous DEGs were plotted to present the top upregulated and downregulated differential genes.
Integrative bioinformatics analysis of the selected GATA family members
The achieved DEGs were pooled for Gene Ontology (GO) terms (biological process) using the GO Annotation database (released 2018-09-06) through the Gene Ontology Consortium (http://pantherdb.org)18, an online database that provides a comprehensive set of functional annotation tools for the interpretation of summed genes. The gene ratio and enrichment score with an FDR were calculated and plotted by R software with a GO enrichment19.
Kyoto encyclopedia of genes and genomes (KEGG) pathway enrichment analysis was performed by using the DAVID database (Version 6.7)20,21. All enriched pathways were plotted by using R software. FDR < 0.05 was set as a cutoff for significance.
The Search Tool for the Retrieval of Interacting Genes (STRING) database (version 10.5; http://string-db.org/)22, is used to explore protein-protein interaction information. To evaluate the interactive associations among overexpressed GATA-3-related DEGs, the interaction with a combined score >0.4 was selected and mapped. Then, the DEGs PPI network was constructed and visualized using Cytoscape software (version 3.5.1; www.cytoscape.org)23.
Gene set enrichment analysis (GSEA) was performed using the Molecular Signatures Database (MSigDB) software24,25. The overexpressed GATA3 group in the GSE24249 dataset was set as phenotype 1, and the vector group was set as phenotype 2. The goal of GSEA in our research was to identify the distribution of biological functions within the DEGs. An enrichment score (ES) was calculated. The gene ranking metric in the weighted ES was the 2-sided SNR, and the p-values were calculated using 1,000 permutations of the phenotype.
Results
Differential expression studies of the GATA family transcript in pan-cancer
The six GATA family members from GATA1 to GATA6 were explored in human cancers by the Oncomine online database. The top mRNA differences between cancer and normal tissues was analyzed by our selective criteria. As shown in Fig. 1, the Oncomine database contained a total of 404, 442, 447, 437, 286 and 450 unique studies involving the genes with GATA1 to GATA6, respectively. Interestingly, all GATA family members besides GATA3 were mostly downregulated in most kinds of cancers, with upregulated vs downregulated study numbers (GATA1 1:7; GATA2 6:39; GATA3 31:36; GATA4 5:18; GATA5 1:17; GATA6 16:37). In detail, the GATA3 mRNA expression level increased in 13 cases compared with one decrease in breast cancer. However, the other members did not show any significant validated study differences in breast cancer. In kidney cancer, GATA2, GATA3, GATA5 and GATA6 were mostly downregulated. In lung cancer, GATA2 and GATA6 were significantly downregulated genes in 17 and 15 studies, respectively. The RNA-Seq expression heatmap (Fig. 2A) showed that GATA3 was significantly increased in breast cancer tissues compared with normal tissues. Additionally, GATA3 expression in breast cancer was relatively higher than in any other normal tissue or cancer type in the pan-cancer profile.
GATA family expression in breast cancer
The boxplot of the RNA-Seq expression in 1085 breast cancer tissues vs 291 normal breast tissues demonstrated that GATA3 was significantly increased (Fig. 2B). For other GATA family members, no statistical significance was revealed in cancer tissues compared to normal tissues. These data indicated that the expression of GATA3 may have a major effect on breast cancers. In addition, we addressed the expression level of breast cancer subtypes from the TCGA database. Our results are also consistent with the RNA-Seq data in breast cancer in which, except for GATA3, all other GATA family members showed no significant difference compared with normal tissue control. GATA3 decreased only in apocrine, large cell neuroendocrine, metaplastic and pleomorphic breast cancer subtypes (Fig. 3C). These subtypes are known to have poor clinical outcomes, which indicates that GATA3 expression may be greatly involved in breast tumor malignancy. However, the case number of each rare subtype in the TCGA database is a limitation for accuracy estimation.
The GATA family prognosis analysis with a combined dataset
To date, little is known about the GATA family expression levels or their possible prognostic value, except for GATA3, in breast cancer. Therefore, we performed Kaplan-Meier survival analysis according to the median mRNA expression of GATA family members systematically by using bcGenExMiner v4.1. For GATA1, the median expression level for the pooled survival analysis could not be considered a significant marker for metastatic relapse-free survival with a total of 3875 breast cancer patients (HR = 1.07; 95% CI: 0.94–1.21, p-value = 0.2936, Fig. 4A). The median expression level of GATA2 (patients = 3630, HR = 1.10; 95% CI: 0.97–1.26, p-value = 0.1355, Fig. 4B), GATA4 (patients = 3952, HR = 1.12; 95% CI: 0.99–1.27, p-value = 0.0739, Fig. 4D), GATA5 (patients = 1862, HR = 1.14; 95% CI: 0.87–1.24, p-value = 0.6599, Fig. 4E) and GATA6 (patients = 3924, HR = 1.01; 95% CI: 0.90–1.15, p-value = 0.8389, Fig. 4F) also demonstrate the same negative results as GATA1. However, only GATA3 showed a positive effect in the survival analysis (patients = 4177, HR = 0.81; 95% CI: 0.72–0.92, p-value = 0.0007, Fig. 4C). The higher expression of GATA3 indicated a better survival time. A prognostic meta-analysis of the GATA family member results is shown in Fig. S1.
The differential expression of GATA3 in breast cancer molecular subtypes, PAM50 subtypes, ER status, tumor stage and grade
In breast cancer, only GATA3 could be considered a validated prognosis marker compared with the other GATA family members according to our results. Therefore, we further focused on the expression of GATA3 in different clinical parameters (Fig. 5A). Our results showed that GATA3 expression is lower in basal subtypes (n = 357) and higher in the luminal A subtype (n = 482). This expression difference is significant among the current molecular subtype classifications. For the PAM50 subtype analysis26, which is known to be a gene expression classifier of the intrinsic subtypes of breast cancer to assess both the prognostic and predictive values of adjuvant hormonal therapy in a study population of premenopausal women, we obtained the same results as the molecular subtypes. An ER-positive estrogen receptor status demonstrated a higher expression level of GATA3 (n = 1225). Interestingly, in tumor grade, grade 3 tumors showed a relatively lower expression compared with grade 2 and grade 1 tumors. However, there was no significant difference in GATA3 expression among clinical stages (Fig. 5B).
Increased GATA3 expression levels are significantly associated with longer RFS times in PR-positive breast cancer and lymph node metastasis
Our Kaplan-Meier analysis by KM-plotter showed that GATA3 expression is positively related to the RFS time if no clinical parameters are selected (Fig. 6A), which is consistent with our bcGenExMiner results. However, after we investigated the details of the patient group, we found that although GATA3 was higher in ER-positive breast cancer patients, GATA3 could not be considered an RFS prognosis marker for ER-positive patients (patients = 3951, HR = 0.94, 95% CI: 0.8–1.11, p-value = 0.46). Our results also demonstrated that an increased GATA3 mRNA expression was not significantly associated with a longer RFS time in the luminal A subtype (patients = 1933, HR = 0.9, 95% CI: 0.76–1.07, p = 0.23), luminal B subtype (patients = 1149, HR = 1.14, 95% CI: 0.94–1.38 p = 0.19) and basal subtype (patients = 618, HR = 0.85, 95% CI: 0.66–1.1, p = 0.22). Interestingly, the increased GATA3 expression level indicated a worse RFS survival outcome in the HER-2 group, although there was no significant difference (patients = 251, HR = 1.43, 95% CI: 0.97–2.1, p = 0.66). In particular, the subanalysis also revealed that an elevated GATA3 mRNA expression was related to a longer RFS time in the PR-positive patients (patients = 589, HR = 0.67, 95% CI: 0.47–0.95, p = 0.025, Fig. 6C) and the w/o lymph node metastasis group (Fig. 6H,I).
IHC expression pattern utilities for clinical outcome prediction
To validate the potential application of GATA3 in the clinic, we extracted the characterized IHC images from the Human Protein Atlas. Normal breast tissue staining of GATA3 showed uneven positive staining in the nucleus with a weak–strong mixed pattern (Fig. 7A). However, in breast cancer tissue, there were distinguishable patterns between the strong and weak expression in the nucleus (Fig. 7B,C).
Identification of the differentially expressed genes (DEGs) in GSE24249
R software with the limma package was applied to screen DEGs from the gene expression dataset GSE24249 between control vectors and overexpressed GATA3 genes in MDA-MB231 breast cancer cells. A total of 600 DEGs were identified from this dataset, 240 upregulated genes and 360 downregulated genes (Fig. 8B). The top 50 up/downregulated genes were plotted in a heatmap (Fig. 8A).
Biological classification and KEGG pathway enrichment analysis of DEGs
To identify the DEG functions, all DEGs were analyzed in the DAVID database. The top 10 significant biological process GO terms enriched by the regulated genes were the following: nitrobenzene metabolic process (GO:0018916); endothelial cell-cell adhesion (GO:0071603); xenobiotic catabolic process (GO:0042178); negative regulation of fibroblast growth factor receptor signaling pathway (GO:0040037); regulation of fibroblast growth factor receptor signaling pathway (GO:0040036); positive regulation of protein oligomerization (GO:0032461); response to fluid shear stress (GO:0034405); ventricular septum morphogenesis (GO:0060412); regulation of extracellular matrix organization (GO:1903053); and ureteric bud morphogenesis (GO:0060675) (Fig. 9A).
For KEGG pathway enrichment analysis (Fig. 9B), the DEGs were found to be primarily enriched in the following top 10 KEGG pathways: hsa04350: TGF-beta signaling pathway; hsa05200: pathways in cancer; hsa00980: metabolism of xenobiotics by cytochrome P450; hsa05219: bladder cancer; hsa04722: neurotrophin signaling pathway; hsa04360: axon guidance; hsa04210: apoptosis; hsa04514: cell adhesion molecules (CAMs); hsa04710: circadian rhythm; hsa05216: thyroid cancer; hsa03320: PPAR signaling pathway; hsa05014: amyotrophic lateral sclerosis (ALS); and hsa05412: arrhythmogenic right ventricular cardiomyopathy (ARVC). These significantly enriched GO terms and KEGG pathways could help us deeply understand the function of DEGs overexpressing GATA3, which is involved in the occurrence and development of breast cancer.
Gene set enrichment analysis (GSEA) of the GSE24249 dataset
Despite all DEGs, we further carried out GSEA analysis as GSEA considers experiments with genome-wide expression profiles from samples belonging to two classes (GATA3 vs Vector). The top six normalized enriched scores (NES) of GATA3 positive results were GO_GOLGI_ASSOCIATED_VESICLE_MEMBRANE, NES = 0.8148495; GO_CELL_CORTEX_REGION, NES = 1.36; GO_PRECATALYTIC_SPLICEOSOME, NES = 0.62; GO_NUCLEOLAR_PART, NES = 1.35; GO_MULTIVESICULAR_BODY, NES = 1.33; and GO_CATALYTIC_STEP_2_SPLICEOSOME, NES = 1.32 (Fig. 10A). In addition, the top six GATA3-negative GSEA results were GO_SMOOTH_ENDOPLASMIC_RETICULUM, NES = −1.46; GO_MYOSIN_II_COMPLEX, NES = −1.37; GO_MICROBODY_MEMBRANE, NES = −1.36; GO_ORGANELLE ENVELOPE_LUMEN, NES = −1.34; GO_MICROVILLUS, NES = −1.30; and GO_LAMELLIPODIUM, NES = −1.29 (Fig. 10B).
Deciphering the complex network of protein-protein interactions (PPI)
Protein-protein interactions help us explore molecular mechanisms. The interactions among the identified DEGs were analyzed by mapping with STRING with a combined score ≥0.4. The PPI network of the DEGs consisted of 531 nodes and 865 edges (Fig. 11A). Among these genes, TP53, SMAD3, CDH1, PPARG, and MAP3K5 showed the greatest degree in the PPI network (minimum required interaction score > 0.95), suggesting that GATA3 plays a key role in breast cancer malignancy transformation (Fig. 11B). The clusters by k-means show that OASL and OAS2 also participated in a small model involved in GATA3 functions. In addition, the following PPI functions were enriched: tissue development, FDR = 1.08e-07; regulation of locomotion, FDR = 1.72e-07; epithelium development, FDR = 1.72e-07; regulation of cell motility, FDR = 1.72e-07; regulation of cell migration, FDR = 1.84e-07; transcriptional misregulation in cancer, FDR = 0.00656; TNF signaling pathway, FDR = 0.0129; and pathways in cancer, FDR = 0.0242. These findings show that GATA3 is mainly involved in the development of breast cancer.
Discussion
In recent decades, scientists have made tremendous improvements in understanding breast cancer. The prognosis markers are also widely explored. Among the known indicators, the key molecular signatures were well characterized as ER, PR and HER-227. Based on these genes, breast cancer is classified into luminal A-, luminal B-, basal-like/triple negative- and Her-2-enriched subtypes28. Novel targets and signal pathways are continuing to emerge, such as Ezh229, ADAM830 and HMGA131. These findings significantly contribute to the pathogenesis and development of breast cancer. However, clinical validations are further required.
GATA family members are key modulatory proteins known to be transcription factors in controlling several pathways. Nevertheless, these GATA transcription factor-related pathways have not yet been fully elucidated. As an important transcription factor family, they are an ideal and attractive module for investigating novel therapies for breast cancer32. However, basic explorations have shown significant contradictions in the specific roles of different GATA family members in breast cancer biology.
In our present study, by integrating analyses through GEPIA, Oncomine, TCGA, bcGenExMiner, Kaplan-Meier plotter, and the Human Protein Atlas, we systemically depicted the expression profiles of each GATA family member in breast cancer, revealing that the GATA family has a remarkable mRNA expression difference between cancer and normal breast tissue. These results demonstrated that the GATA family played important roles in breast cancer development.
Thus far, GATA1 is believed to participate in acute megakaryocytic leukemia and Down syndrome33. Li Yan et al. reported that GATA1 could induce the EMT process by binding to the E-cadherin promoter, downregulating E-cadherin expression through PAK5 oncogenic signaling in breast cancer. GATA1 is also a promoter binder factor of peroxiredoxin 5 in human breast cancer cells through inhibiting apoptosis34. There is also a report that GATA1 was an independent poor prognosis marker in breast cancer35. The low expression of GATA1 and GATA2 is associated with the aggressiveness and poor outcome of clear cell renal cell carcinoma36. Additionally, GATA2 is identified as a poor prognosis marker in colorectal cancer, prostate cancer and hepatocellular cancer37–39. In breast cancer, GATA2 was found to be a key epigenetic regulator for G9a, which impacts breast cancer cell survival and tumorigenesis40. However, there is no report on the role of GATA2 in prognosis in breast cancer. Contractional reports on GATA3 for breast cancer are still available. An early report in 2008 with 3,119 cases showed that GATA3 was highly associated with ER expression but neither had an independent prognostic value nor was useful as a prediction marker for the effect of tamoxifen in ER-positive patients41. However, other studies demonstrated that GATA3 was an independent favorable prognosticator42,43. There was also a report that showed that high GATA3 expression was associated with an unfavorable prognosis in breast cancer patients7. Inconsistent investigations on the protective effect of GATA3 were also reported44–46. The potential explanation for this phenomenon may be that GATA3 acts differently in various breast cancer subtypes. Further investigations on the GATA3 mechanism are still warranted. Kiyoshi et al. reported that the status of GATA4, but not other GATA family members, including GATA3, was an independent prognostic factor for the disease-free and breast cancer-specific survival of invasive ductal carcinoma patients47. However, our results are not consistent with their conclusion. ERBB2 is a key regulator in breast cancer, and the strong negative feedback association between GATA4 and ERBB2 may contribute to the transcriptional dysregulation of ERBB2 gene expression in breast cancer48. The promoter hypermethylation of GATA5 is believed to participate in invasive breast cancer development49. Interestingly, the dual role of GATA5 in the activation of the progesterone receptor gene promoter contributes to the susceptibility of breast cancer50. However, GATA5 has not been widely investigated in breast cancer. GATA6 is reported to be overexpressed in breast cancer, consistent with our pan-cancer analysis. The epithelial-mesenchymal transition (EMT) process was under the regulation of GATA6 via upregulating Slug expression. However, there is only one report showing that GATA6 overexpression could be considered an independent prognostic marker with a favorable outcome in breast cancer51. Our research result for GATA6 is inconsistent with their report. Interestingly, in digestive system cancers, such as esophageal adenocarcinoma, pancreatic cancer, colorectal cancer or cholangiocarcinoma, GATA6 could be used as a prognostic tumor marker for different mechanisms52–55. These research findings reveal the important role of GATA6 in digestive system malignancy progression.
To clarify our research focus, we summaries our study approaches and workflow in Fig. S2. Our findings from the integrative databases and bioinformatics analysis of this study suggest that GATA3, but not the other GATA family members, might be a potential prognostic biomarker and target for new therapies for breast cancer. However, the different expressions of the GATA family members in pan-cancer analysis reveal that the GATA family plays an important role in malignancy transformation. In addition, GATA3 may mainly interact with TP53, SMAD3, and CDH1 genes to regulate endothelial cell-cell adhesion by the FGF and TGF pathways. However, more investigations need to be applied to fully reveal the role of GATA3 in breast cancer for further translational study.
Supplementary information
Acknowledgements
This project was supported by the National Natural Science Foundation of China (No.81572472 and No.81773161 to Mian Guo; No.81670459 to Maomao Zhang; No.81872135 to Baoliang Guo), the Research Project of Health Commission of Heilongjiang Province (Grant No.2016-046 to Chao Li) and the Heilongjiang Postdoctoral Grant (Grant No. LBH-Z16228 to Fei Ye). We also thank Professor Litao Sun, Department of Ultrasound, Second Affiliated Hospital of Harbin Medical University.
Author Contributions
S.Y., X.J., J.L. and C.L. contributed to the data acquisition, analysis and manuscript draft; M.G., F.Y. and M.Z. prepared the Figures; Y.J. and B.G. contributed to the study design. The final manuscript was reviewed and approved by all listed authors.
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary information accompanies this paper at 10.1038/s41598-019-40811-3.
References
- 1.Ko LJ, Engel JD. DNA-binding specificities of the GATA transcription factor family. Mol Cell Biol. 1993;13:4011–4022. doi: 10.1128/MCB.13.7.4011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Merika M, Orkin SH. DNA-binding specificity of GATA family transcription factors. Mol Cell Biol. 1993;13:3999–4010. doi: 10.1128/MCB.13.7.3999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Fujiwara Y, Chang AN, Williams AM, Orkin SH. Functional overlap of GATA-1 and GATA-2 in primitive hematopoietic development. Blood. 2004;103:583–585. doi: 10.1182/blood-2003-08-2870. [DOI] [PubMed] [Google Scholar]
- 4.Li Y, et al. GATA1 induces epithelial-mesenchymal transition in breast cancer cells through PAK5 oncogenic signaling. Oncotarget. 2015;6:4345–4356. doi: 10.18632/oncotarget.2999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Rodriguez-Bravo V, et al. The role of GATA2 in lethal prostate cancer aggressiveness. Nat Rev Urol. 2017;14:38–48. doi: 10.1038/nrurol.2016.225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Cakir A, et al. GATA3 expression and its relationship with clinicopathological parameters in invasive breast carcinomas. Pathol Res Pract. 2017;213:227–234. doi: 10.1016/j.prp.2016.12.010. [DOI] [PubMed] [Google Scholar]
- 7.Guo Y, et al. Prognostic and clinicopathological value of GATA binding protein 3 in breast cancer: A systematic review and meta-analysis. PLoS One. 2017;12:e0174843. doi: 10.1371/journal.pone.0174843. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Klijanienko J, Caly M, Frenaux P, Klos J. GATA3 differential expression in neuroblastoma and nephroblastoma. Cancer Cytopathol. 2018;126:215–216. doi: 10.1002/cncy.21952. [DOI] [PubMed] [Google Scholar]
- 9.Zheng R, Blobel GA. GATA Transcription Factors and Cancer. Genes Cancer. 2010;1:1178–1188. doi: 10.1177/1947601911404223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Tang Z, et al. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Res. 2017;45:W98–W102. doi: 10.1093/nar/gkx247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Jezequel P, et al. bc-GenExMiner: an easy-to-use online platform for gene prognostic analyses in breast cancer. Breast Cancer Res Treat. 2012;131:765–775. doi: 10.1007/s10549-011-1457-7. [DOI] [PubMed] [Google Scholar]
- 12.Jezequel P, et al. bc-GenExMiner 3.0: new mining module computes breast cancer gene expression correlation analyses. Database (Oxford) 2013;2013:bas060. doi: 10.1093/database/bas060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ringner M, Fredlund E, Hakkinen J, Borg A, Staaf J. GOBO: gene expression-based outcome for breast cancer online. PLoS One. 2011;6:e17911. doi: 10.1371/journal.pone.0017911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lanczky A, et al. miRpower: a web-tool to validate survival-associated miRNAs utilizing expression data from 2178 breast cancer patients. Breast Cancer Res Treat. 2016;160:439–446. doi: 10.1007/s10549-016-4013-7. [DOI] [PubMed] [Google Scholar]
- 15.Thul, P. J. et al. A subcellular map of the human proteome. Science356, 10.1126/science.aal3321 (2017). [DOI] [PubMed]
- 16.Barrett T, et al. NCBI GEO: archive for functional genomics data sets–update. Nucleic Acids Res. 2013;41:D991–995. doi: 10.1093/nar/gks1193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Chu IM, et al. GATA3 inhibits lysyl oxidase-mediated metastases of human basal triple-negative breast cancer cells. Oncogene. 2012;31:2017–2027. doi: 10.1038/onc.2011.382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ashburner M, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mi H, et al. PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements. Nucleic Acids Res. 2017;45:D183–D189. doi: 10.1093/nar/gkw1138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
- 21.Huang da W, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37:1–13. doi: 10.1093/nar/gkn923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Szklarczyk D, et al. STRINGv10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43:D447–452. doi: 10.1093/nar/gku1003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Shannon P, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Subramanian A, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102:15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Mootha VK, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34:267–273. doi: 10.1038/ng1180. [DOI] [PubMed] [Google Scholar]
- 26.Chia SK, et al. A 50-gene intrinsic subtype classifier for prognosis and prediction of benefit from adjuvant tamoxifen. Clin Cancer Res. 2012;18:4465–4472. doi: 10.1158/1078-0432.CCR-12-0286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Dai X, Xiang L, Li T, Bai Z. Cancer Hallmarks, Biomarkers and Breast Cancer Molecular Subtypes. J Cancer. 2016;7:1281–1294. doi: 10.7150/jca.13141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Dai X, et al. Breast cancer intrinsic subtype classification, clinical use and future trends. Am J Cancer Res. 2015;5:2929–2943. [PMC free article] [PubMed] [Google Scholar]
- 29.Pourakbar S, Pluard TJ, Accurso AD, Farassati F. Ezh2, a novel target in detection and therapy of breast cancer. Onco Targets Ther. 2017;10:2685–2687. doi: 10.2147/OTT.S138777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Conrad C, et al. ADAM8 expression in breast cancer derived brain metastases: Functional implications on MMP-9 expression and transendothelial migration in breast cancer cells. Int J Cancer. 2018;142:779–791. doi: 10.1002/ijc.31090. [DOI] [PubMed] [Google Scholar]
- 31.Mendez, O. et al. Extracellular HMGA1 Promotes Tumor Invasion and Metastasis in Triple-Negative Breast Cancer. Clin Cancer Res, 10.1158/1078-0432.CCR-18-0517 (2018). [DOI] [PubMed]
- 32.Lin HY, Zeng, Liang YK, Wei XL, Chen CF. GATA3 and TRPS1 are distinct biomarkers and prognostic factors in breast cancer: database mining for GATA family members in malignancies. Oncotarget. 2017;8:34750–34761. doi: 10.18632/oncotarget.16160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Caldwell JT, Ge Y, Taub JW. Prognosis and management of acute myeloid leukemia in patients with Down syndrome. Expert Rev Hematol. 2014;7:831–840. doi: 10.1586/17474086.2014.959923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Seo MJ, Liu X, Chang M, Park JH. GATA-binding protein 1 is a novel transcription regulator of peroxiredoxin 5 in human breast cancer cells. Int J Oncol. 2012;40:655–664. doi: 10.3892/ijo.2011.1236. [DOI] [PubMed] [Google Scholar]
- 35.Zhang Y, et al. The transcription factor GATA1 and the histone methyltransferase SET7 interact to promote VEGF-mediated angiogenesis and tumor growth and predict clinical outcome of breast cancer. Oncotarget. 2016;7:9859–9875. doi: 10.18632/oncotarget.7126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Peters I, et al. Decreased mRNA expression of GATA1 and GATA2 is associated with tumor aggressiveness and poor outcome in clear cell renal cell carcinoma. Target Oncol. 2015;10:267–275. doi: 10.1007/s11523-014-0335-8. [DOI] [PubMed] [Google Scholar]
- 37.Li YW, et al. Decreased expression of GATA2 promoted proliferation, migration and invasion of HepG2 in vitro and correlated with poor prognosis of hepatocellular carcinoma. PLoS One. 2014;9:e87505. doi: 10.1371/journal.pone.0087505. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Xu K, et al. GATA binding protein 2 overexpression is associated with poor prognosis in KRAS mutant colorectal cancer. Oncol Rep. 2016;36:1672–1678. doi: 10.3892/or.2016.4961. [DOI] [PubMed] [Google Scholar]
- 39.Robinson JL, et al. GATA2 expression and biochemical recurrence following salvage radiation therapy for relapsing prostate cancer. Br J Radiol. 2017;90:20170174. doi: 10.1259/bjr.20170174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Casciello F, et al. G9a drives hypoxia-mediated gene repression for breast cancer cell survival and tumorigenesis. Proc Natl Acad Sci USA. 2017;114:7077–7082. doi: 10.1073/pnas.1618706114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Voduc D, Cheang M, Nielsen T. GATA-3 expression in breast cancer has a strong association with estrogen receptor but lacks independent prognostic value. Cancer Epidemiol Biomarkers Prev. 2008;17:365–373. doi: 10.1158/1055-9965.EPI-06-1090. [DOI] [PubMed] [Google Scholar]
- 42.Mehra R, et al. Identification of GATA3 as a breast cancer prognostic marker by global gene expression meta-analysis. Cancer Res. 2005;65:11259–11264. doi: 10.1158/0008-5472.CAN-05-2495. [DOI] [PubMed] [Google Scholar]
- 43.Yoon NK, et al. Higher levels of GATA3 predict better survival in women with breast cancer. Hum Pathol. 2010;41:1794–1801. doi: 10.1016/j.humpath.2010.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Albergaria A, et al. Expression of FOXA1 and GATA-3 in breast cancer: the prognostic significance in hormone receptor-negative tumours. Breast Cancer Res. 2009;11:R40. doi: 10.1186/bcr2327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Ciocca V, Daskalakis C, Ciocca RM, Ruiz-Orrico A, Palazzo JP. The significance of GATA3 expression in breast cancer: a 10-year follow-up study. Hum Pathol. 2009;40:489–495. doi: 10.1016/j.humpath.2008.09.010. [DOI] [PubMed] [Google Scholar]
- 46.Fang SH, Chen Y, Weigel RJ. GATA-3 as a marker of hormone response in breast cancer. J Surg Res. 2009;157:290–295. doi: 10.1016/j.jss.2008.07.015. [DOI] [PubMed] [Google Scholar]
- 47.Takagi K, et al. GATA4 immunolocalization in breast carcinoma as a potent prognostic predictor. Cancer Sci. 2014;105:600–607. doi: 10.1111/cas.12382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Hua G, et al. A negative feedback regulatory loop associates the tyrosine kinase receptor ERBB2 and the transcription factor GATA4 in breast cancer cells. Mol Cancer Res. 2009;7:402–414. doi: 10.1158/1541-7786.MCR-08-0175. [DOI] [PubMed] [Google Scholar]
- 49.Wang D, et al. Promoter hypermethylation may be an important mechanism of the transcriptional inactivation of ARRDC3, GATA5, and ELP3 in invasive ductal breast carcinoma. Mol Cell Biochem. 2014;396:67–77. doi: 10.1007/s11010-014-2143-y. [DOI] [PubMed] [Google Scholar]
- 50.Huggins GS, Wong JY, Hankinson SE, De Vivo I. GATA5 activation of the progesterone receptor gene promoter in breast cancer cells is influenced by the +331G/A polymorphism. Cancer Res. 2006;66:1384–1390. doi: 10.1158/0008-5472.CAN-05-2715. [DOI] [PubMed] [Google Scholar]
- 51.Song Y, et al. GATA6 is overexpressed in breast cancer and promotes breast cancer cell epithelial-mesenchymal transition by upregulating slug expression. Exp Mol Pathol. 2015;99:617–627. doi: 10.1016/j.yexmp.2015.10.005. [DOI] [PubMed] [Google Scholar]
- 52.Martinelli P, et al. GATA6 regulates EMT and tumour dissemination, and is a marker of response to adjuvant chemotherapy in pancreatic cancer. Gut. 2017;66:1665–1676. doi: 10.1136/gutjnl-2015-311256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Shen F, et al. GATA6 predicts prognosis and hepatic metastasis of colorectal cancer. Oncol Rep. 2013;30:1355–1361. doi: 10.3892/or.2013.2544. [DOI] [PubMed] [Google Scholar]
- 54.Pavlov K, et al. GATA6 expression in Barrett’s oesophagus and oesophageal adenocarcinoma. Dig Liver Dis. 2015;47:73–80. doi: 10.1016/j.dld.2014.09.014. [DOI] [PubMed] [Google Scholar]
- 55.Tian F, et al. Aberrant expression of GATA binding protein 6 correlates with poor prognosis and promotes metastasis in cholangiocarcinoma. Eur J Cancer. 2013;49:1771–1780. doi: 10.1016/j.ejca.2012.12.015. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.