Abstract
This study aimed to identify key genes involved in the progression of diabetic pancreatic ductal adenocarcinoma (PDAC). Two gene expression datasets (GSE74629 and GSE15932) were obtained from Gene Expression Omnibus. Then, differentially expressed genes (DEGs) between diabetic PDAC and non-diabetic PDAC were identified, followed by a functional analysis. Subsequently, gene modules related to DM were extracted by weighed gene co-expression network analysis. The protein-protein interaction (PPI) network for genes in significant modules was constructed and functional analyses were also performed. After that, the optimal feature genes were screened by support vector machine (SVM) recursive feature elimination and SVM classification model was built. Finally, survival analysis was conducted to identify prognostic genes. The correlations between prognostic genes and other clinical factors were also analyzed. Totally, 1546 DEGs with consistent change tendencies were identified and functional analyses showed they were strongly correlated with metabolic pathways. Furthermore, there were two significant gene modules, in which RPS27A and UBA52 were key genes. Functional analysis of genes in two gene modules revealed that these genes primarily participated in oxidative phosphorylation pathway. Additionally, 21 feature genes were closely related with diabetic PDAC and the corresponding SVM classifier markedly distinguished diabetic PDAC from non-diabetic PDAC patients. Finally, decreased KIF22 and PYGL levels had good survival outcomes for PDAC. Four genes (RPS27A, UBA52, KIF22 and PYGL) might be involved in the pathogenesis of diabetic PDAC. Furthermore, KIF22 and PYGL acted as prognostic biomarkers for diabetic PDAC.
Keywords: diabetes mellitus, pancreatic ductal adenocarcinoma, meta-analysis, support vector machine, survival analysis
Introduction
Pancreatic cancer (PC) is the third leading cause of cancer-associated mortality around the world. Compelling evidence has suggested that PC is dominated by pancreatic ductal adenocarcinoma (PDAC) which accounts for approximately 95% of PC, and PDAC is an aggressive tumor with high incidence and metastasis rates [1, 2]. It is reported that dietary factors are primary causes for carcinogenesis [3]. Moreover, numerous epidemiological and cohort studies have indicate that diabetes mellitus (DM) is a risk factor for PDAC progression [4, 5]. Huxley et al found that patients diagnosed with DM (<4 years) have 50% higher risk of PC than those who suffered from DM ≥ 5 years [6]. Kleeff et al evaluated the correlations between clinical factors and DM in patients with PC, and they observed that diabetic patients who received PC resection and adjuvant therapy had a larger tumor size and a higher death risk than non-diabetic patients [7]. Moreover, the new-onset DM is predominately correlated with early recurrence rate in PC patients undergoing resection, implying new-onset DM might be an important clinical manifestation for PC and new-onset DM detection might be helpful for early diagnosis for PC [8]. Additionally, many researchers have also argued that PDAC could cause DM, such as type 3C diabetes [9]. Therefore, the underlying association between PDAC and DM is complicated due to the presence of a bidirectional link.
Encouragingly, a growing number of studies have focused on exploring the underlying molecular mechanisms of diabetic PDAC. Sun et al noted that transgelin-2 encoded by TAGLN2 was significantly up-regulated in PDAC tissues and in a subgroup of PDAC patients suffering from DM, suggesting transgelin-2 was possibly implicated in the development of DM coexisting with PDAC [10]. Boursi et al constructed a clinical prediction model based on several risk factors for DM to evaluate PC risk among those individuals with new-onset diabetes [11]. Besides, an early research demonstrated that the expression levels of VNN1 and MMP9 were elevated in patients with PC-associated DM and these two genes could well discriminate PC-related DM from type 2 diabetes by using a microarray analysis [12]. Later on, investigators found that VNN1 overexpression in PC-associated new-onset DM aggravated paraneoplastic islet dysfunction by the increase of oxidative stress base on the laboratory research [13]. However, an integrated analysis for identifying the potential biomarkers involved in diabetic PDAC has not been performed.
Therefore, we conducted an integrated meta-analysis for gene expression profiles of diabetic PDAC to screen novel therapeutic targets for diabetic PDAC. Differentially expressed genes (DEGs) were firstly identified between diabetic PDAC and non-diabetic PDAC patients. Then, functional analyses were conducted to explore the underlying roles of DM-related genes on PDAC progression. Finally, prognosis-associated genes were further extracted by survival analysis.
Materials and Methods
Data Acquisition and Pre-processing
Gene expression datasets were searched with the keywords of “pancreatic adenocarcinoma,” “diabetes” and “homo sapiens”, and eligible datasets were downloaded from Gene Expression Omnibus (GEO; http://www.ncbi.nlm.nih.gov/geo/) database [14]. The selection criteria for microarray datasets were as follows: 1) whole genome expression data of peripheral blood samples; 2) all samples with relevant DM information; and 3) dataset with the sample size not less than 15. Consequently, there were two datasets (GSE74629 and GSE15932) available after dataset screening. The platform for GSE74629 was Illumina HumanHT-12 V4.0 expression beadchip, which was comprised of 36 samples from PDAC patients (14 patients with diabetes and 22 patients without diabetes). Raw TXT files of GSE74629 were obtained and probes were converted into gene symbols by using the platform annotation files. When multiple probes were mapped to the same gene symbol, average value of different probes was considered as the final gene expression level. For GSE15932, there were 16 samples from PDAC patients, among whom there were 8 PDAC patients complicated with diabetes and 8 PDAC patients not complicated with diabetes. GSE15932 was based on the Affymetrix Human Genome U133 Plus 2.0 Array platform and the original Affymetrix CEL files were downloaded. Raw data of these two datasets were pre-processed with oligo [16] package (Version 3.6; http://www.bioconductor.org/packages/release/bioc/html/oligo.html) in R 3.4.1, including imputing missing data with median values, background correction by using MicroArray Suite (MAS) method, and quantile normalization. Then, gene expression values were subjected to log2 transformation with Limma (Version 3.34.0; https://bioconductor.org/packages/release/bioc/html/limma.html) package [15] in R 3.4.1 to ensure normal distribution.
DEGs Identification and Functional Analyses
MetaDE package (https://cran.r-project.org/web/packages/MetaDE) in R 3.4.1 was utilized to eliminate statistical deflection for integrating two datasets from different sources [17]. Briefly, heterogeneity test of each gene expression value in different platform was firstly performed based on three parameters (tau2, Q value, and Q pval). Generally, subjects were considered as homogenous when tau2 was 0. Meanwhile, when Q value was subjected to chi-square test with K-1 freedom and Q pval value was greater than 0.05, the study subjects was also homogeneous without bias. Then, gene expression differences between DM and non-DM group in integrated dataset were estimated by p value, which was further adjusted into false discovery rate (FDR) by algorithm in MetaDE package. The tau2 = 0 and Q pval >0.05 were set as thresholds of homogeneous test, and genes with FDR <0.05 was regarded as significantly differentially expressed in inter-group comparison. According to the calculated log2fold change (FC), DEGs with consistent expression patterns in two datasets were remained for the following functional and pathway enrichment analyses. Database for Annotation Visualization and Integrated Discovery (DAVID) consists of an integrated biological knowledgebase and analytic tools that aimed at systematically extracting biological meaning from large gene/protein lists [18, 19]. Thus, DAVID (Version 6.8, https://david.ncifcrf.gov/) was employed to conduct Gene Ontology-biological process (GO-BP) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses, and p value <0.05 was used as the cutoff level of significant enrichment.
Co-Expression Module Analysis
Weighed gene co-expression network analysis (WGCNA) has been successfully applied in discovery of interest modules and identification of key genes in modules. GSE74629 with a larger sample size was acted as a training dataset and GSE15932 was considered as a verification dataset in this study. We used WGCNA (Version 1.61; https://cran.r-project.org/web/packages/WGCNA/index.html) to extract the significantly extracted stable gene modules related to DM [20]. The thresholds of gene modules screening were set as the number of genes in modules ≥80 and cutHeight = 0.995.
Protein-Protein Interaction (PPI) Network Analysis
Protein-protein interactions of genes in significant modules were revealed by Search Tool for the Retrieval of Interacting Genes (STRING) [21] database (Version 10.0; https://string-db.org/), which provides a critical assessment and integration of protein-protein interactions. The revealed protein-protein interactions were used to construct PPI network which was visualized with Cytoscape (version 3.6.1; http://www.cytoscape.org/) [22]. In addition, functional analyses (the GO-BP analysis and KEGG pathway analysis) of genes in constructed PPI network were carried out by using DAVID (Version 6.8, https://david.ncifcrf.gov/) [18, 19], with the cut-off threshold of p value <0.05.
Optimization of Feature Genes and SVM Classifier Construction
Support vector machine recursive feature elimination (SVM-RFE) is a feature-selection method by iteratively ranking features and removing the lowest features [23]. Herein, GSE74629 was used as the training dataset while GSE15932 served as the validation dataset. We employed the RFE algorithm of caret package (Version 6.0-76; https://cran.r-project.org/web/packages/caret) in R 3.4.1 to identify the optimal feature gene set which had the highest accuracy in 10-fold cross validation test [24]. To further extract the key genes involved in diabetic DM, a SVM-based classifier was built by SVM method of e1071 package (version 1.6-8; https://cran.r-project.org/web/packages/e1071) in R 3.4.1 with optimal feature genes based on core of sigmoid kernel and 100-fold cross validation [25]. Furthermore, performance evaluation of SVM classifier was performed in training and validation datasets, respectively. Receiver operating characteristic (ROC) curve was constructed and the area under ROC (AUROC) value was calculated. Here, the pROC package (version 1.12.1; https://cran.r-project.org/web/packages/pROC/index.html) in R 3.4.1 was used to calculate several indicators for ROC curve, including sensitivity (Sen), specificity (Spe), positive predictive value (PPV) and negative predictive value (NPV) \ Sensitivity = true positive/(true positive + false negative); Specificity = true negative/(false positive + true negative); PPV = true positive/(true positive + false positive); NPV = true negative/(true negative + false negative) [26].
Survival Analysis
Public gene expression profiles of PC were downloaded from The Cancer Genome Atlas (TCGA) database, and gene expression profiles of 150 patients with PC were obtained. Gene expression profiles were measured experimentally with the Illumina HiSeq 2000 RNA Sequencing platform by the University of North Carolina TCGA genome characterization center. Level 3 data was downloaded from TCGA data coordination center. This dataset shows the gene-level transcription estimates, as in log2(x+1) transformed RSEM normalized count. Meanwhile, the corresponding clinical information was also downloaded and obtained. There were 114 patients had clinical information about DM, including 33 diabetic PDAC patients and 81 non-diabetic PDAC patients. In order to identify prognostic genes, univariate cox regression analysis was performed between features gene in SVM classifier and clinical survival by survival (Version 2.41-1; http://bioconductor.org/packages/survivalr/) package in R 3.4.1 [27]. All the 114 samples were divided into high and low risk groups based on the expression levels of prognostic genes, with the cutoff threshold of median expression level calculated from all the 114 samples (high risk group: expression level > median expression level; low risk group: expression level < median expression level). Survival analysis was conducted and Kaplan-Meier (KM) survival curves were generated. Finally, univariate cox regression analyses of prognostic genes with other clinical parameters (age, gender, history of chronic pancreatitis, history of diabetes, alcohol history, neoplasm_histologic grade, pathologic M, pathologic N, pathologic T, and pathologic stage) were performed by using glm function in R software.
Results
DEGs Screening and Functional Analyses
After data pre-processing of two datasets, a total of 1546 DEGs with consistent change patterns between diabetic PDAC patients and non-diabetic PDAC patients were uncovered. Moreover, bidirectional hierarchical clustering analysis showed that these genes were dramatically differentially expressed and their differential expressions were consistent in each dataset, implying that they had similar expression pattern in two datasets (Supplementary Figure S1). Additionally, GO-BP analysis indicated that these DEGs were significantly enriched in 27 GO-BP terms, such as translational initiation, fibroblast growth factor receptor signaling pathway and regulation of glucose transport (Figure 1A, Supplementary Table S1). Simultaneously, there were 9 markedly enriched KEGG pathways for these DEGs and vast majority of these DEGs were mainly involved in metabolic pathways (Figure 1B, Supplementary Table S1).
Identification of Significant Gene Modules by WGCNA
In order to examine whether gene expression levels in each dataset had comparability, we performed consistency analyses for the expressions of overlapping genes in two datasets. Results suggested that gene expression correlation and network node connection correlation were remarkably positive for training and validation sets (Supplementary Figure S2A). Notably, the scale-free network distribution was a prerequisite for WGCNA algorithm. Therefore, we calculated the square of the correlation coefficient (log(k) and log(p(k)) for the weight parameter of adjacency matrix (power parameter) under different values (Supplementary Figure S2B). We observed that the average connectivity of genes was 1 when power parameter was 10, indicating it has scale-free network characteristics (Supplementary Figure S2B). Herein, we obtained 9 gene modules associated with DM status by a co-expression network analysis with training set GSE74629 (Figure 2A). Meanwhile, module division was also carried out in validation set (GSE15932) as displayed in Figure 2A. The correlation analysis between gene modules and DM status was showed in Figure 2B. Four gene modules exhibited negative correlations with DM status (correlation coefficient <0) while five modules had a positive correlation with DM status (correlation coefficient >0). Finally, we evaluated the stability of gene modules. In general, a higher value of preservation Z score represents better module stability. More specifically, the module was stable with 5 < Z score <10 while the module had a good robustness with Z score >10. Our results demonstrated that blue and turquoise modules showed a good stability with preservation Z score >5 and p value ≤0.05 (Table 1). Moreover, we found that blue module had a negative correlation with DM status while there was a positive correlation between turquoise module and DM status (Figure 2B). Thus, the genes in these two modules (206 genes in blue module and 275 genes in turquoise module) were used for the following analysis.
TABLE 1.
ID | Color | Module size | Preservation | ||
---|---|---|---|---|---|
Z-score | Cor | p value | |||
Module 1 | black | 93 | 1.792942 | 0.12 | 0.25 |
Module 2 | blue | 206 | 8.285475 | 0.31 | 5.80E-06 |
Module 3 | brown | 163 | 3.119733 | 0.08 | 0.31 |
Module 4 | green | 132 | 3.42939 | 0.26 | 0.0026 |
Module 5 | grey | 323 | 8.777289 | -0.53 | 8.60E-25 |
Module 6 | pink | 88 | 2.10382 | -0.06 | 0.58 |
Module 7 | red | 106 | 2.981962 | 0.16 | 0.1 |
Module 8 | turquoise | 275 | 12.063853 | 0.27 | 5.60E-06 |
Module 9 | yellow | 160 | 7.446963 | 0.071 | 0.37 |
In general, a higher value of preservation Z score represents better module stability. The module was considered as stable with 5 < Z score <10 while the module had a good robustness with Z score >10. Cor: gene expression correlation.
PPI Network
A PPI network of genes in blue and turquoise modules related to DM status was constructed based on STRING database. There were 214 gene nodes and 701 protein-protein interaction pairs (Figure 3A). Moreover, RPS27A (ribosomal protein S27a) and UBA52 (ubiquitin A-52 residue ribosomal protein fusion product 1) with a relatively higher degree were key genes in PPI network. Besides, functional analyses revealed that genes in PPI network were significantly related to 19 GO-BP terms, which were closely associated with translation-related terms, such as SRP-dependent cotranslational protein targeting to membrane and translational initiation process (Figure 3B, Supplementary Table S2). Meanwhile, three significant KEGG pathways were enriched for genes in PPI network, including ribosome, spliceosome, and oxidative phosphorylation OXPHOS pathways (Figure 3B, Supplementary Table S2).
SVM Classification Analysis
To further extract the optimal feature gene set, the number of feature genes in PPI network was reduced to 21 by RFE method with the max accuracy of 0.863 (Supplementary Figure S3 and Table 2). Notably, down-regulated RPS27A and UBA52 were also belonged to the key feature gene set. After that, a SVM-based classifier was constructed with 21 feature genes in training and validation datasets to identify DM status (Figure 4). The performance evaluation of SVM classifier was then undertaken. Results suggested that SVM-based classifier significantly differentiated diabetic PDAC patients from non-diabetic PDAC patients in training set GSE74629 based on several assessment indicators (AUC = 0.994, sensitivity = 0.923, specificity = 0.913, PPV = 0.857, NPV = 0.954; Figure 4A). Furthermore, this classifier also could effectively distinguish diabetic and non-diabetic PDAC patients in validation set GSE15932 according to multiple assessment indexes (AUC = 0.974, sensitivity = 0.857, specificity = 0.778, PPV = 0.750, NPV = 0.875; Figure 4B). Taken together, our findings revealed that 21 feature genes had good discrimination ability for DM status and they might participate in the pathogenesis of diabetic PDAC.
TABLE 2.
Gene | p value | FDR | GSE15932- | Log2 FC | GSE74629- | Log2 FC |
---|---|---|---|---|---|---|
TRAPPC4 | 1.93E-04 | 0.008 | down | −0.01614 | down | −0.00508 |
SLC17A6 | 2.96E-06 | 0.000 | down | −0.01824 | down | −0.02077 |
TRAPPC6A | 1.84E-04 | 0.007 | down | −0.03159 | down | −0.04112 |
KRTCAP2 | 2.23E-04 | 0.009 | down | −0.02173 | down | −0.00512 |
THOC5 | 4.43E-04 | 0.018 | down | −0.01558 | down | −0.05874 |
UBA52 | 1.58E-04 | 0.006 | down | −0.01686 | down | −0.03055 |
NUP35 | 1.12E-03 | 0.045 | down | −0.07168 | down | −0.01036 |
KIF22 | 3.05E-04 | 0.012 | down | −0.02743 | down | −0.03115 |
RPS27A | 1.24E-04 | 0.005 | down | −0.08765 | down | −0.01318 |
ATP6V0C | 1.16E-03 | 0.047 | down | −0.05715 | down | −0.02371 |
RPL26 | 6.81E-05 | 0.003 | down | −0.02249 | down | −0.02505 |
DPYSL2 | 1.62E-04 | 0.007 | up | 0.01931 | up | 0.01977 |
SLC39A5 | 2.03E-04 | 0.008 | up | 0.03015 | up | 0.03403 |
XAB2 | 3.08E-04 | 0.012 | up | 0.01209 | up | 0.05501 |
BZW2 | 1.63E-04 | 0.007 | up | 0.02933 | up | 0.00486 |
GPR27 | 1.06E-03 | 0.043 | up | 0.05109 | up | 0.00880 |
F11R | 6.16E-04 | 0.025 | up | 0.01936 | up | 0.00782 |
DNTTIP1 | 3.57E-05 | 0.001 | up | 0.04530 | up | 0.07930 |
PYGL | 1.19E-05 | 0.000 | up | 0.08198 | up | 0.02152 |
FBXO2 | 2.47E-04 | 0.010 | up | 0.01414 | up | 0.00816 |
SAP30L | 1.95E-04 | 0.008 | up | 0.01824 | up | 0.03573 |
FC: fold change; FDR: false discovery rate.
Survival Analysis
Gene expression data of 150 PC patients were obtained. Totally, 114 patients had DM clinical information, of which, 33 subjects were diabetic PC patients, and 81 patients exhibited non-diabetic PC. SVM classification was verified by gene expression data from 114 patients and results suggested that this classifier could discriminate the diabetic from non-diabetic patients with PC based on multiple indicators (AUC = 0.924, sensitivity = 0.848, specificity = 0.938, PPV = 0.848, and NPV = 0.938; Supplementary Figure S4). In addition, univariate cox regression showed that down-regulated KIF22 (kinesin family member 22) and up-regulated PYGL (glycogen phosphorylase L) were dramatically associated with prognosis of PC. Moreover, lower expression levels of KIF22 (p = 2.004e-02) and PYGL (p = 3.321e-03) were strongly correlated with favorable survival outcomes according to the KM curves (Figure 5). Finally, the correlations between these two genes and other clinical factors were also evaluated. We found that KIF22 was significantly associated with age, history of diabetes, alcohol history and neoplasm_histologic grade, while PYGL was markedly correlated with neoplasm_histologic grade (p < 0.05; Supplementary Table S3).
Discussion
Many efforts have been made toward molecular genetics of PDAC over recent decades and researches have currently demonstrated that OXPHOS plays a central role in cancer cell energy provision rather than glycolysis [28–30]. For example, Ashton et al argued that OXPHOS level was up-regulated in several cancers, such as PDAC [31]. Viale et al found that surviving PDAC cells driven by Ras heavily relied on OXPHOS according to a transcriptomic and metabolic analysis [32]. Moreover, Zhou et al pointed out that inhibition of OXPHOS by drug metformin could increase apoptosis and induce cell cycle arrest in PDAC cells [33]. Herein, we performed functional enrichment analyses for genes in two gene modules associated with DM and found that many genes were significantly enriched in mitochondrial OXPHOS pathway, implying that OXPHOS might be implicated with the pathological mechanism of diabetic PDAC. Notably, the progression of DM was predominately related to the accumulation of damaged mitochondria in pancreatic β cells which secreted sufficient amounts of insulin [34]. Recently, Haythorne et al also emphasized that DM could trigger metabolic changes in pancreatic β-cells, such as remarkable reduction of OXPHOS-correlated pathways [35]. Therefore, detailed roles of OXPHOS in the development of diabetic PDAC still need to be elaborated in the future.
In this study, we extracted 21 feature genes and established a SVM-based classifier which had a good discrimination ability between diabetic PDAC and non-diabetic PDAC patients in training and validation datasets. Moreover, this classifier also could also differentiate diabetic PDAC from non-diabetic PDAC patients in an external dataset extracted from TCGA database. These genes might participate in the progression of diabetic PDAC. Moreover, we noted that two down-regulated feature genes (RPS27A and UBA52) with a higher degree are key genes according to the constructed PPI network. RPS27A, a member of ribosomal protein S27AE family, is a component of riobosome 40S subunit and encodes the carboxy terminus of ubiquitin. Previous studies have indicated that RPS27A was up-regulated in several cancers, including colorectal and renal cancers [36, 37]. Moreover, RPS27A induced cells cycle arrest, enhanced cell proliferation and suppressed cell apoptosis possibly via multiple signaling pathways, such as p53 and BCL-2 signaling pathways [38]. Yang et al conducted a bioinformatics analysis by constructing a miRNA-Transcription Factor-mRNA network to identify important genes related to mesenchymal stem cells (MSCs) for diabetic nephropathy (DN) treatment [39]. They stated that RPS27A was regulated by EIF3M (eukaryotic translation initiation factor 3 subunit M) and there was a higher RPS27A level in monocytes after mesenchymal stem cells co-cultured, suggesting RPS27A might play a critical role in the treatment of MSCs for DN [39]. However, few reports illuminated the potential roles of RPS27A on diabetic PDAC progression. Interestingly, PPI analysis showed RPS27A was closely interacted with UBA52, which was a housekeeping gene and could encode an ubiquitin ribosomal fusion protein. Although overwhelming evidence has demonstrated that UBA52 was probably responsible for the pathogenesis of DN, the influence of UBA52 on diabetic PDAC has not been fully understood [40, 41].
Additionally, two feature genes (down-regulated KIF22 and up-regulated PYGL) exhibited close associations with prognosis of PC. Furthermore, patients with lower expression levels of KIF22 and PYGL had better survival outcomes for PC. KIF22, a member of kinesin-like DNA-binding family, could encode a microtubule-dependent molecular motor protein and was involved in cell mitosis process [42]. A previous research reported that KIF22 was up-regulated in breast cancer and its inhibition could significantly suppress cell proliferation [43]. Zhang et al argued that KIF22 mRNA and protein levels were over-expressed in prostate cancer, and KIF22 was not dramatically linked with clinical outcomes of prostate cancer [44]. Herein, we found that KIF22 was significantly associated with history of diabetes. Therefore, we inferred that this gene may be a key gene biomarker in the development of diabetic PDAC. Additionally, PYGL was identified as a metastasis-associated metabolic gene in prostate cancer [45]. Until now, the possible influences of PYGL on diabetic PDAC have not been investigated.
There were some limitations in this study. Firstly, our findings suggested that OXPHOS pathway was strongly involved in the development of diabetic PDAC. However, the precise mechanisms have not been clarified. Secondly, the number of available samples in our study is relatively low, and a comprehensive bioinformatics analysis based on a larger sample size and relevant experimental assays still need to be carried out to verify our findings. Thirdly, more clinical and physiological features need to be included for a more comprehensive survival analysis.
In summary, our results showed that OXPHOS pathway might participate in the pathogenesis of diabetic PDAC. Moreover, a SVM classifier based on 21 feature genes was built and this classification model dramatically distinguished diabetic and non-diabetic PDAC patients. Additionally, KIF22 and PYGL were potential prognostic genes for PDAC survival. However, more detailed bioinformatics analysis and corresponding experimental assays are still need to be undertaken in the future.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Author Contributions
Conception and design: WZ, YW and SY. Administrative support: YJ and ZZ. Provision of study materials or patients: YX, XW and HG. Collection and assembly of data: YW and HY. Data analysis and interpretation: WZ and SY. Manuscript writing: YW and WZ. Final approval of manuscript: All authors.
Conflict of Interest
The authors declare that they have no conflict of interest.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.por-journal.com/articles/10.3389/pore.2021.604730/full#supplementary-material
References
- 1. Iozzo RV. The family of the small leucine-rich proteoglycans: key regulators of matrix assembly and cellular growth. Crit Rev Biochem Mol Biol (1997). 32 (2):141–74. 10.3109/10409239709108551 [DOI] [PubMed] [Google Scholar]
- 2. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2017. CA Cancer J Clin (2017). 67 (1):7–30. 10.3322/caac.21387 [DOI] [PubMed] [Google Scholar]
- 3. Hine RJ, Srivastava S, Milner JA, Ross SA. Nutritional links to plausible mechanisms underlying pancreatic cancer: a conference report. Pancreas (2003). 27 (4):356–66. 10.1097/00006676-200311000-00014 [DOI] [PubMed] [Google Scholar]
- 4. Andersen DK, Andren-Sandberg A, Duell EJ, Goggins M, Korc M, Petersen GM, et al. Pancreatitis-diabetes-pancreatic cancer. Pancreas (2013). 42 (8):1227–37. 10.1097/mpa.0b013e3182a9ad9d [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Maisonneuve P, Lowenfels AB. Risk factors for pancreatic cancer: a summary review of meta-analytical studies. Int J Epidemiol (2015). 44 (1):186–98. 10.1093/ije/dyu240 [DOI] [PubMed] [Google Scholar]
- 6. Huxley R, Ansary-Moghaddam A, Berrington de González A, Barzi F, Woodward M. Type-II diabetes and pancreatic cancer: a meta-analysis of 36 studies. Br J Cancer (2005). 92 (11):2076–83. 10.1038/sj.bjc.6602619 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Kleeff J, Costello E, Jackson R, Halloran C, Greenhalf W, Ghaneh P, et al. The impact of diabetes mellitus on survival following resection and adjuvant chemotherapy for pancreatic cancer. Br J Cancer (2016). 115 (7):887–94. 10.1038/bjc.2016.277 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Lee S, Hwang HK, Kang CM, Lee WJ. Adverse oncologic impact of new-onset diabetes mellitus on recurrence in resected pancreatic ductal adenocarcinoma: a comparison with long-standing and non-diabetes mellitus patients. Pancreas (2018). 47 (7):816–22. 10.1097/mpa.0000000000001099 [DOI] [PubMed] [Google Scholar]
- 9. Gao W, Zhou Y, Li Q, Zhou Q, Tan L, Song Y, et al. Analysis of global gene expression profiles suggests a role of acute inflammation in type 3C diabetes mellitus caused by pancreatic ductal adenocarcinoma. Diabetologia (2015). 58 (4):835–44. 10.1007/s00125-014-3481-8 [DOI] [PubMed] [Google Scholar]
- 10. Sun Y, He W, Luo M, Zhou Y, Chang G, Ren W, et al. Role of transgelin-2 in diabetes-associated pancreatic ductal adenocarcinoma. Oncotarget (2017). 8 (30):49592–604. 10.18632/oncotarget.17519 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Boursi B, Finkelman B, Giantonio BJ, Haynes K, Rustgi AK, Rhim AD, Mamtani R, et al. A clinical prediction model to assess risk for pancreatic cancer among patients with new-onset diabetes. Gastroenterology (2017). 152 (4):840–50. 10.1053/j.gastro.2016.11.046 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Huang H, Dong X, Kang MX, Xu B, Chen Y, Zhang B, et al. Novel blood biomarkers of pancreatic cancer-associated diabetes mellitus identified by peripheral blood-based gene expression profiles. Am J Gastroenterol (2010). 105 (7):1661–9. 10.1038/ajg.2010.32 [DOI] [PubMed] [Google Scholar]
- 13. Kang M, Qin W, Buya M, Dong X, Zheng W, Lu W, et al. VNN1, a potential biomarker for pancreatic cancer-associated new-onset diabetes, aggravates paraneoplastic islet dysfunction by increasing oxidative stress. Cancer Lett (2016). 373 (2):241–50. 10.1016/j.canlet.2015.12.031 [DOI] [PubMed] [Google Scholar]
- 14. Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, et al. NCBI GEO: mining tens of millions of expression profiles--database and tools update. Nucleic Acids Res (2007). 35:D760–D765. 10.1093/nar/gkl887 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK. Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res (2015). 43 (7):e47. 10.1093/nar/gkv007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Parrish RS, Spencer HJ. Effect of normalization on significance testing for oligonucleotide microarrays. J Biopharm Stat (2004). 14 (3):575–89. 10.1081/bip-200025650 [DOI] [PubMed] [Google Scholar]
- 17. Chang LC, Lin HM, Sibille E, Tseng GC. Meta-analysis methods for combining multiple expression profiles: comparisons, statistical characterization and an application guideline. BMC Bioinformatics (2013). 14:368. 10.1186/1471-2105-14-368 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources. Nat Protoc (2009). 4 (1):44–57. 10.1038/nprot.2008.211 [DOI] [PubMed] [Google Scholar]
- 19. Huang DW, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res (2009). 37 (1):1–13. 10.1093/nar/gkn923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics (2008). 9:559. 10.1186/1471-2105-9-559 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res (2015). 43:D447–D452. 10.1093/nar/gku1003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Shannon P, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res (2003). 13 (11):2498–504. 10.1101/gr.1239303 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Lu X, Yang Y, Wu F, Gao M, Xu Y, Zhang Y, Yao Y, et al. Discriminative analysis of schizophrenia using support vector machine and recursive feature elimination on structural MRI images. Medicine (Baltimore) (2016). 95 (30):e397310. 10.1097/md.0000000000003973 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Deist TM, Dankers FJ, Valdes G, Wijsman R, Hsu IC, Oberije C, Lustberg T, et al. Machine learning algorithms for outcome prediction in (chemo)radiotherapy: an empirical comparison of classifiers. Med Phys (2018). 45 (7):3449–59. 10.1002/mp.12967 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Wang Q, Liu X. Screening of feature genes in distinguishing different types of breast cancer using support vector machine. Onco Targets Ther (2015). 8:2311–7. 10.2147/OTT.S85271 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, Müller M. (2011). pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 12:77. 10.1186/1471-2105-12-77 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Wang P, Wang Y, Hang B, Zou X, Mao JH. A novel gene expression-based prognostic scoring system to predict survival in gastric cancer. Oncotarget (2016). 7 (34):55343–51. 10.18632/oncotarget.10533 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Weinberg SE, Chandel NS. Targeting mitochondria metabolism for cancer therapy. Nat Chem Biol (2015). 11 (1):9–15. 10.1038/nchembio.1712 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Moreno-Sanchez R, Sara Rodríguez-Enríquez S, Marin-Hernandez A, Saavedra E. Energy metabolism in tumor cells. FEBS J (2007). 274:1393–418. 10.1111/j.1742-4658.2007.05686.x [DOI] [PubMed] [Google Scholar]
- 30. Rademaker G, Costanza B, Anania S, Agirman F, Maloujahmoum N, Di Valentin E, Goval JJ, et al. Myoferlin contributes to the metastatic phenotype of pancreatic cancer cells by enhancing their migratory capacity through the control of oxidative phosphorylation. Cancers (2019). 11 (6): 853. 10.3390/cancers11060853 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Ashton TM, McKenna WG, Kunz-Schughart LA, Higgins GS. Oxidative phosphorylation as an emerging target in cancer therapy. Clin Cancer Res (2018). 24 (11):2482–90. 10.1158/1078-0432.ccr-17-3070 [DOI] [PubMed] [Google Scholar]
- 32. Viale A, Pettazzoni P, Lyssiotis CA, Ying H, Sánchez N, Marchesini M, Carugo A, et al. Oncogene ablation-resistant pancreatic cancer cells depend on mitochondrial function. Nature (2014). 514 (7524):628–32. 10.1038/nature13611 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Zhou C, Sun H, Zheng C, Gao J, Fu Q, Hu N, Shao X, et al. Oncogenic HSP60 regulates mitochondrial oxidative phosphorylation to support Erk1/2 activation during pancreatic cancer cell growth. Cell Death Dis (2018). 9 (2):161. 10.1038/s41419-017-0196-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Wada J, Nakatsuka A. Mitochondrial dynamics and mitochondrial dysfunction in diabetes. Acta Med Okayama (2016). 70 (3):151–8. 10.18926/AMO/54413 [DOI] [PubMed] [Google Scholar]
- 35. Haythorne E, Rohm M, van de Bunt M, Brereton MF, Tarasov AI, Blacker TS, et al. Diabetes causes marked inhibition of mitochondrial metabolism in pancreatic β-cells. Nat Commun (2019). 10 (1):2474. 10.1038/s41467-019-10189-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Wong JM, Mafune K, Yow H, Rivers EN, Ravikumar TS, Steele GD, Jr, Chen LB. (1993). Ubiquitin-ribosomal protein S27a gene overexpressed in human colorectal carcinoma is an early growth response gene. Cancer Res 53 (8):1916–20. [PubMed] [Google Scholar]
- 37. Kanayama H TK, Tanaka K, Aki M, Kagawa S, Miyaji H, Satoh M, Okada F, et al. Changes in expressions of proteasome and ubiquitin genes in human renal cancer cells. Cancer Res (1991). 51 (24):6677–85. [PubMed] [Google Scholar]
- 38. Wang H YJ, Zhang L, Xiong Y, Chen S, Xing H, Tian Z, et al. RPS27a promotes proliferation, regulates cell cycle progression and inhibits apoptosis of leukemia cells. Biochem Biophysical Res Commun (2014). 446 (4):1204–10. 10.1016/j.bbrc.2014.03.086 [DOI] [PubMed] [Google Scholar]
- 39. Yang H, Zhang X, Xin G. Investigation of mechanisms of mesenchymal stem cells for treatment of diabetic nephropathy via construction of a miRNA-TF-mRNA network. Ren Fail (2018). 40 (1):136–45. 10.1080/0886022x.2017.1421556 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Dihazi H, Muller GA, Lindner S, Meyer M, Asif AR, Oellerich M, Strutz F. Characterization of diabetic nephropathy by urinary proteomic analysis: identification of a processed ubiquitin form as a differentially excreted protein in diabetic nephropathy patients. Clin Chem (2007). 53 (9):1636–45. 10.1373/clinchem.2007.088260 [DOI] [PubMed] [Google Scholar]
- 41. Wada J, Sun L, Kanwar YS. Discovery of genes related to diabetic nephropathy in various animal models by current techniques. Contrib Nephrol (2011). 169:161–74. 10.1159/000313951 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Tokai N F-NA, Toyoshima Y, Yonemura S, Tsukita S, Inoue J, Yamamota T. Kid, a novel kinesin-like DNA binding protein, is localized to chromosomes and the mitotic spindle. The EMBO J (1996). 15 (3):457–67. 10.1002/j.1460-2075.1996.tb00378.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Yu Y, Wang XY, Sun L, Wang YL, Wan YF, Li XQ, Feng YM. Inhibition of KIF22 suppresses cancer cell proliferation by delaying mitotic exit through upregulating CDC25C expression. Carcinogenesis (2014). 35 (6):1416–25. 10.1093/carcin/bgu065 [DOI] [PubMed] [Google Scholar]
- 44. Zhang Z XH, Zhu S, Chen X, Yu J, Shen T, Li X, et al. High expression of KIF22/kinesin-like DNA binding protein (Kid) as a poor prognostic factor in prostate cancer patients. Med Sci Monit (2018). 24:8190–7. 10.12659/msm.912643 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Chen J CS, Situ B, Zhong J, Hu Y, Li S, Huang J, et al. Metabolic reprogramming-based characterization of circulating tumor cells in prostate cancer. J Exp Clin Cancer Res (2018). 37 (1):127. 10.1186/s13046-018-0789-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.