Skip to main content
Mediators of Inflammation logoLink to Mediators of Inflammation
. 2020 Apr 9;2020:5934821. doi: 10.1155/2020/5934821

Identification of Hub Genes Related to Carcinogenesis and Prognosis in Colorectal Cancer Based on Integrated Bioinformatics

Benjiao Gong 1, Yanlei Kao 2, Chenglin Zhang 1, Fudong Sun 3, Zhaohua Gong 4,, Jian Chen 1,4,
PMCID: PMC7171686  PMID: 32351322

Abstract

The high mortality of colorectal cancer (CRC) patients and the limitations of conventional tumor-node-metastasis (TNM) stage emphasized the necessity of exploring hub genes closely related to carcinogenesis and prognosis in CRC. The study is aimed at identifying hub genes associated with carcinogenesis and prognosis for CRC. We identified and validated 212 differentially expressed genes (DEGs) from six Gene Expression Omnibus (GEO) datasets and the Cancer Genome Atlas (TCGA) database. We investigated functional enrichment analysis for DEGs. The protein-protein interaction (PPI) network was constructed, and hub modules and genes in CRC carcinogenesis were extracted. A prognostic signature was developed and validated based on Cox proportional hazards regression analysis. The DEGs mainly regulated biological processes covering response to stimulus, metabolic process, and affected molecular functions containing protein binding and catalytic activity. The DEGs played important roles in CRC-related pathways involving in preneoplastic lesions, carcinogenesis, metastasis, and poor prognosis. Hub genes closely related to CRC carcinogenesis were extracted including six genes in model 1 (CXCL1, CXCL3, CXCL8, CXCL11, NMU, and PPBP) and two genes and Metallothioneins (MTs) in model 2 (SLC26A3 and SLC30A10). Among them, CXCL8 was also related to prognosis. An eight-gene signature was proposed comprising AMH, WBSCR28, SFTA2, MYH2, POU4F1, SIX4, PGPEP1L, and PAX5. The study identified hub genes in CRC carcinogenesis and proposed an eight-gene signature with good reproducibility and robustness at the molecular level for CRC, which might provide directive significance for treatment selection and survival prediction.

1. Introduction

Colorectal cancer (CRC) is diagnosed the second most cancer in females and the third most form in males, which has been a major global public health problem [1]. The number of cases diagnosed is forecast to rise from 1800 million now to 3093 million by 2040 through the World Health Organization [2]. Although modern medicine has made great advances, CRC is still the third leading cause for cancer-related mortality [3]. As we all know, early detection of CRC has some effect on reducing its mortality and the discovery of precursor lesion can even cut down the incidence [4]. Early diagnosis with better survival and later diagnosis with worse prognosis have no doubt. Tumor-node-metastasis (TNM) stage, identified by the American Joint Committee on Cancer according to pathologic and clinical factors, is not only the fundamental for treatment but also the gold standard for CRC prognosis [5, 6]. The 5-year survival rate at stage I is more than 90%, and the 5-year survival rate for stage IV is only 10% [7]. However, 20% of patients at stage II undergo cancer-specific death and some stage III patients confront better outcomes than some patients at stage II [8]. Hence, it is extremely necessary to identify novel prognostic biomarkers for early diagnostic detection and improving outcomes due to the limitation of TNM stage.

In recent decades, the research on the molecular and genetic mechanisms in CRC carcinogenesis and progression has accelerated the investigation of genetic prognostic markers for the TNM staging system supplement [9]. And the progress of microarray and high-throughput sequencing technology has also promoted to interpret epigenetic or critical genetic alternations in carcinogenesis and to decipher hopeful biomarkers for cancer diagnosis, treatment, and prognosis [10, 11]. Publicly available genome databases like the Cancer Genome Atlas (TCGA) and the Gene Expression Omnibus (GEO) have provided more facilitated genome exploration on different cancers containing CRC for clinicians and bioinformatics, which was generally impossible in the past [1215]. Meanwhile, integrated bioinformatics methods have been applied to cancer research and large amounts of valuable information have been excavated, which were explored to overcome the restricted or discordant results because of the application of either a small sample size or different types of technological platforms [1619].

In this study, we identified and integrated differentially expressed genes (DEGs) from gene expression profile and RNA sequencing data for human CRC. The DEGs were further preformed functional enrichment analysis to investigate biological processes, molecular functions, and reactome pathways regulated by the DEGs. The protein-protein interaction (PPI) network reflecting the interactions among DEGs was constructed, and hub network modules were captured and deciphered, which embodied representative genes in CRC carcinogenesis. Finally, patients with overall survival data were randomly divided into two groups, the train group and the test group. The train group was used to reveal genes associated with survival and build a CRC gene signature for prognosis. The test group was employed to assess the prognosis model comprehensively.

2. Materials and Methods

2.1. DEG Identification by GEO

The gene expression profile data (GSE21510, GSE24514, GSE32323, GSE89076, GSE110225, and GSE113513) for colorectal cancer were extracted from the GEO database [2024]. All included datasets contained at least 10 samples. The normalization and log2 conversion were performed for the matrix data of each GEO dataset, and the DEGs between tumor and control tissues were filtered out via the Limma package in R [25]. Gene integration for the DEGs screened from the six datasets was executed using the RobustRankAggreg (RRA) package based on a robust rank aggregation method [26]. |log2FC | >1.5 and adjusted P value < 0.05 set the criteria to filter statistically significant DEGs.

2.2. DEG Validation by TCGA

The integrated significant DEGs from GEO datasets were validated by means of RNA sequencing data in TCGA COADREAD dataset. Raw RNA sequencing data including 647 COADREAD samples and 51 matched noncancerous samples were extracted from TCGA database, and the clinical information of patients was also downloaded. The Mann-Whitney test was employed to normalize and analyze the TCGA data. Genes with |log2FC | >2 and adjusted P value < 0.05 were considered to be significantly differentially expressed. Overlapping DEGs between GEO and TCGA database were reserved for following studies.

2.3. Functional Enrichment Analysis

The potential biological processes and molecular functions of the overlapping DEGs were evaluated using BINGO plug-in of Cytoscape 3.2.1 [27]. During this procedure, the significance level was set to 0.05, and organism was selected as Homo sapiens. The pathway enrichment analysis was performed utilizing Reactome FI plug-in of Cytoscape 3.2.1, and the threshold level was defined as FDR < 0.05 [28]. The top ten terms of the functional enrichment analysis were visualized using the Bubble package [29].

2.4. PPI Network and Module Analysis

The protein-protein interactions among overlapping DEGs were identified via STRING database, and genes with the combined score ≥ 0.4 were selected to construct the PPI network [30]. The PPI network was visualized and analyzed by Cytoscape 3.2.1. And the hub network modules were captured with the help of the Cytoscape plug-in Molecular Complex Detection (MCODE) with parameters degree cutoff = 2, Node Score Cutoff = 0.2, and K − Core = 2 [31]. Then, the topological parameters were also calculated, and survival analysis was performed using clinical information via the survival package for hub modules.

2.5. COX Model Construction and Verification

After eliminating patients without overall survival data, 617 patients' data were used for survival analysis. All patients were randomly divided into two groups with the help of the caret package, train group and test group [32]. The train group was used for constructing the COX prognostic signature, and the test group was used for validating the signature. The train group executed univariate Cox proportional hazards regression analysis to recognize candidate genes associated with survival. Then, the LASSO penalized regression model was employed to achieve shrinkage and variable selection simultaneously and to prevent the prognostic model overfitting. Subsequently, the multivariate Cox proportional hazards regression model was performed and corresponding coefficients were calculated in the train group. The predicted overall survival information with a risk score for each patient in two groups was assessed on the basis of the expression level of the prognostic gene and its corresponding coefficient in the train group. The patients in two groups were classified into low- or high-risk groups according to the median risk score of the train group. Survival curves were plotted utilizing the survival package to assess the differences in survival rate between high- and low-risk patients in two groups. Furthermore, the receiver operating characteristic (ROC) curve was constructed based on the survivalROC package and the area under the curve (AUC) was measured to evaluate the predictive ability of the prognostic signature for clinical outcomes. The risk score distribution, survival time, and gene expression patterns for patients in the train and test groups were visualized in R.

3. Results

3.1. DEG Identification and Validation

The detailed information for the six GEO datasets in this study is shown in Table 1. 254 DEGs in total including 80 upregulated genes and 174 downregulated genes were obtained through screening of the Limma package and integration of the RRA package for the six datasets (Table S1). The top 20 up- and downregulated genes after the integrated analysis are displayed in Figure 1(a). The DEGs extracted from TCGA database comprised 1386 upregulated and 2142 downregulated genes (Table S2). Finally, 212 overlapping DEGs containing 46 upregulated and 166 downregulated genes were identified (Figure 1(b) and Table S3). In addition, the clinical information of patients was also organized for survival analysis (Table S4).

Table 1.

Information for six GEO datasets in the study.

Dataset Platform Number of samples (tumor/control)
GSE21510 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array 148 (104/44)
GSE24514 [HG-U133A] Affymetrix Human Genome U133A Array 49 (34/15)
GSE32323 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array 44 (22/22)
GSE89076 Agilent-039494 SurePrint G3 Human GE v2 8x60K Microarray 039381 80 (41/39)
GSE110225 [HG-U133A] Affymetrix Human Genome U133A Array; [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array 60 (30/30)
GSE113513 [PrimeView] Affymetrix Human Gene Expression Array 28 (14/14)

Figure 1.

Figure 1

DEG identification from GEO and validation from TCGA. (a) The top 20 up- and downregulated genes in six GEO datasets based on a RRA package. (b) Overlapping DEGs between GEO and TCGA database.

3.2. Functional Enrichment Analysis

To explain the potential biological functions of the 212 overlapping DEGs, the biological process, molecular function, and reactome pathway enrichment analyses were executed. The biological processes were mainly involved in response to stimulus and metabolic process (Figure 2(a) and Table S5). The molecular functions were significantly enriched in protein binding and catalytic activity (Figure 2(b) and Table S6). According to the reactome pathway enrichment analysis, the upregulated genes were mainly associated with signaling by GPCR and extracellular matrix organization (Figure 2(c) and Table S7). And the downregulated genes participated in response to metal ions, metabolism, signal transduction, and transmembrane transport of small molecules (Figure 2(d) and Table S8).

Figure 2.

Figure 2

Functional enrichment analysis for DEGs. (a) The top 10 terms of biological process enrichment for up- and downregulated DEGs. (b) The top 10 terms of molecular function enrichment for up- and downregulated DEGs. (c) The top 10 terms of reactome pathway enrichment for upregulated DEGs. (d) The top 10 terms of reactome pathway enrichment for downregulated DEGs.

3.3. PPI Network and Module Analysis

The PPIs between 37 upregulated and 131 downregulated genes were excavated via STRING database with the combined score ≥ 0.4, and the PPI network was displayed containing 168 nodes and 417 interactions (Figure 3(a) and Table S9). To further investigate the hub network modules from the complex network, two hub modules with a score > 5 were extracted based on MCODE (Figures 3(b) and 3(c)). And three topological parameters covering degree, closeness centrality, and betweenness centrality were calculated to measure hub nodes in hub network modules (Tables S10 and S11). Hub genes with parameters greater than the mean of each group were considered to reflect key biological characteristics in the network module. However, all parameters were the same in model 1, but CXCL family genes accounted for a half. SLC26A3 and SLC30A10 were defined as hub genes in model 2. Then, the impact of the two modules on the pathways was also investigated. The genes in model 1 were significantly enriched in nine pathways, and the top five pathways coincided with the pathways that 46 upregulated genes mainly regulated, which might indicate that the upregulated genes in model 1 were dominant (Figure 3(d)). The genes in model 2 mainly gathered in six pathways, and the top five pathways were consistent with the pathways affected by 166 downregulated genes, which revealed that Metallothioneins (MTs) played an important role in model 2 (Figure 3(e)). Survival analysis of hub modules suggested CXCL8, CXCL13, and CLCA1 were associated with prognosis (P < 0.05), and the high expression group presented better prognosis (Figures 3(f)3(h)).

Figure 3.

Figure 3

Construction of PPI network and module analysis. (a) The PPI network with red nodes for upregulated genes and green nodes for downregulated genes. (b) Module 1 of PPI network. (c) Module 2 of PPI network. (d) Reactome pathway enrichment for module 1. (e) Reactome pathway enrichment for module 2. (f) Survival curve of CXCL8. (g) Survival curve of CXCL13. (h) Survival curve of CLCA1.

3.4. COX Model Construction and Verification

The 617 patients' data were randomly divided into two groups, the train group (309) and the test group (308). In all, 102 genes were captured through the univariate Cox proportional hazards regression model in the train group, which were significantly associated with survival time (P < 0.001) and all belonged to high-risk genes (HR > 1) (Table S12). Then, 16 representative genes were screened out through shrinkage and variable selection simultaneously of the LASSO penalized regression model in the train group (Figures 4(a) and 4(b) and Table S13). A prognostic gene signature involved in eight genes was developed using the multivariate Cox proportional hazards regression model, covering Muellerian-inhibiting factor (AMH), transmembrane protein 270 (WBSCR28), surfactant-associated protein 2 (SFTA2), myosin-2 (MYH2), POU domain, class 4, transcription factor 1 (POU4F1), homeobox protein SIX4 (SIX4), pyroglutamyl-peptidase 1-like protein (PGPEP1L), and paired box protein Pax-5 (PAX5) (Table 2). All the eight genes with HR > 1 were identified as risky prognostic genes, which implied that the patient's risk increased along with the rising of the gene expression. The risk scores were calculated based on the gene expression values and relevant coefficients, and all patients were divided to high- or low-risk groups based on the median risk score of the train group (Figures 5(a) and 5(b)). The survival time statistics in high- and low-risk groups are exhibited in Figures 5(c) and 5(d). Obviously, a significant difference in survival rate was represented between the high- and low-risk groups in the train group in Figure 5(e), and Figure 5(f) verifies the existence of the significant difference in the test group. The survival rates of the low-risk group were 94.3% (95% CI: 90.6%-98.2%), 88.6% (95% CI: 82.1%-95.6%), and 65.3% (95% CI: 49.3%-86.4%) for 1, 3, and 5 years, respectively, compared with 85.8% (95% CI: 80.2%-91.8%), 70.3% (95% CI: 62.0%-79.7%), and 50.4% (95% CI: 37.0%-68.5%) for the high-risk group in the train group. The accuracy of the prognostic gene signature in survival prediction was presented with AUC as 0.713 and 0.614, respectively, for the train group and the test group (Figures 5(g) and 5(h)). With the rising of the risk score, the distribution of the gene expression trend is revealed in Figures 5(i) and 5(j).

Figure 4.

Figure 4

LASSO regression analysis for the train group. (a) LASSO coefficient profiles of prognostic genes with P < 0.001. (b) Selection of the optimal value of lambda via 10-fold cross-validations.

Table 2.

Prognostic information for the eight genes in train group.

Gene symbol Univariate analysis Multivariate analysis
HR (95% CI) P value HR (95% CI) P value Coefficient
AMH 1.001 (1.000-1.02) 0.000297 1.001 (1.000-1.001) 0.011546 0.000842
WBSCR28 1.022 (1.010-1.033) 0.000139 1.012 (0.999-1.026) 0.080719 0.012188
SFTA2 1.001 (1.001-1.002) 1.61E-05 1.001 (1.001-1.002) 0.000137 0.001245
MYH2 1.061 (1.029-1.095) 0.000162 1.067 (1.027-1.108) 0.00076 0.064845
POU4F1 1.005 (1.003-1.008) 5.65E-05 1.004 (1.002-1.007) 0.002323 0.004278
SIX4 1.003 (1.002-1.004) 6.33E-07 1.003 (1.002-1.005) 1.79E-05 0.003124
PGPEP1L 1.061 (1.032-1.090) 2.46E-05 1.070 (1.038-1.103) 1.43E-05 0.067637
PAX5 1.001 (1.000-1.001) 1.53E-05 1.001 (1.000-1.001) 0.000106 0.000774

Figure 5.

Figure 5

The evaluation and confirmation of the eight-gene signature. (a) The risk score distribution for the train group. (b) The risk score distribution for the test group. (c) The survival time statistic for the train group. (d) The survival time statistic for the test group. (e) Survival curve for the train group. (f) Survival curve for the test group. (g) ROC curve for the train group. (h) ROC curve for the test group. (i) Gene expression pattern for the train group. (j) Gene expression pattern for the test group.

4. Discussion

At the moment, TNM stage is the principal guideline for treatment selection and prognosis prediction of CRC patients. In clinical practice, CRC patients with similar histopathological characteristics presented significantly different prognosis or diverse responses to treatment, which might be associated with the high molecular heterogeneity of CRC and could expose the TNM stage limitations towards precision medicine in CRC [3335]. Moreover, although increasing studies concerning biomarkers have been accumulated focusing on tumor diagnosis, treatment, and prognosis, there are scarce biomarkers utilized for early diagnosis, treatment selection, and predicting outcome in clinical. Thus, reliable prognostic biomarkers capable of differentiating patients' prognosis are still desperately required in CRC.

In this research, 254 DEGs containing 80 upregulated genes and 174 downregulated genes were screened and integrated from six GEO datasets and were mapped into RNA sequencing data from TCGA to extract 212 overlapping DEGs containing 46 upregulated and 166 downregulated genes. The biological process analysis suggested that the upregulated genes were mainly implicated in multiple metabolic processes including collagen catabolic process, multicellular organismal catabolic process, collagen metabolic process, multicellular organismal macromolecule metabolic process, and multicellular organismal metabolic process. The downregulated genes were primarily involved in various responses to stimulus, response to chemical stimulus like chemotaxis and response to nutrient, response to external stimulus like taxis and response to extracellular stimulus, and response to endogenous stimulus like response to glucocorticoid stimulus, response to corticosteroid stimulus, response to steroid hormone stimulus, and response to hormone stimulus. The molecular function analysis showed that the upregulated genes chiefly affected protein binding containing chemokine activity, chemokine receptor binding, cytokine activity, G-protein-coupled receptor binding, receptor binding, etc. The downregulated genes had much effect on catalytic activity such as lyase activity, oxidoreductase activity, transferase activity, and hydrolase activity. For the reactome pathway enrichment analysis, the upregulated genes mostly focused on regulation of the immune system and inflammation and cancer cell invasion and metastasis [36, 37]. The downregulated genes played important roles in CRC-related pathways involving in preneoplastic lesions, carcinogenesis, metastasis, and poor prognosis [3840].

Two hub modules were also identified, and topological parameters were calculated in the PPI network. Topological parameters of genes in module 1 were not significantly different, but the pathway enrichment results mainly accumulated in pathways regulated by 46 upregulated genes, which revealed the major status of CXCL1, CXCL3, CXCL8, CXCL11, NMU, and PPBP. Increased CXCL1 levels had positive relationships with tumor size, degree of invasion, advancing stage, metastasis, and poor prognosis [41, 42]. High expression of CXCL3 was detected in premalignant adenomas and CRC tissue, and CXCL3 significantly downregulated in liver metastasis compared with the primary tumor. And CXCL3 obviously presented high expression in patients with local relative to systemic disease [43]. On the contrary, overexpression of CXCL8 promoted proliferation, migration, and invasion of CRC cells, which was strongly correlated with CRC angiogenesis, metastasis, poor prognosis, and disease-free survival [44, 45]. However, high expression of CXCL8 could act as a protective barrier for liver metastasis of CRC and coincide with better prognosis [46, 47]. Objectively, the role of CXCL8 still remained in dispute. This study confirmed that CXCL8 was associated with prognosis and suggested that the high CXCL8 expression group had a better prognosis than the low expression group. Besides angiogenesis, CXCL11 was an important cytokine in the progression of inflammation to CRC and induced tumor-associated macrophages to infiltrate, which enhanced the proliferation and invasion of CRC cells and generated poor prognosis [4850]. NMU was capable of facilitating the proliferation, migration, and invasion of CRC cells [51]. PPBP, also known as CXCL7, was overexpressed in CRC and associated with poor prognosis and disease-free survival [52]. SLC26A3 and SLC30A10 were uncovered as hub genes in model 2, and the top 2 significant pathways hit on MT1M, MT1X, MT1F, MT1G, MT1H, and MT1E, which occupied the one-sided subnetwork of model 2. SLC26A3 downexpressed in CRC played a tumor suppressor role and was expected to be a candidate epithelial marker in CRC [53, 54]. SLC30A10 was acceptable to classify methylation epigenotypes and correlated with molecular genesis in CRC [55]. MTs, a protein family of low molecular weight and full of cysteine, contained at least 11 functional isoforms and implicated in zinc and redox metabolism. MTs were epigenetically downregulated in CRC early progression (especially MT1G) and tended to induce a worse prognosis [56]. MT overexpression represented a crucial early step in the development of ulcerative colitis-associated CRC [57]. MT expression was also a potential reminder affecting lymph node metastases, particularly in patients with synchronous liver metastases [40]. MT1G uncovered the capability of tumor suppressor via promoting CRC differentiation through zinc signaling [58]. Also, MT1G overexpression sensitized CRC cells to oxaliplatin and 5-fluorouracil via activating p53 and repressing NF-κB activity [59]. In addition, CXCL13 and CLCA1 in hub modules were downregulated, and high expression of that had a better prognosis. CXCL13 showed significantly lower expression in CRC, and patients with CXCL13 deletion had a significantly higher risk of relapse [60]. CLCA1 was also reported to be involved in the pathophysiology of CRC, and upregulation of CLCA1 was associated with a favorable prognosis [61, 62].

In the present study, we detected the association between gene expression and prognosis in CRC patients by recruiting RNA sequencing data for 3528 genes of 309 patients and identified 102 genes significantly associated with CRC patients' overall survival. After removing gene information highly correlated, an eight-gene signature was developed and risk scores were evaluated, which classified CRC patients into high- and low-risk groups with significantly different overall survival. The test group validated the prognostic value of the eight-gene signature capable of good reproducibility and robustness, which suggested that the eight-gene signature could improve prognostic prediction at the molecular level beyond the conventional TNM stage. The eight-gene signature also pushed the limitation of traditional TNM stage for prognostic prediction due to molecular heterogeneity in CRC. Currently, several gene signatures have been reported for prognostic prediction of CRC [6366]. Compared to the reported signatures, the uniqueness of this study was that LASSO regression analysis could execute feature selection and shrinkage and screen highly correlated genes, which determined the optimal genes to participate in subsequent signature building [66]. LASSO regression could prevent the gene signature overfitting and increase the accuracy of bioinformatics analysis [67]. We explored both ROC curve and test verification to assess the prognostic performance of the signature. In the future, the value of the eight-gene signature still needs to be examined in clinical guidelines. The eight-gene signature could delaminate the risk of CRC patients' survival before surgery selection, which implied patients' benefit from therapy with a good prognosis and avoiding unnecessary treatment with a poor prognosis.

Finally, the genes of the signature were more or less researched in human tumors. A monoclonal antibody targeting anti-mullerian-hormone-receptor II (AMHRII) acted through tumor-associated macrophage engagement in advanced/metastatic CRC and had been performed phase 2 study [68]. WBSCR28 had not been well studied in human tumor, but it was repressed by androgen receptor in prostate cancer [69]. SFTA2 was identified as a potential disease-free survival prognostic gene in colon cancer and one of the potential biomarkers for distinguishing between lung adenocarcinoma and squamous cell carcinoma [70, 71]. MYH2 was confirmed significantly changed in hepatocellular carcinoma and highly expressed in the origin of squamous cell carcinoma in the lungs of patients with previous head and neck malignancies [72, 73]. POU4F1 was upregulated and induced neuroendocrine phenotype in small cell lung cancer [74]. SIX4 promoted tumor angiogenesis and metastasis via activating AKT pathway in CRC [75, 76]. PGPEP1L was confirmed downregulated in CRC via Expression Atlas database and firstly proposed as an independent prognostic factor (Table S14). PAX5 was identified to be relevant to CRC with peritoneal metastasis [77].

5. Conclusion

In conclusion, we identified hub genes involved in the pathogenesis of CRC with the help of integrated bioinformatics analysis. We also proposed an eight-gene signature comprising AMH, WBSCR28, SFTA2, MYH2, POU4F1, SIX4, PGPEP1L, and PAX5, which would provide directive significance for prognostic prediction and treatment selection in CRC. However, the application of the eight-gene signature still needed to be assessed and validated in clinical.

Acknowledgments

This work was supported by the Shandong Province Key Research and Development Plan, China (No. 2019GSF107096).

Abbreviations

CRC:

Colorectal cancer

TNM:

Tumor-node-metastasis

DEGs:

Differentially expressed genes

GEO:

Gene Expression Omnibus

TCGA:

The Cancer Genome Atlas

PPI:

Protein-protein interaction

MTs:

Metallothioneins

RRA:

RobustRankAggreg

MCODE:

Molecular Complex Detection

ROC:

Receiver operating characteristic

AUC:

Area under the curve

AMH:

Muellerian-inhibiting factor; anti-mullerian hormone

WBSCR28:

Transmembrane protein 270

SFTA2:

Surfactant-associated protein 2

MYH2:

Myosin-2

POU4F1:

POU domain, class 4, transcription factor 1

SIX4:

Homeobox protein SIX4

PGPEP1L:

Pyroglutamyl-peptidase 1-like protein

PAX5:

Paired box protein Pax-5.

Contributor Information

Zhaohua Gong, Email: gongzhaohuayt@163.com.

Jian Chen, Email: chenjianyt@163.com.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

Authors' Contributions

Benjiao Gong and Yanlei Kao contributed equally to this work.

Supplementary Materials

Supplementary 1

Table S1: 254 DEGs screened by Limma package and integrated by RRA package from six GEO datasets.

Supplementary 2

Table S2: DEGs extracted from TCGA database.

Supplementary 3

Table S3: overlapping DEGs identified between six GEO datasets and TCGA database.

Supplementary 4

Table S4: the clinical information of patients organized for survival analysis.

Supplementary 5

Table S5: the biological processes analyzed for overlapping DEGs.

Supplementary 6

Table S6: the molecular functions analyzed for overlapping DEGs.

Supplementary 7

Table S7: the reactome pathway enriched for upregulated overlapping DEGs.

Supplementary 8

Table S8: the reactome pathway enriched for downregulated overlapping DEGs.

Supplementary 9

Table S9: the protein-protein interactions among overlapping DEGs identified with combined score ≥ 0.4.

Supplementary 10

Table S10: three topological parameters calculated for module 1.

Supplementary 11

Table S11: three topological parameters calculated for module 2.

Supplementary 12

Table S12: the univariate Cox proportional hazards regression analysis for train group.

Supplementary 13

Table S13: the LASSO penalized regression performed for train group.

Supplementary 14

Table S14: differential expression of PGPEP1L in human.

References

  • 1.Jemal A., Bray F., Center M. M., Ferlay J., Ward E., Forman D. Global cancer statistics. CA: a Cancer Journal for Clinicians. 2011;61(2):69–90. doi: 10.3322/caac.20107. [DOI] [PubMed] [Google Scholar]
  • 2.Marcuello M., Vymetalkova V., Neves R. P. L., et al. Circulating biomarkers for early detection and clinical management of colorectal cancer. Molecular Aspects of Medicine. 2019;69:107–122. doi: 10.1016/j.mam.2019.06.002. [DOI] [PubMed] [Google Scholar]
  • 3.Siegel R. L., Miller K. D., Jemal A. Cancer statistics, 2018. CA: a Cancer Journal for Clinicians. 2018;68(1):7–30. doi: 10.3322/caac.21442. [DOI] [PubMed] [Google Scholar]
  • 4.Thorsteinsson M., Jess P. The clinical significance of circulating tumor cells in non-metastatic colorectal cancer – A review. European Journal of Surgical Oncology. 2011;37(6):459–465. doi: 10.1016/j.ejso.2011.01.025. [DOI] [PubMed] [Google Scholar]
  • 5.Li J., Guo B. C., Sun L. R., et al. TNM staging of colorectal cancer should be reconsidered by T stage weighting. World Journal of Gastroenterology. 2014;20(17):5104–5112. doi: 10.3748/wjg.v20.i17.5104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Lugli A., Kirsch R., Ajioka Y., et al. Recommendations for reporting tumor budding in colorectal cancer based on the International Tumor Budding Consensus Conference (ITBCC) 2016. Modern Pathology. 2017;30(9):1299–1311. doi: 10.1038/modpathol.2017.46. [DOI] [PubMed] [Google Scholar]
  • 7.Zhang X., Sun X. F., Shen B., Zhang H. Potential applications of DNA, RNA and protein biomarkers in diagnosis, therapy and prognosis for colorectal cancer: a study from databases to AI-assisted verification. Cancers. 2019;11(2):p. 172. doi: 10.3390/cancers11020172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Nitsche U., Maak M., Schuster T., et al. Prediction of prognosis is not improved by the seventh and latest edition of the TNM classification for colorectal cancer in a single-center collective. Annals of Surgery. 2011;254(5):793–801. doi: 10.1097/SLA.0b013e3182369101. [DOI] [PubMed] [Google Scholar]
  • 9.Lee J. H., Ahn J., Park W. S., et al. Colorectal cancer prognosis is not associated with BRAF and KRAS mutations-a STROBE compliant study. Journal of Clinical Medicine. 2019;8(1):p. 111. doi: 10.3390/jcm8010111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kulasingam V., Diamandis E. P. Strategies for discovering novel cancer biomarkers through utilization of emerging technologies. Nature Clinical Practice Oncology. 2008;5(10):588–599. doi: 10.1038/ncponc1187. [DOI] [PubMed] [Google Scholar]
  • 11.The Cancer Genome Atlas Research Network. Comprehensive molecular characterization of gastric adenocarcinoma. Nature. 2014;513(7517):202–209. doi: 10.1038/nature13480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Huang Z., Duan H., Li H. Identification of gene expression pattern related to breast cancer survival using integrated TCGA datasets and genomic tools. BioMed Research International. 2015;2015:10. doi: 10.1155/2015/878546.878546 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Agrawal N., Akbani R., Aksoy B. A., et al. Integrated genomic characterization of papillary thyroid carcinoma. Cell. 2014;159(3):676–690. doi: 10.1016/j.cell.2014.09.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Devarakonda S., Morgensztern D., Govindan R. Clinical applications of The Cancer Genome Atlas project (TCGA) for squamous cell lung carcinoma. Oncology (Williston Park) 2013;27(9):899–906. [PubMed] [Google Scholar]
  • 15.Lee H., Flaherty P., Ji H. P. Systematic genomic identification of colorectal cancer genes delineating advanced from early clinical stage and metastasis. BMC Medical Genomics. 2013;6(1):p. 54. doi: 10.1186/1755-8794-6-54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Yang J., Han S., Huang W., et al. A meta-analysis of microRNA expression in liver cancer. PLoS One. 2014;9(12, article e114533) doi: 10.1371/journal.pone.0114533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Song E., Song W., Ren M., et al. Identification of potential crucial genes associated with carcinogenesis of clear cell renal cell carcinoma. Journal of Cellular Biochemistry. 2018;119(7):5163–5174. doi: 10.1002/jcb.26543. [DOI] [PubMed] [Google Scholar]
  • 18.Sun M., Song H., Wang S., et al. Integrated analysis identifies microRNA-195 as a suppressor of Hippo-YAP pathway in colorectal cancer. Journal of Hematology & Oncology. 2017;10(1):p. 79. doi: 10.1186/s13045-017-0445-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Wang Z., Chen G., Wang Q., Lu W., Xu M. Identification and validation of a prognostic 9-genes expression signature for gastric cancer. Oncotarget. 2017;8(43):73826–73836. doi: 10.18632/oncotarget.17764. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Tsukamoto S., Ishikawa T., Iida S., et al. Clinical significance of osteoprotegerin expression in human colorectal cancer. Clinical Cancer Research. 2011;17(8):2444–2450. doi: 10.1158/1078-0432. [DOI] [PubMed] [Google Scholar]
  • 21.Alhopuro P., Sammalkorpi H., Niittymäki I., et al. Candidate driver genes in microsatellite-unstable colorectal cancer. International Journal of Cancer. 2012;130(7):1558–1566. doi: 10.1002/ijc.26167. [DOI] [PubMed] [Google Scholar]
  • 22.Khamas A., Ishikawa T., Shimokawa K., et al. Screening for epigenetically masked genes in colorectal cancer using 5-aza-2′-deoxycytidine, microarray and gene expression profile. Cancer Genomics Proteomics. 2012;9(2):67–75. [PubMed] [Google Scholar]
  • 23.Satoh K., Yachida S., Sugimoto M., et al. Global metabolic reprogramming of colorectal cancer occurs at adenoma stage and is induced by MYC. Proceedings of the National Academy of Sciences of the United States of America. 2017;114(37):E7697–E7706. doi: 10.1073/pnas.1710366114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Vlachavas E.-I., Pilalis E., Papadodima O., et al. Radiogenomic analysis of F-18-fluorodeoxyglucose positron emission tomography and gene expression data elucidates the epidemiological complexity of colorectal cancer landscape. Computational and Structural Biotechnology Journal. 2019;17:177–185. doi: 10.1016/j.csbj.2019.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ritchie M. E., Phipson B., Wu D., et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Research. 2015;43(7, article e47) doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Kolde R., Laur S., Adler P., Vilo J. Robust rank aggregation for gene list integration and meta-analysis. Bioinformatics. 2012;28(4):573–580. doi: 10.1093/bioinformatics/btr709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Maere S., Heymans K., Kuiper M. BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics. 2005;21(16):3448–3449. doi: 10.1093/bioinformatics/bti551. [DOI] [PubMed] [Google Scholar]
  • 28.Fabregat A., Jupe S., Matthews L., et al. The reactome pathway knowledgebase. Nucleic Acids Research. 2018;46(D1):D649–D655. doi: 10.1093/nar/gkx1132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Fortino V., Alenius H., Greco D. BACA: bubble chArt to compare annotations. BMC Bioinformatics. 2015;16(1):p. 37. doi: 10.1186/s12859-015-0477-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Szklarczyk D., Morris J. H., Cook H., et al. The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible. Nucleic Acids Research. 2017;45(D1):D362–D368. doi: 10.1093/nar/gkw937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Bader G. D., Hogue C. W. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics. 2003;4(1):p. 2. doi: 10.1186/1471-2105-4-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kuhn M. Building predictive models in R using the caret package. Journal of Statistical Software. 2008;28(5) doi: 10.18637/jss.v028.i05. [DOI] [Google Scholar]
  • 33.Punt C. J., Koopman M., Vermeulen L. From tumour heterogeneity to advances in precision treatment of colorectal cancer. Nature Reviews Clinical Oncology. 2017;14(4):235–246. doi: 10.1038/nrclinonc.2016.171. [DOI] [PubMed] [Google Scholar]
  • 34.Vucic E. A., Thu K. L., Robison K., et al. Translating cancer 'omics' to improved outcomes. Genome Research. 2012;22(2):188–195. doi: 10.1101/gr.124354.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Jass J. R. Molecular heterogeneity of colorectal cancer: implications for cancer control. Surgical Oncology. 2007;16(Supplement 1):7–9. doi: 10.1016/j.suronc.2007.10.039. [DOI] [PubMed] [Google Scholar]
  • 36.Oldham W. M., Hamm H. E. Heterotrimeric G protein activation by G-protein-coupled receptors. Nature Reviews Molecular Cell Biology. 2008;9(1):60–71. doi: 10.1038/nrm2299. [DOI] [PubMed] [Google Scholar]
  • 37.Morrison C. J., Butler G. S., Rodriguez D., Overall C. M. Matrix metalloproteinase proteomics: substrates, targets, and therapy. Current Opinion in Cell Biology. 2009;21(5):645–653. doi: 10.1016/j.ceb.2009.06.006. [DOI] [PubMed] [Google Scholar]
  • 38.Christudoss P., Chacko G., Selvakumar R., Fleming J. J., Pugazhendhi S., Mathew G. Expression of metallothionein in dimethylhydrazine-induced colonic precancerous and cancerous model in rat. Journal of Cancer Research and Therapeutics. 2016;12(4):1307–1312. doi: 10.4103/0973-1482.179107. [DOI] [PubMed] [Google Scholar]
  • 39.Na H., Liu X., Li X., et al. Novel roles of DC-SIGNR in colon cancer cell adhesion, migration, invasion, and liver metastasis. Journal of Hematology & Oncology. 2017;10(1):p. 28. doi: 10.1186/s13045-016-0383-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Hishikawa Y., Kohno H., Ueda S., et al. Expression of metallothionein in colorectal cancers and synchronous liver metastases. Oncology (Williston Park) 2001;61(2):162–167. doi: 10.1159/000055368. [DOI] [PubMed] [Google Scholar]
  • 41.le Rolle A. F., Chiu T. K., Fara M., et al. The prognostic significance of CXCL1 hypersecretion by human colorectal cancer epithelia and myofibroblasts. Journal of Translational Medicine. 2015;13(1):p. 199. doi: 10.1186/s12967-015-0555-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Oladipo O., Conlon S., O'Grady A., et al. The expression and prognostic impact of CXC-chemokines in stage II and III colorectal cancer epithelial and stromal tissue. British Journal of Cancer. 2011;104(3):480–487. doi: 10.1038/sj.bjc.6606055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Doll D., Keller L., Maak M., et al. Differential expression of the chemokines GRO-2, GRO-3, and interleukin-8 in colon cancer and their impact on metastatic disease and survival. International Journal of Colorectal Disease. 2010;25(5):573–581. doi: 10.1007/S00384-010-0901-1. [DOI] [PubMed] [Google Scholar]
  • 44.Xiao Y. C., Yang Z. B., Cheng X. S., et al. CXCL8, overexpressed in colorectal cancer, enhances the resistance of colorectal cancer cells to anoikis. Cancer Letters. 2015;361(1):22–32. doi: 10.1016/j.canlet.2015.02.021. [DOI] [PubMed] [Google Scholar]
  • 45.Shen T., Yang Z., Cheng X., et al. CXCL8 induces epithelial-mesenchymal transition in colon cancer cells via the PI3K/Akt/NF-κB signaling pathway. Oncology Reports. 2017;37(4):2095–2100. doi: 10.3892/or.2017.5453. [DOI] [PubMed] [Google Scholar]
  • 46.Du J., He Y., Li P., Wu W., Chen Y., Ruan H. IL-8 regulates the doxorubicin resistance of colorectal cancer cells via modulation of multidrug resistance 1 (MDR1) Cancer Chemotherapy and Pharmacology. 2018;81(6):1111–1119. doi: 10.1007/s00280-018-3584-x. [DOI] [PubMed] [Google Scholar]
  • 47.Li J., Liu Q., Huang X., et al. Transcriptional profiling reveals the regulatory role of CXCL8 in promoting colorectal cancer. Frontiers in Genetics. 2020;10:p. 1360. doi: 10.3389/fgene.2019.01360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Rupertus K., Sinistra J., Scheuer C., et al. Interaction of the chemokines I-TAC (CXCL11) and SDF-1 (CXCL12) in the regulation of tumor angiogenesis of colorectal cancer. Clinical & Experimental Metastasis. 2014;31(4):447–459. doi: 10.1007/s10585-014-9639-4. [DOI] [PubMed] [Google Scholar]
  • 49.Zeng Y. J., Lai W., Wu H., et al. Neuroendocrine-like cells -derived CXCL10 and CXCL11 induce the infiltration of tumor-associated macrophage leading to the poor prognosis of colorectal cancer. Oncotarget. 2016;7(19):27394–27407. doi: 10.18632/oncotarget.8423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Gao Y. J., Liu D. L., Li S., et al. Down-regulation of CXCL11 inhibits colorectal cancer cell growth and epithelial-mesenchymal transition. OncoTargets and Therapy. 2018;11:7333–7343. doi: 10.2147/OTT.S167872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Wang X., Chen X., Zhou H., et al. The long noncoding RNA, LINC01555, promotes invasion and metastasis of colorectal cancer by activating the neuropeptide, neuromedin U. Medical Science Monitor. 2019;25:4014–4024. doi: 10.12659/MSM.916508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Desurmont T., Skrypek N., Duhamel A., et al. Overexpression of chemokine receptor CXCR2 and ligand CXCL7 in liver metastases from colon cancer is correlated to shorter disease-free and overall survival. Cancer Science. 2015;106(3):262–269. doi: 10.1111/cas.12603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Lauriola M., Ugolini G., Rosati G., et al. Identification by a Digital Gene Expression Displayer (DGED) and test by RT-PCR analysis of new mRNA candidate markers for colorectal cancer in peripheral blood. International Journal of Oncology. 2010;37(2):519–525. doi: 10.3892/ijo_00000701. [DOI] [PubMed] [Google Scholar]
  • 54.Mlakar V., Berginc G., Volavšek M., Štor Z., Rems M., Glavač D. Presence of activating KRAS mutations correlates significantly with expression of tumour suppressor genes DCN and TPM1 in colorectal cancer. BMC Cancer. 2009;9(1):p. 282. doi: 10.1186/1471-2407-9-282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Yagi K., Akagi K., Hayashi H., et al. Three DNA methylation epigenotypes in human colorectal cancer. Clinical Cancer Research. 2010;16(1):21–33. doi: 10.1158/1078-0432.CCR-09-2006. [DOI] [PubMed] [Google Scholar]
  • 56.Arriaga J. M., Levy E. M., Bravo A. I., et al. Metallothionein expression in colorectal cancer: relevance of different isoforms for tumor progression and patient survival. Human Pathology. 2012;43(2):197–208. doi: 10.1016/j.humpath.2011.04.015. [DOI] [PubMed] [Google Scholar]
  • 57.Bruewer M., Schmid K. W., Krieglstein C. F., Senninger N., Schuermann G. Metallothionein: early marker in the carcinogenesis of ulcerative colitis-associated colorectal carcinoma. World Journal of Surgery. 2002;26(6):726–731. doi: 10.1007/s00268-002-6266-z. [DOI] [PubMed] [Google Scholar]
  • 58.Arriaga J. M., Bravo A. I., Mordoh J., Bianchini M. Metallothionein 1G promotes the differentiation of HT-29 human colorectal cancer cells. Oncology Reports. 2017;37(5):2633–2651. doi: 10.3892/or.2017.5547. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Arriaga J. M., Greco A., Mordoh J., Bianchini M. Metallothionein 1G and zinc sensitize human colorectal cancer cells to chemotherapy. Molecular Cancer Therapeutics. 2014;13(5):1369–1381. doi: 10.1158/1535-7163.MCT-13-0944. [DOI] [PubMed] [Google Scholar]
  • 60.Bindea G., Mlecnik B., Tosolini M., et al. Spatiotemporal dynamics of intratumoral immune cells reveal the immune landscape in human cancer. Immunity. 2013;39(4):782–795. doi: 10.1016/j.immuni.2013.10.003. [DOI] [PubMed] [Google Scholar]
  • 61.Hu D., Ansari D., Bauden M., Zhou Q., Andersson R. The emerging role of calcium-activated chloride channel regulator 1 in cancer. Anticancer Research. 2019;39(4):1661–1666. doi: 10.21873/anticanres.13271. [DOI] [PubMed] [Google Scholar]
  • 62.Pan X., Wang Q., Xu C., Yan L., Pang S., Gan J. Prognostic value of chloride channel accessory mRNA expression in colon cancer. Oncology Letters. 2019;18(3):2967–2976. doi: 10.3892/ol.2019.10615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Huang Z., Yang Q., Huang Z. Identification of critical genes and five prognostic biomarkers associated with colorectal cancer. Medical Science Monitor. 2018;24:4625–4633. doi: 10.12659/MSM.907224. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Zuo S., Dai G., Ren X. Identification of a 6-gene signature predicting prognosis for colorectal cancer. Cancer Cell International. 2019;19(1):p. 6. doi: 10.1186/s12935-018-0724-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Chen L., Lu D., Sun K., et al. Identification of biomarkers associated with diagnosis and prognosis of colorectal cancer patients based on integrated bioinformatics analysis. Gene. 2019;692:119–125. doi: 10.1016/j.gene.2019.01.001. [DOI] [PubMed] [Google Scholar]
  • 66.Friedman J., Hastie T., Tibshirani R. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software. 2010;33(1):1–22. [PMC free article] [PubMed] [Google Scholar]
  • 67.Meng Z., Ren D., Zhang K., Zhao J., Jin X., Wu H. Using ESTIMATE algorithm to establish an 8-mRNA signature prognosis prediction system and identify immunocyte infiltration-related genes in pancreatic adenocarcinoma. Aging (Albany NY) 2020;12 doi: 10.18632/aging.102931. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Van Cutsem E., Melichar B., Van den Eynde M., et al. Phase 2 study results of murlentamab, a monoclonal antibody targeting the anti-Mullerian-hormone-receptor II (AMHRII), acting through tumor-associated macrophage engagement in advanced/metastatic colorectal cancers. Annals of Oncology. 2019;30(Supplement 4):iv153–iv154. doi: 10.1093/annonc/mdz183.003. [DOI] [Google Scholar]
  • 69.Prescott J., Jariwala U., Jia L., et al. Androgen receptor-mediated repression of novel target genes. Prostate. 2007;67(13):1371–1383. doi: 10.1002/pros.20623. [DOI] [PubMed] [Google Scholar]
  • 70.Li C., Shen Z., Zhou Y., Yu W. Independent prognostic genes and mechanism investigation for colon cancer. Biological Research. 2018;51(1):p. 10. doi: 10.1186/s40659-018-0158-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Xiao J., Lu X., Chen X., et al. Eight potential biomarkers for distinguishing between lung adenocarcinoma and squamous cell carcinoma. Oncotarget. 2017;8(42, article 17606):71759–71771. doi: 10.18632/oncotarget.17606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Wu W., Li J., Liu Y., Zhang C., Meng X., Zhou Z. Comparative proteomic studies of serum from patients with hepatocellular carcinoma. Journal of Investigative Surgery. 2012;25(1):37–42. doi: 10.3109/08941939.2011.603816. [DOI] [PubMed] [Google Scholar]
  • 73.Vachani A., Nebozhyn M., Singhal S., et al. A 10-gene classifier for distinguishing head and neck squamous cell carcinoma and lung squamous cell carcinoma. Clinical Cancer Research. 2007;13(10):2905–2915. doi: 10.1158/1078-0432.CCR-06-1670. [DOI] [PubMed] [Google Scholar]
  • 74.Ishii J., Sato H., Yazawa T., et al. Class III/IV POU transcription factors expressed in small cell lung cancer cells are involved in proneural/neuroendocrine differentiation. Pathology International. 2014;64(9):415–422. doi: 10.1111/pin.12198. [DOI] [PubMed] [Google Scholar]
  • 75.Sun X., Hu F., Hou Z., et al. SIX4 activates Akt and promotes tumor angiogenesis. Experimental Cell Research. 2019;383(1, article 111495) doi: 10.1016/j.yexcr.2019.111495. [DOI] [PubMed] [Google Scholar]
  • 76.Li G., Hu F., Luo X., Hu J., Feng Y. SIX4 promotes metastasis via activation of the PI3K-AKT pathway in colorectal cancer. PeerJ. 2017;5, article e3394 doi: 10.7717/peerj.3394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Lee J. H., Ahn B. K., Baik S. S., Lee K. H. Comprehensive analysis of somatic mutations in colorectal cancer with peritoneal metastasis. In Vivo. 2019;33(2):447–452. doi: 10.21873/invivo.11493. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary 1

Table S1: 254 DEGs screened by Limma package and integrated by RRA package from six GEO datasets.

Supplementary 2

Table S2: DEGs extracted from TCGA database.

Supplementary 3

Table S3: overlapping DEGs identified between six GEO datasets and TCGA database.

Supplementary 4

Table S4: the clinical information of patients organized for survival analysis.

Supplementary 5

Table S5: the biological processes analyzed for overlapping DEGs.

Supplementary 6

Table S6: the molecular functions analyzed for overlapping DEGs.

Supplementary 7

Table S7: the reactome pathway enriched for upregulated overlapping DEGs.

Supplementary 8

Table S8: the reactome pathway enriched for downregulated overlapping DEGs.

Supplementary 9

Table S9: the protein-protein interactions among overlapping DEGs identified with combined score ≥ 0.4.

Supplementary 10

Table S10: three topological parameters calculated for module 1.

Supplementary 11

Table S11: three topological parameters calculated for module 2.

Supplementary 12

Table S12: the univariate Cox proportional hazards regression analysis for train group.

Supplementary 13

Table S13: the LASSO penalized regression performed for train group.

Supplementary 14

Table S14: differential expression of PGPEP1L in human.

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request.


Articles from Mediators of Inflammation are provided here courtesy of Wiley

RESOURCES