Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2021 Mar 25;11:6837. doi: 10.1038/s41598-021-86286-z

Construction and validation of a metabolic risk model predicting prognosis of colon cancer

Didi Zuo 1,#, Chao Li 2,#, Tao Liu 2, Meng Yue 2, Jiantao Zhang 2,, Guang Ning 1,3,
PMCID: PMC7994414  PMID: 33767290

Abstract

Metabolic genes have played a significant role in tumor development and prognosis. In this study, we constructed a metabolic risk model to predict the prognosis of colon cancer based on The Cancer Genome Atlas (TCGA) and validated the model by Gene Expression Omnibus (GEO). We extracted 753 metabolic genes and identified 139 differentially expressed genes (DEGs) from TCGA database. Then we conducted univariate cox regression analysis and Least Absolute Shrinkage and Selection Operator Cox regression analysis to identify prognosis-related genes and construct the metabolic risk model. An eleven-gene prognostic model was constructed after 1000 resamples. The gene signature has been proved to have an excellent ability to predict prognosis by Kaplan–Meier analysis, time-dependent receiver operating characteristic, risk score, univariate and multivariate cox regression analysis based on TCGA. Then we validated the model by Kaplan–Meier analysis and risk score based on GEO database. Finally, we performed a weighted gene co-expression network analysis and protein–protein interaction network on DEGs, and Kyoto Encyclopedia of Genes and Genomes pathways and Gene Ontology enrichment analyses were conducted. The results of functional analyses showed that most significantly enriched pathways focused on metabolism, especially glucose and lipid metabolism pathways.

Subject terms: Cancer, Cancer genomics, Cancer

Introduction

Colon cancer is a common malignant tumor which mainly occurs in the proximal colon. Colon cancer and rectal cancer are often grouped as “colorectal cancer” (CRC), which is the fourth most common cancer and the second leading cause of cancer-related death in America1. There are estimated to be 147,950 newly diagnosed CRC individuals, including 104,610 colon cancer patients in America in 20202. In addition, the colon cancer patients are rapidly shifting younger as a result of older individuals declining and younger individuals increasing3. However, colon cancer is potentially preventable because most of the cases and deaths are attributable to an unhealthy lifestyle, including high-fat and low-fiber diet, smoking and drinking, insufficient physical activity, and overweight4. Although morbidity and mortality can be mitigated through appropriate screening, including imaging techniques and colonoscopy5,6, approximately 80% of CRC patients show recurrence during the first 3 years. Thus, identifying reliable prognostic biomarkers to select high-risk colon cancer patients is important for improving the survival rate.

Recently, metabolic reprogramming has become a hot topic7. Studies had been shown that tumor metabolism played a vital role in tumor cells8,9. Metabolism plays a vital role in the progression and prognosis of colon cancer10. Energy metabolism is the basis of cell proliferation, and change of cell metabolism is the feature of tumor cells9,11. Tumor cells change metabolic processes to satisfy the increased energy and nutritional demands, for growth and invasion12. The pathogenesis, progression and prognosis of colon cancer is closely related to metabolic progress, including glucose metabolism, amino acid metabolism, and lipid metabolism1315. In this study, we constructed and validated a metabolic risk model of colon cancer based on the data downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database, to explore the potential role of metabolic genes and accurately predict prognosis of colon cancer.

Materials and methods

Data collection

We retrieved and downloaded gene expression profile and clinical data (age, gender, tumor grade, TMN stage) from TCGA (https://portal.gdc.cancer.gov/). We then obtained and combined the gene expression matrix of the GSE17536, GSE17537, and GSE29621 series from the GEO (http://www.ncbi.nlm.nih.gov/geo/) database. The data was extracted, annotated, and normalized by Strawberry Perl (version5.30.1.1). We downloaded the metabolic genes from Gene Set Enrichment Analysis (GSEA) platform (http://software.broadinstitute.org/gsea/downloads).

Differentially expressed metabolic genes identification

Metabolic genes were defined as genes enriched in metabolism pathways based on Kyoto Encyclopedia of Genes and Genomes (KEGG) database. We extracted and downloaded metabolic genes from TCGA. Then, we adjusted different mRNA expression levels by ‘sva’ package and selected candidate metabolic genes by R software (version 3.6.1). Differentially expressed genes (DEGs) were screened through ‘limma’ package (version 3.44.1) on R, with the screening criteria false discovery rate (FDR) < 0.05 and |log2-fold change| > 1. Heatmap and volcano were constructed through ‘pheatmap’ package in R.

Construction of metabolic signature

We used the dataset from TCGA as the training cohort. Univariate cox regression analysis was performed to identify prognostic metabolic genes by ‘survival’ and ‘survminer’ package, with the screening criteria P < 0.05. We regarded overall survival (OS) as the primary outcome and genes with HR < 1 were defined as better prognosis. The metabolic risk model was constructed after 1000 resamples by Least Absolute Shrinkage and Selection Operator (LASSO) Cox regression analysis through ‘glmnet’ and ‘survival’ package on R.

The performance of gene signature

We applied the metabolic risk model to TCGA datasets. Patients were grouped into high- and low-risk groups according to the median risk score. Kaplan–Meier curves were generated by survival analysis through the ‘survival’ package. Risk score curves were drawn using the ‘pheatmap’ package. Next, univariate and multivariate cox proportional hazards analysis were performed. Time-dependent receiver operating characteristic (ROC) curve and the area under the ROC curve (AUC) were used to evaluate the sensitivity and specificity of the model using the ‘survival ROC’ package. For evaluation of the performance of the model, we compared the prediction model with the existing model (TNM stage) by the time-dependent ROC curve16. The 1-, 3-, and 5-year ROC curves of the model were plotted and the values of AUC were calculated with packages ‘survival’, ‘survminer’, and ‘survival ROC’ in R. A nomogram was built to visualize the risk model and classic independent risk factors, including age, gender, tumor grade, and TMN stage, to calculate survival rate of cancer patients.

Validation of metabolic risk model

We investigated extensively in GEO datasets and collected all appropriate datasets to validate the predictive ability of the model. Patients were grouped into high- and low-risk groups and Kaplan–Meier curves were generated by survival analysis through the “survival” package. Risk score curves were drawn using the ‘pheatmap’ package.

Gene set enrichment analyses

GSEA software (version 4.0.3) for Windows was downloaded from website for functional analyses. KEGG and GO (Gene Ontology) enrichment analyses of the DEGs were performed to identify potential pathways and functions by GSEA.

WGCNA of DEGs

To assess the inter-correlation of the intensities of the DEGs and the relationship between them and feature data, a weighted gene co-expression network analysis (WGCNA) was performed using the WGCNA R package. Firstly, we conducted quality testing and preprocessing of the data17,18. Normalization was performed using ‘preprocessCore’ package, while outliers were removed and missing values were treated by using the ‘impute’ package. After the procedure of preprocessing, we got a reasonable and useful sample set. Secondly, a matrix of adjacencies was built according to the calculated absolute value of the Pearson’s correlation coefficients between each of the gene pairs. Thirdly, the matrix was applied to a network construction and module detection function by building a weighted matrix with a scaling factor-β. Then, the adjacencies were transformed into topological overlap matrix (TOM). Similar modules were merged to trim genes whose correlation with module eigengene was less than the defined threshold (min Module Size of 30 and merge Cut Height of 0.25). Fourthly, we calculated the correlation between clinical information and module eigengenes (MEs) to identify the clinically significant modules. Finally, we calculated the correlation between genes and clinical traits, as well as the correlation between genes and MEs. WGCNA determines highly inter-connected nodes as modules designated with different colors.

PPI network construction and functional analyses

Protein–protein interaction (PPI) network was constructed by Search Tool for the Retrieval of Interacting Genes (STRING, https://string-db.org/) and the CytoHubba plug-in in the Cytoscape software (version 3.7.1). Then we analyzed the relationships between each core genes and clinical characteristics. The ‘clusterProfiler’, ‘enrichplot’, and ‘ggplot2’ software packages in R were used to perform Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses. GO has three independent branches: molecular function (MF), biological process (BP), and cellular component (CC). An FDR of < 0.05 was defined as statistically significant.

Consent for publication

All the authors listed have approved the submission and publication.

Results

Data extraction

We extracted 452 cases and 753 metabolic genes from TCGA database, including 361 up-regulated and 392 down-regulated genes (Supplementary Table 1). Then we downloaded and combined gene expression matrix of the GSE17536, GSE17537, and GSE29621 series and obtained 177 cases. After screening, we identified 139 differentially expressed metabolic genes, including 62 up-regulated and 77 down-regulated genes, which were shown in volcano plot (Fig. 1A) and heatmap (Fig. 1B).

Figure 1.

Figure 1

Volcano plot and Heatmap of differentially expressed metabolic genes. (A) Volcano plot of differentially expressed metabolic genes. The red points represent high expression genes, the green points represent low expression genes, the black points represent genes with no significant difference (FDR < 0.05, log FC > 1.0). (B) Heatmap of differentially expressed metabolic genes. Red indicates that the gene expression is relatively high, green indicates that the gene expression is relatively low, and black indicates no significant changes in gene expression (FDR < 0.05, absolute log FC > 1.0).

Construction of metabolic signature

A total of 15 prognostic genes were dug out by univariate cox regression analysis. Among them, NAT2, ENOPH1, ACAA2, UGT2A3, PAFAH1B3, SUCLG2, CPT2, and ACOX1 were associated with better overall survival outcomes, while PKM, GPX3, LPCAT1, ADCY5, ADH1B, SPHK1, and PTGDS were associated with worse overall survival outcomes (Fig. 2). Then an eleven-gene prognostic model was constructed after 1000 resamples by LASSO penalized Cox regression analysis. Risk score = 0.0011 × expression of PKM + 0.0076 × expression of GPX3 + 0.0052 × expression of LPCAT1 − 0.0060 × expression of ENOPH1 − 0.0044 × expression of ACAA2 − 0.0041 × expression of UGT2A3 + 0.0934 × expression of ADCY5 − 0.0119 × expression of PAFAH1B3 − 0.0578 × expression of CPT2 − 0.0461 × expression of ACOX1 + 0.0102 × expression of PTGDS.

Figure 2.

Figure 2

Forest plot of the univariate Cox regression analysis showing the prognostic metabolic genes. Genes with HR < 1 have better overall survival outcomes, while genes with HR > 1 have worse overall survival outcomes.

The performance of gene signature

Patients were grouped into high- and low-risk groups according to the median risk score. In the high, patients had a lower OS according to Kaplan–Meier curves (P < 0.05, Fig. 3A) and risk score distribution than in the low (Fig. 4A). The differential expression of metabolic genes in high- and low-risk groups were shown in heatmaps (Fig. 4C). Univariate Cox analysis showed that the risk of poor prognosis elevated with the risk score increasing (Fig. 5A). Multivariate Cox regression analysis demonstrated that the risk score could be an independent risk factor for OS (Fig. 5B). The AUC was 0.697 according to the ROC, which was higher than age (AUC = 0.561), gender (AUC = 0.439), and T stage (AUC = 0.672) (Fig. 6). We compared the model with TNM stage by the time-dependent ROC curve and the values of AUC. For our predict model, the AUC was 0.707 (1-year), 0.742 (3-year) and 0.775 (5-year), respectively (Fig. 7A). For TMN stage, the AUC was 0.782 (1-year), 0.766 (3-year) and 0.758 (5-year), respectively (Fig. 7B). Nomogram was formed of age, gender, tumor stage, TNM stage and the risk score as well (Fig. 8).

Figure 3.

Figure 3

Kaplan–Meier curves of Metabolic Signature. (A) Kaplan–Meier curves of gene signature based on TCGA. (B) Kaplan–Meier curves of gene signature based on GEO.

Figure 4.

Figure 4

Risk scores of Metabolic Signature. (A) Risk score distribution of the eleven -gene signatures based on TCGA database. (B) Risk score distribution of the eleven-gene signatures based on GEO database. (C) Heatmaps of the differential expression of metabolic genes in high-risk and low-risk groups based on TCGA database. (D) Heatmaps of the differential expression of metabolic genes in high-risk and low-risk groups based on GEO database.

Figure 5.

Figure 5

Univariate and Multivariate Cox analysis. (A) Prognostic value detection of the gene signature via univariate survival-related analysis in TCGA. (B) Prognostic value detection of the gene signature via multivariate survival-related analysis in TCGA.

Figure 6.

Figure 6

Time-dependent ROC analysis of Metabolic Signature in TCGA cohorts. ROC, receiver operating characteristic; AUC, area under the ROC curve.

Figure 7.

Figure 7

The comparison of this model with TNM stage by the time-dependent ROC curve and AUC of 1, 3, and 5-year. (A) For our predict model, the AUC was 0.707 (1-year), 0.742 (3-year) and 0.775 (5-year), respectively. (B) For TMN stage, the AUC was 0.782 (1-year), 0.766 (3-year) and 0.758 (5-year), respectively.

Figure 8.

Figure 8

Nomogram was built to predict overall survival for colon cancer patients.

Validation of metabolic signature

We adjusted and combined GSE17536, GSE17537, and GSE29621 as validation dataset to reanalyze. Patients were grouped into high- and low-risk groups according to the median risk score. In the high, patients had a lower OS according to Kaplan–Meier curves (P < 0.05, Fig. 3B) and risk score distribution than in the low (Fig. 4B). The differential expression of metabolic genes in high- and low-risk groups were shown in heatmaps (Fig. 4D).

GSEA

The results of GSEA showed that significantly metabolism-related pathways were enriched in the low-risk group. KEGG analysis demonstrated that most of the enriched pathways focused on fatty acids (FAs) metabolism, pyruvate metabolism, propanoate metabolism, and butanoate metabolism pathways (Fig. 9A). GO analysis showed that genes were mainly enriched in nucleoside bisphosphate metabolic process, and thioester metabolic process (Fig. 9B). In addition, there were several famous cancer-related pathways, such as cell adhesion molecules (CAMs), cytokine-cytokine receptor interaction, peroxisome, peroxisomal transport, peroxisome organization.

Figure 9.

Figure 9

KEGG pathways and GO enrichment analyses by GSEA. (A) Top 10 representative KEGG pathways. (B) Top 10 representative GO terms.

WGCNA of DEGs

Figure 10A shows the determination of β parameter based on the description in the WGCNA manual. We established two modules based on genes expression pattern via average linkage clustering (Fig. 10B). Modules in gene dendrogram are shown in different colors and based on dynamic branch cutting algorithm underneath row color assigns the modules membership. Figure 10C shows a highly significant correlation between gene significant (GS) versus module membership (MM) in the turquoise module with tumor, and Fig. 10D shows the most significant and correlated module with tumor was blue module.

Figure 10.

Figure 10

A weighted gene co-expression network analysis (WGCNA) of the DEGs. (A) A scaling factor beta determination based on the scale-free topology criterion. (B) Hierarchical clustering of genes in significant modules. The colors are assigned to each module by the Dynamic Tree Cut algorithm. (C) Scatterplot shows a highly significant correlation between gene significant (GS) versus module membership (MM) in the turquoise module with tumor. (D) Heatmap shows most significant and correlated module with tumor was blue module.

PPI network construction and functional analyses

We selected the genes with high GS and MM from blue module based on the co-expression network to constructed PPI networks and highlighted the high connectivity genes. The PPI network showed the top 10 central metabolic genes in colon cancer (Fig. 11). Of all the central genes, PYGM was correlated with gender, T stage, and N stage, while ACOX1, ADH1B, and MAOB were significantly correlated with N stage (Supplementary Fig. 1). GO enrichment results show the top 10 BP terms, 10 CC terms and 10 MF terms, The KEGG enrichment results show the 30 paths (Supplementary Fig. 2).

Figure 11.

Figure 11

PPI network of the top 10 interconnected genes in the blue modules.

Discussion

In this study, we established a metabolic risk model to predict the prognosis of colon cancer based on TCGA and verified by GEO. Firstly, we identified differentially expressed metabolic genes from TCGA and constructed a prognostic model by LASSO. The gene signature has been proved to have an excellent ability to predict prognosis by Kaplan–Meier analysis, time-dependent receiver operating characteristic (ROC), risk score, univariate and multivariate cox regression analysis based on TCGA. Then, we verified the model by Kaplan–Meier curves and risk score based on GEO. Finally, we performed WGCNA, PPI network and functional analyses of DEGs.

Colon cancer is a malignant tumor with poor prognosis, so novel prognostic biomarkers are urgently needed. In recent years, there have been many publications and studies on the risk prediction model for the prognosis of colon cancer, including circulating tumor DNA19, mRNA20, miRNA21, and clinical characters22. Energy metabolism is the basis of proliferation and invasion for tumor cell, and most tumor cells show deviation from the normal energy metabolism state. The prognostic models based on metabolic genes have been established in rectal cancer23, lung adenocarcinoma24, hepatocellular carcinoma25, and acute myelogenous leukemia26. But it has not been used in colon cancer. In this study, we identified eleven prognostic genes involved in metabolism, which revealed the metabolism alterations in colon cancer. Theses metabolic genes can be used as potential biomarkers and therapeutic target for colon cancer patients.

In this model, the expression of ENOPH1, ACAA2, UGT2A3, PAFAH1B3, CPT2, and ACOX1 predicted better prognosis, whereas the expression of PKM, GPX3, LPCAT1, ADCY5 and PTGDS predicted worse prognosis. Most of the genes have been proved to take part in pathogenesis, progression and prognosis of cancers. We compared this metabolic risk model with other metabolic signatures in previous studies and found that PAFAH1B3 is a good prognostic gene in myelogenous leukemia26 while LPCAT1 is a risky prognostic gene in hepatocellular carcinoma27, which has the same function in colon cancer. Platelet activating factor acetyl hydrolase 1B3 (PAFAH1B3) is involved in ether lipid metabolism, and was reported to play an important role in tumorigenesis and aggressiveness in many different cancers28. Blockers of PAFAH1B3 could heightened levels of tumor-suppressing lipids, then impairs pathogenicity of different cancers, including breast, ovarian, melanoma, and prostate cancer29. LPCAT1, an important subtype belongs to the LPCAT subtypes, is a cytosolic enzyme which converts lysophosphatidylcholine (LPC) to phosphatidylcholine (PC). To date, the over-expression of LPCAT1 has been reported in the contribution of the progression, metastasis, and recurrence of multiple types of cancers, including cell renal cell carcinoma30, gastric cancer31, breast cancer32, oral squamous cell carcinoma33, and hepatocellular carcinoma34.

Enolase-phosphatase 1 (ENOPH1), a bifunctional enolase-dephosphorylase enzyme35, is required for polyamine biosynthesis36. Previous studies showed that overexpression of ENOPH1 remarkably promoted cell migration and invasiveness, whereas the downregulation of ENOPH1 significantly impaired cell migration and invasiveness37. Acetyl-Coenzyme A acyltransferase 2 (ACAA2) exerts an effect on β-oxidation of fatty acid, which then provide energy38. Previous showed that ACAA2 could abolished the apoptosis in human hepatocellular carcinoma39, and played a vital role in the metabolism processes in gliomas. The higher expression of ACAA2 was associated with better prognosis of colorectal cancer40. Carnitine palmitoyl transferase 2 (CPT2), a rate-limiting enzyme for mitochondrial fatty acid transportation, had been proved to be a protective prognostic gene for colorectal cancer41, which is in accordance with our results. Peroxisomal Acyl-Coa Oxidase 1 (ACOX1) is the rate-limiting enzyme in fatty acid β-oxidation. The inhibitor of ACOX1 is SIRT1, which has been proved to prevent oxidative damage and is downregulated in liver cancerr42,43, suppresses colorectal cancer metastasis by transcriptional repression of miR-15b-5p44.

Pyruvate kinase M (PKM), a metabolic regulator, participated in both glycolytic and non-glycolytic pathways. PKM also acts as protein kinase, which shifts the glucose metabolism from the respiratory chain to aerobic glycolysis in tumor cells. PKM2 is upregulated in most cancer types, and contributes to tumorigenesis45, which suggested that it could act as a remarkable therapeutic target46. The glutathione peroxidases-3 (GPX3), a selenocysteine-containing redox enzyme, took part in reactive oxygen species signaling and immunomodulatory47. Previous studies suggested that GPX3 prevented the colitis-associated carcinoma by immunomodulation48. But the correlation between GPX and prognosis of colon cancer is not clear. The tumor suppressive effect of prostaglandin D2 (PGD2) on testicular cancer and gastric cancer has been confirmed49. However, the correlation between PGD2 and colon cancer is unclear.

The gene signature has been proved to have an excellent ability to predict prognosis by Kaplan–Meier analysis, ROC and AUC, univariate and multivariate cox regression analysis based on TCGA. In addition, we compared the model with TNM stage by the ROC curve and the values of AUC. The results showed that our predict model was more accurate in the prediction of the 5-year survival rate (AUC = 0.775 vs 0.758). Besides, genetic testing is more accurate and convenient, without the need for surgery, so the prediction model is much more advanced for clinicians and researchers.

It is acknowledged that diverse factors affect tumor development and prognosis, including clinic-pathological diagnosis, immunology regulations, gut microbiome, and metabolism of glucose and lipids50. The TNM classification system is the major pathological staging method, which is the most widely used staging system for clinical decision making51,52. Increasing evidence indicates the key role of systemic immunity in tumor progression and outcome5355. Recent studies highlight the role of the microbiota composition in intestinal inflammation in mice and humans, which has been linked to colorectal cancer56. There have been many publications and studies on the risk prediction model for the prognosis of colon cancer based on immune genes5762. The abnormal metabolism of tumor glucose and lipids is an important part of tumor metabolic reprogramming, which is closely related to tumor occurrence, development, and prognois50. In this study, the functional analyses revealed many metabolic pathways related to prognosis of colon cancer. In one hand, the results validated the close association between metabolic systems and colon cancer. GSEA analyses showed most significantly enriched pathways focused on fatty acid metabolism, butanoate metabolism, propanoate metabolism and pyruvate metabolism pathways. Most of the metabolic genes in the model are fatty metabolism related genes, including PAFAH1B3, LPCAT1, ACAA2, ACOX1, and CPT2.The top four significant genes in PPI network ACOX1, CPT2, ACAA2, and PKM were involved in lipid metabolism or glycolysis. Changes of glucose and lipid metabolism is one of the key features of cancer cells because cell proliferation requires increased lipid biosynthesis, and bioactive molecules produced by lipid catabolism will act as signal molecules to regulate cancer metastasis63,64. Increased glycolysis is the main source of energy supply in cancer cells that use this metabolic pathway for ATP generation. Fatty acids are the preferred substrates for energy storage, mainly in the form of triglyceride. Availability of fatty acids in the blood and peripheral tissues depends on lifestyle and diet and is likely to be altered in patients who are obese or have metabolic syndrome. It is important to keep a healthy diet, including less high saturated fatty acids and lose weight.

In the other hand, most of the metabolic pathways were enriched in the low-risk patients, while the non-metabolic pathways were enriched in the high-risk patients. As a result, metabolic therapy may be more suitable for low‐risk patients, while high‐risk patients might benefit more from nonmetabolic therapy. Previous clinical studies have shown the effectiveness of glycolytic therapy to suppress cancer progression6567. So far, the application of metabolism in malignant tumors includes diagnosis, identification of biomarkers and tumor metabolic processes. In the future, colon cancer patients might benefit from metabolic-related therapy and management.

However, there were some limitations in this study. Firstly, there was no functional experiment in the real world. Secondly, deep learning is widely used for intelligence medicine to assistant disease diagnosis and risk prediction, like imaging data, genomic or transcriptomic data68, but we adopted Lasso and multivariate Cox regression to build a risk model which may result in some limitations. Thirdly, we didn’t test disease-associated modules for preservation by use of an independent dataset, which may lead to reliability decreases17,18. Moreover, it is relatively weak to take only metabolic genes into prognostic model because much more complicated mechanisms contributed to colon cancer.

Conclusions

We constructed and validated a metabolic risk model to predict the prognosis of colon cancer based on TCGA and GEO database. Metabolic-related therapied and managements could be considered for colon cancer patients. However, validation and functional experiments of the metabolic risk model are indispensable.

Supplementary Information

Supplementary Legends. (12.3KB, docx)
Supplementary Tables. (34.2KB, docx)

Author contributions

Analyzed the data and wrote the manuscript: D.Z., C.L. Contributed to analysis tools and manuscript preparation: T.L., M.Y. Contributed to the conception and design of the study: J.Z. and G.N.

Data availability

The datasets and analyses during the current study are available in TCGA and GEO database.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Didi Zuo and Chao Li.

Contributor Information

Jiantao Zhang, Email: zhangjt1980@163.com.

Guang Ning, Email: gning@sibs.ac.cn.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-021-86286-z.

References

  • 1.Benson AB, Venook AP, Al-Hawary MM, et al. NCCN guidelines insights: Colon cancer, version 2.2018. J. Natl. Compr. Cancer Netw. 2018;16(4):359–369. doi: 10.6004/jnccn.2018.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Siegel RL, Miller KD, Goding SA, et al. Colorectal cancer statistics, 2020. CA Cancer J. Clin. 2020;70(3):145–164. doi: 10.3322/caac.21601. [DOI] [PubMed] [Google Scholar]
  • 3.Siegel RL, Fedewa SA, Anderson WF, et al. Colorectal cancer incidence patterns in the United States, 1974–2013. J. Natl. Cancer Inst. 2017;109(8):djw322. doi: 10.1093/jnci/djw322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Islami F, Goding SA, Miller KD, et al. Proportion and number of cancer cases and deaths attributable to potentially modifiable risk factors in the United States. CA Cancer J. Clin. 2018;68(1):31–54. doi: 10.3322/caac.21440. [DOI] [PubMed] [Google Scholar]
  • 5.Winawer SJ, Zauber AG. The advanced adenoma as the primary target of screening. Gastrointest. Endosc. Clin. N. Am. 2002;12(1):1–9. doi: 10.1016/S1052-5157(03)00053-9. [DOI] [PubMed] [Google Scholar]
  • 6.Edwards BK, Ward E, Kohler BA, et al. Annual report to the nation on the status of cancer, 1975–2006, featuring colorectal cancer trends and impact of interventions (risk factors, screening, and treatment) to reduce future rates. Cancer-Am. Cancer Soc. 2010;116(3):544–573. doi: 10.1002/cncr.24760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Porta C, Sica A, Riboldi E. Tumor-associated myeloid cells: New understandings on their metabolic regulation and their influence in cancer immunotherapy. FEBS J. 2018;285(4):717–733. doi: 10.1111/febs.14288. [DOI] [PubMed] [Google Scholar]
  • 8.Qu W, Oya S, Lieberman BP, et al. Preparation and characterization of L-[5-11C]-glutamine for metabolic imaging of tumors. J. Nucl. Med. 2012;53(1):98–105. doi: 10.2967/jnumed.111.093831. [DOI] [PubMed] [Google Scholar]
  • 9.Hanahan D, Weinberg RA. Hallmarks of cancer: The next generation. Cell. 2011;144(5):646–674. doi: 10.1016/j.cell.2011.02.013. [DOI] [PubMed] [Google Scholar]
  • 10.Demirkol Canlı S, Seza EG, Sheraj I, et al. Evaluation of an aldo-keto reductase gene signature with prognostic significance in colon cancer via activation of epithelial to mesenchymal transition and the p70S6K pathway. Carcinogenesis. 2020;41:1219–1228. doi: 10.1093/carcin/bgaa072. [DOI] [PubMed] [Google Scholar]
  • 11.DeBerardinis RJ, Lum JJ, Hatzivassiliou G, Thompson CB. The biology of cancer: Metabolic reprogramming fuels cell growth and proliferation. Cell Metab. 2008;7(1):11–20. doi: 10.1016/j.cmet.2007.10.002. [DOI] [PubMed] [Google Scholar]
  • 12.Tabe Y, Lorenzi PL, Konopleva M. Amino acid metabolism in hematologic malignancies and the era of targeted therapy. Blood. 2019;134(13):1014–1023. doi: 10.1182/blood.2019001034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Chen W, Gao C, Liu Y, et al. Bioinformatics analysis of prognostic miRNA signature and potential critical genes in colon cancer. Front. Genet. 2020;11:478. doi: 10.3389/fgene.2020.00478. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Zhang Z, Zhang S, Yang J, et al. Integrated transcriptomic and metabolomic analyses to characterize the anti-cancer effects of (-)-epigallocatechin-3-gallate in human colon cancer cells. Toxicol. Appl. Pharm. 2020;401:115100. doi: 10.1016/j.taap.2020.115100. [DOI] [PubMed] [Google Scholar]
  • 15.Gharib E, Nasri Nasrabadi P, Reza ZM. miR-497-5p mediates starvation-induced death in colon cancer cells by targeting acyl-CoA synthetase-5 and modulation of lipid metabolism. J. Cell. Physiol. 2020;235(7–8):5570–5589. doi: 10.1002/jcp.29488. [DOI] [PubMed] [Google Scholar]
  • 16.Yu H, Pan R, Qi Y, et al. LEPR hypomethylation is significantly associated with gastric cancer in males. Exp. Mol. Pathol. 2020;116:104493. doi: 10.1016/j.yexmp.2020.104493. [DOI] [PubMed] [Google Scholar]
  • 17.Chen J, Zhao X, Cui L, et al. Genetic regulatory subnetworks and key regulating genes in rat hippocampus perturbed by prenatal malnutrition: Implications for major brain disorders. Aging. 2020;12(9):8434–8458. doi: 10.18632/aging.103150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Li H, Wang X, Lu X, et al. Co-expression network analysis identified hub genes critical to triglyceride and free fatty acid metabolism as key regulators of age-related vascular dysfunction in mice. Aging. 2019;11(18):7620–7638. doi: 10.18632/aging.102275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Luo H, Zhao Q, Wei W, et al. Circulating tumor DNA methylation profiles enable early diagnosis, prognosis prediction, and screening for colorectal cancer. Sci. Transl. Med. 2020;12(524):eaax7533. doi: 10.1126/scitranslmed.aax7533. [DOI] [PubMed] [Google Scholar]
  • 20.Zuo S, Dai G, Ren X. Identification of a 6-gene signature predicting prognosis for colorectal cancer. Cancer Cell Int. 2019;19:6. doi: 10.1186/s12935-018-0724-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Di Z, Di M, Fu W, et al. Integrated analysis identifies a nine-microRNA signature biomarker for diagnosis and prognosis in colorectal cancer. Front. Genet. 2020;11:192. doi: 10.3389/fgene.2020.00192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Boakye D, Jansen L, Schneider M, et al. Personalizing the prediction of colorectal cancer prognosis by incorporating comorbidities and functional status into prognostic nomograms. Cancers. 2019;11(10):1435. doi: 10.3390/cancers11101435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Zhang ZY, Yao QZ, Liu HY, et al. Metabolic reprogramming-associated genes predict overall survival for rectal cancer. J. Cell. Mol. Med. 2020;24(10):5842–5849. doi: 10.1111/jcmm.15254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Zhang S, Lu Y, Liu Z, et al. Identification six metabolic genes as potential biomarkers for lung adenocarcinoma. J. Comput. Biol. 2020;27:1532–1543. doi: 10.1089/cmb.2019.0454. [DOI] [PubMed] [Google Scholar]
  • 25.Liu GM, Xie WX, Zhang CY, Xu JW. Identification of a four-gene metabolic signature predicting overall survival for hepatocellular carcinoma. J. Cell. Physiol. 2020;235(2):1624–1636. doi: 10.1002/jcp.29081. [DOI] [PubMed] [Google Scholar]
  • 26.Wang Y, Hu F, Li JY, et al. Systematic construction and validation of a metabolic risk model for prognostic prediction in acute myelogenous leukemia. Front. Oncol. 2020;10:540. doi: 10.3389/fonc.2020.00540. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Zhu Z, Li L, Xu J, et al. Comprehensive analysis reveals a metabolic ten-gene signature in hepatocellular carcinoma. PeerJ. 2020;8:e9201. doi: 10.7717/peerj.9201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Xu J, Zang Y, Cao S, Lei D, Pan X. Aberrant expression of PAFAH1B3 associates with poor prognosis and affects proliferation and aggressiveness in hypopharyngeal squamous cell carcinoma. Oncotargets Ther. 2019;12:2799–2808. doi: 10.2147/OTT.S196324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kohnz RA, Mulvihill MM, Chang JW, et al. Activity-based protein profiling of oncogene-driven changes in metabolism reveals broad dysregulation of PAFAH1B2 and 1B3 in cancer. ACS Chem. Biol. 2015;10(7):1624–1630. doi: 10.1021/acschembio.5b00053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Du Y, Wang Q, Zhang X, et al. Lysophosphatidylcholine acyltransferase 1 upregulation and concomitant phospholipid alterations in clear cell renal cell carcinoma. J. Exp. Clin. Cancer Res. 2017;36(1):66. doi: 10.1186/s13046-017-0525-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Uehara T, Kikuchi H, Miyazaki S, et al. Overexpression of lysophosphatidylcholine acyltransferase 1 and concomitant lipid alterations in gastric cancer. Ann. Surg. Oncol. 2016;23(Suppl 2):S206–S213. doi: 10.1245/s10434-015-4459-6. [DOI] [PubMed] [Google Scholar]
  • 32.Abdelzaher E, Mostafa MF. Lysophosphatidylcholine acyltransferase 1 (LPCAT1) upregulation in breast carcinoma contributes to tumor progression and predicts early tumor recurrence. Tumour Biol. 2015;36(7):5473–5483. doi: 10.1007/s13277-015-3214-8. [DOI] [PubMed] [Google Scholar]
  • 33.Shida-Sakazume T, Endo-Sakamoto Y, Unozawa M, et al. Lysophosphatidylcholine acyltransferase1 overexpression promotes oral squamous cell carcinoma progression via enhanced biosynthesis of platelet-activating factor. PLoS ONE. 2015;10(3):e0120143. doi: 10.1371/journal.pone.0120143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Morita Y, Sakaguchi T, Ikegami K, et al. Lysophosphatidylcholine acyltransferase 1 altered phospholipid composition and regulated hepatoma progression. J. Hepatol. 2013;59(2):292–299. doi: 10.1016/j.jhep.2013.02.030. [DOI] [PubMed] [Google Scholar]
  • 35.Wang H, Pang H, Bartlam M, Rao Z. Crystal structure of human E1 enzyme and its complex with a substrate analog reveals the mechanism of its phosphatase/enolase activity. J. Mol. Biol. 2005;348(4):917–926. doi: 10.1016/j.jmb.2005.01.072. [DOI] [PubMed] [Google Scholar]
  • 36.Sauter M, Moffatt B, Saechao MC, Hell R, Wirtz M. Methionine salvage and S-adenosylmethionine: Essential links between sulfur, ethylene and polyamine biosynthesis. Biochem. J. 2013;451(2):145–154. doi: 10.1042/BJ20121744. [DOI] [PubMed] [Google Scholar]
  • 37.Zhuang H, Qiang Z, Shao X, et al. Integration of metabolomics and expression of enolase-phosphatase 1 links to hepatocellular carcinoma progression. Theranostics. 2019;9(12):3639–3652. doi: 10.7150/thno.31693. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Kuhajda FP. Fatty-acid synthase and human cancer: New perspectives on its role in tumor biology. Nutrition. 2000;16(3):202–208. doi: 10.1016/S0899-9007(99)00266-X. [DOI] [PubMed] [Google Scholar]
  • 39.Cao W, Liu N, Tang S, et al. Acetyl-Coenzyme A acyltransferase 2 attenuates the apoptotic effects of BNIP3 in two human cell lines. Biochim. Biophys. Acta. 2008;1780(6):873–880. doi: 10.1016/j.bbagen.2008.02.007. [DOI] [PubMed] [Google Scholar]
  • 40.Guo H, Zeng W, Feng L, et al. Integrated transcriptomic analysis of distance-related field cancerization in rectal cancer patients. Oncotarget. 2017;8(37):61107–61117. doi: 10.18632/oncotarget.17864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Liu X, Bing Z, Wu J, et al. Integrative gene expression profiling analysis to investigate potential prognostic biomarkers for colorectal cancer. Med. Sci. Monit. 2020;26:e918906. doi: 10.12659/MSM.918906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Huang J, Viswakarma N, Yu S, et al. Progressive endoplasmic reticulum stress contributes to hepatocarcinogenesis in fatty acyl-CoA oxidase 1-deficient mice. Am. J. Pathol. 2011;179(2):703–713. doi: 10.1016/j.ajpath.2011.04.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Chen X, Tian M, Sun R, et al. SIRT5 inhibits peroxisomal ACOX1 to prevent oxidative damage and is downregulated in liver cancer. Embo Rep. 2018;19(5):e45124. doi: 10.15252/embr.201745124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Sun L, Zhi Z, Chen L, et al. SIRT1 suppresses colorectal cancer metastasis by transcriptional repression of miR-15b-5p. Cancer Lett. 2017;409:104–115. doi: 10.1016/j.canlet.2017.09.001. [DOI] [PubMed] [Google Scholar]
  • 45.Graziano F, Ruzzo A, Giacomini E. Glycolysis gene expression analysis and selective metabolic advantage in the clinical progression of colorectal cancer. Pharmacogenomics J. 2017;17(3):258–264. doi: 10.1038/tpj.2016.13. [DOI] [PubMed] [Google Scholar]
  • 46.Zahra K, Dey T, Ashish, Mishra SP, Pandey U. Pyruvate kinase M2 and cancer: The role of PKM2 in promoting tumorigenesis. Front. Oncol. 2020;10:159. doi: 10.3389/fonc.2020.00159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Olson GE, Whitin JC, Hill KE, et al. Extracellular glutathione peroxidase (Gpx3) binds specifically to basement membranes of mouse renal cortex tubule cells. Am. J. Physiol. Renal Physiol. 2010;298(5):F1244–F1253. doi: 10.1152/ajprenal.00662.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Barrett CW, Ning W, Chen X, et al. Tumor suppressor function of the plasma glutathione peroxidase gpx3 in colitis-associated carcinoma. Cancer Res. 2013;73(3):1245–1255. doi: 10.1158/0008-5472.CAN-12-3150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Zhang B, Bie Q, Wu P, et al. PGD2/PTGDR2 signaling restricts the self-renewal and tumorigenesis of gastric cancer. Stem Cells (Dayton, Ohio) 2018;36(7):990–1003. doi: 10.1002/stem.2821. [DOI] [PubMed] [Google Scholar]
  • 50.Heske CM. Beyond energy metabolism: Exploiting the additional roles of NAMPT for cancer therapy. Front. Oncol. 2019;9:1514. doi: 10.3389/fonc.2019.01514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Balch CM, Soong SJ, Gershenwald JE, et al. Prognostic factors analysis of 17,600 melanoma patients: Validation of the American Joint Committee on Cancer melanoma staging system. J. Clin. Oncol. 2001;19(16):3622–3634. doi: 10.1200/JCO.2001.19.16.3622. [DOI] [PubMed] [Google Scholar]
  • 52.Edge SB, Compton CC. The American Joint Committee on Cancer: The 7th edition of the AJCC cancer staging manual and the future of TNM. Ann. Surg. Oncol. 2010;17(6):1471–1474. doi: 10.1245/s10434-010-0985-4. [DOI] [PubMed] [Google Scholar]
  • 53.Parvy JP, Yu Y, Dostalova A, et al. The antimicrobial peptide defensin cooperates with tumour necrosis factor to drive tumour cell death in Drosophila. Elife. 2019;8:e45061. doi: 10.7554/eLife.45061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Angelova M, Charoentong P, Hackl H, et al. Characterization of the immunophenotypes and antigenomes of colorectal cancers reveals distinct tumor escape mechanisms and novel targets for immunotherapy. Genome Biol. 2015;16:64. doi: 10.1186/s13059-015-0620-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Lee N, Zakka LR, Mihm MJ, Schatton T. Tumour-infiltrating lymphocytes in melanoma prognosis and cancer immunotherapy. Pathology. 2016;48(2):177–187. doi: 10.1016/j.pathol.2015.12.006. [DOI] [PubMed] [Google Scholar]
  • 56.Zheng S, Zhao T, Yuan S, et al. Immunodeficiency promotes adaptive alterations of host gut microbiome: An observational metagenomic study in mice. Front. Microbiol. 2019;10:2415. doi: 10.3389/fmicb.2019.02415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Zhou R, Zhang J, Zeng D, et al. Immune cell infiltration as a biomarker for the diagnosis and prognosis of stage I–III colon cancer. Cancer Immunol. Immunother. 2019;68(3):433–442. doi: 10.1007/s00262-018-2289-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Tian X, Zhu X, Meng W, et al. A 12-immune cell signature to predict relapse and guide chemotherapy for stage II colorectal cancer. Aging. 2020;12(18):18363–18383. doi: 10.18632/aging.103707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Dai S, Xu S, Ye Y, Ding K. Identification of an immune-related gene signature to improve prognosis prediction in colorectal cancer patients. Front. Genet. 2020;11:607009. doi: 10.3389/fgene.2020.607009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Chen H, Luo J, Guo J. Development and validation of a five-immune gene prognostic risk model in colon cancer. BMC Cancer. 2020;20(1):395. doi: 10.1186/s12885-020-06799-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Lu Y, Zhou X, Liu Z, et al. Assessment for risk status of colorectal cancer patients: A novel prediction model based on immune-related genes. DNA Cell Biol. 2020;39(6):958–964. doi: 10.1089/dna.2019.5195. [DOI] [PubMed] [Google Scholar]
  • 62.Wen S, He L, Zhong Z, Mi H, Liu F. Prognostic model of colorectal cancer constructed by eight immune-related genes. Front. Mol. Biosci. 2020;7:604252. doi: 10.3389/fmolb.2020.604252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Huang C, Freter C. Lipid metabolism, apoptosis and cancer therapy. Int. J. Mol. Sci. 2015;16(1):924–949. doi: 10.3390/ijms16010924. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Santos CR, Schulze A. Lipid metabolism in cancer. FEBS J. 2012;279(15):2610–2623. doi: 10.1111/j.1742-4658.2012.08644.x. [DOI] [PubMed] [Google Scholar]
  • 65.Ganapathy-Kanniappan S, Geschwind JH. Tumor glycolysis as a target for cancer therapy: Progress and prospects. Mol. Cancer. 2013;12:152. doi: 10.1186/1476-4598-12-152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Gill KS, Fernandes P, O'Donovan TR, et al. Glycolysis inhibition as a cancer treatment and its role in an anti-tumour immune response. Biochem. Biophys. Acta. 2016;1866(1):87–105. doi: 10.1016/j.bbcan.2016.06.005. [DOI] [PubMed] [Google Scholar]
  • 67.Abdel-Wahab AF, Mahmoud W, Al-Harizy RM. Targeting glucose metabolism to suppress cancer progression: Prospective of anti-glycolytic cancer therapy. Pharmacol. Res. 2019;150:104511. doi: 10.1016/j.phrs.2019.104511. [DOI] [PubMed] [Google Scholar]
  • 68.Liu M, Li F, Yan H, et al. A multi-model deep convolutional neural network for automatic hippocampus segmentation and classification in Alzheimer's disease. Neuroimage. 2020;208:116459. doi: 10.1016/j.neuroimage.2019.116459. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Legends. (12.3KB, docx)
Supplementary Tables. (34.2KB, docx)

Data Availability Statement

The datasets and analyses during the current study are available in TCGA and GEO database.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES