Skip to main content
Discover Oncology logoLink to Discover Oncology
. 2024 Mar 5;15:63. doi: 10.1007/s12672-024-00916-2

Identification and validation of a novel six-gene signature based on mucinous adenocarcinoma-related gene molecular typing in colorectal cancer

Yuxin Man 1,#, Dao Xin 2,#, Yang Ji 3,#, Yang Liu 1, Lingna Kou 1, Lingxi Jiang 3,4,
PMCID: PMC10914658  PMID: 38443703

Abstract

Background and objectives

Colorectal mucinous adenocarcinoma (MAC) is a particular pathological type that has yet to be thoroughly studied. This study aims to investigate the characteristics of colorectal MAC-related genes in colorectal cancer (CRC), explore the role of MAC-related genes in accurately classifying CRC, and further construct a prognostic signature.

Methods

CRC samples were collected from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). MAC-related differentially expressed genes (DEGs) were analyzed in TCGA samples. Based on colorectal MAC-related genes, TCGA CRC samples were molecularly typed by the non-negative matrix factorization (NMF). According to the molecular subtype characteristics, the RiskScore signature was constructed through univariate Cox, the least absolute shrinkage and selection operator (LASSO), and multivariate Cox regression analyses. Clinical significance in CRC of the RiskScore signature was analyzed. A nomogram was further built based on the RiskScore signature.

Results

From the colorectal MAC-related genes, three distinct molecular subtypes were identified. A RiskScore signature composed of six CRC subtype-related genes (CALB1, MMP1, HOXC6, ZIC2, SFTA2, and HYAL1) was constructed. Patients with high-RiskScores had the worse prognoses. RiskScores led to differences in gene mutation characteristics, antitumor drug sensitivity, and tumor microenvironment of CRC. A nomogram based on the signature was developed to predict the one-, three-, and five-year survival of CRC patients.

Conclusion

MAC-related genes were able to classify CRC. A RiskScore signature based on the colorectal MAC-related molecular subtype was constructed, which had important clinical significance for guiding the accurate stratification of CRC patients.

Keywords: Colorectal cancer, Mucinous adenocarcinoma, Gene, Signature, Survival

Introduction

The global incidence and mortality of colorectal cancer (CRC) are among the top three malignant tumors [1]. Approximately 20% of CRC patients have metastases when they are diagnosed, and 50% of patients with localized disease will ultimately develop metastases [2]. However, the five-year survival rate of metastatic CRC is just 14% [3]. It is one of the most severe gastrointestinal tumor diseases that seriously threaten the lives and health of people.

The occurrence and development of CRC is a complex and heterogeneous process involving a variety of molecules and signal pathways [2]. The classification and refinement of CRC according to histological and molecular biological characteristics has been the embodiment of precision treatment [4, 5]. The histological type of CRC profoundly affects its clinical characteristics and outcomes [68]. However, the clinical practice remains challenging, and many patients are insensitive to treatment and have relatively poor prognoses.

Notably, mucinous adenocarcinoma (MAC) is a specific histopathological subtype of CRC, and it is characterized by abundant extracellular mucin in more than 50% of the tumor tissue [9, 10], with an incidence of 10–15% [11, 12]. Cancer cells in colorectal MAC have goblet cell characteristics [13] and produce large amounts of mucin glycoproteins [14]. In colorectal MAC, MUC1 and MUC13 are transmembrane mucins, while MUC2 and MUC5AC are secreted gell-forming mucins resistant to tumor cell death and form an immune infiltrating barrier [12]. In comparison to non-mucinous adenocarcinoma (NMAC), MAC is more common in women, and it has more advanced T or N stage, proximal location, worse differentiation, and a higher ratio of peritoneal implant, microsatellite instability-H (MSI-H) or BRAF mutation [1518]. Due to the inherent characteristics of MAC, it often responds poorly to conventional therapy [14, 19, 20]. MAC often lacks the attention of clinical practice guidelines and faces a more complex treatment situation.

In comparison with NMAC, the unique clinical and pathological features of MAC indicate that there are some characteristic genes in MAC that can be stratified for evaluating patients and guiding clinical treatment. However, the significance of the typical genes and the prognostic factors of colorectal MAC yet remain to be determined, and relatively few studies have been conducted in this area. A comprehensive analysis is essential for elucidating the potential mechanism of MAC for proper treatment.

In the era of big data, valuable features in data mining and analysis can improve the accuracy of decisions made by clinicians. Sequencing data can be obtained from multiple databases, and various bioinformatics analyses can be performed, which helps gain a more comprehensive and accurate understanding of MAC. In this study, differentially expressed genes (DEGs) were initially analyzed in colorectal MAC. Based on colorectal MAC-related DEGs, the molecular subtypes of CRC were classified using the non-negative matrix factorization (NMF) method. The clinical and prognostic characteristics of different subtypes were then analyzed. The gene expression differences were analyzed between the subtypes with the most significant differences in clinical characteristics. In combination with the genes closely related to the occurrence and development of CRC, univariate Cox, the least absolute shrinkage and selection operator (LASSO), and multivariate Cox regression were used for constructing the CRC molecular typing-related prognosis signature in CRC. Multiple datasets were used for verifying the signature, and the clinical characteristics and molecular differences between the high- and low-RiskScore groups were identified. A nomogram was ultimately drawn for predicting the survival of CRC patients. CRC molecular typing-related risk signature can serve as a potential genetic marker and prognostic classification basis for CRC. The workflow of this study can be seen in Fig. 1.

Fig. 1.

Fig. 1

Study flow chart

Materials and methods

Data acquisition and DEG analysis

TCGA-COAD and TCGA-READ datasets were downloaded using the TCGAbiolinks [21] package. CRC-related datasets, including GSE39582 [22], GSE17538 [23], GSE38832 [24], and GSE32323 [25], were downloaded from the GEO database using the GEOquery [26] package. The DESeq2 package [27] was used for analyzing DEGs in the RNA-seq data. The Limma [28] package was used to analyze the DEGs in the microarray data. DEGs in the RNA-seq data were defined as the adjusted P value < 0.05 and | log2foldchange |> 1. DEGs in the microarray data were defined as the P value < 0.05 and | log2foldchange |> 1. Maftools package [29] was used to visualize TCGA-COAD and TCGA-READ gene mutation data.

Molecular typing based on MAC-related genes

MAC-related genes were included for molecular typing analysis of TCGA CRC samples by the NMF [30] method. The optimal number of clusters was determined by cophenetic correlation, dispersion, and silhouette.

Construction of a prognostic signature and survival analysis

The P < 0.05 was used as the threshold, and the univariate Cox regression analysis was used for detecting the critical genes related to molecular typing of CRC in the training set. In order to narrow the gene range and maximize the accuracy, the LASSO regression was used to identify prognostic genes using the glmnet package [31]. Multivariate Cox regression analysis was used to establish the prognostic RiskScore signature, and the RiskScore was calculated.

RiskScore=h0texpj=1nCoefj×Xj

The area under the curve (AUC) of the receiver operating characteristic (ROC) curve was used for quantifying the prediction ability of the signature through the R package timeROC [32]. Kaplan–Meier survival curve was used to describe differences in survival time among patients with different RiskScores.

Functional enrichment analysis

The clusterProfiler [33] package was used for gene ontology (GO) analysis [34] and Kyoto encyclopedia of genes and genomes (KEGG) [35] enrichment analysis. The “h.all.v7.4.symbols.gmt” gene sets were downloaded from the MSigDB database [36] for gene set enrichment analysis (GSEA), and the gene set variation analysis (GSVA) package was used for identifying the hallmark gene set score.

Estimation of tumor-infiltrating immune cells and analysis of IC50 values of common antitumor drugs

ImmuneScore, StromaScore, and ESTIMATEScore were analyzed using the estimate [37] package based on gene expression data. CIBERSORTx (https://cibersortx.stanford.edu/) [38] and ImmuneCellAI online tools (http://bioinfo.life.hust.edu.cn/ImmuCellAI#! /) [39] were used for predicting the degree of tumor-infiltrating immune cells in the tumor microenvironment of samples. The pRRophetic package [40] was used to predict the half-maximal inhibitory concentration (IC50) values of antitumor drugs based on the sample gene expression data.

Construction of the nomogram

A nomogram was constructed to predict CRC patients' overall survival (OS) rates at one, three, and five years. We also built calibration curves to evaluate the difference between the OS rate predicted by the nomogram and the observed OS rate. We used decision curve analysis (DCA) to assess the clinical practicability of the nomogram.

Statistical analysis

R v. 4.1.2 was used for statistical analysis. Student’s t-test or Wilcoxon test was used to establish the difference between the two groups. Multiple groups were compared using one-way ANOVA or the Kruskal–Wallis test. The chi-square test was used for comparison of frequencies. P < 0.05 was considered to be statistically significant.

Results

Analysis of MAC-related genes in CRC

In comparison to the small number of normal samples, more MAC samples for subsequent analysis were found in the TCGA database. We analyzed DEGs between 58 MAC samples and 328 adenocarcinoma samples in the TCGA-COAD dataset. DEGs were analyzed between 14 MAC samples and 118 adenocarcinoma samples in the TCGA-READ dataset. DEGs were presented in volcano plots (Fig. 2A and D). There were 1418 differentially up-regulated genes and 736 differentially down-regulated genes in TCGA-COAD MAC. A total of 606 genes were up-regulated and 372 down-regulated in TCGA-READ MAC. We further performed GO and KEGG analyses on MAC-related DEGs to explore the potential functional mechanism (Fig. 2B, C, E, and F). In CRC, according to the results of GO analysis, MAC-related DEGs were involved in receptor-ligand activity, signaling receptor activator activity, and other functional mechanisms. According to the results of KEGG analysis, MAC-related DEGs were involved in neuroactive ligand-receptor interaction, cytokine-cytokine receptor interaction, and IL-17 signaling pathway. Our results suggested that DEGs were closely related to the malignant biological behavior of colorectal MAC. GSEA of the hallmark gene set was performed on the MAC-related genes, which indicated that the epithelial-mesenchymal transition (EMT) pathway-related gene set was activated (Fig. 2G and H).

Fig. 2.

Fig. 2

Analysis of the MAC-related genes. A Volcano plot of MAC-related DEGs in the TCGA-COAD dataset. B GO analysis of MAC-related DEGs in the TCGA-COAD dataset. C KEGG analysis of MAC-related DEGs in the TCGA-COAD dataset. D Volcano plot of MAC-related DEGs in the TCGA-READ dataset. E GO analysis of MAC-related DEGs in the TCGA-READ dataset. F KEGG analysis of MAC-related DEGs in the TCGA-READ dataset. G GSEA of MAC-related genes in the TCGA-COAD dataset. H GSEA of MAC-related genes in the TCGA-READ dataset

Identification of molecular subtypes in CRC

The intersection of genes related to TCGA-COAD MAC and TCGA-READ MAC showed 264 genes to be up-regulated and 141 genes to be down-regulated simultaneously. A total of 405 genes that may have biological importance for colorectal MAC were identified (Fig. 3A). Survival analysis was performed on the critical genes grouped based on the optimal truncation value of patient survival. Genes with P < 0.05 were selected, and 192 MAC-related key genes were found to have a correlation with the OS of CRC patients in the TCGA dataset. These 192 MAC-related key genes were then used for the molecular typing of CRC by the NMF method. According to phenotype, dispersion, and silhouette, the optimal number of clusters was three (Fig. 3B). Three molecular subtypes were obtained: Cluster 1 (n = 280), Cluster 2 (n = 127), and Cluster 3 (n = 130). The PCA diagram (Fig. 3C) showed an apparent distinction among CRC's molecular subtypes, indicating that the three molecular subtypes were well differentiated. The consistency clustering heatmap can be seen in Fig. 3D. Survival analysis was performed for samples with survival data, and the Kaplan–Meier curves showed significant differences in the survival of patients with different molecular subtypes (Fig. 3E).

Fig. 3.

Fig. 3

Molecular subtyping analysis of MAC-related genes in CRC. A Venn diagram of the intersection of TCGA colorectal MAC-related genes. B Line charts of phenotypic correlation, dispersion, and silhouette at rank = 2–10. C PCA graph of different subtypes of TCGA CRC dataset. D Consensus graph of NMF clustering. E Survival analysis of three molecular subtypes. F GSVA heatmap of three molecular subtypes. G ESTIMATE analysis of three molecular subtypes. H ImmuCellAI analysis of three molecular subtypes. I CIBERSORTx analysis of three molecular subtypes. ****P < 0.0001; ***P < 0.001; **P < 0.01; *P < 0.05; ns: no significance

GSVA were then performed on the tumor-related gene sets “H.all.v7.4.symbols.gmt” of three molecular subtypes, and a heatmap was drawn for the GSVA scores (Fig. 3F). From the heatmap, it can be found that “EPITHELIAL_MESENCHYMAL_TRANSITION”, “INFLAMMATORY_RESPONSE” or “ANGIOGENESIS” got relatively higher scores in Cluster 2. In addition, the differences in immune characteristics among the three subtypes were analyzed. The molecular subtypes' ImmuneScore, StromalScore, and ESTIMATEScore were all calculated (Fig. 3G). Scores in Cluster 2 were relatively high, which indicated more immune and stromal components in the tumor microenvironment. Therefore, the degree of cell invasion in the tumor microenvironment of the three molecular subtypes was further explored. Significant differences in infiltrating cells were observed in ImmuCellAI (Fig. 3H) and CIBERSORTx (Fig. 3I) analyses. The degree of macrophage invasion in the tumor microenvironment of Cluster 2 was higher, and NK-T cell infiltration was considerably lower. These results suggested that the three molecular subtypes of CRC differentiated by MAC-related genes have significant differences in clinical prognosis and potential mechanism pathways, with apparent differentiation.

Identification of CRC molecular typing-related genes

Significant biological differences were observed in the three molecular subtypes distinguished by the key genes of MAC, from clinical features to mechanism. Therefore, DEGs between Cluster 2 and Cluster 1 and DEGs between Cluster 2 and Cluster 3 were analyzed. 3,733 genes were up-regulated, and 1,009 genes were down-regulated between Cluster 2 and Cluster 1. 2,674 genes were up-regulated, and 833 genes were down-regulated between Cluster 2 and Cluster 3.

After merging the two groups of DEGs, 4,457 were up-regulated, and 1,677 were down-regulated. Due to the large number of genes obtained by MAC-related molecular typing, we analyzed the DEGs of 17 paired CRC samples in the GSE32323 dataset from the GEO database to narrow the gene range. Then, we screened out genes which were closely associated with the occurrence and development of CRC. The prognostic signature could be constructed in the whole range of CRC, and the applicability of the prognostic signature could be expanded. In the GSE32323 dataset, 718 genes were up-regulated, and 719 genes were down-regulated. There were 139 genes in the intersection of up- and down-regulated genes (Fig. 4A). This gene set may play an essential role in CRC.

Fig. 4.

Fig. 4

Construction and validation of the signature. A Venn diagram of the intersection of key genes in CRC samples. B Ten-fold cross-validation of lambda selection by LASSO. C The LASSO coefficient spectrum in the TCGA training set. DF Survival analysis showed the difference in the prognoses of patients in the TCGA CRC training set, internal testing set, and entire set. GI The one-, three-, and five-year ROC curves of the TCGA CRC training set, internal testing set, and entire set. JL Kaplan–Meier curves in GSE17538, GSE38832, and GSE39582. MO The one-, three-, and five-year ROC curves in GSE17538, GSE38832, and GSE39582

Development of the RiskScore signature

The gene set that contained 139 genes was obtained using the aforementioned steps. After removing samples with missing survival data or the survival time of 0, the entire TCGA CRC set (512 samples) was randomly divided into the training set and the internal testing set (7:3). Based on the expression levels of 139 CRC molecular typing-related genes and survival information in the training set, we constructed a CRC prognostic signature. Firstly, P < 0.05 was chosen as the screening threshold, and univariate Cox analysis was performed on the 139 genes in the training set to screen out the genes with prognostic significance. A total of 12 genes met the screening threshold by univariate Cox analysis. Since the number of 12 genes was still relatively large, we further compressed these 12 genes using the LASSO regression (Fig. 4B and C). Ten-fold cross-validation was performed in the LASSO regression. When lambda = 0.0116, 12 genes were compressed to nine genes. Therefore, these nine genes were included in the multivariate Cox analysis, and the stepwise method was used to construct the signature.

The six genes found to be related to CRC molecular typing were ultimately used for constructing a prognostic signature of CRC patients: CALB1, MMP1, HOXC6, ZIC2, SFTA2, and HYAL1. The following formula was used for calculating the RiskScore (the coefficients were rounded to seven decimal places).

RiskScore=h0texp(0.1187114×CALB1-0.1208561×MMP1+0.3187824×HOXC6+0.0974445×ZIC2+0.2239488×SFTA2-0.2384973×HYAL1)

The samples were divided into high- and low-RiskScore groups based on the median RiskScore in the TCGA training set, TCGA internal testing set, and TCGA entire set. Kaplan–Meier analysis suggested that low-RiskScore patients had a significantly better survival outcome. ROC curve results showed the signature constructed in this study had the good predictive ability (five-year AUC = 0.720; three-year AUC = 0.692; one-year AUC = 0.704). In particular, the AUC values of the TCGA internal testing set and the entire set at one, three, and five years were all greater than 0.7, and the AUC value of the internal testing set at three years was 0.8. Therefore, it could be seen that this prognostic signature showed good discriminative and predictive ability (Fig. 4D-I). To determine the independence of the six-gene signature in clinical application, univariate and multivariate Cox regression analyses were conducted on clinical variables in TCGA CRC samples with complete clinical information (shown in Table 1). Univariate Cox regression analysis showed that stage III/IV vs. stage I/II, age, and the RiskScore were significantly correlated with survival. However, gender was not a prognostic factor. In multivariate Cox analysis, stage III/IV vs. stage I/II, age, and the RiskScore had a significant correlation with survival. These results suggested the potential for the six-gene signature to be used as an independent risk factor for predicting the prognosis of CRC patients.

Table 1.

Univariate and multivariate Cox analyses

Variables Univariable Cox analysis Multivariable Cox analysis
HR 95% CI of HR P HR 95% CI of HR P
lower upper lower upper
Age 1.04 1.02 1.06  < 0.01 1.04 1.02 1.06  < 0.01
Gender 0.92 0.60 1.41 0.70
Stage 3.30 2.09 5.21  < 0.01 3.35 2.10 5.33  < 0.01
RiskScore 1.45 1.31 1.61  < 0.01 1.33 1.19 1.48  < 0.01

Validation of the six-gene signature in three independent GEO CRC datasets

Three independent GEO datasets (GSE17538, GSE38832, and GSE39582) were used as external testing sets for verifying the prognostic signature. The samples were divided into high- and low-RiskScore groups based on the median RiskScore of each dataset. In the external testing sets, Kaplan–Meier curves showed the median OS of patients in the high-RiskScore group to be significantly shorter than in the low-RiskScore group (Fig. 4J, K, and L). The one-, three-, and five-year AUC values also suggested that the prognostic signature had the good predictive ability in the three independent datasets (Fig. 4M, N, and O).

Correlation between RiskScores and clinicopathological features

Based on the best cut-off P values, we first analyzed the prognostic value of these six genes in the TCGA CRC dataset. The results indicated that patients with high expression of CALB1, HOXC6, ZIC2, or SFTA2 had poor prognoses, while those with high expression of MMP1 or HYAL1 had good prognoses (Fig. 5A-F). It could be inferred that the six genes had a close association with the CRC prognosis. The relationship between the RiskScores and clinical parameters of CRC was further explored. Regardless of gender, age, tumor locations, or T/N/M stages, patients with low-RiskScores were found to have relatively better prognoses (Fig. 6A-F).

Fig. 5.

Fig. 5

The effect of signature genes on the prognosis of patients in the TCGA CRC dataset. A Effect of CALB1 on patient survival. B Effect of HOXC6 on patient survival. C Effect of HYAL1 on patient survival. D Effect of MMP1 on patient survival. E Effect of SFTA2 on patient survival. F Effect of ZIC2 on patient survival

Fig. 6.

Fig. 6

Survival analysis between RiskScores and different clinical parameters in the TCGA CRC dataset. A Effect of RiskScores on the prognosis of different gender. B Effect of RiskScores on the prognosis of different ages. C Effect of RiskScores on the prognosis of different primary tumor locations. D Effect of RiskScores on the prognosis of different T stages. E Effect of RiskScores on the prognosis of different N stages. F Effect of RiskScores on the prognosis of different M stages

Subsequently, we further analyzed the relationship between RiskScores and clinical characteristics in the aforementioned TCGA CRC entire set. The IC50 values of Erlotinib, BMS.754807, OSI.906, and SB.216763 in the low-RiskScore group were found to be lower. The corresponding IC50 values of Camptothecin, CI.1040, ABT.263, and A.770041 were all higher. No significance was found in the IC50 values of MS.275 and Cytarabine between the two groups (Fig. 7A). These results suggested that patients in the high-RiskScore group could choose drugs with lower corresponding IC50 values, potentially improving efficacy. The differences in gene mutations between high- and low-RiskScore groups were explored (Fig. 7B). The mutation rates in the high-RiskScore group of the first five were APC (75%), TP53 (63%), TTN (52%), KRAS (38%), and SYNE1 (32%). Mutation rates in the low-RiskScore group of the first five were APC (79%), TP53 (58%), TTN (51%), KRAS (44%), and PIK3CA (30%). Interestingly, we found a significant difference in the distribution of BRAF mutation between the two groups (P < 0.001), with 18% in the high-RiskScore group and 6% in the low-RiskScore group. The relationship between RiskScores and three Clusters formed by MAC-related genes was further analyzed (Fig. 7C). We were also surprised that Cluster 3 with the low RiskScore had the best prognosis, while Cluster 2 with the high RiskScore had the worst prognosis.

Fig. 7.

Fig. 7

Analysis of clinical features of the signature. A IC50 values of antitumor drugs in patients with different RiskScores. B Gene mutation waterfall diagram of TCGA CRC samples with different RiskScores. C Relationship between different Clusters and RiskScores. ****P < 0.0001; ***P < 0.001; **P < 0.01; *P < 0.05; ns: no significance

Evaluation of the tumor microenvironment in groups stratified by the RiskScore

ImmuCellAI and CIBERSORTx online tools were used for analyzing the immune infiltrating cells of patients in the high- and low-RiskScore groups. A significant difference was observed in some immune infiltrating cells between the high- and low-RiskScore groups. ImmuCellAI analysis (Fig. 8A) showed that the infiltration levels of macrophages and DC were higher in the high-RiskScore group. CIBERSORTx analysis showed that the infiltration levels of plasma cells and activated mast cells were higher in the low-RiskScore group (Fig. 8B). Through the above analyses, it was found that the RiskScore had an impact on the tumor microenvironment of CRC samples.

Fig. 8.

Fig. 8

A ImmuCellAI analysis of tumor microenvironment immune infiltrating cells in different RiskScore groups. B CIBERSORTx analysis of tumor microenvironment immune infiltrating cells in different RiskScore groups. ****P < 0.0001; ***P < 0.001; **P < 0.01; *P < 0.05; ns: no significance

Analysis of the differences in molecular mechanism caused by the RiskScore

Due to the significant differences in clinical characteristics caused by the RiskScore, DEGs in the high- and low-RiskScore groups in the aforementioned TCGA CRC entire dataset were further explored (Fig. 9A). GO analysis was conducted on DEGs (Fig. 9B). It was found that DEGs were mainly involved in the receptor ligand activity, signaling receptor activator activity, and some metabolic processes. KEGG analysis of the potential mechanisms conducted on DEGs (Fig. 9C) showed that DEGs caused by the RiskScore were engaged in tumor occurrence and development-related pathways and metabolic-related pathways. GSEA was then further performed on the two groups with different RiskScores, and the primary mechanism caused by the RiskScore was analyzed using hallmark gene sets (Fig. 9D). It could be seen that HALLMARK_APICAL_JUNCTION, HALLMARK_MYOGENESIS, and HALLMARK_KRAS_SIGNALING_DN were activated, while HALLMARK_INFLAMMATORY_RESPONSE and HALLMARK_IL6_JAK_STAT3_SIGNALING were inhibited.

Fig. 9.

Fig. 9

Functional enrichment analyses of the signature. A Volcano plot of DEGs caused by different RiskScores. B GO analysis of DEGs caused by different RiskScores. C KEGG analysis of DEGs caused by different RiskScores. D GSEA of RiskScore-related genes

Construction of a nomogram combined with the RiskScore and clinical features

A nomogram was constructed based on the prognostic signature related to colorectal MAC-related molecular subtypes. Multivariate Cox regression analysis on potential variables, which included the T/N/M stage, age, gender, and the RiskScore, was used to predict one-, three-, and five-year OS rates in CRC patients (Fig. 10A). The C-index of the nomogram was 0.783. It was observed that a higher RiskScore led to a worse prognosis. For example, a clinician could calculate a 61-year-old female CRC patient with clinical characteristics of T2N0M0 and the RiskScore for a score of the corresponding variables, resulting in a total score of -0.35. The probabilities of survival time longer than one, three, and five years were found to be 0.971, 0.933, and 0.861, respectively.

Fig. 10.

Fig. 10

Construction and validation of the nomogram. A OS rates of patients for one, three, and five years were predicted by the nomogram constructed with important clinical parameters. B DCA evaluated the nomogram compared with the T/N/M stage (one-year DCA). C The calibration curves. D C-index histogram

Calibration curves were also constructed for evaluating the consistency of the OS rates and OS rates predicted by the nomogram. The results showed that this method fit well for one-, three-, and five-year OS prediction (Fig. 10C). In comparison to the T/N/M stage, the results of one-year DCA proved the nomogram had reasonable clinical practicability for prognosis prediction (Fig. 10B). The constructed nomogram was compared with the nomograms of CRC previously published [4145] and it was found that the nomogram used in this study had a higher C-index (Fig. 10D).

Discussion

CRC still represents a significant threat to the health of many people. MAC is a unique histopathological subtype of CRC, and its mechanism of occurrence and development, clinical features, and impact on the prognosis of patients still lack understanding. It is urgent to understand the characteristic genes of colorectal MAC in order to guide the clinical precision treatment of CRC.

Previous studies have identified genes with important biological roles for MAC in CRC, such as RPS18, RPL30, CXCL9, and IDO1 [46, 47]. Machine learning has been applied extensively in biomedicine. Based on the transcriptome data of CRC, this study found significant differences in gene expression between MAC and adenocarcinoma. Functional enrichment analyses showed that MAC-related DEGs were closely related to tumor growth, invasion, or metastasis, which suggested different biological backgrounds between the two histopathological subtypes. Subsequently, three distinct molecular subtypes were distinguished by these MAC-related genes. Further studies were conducted to explore the characteristics of each subtype. There were significant differences in tumor microenvironment, mechanism, and prognosis among different subtypes. These results demonstrated heterogeneity in MAC-related molecular subtypes, and it also provided a direction for our subsequent construction of the risk signature. Based on the MAC-related molecular subtypes of CRC, the signature was constructed in this study, which could perform precise risk stratification of CRC patients.

The signature genes are the highlights of this study. Previous studies have confirmed the important roles of some genes in cancer. In non-small cell lung cancer, the high expression of CALB1 has a significant correlation with the lymph node metastasis stage [48]. In CRC, MMP1 derived from tumor-associated macrophages promotes the cell cycle transition from G0/G1 to S and G2/M phases [49], and the overexpression of HOXC6 induces EMT in colon cancer cells [50]. ZIC2 and SFTA2 may be identified as critical genes in CRC carcinogenesis [51, 52]. Furthermore, HYAL1 inhibits CRC metastasis by modulating MMPs/TIMPs balance and rearranging F-Actin distribution [53]. However, these genes have not been studied in depth in the basic research of colorectal MAC and may be the focus of the basic experimental research of colorectal MAC.

This study also analyzed the predictive power of the signature in CRC patients. The C-index of the signature in the TCGA training set, internal testing set, and entire set were 0.683, 0.709, and 0.688, respectively. Based on the signature, a nomogram was constructed by combining the clinical parameters of CRC. The C-index of the nomogram was 0.783. Previous studies established signatures from different perspectives for the prognosis of CRC patients, such as the anoikis-related predictive model [41], the pyroptosis-related predictive model [42], and the RNA-binding protein-related prognostic model [43]. In comparison to the currently published signatures related to CRC, this study built a RiskScore signature based on the MAC-related molecular subtypes. Multiple validations of external datasets were included to improve the reliability. These results suggest that the prognostic signature based on the MAC-related molecular subtypes may enhance the predictive ability of CRC.

The clinical significance and molecular mechanism of the signature were then comprehensively evaluated. It was found that the signature could predict the prognosis accurately and provide a reference for clinical medication. Furthermore, immunotherapy has been studied in full swing in various solid tumors, which brings new hope for CRC. Currently, only CRC patients with MSI-H can benefit from immunotherapy, so a better understanding of the immune microenvironment of CRC wound help to guide immunotherapy strategies. The extracellular mucus protein of MAC can be used as a physical immune infiltration barrier [12]. The complex tumor microenvironment has become a challenge that hinders the treatment of colorectal MAC and leads to the immune escape of CRC. Various tumor microenvironment components are closely related to the occurrence and development of CRC [54]. We were also surprised to find significant differences in infiltrating cells in the tumor microenvironment between different RiskScore groups. Following the further analysis of the molecular mechanism caused by RiskScores, it can be inferred that RiskScores may be engaged in a variety of biological processes leading to different clinical features.

This research still has some limitations. All our data was obtained from the TCGA and GEO databases, and the sample size was not large enough, which may result in a deviation in the results. The exploration of key genes needs to be more profound. Future research should analyze the molecular biological functions of six key genes.

Overall, we performed molecular typing of CRC by analyzing colorectal MAC-related genes. A prognostic signature and the nomogram were then constructed, which was significant for accurate stratification and prognosis evaluation of CRC.

Acknowledgements

None.

Abbreviations

AUC

Area under the curve

CI

Confidence interval

CRC

Colorectal cancer

DCA

Decision curve analysis

DEG

Differentially expressed gene

EMT

Epithelial-mesenchymal transition

GEO

Gene Expression Omnibus

GO

Gene ontology

GSEA

Gene set enrichment analysis

GSVA

Gene set variation analysis

HR

Hazard ratio

IC50

Half maximal inhibitory concentration

KEGG

Kyoto encyclopedia of genes and genomes

LASSO

Least absolute shrinkage and selection operator

MAC

Mucinous adenocarcinoma

MSI

Microsatellite instability

NMAC

Non-mucinous adenocarcinoma

NMF

Negative matrix factorization

OS

Overall survival

ROC

Receiver operating characteristic

TCGA

The Cancer Genome Atlas

Author contributions

YM, DX, and LJ conceived and designed the project. YM and DX collected data and performed bioinformatics analysis. YJ, LJ, YL, and LK reviewed the literature. YM, YJ, and LJ contributed to the figures and the table. LJ and DX drafted the manuscript. YM, YJ, YL, LK, and LJ modified the original manuscript. All authors reviewed and approved the submitted version.

Funding

Sichuan Science and Technology Program (No. 2022YFS0606); 2021 Open Project of Sichuan Provincial Key Laboratory of Human Disease Gene Research (2021kflx001); National funded postdoctoral researcher program of China (GZB20230670); Excellent Youth Fund of Sichuan Cancer Hospital (YB2024019).

Data availability

The datasets provided in this study can be downloaded from online repositories. If you have further inquiries, please contact the authors.

Declarations

Ethics approval and consent to participate

The datasets provided in this study can be downloaded from online repositories. This research did not require ethics approval and consent to participate.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Yuxin Man, Dao Xin and Yang Ji have contributed equally to this work and share the first authorship.

References

  • 1.Siegel RL, Miller KD, Wagle NS, Jemal A. Cancer statistics 2023. CA Cancer J Clin. 2023;73(1):17–48. doi: 10.3322/caac.21763. [DOI] [PubMed] [Google Scholar]
  • 2.Ciardiello F, Ciardiello D, Martini G, Napolitano S, Tabernero J, et al. Clinical management of metastatic colorectal cancer in the era of precision medicine. CA Cancer J Clin. 2022;72(4):372–401. doi: 10.3322/caac.21728. [DOI] [PubMed] [Google Scholar]
  • 3.Shin AE, Giancotti FG, Rustgi AK. Metastatic colorectal cancer: mechanisms and emerging therapeutics. Trends Pharmacol Sci. 2023;44(4):222–236. doi: 10.1016/j.tips.2023.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.de Assis JV, Coutinho LA, Oyeyemi IT, Oyeyemi OT, Grenfell R. Diagnostic and therapeutic biomarkers in colorectal cancer: a review. Am J Cancer Res. 2022;12(2):661–680. [PMC free article] [PubMed] [Google Scholar]
  • 5.Andrei P, Battuello P, Grasso G, Rovera E, Tesio N, et al. Integrated approaches for precision oncology in colorectal cancer: the more you know, the better. Semin Cancer Biol. 2022;84:199–213. doi: 10.1016/j.semcancer.2021.04.007. [DOI] [PubMed] [Google Scholar]
  • 6.Li C, Zheng H, Jia H, Huang D, Gu W, et al. Prognosis of three histological subtypes of colorectal adenocarcinoma: a retrospective analysis of 8005 Chinese patients. Cancer Med. 2019;8(7):3411–3419. doi: 10.1002/cam4.2234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Han L, Dai W, Mo S, Xiang W, Li Q, et al. Nomogram of conditional survival probability of long-term survival for metastatic colorectal cancer: a real-world data retrospective cohort study from seer database. Int J Surg. 2021;92:106013. doi: 10.1016/j.ijsu.2021.106013. [DOI] [PubMed] [Google Scholar]
  • 8.Reynolds IS, O'Connell E, Fichtner M, McNamara DA, Kay EW, et al. Mucinous adenocarcinoma is a pharmacogenomically distinct subtype of colorectal cancer. Pharmacogenomics J. 2020;20(3):524–532. doi: 10.1038/s41397-019-0137-6. [DOI] [PubMed] [Google Scholar]
  • 9.Huang A, Yang Y, Shi JY, Li YK, Xu JX, et al. Mucinous adenocarcinoma: a unique clinicopathological subtype in colorectal cancer. World J Gastrointest Surg. 2021;13(12):1567–1583. doi: 10.4240/wjgs.v13.i12.1567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Xu Y, Chen X, Chen Y, Wu X, Fang Q, et al. Colorectal mucinous adenocarcinoma indicates a meaningful subtype: a whole genome sequencing study. Clin Transl Med. 2023;13(4):e1246. doi: 10.1002/ctm2.1246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kwon M, Rubio G, Nolan N, Auteri P, Volmar JA, et al. Filip1l Loss is a driver of aggressive mucinous colorectal adenocarcinoma and mediates cytokinesis defects through Pfdn1. Can Res. 2021;81(21):5523–5539. doi: 10.1158/0008-5472.Can-21-0897. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.O'Connell E, Reynolds IS, McNamara DA, Burke JP, Prehn JHM. Resistance to cell death in mucinous colorectal cancer-a review. Cancers. 2021 doi: 10.3390/cancers13061389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Hu FJ, Li YJ, Zhang L, Ji DB, Liu XZ, et al. Single-Cell profiling reveals differences between human classical adenocarcinoma and mucinous adenocarcinoma. Commun Biol. 2023;6(1):85. doi: 10.1038/s42003-023-04441-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Gundamaraju R, Chong WC. Consequence of distinctive expression of Muc2 in colorectal cancers: how much is actually bad? Biochim Biophys Acta. 2021;1876(1):188579. doi: 10.1016/j.bbcan.2021.188579. [DOI] [PubMed] [Google Scholar]
  • 15.Yan C, Yang H, Chen L, Liu R, Shang W, et al. Clinical significance of mucinous component in colorectal adenocarcinoma: a propensity score-matched study. BMC Cancer. 2021;21(1):1286. doi: 10.1186/s12885-021-09031-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Luo C, Cen S, Ding G, Wu W. Mucinous colorectal adenocarcinoma: clinical pathology and treatment options. Cancer Commun. 2019;39(1):13. doi: 10.1186/s40880-019-0361-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Williams DS, Mouradov D, Newman MR, Amini E, Nickless DK, et al. Tumour infiltrating lymphocyte status is superior to histological grade, DNA mismatch repair and braf mutation for prognosis of colorectal adenocarcinomas with mucinous differentiation. Modern Pathol. 2020;33(7):1420–1432. doi: 10.1038/s41379-020-0496-1. [DOI] [PubMed] [Google Scholar]
  • 18.Reynolds IS, Furney SJ, Kay EW, McNamara DA, Prehn JHM, et al. Meta-analysis of the molecular associations of mucinous colorectal cancer. Br J Surg. 2019;106(6):682–691. doi: 10.1002/bjs.11142. [DOI] [PubMed] [Google Scholar]
  • 19.Zhang Y, Chen Y, Huang J, Wu X, Tang R, et al. Mucinous histology is associated with poor prognosis in locally advanced colorectal adenocarcinoma treated with postoperative first-line adjuvant chemotherapy: a systematic review and meta-analysis. Eur J Surg Oncol. 2022;48(10):2075–2081. doi: 10.1016/j.ejso.2022.06.024. [DOI] [PubMed] [Google Scholar]
  • 20.Bong JW, Gim JA, Ju Y, Cheong C, Lee SI, et al. Prognosis and sensitivity of adjuvant chemotherapy in mucinous colorectal adenocarcinoma without distant metastasis. Cancers. 2022 doi: 10.3390/cancers14051297. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Mounir M, Lucchetta M, Silva TC, Olsen C, Bontempi G, et al. New functionalities in the Tcgabiolinks package for the study and integration of cancer data from Gdc and Gtex. PLoS Comput Biol. 2019;15(3):e1006701. doi: 10.1371/journal.pcbi.1006701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Marisa L, de Reyniès A, Duval A, Selves J, Gaub MP, et al. Gene expression classification of colon cancer into molecular subtypes: characterization, validation, and prognostic value. PLoS Med. 2013;10(5):e1001453. doi: 10.1371/journal.pmed.1001453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Smith JJ, Deane NG, Wu F, Merchant NB, Zhang B, et al. Experimentally derived metastasis gene expression profile predicts recurrence and death in patients with colon cancer. Gastroenterology. 2010;138(3):958–968. doi: 10.1053/j.gastro.2009.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Tripathi MK, Deane NG, Zhu J, An H, Mima S, et al. Nuclear factor of activated t-cell activity is associated with metastatic capacity in colon cancer. Can Res. 2014;74(23):6947–6957. doi: 10.1158/0008-5472.Can-14-1592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Khamas A, Ishikawa T, Shimokawa K, Mogushi K, Iida S, et al. Screening for epigenetically masked genes in colorectal cancer using 5-Aza-2'-deoxycytidine, microarray and gene expression profile. Cancer Genomics Proteomics. 2012;9(2):67–75. [PubMed] [Google Scholar]
  • 26.Davis S, Meltzer PS. Geoquery: a bridge between the gene expression omnibus (Geo) and bioconductor. Bioinformatics. 2007;23(14):1846–1847. doi: 10.1093/bioinformatics/btm254. [DOI] [PubMed] [Google Scholar]
  • 27.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for Rna-Seq data with Deseq2. Genome Biol. 2014;15(12):550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, et al. Limma powers differential expression analyses for rna-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Mayakonda A, Lin DC, Assenov Y, Plass C, Koeffler HP. Maftools: efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 2018;28(11):1747–1756. doi: 10.1101/gr.239244.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Gaujoux R, Seoighe C. A flexible R package for nonnegative matrix factorization. BMC Bioinformatics. 2010;11:367. doi: 10.1186/1471-2105-11-367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33(1):1–22. doi: 10.18637/jss.v033.i01. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Blanche P, Dartigues JF, Jacqmin-Gadda H. Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks. Statistics Med. 2013;32(30):5381–5397. doi: 10.1002/sim.5958. [DOI] [PubMed] [Google Scholar]
  • 33.Wu T, Hu E, Xu S, Chen M, Guo P, et al. Clusterprofiler 4.0: a universal enrichment tool for interpreting omics data. Innovation. 2021;2(3):100141. doi: 10.1016/j.xinn.2021.100141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Aleksander SA, Balhoff J, Carbon S, Cherry JM, Drabkin HJ, et al. The gene ontology knowledgebase in 2023. Genetics. 2023 doi: 10.1093/genetics/iyad031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. Kegg as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;44(D1):D457–D462. doi: 10.1093/nar/gkv1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102(43):15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Yoshihara K, Shahmoradgoli M, Martínez E, Vegesna R, Kim H, et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat Commun. 2013;4:2612. doi: 10.1038/ncomms3612. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Newman AM, Steen CB, Liu CL, Gentles AJ, Chaudhuri AA, et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat Biotechnol. 2019;37(7):773–782. doi: 10.1038/s41587-019-0114-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Miao YR, Zhang Q, Lei Q, Luo M, Xie GY, et al. Immucellai: a unique method for comprehensive T-cell subsets abundance prediction and its application in cancer immunotherapy. Adv Sci. 2020;7(7):1902880. doi: 10.1002/advs.201902880. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Geeleher P, Cox NJ, Huang RS. Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines. Genome Biol. 2014;15(3):R47. doi: 10.1186/gb-2014-15-3-r47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Hu G, Li J, Zeng Y, Liu L, Yu Z, et al. The anoikis-related gene signature predicts survival accurately in colon adenocarcinoma. Sci Rep. 2023;13(1):13919. doi: 10.1038/s41598-023-40907-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Luo B, Lin J, Cai W, Wang M. Identification of the pyroptosis-related gene signature and risk score model for colon adenocarcinoma. Front Genet. 2021;12:771847. doi: 10.3389/fgene.2021.771847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Miao Y, Zhang H, Su B, Wang J, Quan W, et al. Construction and validation of an Rna-binding protein-associated prognostic model for colorectal cancer. PeerJ. 2021;9:e11219. doi: 10.7717/peerj.11219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Chang K, Yuan C, Liu X. A new Rbps-related signature predicts the prognosis of colon adenocarcinoma patients. Front Oncol. 2021;11:627504. doi: 10.3389/fonc.2021.627504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Zhou Z, Xie X, Wang X, Zhang X, Li W, et al. Correlations between tumor mutation burden and immunocyte infiltration and their prognostic value in colon cancer. Front Genet. 2021;12:623424. doi: 10.3389/fgene.2021.623424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Kim CW, Cha JM, Kwak MS. Identification of potential biomarkers and biological pathways for poor clinical outcome in mucinous colorectal adenocarcinoma. Cancers. 2021 doi: 10.3390/cancers13133280. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Zhang X, Zuo J, Wang L, Han J, Feng L, et al. Identification of differentially expressed genes between mucinous adenocarcinoma and other adenocarcinoma of colorectal cancer using bioinformatics analysis. J Int Med Res. 2020;48(8):300060520949036. doi: 10.1177/0300060520949036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Jin C, Lin T, Shan L. Downregulation of calbindin 1 by Mir-454-3p suppresses cell proliferation in nonsmall cell lung cancer in vitro. Cancer Biother Radiopharm. 2019;34(2):119–127. doi: 10.1089/cbr.2018.2598. [DOI] [PubMed] [Google Scholar]
  • 49.Yu J, Xu Z, Guo J, Yang K, Zheng J, et al. Tumor-associated macrophages (Tams) depend on Mmp1 for their cancer-promoting role. Cell death discovery. 2021;7(1):343. doi: 10.1038/s41420-021-00730-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Qi L, Chen J, Zhou B, Xu K, Wang K, et al. Homeoboxc6 promotes metastasis by orchestrating the Dkk1/Wnt/Β-catenin axis in right-sided colon cancer. Cell Death Dis. 2021;12(4):337. doi: 10.1038/s41419-021-03630-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Liu F, Shi Z, Bao W, Zheng J, Chen K, et al. Zic2 Promotes colorectal cancer growth and metastasis through the Tgf-Β signaling pathway. Exp Cell Res. 2022;415(2):113118. doi: 10.1016/j.yexcr.2022.113118. [DOI] [PubMed] [Google Scholar]
  • 52.Gong B, Kao Y, Zhang C, Sun F, Gong Z, et al. Identification of hub genes related to carcinogenesis and prognosis in colorectal cancer based on integrated bioinformatics. Mediators Inflamm. 2020;2020:5934821. doi: 10.1155/2020/5934821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Jin Z, Zhang G, Liu Y, He Y, Yang C, et al. The suppressive role of Hyal1 and Hyal2 in the metastasis of colorectal cancer. J Gastroenterol Hepatol. 2019;34(10):1766–1776. doi: 10.1111/jgh.14660. [DOI] [PubMed] [Google Scholar]
  • 54.Plundrich D, Chikhladze S, Fichtner-Feigl S, Feuerstein R, Briquez PS. Molecular mechanisms of tumor immunomodulation in the microenvironment of colorectal cancer. Int J Mol Sci. 2022 doi: 10.3390/ijms23052782. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The datasets provided in this study can be downloaded from online repositories. If you have further inquiries, please contact the authors.


Articles from Discover. Oncology are provided here courtesy of Springer

RESOURCES