Skip to main content
Heliyon logoLink to Heliyon
. 2024 Mar 7;10(6):e27587. doi: 10.1016/j.heliyon.2024.e27587

Metabolism-associated molecular classification and prognosis signature of head and neck squamous cell carcinoma

Mengxian Jiang a,1, Xiang Gu a,1, Yexing Xu b,, Jing Wang c,⁎⁎
PMCID: PMC10945276  PMID: 38501009

Abstract

Although the fundamental processes and chemical changes in metabolic programs have been elucidated in many cancers, the expression patterns of metabolism-related genes in head and neck squamous cell carcinoma (HNSCC) remain unclear. The mRNA expression profiles from the Cancer Genome Atlas included 502 tumour and 44 normal samples were extracted. We explored the biological functions and prognosis roles of metabolism-associated genes in patients with HNSCC. The results indicated that patients with HNSCC could be divided into three molecular subtypes (C1, C2 and C3) based on 249 metabolism-related genes. There were markedly different clinical characteristics, prognosis outcomes, and biological functions among the three subtypes. Different molecular subtypes also have different tumour microenvironments and immune infiltration levels. The established prognosis model with 17 signature genes could predict the prognosis of patients with HNSCC and was validated using an independent cohort dataset. An individual risk scoring tool was developed using the risk score and clinical parameters; the risk score was an independent prognostic factor for patients with HNSCC. Different risk stratifications have different clinical characteristics, biological features, tumour microenvironments and immune infiltration levels. Our study could be used for clinical risk management and to help conduct precision medicine for patients with HNSCC.

Keywords: Head and neck squamous cell carcinoma, Metabolism, Immune infiltration, Prognosis model

1. Introduction

According to the latest statistics, head and neck cancer ranks seventh among the cancer-related causes of death worldwide [1]. In 2018, approximately 700,000 new cases of head and neck cancer were reported globally, with approximately 350,000 deaths [2]. Head and neck cancers primarily refer to cancers that occur in the nasal cavity, sinuses, oral cavity, pharynx, and larynx, with more than 90% of cases being squamous cell carcinomas [3]. Despite advancements in comprehensive treatment techniques such as surgery, radiation, and chemotherapy in recent years, 30–40% of patients still experience distant metastasis within 5 years [4]. Research has found that approximately 58% of patients with head and neck squamous cell carcinoma (HNSCC) are diagnosed at advanced stages (stages III-IV) [5]. Therefore, conducting in-depth studies on the factors and mechanisms that affect the prognosis of HNSCC and identifying tumour markers that can be used for molecular diagnosis, prognosis prediction, and targeted therapy are highly significant.

Decomposition and synthesis are two crucial biological processes for tumour cells to maintain their energy supply and product synthesis. An important biological characteristic of tumour cells is the alteration or rewiring of genetically-regulated metabolic processes [6]. Research has shown that tumour cells can increase the synthesis of macromolecules and intermediate products to sustain abnormal cell proliferation, making them more prone than normal tissues to glucose uptake through the glycolysis pathway [7]. Moreover, abnormal metabolism and tumour development have a reciprocal cause-and-effect relationship because changes in the expression levels of certain proteases regulate alterations in metabolic products [8]. For example, some malignant tumours rely primarily on glycolysis for energy production, whereas others exhibit a metabolic phenotype that depends on oxidative phosphorylation [9]. Although the fundamental processes and chemical changes of metabolic programs have been elucidated in many cancers, the expression patterns of metabolism-related genes in HNSCC remain unclear. In this study, we explored the biological functions and prognostic roles of metabolism-associated genes in HNSCC. We explored the molecular subtypes of HNSCC based on metabolism-associated genes, and described their clinical and molecular features. We then developed a prognostic model based on metabolism-associated genes, built a risk evaluation tool, and compared clinical and molecular characteristics between different risk stratifications. This study provides novel insights into the treatment and management of HNSCC.

2. Materials and methods

2.1. Patients and samples

We obtained mRNA expression profiles from the Cancer Genome Atlas, which included 502 tumour and 44 normal samples. Clinical characteristics and follow-up data were also obtained. For tumour, samples without clinical or follow-up data were excluded. Copy number variation data were also downloaded. Raw counts were transformed into transcripts per kilobase million for further analysis. Additional processed microarray data from 270 patients with HNSCC from GSE65858 were used as the validation dataset. We identified the metabolism-related genes from a previously published dataset [10].

2.2. Identification of HNSCC subtypes

We first screened metabolism-related genes using clinical data and median absolute deviation (MAD), followed by univariate Cox regression. We overlapped the data, and the genes with MAD >0.5 and significant associations with prognosis were used for subtype identification. We identified the HNSCC subtypes using the non-negative matrix factorisation clustering [[11], [12]]. A class map and correlation rank helped us select the optimal values of K. Principal component analysis was used to validate the subtypes.

2.3. Gene set variation and enrichment analysis

Gene set variation analysis (GSVA) was performed using the GSVA R package. GSVA can be used to calculate the scores of metabolisms-related genes using nonparametric and unsupervised methods [13]. These pathways were divided into four types: amino acid, carbohydrate, lipid as well as other pathways. We explored the pathway differences among the subtypes. Gene Set enrichment analysis was performed using the GSEABase R package, which is a knowledge-based approach for interpreting genome-wide expression profiles [14]. We used GSEA to explore the pathway enrichment of certain gene sets.

2.4. GO and KEGG analysis

We first performed differential gene expression analysis and identified the differentially expressed genes (DEGs). The DEGs was then used for genes ontology (GO) and Kyoto Encyclopaedia of Genes and Genomes (KEGG) enrichment analyses, which were performed using the R “clusterProfiler” packages and org.Hs.eg.db dataset [[15], [16]].

2.5. Immune infiltration estimation

We used the CIBERSORT method to evaluate immune infiltration levels. CIBERSORT deconvolutes the expression matrix of human immune cell subtypes based on the linear support vector regression [17]. This algorithm uses microarray data to construct a feature matrix that describes the expression characteristics of 22 immune cell phenotypes, including various cell types and functional states of immune cells. In the CIBERSORT parameters setting, the relative modes were selected, quantile normalization was disabled, 22 immune cell phenotypes were used, and the permutations were 100.

We also used the Estimation of STromal and Immune cells in MAlignant Tumor tissues using Expression data (ESTIMATE; https://bioinformatics.mdanderson.org/estimate/index.html) tool to evaluate the tumour purity, levels of stromal cells, and infiltration levels of immune cells in tumour tissues based on expression data.

2.6. Establishment and validation of risk prediction model

We adopted the least absolute shrinkage and selection operator (LASSO) penalized Cox regression method to build a prognosis model. A penalty parameter (lambada) was used to construct an optimal prognostic gene set to prevent overfitting. When multiple lambadas appear, those with fewer variables are selected based on partial likelihood deviance. Using cox regression coefficient and gene expression levels, we can calculate the risk score using the following formula: risk score = β(1)*Gene(1)_expression+ … +β(n)*Gene(n)_expression, β is cox regression coefficient [18].Seventeen genes were included the final model, and risk score was calc −0.977*CACNA1A + 0.442*PIGP - 0.422*GLYCTK - 0.357*ABCB8 - 0.239*STAR+0.329*ELOVL6+0.421*HS3ST1+0.262*PNPLA4+0.235*PFKM+0.371*NAT10 + 0.501*ATP6V0E1-0.645*SLC25A45–0.275*CSGALNACT1+0.262*PNPLA4+0.309*HK1+0.297*DAD1. Patients were also divided into two groups based on the median risk score: high- and low-risk. The TCGA dataset was used as the training group, and the GEO dataset was used as the validation dataset (GSE65858; n = 270).

2.7. Clinical relevance and risk evaluation system

We further explored the clinical relevance of the risk score to the clinical characteristics. Univariate and multivariate cox regression were performed to investigate the independence of risk scores. Time-independent receiver operating characteristic curves were obtained to evaluate the predictive ability of the risk model. We also built a nomogram scoring system to evaluate the individual prognostic risk [19]. The nomogram included the risk score and clinical parameters (age, gender, grade, T: tumour N: node, M: metastasis). Calibration plots of the nomogram were used to depict the predictive values between the predicted and observed values, which can assess the stability of the scoring system.

2.8. Statistical analysis

Continuous variables with normal distributions were represented using the mean and standard deviation. Student's t-test was used for stromal, immune, ESTIMATE, between high- and low-risk groups, one-way analysis of variance (ANOVA) was used among multiple groups, and multiple testing was performed using the LSD method including comparisons of stromal, immune and ESTAMATE, risk score among C1, C2 and C3. The Mann-Whitney or Kruskal-Wallis H test was used for immune checkpoint genes expression among three components. Categorical variables were represented as counts and percentages, and chi-square tests were used to compare the two groups (gender, grade, stage, TNM). Kaplan-Meier survival curves were compared using log-rank tests. All analyses were performed using the R software version 4.0 [20]. Differences were considered statistically significant when P values were less than 0.05.

3. Results

3.1. Identification of metabolism-related subtypes in HNSCC

The flowchart of the study is presented in Fig. 1A. We obtained the expression profile data of 502 HNSCC and 44 normal control samples from the dataset, and 499 tumour samples were included in the analyses. We first performed DEGs analysis and identified 318 DEGs between HNSCC and normal samples (|log2 Fold change| (|Log2 FC|)>1, P < 0.05; Table S1 and Fig. S1A). The top 20 genes are presented in Fig. S1B. GO enrichment analysis indicated that these genes were enriched for amino acids, carbohydrate, lipid transport, biosynthetic processes, transporter and channel complex, and transmembrane transporter activities (Fig. S1C). KEGG pathway analyses indicated that DEGs were mainly enriched in glycerophospholipid, purine, carbon, amino, and o-glycan metabolism, and biosynthesis (Fig. S1D). Further details of the GO and KEGG analyses are presented in Tables S2 and S3.

Fig. 1.

Fig. 1

Flow of study analyses and identification of HNSCC molecular subtypes using NMF consensus clustering. A: The flow chart of data analysis. B: NMF clustering using 249 metabolism-associated genes. Cophenetic correlation coefficient for K = 2–5 is shown. C: Consensus matrix of three HNSCC subclasses. D: PCA showed the distribution of three HNSCC subclasses. E: Individuals CA of three HNSCC subclasses. F: Kaplan-Meier curves of overall survival among three subclasses (C1, C2 and C3).

Finally, we identified 249 metabolism-related genes from 318 DEGs Using MAD>0.5and P < 0.05 in the univariate Cox regression (Table S4). Non-negative matrix factorisation clustering was performed, and the cophenetic correlation coefficients were calculated to determine the optimal k value. The cophenetic correlation coefficient is the maximum when k = 2. However, we want to explore potential subtypes based on current sample size as many as we can, but also keep subtypes some differences. In our study, k = 3 was eventually chosen as optimal number of clusters based on current sample size instead of k = 2, includingC1, C2, C3 (Fig. 1B and C). PCA and individual PCA confirmed the presence of these three components (Fig. 1D and E). Furthermore, we performed K-M survival analyses and found significant differences among C1, C2, and C3. Patients from C3 had favorable prognosis (Fig. 1F).

3.2. Correlation of the HNSCC subtypes with metabolism-associated signatures

First, we performed DEGs analysis of the three subtypes. Using P < 0.05 and |log2 FC|>2, we identified 4744 DEGs in C1, 6613 DEGs in C2, and 7165 DEGs in C3 (Table S5). There were 1463 overlaps among the three subtypes (Fig. 2A and B). We detected the expression of several key metabolism-related genes among the three subtypes, and found that PCK2, ACLY, HK2, CPT1A, ACSS2, and IDH1 were highest expressed in C1. LDHA and GLS were the highest in C2, whereas IDH2 was the highest in C3 (Fig. 2C). GO functional enrichment and KEGG pathway analyses were performed. Cell -cell adhesion, junction, and binding were mainly enriched in C1. Signal transduction, immune response, ficolin-1-rich granule, and protease and phosphatase binding were enriched in C2. Immune-related signaling and function were mainly enriched in C3 (Fig. S2). KEGG pathway analyses indicated that focal adhesion, cell adhesion, and cell differentiation were enriched in C1, MAPK signaling pathway, cell cycle, apoptosis, p53 signaling pathways, and AGE-RAGE signaling pathways were enriched in C2, while the NF-kappa B, T cell receptor, and some immune-related signaling pathways were mainly enriched in C3 (Fig. S3).

Fig. 2.

Fig. 2

Distributions of 249 metabolism-associated genes and key metabolic genes expressions among three HNSCC subclasses. A: Heatmap of correlation of metabolism-associated genes expression among three subclasses and different clinical features. B: Venn diagram showed the overlapped number of DEGs among three subtypes. C: Expression differences of several key metabolic genes among three subtypes (***: adjusted P values).

We performed GSVA for the three subtypes by dividing these pathways into four categories: amino acid, carbohydrate, lipid, as well as other metabolic pathways. The top five differential metabolism pathways in C1 were cholesterol biosynthesis, glyoxylate and dicarboxylate metabolism, ubiquinone and other terpenoid-quinone biosynthesis, propanoate metabolism, and ketone biosynthesis and metabolism. The top five differentially metabolism pathways in the C2 group were glycosaminoglycan biosynthesis, n-glycan biosynthesis, other types of o-glycan biosynthesis, glycan degradation, and ADP-ribosylation. There were no amino acid metabolism-related signaling pathways. Tryptophan metabolism valine, leucine and isoleucine degradation, kynurenine metabolism; taurine and hypotaurine metabolism; and selenocompound metabolism were the main differential signaling pathways (Fig. 3 and Table S6).

Fig. 3.

Fig. 3

Heatmap of significantly differential metabolism-associated pathways among three HNSCC subclasses.

3.3. Correlation of HNSCC subtypes with immune infiltration

We then compared the immune infiltration levels among the three components (Fig. 4A and B). The results indicated that the C3 components had more B cells naïve, plasma cells, T cells CD8, T cells follicular helper, T cells regulatory (Tregs), mast cells resting than the C1 and C2 components. The C2 component had higher T cells CD4+memory resting, T cells CD4 memory activated, NK cells resting, macrophages M1 and M2 than C1 and C3. The C1 component had higher macrophages M0 than C2 and C3. The ESTIMATE results indicated that C2 and C3 had higher stromal score than C1 (Fig. 4C). The immune and ESTMATE scores increased with the C1, C2, and C3 (adjusted P values within groups<0.001, Fig. 4D and E). We also evaluated the expression levels of several key immune-related genes among the three components. Significant differential expression was observed among C1, C2 and C3 (Fig. 4F). C2 exhibited higher expression levels of CD276, IL6, CD274, PVR, and TGFB1 than C1 and C3. Expression levels of CCL2, CTLA4, LGALS9, CXCR4, LAG3, CD4, and PDCD1 were higher in C3 than C1 and C2.

Fig. 4.

Fig. 4

Comparisons of immune status among three subclasses. A: Heatmap showed the immune cells infiltration levels among three subclasses and different clinical features. B: Box plot comparing the immune cells scale of fraction among three subclasses (***: adjusted P values). C-E: Violin plot shown stromal, immune, and ESTIMATE scores among three subtypes (***: adjusted P values). F: Comparisons of expression levels of several immune checkpoint genes among three subtypes (***: adjusted P values).

3.4. Development and validation of metabolism-related prognosis signature

Next, LASSO regression was performed to select the model genes, and 17 prognosis-related genes were included in the final model (Fig. 5A and B). Among these genes, CACNA1A, PIGP, GLYCTK, ABCB8, STAR, ELOVL6, HS3ST1, PNPLA4, PFKM, NAT10, and ATP6V0E1 were risk factors for patients with HNSCC, whereas SLC25A45, CSGALNACT1, PNPLA4, HK1, and DAD1 were favorable factors for patients with HNSCC (Table S7). We then calculated the risk score of each patient and divided the patients into high- and low-risk groups using the median risk score. The Kaplan-Meier curve indicated that the high-risk group had a poorer prognosis than the low-risk group in training (Fig. 5C) and validation cohorts (Fig. 5D). Finally, we performed the univariate and multivariate Cox regression analyses and identified the risk score was an independent prognostic factor for HNSCC (univariate: HR = 1.360, 95%CI: 1.277–1.448, P < 0.001, Fig. 5E; Multivariate: HR = 1.383, 95%CI: 1.289–1.484, P < 0.001, Fig. 5F). We also validated the prognostic signatures of three subtypes. The risk scores of C1 and C2 were significantly higher than those of C3 (P < 0.001), and there was no significant difference between C1 and C2 (Fig. 6A). Kaplan-Meier analysis also indicated that the high-risk group had a poorer prognosis than the low-risk group for the three different subtypes of patients (Fig. 6B–D).

Fig. 5.

Fig. 5

Identification of metabolism-associated genes prognosis model for HNSCC. A: Elastic net solution path of LASSO regression. B: Determination of the number of prognosis-related genes via the LASSO regression. C and D: Overall survival Kaplan-Meier curves of high-risk group and low-risk group for HNSCC patients in TCGA training group and validation group; E and F: Forest plots of univariate and multivariate cox regression analysis exhibited the association between risk score and overall survival in HNSCC patients.

Fig. 6.

Fig. 6

Association between risk score and three metabolism components. A: Comparisons of risk score among three subtypes (***: adjusted P values). B-D: Kaplan-Meier analysis curve of high- and low-risk groups in three subtypes.

3.5. Evaluation of prognostic model and individual risk scoring system

ROC curves were constructed for the training and validation datasets. The AUCs at 1-, 3-, and 5-year were 0.758, 0.796, and 0.790 in the training dataset (Fig. 7A), and 0.510, 0.675, and 0.633 in the validation dataset, respectively (Fig. 7B). We evaluated the predictive ability of the prognostic model using time-independent ROC curves (Fig. 7C). The results indicated that the risk score had the highest predictive ability compared to the other clinical parameters (AUC:0.758). Using the signature genes and clinical parameters, we established an individual risk scoring system (Fig. 7D). Furthermore, we evaluated the association among risk stratification, clinical parameters, and signature gene expression. We found that the high-risk group tended to have more advanced T and N stages, and higher expression of risk genes (Fig. 7E).

Fig. 7.

Fig. 7

Evaluation of prognosis model and individual risk scoring system. A and B: ROC curves of prognostic signature in training dataset and validation dataset. C: ROC curves of risk score and other clinical parameters in HNSCC patients. D: Nomograph plot of predicted 1-,3-and 5-year overall survival probability based on metabolism-related genes signature. E: Heatmap showed prognosis signatures genes among clinical features.

3.6. Correlation of HNSCC risk stratification with immune infiltration and mutation

The high-risk group had lower stromal (Fig. 8A), immune (Fig. 8B), and ESTMATE scores (Fig. 8C). Next, we evaluated the immune infiltration levels between the high-risk and low-risk groups (Fig. 8D). The high-risk group had fewer naïve B cells, plasma cells, follicular helper, and T cells regulatory, and more T cells CD4 memory resting, and macrophages M1. Other immune cells showed no significant differences (Fig. 8E). We also compared the gene alterations between the high-risk group and low-risk groups (Fig. 9). Gene alteration levels in the high-risk group were higher than those in the low-risk group (96.73% vs. 88.66%; Fig. 9A and B). Variant classification, variant type, and SNV class showed similar trends. Among the top 10 mutated genes in both the high and low-risk groups, 8 of them are shared, including TP53, TTN, NOTCH1, FAT1, CDKN2A, LRP1B, MUC16, PIK3CA, and CSMD3 (Fig. 9C and D). The top five mutation genes in the high-risk group were TP53, TTN, NOTCH1, CDKN2A, and SYNE1, and those in the low-risk group were TP53, FAT1, NOTCH1, TTN, and PIK3CA (Fig. 9E and F). The Venn plot showed that there 11230 overlapped genes between the high- and low-risk groups (Fig. 9G).

Fig. 8.

Fig. 8

Comparisons of immune status between high- and low-risk groups. A-C: Violin plot comparing the stromal, immune, and ESTIMATE scores between high- and low-risk groups. D: Heatmap showed the immune cells distributions between high- and low-risk groups. E: Violin plot comparing the immune cells infiltration levels between high- and low-risk groups.

Fig. 9.

Fig. 9

Landscape of mutation profiles between high-and low-risk group in HNSCC patients. A and B: Waterfall plots of mutation information in each sample of high-and low-risk HNSCC patients; C and D: The variant classification and type, SNV class summary of in high-and low-risk groups of HNSCC patients; E and F: Gene cloud plots of mutations frequencies in high-and low-risk groups of HNSCC patients. G: Venn plot of overlapped mutation genes between high-risk and low-risk groups.

4. Discussion

HNSCC is among the most common human malignancies, ranking sixth in the global incidence, with approximately 600,000 new cases and 380,000 deaths annually [1]. At present, the main treatment method for most head and neck cancers is surgical resection combined with radiotherapy and chemotherapy, however, its curative effect is not ideal, many patients eventually experience relapse or distant metastasis, the median survival time of patients with incurable or metastatic recurrence is low, and the long-term survival and quality of life of patients are unsatisfactory [21]. The survival rate of the first 5 years is only about 60%, and only 20–30% for patients at advanced stages [22]. Therefore, it is urgent to improve the cure rate, optimise the treatment, and find a practical prognostic evaluation method for HNSCC.

The present study had the following findings: (1) Patients with HNSCC can be divided into three molecular subtypes (C1, C2, and C3) based on 249 metabolism-related genes. (2) There were markedly different clinical characteristics, prognosis outcomes and biological function among the three subtypes. (3) Different molecular subtypes also have different tumour microenvironments and immune infiltration levels. (4) The established prognosis model with 17 signature genes could predict the prognosis of patients with HNSCC, and this prognosis model was validated using an independent prognosis model. (5) An individual risk-scoring tool was built with the risk score and clinical parameters; the risk score was an independent prognostic factor for HNSCC. (6) Different risk stratification had different clinical characteristics, biological features, tumor microenvironment and immune infiltration. The present study may provide novel insights for the prognosic risk evaluation and clinical treatment of patients with HNSCC.

Currently, classification, grading, and staging are the three most important indicators for evaluating the biological behaviour and diagnosis of tumours, among the latter two are mainly used for evaluating the biological behaviour and prognosis of malignant tumours [23]. In recent decades, owing to breakthroughs in life science and medical technology, the detection of targets related to the clinical application of precision and personalized medicine, including targeted therapy, has not only greatly improved the detection rate of early tumours, but also significantly improved the prognosis of many tumours [24]. The clinical value and significance of traditional tumor classification, grading and staging have also changed to varying degrees. Owing to the great heterogeneity of tumours, with the development of personzlised medicine, traditional clinical classification can no longer meet these needs [[25], [26]]. The molecular classification of HNSCC is helpful for better clinical management.

The established prognosis model with 17 signature genes could predict the prognosis of patients with HNSCC, and this prognosis model was validated using an independent prognosis model. Patients with HNSCC patients were divided into high- and low-risk groups, and the high-risk group had a poorer prognosis. Multiple Cox regression analysis indicated that a high-risk score was an independent prognostic factor for HNSCC. The time-independent curve indicates that the predictive ability of the model is moderate. Previous studies also built some other models of gene sets such as pyroptosis‐related (AUC: 0.676) [27], stemness-related (AUC:0.732) [28], and immune-related gene signatures in HNSCC(AUC:0.626) [29]. Two studies built prognostic models for HNSCC using metabolism-related genes. Du et al. and Qiang et al. developed 7 and 12 metabolism-related gene models for HNSCC, respectively [[30], [31]]. There are some remarkable differences between our study and the previous studies. First, the number used metabolism-related genes differed. The two previous studies included 669 and 1454 metabolism-related genes, whereas our study included 2752 genes. Second, this study provided additional results. Previous studies have developed prognostic model, and analyzed the characteristics of high- and low-risk group. Our study not only developed a prognostic model but also established molecular subtypes based on metabolism-related genes, including amino acid, carbohydrate, and lipid metabolism as well as other metabolic pathways. Third, the developed prognostic model was completely different between our study (17 genes) and previous studies (7 and 12 genes). No overlapped genes were found among the three prognostic models. We also constructed a risk scoring-system for personal survival based on metabolism-related genes, which has not been used in previous studies. Compared with the previous prognostic model, our predictive ability was the highest (0.758 for 5-year survival) based on training set. The AUCs in previous studies based on metabolism-related genes were 0.670 and 0.660 using training set; However, this finding still has some significance. However, there are some similar points that the results showed the high-risk group had low stromal, immune and ESTAMTE scores. The number of immune cells was lower in the high-risk groups.

We also found that different molecular subtypes and risk stratifications had different tumour microenvironment and immune infiltration levels, which may help in carrying out immunotherapy. In fact, for immunotherapy of HNSCC, researchers have further developed compound system therapy based on single cytokine therapy and have achieved good efficacy [32]. Various tumor vaccines, including protein/polypeptide-dendritic cell vaccines, have entered clinical trials [33]. In addition, immunotherapy targeting immune checkpoints such as PD-1 has received considerable attention, and related inhibitory antibodies have been approved for treating recurrent or metastatic HNSCC [[34], [35]]. Immunotherapy is possible for HNSCC.

This study had several limitations. First, although a prognostic model was developed, the validation cohort had a small sample size and the statistical significance level is very close to 0.05. Therefore, validation datasets with larger sample size are required for this model. Second, its predictive ability for survival prognosis of HNSCC is moderate, and its clinical application is restricted. However, the present results provide some clues for evaluating prognosis. Third, in our study, k = 3 was eventually chosen as optimal number of clusters based on current sample size instead of k = 2 to explore potential subtypes based on current sample size as many as we can. However, some noticeable small mini clusters can be found when k = 3 was chosen and, will also need to be tested and re-evaluated in a future larger study. Finally, the biological functions and molecular mechanisms need to be validated in vivo and in vitro. Considering the rise of immunotherapy and its application in HNSCC, future studies should focus on the effect of metabolism on the immune response.

5. Conclusions

In conclusion, metabolism-related genes can be used to identify the molecular subtypes of HNSCC characterized by different clinical and prognostic outcomes, biological functions, and immune infiltration levels. A prognostic model established using metabolism-related genes could predict the prognosis of patients with HNSCC. Multiple functions and signaling pathways are involved in the development of HNSCC. Risk stratification could be associated with immune infiltration, tumor microenvironment, and gene alterations. Our study could be used for clinical risk management and to help conduct precision medicine for patients with HNSCC.

Ethics declarations

The ethics are not applicable because this study is based on publicly available data.

Funding

None.

Data availability statement

All data can be available without any restriction, and these data can be download from TCGA database (https://portal.gdc.cancer.gov/). We also would like to share all data and code details if they had any questions.

CRediT authorship contribution statement

Mengxian Jiang: Writing – original draft. Xiang Gu: Software, Investigation, Formal analysis, Data curation. Yexing Xu: Writing – review & editing, Project administration, Formal analysis, Conceptualization. Jing Wang: Writing – review & editing, Supervision, Formal analysis.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Footnotes

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.heliyon.2024.e27587.

Contributor Information

Yexing Xu, Email: xuyexing1004@163.com.

Jing Wang, Email: 280827451@qq.com.

Appendix A. Supplementary data

The following are the Supplementary data to this article.

Multimedia component 1
mmc1.xlsx (919.7KB, xlsx)

figs1.

figs1

figs2.

figs2

figs3.

figs3

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Multimedia component 1
mmc1.xlsx (919.7KB, xlsx)

Data Availability Statement

All data can be available without any restriction, and these data can be download from TCGA database (https://portal.gdc.cancer.gov/). We also would like to share all data and code details if they had any questions.


Articles from Heliyon are provided here courtesy of Elsevier

RESOURCES