Abstract
Purpose
Head and neck squamous cell carcinoma (HNSCC) is the sixth leading cancer globally, and radiotherapy plays a crucial part in its treatment. This study was designed to identify potential genes related to radiation resistance in HNSCC.
Method
We first used text mining to obtain common genes related to radiotherapy resistance and HNSCC in published articles. Functional enrichment analyses were conducted to identify the significantly enriched pathways and genes. Protein and protein interactions were performed, and the most significant gene modules were determined; then, genes in the gene modules were validated at transcriptional levels and overall survival. Gene set variation analysis (GSVA) score was calculated, and the association between GSVA score and survival/pathway was estimated. Immune cell infiltration, methylation, and genetic alteration analysis of these genes was conducted in HNSCC patients. Finally, potential sensitive anticancer drugs related to target genes were obtained.
Result
We identified 583 common genes through text mining. After further validation, a four-gene signature (EPHB2, SPP1, SERPINE1, and VEGFC) was constructed. The patients with higher GSVA scores have a worse prognosis than those with lower GSVA scores. Differences in methylation of these four genes in HNSCC tumor tissue and normal tissue were compared, with higher methylation levels of EBPH2 and SPP1 in normal tissue and higher methylation levels of SERPINE1 in the tumor. Immune cell infiltration revealed that the increased expression of these genes was closely related to the infiltration level of CD4+ T cell, neutrophil, macrophage, and dendritic cell. Thirty drugs, including 22 positively and eight negatively correlated drugs that most correlated with related genes, were available for treating HNSCC.
Conclusion
In this study, we identified four potential genes as well as corresponding drugs that might be related to radioresistance in HNSCC patients. These candidate genes may provide a promising avenue to further elevate radiotherapy efficacy.
1. Introduction
Head and neck squamous cell carcinoma (HNSCC) is the sixth most common type of cancer diagnosed in the world, with 890,000 new cases and 450,000 deaths annually [1, 2], and the incidence increases year by year [3]. HNSCC originates from mucosal epithelium cells in the oral cavity, oropharynx, hypopharynx, and larynx, accounting for 90% of all head and neck cancer [4]. The main risk factors of HNSCC include smoking, heavy drinking, exposure to environmental pollutants, and infection with human papillomavirus and EB virus.
The treatment of HNSCC differs according to the stage of disease, anatomical site, and surgical accessibility, including surgery, radiotherapy, systemic chemotherapy, targeting, and immunotherapy. As a noninvasive and function-preserving therapy, radiation therapy plays a crucial role in curative-intent treatments for head and neck cancers. It can be used as a single treatment for early tumors or combined with surgery or concurrent chemotherapy as a treatment for advanced tumors [5].
Although there has been tremendous progress in treatment in recent years, the survival rate of head and neck cancer has not been significantly improved [6, 7]. The efficacy of radiotherapy is affected by the sensitivity of radiotherapy, and radiotherapy resistance would lead to treatment failure. A mechanism for radiotherapy resistance is hypoxia. Adequate oxygen can maintain the sensitivity of tumor tissue to radiotherapy, and hypoxia can lead to radiotherapy resistance, thereby reducing the effect of radiotherapy. In addition to hypoxia, there are several molecular and biological mechanisms of radiation resistance, including alterations in intracellular pathways involved in DNA damage and repair, apoptosis, proliferation, and angiogenesis [8]. A couple of signaling pathways related to radiotherapy resistance have been intensively studied, and the three most clinically relevant pathways are EGFR, PI3K/Akt/mTOR, and P53 [9].
Identifying potential genes and pathways associated with radiotherapy resistance may facilitate proper treatment selection and improve outcomes. With the development of bioinformatics, text mining and data analysis have been applied to several aspects of cancer research, such as identifying potential key gene targets, signaling pathways, and predicting the outcomes [10, 11].
In this study, we first used text mining to obtain common genes, which represent the genes related to radiotherapy resistance and HNSCC in published articles. Secondly, a four-gene signature associated with radioresistance was identified, and the roles of the gene signature in the prognosis, pathway, methylation, immunity, and genetic alterations of HNSCC were explored. Finally, multiple potential targeted drugs associated with radioresistance were established for HNSCC patients. Figure 1 shows the workflow of this study.
Figure 1.

The workflow of the study. GSRR: gene set of radioresistance; GSHNSCC: gene set of head and neck squamous cell carcinoma.
2. Materials and Methods
2.1. Text Mining
The database pubmed2ensembl was used to perform text mining. pubmed2ensembl provides links between literature and genes, which can be used for data mining. When a query is executed, it extracts all genes related to the search keywords from available biological literature [12]. Firstly, we inputted the keywords “radioresistance,” “radiation resistance,” and “radiotherapy resistance” into the website, respectively, and then, the system extracted all related genes. After removing the duplicate genes, these gene sets made up the gene set of radioresistance (GSRR). Secondly, we performed a query using the keyword “head and neck squamous cell carcinoma,” and the gene sets of head and neck squamous cell carcinoma (GSHNSCC) were obtained. The intersection of GSRR and GSHNSCC was common genes and then used for further analysis.
2.2. Functional and Signal Pathway Enrichment Analysis
GO provides a controlled glossary of terms to clarify the characteristics of a gene product through their annotations. The GO terms reflect what is currently known about a gene in terms of biological process (BP), cellular component (CC), and molecular function (MF) [13, 14]. Furthermore, the Kyoto Encyclopedia of Genes and Genomes (KEGG) [15] provides data sources of known biological pathways to mark a gene or group of genes/proteins with their respective KEGG pathways. The GO and KEGG enrichment analysis of common genes was conducted using an online tool DAVID [16], a functional annotation and bioinformatics microarray analysis website. FDR (false discovery rate) < 0.05 was considered statistically significance.
Genes from the significant enrichment terms for BP, CC, MF, and KEGG signal pathways were downloaded from the website DAVID, extracted using R software, and used in the following analysis.
2.3. Protein-Protein Interaction Analysis and Gene Module Analysis
The database STRING was used to display the protein-protein interaction (PPI) network of the screened genes. There are seven ways in which proteins interact with each other built into the database, including text mining, experiments, databases, coexpression, cooccurrence, neighborhood, and gene fusion. The genes selected from the GO and pathway enrichment analysis were used as the input set. The minimum required interaction score was set to 0.90 (highest confidence) to further narrow the candidate gene field. The interaction file was downloaded in TSV format and processed using Cytoscape software. The plugin Molecular Complex Detection (MCODE) and STRING in Cytoscape were used to extract significant gene modules in the PPI network. The parameter settings were default values, except the K-core was set to 9. Then, the two most significant gene modules, including 50 genes, in the PPI network were selected for further analysis.
2.4. Verification of the Genes in TCGA/GTEx
To screen for specific genes, we verified the gene expression levels and OS of these 50 genes on the webserver GEPIA, which could deliver fast and customizable functionalities based on The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GTEx) data [17]. As for gene expression validation involved in 519 HNSCC tumor samples and 44 normal samples, the threshold with ∣log2 FC | ≥1 and P value < 0.01 was considered statistically significant. For overall survival (OS) analysis of the HNSCC dataset, 362 patients with available OS time data were classified into the low-expression and high-expression groups according to the median transcripts per kilobase million (TPM), and log-rank P < 0.05 was considered to be significant.
2.5. Gene Set Variation Analysis
To further observe the performance of these four genes as a whole in various aspects, the gene set variation analysis (GSVA) [18] score, which represents the integrated level of the expression of a gene set, was calculated through the R package GSVA. Moreover, the association between GSVA score and survival/pathway/methylation in HNSCC was estimated using the online website GSCA [19].
2.6. Immune Cell Infiltration Analysis
The composition and abundance of immune cells in the tumor microenvironment occupy a crucial role in tumor progression and the efficacy of immunotherapy. The website Tumor Immune Estimation Resource [20] (TIMER, https://cistrome.shinyapps.io/timer/) was utilized to further explore the potential association of gene expression and immune infiltration levels of immune cells, including B cell, CD8+ T cell, CD4+ T cell, macrophage, neutrophil, and dendritic cell (DC). P < 0.05 was considered statistically significant.
2.7. Genetic Alteration Analysis
We further investigated the genetic alterations of these four genes at the DNA level using the cBioPortal. cBioPortal [21] (http://www.cbioportal.org) is a comprehensive online tool providing us with visual and multidimensional cancer genomics data based on TCGA database.
2.8. Drug Sensitivity and Gene Expression
To explore potential drugs that can regulate the sensitivity of radiotherapy, we used the online website GSCA to further seek sensitive drugs related to target genes. The database contains the IC50 of 265 small molecules in 860 cell lines and its corresponding mRNA gene expression from Genomics of Drug Sensitivity in Cancer (GDSC) [19]. These drug candidates that target genes associated with radiotherapy resistance may herald the arrival of new potential therapies. Materials and Methods should contain sufficient detail so that all procedures can be repeated. It may be divided into headed subsections if several methods are described.
3. Results
3.1. Text Mining
Based on the text mining strategy described in Methods, after deleting the duplicates, 1218 genes were found in GSRR, 1359 genes were found in GSHNSCC, and 583 genes were common to both lists.
3.2. Functional and Signal Pathway Enrichment Analysis
The common genes were uploaded to the DAVID website to identify the GO and pathway terms. Figures 2(a)–2(d) show the top 10 significant enrichment terms for BP, CC, MF, and KEGG signal pathways. In BP annotation, it was mainly involved in response to organic substance, positive regulation of metabolic process, and cellular response to chemical stimulus (Figure 2(a)). In CC annotation, it was significantly involved in the extracellular region, cytosol, and membrane-bounded vesicle (Figure 2(b)). In MF annotation, it was mainly enriched in enzyme binding, receptor binding, and carbohydrate derivative binding (Figure 2(c)). As for signal pathway enrichment, it was primarily involved in the pathways in cancer, PI3K-Akt signaling pathway, and HTLV-I infection (Figure 2(d)), respectively. We extracted the genes that were all significantly enriched in BP, CC, MF, and KEGG signaling pathways, and there were 353 common genes left for the subsequent analysis.
Figure 2.

Functional enrichment of common genes. (a–c) The top ten significant GO terms of common genes. (d) The top ten significant KEGG pathways of common genes.
3.3. Protein-Protein Interaction Analysis and Gene Module Analysis
The screened genes were entered into the STRING website, and the TSV file was obtained, then downloaded, and analyzed using Cytoscape software.
The obtained PPI networks consisted of 329 genes/nodes and 3068 edges (Figure 3(a)), and 24 genes were not involved in the PPI networks. Then, the two most significant gene modules were clustered via MCODE APP built-in Cytoscape. Module 1 consisted of 33 genes/nodes and 191 edges, while module 2 was made up of 17 genes/nodes and 95 edges (Figure 3(b)).
Figure 3.

The construction of protein-protein interaction (PPI) networks and significant gene module analysis. (a) The entire PPI networks of common genes. (b) The two most significant gene modules in the PPI networks.
3.4. Verification of the Genes in TCGA/GTEx
We verified the expression between normal and cancerous tissues and the survival between high and low expression of these 50 genes in GEPIA (Figures S1–S4). Finally, four genes were filtered (EPHB2, SPP1, SERPINE1, and VEGFC). Their expression in tumor tissues was significantly higher than in normal tissues (Figure 4(a)), and the OS of high expression is worse than low expression (Figure 4(b)). Moreover, the functional annotation of these four genes was conducted using DAVID. Enrichment analysis showed that these four candidate genes are significantly enriched in regulation of anatomical structure morphogenesis (BP, P = 2.51E − 04), platelet alpha granule lumen (CC, P = 0.011), and receptor binding (MF, P = 8.41E − 04), respectively (Figure 4(c), Table 1).
Figure 4.

Verification of the filtered four genes and their functional annotation. (a) Validation of the gene expression of EPHB2, SPP1, SERPINE1, and VEGFC in HNSC datasets. The cutoff: ∣log2 fold change (FC) | ≥1, and P < 0.01 (∗ indicates P < 0.01). (b) Overall survival analysis of EPHB2, SPP1, SERPINE1, and VEGFC in HNSC datasets. (c) Chord plot for functional enrichments of four genes.
Table 1.
The top six gene ontology enrichment terms of identified four genes (∗terms that do not meet the P value < 0.05 conditions are not shown).
| Category | Term | Description | Count | Gene | P value |
|---|---|---|---|---|---|
| GOTERM_BP_FAT | GO:0022603 | Regulation of anatomical structure morphogenesis | 4 | SERPINE1, SPP1, VEGFC, EPHB2 | 2.51E − 04 |
| GOTERM_BP_FAT | GO:0051240 | Positive regulation of multicellular organismal process | 4 | SERPINE1, SPP1, VEGFC, EPHB2 | 6.34E − 04 |
| GOTERM_BP_FAT | GO:2000026 | Regulation of multicellular organismal development | 4 | SERPINE1, SPP1, VEGFC, EPHB2 | 0.001102607 |
| GOTERM_BP_FAT | GO:0001525 | Angiogenesis | 3 | SERPINE1, VEGFC, EPHB2 | 0.001811168 |
| GOTERM_BP_FAT | GO:0009605 | Response to external stimulus | 4 | SERPINE1, SPP1, VEGFC, EPHB2 | 0.001927546 |
| GOTERM_BP_FAT | GO:0048514 | Blood vessel morphogenesis | 3 | SERPINE1, VEGFC, EPHB2 | 0.002552747 |
| GOTERM_CC_FAT | GO:0031093 | Platelet alpha granule lumen | 2 | SERPINE1, VEGFC | 0.011315988 |
| GOTERM_CC_FAT | GO:0031091 | Platelet alpha granule | 2 | SERPINE1, VEGFC | 0.01540963 |
| GOTERM_CC_FAT | GO:0034774 | Secretory granule lumen | 2 | SERPINE1, VEGFC | 0.017656309 |
| GOTERM_CC_FAT | GO:0060205 | Cytoplasmic membrane-bounded vesicle lumen | 2 | SERPINE1, VEGFC | 0.021528882 |
| GOTERM_CC_FAT | GO:0031983 | Vesicle lumen | 2 | SERPINE1, VEGFC | 0.021732419 |
| GOTERM_CC_FAT | GO:0005615 | Extracellular space | 3 | SERPINE1, SPP1, VEGFC | 0.027662649 |
| GOTERM_MF_FAT | GO:0005102 | Receptor binding | 4 | SERPINE1, SPP1, VEGFC, EPHB2 | 8.41E − 04 |
3.5. Gene Set Variation Analysis
GSVA score was calculated and compared between tumor and normal samples in HNSCC samples on the GSCA. The GSVA score of the tumor group was significantly higher than that of the normal tissue group (Figure 5(a)). Furthermore, the association between GSVA score and survival (includes overall survival (OS), progression-free survival (PFS), disease-specific survival (DSS), and disease-free interval (DFI)) was estimated, and it was found that the GSVA score had a significant relationship with OS, PFS, DSS, and DFI in HNSCC patients (Figures 5(b)–5(e)). These findings suggest that this 4-gene signature has a high potential for predicting poor prognosis in HNSCC patients by assessing their risk score based on related gene expression levels.
Figure 5.

GSVA and methylation analysis. (a) Boxplot showing the GSVA score of the tumor and normal tissue groups. (b–e) OS, PFS, DFS, and DFI analysis between higher and lower GSVA score in HSCC patients. (f) Overview of the relationship between GSVA score and activity of tumor-related pathways. (g) Correlation between GSVA score and EMT pathway activity. (h) Correlation between GSVA score and cell cycle pathway activity. (i–k) Methylation analysis of EPHB2, SPP1, and SERPINE1.
Figure 5(f) shows the correlation between GSVA score and activity of cancer-related pathways in HNSCC. These four genes were positively correlated with EMT pathway activity (P value = 0.00, FDR = 1.5e − 12, Figure 5(g)) and negatively associated with cell cycle pathway activity (P value = 0.00, FDR = 8.3e − 03, Figure 5(h)).
3.6. Methylation Analysis
Differences in methylation of these four genes in HNSCC tumor tissue and normal tissue were compared (Figures 5(i)–5(k)), with higher methylation levels of EBPH2 and SPP1 in normal tissue and higher methylation levels of SERPINE1 in tumor (P < 0.05). There is no difference in methylation levels of VEGFC between the two groups.
3.7. Immune Infiltration Cell Analysis
We used the TIMER database to explore the association between the expression of the selected genes and immune infiltrating cells, including B cell, CD8+ T cell, CD4+ T cell, macrophage, neutrophil, and dendritic cell (DC) in HNSCC. The result revealed that the expression of EPHB2 was positively correlated with all of these immune cells' expression levels. The expression of SPP1 was positively correlated with the expression levels of CD4+ T cell, macrophage, neutrophil, and DC. The expression of SERPINE1 was positively correlated with the expression levels of CD4+ T cell, neutrophil, and DC, while negatively associated with B cell and CD8+ T cell. The expression of VEGFC was positively correlated with CD4+ T cell, neutrophil, and DC, while negatively correlated with B cell and CD8+ T cell (Figures 6(a)–6(d)).
Figure 6.

Association of gene expression and immune infiltration levels of immune cells including CD4+ T cell, CD8+ T cell, macrophage, dendritic cell, neutrophil, and B cell.
3.8. Genetic Alteration Analysis
At the cBioPortal, we examined the genetic alterations of these four prognostic genes in 3 HNSCC studies from TCGA. Overall, we found that these genes were altered in 126 (9%) of all case, with 30 (10.75%) of 279 cases (TCGA, Nature 2015), 50 (9.43%) of 530 cases (TCGA, Firehose Legacy), and 46 (8.8%) of 523 cases (TCGA, PanCancer Atlas), respectively (Figure 7(a)). The amplification of SERPINE1 and the deep deletion of VEGFC were the most frequent copy number alterations (CNA) among these genes (Figure 7(b)).
Figure 7.

Genetic alteration analysis and exploration of targeted drugs. (a, b) Genetic alteration analysis of the four genes in three HNSCC datasets. (c) Correlation between GDSC drug sensitivity and gene expression.
3.9. Discovery of Gene-Related Sensitive Drugs
The four genes were used for drug-sensitivity analysis on the website GSCA. Figure 7(c) summarizes the top 30 drugs most associated with gene expression in pan-cancer. The drugs are ranked by the integrated level of correlation coefficient and FDR of searched genes, including 22 positively correlated drugs and eight negatively correlated drugs. Information on these sensitive drugs may provide new ideas for further alleviating radioresistance. Results and Discussion may be presented separately, or in one combined section, and may optionally be divided into headed subsections.
4. Discussion
HNSCC is one of the most common malignant tumors that seriously threaten human health, and radiotherapy plays a crucial part in its treatment. Radiotherapy resistance can reduce the local efficacy of HNSCC patients and lead to tumor recurrence and distant metastasis and is significantly associated with poor prognosis. Identifying new genes related to radiation resistance in HNSCC may further improve the prognosis of these patients. In this study, we adopted the approach of text mining and bioinformatics, and finally, four potential genes (EPHB2, SPP1, SERPINE1, and VEGFC) were found.
The erythropoietin-producing hepatocellular carcinoma receptor B2 (EPHB2), an essential member of the erythropoietin-producing hepatocellular carcinoma (EPH) receptor family, has been confirmed to be highly expressed in various tumors, such as breast cancer, meningioma, colorectal cancer, gastric cancer, and lung cancer [22]. High expression of EPHB2 is considered a proto-oncogene by promoting blood vessel formation by enhancing cancer cell motility, invasion, and metastasis. Thus, the EPHB2 gene is regarded as a pro-oncogene. EPHB2 is found to play a critical role in the progression from early-stage cutaneous squamous cell carcinoma to an advanced stage, and it is speculated to be a therapeutic target for invasive cutaneous squamous cell carcinoma [23]. Another study has demonstrated that EPHB2 expression regulates angiogenesis both in vitro and in vivo and that EPHB2 overexpression is associated with poor prognosis and tumor angiogenesis in HNSCC patients [24]. Bhatia et al. observed the expression of EPHB2 in the myeloblastoma cell line, and the cell line was more sensitive to radiotherapy after the EphB2 was knocked out. The radiosensitization effect was partly mediated by enhancing G2/M cell cycle arrest. It suggests that blocking the EPHB2 receptor may be a way to improve radiation sensitivity in medulloblastoma [25].
Secreted phosphoprotein 1 (SPP1), also known as osteopontin (OPN), is an extracellular matrix protein closely associated with malignant tumors [26]. It is highly expressed in breast cancer [27], non-small-cell lung cancer [28], prostate cancer [29], and liver cancer [30]. It is associated with tumor invasion and metastasis, apoptosis inhibition, angiogenesis, and chemotherapy resistance [31]. The level of SPP1 mRNA in HNSCC was higher than that in noncancerous tissues, and high SPP1 mRNA level was associated with poor overall survival of HNSCC patients [32]. Another study showed that the serum SPP1 level of HNSCC with positive cervical lymph nodes was significantly higher than that with negative cervical lymph nodes, and researchers concluded that SPP1 might play an essential role in developing cervical lymph nodes metastasis in NHSCC patients [33].
Serpin family E member 1 (SERPINE1), also named plasminogen activator inhibitor type 1 (PAI-1), is a fibrinolysis inhibitor found to be abnormally expressed in gastric cancer [34], colon cancer [35], diffuse lower-grade gliomas [36], and other cancers. It is a biomarker of the poor prognosis of the malignant tumor. Overexpression of SERPINE1 promotes tumor progression, lymph node metastasis, and lung metastasis of HNSCC and is associated with poor prognosis [37]. Yang et al. found that SERPINE1 was significantly overexpressed in HNSCC and correlated with the prognosis of HNSCC patients [38]. Wang et al. constructed a 6-gene signature (including SERPINE1) that can be used to predict the survival of HNSCC patients [39]. Similar to our findings, Lee et al. found that SERPINE1 was associated with radioresistance in HNSCC, and knockdown of SERPINE1 increased radiosensitivity in head and neck cancer tumor-initiating cells [40].
Vascular endothelial growth factor-C (VEGFC) is one of the most crucial lymphangiogenic growth factors, which can stimulate the formation of tumor lymphatics [41]. Recent evidence suggests that VEGFC is also an immunomodulator that modulates the immune system, making tumor cells more likely to evade immune surveillance [42]. Tacconi et al. demonstrated that VEGFC could accelerate tumor growth via fostering cancer immune escape [43]. High expression of VEGFC is significantly associated with poor prognosis in various malignancies, including lung cancer [44], gastric cancer [45], thyroid cancer [46], colorectal cancer [47], and HNSCC [48]. Furthermore, a study showed that silencing VEGFC can increase the radiosensitivity of nasopharyngeal carcinoma CNE-2 cells [49].
Previous studies on radiosensitivity in head and neck cancer have identified some related genes. Some are based on the gene expression levels of cell lines [50, 51], and some are based on gene expression levels in tumors and patient survival data [52, 53]. Foy et al. analyzed the data from the public database by data mining. They found that 13 genes were associated with in vitro and in vivo radioresistance for HPV-negative HNSCC [54]. Although these genes were related to disease-free survival, the detection of 13 genes makes it an inconvenience for clinical application. In contrast, our four-gene signature is more conducive to clinical use. Similar to our finding, Shen et al. adopted the method of bioinformatics to explore expression profile chip data from Gene Expression Omnibus (GEO) and the RNAseq tertiary dataset from TCGA database of HNSCC, three key genes (TGFBI, SPP1, and LAMB3) related to HNSCC prognosis were identified [55]. In another study, differentially expressed genes from TCGA were analyzed and significantly enriched in the GO and KEGG pathways of mitosis, cell cycle, Wnt, JAK/STAT, and TLR signaling pathways [56]. Liu et al. identified an eight-mRNA signature and a nomogram to predict the prognosis of HNSCC and also found that the EMT pathway plays a crucial role in their signature [57]. However, the three researches mentioned above were only based on TCGA (and GEO) dataset and have not been validated by other relevant evidence and experiments. In this study, the initial genes were obtained from a large number of published literature based on the text mining method, and then, the differences in expression and survival were verified in TCGA/GTEx database.
To further validate these candidate genes' expression and prognostic value as a whole, we conducted gene set variation analysis, showing that a high GSVA score predicts poor survival, which consists of the single gene survival analysis.
Immunotherapy plays a critical role in the treatment of HNCSS, especially in recurrent and metastatic diseases [58]. Tumor-infiltrating lymphocytes and immunological scores are linked to HNSCC prognosis and predicted radiation and chemotherapy effectiveness [59]. HNSCC is an immunosuppressive cancer that causes a broad range of immunosuppressive cells to be generated. Here, we examined the relationship between the candidate genes and immune cells and found that the infiltration of CD4+ T cell, neutrophil, macrophage, and dendritic cell was closely related to the high expression of genes. However, in HNSCC, the fundamental mechanism of immune infiltration against tumor response is still unknown. Jiang et al. argue that macrophages, T cells CD8, and T cells CD4 memory are the most commonly infiltrated subtypes of immune cells in HNSCC [60], while Wang et al. found a higher abundance of B cells, CD8+ T cells, neutrophils, and DCs in the low-risk group compared with the high-risk group, which predicts better prognosis [61]. We hypothesize that this is related to the heterogeneity of head and neck tumors and the fact that some immune cells have different subtypes that have reverse effects on prognosis.
To further validate these candidate genes' expression and prognostic value as a whole, we conducted gene set variation analysis, showing that a high GSVA score predicts poor survival, which consists of the single gene survival analysis. However, this study has certain limitations; firstly, the data on gene expression levels were obtained from public databases, and tumor specimens from patients in our hospital were not collected for verification. Secondly, the specific relationship and mechanism between these genes and radiotherapy resistance need to be further verified by experiments.
5. Conclusions
In this study, adopting the method of text mining and bioinformatics, we finally screened out four genes (EPHB2, SPP1, SERPINE1, and VEGFC) and corresponding signaling pathways that might be related to radiotherapy resistance in HNSCC patients. These genes are more highly expressed in tumors than in normal tissue and are associated with a poor prognosis of HNSCC. This study improves our understanding of the pathogenesis and underlying molecular mechanism of radiotherapy resistance and provides clues for further elevated radiotherapy efficacy in HNSCC patients.
Data Availability
The data used to support the findings of this study are partially included in the article. And others are available from the corresponding author upon request.
Conflicts of Interest
The authors declare that there is no conflict of interest regarding the publication of this paper.
Authors' Contributions
Yongqian Zhang and Hongmin Wang contributed equally to this work.
Supplementary Materials
Figure S1: validation of the expression of genes in gene module 1 in HNSC datasets. The cutoff: ∣log2 fold change (FC) | ≥1, and P < 0.01 (∗ indicates P < 0.01). Figure S2: validation of the expression of genes in gene module 2 in HNSC datasets. The cutoff: ∣log2 fold change (FC) | ≥1, and P < 0.01 (∗ indicates P < 0.01). Figure S3: overall survival (OS) analysis of genes in gene module 1 in HNSC datasets. Figure S4: overall survival (OS) analysis of genes in gene module 2 in HNSC datasets; HNSC: head and neck squamous cell carcinoma; HR: hazard ratio; TPM: transcripts per kilobase million.
References
- 1.Ferlay J., Colombet M., Soerjomataram I., et al. Estimating the global cancer incidence and mortality in 2018: GLOBOCAN sources and methods. International Journal of Cancer . 2019;144(8):1941–1953. doi: 10.1002/ijc.31937. [DOI] [PubMed] [Google Scholar]
- 2.Bray F., Ferlay J., Soerjomataram I., Siegel R. L., Torre L. A., Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: a Cancer Journal for Clinicians . 2018;68(6):394–424. doi: 10.3322/caac.21492. [DOI] [PubMed] [Google Scholar]
- 3.Jiang X., Ye J., Dong Z., Hu S., Xiao M. Novel genetic alterations and their impact on target therapy response in head and neck squamous cell carcinoma. Cancer Management and Research . 2019;Volume 11:1321–1336. doi: 10.2147/CMAR.S187780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Muzaffar J., Bari S., Kirtane K., Chung C. H. Recent advances and future directions in clinical management of head and neck squamous cell carcinoma. Cancers . 2021;13(2):p. 338. doi: 10.3390/cancers13020338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Alterio D., Marvaso G., Ferrari A., Volpe S., Orecchia R., Jereczek-Fossa B. A. Modern radiotherapy for head and neck cancer. Seminars in Oncology . 2019;46(3):233–245. doi: 10.1053/j.seminoncol.2019.07.002. [DOI] [PubMed] [Google Scholar]
- 6.Klein J., Livergant J., Ringash J. Health related quality of life in head and neck cancer treated with radiation therapy with or without chemotherapy: a systematic review. Oral Oncology . 2014;50(4):254–262. doi: 10.1016/j.oraloncology.2014.01.015. [DOI] [PubMed] [Google Scholar]
- 7.Higgins G. S., O’Cathail S. M., Muschel R. J., McKenna W. G. Drug radiotherapy combinations: review of previous failures and reasons for future optimism. Cancer Treatment Reviews . 2015;41(2):105–113. doi: 10.1016/j.ctrv.2014.12.012. [DOI] [PubMed] [Google Scholar]
- 8.Ahmad P., Sana J., Slavik M., Slampa P., Smilek P., Slaby O. MicroRNAs involvement in radioresistance of head and neck cancer. Disease Markers . 2017;2017:8. doi: 10.1155/2017/8245345.8245345 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Perri F., Pacelli R., Della Vittoria Scarpati G., et al. Radioresistance in head and neck squamous cell carcinoma: biological bases and therapeutic implications. Head & Neck . 2015;37(5):763–770. doi: 10.1002/hed.23837. [DOI] [PubMed] [Google Scholar]
- 10.Kirk J., Shah N., Noll B., et al. Text mining-based in silico drug discovery in oral mucositis caused by high-dose cancer therapy. Support Care Cancer . 2018;26(8):2695–2705. doi: 10.1007/s00520-018-4096-2. [DOI] [PubMed] [Google Scholar]
- 11.Zhang N., Xu W., Wang S., Qiao Y., Zhang X. Computational drug discovery in chemotherapy-induced alopecia via text mining and biomedical databases. Clinical Therapeutics . 2019;41(5, article e978):972–980.e8. doi: 10.1016/j.clinthera.2019.04.003. [DOI] [PubMed] [Google Scholar]
- 12.Baran J., Gerner M., Haeussler M., Nenadic G., Bergman C. M. pubmed2ensembl: a resource for mining the biological literature on genes. PLoS One . 2011;6(9, article e24716) doi: 10.1371/journal.pone.0024716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hale M. L., Thapa I., Ghersi D. FunSet: an open-source software and web server for performing and displaying Gene Ontology enrichment analysis. BMC Bioinformatics . 2019;20(1):p. 359. doi: 10.1186/s12859-019-2960-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Gene Ontology Consortium. Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Research . 2017;45(D1):D331–D338. doi: 10.1093/nar/gkw1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kanehisa M., Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Research . 2000;28(1):27–30. doi: 10.1093/nar/28.1.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Sherman B. T., Hao M., Qiu J., et al. DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update) Nucleic Acids Research . 2022;50(W1):W216–W221. doi: 10.1093/nar/gkac194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Tang Z., Li C., Kang B., Gao G., Li C., Zhang Z. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Research . 2017;45(W1):W98–W102. doi: 10.1093/nar/gkx247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hänzelmann S., Castelo R., Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics . 2013;14(1):p. 7. doi: 10.1186/1471-2105-14-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Liu C.-J., Hu F.-F., Xia M.-X., Han L., Zhang Q., Guo A. Y. GSCALite: a web server for gene set cancer analysis. Bioinformatics . 2018;34(21):3771–3772. doi: 10.1093/bioinformatics/bty411. [DOI] [PubMed] [Google Scholar]
- 20.Li T., Fan J., Wang B., et al. TIMER: a web server for comprehensive analysis of tumor-infiltrating immune cells. Cancer Research . 2017;77(21):e108–e110. doi: 10.1158/0008-5472.CAN-17-0307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Gao J., Aksoy B. A., Dogrusoz U., et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Science Signaling . 2013;6(269):p. pl1. doi: 10.1126/scisignal.2004088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Liu W., Yu C., Li J., Fang J. The roles of EphB2 in cancer. Frontiers in Cell and Development Biology . 2022;10, article 788587 doi: 10.3389/fcell.2022.788587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Farshchian M., Nissinen L., Siljamäki E., et al. EphB2 promotes progression of cutaneous squamous cell carcinoma. The Journal of Investigative Dermatology . 2015;135(7):1882–1892. doi: 10.1038/jid.2015.104. [DOI] [PubMed] [Google Scholar]
- 24.Sato S., Vasaikar S., Eskaros A., et al. EPHB2 carried on small extracellular vesicles induces tumor angiogenesis via activation of ephrin reverse signaling. Insight . 2019;4(23) doi: 10.1172/jci.insight.132447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bhatia S., Hirsch K., Bukkapatnam S., et al. Combined EphB2 receptor knockdown with radiation decreases cell viability and invasion in medulloblastoma. Cancer Cell International . 2017;17(1):p. 41. doi: 10.1186/s12935-017-0409-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Del Prete A., Scutera S., Sozzani S., Musso T. Role of osteopontin in dendritic cell shaping of immune responses. Cytokine & Growth Factor Reviews . 2019;50:19–28. doi: 10.1016/j.cytogfr.2019.05.004. [DOI] [PubMed] [Google Scholar]
- 27.Behbahani R. G., Danyaei A., Teimoori A., Neisi N., Tahmasbi M. J. La radioresistance au cancer du sein peut etre surmontee par le _knock-out_ du g ene de l’osteopontine avec la technique CRISPR/Cas9. Cancer Radiothérapie . 2021;25(3):222–228. doi: 10.1016/j.canrad.2020.08.048. [DOI] [PubMed] [Google Scholar]
- 28.Shi L., Wang X. Role of osteopontin in lung cancer evolution and heterogeneity. Seminars in Cell & Developmental Biology . 2017;64:40–47. doi: 10.1016/j.semcdb.2016.08.032. [DOI] [PubMed] [Google Scholar]
- 29.Pang X., Gong K., Zhang X., Wu S., Cui Y., Qian B. Z. Osteopontin as a multifaceted driver of bone metastasis and drug resistance. Pharmacological Research . 2019;144:235–244. doi: 10.1016/j.phrs.2019.04.030. [DOI] [PubMed] [Google Scholar]
- 30.Bruha R., Vitek L., Smid V. Osteopontin - a potential biomarker of advanced liver disease. Annals of Hepatology . 2020;19(4):344–352. doi: 10.1016/j.aohep.2020.01.001. [DOI] [PubMed] [Google Scholar]
- 31.Bie T., Zhang X. Higher expression of SPP1 predicts poorer survival outcomes in head and neck cancer. Journal of Immunology Research . 2021;2021:11. doi: 10.1155/2021/8569575.8569575 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Cai X., Zhang H., Li T. The role of SPP1 as a prognostic biomarker and therapeutic target in head and neck squamous cell carcinoma. International Journal of Oral and Maxillofacial Surgery . 2022;51(6):732–741. doi: 10.1016/j.ijom.2021.07.022. [DOI] [PubMed] [Google Scholar]
- 33.Maleš J., Mihalj H., Šestak A., Kralik K., Smolić M. Osteopontin levels in patients with squamous metastatic head and neck cancer. Medicina . 2021;57(2):p. 185. doi: 10.3390/medicina57020185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Chen S., Li Y., Zhu Y., et al. SERPINE1 overexpression promotes malignant progression and poor prognosis of gastric cancer. Journal of Oncology . 2022;2022:17. doi: 10.1155/2022/2647825.2647825 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Wang S., Pang L., Liu Z., Meng X. SERPINE1 associated with remodeling of the tumor microenvironment in colon cancer progression: a novel therapeutic target. BMC Cancer . 2021;21(1):p. 767. doi: 10.1186/s12885-021-08536-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Huang X., Zhang F., He D., et al. Immune-related gene SERPINE1 is a novel biomarker for diffuse lower-grade gliomas via large-scale analysis. Frontiers in Oncology . 2021;11, article 646060 doi: 10.3389/fonc.2021.646060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Arroyo-Solera I., Pavón M. Á., León X., et al. Effect of serpinE1 overexpression on the primary tumor and lymph node, and lung metastases in head and neck squamous cell carcinoma. Head & Neck . 2018;41(2):429–439. doi: 10.1002/hed.25437. [DOI] [PubMed] [Google Scholar]
- 38.Yang K., Zhang S., Zhang D., et al. Identification of SERPINE1, PLAU and ACTA1 as biomarkers of head and neck squamous cell carcinoma based on integrated bioinformatics analysis. International Journal of Clinical Oncology . 2019;24(9):1030–1041. doi: 10.1007/s10147-019-01435-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wang J., Chen X., Tian Y., et al. Six-gene signature for predicting survival in patients with head and neck squamous cell carcinoma. Aging (Albany NY) . 2020;12(1):767–783. doi: 10.18632/aging.102655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lee Y. C., Yu C. C., Lan C., et al. Plasminogen activator inhibitor-1 as regulator of tumor-initiating cell properties in head and neck cancers. Head & Neck . 2016;38(Suppl 1):E895–E904. doi: 10.1002/hed.24124. [DOI] [PubMed] [Google Scholar]
- 41.Morfoisse F., de Toni F., Nigri J., et al. Coordinating effect of VEGFC and oleic acid participates to tumor lymphangiogenesis. Cancers . 2021;13(12):p. 2851. doi: 10.3390/cancers13122851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wang C. A., Tsai S. J. The non-canonical role of vascular endothelial growth factor-C axis in cancer progression. Experimental Biology and Medicine . 2015;240(6):718–724. doi: 10.1177/1535370215583802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Tacconi C., Ungaro F., Correale C., et al. Activation of the VEGFC/VEGFR3 pathway induces tumor immune escape in colorectal cancer. Cancer Research . 2019;79(16):4196–4210. doi: 10.1158/0008-5472.CAN-18-3657. [DOI] [PubMed] [Google Scholar]
- 44.Qin T., Xia J., Liu S., et al. Clinical importance of VEGFC and PD-L1 co-expression in lung adenocarcinoma patients. Thoracic cancer . 2020;11(5):1139–1148. doi: 10.1111/1759-7714.13354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Ma C., Xie J., Luo C., et al. OxLDL promotes lymphangiogenesis and lymphatic metastasis in gastric cancer by upregulating VEGF‑C expression and secretion. International Journal of Oncology . 2019;54(2):572–584. doi: 10.3892/ijo.2018.4648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Ceric S., Ceric T., Pojskic N., Bilalovic N., Musanovic J., Kucukalic-Selimovic E. Immunohistochemical expression and prognostic significance of VEGF-C in well-differentiated thyroid cancer. Acta Endocrinologica (Bucharest) . 2020;16(4):409–416. doi: 10.4183/aeb.2020.409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zhou J., Zou Y., Hu G., et al. Facilitating colorectal cancer cell metastasis by targeted binding of long non-coding RNA ENSG00000231881 with miR-133b via VEGFC signaling pathway. Biochemical and Biophysical Research Communications . 2019;509(1):1–7. doi: 10.1016/j.bbrc.2018.08.004. [DOI] [PubMed] [Google Scholar]
- 48.Kim H. Y., Rha K. S., Shim G., et al. Podoplanin is involved in the prognosis of head and neck squamous cell carcinoma through interaction with VEGF-C. Oncology Reports . 2015;34(2):833–842. doi: 10.3892/or.2015.4070. [DOI] [PubMed] [Google Scholar]
- 49.Wang F., Peng L., Wang Y., Liu X. Silencing vascular endothelial growth factor C increases the radiosensitivity in nasopharyngeal carcinoma CNE-2 cells. Journal of Cellular Biochemistry . 2020;121(2):1182–1191. doi: 10.1002/jcb.29352. [DOI] [PubMed] [Google Scholar]
- 50.Routila J., Suvila K., Grénman R., Leivo I., Westermarck J., Ventelä S. Cancer cell line microarray as a novel screening method for identification of radioresistance biomarkers in head and neck squamous cell carcinoma. BMC Cancer . 2021;21(1):p. 868. doi: 10.1186/s12885-021-08618-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kim H. S., Kim S. C., Kim S. J., et al. Identification of a radiosensitivity signature using integrative metaanalysis of published microarray data for NCI-60 cancer cells. BMC Genomics . 2012;13(1):p. 348. doi: 10.1186/1471-2164-13-348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Dai D., Guo Y., Shui Y., Li J., Jiang B., Wei Q. Combination of radiosensitivity gene signature and PD-L1 status predicts clinical outcome of patients with locally advanced head and neck squamous cell carcinoma: a study based on the cancer genome atlas dataset. Frontiers in Molecular Biosciences . 2021;8, article 775562 doi: 10.3389/fmolb.2021.775562. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kim S. I., Kang J. W., Noh J. K., et al. Gene signature for prediction of radiosensitivity in human papillomavirus-negative head and neck squamous cell carcinoma. Radiation Oncology Journal . 2020;38(2):99–108. doi: 10.3857/roj.2020.00136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Foy J. P., Bazire L., Ortiz-Cuaran S., et al. A 13-gene expression-based radioresistance score highlights the heterogeneity in the response to radiation therapy across HPV-negative HNSCC molecular subtypes. BMC Medicine . 2017;15(1):p. 165. doi: 10.1186/s12916-017-0929-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Shen Y., Liu J., Zhang L., et al. Identification of potential biomarkers and survival analysis for head and neck squamous cell carcinoma using bioinformatics strategy: a study based on TCGA and GEO datasets. BioMed Research International . 2019;2019:14. doi: 10.1155/2019/7376034.7376034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Yan L., Zhan C., Wu J., Wang S. Expression profile analysis of head and neck squamous cell carcinomas using data from The Cancer Genome Atlas. Molecular Medicine Reports . 2016;13(5):4259–4265. doi: 10.3892/mmr.2016.5054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Liu B., Su Q., Ma J., et al. Prognostic value of eight-gene signature in head and neck squamous carcinoma. Frontiers in Oncology . 2021;11, article 657002 doi: 10.3389/fonc.2021.657002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Ling D. C., Bakkenist C. J., Ferris R. L., Clump D. A. Role of immunotherapy in head and neck cancer. Seminars in Radiation Oncology . 2018;28(1):12–16. doi: 10.1016/j.semradonc.2017.08.009. [DOI] [PubMed] [Google Scholar]
- 59.Balermpas P., Michel Y., Wagenblast J., et al. Tumour-infiltrating lymphocytes predict response to definitive chemoradiotherapy in head and neck cancer. British Journal of Cancer . 2014;110(2):501–509. doi: 10.1038/bjc.2013.640. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Jiang A. M., Ren M. D., Liu N., et al. Tumor mutation burden, immune cell infiltration, and construction of immune-related genes prognostic model in head and neck cancer. International Journal of Medical Sciences . 2021;18(1):226–238. doi: 10.7150/ijms.51064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Wang Z., Yuan H., Huang J., et al. Prognostic value of immune-related genes and immune cell infiltration analysis in the tumor microenvironment of head and neck squamous cell carcinoma. Head & Neck . 2021;43(1):182–197. doi: 10.1002/hed.26474. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figure S1: validation of the expression of genes in gene module 1 in HNSC datasets. The cutoff: ∣log2 fold change (FC) | ≥1, and P < 0.01 (∗ indicates P < 0.01). Figure S2: validation of the expression of genes in gene module 2 in HNSC datasets. The cutoff: ∣log2 fold change (FC) | ≥1, and P < 0.01 (∗ indicates P < 0.01). Figure S3: overall survival (OS) analysis of genes in gene module 1 in HNSC datasets. Figure S4: overall survival (OS) analysis of genes in gene module 2 in HNSC datasets; HNSC: head and neck squamous cell carcinoma; HR: hazard ratio; TPM: transcripts per kilobase million.
Data Availability Statement
The data used to support the findings of this study are partially included in the article. And others are available from the corresponding author upon request.
