Abstract
Considerable evidence indicates that autophagy plays a vital role in the biological processes of various cancers. The aim of this study is to evaluate the prognostic value of autophagy-related genes in patients with head and neck squamous cell carcinoma (HNSCC). Transcriptome expression profiles and clinical data acquired from The Cancer Genome Atlas (TCGA) database were analyzed by Cox proportional hazards model and Kaplan–Meier survival analysis to screen autophagy-related prognostic genes that were significantly correlated with HNSCC patients' overall survival. Functional enrichment analyses were performed to explore biological functions of differentially expressed autophagy-related genes (ARGs) identified in HNSCC patients. Six ARGs (EGFR, HSPB8, PRKN, CDKN2A, FADD, and ITGA3) identified with significantly prognostic values for HNSCC were used to construct a risk signature that could stratify patients into the high-risk and low-risk groups. This signature demonstrated great value in predicting prognosis for HNSCC patients and was indicated as an independent prognostic factor in terms of clinicopathological characteristics (sex, age, clinical stage, histological grade, anatomic subdivision, alcohol history, smoking status, HPV status, and mutational status of the samples). The prognostic signature was also validated by data from the Gene Expression Omnibus (GEO) database and International Cancer Genome Consortium (ICGC). In conclusion, this study provides a novel autophagy-related gene signature for predicting prognosis of HNSCC patients and gives molecular insights of autophagy in HNSCC.
1. Introduction
Head and neck squamous cell carcinomas include several types of malignancies from the oral cavity, pharynx, and larynx. Each year, over 600,000 cases are diagnosed worldwide, making HNSCC the sixth most prevalent cancer [1, 2]. Studies have shown the risk factors related to HNSCC are tobacco use, alcohol consumption, and human papillomavirus (HPV) infection. The typical treatments include surgery, radiotherapy, and chemotherapy. However, the mortality rate of HNSCC remains high, which results in 380,000 death annually mainly because of local recurrence and distant metastasis [3, 4]. The current plight is partly on account of the lack of accurate and reliable biomarkers for prognosis prediction in the early stage of this disease [5]. Therefore, it is quite essential to explore an effective and specific signature to help severity assessment and guide decision-making in clinical practice.
Autophagy is an important process which selects and degrades dysfunctional organelles, microbes, and proteins by lysosomes to maintain cellular homeostasis and sustain metabolism. Aberrant expression of autophagy-related genes (ARGs) is closely related to multiple diseases, especially neurodegenerative disorder, inflammatory abnormality, and cancer [6]. Autophagy plays a multifaceted role in tumor initiation and progression in respect of microenvironment stress, nutritional supplement, and immune status [7, 8]. Research suggests that autophagy-related mechanism is an optimistic target for future cancer therapy [9]; thus, exploring the valuable knowledge of autophagy is in urgent need and can offer huge potential in long term.
In this study, we comprehensively analyzed the correlation between expression profiles of ARGs and clinical features of HNSCC patients from The Cancer Genome Atlas (TCGA) database and constructed a 6-ARG risk signature. This model was identified to be an independent prognostic signature for HNSCC patients. Functional analysis was applied to reveal more information of ARGs. These findings could provide novel biomarkers for predicting the survival of HNSCC patients and give new insights in personalized therapy.
2. Materials and Methods
2.1. Data Acquisition and Preparation
RNA sequencing data of 545 HNSCC specimens, simple nucleotide variation data of 506 HNSCC specimens, and corresponding clinical information of 528 patients were downloaded from TCGA database (https://cancergenome.nih.gov/). A total of 232 ARGs were obtained from the Human Autophagy Database (HADb; http://autophagy.lu/clustering/index.html). We extracted autophagy-related gene expression data using Practical Extraction and Report Language (Perl; https://www.perl.org/). The clinical data was combined with corresponding gene expression profiles by unique ID of the patients. Samples without complete survival information or with overall survival time less than 7 days were excluded. Consequently, the data of 495 patients were retained for gene signature construction and further analyses. Three independent microarray HNSCC cohorts obtained from the GEO database (GSE41613, n = 97; GSE117973, n = 77) and ICGC (ORCA-IN, n = 40) were used as the testing groups. Gene expression data from the GSE117973 dataset was normalized using the zero-mean normalization method implemented in the Sklearn library with Python programming language [10].
2.2. Identification of Differentially Expressed ARGs and Functional Annotation
R programming language (version 3.6.2) was utilized as data analysis and plotting tool throughout this research. The R package limma was used to select differentially expressed ARGs between HNSCC and nontumor samples. After normalization by the formula log2(x + 1) transformed, 38 differentially expressed ARGs were identified with log fold change (FC) > 1 or <-1 and adjusted p value < 0.05. To investigate biological functions of the selected ARGs, gene ontology (GO) enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis were performed by R software and p < 0.05 was considered to be of statistically significant difference. The GOplot package was applied to visualize the enrichment terms.
2.3. Construction of the Prognostic Signature
Univariate Cox regression analysis was performed in the training set (TCGA cohort) to evaluate the association between differentially expressed ARGs and overall survival (OS) of HNSCC patients. ARGs with p value < 0.05 were considered to be statistically significant for OS and were identified as candidate genes. In order to minimize partial likelihood deviance and prevent overfitting of the model, we performed the least absolute shrinkage and selection operator (LASSO) Cox regression analysis and selected the most effective prognostic ARGs. The selected ARGs were subjected to the multivariate Cox regression analysis to further screen genes which were capable of predicting the prognosis independently. Subsequently, a risk model composed of several ARGs was established using glmnet and survival R packages.
2.4. Validation of ARG-Based Risk Model
The risk score of each patient was developed by a linear combination of the expression level of genes multiplied regression coefficients calculated by the Cox regression model. HNSCC patients were assigned into the high-risk or low-risk groups according to corresponding risk scores, using the median score as a cutting-off point. A Kaplan–Meier survival curve was plotted to estimate the difference between the two groups using the log-rank test. Meanwhile, the time-dependent receiver operating characteristic (ROC) analysis was employed to evaluate the efficiency of the prognostic signature using the survivalROC package, and the respective area under the ROC curve (AUC) was assessed for predicting accuracy.
To verify if the prognostic ARGs signature could be an independent indicator for predicting the OS of patients with HNSCC, the multivariate Cox regression analysis was performed. Patients were separated into different groups according to sex, age, clinical stage, histological grade, anatomic subdivision, alcohol history, smoking status, HPV status, and tumor mutation burden (TMB) value. These clinicopathological characteristics and the risk score served as covariates and ROC curves were also plotted to evaluate the value in predicting prognosis. Kaplan–Meier survival analysis was also performed in the testing sets from GEO and ICGC cohorts to validate the efficacy of our risk signature. P value < 0.05 was regarded as significant.
3. Results
3.1. Differentially Expressed Autophagy-Related Genes
A flowchart was provided to demonstrate the study design and analysis (Figure 1). The RNA-seq profiles obtained from public database TCGA consisted of 501 HNSCC tissue samples and 40 nontumor samples. After extracting the expression data of 232 autophagy-related genes in HNSCC patients,10 downregulated genes (NRG2, NRG3, MAP1LC3C, PRKN, HSPB8, CCL2, FOS, TP53INP2, PTK6, and NKX2-3) and 28 upregulated genes (EIF4EBP1, BAK1, RGS19, HIF1A, CXCR4, CTSL, VMP1, SPNS1, TNFSF10, TP63, BID, VEGFA, SPHK1, EGFR, SERPINA1, DDIT3, EIF2AK2, ITGB4, ITGA3, APOL1, IRGM, BIRC5, FADD, ITGA6, IFNG, NRG1, IL24, and CDKN2A) were determined. Scatter plots displayed the expression patterns of differentially expressed ARGs between tumor and nontumor tissues (Figure 2).
Figure 1.

A schematic flowchart of this study.
Figure 2.

Differentially expressed autophagy-related genes (ARGs). (a) The heat maps of the 38 differently expressed ARGs. The red color indicates high gene expression while the green color indicates low gene expression. N indicates nontumor tissues; T indicates tumor tissues. (b) The volcano plot for the 232 ARGs from TCGA database. Red indicates high expression, and green indicates low expression. Black indicates that those genes show no difference between HNSCC and paired nontumor tissues. (c) The boxplot of the differentially expressed ARGs.
3.2. Functional Enrichment Analysis of the Differentially Expressed ARGs
Functional enrichment analysis was conducted to provide an in-depth understanding of biological functions of the 38 differentially expressed ARGs. The GO term functional enrichment and the KEGG pathway enrichment of these genes are summarized in Figures 3 and 4. The top enriched GO terms for biological processes were autophagy, process utilizing autophagic mechanism, neuron death, and regulation of apoptotic signaling pathway. Cellular components had integrin complex, autophagosome, protein complex involved in cell adhesion, and autophagosome membrane. Based on molecular function, genes were mostly enriched in terms of receptor ligand activity, cytokine activity, cytokine receptor binding, ubiquitin protein ligase binding, and ubiquitin-like protein ligase binding (Figure 3). The KEGG pathway enrichment analysis revealed that these genes were notably associated with pathways in apoptosis, human cytomegalovirus infection, and human papillomavirus infection. Most of the Z-scores of enriched pathways were more than zero, indicating that most of the pathways were more likely to be enhanced (Figure 4).
Figure 3.

The barplot and heat map of gene ontology (GO) enrichment analysis. (a) The relationship between enriched GO terms and differentially expressed autophagy-related genes. BP indicates biological process; CC indicates cellular component; MF indicates molecular function. (b) The color of each block depends on the logFC value of the specified gene.
Figure 4.

The bubble plot and circle plot of Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. (a) The relationship between KEGG pathways and differentially expressed autophagy-related genes. (b) The outer circle shows a scatter plot for each term of the logFC value of the specified gene. Red circles display upregulation, and blue means downregulation. The higher the Z-score indicates, the higher expression of the enriched pathway.
3.3. Identification of an Autophagy-Related Gene Signature for the Prognosis of HNSCC
A total of 495 eligible HNSCC patients based on TCGA database were involved in this study, and the baseline of clinical characteristics was presented (Table 1). To investigate differentially expressed ARGs for clinical prognosis, univariate Cox regression analysis was performed and 9 ARGs with p value < 0.05 were selected from the training set (Table 2). The candidate genes were passed on for LASSO regression. However, none of them were removed, suggesting that all 9 ARGs may provide specific information for prognosis (Figure S1). Subsequently, a multivariate Cox analysis was conducted, and six genes were identified to develop the prognostic signature (Table 3). The prognostic model was established based on 6 ARG expression levels and coefficients using the following formula: prognosis index (PI) = (0.0017∗EGFR) + (−0.0067∗HSPB8) + (0.6109∗PRKN) + (−0.0144∗CDKN2A) + (0.0075∗FADD) + (0.0046∗ITGA3).
Table 1.
Clinical characteristics of head and neck squamous cell carcinoma patients from TCGA in this study.
| Features | Alive | Dead | Total |
|---|---|---|---|
| (n = 281) | (n = 214) | (n = 495) | |
| Gender | |||
| Female | 64 | 68 | 132 |
| Male | 217 | 146 | 363 |
| Age | |||
| Mean (SD) | 59.44 (11.2) | 62.97 (12.4) | 60.97 (11.9) |
| Median [min, max] | 60 [19, 85] | 63 [24, 88] | 61 [19, 88] |
| Anatomic subd | |||
| Oral cavity | 179 | 149 | 328 |
| Other | 102 | 65 | 167 |
| Clinical stage | |||
| Stage I | 21 | 4 | 25 |
| Stage II | 50 | 30 | 80 |
| Stage III | 55 | 35 | 90 |
| Stage IVA | 150 | 134 | 284 |
| Stage IVB | 5 | 8 | 13 |
| Stage IVC | 0 | 3 | 3 |
| Grade | |||
| G1 | 39 | 22 | 61 |
| G2 | 166 | 131 | 297 |
| G3 | 63 | 53 | 116 |
| G4 | 2 | 0 | 2 |
| GX | 9 | 7 | 16 |
| NA | 2 | 1 | 3 |
| Alcohol history | |||
| Yes | 192 | 138 | 330 |
| No | 83 | 71 | 154 |
| NA | 6 | 5 | 11 |
| Smoking status | |||
| Never | 67 | 42 | 109 |
| Former/current | 212 | 164 | 376 |
| NA | 2 | 8 | 10 |
SD: standard deviation; sub: subdivision.
Table 2.
The results of univariate Cox analysis.
| Gene | HR | HR.95L | HR.95H | p value |
|---|---|---|---|---|
| CXCR4 | 0.990886 | 0.98339 | 0.998439 | 0.018111 |
| EGFR | 1.001677 | 1.000173 | 1.003184 | 0.028858 |
| HSPB8 | 0.993644 | 0.987676 | 0.999648 | 0.03803 |
| PRKN | 1.496081 | 1.114563 | 2.008194 | 0.007317 |
| NKX2-3 | 0.889201 | 0.808232 | 0.978281 | 0.01592 |
| CDKN2A | 0.982237 | 0.97227 | 0.992306 | 0.000573 |
| CTSL | 1.004316 | 1.000584 | 1.008062 | 0.023384 |
| FADD | 1.011174 | 1.004614 | 1.017777 | 0.000819 |
| ITGA3 | 1.004492 | 1.001521 | 1.007472 | 0.00302 |
HR: hazard ratio; L: low confidence interval; H: high confidence interval.
Table 3.
The results of multivariate Cox analysis.
| Gene | Coef | HR | HR.95L | HR.95H | p value |
|---|---|---|---|---|---|
| EGFR | 0.0017 | 1.0017 | 1.0001 | 1.0033 | 0.0397 |
| HSPB8 | -0.0067 | 0.9933 | 0.9876 | 0.9991 | 0.0241 |
| PRKN | 0.6109 | 1.8421 | 1.4074 | 2.4109 | 0.0214 |
| CDKN2A | -0.0144 | 0.9857 | 0.9754 | 0.9960 | 0.0066 |
| FADD | 0.0075 | 1.0075 | 1.0008 | 1.0142 | 0.0277 |
| ITGA3 | 0.0046 | 1.0047 | 1.0016 | 1.0077 | 0.0027 |
Coef: coefficient; HR: hazard ratio; L: low confidence interval; H, high confidence interval.
Based on the median expression value of PI, patients in the training cohort were stratified into the high-risk and low-risk groups with specific cut-off point at 1.023. Survival analysis indicated that patients in the high-risk group had a significantly worse prognosis (Figure 5(a)). The results of ROC analysis revealed that this PI model had good accuracy and efficiency in predicting the 1-, 3-, and 5-year survival with respective areas under the ROC curve (AUCs) which were 0.663, 0.686, and 0.622 (Figure 5(b)). With an increasing risk score came shorter overall survival and more death events (Figures 5(c) and 5(d)). The heat map was also performed to evaluate the relationship between risk score and respective gene expression level in this signature (Figure 5(e)). EGFR, FADD, and ITGA3 were upregulated in the high-risk group (Figures 6(a), 6(e), and 6(f)), which indicated they acted as risk factors, while HSPB8 and CDKN2A were discovered to be downregulated in the high-risk group (Figures 6(b) and 6(d)), indicating that they were protective factors for HNSCC. However, we did not observe a significant difference of PRKN expression between the high-risk and low-risk groups (Figure 6(c)). Kaplan–Meier survival analysis of the 6 ARGs demonstrated similar results. The high expression of EGFR, FADD, and ITGA3 was significantly correlated with inferior overall survival of HNSCC patients (Figures 7(a), 7(e), and 7(f)). In contrast, the downregulation of HSPB8 and CDKN2A indicated inferior OS (Figures 7(b) and 7(d)). Also, there was no significant difference in OS concerning PRKN expression level (Figure 7(c)). Since each gene in this risk model played a different role in predicting prognosis, there may not be of statistical difference for every single gene in above analysis.
Figure 5.

(a) The Kaplan–Meier plot of the overall survival (OS) for high-risk and low-risk patient cohorts divided by the 6-ARGs signature in the training set (N = 495). The OS differences are determined by the two-sided log-rank test. (b) Receiver operating characteristic analysis for the 6-ARG signature in predicting the patients of 1, 3, and 5 years OS in the training set. (c) The distribution of the risk score. Each dot represents a specific patient. The color red indicates high-risk level, and green indicates low-risk level. (d) The distribution of patients' survival status. Each dot represents a specific patient. The color red indicates dead, and green indicates alive. (e) The heat map displays the relationship between risk score and respective gene expression level of this signature in the training set.
Figure 6.

Different expression of the six key genes between the high-risk group and low-risk group.
Figure 7.

The correlation between the six genes included in the signature and HNSCC patients' overall survival. Kaplan–Meier plots demonstrate results from the analysis of correlation between each gene expression level and OS, all using the best separation.
3.4. Correlation between the Autophagy-Related Gene Signature and Clinicopathologic Characteristics in HNSCC Patients
Considering that clinical parameters may influence the performance of the risk signature, the patients were also grouped by sex, age, clinical stage, histological grade, anatomic subdivision, alcohol history, and smoking status and to explore their association with the 6-ARG signature. Patients with incomplete grade, alcohol history, and smoking information were removed, and 457 samples were retained for this procedure. The results suggested that the signature was significantly correlated with clinical stage (p = 0.037, Figure 8(a)). Besides, the 6 ARGs displayed different expression with respect to various clinicopathologic features. Different expression of CDKN2A was found across different sexes, ages, grades, and anatomic subdivisions (Figures 8(b)–8(e)). HSPB8 was expressed differentially with different anatomic sites and alcohol histories (Figures 8(f) and 8(g)). Higher PRKN expression was observed in male and stage III and IV HNSCC patients (Figures 8(h) and 8(i)), and higher FADD expression was found in patients with higher clinical stages and tobacco use (Figures 8(j) and 8(k)). ITGA3 also showed different expression across different sexes, anatomic subdivisions, and smoking statuses (Figures 8(l)–8(n)). Since HPV has been revealed to have some impacts on tumor progression in HNSCC [11], we also took HPV status into analysis. Because many patients in TCGA cohort were lack of HPV status information, only 90 samples were selected. As shown in Figure 9, higher expression levels of EGFR and ITGA3 were discovered in patients with HPV-negative HNSCC, while higher CDKN2A expression was found in patients with HPV-positive HNSCC.
Figure 8.

(a) The 6-ARG signature in the cohort stratified by clinical stages. (b–e) CDKN2A expression in the cohorts stratified by sexes, ages, grades, and anatomic subdivisions. (f, g) HSPB8 expression in the cohorts stratified by anatomic sites and alcohol histories. (h, i) PRKN expression in the cohorts stratified by sexes and clinical stages. (j, k) FADD expression in the cohorts stratified by clinical stages and smoking statuses. (l–n) ITGA3 expression in the cohorts stratified by sexes, anatomic subdivisions, and smoking statuses.
Figure 9.

(a) EGFR expression in the cohorts stratified by HPV status. (b) CDKN2A expression in the cohorts stratified by HPV status. (c) ITGA3 expression in the cohorts stratified by HPV status.
3.5. The Autophagy-Related Gene Signature Is an Independent Prognostic Indicator in Patients with HNSCC
To identify other possible contributors on overall survival, the patients in different clinical subgroups were used for univariate and multivariate Cox regression analyses. Tumor mutation burden (TMB) has been shown to correlate with cancer aggressiveness and immunotherapeutic response [12, 13] and thus was also included in this procedure. 450 HNSCC patients with available TMB information were stratified in the high/low TMB group based on median TMB value and were selected for further study. The results suggested that the autophagy-related gene signature could independently predict prognosis of HNSCC (Figures 10(a) and 10(b)). It was also noticed that age and clinical stage were also significantly associated with patient survival. However, ROC analysis demonstrated that age (AUC = 0.586) and stage (AUC = 0.554) alone were not reliable indicators while risk score signature (AUC = 0.665) kept a stable and reliable performance (Figure 10(c)). Likewise, we brought HPV status into Cox regression analysis and 88 samples were selected. The results also indicated that the risk signature was an independent prognostic factor for HNSCC patients with even better predicting accuracy (AUC = 0.709, Figure 11). Besides, we identified the most frequently mutated genes in patients with HNSCC (Figure 12(a)). Researches have indicated TP53 to be the most commonly mutated gene in HNSCC [14, 15], which was also confirmed in our study. Since mutational status of the samples might influence the performance of the risk signature, the patients were grouped by mutational status of TP53. The high-risk groups showed significantly shorter OS in both HNSCC patients with TP53 mutation and patients without TP53 mutation (Figures 12(b) and 12(c)), indicating the independence of the risk signature once again. In conclusion, the 6-ARG signature could be applied as an independent prognostic indicator for HNSCC in clinical practice.
Figure 10.

The forest plots of univariate (a) and multivariate (b) Cox analyses display the correlation of different indexes and overall survival of HNSCC patients. (c) Receiver operating characteristic analysis for risk signature and clinicopathological features in predicting HNSCC patients' OS in TCGA cohort.
Figure 11.

The forest plots of univariate (a) and multivariate (b) Cox analyses display the correlation of different indexes and overall survival of HNSCC patients with available HPV status information. (c) Receiver operating characteristic analysis for risk signature and clinicopathological features in predicting HNSCC patients' OS.
Figure 12.

(a) Top ten genes with the highest mutation rates in HNSCC patients from TCGA database. (b) The results of Kaplan–Meier analysis in HNSCC patients with TP53 mutation. (c) The results of Kaplan–Meier analysis in HNSCC patients without TP53 mutation. The OS differences are determined by the two-sided log-rank test.
3.6. Validation of the Autophagy-Related Gene Signature in Independent HNSCC Cohorts
Three independent datasets GSE41613, GSE117973, and ORCA-IN (ICGC) were used as external validation groups. The risk score for each patient was calculated using the same PI formula. The patients were stratified into the high-risk and low-risk groups based on the median risk score. We performed Kaplan–Meier survival analysis in the three testing groups, which confirmed the prognostic value of our gene signature. As expected, patients in the high-risk groups from GSE41613 and ORCA-IN datasets showed significantly inferior overall survival (Figures 13(a) and 13(c)). The survival information of GSE117973 dataset was progression-free survival (PFS). Similarly, the high-risk HNSCC patients showed inferior PFS (Figure 13(e)). A good prognostic ability was also observed based on the time-dependent ROC analysis for the GSE41613 cohort in 1-, 3-, and 5-year follow-ups with respective AUC values which were 0.613, 0.671, and 0.631 (Figure 13(b)) and GSE117973 cohort in 1-, 3-, and 5-year follow-ups with respective AUC values which were 0.654, 0.66, and 0.604 (Figure 13(f)). Since there were only a limited number of samples acquired from ICGC and the follow-up information of most patients was restricted within 2 years, we just demonstrated the ROC curve of the 1-year follow-up (AUC = 0.838, Figure 13(d)), which suggested great predicting ability of this signature for HNSCC patients' prognosis once again. Therefore, the 6-ARG-based risk signature was proved to show great competence in predicting prognosis of HNSCC.
Figure 13.

The Kaplan–Meier curves of the overall survival (OS) for high-risk and low-risk patient cohorts divided by the 6-ARG signature in the GEO dataset GSE41613 (a) and ICGC dataset ORCA-IN (c). (e) The Kaplan–Meier curves of the progression-free survival (PFS) for high-risk and low-risk patient cohorts divided by the 6-ARG signature in GSE117973. Receiver operating characteristic analysis in the GEO datasets GSE41613 (b) and GSE117973 (f) and the ICGC dataset ORCA-IN (d).
4. Discussion
HNSCC is one of the most prevailing and life-threatening cancers worldwide. Despite considerable improvements in diagnosis and treatment, the survival rate of HNSCC remains low. Besides, treatment is accompanied by significant long-term toxicity and morbidity [16]. Numerous studies have shown that autophagy plays a dual role in occurrence and progression of tumors. On one hand, autophagy promotes genomic stability and anticancer immunosurveillance in tumor suppression. On the other hand, it boosts high metabolic activity to sustain tumor cell survival and executives cytoprotective function under environmental stress in tumor progression [17]. Recent studies have indicated that various autophagy-related genes and corresponding protein expression profiles are related to clinicopathological features or prognosis of OSCC, including LC3, BECN1, ATG16L1, ATG9A, SQSTM1, and ATG5 [18]. However, most research mainly focuses on one particular gene related to autophagy.
The large-scale databases, such as TCGA and GEO, provide us with abundant information and effective measures to explore gene signatures. In this study, we screened autophagy-related genes and identified six key prognostic genes, which could be potential biomarkers and therapeutic targets. The 6-ARG signature was capable of distinguishing patients of different risk and prognosis, which was validated in multiple datasets and was proved to own great prognostic ability, thus providing us with a novel insight in HNSCC. The GO and KEGG analyses were performed to exploit the molecular and biological information of in-depth ARGs. The results of both functional analyses revealed that the top enriched terms were closely related to autophagy. It was also reported that autophagy contributed to membrane trafficking and signaling pathways in a diverse way [6]. Moreover, there was an increase of enriched KEGG pathways in human cytomegalovirus infection (HCMV) and human papillomavirus infection (HPV), indicating the interplay between autophagy and the immune microenvironment. Previous observations have indicated that HPV-associated HNSCC patients tended to demonstrate improved survival compared to patients with HPV-negative HNSCC. Integration of the HPV genome into the host's genome could generate remarkable downstream consequences in immune response, especially an enhanced infiltration of immune cells and inflammatory cytokines in the HPV-positive tumor microenvironment [19, 20]. This was consistent with our results, because the risk factors in the gene signature EGFR and ITGA3 were downregulated while the protective factor CDKN2A was upregulated in HPV-positive HNSCC patients. However, our study did not observe a significant correlation between HPV and prognosis of HNSCC patients. The reasons may be that HNSCC were a heterogeneous group of malignancies and HPV was detected in 25.9% of all HNSCC, mainly the oropharyngeal squamous cell cancer subtypes [21, 22].What is more, there were only a limited number of HNSCC patients equipped with available HPV status information from TCGA database that may also affect the results. HCMV was regarded as a factor for tumorigenesis and has been revealed to have connection with several cancers including malignant glioma, cervical carcinoma, Kaposi's sarcoma, and breast cancer [23]. Some of the HCMV gene products and proteins were proved to accelerate cancer progression via certain pathways, including suppression of the local immune response against tumors and promotion of cell apoptosis [24].
The Cox regression survival analysis helped us to identify six key prognostic ARGs and construct a prognostic signature, which could be an independent indicator for HNSCC patients' OS. We observed a decreasing expression level of HSPB8 and PRKN and an increasing expression level of EGFR, CDKN2A, FADD, and ITGA3 in tumor samples. Conversely and interestingly, PRKN was upregulated while CDKN2A was downregulated in the high-risk group. According to the results, we speculate that the performance of autophagy is not immutable and autophagy plays a complicated role in various kinds of tumors [25].
Indeed, four genes involved in this prognostic signature have been shown to be significantly correlated with carcinogenesis of HNSCC. The epidermal growth factor receptor (EGFR) is a cell surface receptor member of the ErbB family, which has been extensively studied in HNSCC. Activation of EGFR leads to the promotion of proliferation, differentiation, antiapoptotic signaling, angiogenesis, and metastatic spread and thus is highly correlated with tumor progression [26, 27]. EGFR is reported to be overexpressed in more than 90% of HNSCC and is an independent prognostic indicator which is associated with increased tumor size, shorter progression-free survival, and decreased overall survival [28–30]. This has made EGFR a promising therapeutic target and led to the development of monoclonal antibodies and tyrosine kinase inhibitors in anti-EGFR therapy [31]. Currently, the EGFR monoclonal antibody cetuximab is the only target drug that has been approved by the US FDA in the treatment for HNSCC [32].CDKN2A, a tumor-suppressor gene which encodes the cell cycle regulator p16INK4a, has been extensively investigated to be associated with HNSCC. Global genomic analyses suggest CDKN2A being one of the most commonly altered genes in HNSCC and noncoding mutations of CDKN2A correlates with worse overall survival in HNSCC patients [33, 34]. The results of immunohistochemistry suggest that oropharyngeal tumors with p16INK4a protein expression demonstrate better prognosis than p16-negative oropharyngeal tumors [35]. Meanwhile, HPV-positive patients with p16INK4a expression have better response to chemotherapy and radiotherapy compared with HPV-negative patients, during the standard multimodality treatments [36]. FADD is originally described as an adapter molecule mediating apoptosis, which has been newly revealed to engage in cell survival and tumor development [37, 38]. High FADD expression has been identified frequently in HNSCC (>30%) and is related to a higher incidence of lymph node metastasis and shorter distant metastasis-free internal [39]. Moreover, pharmacologic modulation of FADD is found to be effective and promising in HNSCC patients [40]. ITGA3 is a member of the integrin family and serves as a cell surface adhesion protein [41]. Research has demonstrated that knockdown of gene ITGA3 inhibits HNSCC cancer cell migration and invasion. Furthermore, high expression of ITGA3 indicates poorer survival of patients with HNSCC [42, 43].
So far, existing research also has revealed some mechanism of the other two identified ARGs, though their interaction with HNSCC is rudimentary and inconclusive. PRKN, also named as PARK2 or PARKIN, is an E3 ubiquitin ligase that functions in mitophagy and xenophagy. Biological studies demonstrate that PRKN possesses both prosurvival and growth suppressive capacities and its function as a tumor promoter or suppressor is highly dependent on cancer subtype and context [44, 45]. PRKN has been found to be affected in tumor microenvironmental signaling networks with likely loss of expression in HNSCC tumor samples [46]. Decreased expression of PRKN has also been identified in oropharyngeal squamous cell carcinoma (OPSCC) samples compared with paired normal tissues [47], which is consistent with our findings. But how PRKN behaves in tumor progression especially in HNSCC is still indefinite.
HSPB8 belongs to the ubiquitous small heat shock protein (sHSP) family, which cooperates with BAG3 and stimulate autophagic flux to seek out damaged cellular components for autophagy. Although there have been no reports concerning HNSCC and HSPB8, this gene plays an important role in some diseases. It is reported that mutation of HSPB8 is linked to neurodegenerative disorders [48]. In addition, the decreased expression of HSPB8 has been shown to lead to an increased number of cells resting in the G0/G1 phase and reduce the migratory ability of MCF-7 cells and thus is engaged in regulating cell cycle and cell migration in MCF-7 cells of breast cancer [49]. Hence, the relationship between these prognostic ARGs and HNSCC needs further research in the future.
However, there are some limitations in this study. First, this is a retrospective study, which emphasizes on data mining and data analysis. Further experiments are needed to validate these findings. Second, there is currently no other research that investigates autophagy-related gene signature for HNSCC, suggesting that we are unable to validate our findings in another independent study. Therefore, we encourage multicenter data to confirm our results.
5. Conclusions
In summary, we identify a six-autophagy-related gene signature that can independently predict prognosis of HNSCC and help to distinguish high-risk patients. The six identified genes provide new insights into underlying molecular mechanisms of HNSCC and can be utilized as promising therapeutic targets. Further studies are expected to verify the clinical application and explore optimal treatment strategies for HNSCC.
Acknowledgments
This study was funded by the National Natural Science Foundation of China (31670951 and 31971240) and the Project of the Science and Technology Department in Sichuan Province (2017JY0228).
Data Availability
The data used to support the findings of this study are available from TCGA database (https://cancergenome.nih.gov/), the Gene Expression Omnibus database (https://www.ncbi.nlm.nih.gov/gds/), International Cancer Genome Consortium (https://dcc.icgc.org/releases/current/Projects/ORCA-IN), and the Human Autophagy Database (HADb; http://autophagy.lu/clustering/index.html).
Conflicts of Interest
The authors declare that there is no conflict of interest regarding the publication of this paper.
Supplementary Materials
Figure S1: the results of the least absolute shrinkage and selection operator regression. (a) The tuning parameter (lambda) selection in the least absolute shrinkage and selection operator (LASSO) model using 10-fold cross-validation via minimum criteria. Dotted vertical lines are drawn at the optimal values using the minimum criteria and the 1 standard error of the minimum criteria (the 1-SE criteria). (b) LASSO coefficient profiles of the 9-survival-related autophagy-related genes.
References
- 1.Global Burden of Disease Cancer Collaboration, Fitzmaurice C., Abate D., et al. Global, regional, and national cancer incidence, mortality, years of life lost, years lived with disability, and disability-adjusted life-years for 29 cancer groups, 1990 to 2017: a systematic analysis for the global burden of disease study. JAMA Oncology. 2019;5(12):1749–1768. doi: 10.1001/jamaoncol.2019.2996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Siegel R. L., Miller K. D., Jemal A. Cancer statistics, 2016. CA: a Cancer Journal for Clinicians. 2016;66(1):7–30. doi: 10.3322/caac.21332. [DOI] [PubMed] [Google Scholar]
- 3.Polanska H., Raudenska M., Gumulec J., et al. Clinical significance of head and neck squamous cell cancer biomarkers. Oral Oncology. 2014;50(3):168–177. doi: 10.1016/j.oraloncology.2013.12.008. [DOI] [PubMed] [Google Scholar]
- 4.Pulte D., Brenner H. Changes in survival in head and neck cancers in the late 20th and early 21st century: a period analysis. The Oncologist. 2010;15(9):994–1001. doi: 10.1634/theoncologist.2009-0289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Argiris A., Karamouzis M. V., Raben D., Ferris R. L. Head and neck cancer. The Lancet. 2008;371(9625):1695–1709. doi: 10.1016/S0140-6736(08)60728-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Levine B., Kroemer G. Biological functions of autophagy genes: a disease perspective. Cell. 2019;176(1-2):11–42. doi: 10.1016/j.cell.2018.09.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Amaravadi R., Kimmelman A. C., White E. Recent insights into the function of autophagy in cancer. Genes & Development. 2016;30(17):1913–1930. doi: 10.1101/gad.287524.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Dower C. M., Wills C. A., Frisch S. M., Wang H. G. Mechanisms and context underlying the role of autophagy in cancer metastasis. Autophagy. 2018;14(7):1110–1128. doi: 10.1080/15548627.2018.1450020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Peixoto P., Grandvallet C., Feugeas J.-P., Guittaut M., Hervouet E. Epigenetic control of autophagy in cancer cells: a key process for cancer-related phenotypes. Cells. 2019;8(12):p. 1656. doi: 10.3390/cells8121656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Pedregosa F., Varoquaux G., Gramfort A., et al. Scikit-learn: machine learning in python. Journal of Machine Learning Research. 2011;12:2825–2830. [Google Scholar]
- 11.D'Souza G., Dempsey A. The role of HPV in head and neck cancer and review of the HPV vaccine. Preventive Medicine. 2011;53(Supplement 1):S5–S11. doi: 10.1016/j.ypmed.2011.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Goodman A. M., Kato S., Bazhenova L., et al. Tumor mutational burden as an independent predictor of response to immunotherapy in diverse cancers. Molecular Cancer Therapeutics. 2017;16(11):2598–2608. doi: 10.1158/1535-7163.MCT-17-0386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Klebanov N., Artomov M., Goggins W. B., Daly E., Daly M. J., Tsao H. Burden of unique and low prevalence somatic mutations correlates with cancer survival. Scientific Reports. 2019;9(1):p. 4848. doi: 10.1038/s41598-019-41015-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Stransky N., Egloff A. M., Tward A. D., et al. The mutational landscape of head and neck squamous cell carcinoma. Science. 2011;333(6046):1157–1160. doi: 10.1126/science.1208130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhou G., Liu Z., Myers J. N. TP53 mutations in head and neck squamous cell carcinoma and their impact on disease progression and treatment response. Journal of Cellular Biochemistry. 2016;117(12):2682–2692. doi: 10.1002/jcb.25592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Miyauchi S., Kim S. S., Pang J., et al. Immune modulation of head and neck squamous cell carcinoma and the tumor microenvironment by conventional therapeutics. Clinical Cancer Research. 2019;25(14):4211–4223. doi: 10.1158/1078-0432.CCR-18-0871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Galluzzi L., Pietrocola F., Bravo-San Pedro J. M., et al. Autophagy in malignant transformation and cancer progression. The EMBO Journal. 2015;34(7):856–880. doi: 10.15252/embj.201490784. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Tan Y.-Q., Zhang J., Zhou G. Autophagy and its implication in human oral diseases. Autophagy. 2017;13(2):225–236. doi: 10.1080/15548627.2016.1234563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ang K. K., Harris J., Wheeler R., et al. Human papillomavirus and survival of patients with oropharyngeal cancer. The New England Journal of Medicine. 2010;363(1):24–35. doi: 10.1056/NEJMoa0912217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Koneva L. A., Zhang Y., Virani S., et al. HPV integration in HNSCC correlates with survival outcomes, immune response signatures, and candidate drivers. Molecular Cancer Research. 2018;16(1):90–102. doi: 10.1158/1541-7786.MCR-17-0153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kreimer A. R., Clifford G. M., Boyle P., Franceschi S. Human papillomavirus types in head and neck squamous cell carcinomas worldwide: a systematic review. Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology. 2005;14(2):467–475. doi: 10.1158/1055-9965.EPI-04-0551. [DOI] [PubMed] [Google Scholar]
- 22.von Witzleben A., Wang C., Laban S., Savelyeva N., Ottensmeier C. H. HNSCC: tumour antigens and their targeting by immunotherapy. Cells. 2020;9(9):p. 2103. doi: 10.3390/cells9092103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bai B., Wang X., Chen E., Zhu H. Human cytomegalovirus infection and colorectal cancer risk: a meta-analysis. Oncotarget. 2016;7(47):76735–76742. doi: 10.18632/oncotarget.12523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Teo W. H., Chen H.-P., Huang J. C., Chan Y. J. Human cytomegalovirus infection enhances cell proliferation, migration and upregulation of EMT markers in colorectal cancer-derived stem cell-like cells. International Journal of Oncology. 2017;51(5):1415–1426. doi: 10.3892/ijo.2017.4135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Koustas E., Sarantis P., Kyriakopoulou G., Papavassiliou A. G., Karamouzis M. V. The interplay of autophagy and tumor microenvironment in colorectal cancer—ways of enhancing immunotherapy action. Cancers. 2019;11(4):p. 533. doi: 10.3390/cancers11040533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Bossi P., Resteghini C., Paielli N., Licitra L., Pilotti S., Perrone F. Prognostic and predictive value of EGFR in head and neck squamous cell carcinoma. Oncotarget. 2016;7(45):74362–74379. doi: 10.18632/oncotarget.11413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Normanno N., de Luca A., Bianco C., et al. Epidermal growth factor receptor (EGFR) signaling in cancer. Gene. 2006;366(1):2–16. doi: 10.1016/j.gene.2005.10.018. [DOI] [PubMed] [Google Scholar]
- 28.Wheeler S., Siwak D. R., Chai R., et al. Tumor epidermal growth factor receptor and EGFR PY1068 are independent prognostic indicators for head and neck squamous cell carcinoma. Clinical Cancer Research. 2012;18(8):2278–2289. doi: 10.1158/1078-0432.CCR-11-1593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Molinolo A. A., Amornphimoltham P., Squarize C. H., Castilho R. M., Patel V., Gutkind J. S. Dysregulated molecular networks in head and neck carcinogenesis. Oral Oncology. 2009;45(4-5):324–334. doi: 10.1016/j.oraloncology.2008.07.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Boeckx C., Baay M., Wouters A., et al. Anti-epidermal growth factor receptor therapy in head and neck squamous cell carcinoma: focus on potential molecular mechanisms of drug resistance. The Oncologist. 2013;18(7):850–864. doi: 10.1634/theoncologist.2013-0013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Sacco A. G., Worden F. P. Molecularly targeted therapy for the treatment of head and neck cancer: a review of the ErbB family inhibitors. Oncotargets and Therapy. 2016;9:1927–1943. doi: 10.2147/OTT.S93720. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Cohen E. E. W., Soulières D., le Tourneau C., et al. Pembrolizumab versus methotrexate, docetaxel, or cetuximab for recurrent or metastatic head-and-neck squamous cell carcinoma (KEYNOTE-040): a randomised, open-label, phase 3 study. The Lancet. 2019;393(10167):156–167. doi: 10.1016/S0140-6736(18)31999-8. [DOI] [PubMed] [Google Scholar]
- 33.The Cancer Genome Atlas Network. Comprehensive genomic characterization of head and neck squamous cell carcinomas. Nature. 2015;517(7536):576–582. doi: 10.1038/nature14129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Rothenberg S. M., Ellisen L. W. The molecular pathogenesis of head and neck squamous cell carcinoma. The Journal of Clinical Investigation. 2012;122(6):1951–1957. doi: 10.1172/JCI59889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Salazar C. R., Anayannis N., Smith R. V., et al. Combined P16 and human papillomavirus testing predicts head and neck cancer survival. International Journal of Cancer. 2014;135(10):2404–2412. doi: 10.1002/ijc.28876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Jiang X., Ye J., Dong Z., Hu S., Xiao M. Novel genetic alterations and their impact on target therapy response in head and neck squamous cell carcinoma. Cancer Management and Research. 2019;11:1321–1336. doi: 10.2147/CMAR.S187780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Tourneur L., Chiocchia G. FADD: a regulator of life and death. Trends in Immunology. 2010;31(7):260–269. doi: 10.1016/j.it.2010.05.005. [DOI] [PubMed] [Google Scholar]
- 38.Reddy R. B., Bhat A. R., James B. L., et al. Meta-analyses of microarray datasets identifies ANO1 and FADD as prognostic markers of head and neck cancer. PLoS One. 2016;11(1, article e0147409) doi: 10.1371/journal.pone.0147409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Pattje W. J., Melchers L. J., Slagter-Menkema L., et al. FADD expression is associated with regional and distant metastasis in squamous cell carcinoma of the head and neck. Histopathology. 2013;63(2):263–270. doi: 10.1111/his.12174. [DOI] [PubMed] [Google Scholar]
- 40.Eytan D. F., Snow G. E., Carlson S., et al. SMAC mimetic Birinapant plus radiation eradicates human head and neck cancers with genomic amplifications of cell death genes FADD and BIRC2. Cancer Research. 2016;76(18):5442–5454. doi: 10.1158/0008-5472.CAN-15-3317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Pinon P., Wehrle-Haller B. Integrins: versatile receptors controlling melanocyte adhesion, migration and proliferation. Pigment Cell & Melanoma Research. 2011;24(2):282–294. doi: 10.1111/j.1755-148X.2010.00806.x. [DOI] [PubMed] [Google Scholar]
- 42.Koshizuka K., Hanazawa T., Kikkawa N., et al. Regulation ofITGA3by the anti‐tumormiR‐199family inhibits cancer cell migration and invasion in head and neck cancer. Cancer Science. 2017;108(8):1681–1692. doi: 10.1111/cas.13298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Koshizuka K., Nohata N., Hanazawa T., et al. Deep sequencing-based microRNA expression signatures in head and neck squamous cell carcinoma: dual strands of pre-miR-150 as antitumor miRNAs. Oncotarget. 2017;8(18):30288–30304. doi: 10.18632/oncotarget.16327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Bernardini J. P., Lazarou M., Dewson G. Parkin and mitophagy in cancer. Oncogene. 2017;36(10):1315–1327. doi: 10.1038/onc.2016.302. [DOI] [PubMed] [Google Scholar]
- 45.Xu L., Lin D. C., Yin D., Koeffler H. P. An emerging role of PARK2 in cancer. Journal of Molecular Medicine (Berlin, Germany) 2014;92(1):31–42. doi: 10.1007/s00109-013-1107-0. [DOI] [PubMed] [Google Scholar]
- 46.Bebek G., Orloff M., Eng C. Microenvironmental genomic alterations reveal signaling networks for head and neck squamous cell carcinoma. Journal of Clinical Bioinformatics. 2011;1(1):p. 21. doi: 10.1186/2043-9113-1-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Gao G., Kasperbauer J. L., Tombers N. M., Wang V., Mayer K., Smith D. I. A selected group of large common fragile site genes have decreased expression in oropharyngeal squamous cell carcinomas. Genes, Chromosomes & Cancer. 2014;53(5):392–401. doi: 10.1002/gcc.22150. [DOI] [PubMed] [Google Scholar]
- 48.Li F., Xiao H., Hu Z., Zhou F., Yang B. Exploring the multifaceted roles of heat shock protein B8 (HSPB8) in diseases. European Journal of Cell Biology. 2018;97(3):216–229. doi: 10.1016/j.ejcb.2018.03.003. [DOI] [PubMed] [Google Scholar]
- 49.Piccolella M., Crippa V., Cristofani R., et al. The small heat shock protein B8 (HSPB8) modulates proliferation and migration of breast cancer cells. Oncotarget. 2017;8(6):10400–10415. doi: 10.18632/oncotarget.14422. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figure S1: the results of the least absolute shrinkage and selection operator regression. (a) The tuning parameter (lambda) selection in the least absolute shrinkage and selection operator (LASSO) model using 10-fold cross-validation via minimum criteria. Dotted vertical lines are drawn at the optimal values using the minimum criteria and the 1 standard error of the minimum criteria (the 1-SE criteria). (b) LASSO coefficient profiles of the 9-survival-related autophagy-related genes.
Data Availability Statement
The data used to support the findings of this study are available from TCGA database (https://cancergenome.nih.gov/), the Gene Expression Omnibus database (https://www.ncbi.nlm.nih.gov/gds/), International Cancer Genome Consortium (https://dcc.icgc.org/releases/current/Projects/ORCA-IN), and the Human Autophagy Database (HADb; http://autophagy.lu/clustering/index.html).
