Skip to main content
Clinical and Translational Medicine logoLink to Clinical and Translational Medicine
letter
. 2020 Sep 27;10(5):e187. doi: 10.1002/ctm2.187

Tumor microenvironment characterization in head and neck squamous carcinoma reveals distinct genomic alterations and clinical outcomes

Run Shi 1, Xuanwen Bao 2, Jing Sun 1, Shun Lu 3, Claus Belka 1, Minglun Li 1,
PMCID: PMC7520081  PMID: 32997410

Dear Editor,

Head and neck squamous carcinoma (HNSC) is one of the most common malignancies worldwide. Despite recent advances in HNSC treatment, the prognosis remains unfavorable. 1 Given the poor outcomes after standard treatment in HNSC, immunotherapy, such as anti‐PD‐1 therapy, is a promising alternative. 2 However, due to the tumor heterogeneity and complexity of tumor microenvironment (TME), 3 , 4 immunotherapy benefits only a subset of HNSC patients, hence, the issue of patient selection becomes a critical challenge. Thus, a comprehensive understanding of HNSC needs to focus not only on tumor cells but also on TME. In this study, we aimed to characterize different TME landscapes by analyzing the infiltrating patterns of various TME cells, and to develop an individualized TME‐related scoring tool to predict cancer‐specific survival (CSS) in HNSC.

Normalized RNA‐seq data, somatic mutation data, and CSS information were obtained from The Cancer Genome Atlas (TCGA) portal (https://portal.gdc.cancer.gov/). xCell 5 was used to quantify the abundance of immune and stroma cells involved in TME, and the immune, stroma, and TME scores of each sample were computed, respectively. The non‐negative matrix factorization (NMF) consensus clustering was used to identify different TME clusters. Principal coordinates analysis (PCoA) was used to visualize dissimilarity of different TME clusters based on their Bray–Curtis distance. Different immune responses were quantified using a single‐sample gene set enrichment analysis (ssGSEA) 6 algorithm based on the transcriptome profiling data and corresponding gene sets retrieved from Molecular Signatures Database (MSigDB), 7 and depicted by a radar chart with Z‐score normalization. ImmuCellAI 8 was used to predict the response to immune checkpoint blockade (ICB) therapy. Using R package “maftools,” an oncoplot was generated to display the somatic mutation landscape in different TME clusters, and tumor mutational burden (TMB) was computed for each sample. The weighted gene coexpression network analysis (WGCNA) 9 was used to identify a TME‐related gene module. Metascape 10 was used to visualize the results of enrichment analysis. Subsequently, univariate and least absolute shrinkage and selection operator (LASSO) Cox regression analyses were step wisely performed to screen for the most robust prognostic genes. Finally, a formula which calculates TME‐related risk score (TMErs) was established to quantify the risk of cancer‐specific death: TMErs=iLASSOCoxCoefficient(mRNAi)×Expression(mRNAi). A total of 171 HNSC raw CEL files with CSS information produced from a same chip platform (Affymetrix HG‐U133 Plus 2.0) were downloaded from two Gene Expression Omnibus (GEO) datasets (GSE41613& GSE42743),and integrated to a singular cohort using a robust multiarray average (RMA) method, with Combat algorithm eliminating the batch effects. The prognostic capacity of TMErs was further validated in the independent GEO cohort. The Kaplan–Meier method was used to draw survival curves, and the log‐rank test was used to evaluate survival difference. The X‐tile software was used to select the most optimal cut points with the maximum of log‐rank statistics. 11 Student's t‐test, one‐way ANOVA, or chi‐square test was used to evaluate the significance, and P value less than .05 was considered statistically significant.

Based on the abundance matrix of 36 cell types involved in TME, a fan phylogram was generated to show their similarity and distance (Figure 1A), and NMF consensus clustering was performed to divide 520 TCGA HNSC samples into three clusters (C1‐3) with an optimal k value of 3 (Figure 1B). A stacked barplot depicts the distinct patterns of the relative proportion of TME cells in the three identified clusters (Figure 1C). PCoA demonstrated that 520 samples were clearly separated into three distinct TME clusters (Figure 1D). Immune, stroma, and TME scores varied markedly among the three clusters (all, < .001; Figure 1E‐G). Moreover, a radar chart demonstrated that the performances of different immune responses, such as humoral, adaptive, and innate immune responses, shrink progressively from C1 to C3 (Figure 1H). Accordingly, important inhibitory immune receptors and ligands, including PD‐1, PD‐L1, CTLA4, and TIGIT, were progressively downregulated from C1 to C3 (all, < .001; Figure 1I). Furthermore, potential responses to ICB therapy differed among the three clusters (= .002; Figure 1J). The Kaplan–Meier analysis showed that C1 exhibited best CSS, while C3 exhibited worst CSS (= .008; Figure 1K). Landscape of somatic mutations was depicted in the three clusters (Figure 1L). Summarization of top mutated genes in each cluster showed the frequency of TP53 mutation in C1 is significantly lower than C2 and C3 (Figure 1M). In addition, significant difference of TMB was observed among the three clusters (= .001; Figure 1N).

FIGURE 1.

FIGURE 1

Different genomic alterations and clinical outcomes were observed in three NMF‐identified TME clusters. (A) A fan phylogram was generated to show the similarity and distance among the 36 cell types involved in TME. (B) NMF consensus clustering was performed to divide 520 TCGA HNSC samples into three clusters. (C) A stacked barplot depicts distinct TME patterns in the three clusters. (D) PCoA analysis demonstrated the three TME clusters were clearly separated. (E‐G) Immune, stroma, and TME scores varied markedly among the three TME clusters (all, < .001). (H) A radar chart depicts the performances of different immune responses from C1 to C3. (I) Important inhibitory immune receptors and ligands, including PD‐1, PD‐L1, CTLA4, and TIGIT, were progressively downregulated from C1 to C3 (all, < .001). (J) Potential responses to ICB therapy differed significantly among the three clusters (= .002). (K) Different CSS was observed among the three clusters (= .008). (L) Landscape of somatic mutations in the three clusters. (M) Summarization of 10 top mutated genes in each cluster. (N) Significant difference of TMB was observed in the three clusters (= .001)

To quantify the risk assessment, we developed a TME‐related signature for CSS. The flowchart was presented in Figure 2A. With a power of 8 as the optimal soft threshold (Supporting information Figure S1), a total of 34 modules were generated using WGCNA, and the blue module was considered as “TME‐related module” due to its highest correlation with the TME score (r = .85, = 3e‐140; Figure 2B).Gene Ontology enrichment analysis confirmed the blue module is involved in TME‐related biological processes (Figure 2C). Volcano plot shortlisted 141 promising candidates with a threshold P of univariate Cox regression analysis less than .001 (Figure 2D). Subsequently, LASSO Cox algorithm was used to identify the most robust prognostic genes. Tenfold cross‐validation was applied to overcome overfitting effect (Supporting information Figure S2), and an optimal λ of 0.0318 was selected (Figure 2E). Finally, nine genes remained, and the distribution of their LASSO Cox coefficients iss shown in Figure 2F. TMErs was calculated for each sample, and the entire cohort was divided into three parts: TMErs‐low, TMErs‐intermediate, and TMErs‐high groups. In the TCGA training cohort, HNSC patients in different TMErs groups exhibited significant difference in CSS (< .001; Figure 2G), and the similar result was also observed in the GEO validation cohort (< .001; Figure 2H). The results of log‐rank test between different TMErs groups in TCGA and GEO cohorts are summarized in Figure 2I.

FIGURE 2.

FIGURE 2

An individualized TME‐related prognostic signature was established and validated. (A) The flowchart. (B) A total of 34 modules were generated, and the blue module was selected due to its highest correlation with the TME score (r = 0.85, = 3e‐140). (C) Gene Ontology enrichment analysis confirmed the blue module is involved in TME‐related biological processes. (D) Volcano plot shortlisted 141 promising candidates with a threshold P less than .001. (E) An optimal λ of 0.0318 was selected in the LASSO Cox algorithm. (F) LASSO Cox coefficients of the nine remaining genes. (G) In the TCGA training cohort, HNSC patients in different TMErs groups exhibited significant difference in CSS (< .001), and (H) the similar result was also observed in the GEO validation cohort (< .001). (I) Results of log‐rank tests between different TMErs groups in TCGA and GEO cohorts

In this study, we systematically evaluated the infiltrating abundance of various TME cells and compared clinical outcomes and genomic alterations in different TME clusters. Using NMF consensus clustering, 520 HNSC samples were divided into three distinct TME clusters. As well as TME score, potential response to ICB, CSS, and somatic mutation profile differed markedly among the three clusters. These findings demonstrated that the TME landscape could serve as a promising biomarker to discriminate high‐risk subset and guide patient selection for immunotherapy. To quantify the risk assessment, an individualized TME‐related prognostic signature was established, and further validated in an independent GEO HNSC cohort. The novelty of this study is the integration of TME gene coexpression network into the establishment of prognostic signature, which might increase the robustness of the present TMErs model.

The limitations should be acknowledged. First, this is a retrospective study based on public databases, thus, the sampling bias could not be completely excluded. Moreover, the sample size is relatively small. In summary, the clinical usefulness of TME should be further validated in a large prospective study.

CONFLICT OF INTEREST

The authors declare no potential conflict of interest.

Supporting information

Figure S1: A power of β = 8 was chosen as the optimal soft threshold to ensure a scale‐free co‐expression network in WGCNA.

Figure S2: In the LASSO Cox regression analysis, 10‐fold cross‐validation was applied to overcome over‐fitting effect.

ACKNOWLEDGMENTS

We greatly thank the China Scholarship Council (CSC) for supporting the work of Run Shi, Xuanwen Bao, and Jing Sun.

Run Shi and Xuanwen Bao contributed equally to this work.

REFERENCES

  • 1. Laura QM, Chow MD. Head and neck cancer. N Engl J Med. 2020;382(1):60‐72. 10.1056/NEJMra1715715. [DOI] [PubMed] [Google Scholar]
  • 2. Chow LQM, Haddad R, Gupta S, et al. Antitumor activity of pembrolizumab in biomarker‐unselected patients with recurrent and/or metastatic head and neck squamous cell carcinoma: results from the Phase IB KEYNOTE‐012 expansion cohort. J Clin Oncol. 2016;34(32):3838‐3845. 10.1200/JCO.2016.68.1478. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Curry JM, Sprandio J, Cognetti D, et al. Tumor microenvironment in head and neck squamous cell carcinoma. Semin Oncol. 2014;41(2):217‐234. 10.1053/j.seminoncol.2014.03.003. [DOI] [PubMed] [Google Scholar]
  • 4. Flemming A. Tumour heterogeneity determines immune response. Nat Rev Immunol. 2019;19(11):662‐663. 10.1038/s41577-019-0230-8. [DOI] [PubMed] [Google Scholar]
  • 5. Aran D, Hu Z, Butte AJ. xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol. 2017;18(1):220 10.1186/s13059-017-1349-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Barbie DA, Tamayo P, Boehm JS, et al. Systematic RNA interference reveals that oncogenic KRAS‐driven cancers require TBK1. Nature. 2009;462(7269):108‐112. 10.1038/nature08460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Liberzon A, Birger C, Thorvaldsdottir H, Ghandi M, Mesirov JP, Tamayo P. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst. 2015;1(6):417‐425. 10.1016/j.cels.2015.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Miao YR, Zhang Q, Lei Q, et al. ImmuCellAI: a unique method for comprehensive T‐cell subsets abundance prediction and its application in cancer immunotherapy. Adv Sci (Weinh). 2020;7(7):1902880 10.1002/advs.201902880. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Zhou Y, Zhou B, Pache L, et al. Metascape provides a biologist‐oriented resource for the analysis of systems‐level datasets. Nat Commun. 2019;10(1):1523 10.1038/s41467-019-09234-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Camp RL, Dolled‐Filhart M, Rimm DL. X‐tile: a new bio‐informatics tool for biomarker assessment and outcome‐based cut‐point optimization. Clin Cancer Res. 2004;10(21):7252‐7259. 10.1158/1078-0432.CCR-04-0713. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1: A power of β = 8 was chosen as the optimal soft threshold to ensure a scale‐free co‐expression network in WGCNA.

Figure S2: In the LASSO Cox regression analysis, 10‐fold cross‐validation was applied to overcome over‐fitting effect.


Articles from Clinical and Translational Medicine are provided here courtesy of John Wiley & Sons Australia, Ltd on behalf of Shanghai Institute of Clinical Bioinformatics

RESOURCES