Abstract
Background
Immune checkpoint inhibitor (ICI) therapy has revolutionized the treatment of many cancers. However, the limited population that benefits from ICI therapy makes it necessary to screen predictive biomarkers for stratifying patients. Currently, many biomarkers, such as tumor mutational burden (TMB), have been used in the clinic as indicative biomarkers. However, some high-TMB patients with mutations in genes that are closely related to immunotherapeutic resistance are not sensitive to ICI therapy. Thus, there is a need to move beyond TMB and identify specific genetic determinants of the response to ICI therapy. In this study, we established a comprehensive mutation-based gene set across different tumor types to predict the efficacy of ICI therapy.
Methods
We constructed and validated a mutational signature to predict the prognosis of patients treated with ICI therapy. Then, the underlying immune response landscapes of different subtypes were investigated with multidimensional data.
Results
This study included genomic and clinical data for 12,647 patients. An eleven-gene mutation-based gene set was generated to divide patients into a high-risk group and a low-risk group in a training cohort (1572 patients with 9 types of cancers who were treated with ICI therapy). Validation was performed in a validation cohort (932 patients with 5 types of cancers who were treated with ICI therapy). Mutations in these 11 genes were associated with a better response to ICI therapy. In addition, the mutation-based gene set was demonstrated to be an independent prognostic factor after ICI therapy. We further explored the role of the immune context in determining the benefits of immunotherapy in 10,143 patients with 33 types of cancers and found distinct immune landscapes for the high- and low-risk groups.
Conclusions
The mutation-based gene set developed in this study can be used to reliably predict survival benefit across cancers in patients receiving ICI therapy. The close interplay between the extrinsic and intrinsic immune landscapes in the identified patient subgroups and the subgroups’ differing responses to ICI therapy could guide immunotherapy treatment decisions for cancer patients.
Supplementary Information
The online version contains supplementary material available at 10.1186/s13073-022-01024-y.
Keywords: Tumor microenvironment, Biomarker, Immune response, Immune checkpoint inhibitor, Immunotherapy
Background
The treatment landscapes of different cancer types have changed based on developments in the field of immuno-oncology. Immune checkpoint inhibitor (ICI) therapy, which includes antibodies targeting cytotoxic T lymphocyte-associated protein-4 (CTLA-4), programmed death-1 (PD-1), and programmed death ligand-1 (PD-L1), offers significant clinical benefits for patients with many types of cancer [1–3]. In the PD-1/PD-L1 pathway, binding of the PD-1 receptor on T cells to PD-L1 on antigen-presenting cells and tumor cells limits or halts the T cell response by downregulating cytokine production, effector function, and T cell proliferation. However, only 20–40% of patients treated with PD-1/PD-L1 blockade therapy show a response, whereas most do not, and the determinants of the response remain elusive. In addition, treatment discontinuation in nonresponders is often delayed due to difficulty in interpreting imaging results [4]. Therefore, the selection of patients is vital, and it is a substantial challenge to identify reliable markers to rapidly predict a sustainable response.
Tumor-infiltrating immune cells play a crucial role in patient prognosis and cancer treatment efficacy [5–8]. In the tumor microenvironment, the composition of immune cells is related to cancer heterogeneity and creates complexity that is interesting but challenging when studying the dynamic interactions between cancer and immune cells [8]. IFN-γ is a crucial cytokine produced by natural killer (NK) cells and activated T cells [9], and loss of sensitivity to IFN-γ induction can result in resistance to immunotherapy [6, 10]. Many excellent models and targets for predicting the response to ICI therapy have been developed recently. For example, Jiang et al. developed a tumor immune dysfunction and exclusion (TIDE) score, a method that uses gene expression profiles for calculations to predict the response to ICI therapy. TIDE evaluates two different tumor immune escape mechanisms: the prevention of T cell infiltration in cancers with low cytotoxic T lymphocyte (CTL) levels and the induction of T cell dysfunction in cancers with high CTL infiltration [10, 11]. This method can predict the outcomes of cancer patients treated with ICI therapy more accurately than other biomarkers, such as mutational load and PD-L1 levels. Moreover, Shi et al. discovered that MAN2A1 loss renders cancer cells more susceptible to T cell-mediated killing and that inhibition of MAN2A1 enhances the immune response to anti-PD-L1 [12]. In addition to transcriptomics data, other types of omics data can be assessed to predict the efficacy of ICI therapy. Kumar et al. found that suppression of CARM1, an epigenetic enzyme and cotranscriptional activator, facilitates immunotherapy for resistant tumors through dual effects on cancer cells and cytotoxic T cells [13]. Genomic profiling also represents an emerging approach for predicting the response to immunotherapy. Based on exome analysis of tumors from pembrolizumab-treated patients, the best responses to PD-1 blockade occurred in tumors with a high tumor mutational burden (TMB) [14]. Indeed, TMB has been shown to be a strong marker of the response to front-line treatment with nivolumab together with ipilimumab in patients with advanced non-small-cell lung cancer (NSCLC) [15]. However, due to the limited number of high-quality DNA samples, the availability of tissue samples, the need for bioinformatics analyses, the lack of a standardized panel and cutoff values, and the high cost, it is difficult to implement whole-exome sequencing or next-generation sequencing panels in routine clinical practice.
Furthermore, tumors with a comparably high TMB show variable responses, indicating that additional factors may contribute to the response to ICIs [16]. Mutations in genes involved in antigen presentation and in interferon-receptor signaling pathways, such as B2M and JAK1/2, have been shown to be related to acquired resistance to ICIs, and JAK1/2 mutations have also been found to result in primary anti-PD-1 resistance [17]. Although the TMB may be high, patients carrying these mutations usually have a poor response to ICIs. There is thus a need to move beyond the TMB and identify specific genetic determinants of the response to PD-1 inhibitors [18].
POLE and POLD1 mutations have been proposed as biomarkers for immunotherapy outcomes across multiple cancer types [19]. However, there has not yet been a comprehensive exploration of factors related to prognosis after immunotherapy at the genomic level. In this study, we conducted a pancancer genomic analysis to identify a powerful signature for predicting the clinical benefit for ICI-treated patients. We further investigated the role of the immune context in determining the benefit of immunotherapy.
Methods
Study population
Mutation data and clinical information for the training and validation cohorts were obtained from the cBioPortal database (https://www.cbioportal.org) and the literature [20–28]. The predictive model was first constructed based on the training cohort, which consisted of 1572 patients with 9 types of cancers who received ICI treatment (Additional file 2: Fig. S1) and was then validated in the independent validation cohort consisting of 932 patients with 5 types of cancers who received ICI treatment (Additional file 2: Fig. S1) [20, 23]. Additional file 2: Fig. S1 summarizes the sample selection process. Specifically, in the training cohort from Samstein et al. [20], both mutation profiles and clinical data were available for 1661 patients. Next, cancer types with only one case (n = 1) and cancer of unknown primary type (n = 88) were excluded; 1572 cases remained. In the validation cohort, both mutation profiles and clinical data were available for 144 patients from the cohort of Liu et al. [21], 274 patients from the IMvigor210 cohort reported by Mariathasan et al. [22], 249 patients from the cohort of Miao et al. [23], 35 patients from the cohort of Miao et al. [24], 68 patients from the cohort of Riaz et al. [25], 38 patients from the cohort of Hugo et al. [26], 110 patients from the cohort of Van Allen et al. [27], and 64 patients from the cohort of Snyder et al. [29]. Notably, in the Miao cohort (n = 249) [23], cancer types with only one case (n = 3) were excluded, with only 246 cases remaining. In the Hugo cohort (n = 38) [26], one patient was excluded because of a lack of overall survival data; 37 cases remained. In addition, 46 cases from the Snyder cohort (n = 64) and Miao cohort (n = 249) were duplicates, and 46 cases from the Snyder cohort were excluded [23, 29]. The clinical data for each sample used in the analysis are shown in Additional file 1: Tables S1-S3.
Samples from the training cohort were sequenced using the Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT) panel, which was designed for targeted sequencing of 468 tumor-suppressor genes, oncogenes, and members of pathways considered actionable for targeted therapies and authorized by the US FDA [20]. Samples from the validation cohort were sequenced using WES, except for the IMvigor210 cohort reported by Mariathasan et al., which was sequenced with the FoundationOne panel, a US FDA-authorized panel [22]. This study included all nonsynonymous mutations, including missense, frame-shift, nonsense, nonstop, splice site, and translation start site mutations [19]. The primary clinical outcomes were overall survival (OS) and clinical benefit, which was categorized as durable clinical benefit (DCB) (complete response [CR]/partial response [PR] or stable disease [SD] that lasted > 6 months) or no durable benefit (NDB) (progression of disease [PD] or SD that lasted ≤ 6 months) [30]. In the training cohort, no response data were provided by Samstein et al., and we extracted the response data for some of those patients from Janjigian et al. [31] and Rizvi et al. [30]. In the validation cohort, data on the response to ICI therapy were obtained from Hugo et al. [26], Liu et al. [21], Mariathasan et al. [22], Miao et al. [23, 24], Riaz et al. [25], and Van Allen et al. [27]. In the training and validation cohorts, OS was defined as the time from the date of the first ICI therapy to the time of the last follow-up or death. For samples sequenced by WES, the TMB was defined as the total number of nonsynonymous mutations divided by the exome size (38 Mb was utilized as the exome size). For samples sequenced with the MSK-IMPACT panel or FoundationOne panel, the TMB was obtained from the respective studies.
In the cohort from TCGA, mutation profiles (sequenced by WES), copy number variation (CNV) data, and mRNA expression profiles for 10143 patients with 33 cancer types, as acquired from the PanCancer Atlas consortium (https://gdc.cancer.gov/about-data/publications/pancanatlas), were employed to explore differences in genomic patterns between the identified subtypes [32].
Propensity score matching (PSM) weighting algorithm
PSM is a critical statistical method used to adjust for confounding factors in observational studies and has a wide range of applications in the social sciences, economics, and clinical practice [33]. In contrast to pair matching, PSM can improve balance, estimate efficiency, and enable the inclusion of all subjects by weighting them such that each contributes to the estimation [34]. We used the PSM method in this study to balance potentially confounding factors, including age, drug type, and cancer type, between the mutant and wild-type status of each gene in the MSK-IMPACT panel. Briefly, we first calculated the propensity score using logistic regression, with the mutation status of a given gene as a dependent variable, and we then used the PSM weighting scheme to continuously assign weights for each sample based on the propensity scores to achieve balance [34]. When the standardized difference of the weighted propensity scores between the mutant gene and wild-type gene groups was less than 10%, we considered the clinical characteristics to be balanced between the propensity score-weighted samples. We then compared survival data between the mutant gene and wild-type gene samples by supplying weights for multivariate Cox regression. Genes with a P value < 0.05 and adjusted P < 0.1 were considered to have a profound effect on prognosis and were selected for further analysis, and statistical significance was confirmed by randomly shuffling the mutation labels of the samples and repeating the above processes 100 times [34]. Statistical significance was analyzed by comparing the number of significant features obtained from the permutated data to that obtained from our real-world data.
Generation and validation of the mutation-based gene set
In the training cohort, PSM analysis, Lasso-penalized Cox regression analysis, and multivariate Cox regression analysis were employed to screen prognostic genes and construct a mutation-based gene set. First, a gene was considered significant when the P value was < 0.05 in the PSM analysis. The PSM algorithm was utilized as described above. Second, we applied Lasso-penalized Cox regression using the “glmnet” R package (version: 4.0-2) to avoid overfitting, reduce multicollinearity, and further select the key prognostic genes [35, 36]; subselection of prognostic genes was performed by shrinkage of the regression coefficient via the imposition of a penalty proportional to size [37]. Tenfold cross-validations were performed to define the optimal value of the lambda penalty parameter; this resulted in the weight of most of the potential prognostic genes decreasing to zero, and a relatively small number of prognostic genes with a weight of nonzero remained. For the Lasso-penalized Cox regression analysis, we subsampled the dataset with replacement 1000 times and selected prognostic genes with nonzero occurrence frequencies of more than 990 [38]. Third, multivariate Cox regression analysis was used to construct a mutation-based gene set with the “survival” R package (version: 3.2-3). The risk score can be estimated from the Cox model as follows:
The coefficients (b1, b2, …, bp) measure the impact (i.e., the effect size) of covariates.
Xi is the value of the ith covariate from the subjects.
is the mean value of the ith covariate.
X-tile 3.6.1 software was used to determine the best cutoff for classifying patients into low- and high-risk score groups [39]. The cutoff was defined as the risk score that generated the largest value of χ2 in the Mantel–Cox test [40]. Finally, the same formula and cutoff were applied for the validation and TCGA cohorts.
Generation and validation of the nomogram
As convenient and reliable tools, nomograms are widely used to predict specific outcomes in clinical oncology; they quantitatively predict prognosis for certain patients using known critical predictive factors and reveal the survival probability of clinical outcomes [41]. A calibration curve was used to evaluate the agreement between the actual and predicted survival probabilities [42].
Evaluation of immune infiltration with CIBERSORT
CIBERSORT is a deconvolution algorithm that is based on gene expression and applies support vector regression to infer cell type proportions in data from bulk cancer samples of mixed cell types [43]. The proportions of 22 types of infiltrating immune cells were estimated via the CIBERSORT method based on normalized gene expression data. CIBERSORT immune infiltration proportions were obtained from the pancancer immune landscape project conducted by Thorsson et al. [44].
TIL fraction, leukocyte fraction and lymphocyte fraction analyses
In the cohort from TCGA, the levels of TILs from genomics evaluation and those of TILs from H&E-stained image evaluation were evaluated by analyzing the data from Thorsson et al. and Saltz et al., respectively [44, 45]. Saltz et al. presented global mappings of TILs for over 5000 H&E-stained diagnostic whole-slide images from TCGA by using deep learning-based lymphocyte classification with convolutional neural networks (CNNs), representing a benchmark for TIL analysis. Genomics evaluation of the TIL fraction was carried out by multiplying an aggregated proportion of the lymphocyte fraction in the immune compartment assessed by the CIBERSORT approach with the leukocyte fraction derived from DNA methylation. The lymphocyte fraction is an aggregation of CIBERSORT estimates of T regulatory cells, follicular helper T cells, naïve, resting and activated memory CD4 T cells, naïve and memory B cells, plasma cells, activated and resting NK cells, CD8 T cells, and gamma-delta T cells.
The immune infiltration scores from Danaher et al.
The immune infiltration scores were extracted from a previous TCGA pancancer study conducted by Danaher et al. [46]. Each immune cell score was estimated by 60 specific marker genes with expression levels that are able to classify 14 immune cell populations: total TILs, B cells, DCs, macrophages, exhausted CD8 T cells, CD8 T cells, neutrophils, cytotoxic cells, Tregs, NK CD56dim cells, mast cells, NK cells, and Th1 cells. These results were highly reproducible and concordant with those obtained by immunohistochemistry and flow cytometry.
Immune signature evaluation
Twenty-nine classical immune signatures were acquired from He et al. (Additional file 1: Table S4) [47]. We used the “GSVA” R package (version: 1.34.0) based on the single-sample gene set enrichment analysis (ssGSEA) method to quantify the enrichment levels of the twenty-nine immune signatures in each sample [48].
Immunogenomic indicator calculation
Immunogenomic indicators were obtained from the pancancer immune landscape project conducted by Thorsson et al. [44]. In brief, the intertumoral heterogeneity (ITH) score was defined as the subclonal genome fraction (which measures the fraction of the tumor genome that is not part of the “plurality” clone), as determined by ABSOLUTE, which models tumor copy number alterations and mutations as mixtures of subclonal and clonal components of varying ploidy. The copy number burden scores n_segs and frac_altered (“number of segments” and “fraction altered”, respectively) represent the total number of segments in each sample’s copy number profile and the fraction of bases that deviate from the baseline ploidy, respectively. Aneuploidy scores were defined as the sum total of the amplified or deleted (collectively, “altered”) arms. TCR diversity scores (Shannon entropy and richness) and BCR diversity scores (Shannon entropy and richness) were inferred from cancer RNA-seq data.
Cytolytic activity score
The cytolytic activity score (CYT) was defined as the geometric mean of granzyme A (GZMA) and perforin 1 (PRF1) expression [49].
Deciphering mutational signatures in the genome
The “MutationalPatterns” R package (version: 1.12.0) was applied to perform nonnegative matrix factorization (NMF) analysis of mutations stratified by 96 trinucleotide contexts in pancancer specimens from TCGA. The extracted mutational portrait was compared against the Catalogue of Somatic Mutations in Cancer (COSMIC) by cosine similarity.
Enrichment scores of oncogenic pathways
Ten canonical oncogenic pathways containing 187 oncogenes were obtained from the study conducted by Sanchez-Vega et al. [50]. Enrichment scores for each pathway in each sample were determined by the ssGSEA approach applying the “GSVA” R package [48].
Copy number variation analysis
Significant deletion or amplification events in regions of the genome were investigated with GISTIC 2.0, a revised computational program that identifies somatic copy number alterations by investigating the amplitude and frequency of observed events [51].
Functional enrichment analysis
Functional enrichment analysis and clustering of the identified biological processes were conducted using the “clusterProfiler” R package (version: 3.14.3) [52].
Statistical analysis
Associations between the mutation-based gene set and OS were analyzed via the Kaplan–Meier method; survival curves were compared via the log-rank test. C-indexes were determined to compare the accuracy of the mutation-based gene set with that of the risk factors [53]. Statistical analysis for comparisons between two groups was conducted using the Wilcoxon test. R software (version 3.6.3) was applied to perform all statistical analyses, and P values were two-tailed. A P value < 0.05 was considered to indicate significance.
Results
Identification of a mutation-based gene set for predicting immunotherapy outcomes
Samples from the training cohort were sequenced using an MSK-IMPACT panel including 468 genes [20]. To remove confounding effects (including effects of age, drug type and cancer type), a PSM weighting algorithm was adopted to study survival differences between carriers of mutant and wild-type variants of these 468 genes (Fig. 1A). Additional file 2: Fig. S2 summarizes the analysis process used for this study. We calculated the propensity scores, “reweighted” the samples in the training cohort, and compared the survival differences between mutant and wild-type status for the 468 genes. As a result, 98 gene mutations were found to be significantly related to OS (P < 0.05 and adjusted P < 0.1). Lasso-penalized Cox regression analysis was used to further select important genes. Eleven genes with a nonzero occurrence frequency of more than 990 times of a total of 1000 repetitions were obtained (Additional file 1: Table S5) [38]. Finally, we quantified a risk score for each patient on the basis of the eleven-gene mutation-based gene set through multivariate Cox regression analysis:
In this formula, exp denotes exponential, the mutant gene status equals 1, and the wild-type gene status equals 0. X-tile software was used to generate an optimal cutoff value (1.07) to divide patients into groups with high- and low-risk scores [40]. The cutoff score of 1.07 was automatically identified by X-tile software because it was defined as the risk score that generated the largest value of χ2 in the Mantel–Cox test. In addition, we rescored 9 gene sets (excluding ROS1 and PTPRT from the 11 gene sets), and we found that the cutoff automatically identified by X-tile software was 0.74, so the cutoff for the 9 gene sets was 0.74, and the cutoff for the 11 gene sets was 1.07. This means that the 1.07 cutoff used in our study was selected specifically for 11 gene sets.
Patients in the group with high risk scores had a shorter OS than those in the group with low risk scores (P < 0.001; HR, 2.394; 95% CI, 2.035–2.817) (Fig. 1B). The AUC of the mutation-based gene set in the training cohort was 0.751 at 3 years and 0.831 at 5 years (Additional file 2: Fig. S3A). The AUC of each cancer type in the training cohort was also calculated (Additional file 2: Fig. S3C). We investigated whether the mutation-based gene set is restricted to specific groups or applicable to different populations. Subgroup analyses indicated that the mutation-based gene set was significantly associated with OS in patients treated with ICI therapy, regardless of age (Fig. 1C), drug type (Fig. 1D), or cancer type (Fig. 1E). The results of the subgroup analysis are in good agreement with those of PSM. Considering that TMB is a good marker for predicting the efficacy of immunotherapy, we performed a stratified analysis of TMB [54, 55]. In the stratified analysis of TMB, we found that the mutation-based gene set could predict prognosis very well in the TMB-high group and TMB-low group (Additional file 2: Fig. S4A and B). In the training cohort, the survival time of patients with BRAF mutation (median OS: 47.0 months) was significantly longer than that of those with wild-type BRAF (median OS: 17.0 months) (P < 0.001) (Additional file 2: Fig. S4C). In melanoma, patients with mutated BRAF (median OS: 49.0 months) had a good survival trend compared with those with wild-type BRAF (median OS: 33.0 months) (Additional file 2: Fig. S4D). We calculated the mutation rate of the ROS1 (Additional file 2: Fig. S4E) and PTPRT (Additional file 2: Fig. S4F) genes for each cancer type and found them not to be high.
Validation of the mutation-based gene set for predicting immunotherapy outcomes
To further confirm the value of the mutation-based gene set for predicting immunotherapy outcomes, we evaluated the mutation-based gene set in the validation cohort. When using the same formula and the same cutoff obtained from the training cohort, in the validation cohort, patients in the low-risk group exhibited an increased OS compared with those in the high-risk group (P < 0.001; HR, 1.792, 95% CI, 1.499–2.143) (Fig. 1F). The AUC of the mutation-based gene set in the validation cohort was 0.674 at 3 years and 0.732 at 5 years (Additional file 2: Fig. S3B); the AUC for each cancer type in the validation cohort was also assessed (Additional file 2: Fig. S3D). Considering the dependence of TMB measurement on the sequence panels used [54, 55], we separately evaluated the robustness of the model across different panels used in the clinic. In the Snyder et al. cohort [29], an advanced melanoma anti-CTLA-4-treated cohort (Additional file 2: Fig. S5A), and in the Mariathasan et al. cohort [22], a metastatic urothelial cancer anti-PD-L1-treated cohort (Additional file 2: Fig. S5B), the survival time of patients in the low-risk group was significantly longer than that of patients in the high-risk group (P < 0.05), which was consistent with the results obtained for the training cohort.
We also systematically compared the performance of our mutation-based gene set to that of the existing mutation-based signature of ICI response in the training cohort, including frameshift insertion/deletion (indel) mutation burden [56], tobacco mutation signature [57], UV signature [58], APOBEC signature [59], and DNA damage response pathway mutation [60]. Genes of the DNA damage response pathway were extracted from Conway et al., including MSH2, MSH6, PMS2, POLE, and BRCA2 [60]. We defined the sample in which all genes in the DNA damage response pathway were wild-type as “DNA damage response pathway unaltered” and the sample in which at least one gene in the DNA damage response pathway was mutated as “DNA damage response pathway altered.” The C-index is one of the most commonly used performance measures for survival models: the higher the value of the C-index is, the better the predictive ability of the model [61]. We found that the predictive power of the mutation-based gene set (C-index = 0.716) was greater than that of the frameshift insertion/deletion (indel) mutation burden (C-index = 0.526), tobacco mutation signature (C-index = 0.515), UV signature (C-index = 0.592), APOBEC signature (C-index = 0.531), and DNA damage response pathway mutation (C-index = 0.607) (Additional file 2: Fig. S5C).
As many studies have associated individual gene mutation status with ICI benefit, we used the C-index to compare the performance of mutation-based gene sets to that of those genes, including B2M [62], JAK1, JAK2 [63], KRAS, TP53 [64], PTEN [65], STK11 [66], and BAP1 [67]. We found that the predictive power of the mutation-based gene set (C-index = 0.716) was greater than that of B2M mutation (C-index = 0.538), JAK1 mutation (C-index = 0.614), JAK2 mutation (C-index = 0.615), KRAS mutation (C-index = 0.526), TP53 mutation (C-index = 0.600), PTEN mutation (C-index = 0.513), STK11 mutation (C-index = 0.653), and BAP1 mutation (C-index = 0.606) (Additional file 2: Fig. S5D).
We investigated whether the mutation-based gene set is able to predict the response to ICI therapy in the training and validation cohorts. In the training cohort, there was a significant DCB of ICI therapy in the low-risk group compared to the high-risk group (Fig. 1G). Patients with low risk scores were also more likely to respond to ICI therapy (Fig. 1H). This result was confirmed in the validation cohort (Fig. 1I and J). Additionally, we examined the breakdown of the risk score-predicted high-risk and low-risk proportion per cancer type and found that renal cell carcinoma and melanoma accounted for a higher proportion of samples in the low-risk group (Additional file 2: Fig. S6A-C). This may be due to the higher response rate of renal cell carcinoma and melanoma than other tumors to immunotherapy [68].
The mutation-based gene set is an independent predictor of prognosis after immunotherapy
We next verified whether the mutation-based gene set is an independent predictor of the response to immunotherapy. In both the training cohort and validation cohorts, univariate Cox regression analysis showed that the mutation-based gene set correlated with OS (Fig. 2A, B). After adjusting for drug type, cancer type, and TMB, the mutation-based gene set remained an independent predictive factor based on multivariate Cox regression analysis, confirming its robustness for independently predicting ICI prognosis (Fig. 2A, B).
In both the training and validation cohorts, multivariate Cox regression analysis showed drug type, TMB, and the mutation-based gene set to be independent predictive factors for identifying patients who will benefit from ICI treatment (Fig. 2A, B). To identify which factor has the best predictive performance, the C-index was utilized to compare performance between the mutation-based gene set and the TMB and drug type in both the training and validation cohorts. In the former, the C-index results showed that the mutation-based gene set predicted prognosis more accurately than TMB (P < 0.001) and drug type (P = 0.006) (Fig. 2C), a result that was validated in the validation cohort (all P < 0.001) (Fig. 2D).
The mutation-based gene set, disease stage, CTL, and 6-IFN-g gene signature can be combined to predict the clinical benefit of ICI therapy
Using the Riaz cohort involving both DNA sequencing and RNA sequencing data [25], we compared the 2-gene cytolytic score, 6-gene IFN-g signature score, and 18-gene IFN-g signature score between the low-risk group and the high-risk group. The genes of the 2-gene cytolytic score included GZMA and GZMB. The genes of the 6-gene IFN-g signature were extracted from Ayers et al.: IDO1, CXCL10, CXCL9, HLA-DRA, STAT1, and IFNG [9]. The genes of the 18-gene IFN-g signature were extracted from Ayers et al.: CD3D, IDO1, CIITA, CD3E, CCL5, GZMK, CD2, HLA-DRA, CXCL13, IL2RG, NKG7, HLA-E, CXCR6, LAG3, TAGAP, CXCL10, STAT1, and GZMB [9]. The 2-gene cytolytic score, 6-gene IFN-g signature score, and 18-gene IFN-g signature score were estimated by the ssGSEA method. We found that compared with the high-risk group, the low-risk group showed a higher 2-gene cytolytic score (P = 0.071), 6-gene IFN-g signature score (P < 0.05), and 18-gene IFN-g signature score (P < 0.05) (Additional file 2: Fig. S7A-C).
Given that the disease stage, TIL, and 6-gene IFN-g gene signature have been shown to be highly predictive of the response to ICI therapy [9, 11, 69], we speculated that they might function as synergistic factors in predicting the response to immunotherapy. The genes of TILs (represented by CTLs) were extracted from Jiang et al., including CD8A, CD8B, GZMA, GZMB, and PRF1 [11]. The CTL score was estimated by the ssGSEA method. Therefore, a nomogram was developed to combine the mutation-based gene set with the disease stage, CTL, and 6-gene IFN-g gene signature to offer clinicians a quantitative approach for predicting OS in ICI-treated patients. The nomogram was constructed in the Riaz cohort (Fig. 2E), and the calibration curve of the nomogram showed good agreement between the observations and the predictions (Fig. 2F), suggesting that the mutation-based gene set, disease stage, CTL, and 6-gene IFN-g gene signature should be integrated into a predictive nomogram for ICI therapy.
Underlying extrinsic immune landscapes of the high- and low-risk groups
To further explore the relationship between the immune system and mutation-based gene sets, we performed multiomics analysis of the cohort from The Cancer Genome Atlas (TCGA). Using the same formula and cutoff obtained from the training cohort, the cohort from TCGA was classified into high-risk and low-risk groups (Fig. 3A). Comparison at the genomic level revealed larger leukocyte, lymphocyte, and TIL fractions in the low-risk group than in the high-risk group (P < 0.001) (Fig. 3B–D). In addition, we used the TIL fraction data according to Saltz et al., who applied deep learning methods to estimate TILs on hematoxylin and eosin-stained (H&E-stained) slides [45]. Strikingly consistent results for the H&E estimates of the TIL fraction were obtained (P < 0.001) (Fig. 3E). In detail, the proportion of immune-stimulatory cells (such as CD8 T cells) was significantly increased in the low-risk group compared with the high-risk group (P < 0.001) (Fig. 3F). To further examine the above results using different methods of evaluating immune cells, we analyzed their distribution between the high- and low-risk groups according to the immune infiltration scores from Danaher et al. (Fig. 3G) and immune signature scores (Fig. 3H). The low-risk group was characterized by a greater abundance of immune cells, such as TILs and CD8 T cells (P < 0.05) (Fig. 3G, H). TCGA cohort patients were then clustered on the basis of immune signature scores using unsupervised clustering to assess whether the high-risk and low-risk groups correctly corresponded to the low-immune infiltration and high-immune infiltration groups, and unsupervised clustering revealed two distinct immune patterns with high and low levels of immune infiltration (Fig. 3I). Interestingly, the high immune infiltration group was significantly enriched in cases from the low-risk group (Fig. 3J). In addition, in the low-risk group, the immune signature scores at the tumor site were obviously greater than those at the normal site; conversely, the immune signature scores of the high-risk group at the tumor site were obviously lower than those at the normal site (Fig. 4A). Furthermore, the correlation among immune activities in the low-risk group was significantly higher than that in the high-risk group (Fig. 4B, C). GSEA showed significant enrichment in 13 pathways in the low-risk group, including 6 immune-related pathways, such as “natural killer cell mediated cytotoxicity” (P < 0.05) (Additional file 1: Table S6) (Fig. 4D). In contrast, no enrichment in any immune-related pathway was observed for the high-risk group (Additional file 1: Table S7). Low-risk tumors were associated with significantly higher CYT scores (P < 0.001) (Fig. 4E), and a significantly larger number of fibroblasts was found in the high-risk group (P < 0.01) (Fig. 4F). According to these results, the low-risk group showed abundant immune cells at the tumor site, which led to a response to ICI therapy, whereas fibroblasts may contribute to extrinsic immune escape in the high-risk group.
Furthermore, we found higher expression of chemokines in the low-risk group (Fig. 4G), which was compatible with the higher infiltration of immune cells in this group (Fig. 3). To provide a fair comparison normalized to immune cell density in tissues, we divided the expression of these genes by the immune cell fraction and compared them again (Fig. 4H); the results obtained after normalization were generally consistent with those obtained before normalization. Therefore, we infer that enrichment of chemokines may invoke an immune response in the low-risk group.
Underlying intrinsic immune landscapes of the high- and low-risk groups
We first compared some underlying factors determining tumor immunogenicity between the two groups. The low-risk group showed a higher mutation rate and neoantigen load than the high-risk group (all P < 0.001) (Fig. 5A), as well as significantly higher TCR diversity and BCR diversity (P < 0.001) (Fig. 5A). Compared with the low-risk group, the high-risk group exhibited a higher CNV burden and aneuploidy (all P < 0.001) (Fig. 5A). This result is consistent with the previous discovery that tumor aneuploidy is related to a reduced response to immunotherapy and to markers of immune evasion [70]. In terms of intertumoral heterogeneity, patients in the high-risk group displayed higher intertumoral heterogeneity than those in the low-risk group (P < 0.001) (Fig. 5A). This result further supports the concept that in the presence of cytolytic activity and fewer actively infiltrating immune cells, the tumor is allowed to clonally evolve, promoting the development of heterogeneity. Hence, we conclude that high immunogenicity may cause an extrinsic immune response in the low-risk group.
To further understand the mutational processes in the high-risk and low-risk groups, we delineated the mutational signatures based on somatic mutation data and identified four distinct patterns of mutagenesis in the cohort from TCGA (Fig. 5B). Signature 10 (5.58%), which contains a predominance of C>A mutations at TCT (31.2%) sites and C>T mutations at TCG (21.2%) sites, has been previously related to altered activity of the error-prone polymerase POL ε (POLE) as a consequence of mutations in the gene (Fig. 5B). Signature 7 (24.75%) contains an extremely strong transcriptional strand bias for C>T mutations in the CpTpN context, possibly due to ultraviolet light exposure (Fig. 5B). Signature 4 accounts for 31.63% of all point mutations and is characterized by C>A mutations; it may be associated with smoking (Fig. 5B). Signature 6 (38.03%), which is the most prevalent signature, is characterized by C>T mutations and thought to be associated with defective DNA mismatch repair (MMR); this signature has been detected in microsatellite unstable tumors (Fig. 5B). The four signatures were found at obviously higher frequencies in the low-risk group than in the high-risk group (all P < 0.001) (Fig. 5C). Smoking signatures and MMR signatures have been reported to be associated with immune response [14, 71]. We then calculated enrichment scores for oncogenes in 10 common oncogenic pathways in the low- and high-risk groups [50]. The cell cycle, Hippo, NRF2, PI3K, and TP53 pathways had higher scores in the low-risk group, whereas the MYC and Wnt pathways were enriched in the high-risk group (all P < 0.001) (Fig. 5D). The Wnt pathway has been shown to be related to immune exclusion [72].
Compared to the low-risk group, the high-risk group expressed smaller amounts of MHC I- and II-related antigen-presenting molecules (all P < 0.001), resulting in intrinsic immune escape (Fig. 5E). In contrast, the low-risk group had higher expression of most MHC genes, which is indicative of stronger immunogenicity. We also found immune checkpoint molecules (such as PD-1, PD-L1, and CTLA4) and costimulatory molecules to be more highly expressed in the low-risk group than in the high-risk group (most P < 0.001) (Fig. 5E). Therefore, we conclude that these immune checkpoint molecules cause a response to ICI therapy.
The above research was based on the mutation-based gene set as a whole to study the potential mechanisms of immune response and escape; thus, we further characterized the presumed mechanism by which each gene is related to the response to immunotherapy. We first compared the nonsilent mutation rate between the mutant and wild-type status of each gene in the cohort from TCGA. The nonsilent mutation rate was significantly higher in tumors with mutant genes than in those with wild-type genes (Additional file 2: Fig. S8A), indicating that mutations are related to enhanced tumor immunogenicity.
In addition, based on molecular estimates, TILs were more abundant in mutant-gene tumors than in wild-type-gene tumors (Additional file 2: Fig. S8B), which was validated using the H&E estimate data (Additional file 2: Fig. S8C). Next, we focused on T cells and found significantly higher TCR richness in tumors harboring mutant genes than in tumors with wild-type genes (Additional file 2: Fig. S9A). Based on CIBERSORT data, CD8 T cells were more abundant in mutant-gene tumors than in wild-type-gene tumors (Additional file 2: Fig. S9B), and these results were validated using immune signature score (Additional file 2: Fig. S9C). To better characterize the immune profile, differences in the expression pattern of immune checkpoint genes between mutant- and wild-type-gene tumors were explored. In line with the data for TILs, PD-1, PD-L1, and CTLA4 were upregulated in tumors with mutant genes (Additional file 2: Fig. S10). These results suggest that mutation of these 11 genes is strongly related to a hot immune microenvironment and enhanced tumor immunogenicity, which firmly supports the predictive abilities of these mutations for ICI therapy.
We also compared the lymphocyte, immune activation and mutation signature 7 (ultraviolet radiation, UVR) between the low-risk group and the high-risk group in the breast invasive carcinoma (BRCA) and skin cutaneous melanoma (SKCM) cohorts (Additional file 2: Fig. S7D-I); lymphocyte genes (represented by CTLs) were extracted from Jiang et al., including CD8A, CD8B, GZMA, GZMB, and PRF1 [11], and immune activation genes (represented by T and NK cell activity markers) were extracted from Wan et al., including GZMA, GZMB, IFNG, and NKG7 [50]. The lymphocyte score and immune activation score for each sample were estimated by the ssGSEA method. Compared with the high-risk group, the low-risk group showed a higher lymphocyte count, stronger immune activation and a higher mutation signature 7 (UVR) score in both cohorts (all P < 0.05, Additional file 2: Fig. S7D-I).
Furthermore, to balance the bias of the number of high- and low-risk groups among different cancer types, we selected 50 high-risk cases and 50 low-risk cases from each cancer type that had at least these numbers of cases, including bladder urothelial carcinoma (BLCA), BRCA, cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC), colon adenocarcinoma (COAD), head and neck squamous cell carcinoma (HNSC), kidney renal clear cell carcinoma (KIRC), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), SKCM, stomach adenocarcinoma (STAD), thyroid carcinoma (THCA), and uterine corpus endometrial carcinoma (UCEC) (Additional file 2: Fig. S7J). We conducted 1000 random samplings and compared the lymphocyte score, immune activation score and mutation signature 7 (UVR) between the low-risk and high-risk groups and found higher scores in the low-risk group (all P < 0.001, Additional file 2: Fig. S7K-M).
Copy number features of the high- and low-risk groups
Significant differences in chromosomal aberrations were detected between the high-risk and low-risk groups (Fig. 6A). Compared with the high-risk group (Fig. 6B), focal amplification peaks were observed for well-characterized immune genes, such as PD-L1 (9p24.1) and PD-L2 (9p24.1), in the low-risk group (Fig. 6C). Venn diagrams revealed 692 shared genes in the chromosome regions with copy number amplification in both groups, with 310 and 1218 genes specifically amplified in the high-risk and low-risk groups, respectively (Fig. 6D). We annotated these specific amplified genes through biological processes in Gene Ontology (Additional file 1: Tables S8 and S9) and then clustered the top 10 biological processes (Fig. 6E). The low-risk group was significantly enriched in 2 immune-related biological processes, “lymphocyte costimulation” and “T cell costimulation” (Fig. 6E). In contrast, the high-risk group was significantly enriched in “positive regulation of fibroblast proliferation” but not any immune-related biological process (Fig. 6E). This result was surprisingly consistent with previous results; that is, there were more immune cells in the low-risk group (Fig. 3B–D) and more fibroblasts in the high-risk group (Fig. 4F). Notably, PD-L1 and PD-L2 (located in the low-risk group-specific amplification peak 9p24.1) were annotated in both immune-related biological processes, indicating that PD-L1 and PD-L2 may play important roles in regulating immune status in the low-risk group (Fig. 6F). At the level of mRNA expression in the cohort from TCGA, we found significantly higher mRNA expression of PD-L1 and PD-L2 in the low-risk group (Fig. 6G), consistent with the CNV data. This finding indicates that CNVs in tumors contribute to observed differences in immune infiltration.
Discussion
Predictive biomarkers may help members of the medical community offer accurate guidance for ICI-treated patients, aid in cost management, and accelerate clinical trials and FDA approvals. Several biomarkers have been investigated, and some have been used to predict treatment outcomes. Indeed, recent studies have shown a robust association between TMB and the response to ICIs [73]. However, some patients with a high TMB may carry decisive mutations (in B2M, JAK1/2, etc.) that are closely associated with immunotherapy resistance, leading to a lack of response to ICIs and indicating that the TMB is insufficient for prognosis prediction [17]. Therefore, it is necessary to identify alternative markers of responsiveness. Based on a cohort of 2504 patients with different types of cancer, we established and validated a mutation-based gene set including 11 genes to predict survival benefits in patients undergoing ICI therapy. To the best of our knowledge, the current study is the first to investigate a comprehensive mutation-based gene set across different tumor types using independent cohorts.
Different types of tumors were included in our study, and different types of tumors have different prognoses. Therefore, we focused on eliminating bias among different types of tumors in the establishment and evaluation of the mutation-based gene set. First, we used the PSM adjustment method to adjust for bias among different types of tumors. The PSM algorithm is an important statistical tool to control confounding in observational studies, and it has been widely used in clinical research and pancancer genomic studies to reweight potential confounding effects in a multivariate manner [33, 34, 74–77]. In addition, the performance of methods that correct the confounder effect by balancing the propensity score was reported to be superior to that of other methods, including the t test, analysis of variance (ANOVA) and general linear model (GLM) [33]. Therefore, to identify gene mutations associated with prognosis, we employed a propensity score algorithm to reduce potential confounding effects among different types of tumors. Second, after the mutation-based gene set was established, we investigated whether the mutation-based gene set was restricted to specific groups or applicable to different populations. Stratification analyses indicated that the mutation-based gene set was significantly associated with OS in patients treated with ICI therapy, regardless of whether the cancer type was advanced lung cancer or colorectal cancer. The results of the subgroup analysis were consistent with the results of PSM adjustment. Third, we performed multivariate Cox regression analysis and found that our mutation-based gene set was independent of tumor type in predicting prognosis. In summary, we tested the application performance of the mutation-based gene set across different types of tumors using the PSM algorithm, stratified analysis, and multivariate Cox regression analysis, and based on the results, we believe the mutation-based gene set to be reliable.
Furthermore, we employed the multidimensional TCGA dataset to analyze how cancers respond to immunotherapy. We found that the low-risk group featured an inflammatory pattern of immune activities, such as high levels of CD8 T cell infiltration determined by the ESTIMATE approach, and stronger immunogenicity, such as a higher TMB. When we utilized the immune infiltration scores from Danaher et al. and the ssGSEA approach to calculate overall immune cell infiltration levels for cancers, the immune score was significantly higher in the low-risk group than in the high-risk group, which again confirmed the stronger antitumor immune activity in the former group. Many studies have shown that the density of TILs is positively associated with the immune response in patients with various kinds of cancers [78]. In addition to a high level of cytotoxic T cell infiltration, the low-risk group was characterized by overexpression of immune checkpoints, such as PD-L1, PD-1, and CTLA-4, compared with the high-risk group. Therefore, activated antitumor immunity, high PD-L1, PD-L1, and CTLA-4 expression, and enhanced t1umor immunogenicity might explain why the low-risk group was found to be more likely than the high-risk group to benefit from ICI therapy.
Our research has the following innovations and practical application. First, our study investigated different types of tumors (such as NSCLC, melanoma, and renal cell carcinoma), which represent the most common types of cancers treated with ICI therapy [79–81]. Several mRNA-based signatures, such as the T cell-inflamed gene-expression profile (GEP), an 18-gene assay, have been developed to predict clinical efficacy in patients undergoing ICI therapy [82]. To the best of our knowledge, the current study is the first to investigate a comprehensive mutation-based gene set across different tumor types using independent cohorts. Second, the application of multibiomarker predictive models requires an understanding of the factors that influence the accuracy and precision of high-throughput-based assays in clinical practice. Principal among these factors is the variability of biomarker measurements, which can be classified into preanalytical (intrinsic to the sample) and technical (intrinsic to the platform) sources of variation. Tissue-specific variability influences mRNA expression and is controlled by introducing several reference genes; relative quantitation is adopted to assess mRNA expression by normalization to reference genes. The risk score formulas and threshold values of these mRNA signatures are not suitable for validation using other types of measurement data. In the current study, we developed a mutation-based gene set to predict the clinical efficacy of ICI therapy. The composition of the above mutations is neither affected by the tissue type nor adjusted for by any other biomarker. However, the risk score formula as well as the threshold value for the mutation-based gene set can be validated by other tumor analysis methods, such as DNA sequencing and single-nucleotide polymorphism microarray analysis. Hence, the mutation-based gene set is not affected by technical sources of variation, even when using different platforms for different centers. Third, in practice, the mutation-based gene set avoids exposing patients to potential immune-related adverse effects if they are unlikely to respond and enables matching of a patient to a potentially more effective treatment sooner. In addition, given that the treatment course typically costs more than $120 000 on average [73], the application of biomarker strategies that improve diagnostic accuracy may help avoid considerable costs for what is anticipated to be a substantially reduced benefit. Overall, a mutation-based gene set incorporating these alterations should be assessed due to the greater ease of obtaining tumor specimens from patients on the basis of targeted NGS of these genes rather than assessing the TMB, which is complicated and expensive in routine practice. Fourth, we compared prediction performance between the mutation-based gene set and other factors that can predict immunotherapy, including the frameshift indel mutation burden, tobacco mutation signature, UV signature, APOBEC signature, DNA damage response pathway mutations, B2M mutation, JAK1 mutation, JAK2 mutation, KRAS mutation, TP53 mutation, PTEN mutation, STK11 mutation, and BAP1 mutation. We found that the prediction performance of the mutation-based gene set was superior to that of all of those factors.
Several limitations of this study should be considered. First, as some mutations may be enriched in some tumor types, the original goal of this study was to create a panel rather than identify a single gene (such as BRAF), as the former can include more genes to predict prognosis across different types of tumors. In addition, we explored all pancancer articles and evaluated how other researchers eliminated the biases associated with different types of tumors. Because we found that the PSM adjustment method is well recognized [33, 34, 74–77], we used PSM adjustment in this study to eliminate such bias. We also included different types of tumors as much as possible to eliminate these biases. To the best of our knowledge, this study is the largest to date to explore prognosis prediction for mutation-based pancancer immunotherapy. Of course, as large sample sizes of immunotherapy cohort clinical trials and better algorithms continue to be published, we will update our mutation-based gene set accordingly in the future to make it more comprehensive. Second, although we explored the immune landscape of each of the 11 genes in the mutation-based gene set, we still need to elucidate the molecular mechanism underlying the influence of each gene on immunotherapy in in vivo and in vitro functional experiments. Third, the enrichment scores of oncogenic pathways and expression patterns of immune checkpoints should also be examined by immunohistochemistry.
Conclusions
To use high-throughput methodologies in clinical practice, a marker must be validated by utilizing widely available tissues, such as formalin-fixed and paraffin-embedded tumor tissues. Once this major step has been achieved, we will enter a new era of truly tailored and precision medicine, likely with higher cure rates. Our mutation-based gene set meets the above requirement and is the first systematically identified comprehensive genomic marker for assessing the effect of ICI therapy across a broad spectrum of cancers. This study also represents the largest prognostic model discovery project for cancer patients who received ICI treatment (either as monotherapy or as a combination of anti-PD-1 and anti-CTLA-4). The nomogram combining the mutation-based gene set with the TMB and drug type can help clinicians select patients who have a strong likelihood of responding to ICI therapies. In addition, our study revealed distinct immune landscapes for the high- and low-risk groups. Specific genomic alterations might drive the formation of these microenvironment phenotypes. Overall, this work proposes a new tumor classification system with the potential to guide ICI treatment decisions.
Supplementary Information
Acknowledgements
Not applicable.
Abbreviations
- ACC
Adrenocortical carcinoma
- aDCs
Activated dendritic cells
- APCs
Antigen-presenting cells
- AUC
Area under the curve
- BCR
B cell receptor
- BLCA
Bladder urothelial carcinoma
- BRCA
Breast invasive carcinoma
- CCR
Cytokine and cytokine receptor
- CESC
Cervical squamous cell carcinoma and endocervical adenocarcinoma
- CHOL
Cholangiocarcinoma
- CNV
Copy number variation
- COAD
Colon adenocarcinoma
- COSMIC
Catalogue of somatic mutations in cancer
- CTLA-4
Cytotoxic T lymphocyte-associated protein-4
- CYT
Cytolytic activity score
- DCs
Dendritic cells
- DLBC
Lymphoid neoplasm diffuse large B cell lymphoma
- ES
Enrichment score
- ESCA
Esophageal carcinoma
- ESTIMATE
Estimation of stromal and immune cells in malignant tumors using expression data
- FDR q-val
False discovery rate q value
- FWER P-val
Family-wise-error rate P value
- GBM
Glioblastoma multiforme
- GEP
Gene expression profile
- GO
Gene Ontology
- GSEA
Gene set enrichment analysis
- HLA
Human leukocyte antigen
- HNSC
Head and neck squamous cell carcinoma
- ICI
Immune-checkpoint inhibitor
- iDCs
Immature dendritic cells
- IDI
Integrated discrimination improvement
- IFN
Interferon
- Indel
Insertion-deletion
- ITH
Intertumoral heterogeneity
- KEGG
Kyoto Encyclopedia of Genes and Genomes
- KICH
Kidney chromophobe
- KIRC
Kidney renal clear cell carcinoma
- KIRP
Kidney renal papillary cell carcinoma
- LAML
Acute myeloid leukemia
- LASSO
Least absolute shrinkage and selection operator
- LGG
Low-grade glioma
- LIHC
Liver hepatocellular carcinoma
- LUAD
Lung adenocarcinoma
- LUSC
Lung squamous cell carcinoma
- MESO
Mesothelioma
- MHC
Major histocompatibility complex
- MMR
Mismatch repair
- MSI
Microsatellite instability
- NES
Normalized enrichment score
- NGS
Next-generation sequencing
- NK
Natural killer
- NMF
Nonnegative matrix factorization
- NOM p-val
Nominal P value
- NRI
Net reclassification improvement
- NSCLC
Non-small-cell lung cancer
- OS
Overall survival
- OV
Ovarian serous cystadenocarcinoma
- PAAD
Pancreatic adenocarcinoma
- PCPG
Pheochromocytoma and paraganglioma
- PD-1
Programmed death-1
- pDCs
Plasmacytoid dendritic cells
- PD-L1
Programmed death ligand-1
- PRAD
Prostate adenocarcinoma
- READ
Rectum adenocarcinoma
- ROC
Receiver operating characteristic
- SARC
Sarcoma
- SCNA
Somatic copy number alteration
- SKCM
Skin cutaneous melanoma
- SNV
Single-nucleotide variant
- ssGSEA
Single-sample gene set enrichment analysis
- STAD
Stomach adenocarcinoma
- TCGA
The Cancer Genome Atlas
- TCR
T cell receptor
- Tfh cells
Follicular helper T cells
- TGCT
Testicular germ cell tumors
- Th17 cells
T helper 17 cells
- THCA
Thyroid carcinoma
- THYM
Thymoma
- TILs
Tumor-infiltrating lymphocytes
- TMB
Tumor mutation burden
- TME
Tumor microenvironment
- Tregs
Regulatory T cells
- UCEC
Uterine corpus endometrial carcinoma
- UCS
Uterine carcinosarcoma
- UVM
Uveal melanoma
Authors’ contributions
Haitao Zhao led the entire project, and all authors participated in the discussion and interpretation of the data and results. Junyu Long, Dongxu Wang, Anqiang Wang, and Peipei Chen performed the bioinformatics analysis of the sequencing data, wrote the paper, and were involved in planning the project. Yu Lin, Jin Bian, Xu Yang, Mingjun Zheng, Haohai Zhang, Yongchang Zheng, and Xinting Sang generated the figures and tables. All authors read and approved the final manuscript.
Funding
This work was supported by the Beijing Municipal Natural Science Foundation Project (7222130), the CHEN XIAO-PING Foundation for the development of science and technology of HUBEI province (CXPJJH1200008-10), the Fundamental Research Funds for the Central Universities (3332020084 and 3332018032), the CAMS Innovation Fund for Medical Sciences (CIFMS) (2021-I2M-1-061, 2021-1-I2M-003, 2018-I2M-3-001, and 2020-I2M-C&T-B-019), the CAMS Clinical and Translational Medicine Research Funds (2019XK320006), the Beijing Natural Science Foundation (7192158), the CSCO-hengrui Cancer Research Fund (Y-HR2019-0239), the National Ten-thousand Talent Program, the Shenzhen Science and Technology Plan (CKCY20180323174659823), the Project funded by China Postdoctoral Science Foundation (2020TQ0051 and 2021M690463), and the National Science Foundation for Young Scientists of China (81802735 and 82100380).
Availability of data and materials
All data used in this study are from public datasets and can be accessed without restriction. The web links or unique identifiers for public datasets are described in the paper. The codes used for data analysis in this manuscript have been deposited in GitHub (https://github.com/longjunyu/Pancan-ICI) [83].
Declarations
Ethics approval and consent to participate
The patient data analyzed in this work were acquired from publicly available datasets.
Consent for publication
Not applicable.
Competing interests
Yu Lin is affiliated with Shenzhen Withsum Technology Limited. The remaining authors declare no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Junyu Long, Dongxu Wang, Anqiang Wang and Peipei Chen contributed equally to this work.
Contributor Information
Junyu Long, Email: lancet_junyu@163.com.
Dongxu Wang, Email: drwangdongxu@163.com.
Anqiang Wang, Email: wanganqiang0902@163.com.
Peipei Chen, Email: peipeich@163.com.
Yu Lin, Email: linyunet@163.com.
Jin Bian, Email: bianjinps@163.com.
Xu Yang, Email: yangxulcyx@163.com.
Mingjun Zheng, Email: mingjun.zheng@campus.lmu.de.
Haohai Zhang, Email: hzhang3@bidmc.harvard.edu.
Yongchang Zheng, Email: zhengyongchang@pumch.cn.
Xinting Sang, Email: sangxt@pumch.cn.
Haitao Zhao, Email: zhaoht@pumch.cn.
References
- 1.Daud AI, Wolchok JD, Robert C, Hwu WJ, Weber JS, Ribas A, Hodi FS, Joshua AM, Kefford R, Hersey P, et al. Programmed death-ligand 1 expression and response to the anti-programmed death 1 antibody pembrolizumab in melanoma. J Clin Oncol. 2016;34:4102–4109. doi: 10.1200/JCO.2016.67.2477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Lin J, Yang X, Long J, Zhao S, Mao J, Wang D, Bai Y, Bian J, Zhang L, Yang X, et al. Pembrolizumab combined with lenvatinib as non-first-line therapy in patients with refractory biliary tract carcinoma. Hepatobiliary Surg Nutr. 2020;9:414–424. doi: 10.21037/hbsn-20-338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Yang X, Xu H, Zuo B, Yang X, Bian J, Long J, Wang D, Zhang J, Ning C, Wang Y, et al. Downstaging and resection of hepatocellular carcinoma in patients with extrahepatic metastases after stereotactic therapy. Hepatobiliary Surg Nutr. 2021;10:434–442. doi: 10.21037/hbsn-21-188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Sun R, Limkin EJ, Vakalopoulou M, Dercle L, Champiat S, Han SR, Verlingue L, Brandao D, Lancia A, Ammari S, et al. A radiomics approach to assess tumour-infiltrating CD8 cells and response to anti-PD-1 or anti-PD-L1 immunotherapy: an imaging biomarker, retrospective multicohort study. Lancet Oncol. 2018;19:1180–1191. doi: 10.1016/S1470-2045(18)30413-3. [DOI] [PubMed] [Google Scholar]
- 5.Song L, Cohen D, Ouyang Z, Cao Y, Hu X, Liu XS. TRUST4: immune repertoire reconstruction from bulk and single-cell RNA-seq data. Nat Methods. 2021;18:627–630. doi: 10.1038/s41592-021-01142-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Gu SS, Wang X, Hu X, Jiang P, Li Z, Traugh N, Bu X, Tang Q, Wang C, Zeng Z, et al. Clonal tracing reveals diverse patterns of response to immune checkpoint blockade. Genome Biol. 2020;21:263. doi: 10.1186/s13059-020-02166-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hu X, Zhang J, Wang J, Fu J, Li T, Zheng X, Wang B, Gu S, Jiang P, Fan J, et al. Landscape of B cell immunity and related immune evasion in human cancers. Nat Genet. 2019;51:560–567. doi: 10.1038/s41588-018-0339-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Li T, Fu J, Zeng Z, Cohen D, Li J, Chen Q, Li B, Liu XS. TIMER2.0 for analysis of tumor-infiltrating immune cells. Nucleic Acids Res. 2020;48:W509–w514. doi: 10.1093/nar/gkaa407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Ayers M, Lunceford J, Nebozhyn M, Murphy E, Loboda A, Kaufman DR, Albright A, Cheng JD, Kang SP, Shankaran V, et al. IFN-γ-related mRNA profile predicts clinical response to PD-1 blockade. J Clin Invest. 2017;127:2930–2940. doi: 10.1172/JCI91190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Fu J, Li K, Zhang W, Wan C, Zhang J, Jiang P, Liu XS. Large-scale public data reuse to model immunotherapy response and resistance. Genome Med. 2020;12:21. doi: 10.1186/s13073-020-0721-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Jiang P, Gu S, Pan D, Fu J, Sahu A, Hu X, Li Z, Traugh N, Bu X, Li B, et al. Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response. Nat Med. 2018;24:1550–1558. doi: 10.1038/s41591-018-0136-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Shi S, Gu S, Han T, Zhang W, Huang L, Li Z, Pan D, Fu J, Ge J, Brown M, et al. Inhibition of MAN2A1 enhances the immune response to anti-PD-L1 in human tumors. Clin Cancer Res. 2020;26:5990–6002. doi: 10.1158/1078-0432.CCR-20-0778. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kumar S, Zeng Z, Bagati A, Tay RE, Sanz LA, Hartono SR, Ito Y, Abderazzaq F, Hatchi E, Jiang P, et al. CARM1 inhibition enables immunotherapy of resistant tumors by dual action on tumor cells and T cells. Cancer Discov. 2021;11:2050–2071. doi: 10.1158/2159-8290.CD-20-1144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Rizvi NA, Hellmann MD, Snyder A, Kvistborg P, Makarov V, Havel JJ, Lee W, Yuan J, Wong P, Ho TS, et al. Cancer immunology. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science. 2015;348:124–128. doi: 10.1126/science.aaa1348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hellmann MD, Ciuleanu TE, Pluzanski A, Lee JS, Otterson GA, Audigier-Valette C, Minenza E, Linardou H, Burgers S, Salman P, et al. Nivolumab plus ipilimumab in lung cancer with a high tumor mutational burden. N Engl J Med. 2018;378:2093–2104. doi: 10.1056/NEJMoa1801946. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Li R, Han D, Shi J, Han Y, Tan P, Zhang R, Li J. Choosing tumor mutational burden wisely for immunotherapy: a hard road to explore. Biochim Biophys Acta Rev Cancer. 2020;1874:188420. doi: 10.1016/j.bbcan.2020.188420. [DOI] [PubMed] [Google Scholar]
- 17.Zaretsky JM, Garcia-Diaz A, Shin DS, Escuin-Ordinas H, Hugo W, Hu-Lieskovan S, Torrejon DY, Abril-Rodriguez G, Sandoval S, Barthly L, et al. Mutations associated with acquired resistance to PD-1 blockade in melanoma. N Engl J Med. 2016;375:819–829. doi: 10.1056/NEJMoa1604958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Chen DS, Mellman I. Elements of cancer immunity and the cancer-immune set point. Nature. 2017;541:321–330. doi: 10.1038/nature21349. [DOI] [PubMed] [Google Scholar]
- 19.Wang F, Zhao Q, Wang YN, Jin Y, He MM, Liu ZX, Xu RH. Evaluation of POLE and POLD1 mutations as biomarkers for immunotherapy outcomes across multiple cancer typeS. JAMA Oncol. 2019;5(10):1504–1506. doi: 10.1001/jamaoncol.2019.2963. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Samstein RM, Lee CH, Shoushtari AN, Hellmann MD, Shen R, Janjigian YY, Barron DA, Zehir A, Jordan EJ, Omuro A, et al. Tumor mutational load predicts survival after immunotherapy across multiple cancer types. Nat Genet. 2019;51:202–206. doi: 10.1038/s41588-018-0312-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Liu D, Schilling B, Liu D, Sucker A, Livingstone E, Jerby-Arnon L, Zimmer L, Gutzmer R, Satzger I, Loquai C, et al. Integrative molecular and clinical modeling of clinical outcomes to PD1 blockade in patients with metastatic melanoma. Nat Med. 2019;25:1916–1927. doi: 10.1038/s41591-019-0654-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Mariathasan S, Turley SJ, Nickles D, Castiglioni A, Yuen K, Wang Y, Kadel EE, III, Koeppen H, Astarita JL, Cubas R, et al. TGFβ attenuates tumour response to PD-L1 blockade by contributing to exclusion of T cells. Nature. 2018;554:544–548. doi: 10.1038/nature25501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Miao D, Margolis CA, Vokes NI, Liu D, Taylor-Weiner A, Wankowicz SM, Adeegbe D, Keliher D, Schilling B, Tracy A, et al. Genomic correlates of response to immune checkpoint blockade in microsatellite-stable solid tumors. Nat Genet. 2018;50:1271–1281. doi: 10.1038/s41588-018-0200-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Miao D, Margolis CA, Gao W, Voss MH, Li W, Martini DJ, Norton C, Bossé D, Wankowicz SM, Cullen D, et al. Genomic correlates of response to immune checkpoint therapies in clear cell renal cell carcinoma. Science. 2018;359:801–806. doi: 10.1126/science.aan5951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Riaz N, Havel JJ, Makarov V, Desrichard A, Urba WJ, Sims JS, Hodi FS, Martín-Algarra S, Mandal R, Sharfman WH, et al. Tumor and microenvironment evolution during immunotherapy with nivolumab. Cell. 2017;171:934–949.e916. doi: 10.1016/j.cell.2017.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hugo W, Zaretsky JM, Sun L, Song C, Moreno BH, Hu-Lieskovan S, Berent-Maoz B, Pang J, Chmielowski B, Cherry G, et al. Genomic and transcriptomic features of response to anti-PD-1 therapy in metastatic melanoma. Cell. 2016;165:35–44. doi: 10.1016/j.cell.2016.02.065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Van Allen EM, Miao D, Schilling B, Shukla SA, Blank C, Zimmer L, Sucker A, Hillen U, Foppen MHG, Goldinger SM, et al. Genomic correlates of response to CTLA-4 blockade in metastatic melanoma. Science. 2015;350:207–211. doi: 10.1126/science.aad0095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zehir A, Benayed R, Shah RH, Syed A, Middha S, Kim HR, Srinivasan P, Gao J, Chakravarty D, Devlin SM, et al. Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. Nat Med. 2017;23:703–713. doi: 10.1038/nm.4333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Snyder A, Makarov V, Merghoub T, Yuan J, Zaretsky JM, Desrichard A, Walsh LA, Postow MA, Wong P, Ho TS, et al. Genetic basis for clinical response to CTLA-4 blockade in melanoma. N Engl J Med. 2014;371:2189–2199. doi: 10.1056/NEJMoa1406498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Rizvi H, Sanchez-Vega F, La K, Chatila W, Jonsson P, Halpenny D, Plodkowski A, Long N, Sauter JL, Rekhtman N, et al. Molecular determinants of response to anti-programmed cell death (PD)-1 and anti-programmed death-ligand 1 (PD-L1) blockade in patients with non-small-cell lung cancer profiled with targeted next-generation sequencing. J Clin Oncol. 2018;36:633–641. doi: 10.1200/JCO.2017.75.3384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Janjigian YY, Sanchez-Vega F, Jonsson P, Chatila WK, Hechtman JF, Ku GY, Riches JC, Tuvy Y, Kundra R, Bouvier N, et al. Genetic predictors of response to systemic therapy in esophagogastric cancer. Cancer Discov. 2018;8:49–58. doi: 10.1158/2159-8290.CD-17-0787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Weinstein JN, Collisson EA, Mills GB, Shaw KR, Ozenberger BA, Ellrott K, Shmulevich I, Sander C, Stuart JM. The Cancer Genome Atlas Pan-Cancer analysis project. Nat Genet. 2013;45:1113–1120. doi: 10.1038/ng.2764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Yuan Y, Liu L, Chen H, Wang Y, Xu Y, Mao H, Li J, Mills GB, Shu Y, Li L, Liang H. Comprehensive characterization of molecular differences in cancer between male and female patients. Cancer Cell. 2016;29:711–722. doi: 10.1016/j.ccell.2016.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Luo Z, Wang W, Li F, Songyang Z, Feng X, Xin C, Dai Z, Xiong Y. Pan-cancer analysis identifies telomerase-associated signatures and cancer subtypes. Mol Cancer. 2019;18:106. doi: 10.1186/s12943-019-1035-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Gui J, Li H. Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data. Bioinformatics. 2005;21:3001–3008. doi: 10.1093/bioinformatics/bti422. [DOI] [PubMed] [Google Scholar]
- 36.Meehan AJ, Latham RM, Arseneault L, Stahl D, Fisher HL, Danese A. Developing an individualized risk calculator for psychopathology among young people victimized during childhood: a population-representative cohort study. J Affect Disord. 2020;262:90–98. doi: 10.1016/j.jad.2019.10.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wu TT, Chen YF, Hastie T, Sobel E, Lange K. Genome-wide association analysis by lasso penalized logistic regression. Bioinformatics. 2009;25:714–721. doi: 10.1093/bioinformatics/btp041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Xu RH, Wei W, Krawczyk M, Wang W, Luo H, Flagg K, Yi S, Shi W, Quan Q, Li K, et al. Circulating tumour DNA methylation markers for diagnosis and prognosis of hepatocellular carcinoma. Nat Mater. 2017;16:1155–1161. doi: 10.1038/nmat4997. [DOI] [PubMed] [Google Scholar]
- 39.Camp RL, Dolled-Filhart M, Rimm DL. X-tile: a new bio-informatics tool for biomarker assessment and outcome-based cut-point optimization. Clin Cancer Res. 2004;10:7252–7259. doi: 10.1158/1078-0432.CCR-04-0713. [DOI] [PubMed] [Google Scholar]
- 40.Tang XR, Li YQ, Liang SB, Jiang W, Liu F, Ge WX, Tang LL, Mao YP, He QM, Yang XJ, et al. Development and validation of a gene expression-based signature to predict distant metastasis in locoregionally advanced nasopharyngeal carcinoma: a retrospective, multicentre, cohort study. Lancet Oncol. 2018;19:382–393. doi: 10.1016/S1470-2045(18)30080-9. [DOI] [PubMed] [Google Scholar]
- 41.Iasonos A, Schrag D, Raj GV, Panageas KS. How to build and interpret a nomogram for cancer prognosis. J Clin Oncol. 2008;26:1364–1370. doi: 10.1200/JCO.2007.12.9791. [DOI] [PubMed] [Google Scholar]
- 42.Coutant C, Olivier C, Lambaudie E, Fondrinier E, Marchal F, Guillemin F, Seince N, Thomas V, Leveque J, Barranger E, et al. Comparison of models to predict nonsentinel lymph node status in breast cancer patients with metastatic sentinel lymph nodes: a prospective multicenter study. J Clin Oncol. 2009;27:2800–2808. doi: 10.1200/JCO.2008.19.7418. [DOI] [PubMed] [Google Scholar]
- 43.Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, Hoang CD, Diehn M, Alizadeh AA. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. 2015;12:453–457. doi: 10.1038/nmeth.3337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Thorsson V, Gibbs DL, Brown SD, Wolf D, Bortone DS, Ou Yang TH, Porta-Pardo E, Gao GF, Plaisier CL, Eddy JA, et al. The immune landscape of cancer. Immunity. 2018;48:812–830.e814. doi: 10.1016/j.immuni.2018.03.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Saltz J, Gupta R, Hou L, Kurc T, Singh P, Nguyen V, Samaras D, Shroyer KR, Zhao T, Batiste R, et al. Spatial organization and molecular correlation of tumor-infiltrating lymphocytes using deep learning on pathology images. Cell Rep. 2018;23:181–193.e187. doi: 10.1016/j.celrep.2018.03.086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Danaher P, Warren S, Dennis L, D'Amico L, White A, Disis ML, Geller MA, Odunsi K, Beechem J, Fling SP. Gene expression markers of tumor infiltrating leukocytes. J Immunother Cancer. 2017;5:18. doi: 10.1186/s40425-017-0215-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.He Y, Jiang Z, Chen C, Wang X. Classification of triple-negative breast cancers based on immunogenomic profiling. J Exp Clin Cancer Res. 2018;37:327. doi: 10.1186/s13046-018-1002-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Hanzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics. 2013;14:7. doi: 10.1186/1471-2105-14-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Rooney MS, Shukla SA, Wu CJ, Getz G, Hacohen N. Molecular and genetic properties of tumors associated with local immune cytolytic activity. Cell. 2015;160:48–61. doi: 10.1016/j.cell.2014.12.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Sanchez-Vega F, Mina M, Armenia J, Chatila WK, Luna A, La KC, Dimitriadoy S, Liu DL, Kantheti HS, Saghafinia S, et al. Oncogenic signaling pathways in the cancer genome atlas. Cell. 2018;173:321–337.e310. doi: 10.1016/j.cell.2018.03.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Mermel CH, Schumacher SE, Hill B, Meyerson ML, Beroukhim R, Getz G. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol. 2011;12:R41. doi: 10.1186/gb-2011-12-4-r41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Pencina MJ, D'Agostino RB, Sr, D'Agostino RB, Jr, Vasan RS. Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Stat Med. 2008;27:157–172. doi: 10.1002/sim.2929. [DOI] [PubMed] [Google Scholar]
- 54.Heydt C, Rehker J, Pappesch R, Buhl T, Ball M, Siebolts U, Haak A, Lohneis P, Büttner R, Hillmer AM, Merkelbach-Bruse S. Analysis of tumor mutational burden: correlation of five large gene panels with whole exome sequencing. Sci Rep. 2020;10:11387. doi: 10.1038/s41598-020-68394-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Litchfield K, Reading JL, Lim EL, Xu H, Liu P, Al-Bakir M, Wong YNS, Rowan A, Funt SA, Merghoub T, et al. Escape from nonsense-mediated decay associates with anti-tumor immunogenicity. Nat Commun. 2020;11:3800. doi: 10.1038/s41467-020-17526-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Turajlic S, Litchfield K, Xu H, Rosenthal R, McGranahan N, Reading JL, Wong YNS, Rowan A, Kanu N, Al Bakir M, et al. Insertion-and-deletion-derived tumour-specific neoantigens and the immunogenic phenotype: a pan-cancer analysis. Lancet Oncol. 2017;18:1009–1021. doi: 10.1016/S1470-2045(17)30516-8. [DOI] [PubMed] [Google Scholar]
- 57.Anagnostou V, Niknafs N, Marrone K, Bruhm DC, White JR, Naidoo J, Hummelink K, Monkhorst K, Lalezari F, Lanis M, et al. Multimodal genomic features predict outcome of immune checkpoint blockade in non-small-cell lung cancer. Nat Cancer. 2020;1:99–111. doi: 10.1038/s43018-019-0008-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Knepper TC, Montesion M, Russell JS, Sokol ES, Frampton GM, Miller VA, Albacker LA, McLeod HL, Eroglu Z, Khushalani NI, et al. The genomic landscape of Merkel cell carcinoma and clinicogenomic biomarkers of response to immune checkpoint inhibitor therapy. Clin Cancer Res. 2019;25:5961–5971. doi: 10.1158/1078-0432.CCR-18-4159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Chapuy B, Stewart C, Dunford AJ, Kim J, Wienand K, Kamburov A, Griffin GK, Chen PH, Lako A, Redd RA, et al. Genomic analyses of PMBL reveal new drivers and mechanisms of sensitivity to PD-1 blockade. Blood. 2019;134:2369–2382. doi: 10.1182/blood.2019002067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Conway JR, Kofman E, Mo SS, Elmarakeby H, Van Allen E. Genomics of response to immune checkpoint therapies for cancer: implications for precision medicine. Genome Med. 2018;10:93. doi: 10.1186/s13073-018-0605-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Long J, Wang A, Bai Y, Lin J, Yang X, Wang D, Yang X, Jiang Y, Zhao H. Development and validation of a TP53-associated immune prognostic model for hepatocellular carcinoma. EBioMedicine. 2019;42:363–374. doi: 10.1016/j.ebiom.2019.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Gettinger S, Choi J, Hastings K, Truini A, Datar I, Sowell R, Wurtz A, Dong W, Cai G, Melnick MA, et al. Impaired HLA class I antigen processing and presentation as a mechanism of acquired resistance to immune checkpoint inhibitors in lung cancer. Cancer Discov. 2017;7:1420–1435. doi: 10.1158/2159-8290.CD-17-0593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Shin DS, Zaretsky JM, Escuin-Ordinas H, Garcia-Diaz A, Hu-Lieskovan S, Kalbasi A, Grasso CS, Hugo W, Sandoval S, Torrejon DY, et al. Primary resistance to PD-1 blockade mediated by JAK1/2 mutations. Cancer Discov. 2017;7:188–201. doi: 10.1158/2159-8290.CD-16-1223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.La Fleur L, Falk-Sörqvist E, Smeds P, Berglund A, Sundström M, Mattsson JS, Brandén E, Koyi H, Isaksson J, Brunnström H, et al. Mutation patterns in a population-based non-small cell lung cancer cohort and prognostic impact of concomitant mutations in KRAS and TP53 or STK11. Lung Cancer. 2019;130:50–58. doi: 10.1016/j.lungcan.2019.01.003. [DOI] [PubMed] [Google Scholar]
- 65.Peng W, Chen JQ, Liu C, Malu S, Creasy C, Tetzlaff MT, Xu C, McKenzie JA, Zhang C, Liang X, et al. Loss of PTEN promotes resistance to T cell-mediated immunotherapy. Cancer Discov. 2016;6:202–216. doi: 10.1158/2159-8290.CD-15-0283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Aredo JV, Padda SK, Kunder CA, Han SS, Neal JW, Shrager JB, Wakelee HA. Impact of KRAS mutation subtype and concurrent pathogenic mutations on non-small cell lung cancer outcomes. Lung Cancer. 2019;133:144–150. doi: 10.1016/j.lungcan.2019.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Shrestha R, Nabavi N, Lin YY, Mo F, Anderson S, Volik S, Adomat HH, Lin D, Xue H, Dong X, et al. BAP1 haploinsufficiency predicts a distinct immunogenic class of malignant peritoneal mesothelioma. Genome Med. 2019;11:8. doi: 10.1186/s13073-019-0620-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Yarchoan M, Hopkins A, Jaffee EM. Tumor mutational burden and response rate to PD-1 inhibition. N Engl J Med. 2017;377:2500–2501. doi: 10.1056/NEJMc1713444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Wan C, Keany MP, Dong H, Al-Alem LF, Pandya UM, Lazo S, Boehnke K, Lynch KN, Xu R, Zarrella DT, et al. Enhanced efficacy of simultaneous PD-1 and PD-L1 immune checkpoint blockade in high-grade serous ovarian cancer. Cancer Res. 2021;81:158–173. doi: 10.1158/0008-5472.CAN-20-1674. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Davoli T, Uno H, Wooten EC, Elledge SJ. Tumor aneuploidy correlates with markers of immune evasion and with reduced response to immunotherapy. Science. 2017;355(6322):eaaf8399. doi: 10.1126/science.aaf8399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Le DT, Durham JN, Smith KN, Wang H, Bartlett BR, Aulakh LK, Lu S, Kemberling H, Wilt C, Luber BS, et al. Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science. 2017;357:409–413. doi: 10.1126/science.aan6733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Luke JJ, Bao R, Sweis RF, Spranger S, Gajewski TF. WNT/β-catenin pathway activation correlates with immune exclusion across human cancers. Clin Cancer Res. 2019;25:3074–3083. doi: 10.1158/1078-0432.CCR-18-1942. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Lu S, Stein JE, Rimm DL, Wang DW, Bell JM, Johnson DB, Sosman JA, Schalper KA, Anders RA, Wang H, et al. Comparison of biomarker modalities for predicting response to PD-1/PD-L1 checkpoint blockade: a systematic review and meta-analysis. JAMA Oncol. 2019;5(8):1195–1204. doi: 10.1001/jamaoncol.2019.1549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Li L, Greene T. A weighting analogue to pair matching in propensity score analysis. Int J Biostat. 2013;9:215–234. doi: 10.1515/ijb-2012-0030. [DOI] [PubMed] [Google Scholar]
- 75.Ye Y, Jing Y, Li L, Mills GB, Diao L, Liu H, Han L. Sex-associated molecular differences for cancer immunotherapy. Nat Commun. 2020;11:1779. doi: 10.1038/s41467-020-15679-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Deng J, Chen H, Zhou D, Zhang J, Chen Y, Liu Q, Ai D, Zhu H, Chu L, Ren W, et al. Comparative genomic analysis of esophageal squamous cell carcinoma between Asian and Caucasian patient populations. Nat Commun. 2017;8:1533. doi: 10.1038/s41467-017-01730-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Ye Y, Hu Q, Chen H, Liang K, Yuan Y, Xiang Y, Ruan H, Zhang Z, Song A, Zhang H, et al. Characterization of hypoxia-associated molecular features to aid hypoxia-targeted therapy. Nat Metab. 2019;1:431–444. doi: 10.1038/s42255-019-0045-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Long J, Lin J, Wang A, Wu L, Zheng Y, Yang X, Wan X, Xu H, Chen S, Zhao H. PD-1/PD-L blockade in gastrointestinal cancers: lessons learned and the road toward precision immunotherapy. J Hematol Oncol. 2017;10:146. doi: 10.1186/s13045-017-0511-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Ready N, Hellmann MD, Awad MM, Otterson GA, Gutierrez M, Gainor JF, Borghaei H, Jolivet J, Horn L, Mates M, et al. First-Line Nivolumab plus ipilimumab in advanced non-small-cell lung cancer (CheckMate 568): outcomes by programmed death ligand 1 and tumor mutational burden as biomarkers. J Clin Oncol. 2019;37:992–1000. doi: 10.1200/JCO.18.01042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Lebbe C, Meyer N, Mortier L, Marquez-Rodas I, Robert C, Rutkowski P, Menzies AM, Eigentler T, Ascierto PA, Smylie M, et al. Evaluation of two dosing regimens for nivolumab in combination with ipilimumab in patients with advanced melanoma: results from the phase IIIb/IV CheckMate 511 trial. J Clin Oncol. 2019;37:867–875. doi: 10.1200/JCO.18.01998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.McKay RR, Bosse D, Choueiri TK. Evolving systemic treatment landscape for patients with advanced renal cell carcinoma. J Clin Oncol. 2018;36(36):Jco2018790253. https://ascopubs.org/doi/pdf/10.1200/JCO.2018.79.0253. [DOI] [PubMed]
- 82.Ott PA, Bang YJ, Piha-Paul SA, Razak ARA, Bennouna J, Soria JC, Rugo HS, Cohen RB, O'Neil BH, Mehnert JM, et al. T-cell-inflamed gene-expression profile, programmed death ligand 1 expression, and tumor mutational burden predict efficacy in patients treated with pembrolizumab across 20 cancers: KEYNOTE-028. J Clin Oncol. 2019;37:318–327. doi: 10.1200/JCO.2018.78.2276. [DOI] [PubMed] [Google Scholar]
- 83.Long J, Wang D, Wang A, Chen P, Lin Y, Bian J, Yang X, Zheng M, Zhang H, Zheng Y, et al. A mutation-based gene set predicts survival benefit after immunotherapy across multiple cancers and reveals the immune response landscape. Github Repository; 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data used in this study are from public datasets and can be accessed without restriction. The web links or unique identifiers for public datasets are described in the paper. The codes used for data analysis in this manuscript have been deposited in GitHub (https://github.com/longjunyu/Pancan-ICI) [83].