Abstract
Defects in pathways governing genomic fidelity have been linked to improved response to immune checkpoint blockade therapy (ICB). Pathogenic POLE/POLD1 mutations can cause hypermutation, yet how diverse mutations in POLE/POLD1 influence anti-tumor immunity following ICB is unclear. Here, we comprehensively determined the effect of POLE/POLD1 mutations in ICB and elucidated the mechanistic impact of these mutations on tumor immunity. Murine syngeneic tumors harboring Pole/Pold1 functional mutations displayed enhanced anti-tumor immunity and were sensitive to ICB. Patients with POLE/POLD1 mutated tumors harboring telltale mutational signatures respond better to ICB than patients harboring wild-type or signature-negative tumors. A mutant POLE/D1 function-associated signature-based model out-performed several traditional approaches for identifying POLE/POLD1 mutated patients that benefit from ICB. Strikingly, the spectrum of mutational signatures correlates with the biochemical features of neoantigens. Alterations that cause POLE/POLD1 function-associated signatures generate TCR-contact residues with increased hydrophobicity, potentially facilitating T-cell recognition. Altogether, the functional landscapes of POLE/POLD1 mutations shape immunotherapy efficacy.
Although immune checkpoint blockade (ICB) therapy is effective in multiple cancer types, durable response to ICB is still limited to a minority of patients1,2. Recent studies from us and others have revealed that pathogenic alterations in DNA damage repair pathways (DDR) are associated with improved response to ICB3-7. For instance, mismatch repair deficiency (MMRd), which results in microsatellite instability (MSI), is an FDA-approved indication for anti-PD1 therapy regardless of cancer type8. Mechanistically, MMRd increases tumour mutational burden (TMB) and indel load which contributes to an elevated level of neoantigens, promoting anti-tumor immunity9.
Polymerase epsilon and delta (POLE and POLD1, hereby called POLE/D1) are DDR genes that are genetically altered in nearly 4% of tumors across all cancer types, with 10-13% of these cases due to germline variants10. These proteins are catalytic subunits of DNA polymerases that are responsible for DNA synthesis during cell division and DNA damage repair11. Certain pathogenic mutations occurring both within and outside of the exonuclease domains of POLE/D1 can result in a hypermutator phenotype12-14. Recent studies reported that tumors harboring POLE/D1 pathogenic mutations are highly infiltrated with immune cells, suggesting these mutations may lead to improved immune recognition which potentially correlated with favorable response to ICB15-19; however, the functional significance of most of these mutations, including their effects on inducing genomic alterations, role in anti-tumor immunity, and modulation of immunotherapy response remains unclear12,20-22.
Well-known functional mutations (i.e. mutations that perturb function) in the exonuclease domains of POLE and POLD1 can produce telltale mutational signatures12,23,24. The COSMIC single-base substitutions (SBS) signature SBS10a/b are associated with proof-reading defects of polymerases in samples that have intact mismatch repair machinery, while SBS14/20 are associated with concurrent POLE/D1 mutations and MMRd. There is an opportunity to identify and characterize other pathogenic POLE/D1 mutations and their impact on tumor immune surveillance and immunotherapy outcomes based on POLE/D1 functional mutations associated mutational signatures25.
Here, we present a comprehensive evaluation of the functional implications of the mutational landscape associated with POLE/D1 alterations, including their role in inducing specific mutational patterns, altering the immune microenvironment, and shaping ICB response.
Results
Pole/Pold1 functional alterations directly sensitize tumors to ICB
Human POLE/POLD1 and mouse Pole/Pold1 are highly homologous with 91% and 90% amino acid similarity respectively (Supplementary fig. 1&2). We first introduced a well-established PoleP286R hotspot functional mutation22,26,27 into the murine B16F10 melanoma cell line, using the Clustered Regularly Interspaced Short Palindromic Repeats-Homology-directed Repair (CRISPR-HDR) technique (Extended data fig. 1a; Supplementary fig. 1&3)28. We detected a 4.7-fold increment of de novo single nucleotide variants (SNVs; Extended data fig. 1a&b & Methods) in the mutant cell lines compared to parental cell lines after 8-weeks of in vitro passaging (P=1.7e-5; Fig. 1a, Extended data fig. 1c). New indels (P=0.82) , somatic copy number variations (SCNVs, P=0.74) and the MSIsensor score29 (P=0.68), remained similar (Fig. 1b). These results are consistent with the observation in patients 30,31.
The growth of the B16F10 PoleP286R tumors, was dramatically reduced by either mono- or combination ICB in the C5BL/6J host (anti-CTLA4, P=0.002; anti-PD1, P=0.003; Combo, P= 0.0009, Fig. 1c-e, Methods), and this was validated in a separate B16F10 PoleP286R clone (anti-CTLA4, P=0.0003; anti-PD1, P=0.0002; Combo, P= 0.0001; Fig. 1e, Extended data fig. 1d). Conversely, only combination treatment moderately delayed the growth of the parental tumors (P=0.007, Fig. 1d). All three types of ICB improved the overall survival of the mice bearing the PoleP286R tumors, while only combination therapy modestly extended the survival of mice bearing parental tumors (Extended data fig. 1e). Immunofluorescence staining of ICB-treated tumors showed at least 3-fold increment of Cd3+ T cell infiltration in PoleP286R tumors (Fig. 1f-g), and even in the IgG treatment arm, suggesting that the mutant tumors are more inflamed even before therapy and poised to respond to ICB. A similar response to ICB, and immune infiltration pattern were observed in CT-26 mouse colorectal tumors harboring the PoleP286R mutation and B16F10 tumors harboring the PoleV411L hotspot functional mutation12 (Fig. 1h-l; Extended data fig. 1f-I; Supplementary fig. 3&4; Supplementary note 1), suggesting that functional mutations in Pole likely induce sensitivity to ICB.
We investigated whether Pold1 pathogenic mutations also directly drive response to ICB. We generated B16F10 mutant cell lines harboring Pold1L472P functional mutation32 or Pold1E372K variant of unknown significance (hereby referred as VUS), which is associated with ICB response as well as POLD1-dependent mutational signature in patients6 (Supplementary fig. 2&5, Supplementary note 2). Tumors harboring these two Pold1 mutations are also more sensitive to anti-PD1 therapy (P=1.4e-4; P=0.0051, Fig. 1m). These results indicate that Pole/Pold1 functional mutations or VUSes that generate mutational signatures associated with POLE/POLD1 activity (SBS10a, SBS10b, SBS14, and/or SBS20; hereafter called function-associated signatures) influence sensitivity to ICB.
Pole functional mutations enhance tumor immunogenicity
To elucidate how POLE functional mutations influence the immune microenvironment of tumors prior to ICB, we implanted B16F10 PoleP286R mutant and parental cell lines into nude and C57BL/6J recipient mice subcutaneously. Mutant and parental tumors showed similar growth kinetics in immunodeficient nude mice, but showed growth delay in the immuno-competent C57BL/6J recipients, consistent with a stronger effect of immunosurveillance against mutant tumors (Fig. 2a-b). RNA-seq analysis of treatment-naive parental and mutant tumors harvested at 14 days post-implantation revealed distinct gene expression profiles (Extended data fig. 2a). Gene set enrichment analysis (GSEA) indicated that gene sets related to T-cell infiltration, natural killer (NK) cell infiltration, adaptive immune response, and inflammation are highly enriched in mutant tumors versus the parental tumors (Fig. 2c, Extended data fig. 2a-b).
We further performed flow cytometry analysis on the single cell suspension extracted from mutant and parental tumors 14 days after the initial implantation into B6 recipient mice. PoleP286R mutant tumors showed ~2.5-fold increase of Cd45+ immune cell influx over parental tumors (P=0.002; Fig. 2d) and this was validated by the expression of Ptprc from RNA-seq data (P=0.0008, Extended data fig. 2d). Compared to parental tumors, mutant tumors showed a ~5-fold higher percentage of Cd8+ T-cell infiltration (P=0.002) and reduced proportions of immuno-suppressive regulatory T-cells (Treg; P=0.0087, Fig. 2e). Cd8+ T cells in the mutant tumors had increased proliferation (Ki67+, P=0.018) and were potentially activated, as assessed by the cytotoxic marker Gzmb (P=0.015, Fig. 2e). There was also a higher fraction of Pd1+ Cd8+ but not Lag3+ or Tigit+ Cd8+ T cells in the mutant tumors (P=0.026, Fig. Extended data fig. 2e). Additionally, mutant tumors also showed significant differences in the composition of the innate immune compartment when compared to the parental tumors, with increased infiltration of Natural killer cells (NK, P=0.0053), reduced fractions of tumor associated macrophages (TAM, P=0.0056) and myeloid-derived suppressor cells (MDSCs, P=0.0022, Fig. 2f-g). TAMs in mutant tumors demonstrated lower expression of Cd204 (P=0.043) and Cd206 (P=0.0087), markers for M2 polarization, indicating a more inflammatory microenvironment (Extended data fig. 2f). Along with the increased Pd1+ Cd8+ T cell population, TAMs in mutant tumors also expressed higher levels of Pd-l1 (P=0.045, Extended data fig. 2f).
We next explored how Pole functional mutations influence the immune microenvironment of tumors after ICB. Gene expression profile analysis showed that immune related pathways were upregulated in post-ICB PoleP286R tumors (Extended data fig. 3a-e, Supplementary note 3). We inferred the presence of adaptive and innate immune cell types in post-ICB samples via single sample GSEA based cell type specific gene signatures enrichment33 and performed principal component (PC) analysis (Extended data fig. 4a&b). The first two PCs explained 46.49% and 15.59% of the variation observed (Fig. 2h, Extended data fig. 4b). Based on these two PCs, significant separation between the PoleP286R and parental tumors was observed in the permutational multivariate analysis of variance (PERMANOVA test; P<0.001; Fig. 2h), contributed by both adaptive and innate immune cell types (Fig. 2i-j, Extended data fig. 4d&e; Supplementary note 3). Analysis of the TCR-beta CDR3 clonotypes also showed higher levels of clonal expansion, richness and reduced evenness in post-ICB PoleP286R tumors (Extended data fig. 4f-h, Supplementary note 3). Altogether, these results indicated that Pole functional mutations alter both the adaptive and innate immune compartments and contribute to more inflamed tumor immune microenvironments before and after ICB.
PoleP286R show similar mutational signatures to human POLE tumors
Pathogenic POLE/D1 mutations generate somatic mutations following specific patterns, e.g., C>A transversions in a TCT trinucleotide context, C>T transition in a TCG trinucleotide context or T>G in a TTT trinucleotide context23,24. As we found that isogenic PoleP286R cell lines can accumulate SNVs during in vitro culturing (Fig. 1a), we wanted to explore these mutation patterns. As anticipated, the six SNV category profiles of the parental and mutant cell lines before in vitro culturing were similar (‘baseline mutations’; Fig. 3a; Extended data fig. 5a). In contrast, de novo mutations in the mutant cell lines showed a higher percentage of ‘C>T’ (1.2-fold, P=0.005) and ‘T>G’ (2.4-fold, P=0.007) mutations, indicating that pathogenic Pole mutations can induce a distinct mutational pattern (Fig. 3a; Extended data fig. 5a)
We next generated de novo mutational signatures using the non-negative matrix factorization (NMF) of the 96-trinucleotide-context of baseline and de novo SNVs from 0-week and 8-week passaged cell lines (Extended data fig. 5a, Method)34. Three de novo mutational signatures were identified; designated mSigA, mSigB and mSigC (Fig. 3b). mSig. B, predominantly harboring C>T transitions in a TCG context, is strongly detected in de novo mutations from the mutant cell lines, and closely resembles the COSMIC SBS10b (cosine similarity of 0.82), one of the four characteristic single-base substitution (SBS) signatures observed in human tumors with POLE/D1 pathogenic mutations (Fig. 3b-c)24. mSig. A&C are signatures present in baseline or de novo mutations identified from the parental tumors. They showed distinct profiles and are potentially associated with other mutational processes (Fig. 3b-c; Supplementary note 4). With additional validation (Extended data fig. 5b-e; Supplementary note 5), these results indicated that PoleP286R mutation generated a similar de novo SNV profile in murine cell lines, as observed in patient samples harboring POLE functional mutations.
Signature-based model identifies tumors with functional POLE/D1 mutations
Next, we wanted to further systemically investigate the association of POLE/D1 functional mutations with response to immunotherapy. We reasoned that truly functional mutations will generate specific mutational signatures, as we observed in the murine cell lines, and thus sought to develop a logistic regression model to predict whether a tumor sample harbors POLE/D1 functional mutations based on the four POLE/POLD1-associated COSMIC SBS signatures (hereafter function-associated signatures).
We trained our model on a training set containing samples with known somatic POLE/D1 functional mutations with functional validation from previous literature and the OncoKB database35 (N=74 samples; Methods; Supplementary table 1), or POLE/D1 wild-type tumor samples with at least 20 SNVs (N=8757 samples) detected from WES sequencing from the Cancer Genome Atlas Program (TCGA) pan-cancer cohort (Fig. 4a; Extended data fig. 6a; Methods)12,35,36. The logistic regression model (WES-model) showed a high AUC (0.9870; 95%CI: 0.9697-1) on the training set (Fig. 4b). We selected an optimal cutoff for balance between sensitivity and specificity based on the Youden Index (Cutoff=0.6274, Extended data fig. 6b)37 and reached an overall accuracy of 0.9852 (95%CI: 0.9824-0.9876) with high sensitivity and specificity (Sensitivity=0.9459, 95%CI: 0.8673-0.9851; Specificity=0.9855, 95%CI: 0.9828-0.9879; Fig. 4b). We then tested our model on a test set composed of additional tumor samples from the International Cancer Genome Consortium (ICGC) and the Cancer Cell Line Encyclopedia (CCLE) datasets with defined functional POLE/D1 mutation status from somatic mutation calling and harboring at least 20 SNVs (functional N=16 samples, wild-type N=7822 samples; Extended data fig. 6c; Methods)38,39. The WES-model led to an AUC of 0.8929 (95%CI: 0.7806-1) and an overall accuracy of 0.9908 (95%CI: 0.9884-0.9928; Fig. 4c). We found that the false-negative predictions are associated with earlier MMRd events or other dominant mutation processes in tumors, while the false-positive prediction may be associated with low SNV load caused by genetic or environmental factors distinct from POLE/D1 functional mutations (Extended data fig. 6d-m; Supplementary note 6).
Panel sequencing is more widely used in clinical practice. Reduced numbers of mutations covered, together with distinct trinucleotide-context, lead to shifts of mutational signature readouts and sub-optimal prediction of our model on the MSK-IMPACT panel data (Extended data fig. 7. a-b; Supplemental note 7)40. Therefore, we built a separate model on the new training set containing pan-cancer samples with MSK-IMPACT data and at least 10 SNVs (sample with somatic functional mutations =137, wild-type sample=7649, Extended data fig. 7c-d; Methods). The IMPACT-model showed an AUC of 0.9560 and with a cutoff that delivered an accuracy of 0.9409 (cutoff=0.4935, accuracy 95%CI: 0.9355-0.9461; sensitivity=0.8978, 95%CI: 0.8344-0.9429; specificity=0.9416, 95%CI:0.9362-0.9468; Fig. 4d, Extended data fig. 7e). All of the 14 false-negative predictions showed either MMRd or ultraviolet radiation (UV) associated SBS signatures (SBS7a/b, Extended data fig. 7f). The IMPACT-model also demonstrated a high AUC of 0.9470 (95%CI: 0.9319-0.9801) and an overall accuracy of 0.9340 (accuracy 95%CI: 0.9231-0.9438; sensitivity=0.8888, 95%CI: 0.7927-0.9507; specificity=0.9354, 95%CI: 0.9244-0.9453) on the test set generated from the TCGA (Fig. 4e; Extended data fig. 7g; Methods).
We applied these two models and identified 85 samples that were POLE/D1 function-associated signature-positive out of the 2607 POLE/D1 VUS samples from the TCGA/ICGC/CCLE and MSK-IMPACT cohorts (Fig. 4f). To assess the functional ambiguity of these VUS samples with function-associated signatures, we expanded our baseline functional mutation list from a comprehensive literature review (Supplementary note 8) and found that 25% of these VUS samples are associated with functional mutations found in clinical studies or validated in other model systems, and showed similar genomic features to the samples harboring known POLE/D1 functional mutations (Fig. 4g, Supplementary table 4, Supplementary note 8).
We compared the immune landscape of samples that harbored either known functional mutations or VUSes and are function-associated signature-positive (hereby referred to as “functional mutation/signature-positive”) with two other groups: 1. samples which did not harbor any known POLE/D1 functional mutation and also were predicted as passenger mutation samples by our signature-based model (hereby referred to as “functional mutation/signature-negative”), and 2. POLE/D1 wild-type samples which were false positive (FP) predictions from the TCGA endometrial cohort (hereby referred as “false-positive prediction wild-type samples”). We also performed comparative analyses with the baseline mouse PoleP286R tumors. The functional mutation/signature-positive tumors are more immune active and share immune features with the mouse PoleP286R tumors (Fig. 5a-f; Extended data fig. 7h-j, Extended data fig. 8 a-c; Supplementary note 9).
POLE/D1 function-associated signature-based model predicts patient outcome
POLE/D1 functional mutations can be observed concurrently with MMRd, a well-known genetic feature which leads to the MSI-H phenotype and promotes response to ICB9. To determine the independent effect of POLE/D1 functional mutations on ICB outcome, we applied our model to identify patients with POLE/D1 function-associated signatures in a pan-cancer cohort containing 2,700 MSI-Stable/Low cancer patients that received anti-PD-1/PD-L1 therapy and MSK-IMPACT sequencing from 2015 (Methods). Within this cohort, a group of patients that harbored at least one somatic POLE/D1 mutation (N=172) showed modest improvement in overall survival (OS) with ICB when corrected for cancer types (HR=0.74, P=0.0046, 95% CI=0.60-0.91; Fig. 6a; Supplementary table 5). From these patients, we identified a subgroup of 24 patients that harbored known POLE/D1 somatic functional mutations (N=12), or harbored POLE/D1 somatic VUSes but were predicted to be POLE/D1 function-associated signature-positive samples (N=12) based on our logistic regression model (Methods, Extended data fig. 8d, Supplementary table 6). Ten out of the twelve known POLE/D1 functional mutation-positive samples were also predicted as POLE/D1 function-associated signature-positive, while the remaining two have shown dominant UV (SBS7a/b) or MMRd signatures (Extended data fig. 8d). The tumors from these POLE/D1 functional mutation/signature-positive patients showed higher levels of TMB and lower levels of copy number variants than the tumors from the rest of the POLE/D1 mutated patients that are POLE/D1 function-associated signature-negative (P=2.2e-5; P=0.0021), or the tumors from the POLE/D1 wild-type patients (P<2.2e-16; P=0.0095; Extended data fig. 8e-f). We found that the POLE/D1 functional mutation/signature-positive patients showed significantly better overall survival than the POLE/D1 wild-type patients, even after correcting for cancer type in a multivariable coxph regression (HR=0.25, P=0.0072, 95% CI: 0.11-0.56, Fig. 6b), or when limiting the cohort to POLE/D1 functional mutation/signature-positive tumors and their histology-matched wild-type controls (HR=0.25, P=0.0008, 95% CI: 0.11-0.56, Extended data fig. 8g). Importantly, this survival benefit is not driven by a specific histology (Extended data fig. 8h; Supplementary note 10).
As response rate and progression-free survival (PFS) are also key measurements of efficacy of immunotherapy, we analyzed the immunotherapy-specific clinical response data via manual review of clinical records from the POLE/D1 functional mutation/signature-positive patients41 (Methods). Among the 24 POLE/D1 functional mutation/signature-positive patients, 18 patients experienced durable clinical benefit for at least 6 months, including four patients with complete response (CR), while only six patients experienced progression of disease (PD; Fig. 6c-d). Of the four complete responders, two harbored known POLE/D1 functional mutations while the other two harbored POLE/D1 VUSes and are function-associated signature positive patients. Of these complete responders, one harbored the classical POLEP286R mutation (Extended data fig. 8i; Supplementary note 11); the other harbored the POLEE277Q VUS (Fig. 6e; Supplementary note 11), indicating that ICB can induce long-lasting complete clinical and biochemical response in patients with either known POLE/D1 functional mutations or VUSes with function-associated signatures. Compared to the 30.0% clinical benefit rate of the 1038 POLE/D1 wild-type patients with response data available, the overall 75.0% overall clinical benefit rate of the 24 POLE/D1 functional mutation/signature-positive patients is significantly higher (OR=7.0, P<0.0001, Fig. 6f). This difference was also observed in the histology-matched setting (OR=6.6, P<0.0001, Extended data fig. 9a) or when limiting the comparison to individual histology category (Fig. 6f).
Consistent with the OS data, the overall POLE/D1 mutant patient group only experienced modest PFS benefit when compared to the wild-type patients (HR=0.50, P=1e-07, 95% CI: 0.38-0.65, Extended data fig. 9b), while the patients with the POLE/D1 functional mutation/signature-positive tumors showed a much more robust PFS benefit (HR=0.2, P=6.9e-07, 95% CI: 0.11-0.38, Fig. 6g), even when considering histology of the tumors (HR=0.2, P=7.4e-07, 95% CI:0.11-0.38, Extended data fig. 9c, Fig. 6h). Interestingly, the POLE/D1 functional mutation/signature-positive patients have improved response and survival even when compared to all other POLE/D1 mutated patients that are negative for POLE/D1 functional mutations and function-associated signatures, indicating that the function-associated signature-based model enriched for a subgroup of patients that might benefit more from immunotherapy (PFS: HR=0.16, P=0.0002, 95% CI:0.062-0.42; OS, HR=0.25, P=0.0023, 95% CI:0.10-0.61; Response OR= 3.4, P=0.03; Fig. 6i; Extended data fig. 9d-e). In contrast, false-positive prediction wild type patients (i.e., the wild-type false positive predictions by the logistic regression model) did not show improved ICB outcome when compared to the true negative prediction wild type patients (i.e., the wild-type true negative predictions by the MSK-IMPACT model, OS, N=49, 2479, HR=1.1, P=0.32, 95%CI:0.92-1.3; PFS, N=23, 1016, HR=0.93, P=0.55, 95%CI:0.75-1.2; Response OR=0.82, P=0.82; Extended data fig. 9f-h), indicating that POLE/D1 function-associated signatures do not predict better survival or response to ICB without the presence of POLE/D1 functional mutations. Altogether, the POLE/D1 functional mutation/signature-positive patients show better outcome with immunotherapy than the other POLE/D1 mutant patients or wild-type patients.
Signature-based model outperforms traditional approaches
As we observed that the POLE/D1 functional mutation/signature-positive patients have improved survival compared to other POLE/D1 mutated patients (Fig. 6i, Extended data fig. 9d-e), we compared our prediction approach with other methods to enrich for tumors with POLE/D1 functional mutations. We first considered two of such methods: (i) POLE/D1 mutated tumors with hypermutation phenotype (50 Mutations/Mb) (hypermutated, N=36)12 and (ii) tumors with at least one POLE/D1 exonuclease domain mutation (N=37) (Fig. 7a). There are 23 out of 36 hypermutated tumors and 21 out of 37 exonuclease domain mutation-positive samples are neither positive for known POLE/D1 functional mutations nor positive for POLE/D1 function-associated signatures, indicating that these two approaches are identifying relatively distinct patient populations from the POLE/D1 functional mutation/signature-positive patients (Fig. 7a). To assess the independent value of our signature-associated model against these other approaches, we generated multivariable Cox proportional hazard models (Coxph) and found that only the POLE/D1 functional mutations/ function-associated signatures, and hypermutation are independent predictive beneficial factors for both OS and PFS (Fig. 7b; Extended data fig. 9i). Exonuclease domain mutations of POLE/D1 have been used as a criterion to select POLE/D1 functional mutations for a long time32. Indeed, patients with POLE/D1 exonuclease domain mutations showed slightly better OS and PFS after immunotherapy (OS, HR=0.57, P=0.039, 95% CI: 0.36-0.97; PFS, HR=0.26, P=0.0030, 95% CI: 0.11-0.64; Extended data fig. 9j-k). However, when excluding the small portion of the POLE/D1 functional mutation/signature-positive patients from the POLE/D1 exonuclease domain mutation-positive population, we did not see survival benefit for the rest of the POLE/D1 exonuclease domain mutation-positive patients, compared to POLE/POLD1 wild-type patients (OS, HR=0.71, P=0.24, 95% CI: 0.40-1.26; PFS, HR=0.80, P=0.55, 95% CI: 0.39-1.64; Fig. 7c-d), revealing that the survival benefit of POLE/D1 exonuclease domain mutations is largely explained by the POLE/D1 exonuclease domain mutation-positive samples that are also predicted to be POLE/D1 functional mutation/signature-positive. In contrast, the POLE/D1 functional mutation/signature-positive patients whose mutations are out of the exonuclease domains of POLE/POLD1, or the POLE/D1 functional mutation/signature-positive patients whose tumor did not show hypermutator phenotype still showed better OS and PFS, in comparison with the POLE/D1 wild-type patients, suggesting that the predictive effect of POLE/D1 functional mutations/signatures is partially independent of POLE/D1 exonuclease domain mutations or hypermutator phenotype (Fig. 7e-f; Extended data fig. 9l-m).
We further compared our model approach with ‘known POLE/D1 functional mutation-positive’, ‘POLE/D1 exonuclease domain mutation-positive’, hypermutation and in silico function prediction-based algorithms on ICB outcomes, the results indicated that the function-associated signature-based model significantly out-perform most of these methods (Fig. 7g-h; Extended data fig. 10a-c; Supplementary note 12). Importantly, ‘POLE/D1 functional mutations/signatures’ showed independent predictive power for OS and PFS from TMB in multivariable coxph models (Fig. 7i, Extended data fig. 10d), or in ICB patient subsets that are TMBhi (i.e., with at least 10 mutations per Mb exome) or with balanced TMB distributions (Fig. 7j, Extended data fig. 10f-g, Supplementary note 13), indicating other biological mechanisms other than increased TMB also contribute to the improved ICB outcome of the POLE/D1 functional mutation/signature-positive patients.
Taken together, these results indicated that function-associated signature-generating POLE/D1 mutations, even those that are not classic functional mutations, predict outcome after anti-PD1/PD-L1 therapy.
Mutational signatures affect immunogenicity of neoantigens
Given that we observed a TMB-independent predictive effect of POLE/D1 function-associated signatures on ICB outcome, we wanted to further explore the potential underlying mechanism. While mutational signatures were associated to different probability of generating different SNV classes, the POLE/D1 function-associated mutational signatures do not differ from other COSMIC SBS signatures in their ability to generate missense SNV mutations, which could alter amino acid codons and generate immunogenic neo-peptides (Extended data fig. 10h, Supplementary table 7, Supplementary note 8). Further, the SNVs in the POLE/D1 functional mutation/signature-positive samples are also not more likely to generate Human leukocyte antigen class-I (HLA-I) binding peptides (Fig. 8a-b; Supplementary note 15).
Increased hydrophobicity and decreased polarity are highly associated with enhanced immunogenicity of HLA-I binding peptides42-44. We found that the change-in-hydrophobicity (Δhydrophobicity) of the neo-peptides versus the corresponding wild-type peptides from the POLE/D1 functional mutation/signature-positive samples in the TCGA cohort is higher than that of the neo-peptides derived from the POLE/D1 functional mutation/signature-negative samples (P=0.0075, Fig. 8c). As the Δhydrophobicity of the neo-peptides is the consequence of amino acid alterations, we further looked at the Δhydrophobicity at each residue position in the predicted HLA-I binding neo-peptides (all 8-11 mers). The Δhydrophobicity of the amino acid alterations at the TCR-contact residues was significantly higher in the POLE/D1 functional mutation/signature-positive samples but not the false-positive prediction wild type samples, or that in the POLE/D1 functional mutation/signature-negative samples (Fig. 8d-e), which could be associated with higher immunogenicity42,45. To understand the basis of this difference, we examined the missense SNVs which generate specific amino acid alterations and found that mutational signatures are associated with the presence of different amino acid alterations (i.e., wild-type and mutant amino acid pairs) (Extended data fig. 10i; Supplementary table 8). For instance, the APOBEC-associated SBS2 signature is strongly associated with E-to-K alterations, while POLE/D1-related SBS10a is more associated with D/S-to-Y and L/R-to-I alterations. Strikingly, we found that two of the classical POLE/D1 function-associated signatures (SBS10a/b) are among the mutational signatures that are most likely to generate amino acid alterations which increase the hydrophobicity and decrease the polarity of the resultant neo-peptides (Fig 8. f-g, Supplementary table 9). Altogether, these results indicated that POLE/D1 function-associated signatures likely generate amino acid alterations that increase the hydrophobicity of the neo-peptides, enhancing immunogenicity (see Supplementary note 15-16).
Discussion
Here, we showed that POLE/D1 functional mutations are sufficient to induce anti-tumor immunity and impart sensitivity to ICB in both human and mouse. The introduction of POLE/D1 functional mutations into murine syngeneic tumors led to enhanced baseline activity of both adaptive and innate immune compartment, which imparted sensitivity to ICB. This effect appeared to be independent of cancer type in our model systems. These observations are consistent with previous studies in human15, and transgenic mice46.
Previous studies showed the value of using mutational signatures to identify pathogenic mutations in DDR pathways47. Here, our models trained on POLE/D1 function-associated signatures can distinguish samples harboring function POLE/D1 mutations from wild-type samples with high accuracy using both WES and targeted panel sequencing. Furthermore, our function-associated signature-based approach was superior in accurately classifying POLE/D1 mutated tumors that respond to ICB and out-performed most of the other traditional computational approaches for identifying likely pathogenic variants.
We also found that the spectrums of the SBS signatures are associated with different amino acid alterations. Classical POLE/D1 function-associated signatures SBS10a/b are more likely to be associated with amino acid alterations that increase the hydrophobicity of neo-peptides, potentially leading to higher immunogenicity42,48, and may contribute to the exceptional ICB outcome of the POLE/D1 functional tumor patients. Consistently, recent studies have indicated that specific mutational patterns associated with smoking and UV can generate more immunogenic neoantigens, highlighting the importance of utilizing mutational signatures to understand patients’ response to ICB49,50.
One limitation of this study pertains to the syngeneic models we used. These cell lines have relatively high baseline SNV burdens but provided easy isogenic-based comparisons. Additionally, our function-associated signature-based model generates false predictions and can be further improved. Future validation of the POLE/D1 function-associated signature-positive VUSes in yeast, mouse or human cell lines will determine whether they are bona fide functional mutations.
In summary, our study has significant utility for the interpretation of POLE and POLD1 mutations in the setting of ICB treatment. We provide an extensive category of immunologically relevant mutations and a definitive framework for using mutational signatures to understand the functional consequences of these mutations on ICB response and tumor immunology.
Methods
The use of the patient data was approved by the MSKCC Institutional Review Board (IRB). All patients provided informed consent to a Memorial Sloan Kettering IRB-approved protocol. The animal experiments were approved by the institutional animal care and use committee (IACUC) of MSKCC (Protocol# 07-09-015) and the Lerner Research Institute, the Cleveland Clinic Foundation (Protocol# 002467).
Generation of isogenic murine cell lines harboring Pole/Pold1 mutations
B16F10 (CRL-6475), a murine melanoma cell line derived from a spontaneous melanoma tumor which is widely used in immuno-oncology studies51-53, and CT-26 (CRL-2638) cell lines, an N-nitroso-N-methylurethane-(NNMU) induced murine colon carcinoma cell line that is a well-accepted model for testing efficacy of immunotherapy52-54, were obtained directly from ATCC prior to experimentation and cultured in RPMI-1640 complete medium with 10% FBS and antibiotics. 6-7 weeks old female C57BL6J, Nude or Balbc/J mice (Jackson Laboratories) were used for animal experiments. CRISPR-HDR editing was performed according to published methods28,55. CRISPR-Cas9 guide RNAs were designed using design tool provided by Benchling (Biology Software, 2019). Single-stranded oligo deoxynucleotides (ssODNs) aligned to the non-targeting strands with 40nt overhangs at each side of mutated regions were designed to introduce desired non-synonymous mutations, and to introduce synonymous mutations to disrupt potential binding between guide RNA and HDR production as well as to generate new restriction enzyme sites for downstream screening. As multiple guides were tested to generate the mutation cell lines, each ssODN may contain multiple synonymous mutations corresponding to PAM sites of multiple guides. The ssODNs were then synthesized by IDT with 5’ and 3’ phospho-modification on the first and last nucleotides. For generation of B16F10 mutant cell lines, briefly, optimized guide RNAs were cloned into the pSpCas9(BB)-2A-GFP (PX458), gift from Dr. Feng Zhang (Addgene). Transient transfection of 2ug plasmids and 1ul of 40uM ssODNs containing desired mutations and restriction enzyme sites was performed using Genejet-B16F10 (SignaGen) reagent according to manufacturer’s instruction. GFP positive live single cells were sorted into single wells containing culture medium 48-72 hours after transfection. For CT-26 mutant cell lines, Alt-R® CRISPR-Cas9 crRNAs containing the optimized guide sequences were ordered from IDT, then annealed together with Alt-R® CRISPR-Cas9 tracrRNA, ATTO™ 550 (IDT) and Alt-R® S.p. Cas9 Nuclease V3 (IDT) to form RNP complex then co-transfected with 1ul of 10uM ssODN template containing desired mutation and restriction enzyme site using RNAiMAX transfection reagent (Invitrogen). ATTO 550 positive live single cells were sorted into single wells containing culture medium 48 hours after the transfection. Single cell derived colonies were expanded and screened for incorporation of restriction enzyme cutting sites by direct digestion of PCR products (TseI for PoleP286Ralleles, AcuI for PoleV411Lalleles, HindIII for Pold1L472P Alleles, and NdeI for Pold1E372K alleles). Digestion products from clones that showed the desired digestion pattern were further sent for Amplicon sequencing (EZ-Amplicon, Genewiz) to validate the incorporation of the functional mutations using Crispresso v2 package56. Parental cell lines and validated mutant cell lines were cryopreserved as passage 0 week (P0w) and continuously cultured for 8 weeks in vitro and cryopreserved again at passage 8 weeks (P8w). All DNA oligo sequences will be available in Supplementary table 10.
Animal experiments
For mouse syngeneic tumor models, 2 x 105 B16F10 or CT-26 cells in 100 μl of PBS were injected into the left flanks subcutaneously in 6-7 weeks old female C57BL6J, Nude or Balbc/J mice (Jackson Laboratories). Mice with palpable tumors (20-100mm3) were randomized into indicated treatment arms. IgG (2A3, Bioxcel, 100μg; polyclonal Syrian hamster IgG, 250μg), anti-PD-1 (RMP1-14, Bioxcel 100μg), anti-CTLA4 (9H10, Bioxcel 250μg), or combination of anti-PD-1 and anti-CTLA4 antibodies were administered intraperitoneally in 100 μl of PBS twice weekly (every 3-4 days)7. Mice with ulcerative tumors or intramuscular tumors were excluded from analysis. Tumor volumes were measured twice weekly using calipers and calculated by the formula: ((Length) x (Width)^2)/2. All mouse experiments were repeated at least twice to ensure reproducibility. Mice were euthanized by carbon dioxide prior to necropsy. For survival analysis, end point of tumor bearing animals were determined by either death, severe health conditions or tumor volume of at least 1.7cm3, according to the IACUC approved animal protocol. The maximal tumor size limit of 2cm3 was followed, except in the cases where the tumors reached the limit between 2 contiguous days in which measurements were made. Every effort was made to abide by the state limit.
DNA/RNA extraction of murine tumors
Mouse tumors were trimmed, dissected, and then chopped to small pieces with blade in Petri dishes. Tumor pieces were further mixed and snap frozen with dry ice to reduce tumor regional heterogeneity. The frozen tissue was used for isolation of DNA and RNA and submitted to the Integrated Genomic Core for library preparation and sequencing. Three replicates per condition were submitted for sequencing.
Flow cytometry analysis
Murine tumors harvested at 14 days post-tumor implantation were dissociated into single cell suspensions using a gentle MACS tissue dissociator and the Miltenyi Mouse Tumor Dissociation Kit according to manufacturer instructions (Miltenyi Biotech). Cell suspensions were stained using the Live-Dead aqua fixable dye (Thermo) followed by surface and intracellular antibodies. Stained single cell suspensions were analyzed using the Fortessa (BD) flow cytometric analyzer. Quantitative data analysis was performed using Flowjo software (Treestar). All experiments were repeated at least twice. Information of the antibodies used for flow cytometry is available in Supplementary table 11. Gating strategy of flow cytometry is available in Supplemental fig 6.
Immunofluorescence and quantification of Cd3+ T cells
Harvested murine tumors were fixed in formalin at 4 °C for 48 hours and transferred to 70% ethanol at 4 °C. Tumors were embedded in paraffin and sectioned onto glass slides. Staining was performed by the Molecular Cytology Core Facility (MCCF) with Ventana Ultra stainer (Roche Diagnostics). Briefly, following 32 min of heat and CC1 (Cell Conditioning 1, Ventana 950-500) retrieval, the tissue sections were blocked first for 30 min in Background Blocking reagent (Innovex NB306). Anti-mouse Cd45 incubation (0.5ug/ml, 5h, BD 550539) was followed by incubation with biotinylated rabbit anti-rat IgG (5.75ug/ml, 1h, Vector BA-4000). Blocker D, Streptavidin- HRP and TSA A488 (Life Tech B40932) prepared according to manufacturer instruction in 1:100 dilution for 16 min. Then, anti-mouse Cd3 antibody incubation (1.2g/ml, 6h, Dako A0452) was followed by incubation with biotinylated goat anti-rabbit IgG (5.75ug /ml, 1h, Vector labs PK6101). Streptavidin- HRP and TSA CF594 (Biotium 92174) were prepared according to manufacturer instruction in 1:2000 dilution for 16 min. After that, anti-mouse Cd8a antibody incubation (4.8ug /ml, 6h, Cell Signaling 98941) followed by incubation with biotinylated goat anti-rabbit IgG (5.75ug /ml, 1h, Vector labs PK6101). Blocker D, Streptavidin- HRP and TSA Alexa 647 (Life Tech B40958) is prepared according to manufacturer instruction in 1:100 dilution for 16 min. All slides were counterstained in 5ug /ml DAPI [dihydrochloride [2-(4-amidinophenyl)-6-indolecarbamidine dihydrochloride], Sigma D9542, for 5 minutes at room temperature, mounted with anti-fade mounting medium Mowiol [Mowiol 4-88 (CALBIOCHEM 475904)] and coverslipped. After staining, slides were subjected to Pannoramic Scanner (3DHistech) at MCCF at 40x. For T-cell quantification, at least 5 tumors per groups and at least 3 random fields per tumor were generated by CaseViewer v2.4 (3DHistech) and then analyzed in ImageJ v1.5.3.
Whole exome sequencing of murine cell lines
Whole exome sequencing was performed with Novaseq (Illumina) with an average coverage of 250x. Three replicate cell lines were sequenced. Raw sequencing data are aligned to the GRCm38/mm10 genome build using the Burrows-Wheeler Aligner (BWA) v0.7.15 57. Further indel realignment, base-quality score recalibration and duplicate-read removal were performed using the Genome Analysis Toolkit (GATK) v4.1.4.158. SNV and indel callers include MuTect v1.1.459, VarScan v1.160, Strelka v2.9.1061, Mutect2 (part of GATK 4.1.4.1) and somaticSniper v1.0.562, annotations and filters were described in a previous study9. Briefly, we use Ensembl Variant Effect Predictor63 v102 to determine effect of called variants. Annotated VCF files are converted to Multiple Alignment Forma (MAF) format with Vcf2Maf v1.6.19 (doi:10.5281/zenodo.593251).
RNA-sequencing, GSEA, and TCR analysis
Three biological replicates per condition were submitted for sequencing depending on RNA quality. RNAseq raw read sequences were aligned against mouse genome assembly mm10 (Dec.2011/GRCm38, https://genome.ucsc.edu/cgi-bin/hgGateway?db=mm10) by STAR 2-pass alignment64. RNAseq gene level count values were computed by using the R package GenomicAlignments 65 over aligned reads with UCSC KnownGene66 in mm10 as the base gene model. The Union counting mode was used and only mapped paired reads were considered. Fragments per kilobase million (FPKM) values were then computed from gene level counts by using fpkm function from the R package DESeq2 v1.30.167.GSEA analysis was performed on DESeq2 outputs using pre-ranked GSEA module with default setting in GSEA v4.1.0 for Windows or on Genepattern server (Broad Institute)68. Pathway enrichment from the consensus DEGs were performed using R package clusterProfiler v3.18.169. Immune cell type signature enrichment were performed using xCell v1.133. PCA analysis were performed with R package PCAtools v2.2.0 with 10% variable with lowest variation removed from the analysis. PERMANOVA analysis were performed using R package vegan v2.5.7. TCR-beta CDR3 was extracted with MixCR70 v3.0.18 with default setting70. Calculation of Chao1 index, richness, evenness and clonality were performed as previously described71.
Mutational signature analysis on whole exome sequencing and MSK-IMPACT panel sequencing data
Mutation data were converted to trinucleotide context matrices using SigProfilerMatrixGeneratorR v0.1.072 and was limited to exome regions. De novo NMF prediction of mutational signature and transcriptional strand-based mutational signature analysis were preformed using the R package MutationalPatterns v1.8.073. NNLS analysis was performed using R package deconstructsig v1.9.074. SNVs detected in parental and mutant cell lines before 8 weeks of in vitro culturing were defined as baseline SNV mutations. SNVs detected only after 8 weeks of in vitro culturing were defined as de novo mutations. For MSK-IMPACT panel and mouse mutational signature analysis, a trinucleotide context matrix was first normalized based on the abundance of each trinucleotide context category in MSK-IMPACT regions, mouse and human exome regions. The normalize matrix were then used for downstream NMF and NNLS analysis.
Logistic regression model training on TCGA and MSK-IMPACT data
A priori, we classified samples as having a known functional mutation using a baseline list of reported known functionals from prior work classifying mutations based on the ultramutator phenotype12 and an FDA-approved database of functional mutations, the OncoKB data base35, which contains functional mutations curated from the literature. We then used this list to determine whether these samples harbored at least one POLE/D1 functional mutation. Logistic regression was applied using the R glm function with equation ‘functional status ~ SBS10a + SBS10b + SBS14 + SBS20’. For the WES dataset, only samples with at least 20 SNVs were used for generating the WES-model. For the MSK-IMPACT dataset, only samples with at least 10 SNVs were used for generating the IMPACT-model. Weight of functional and wild-type sample classes were calculated based on the percentage of functional samples and wild-type samples in the dataset. Predictions from the logistic regression model were further analyzed. Cutoff points were obtained using ROCit v2.1.1 package. Confusion matrix, accuracy and ROC curves were obtained using caret v6.0-86, PRROC v1.6.0 and pROC (v1.17.01) packages.
MSK-IMPACT immunotherapy cohort, survival and response analysis
The MSK-IMPACT immunotherapy cohort was assembled using a similar approach as described previously75,76. Briefly, after informed patient consent and approval by the MSK Institutional Review Board (IRB), patients were identified who received their anti-PD1/PD-L1 and underwent MSK-IMPACT targeted panel sequencing at MSK. In addition, to minimize the impact of potential survival bias due to left truncation (defined as a type of selection bias that results from only studying patients who have survived long enough on ICB therapy to receive MSK-IMPACT sequencing, in this circumstance) by limiting the cohort to patients received their first dosage of immunotherapy reagents between January 2015 to July 2018 when MSK-IMPACT was already routinely performed. Patients with more than one cancer type and cancer types with fewer than 10 patients were excluded. MSI-H patients, determined by having an MSIsensor score >= 10, were also excluded. Survival data was last updated July 2020 and overall survival calculated as time between first anti-PD-1/anti-PD-L1 therapy start and death, or censored at last contact. Kaplan–Meier survival analysis was performed with the R survival package v3.2.10, and hazard ratios (HR) and log-rank p values were calculated using the coxph analysis with cancer type correction; results were reported and presented using the R survminer package v-.4.9. Multi-variable coxph analysis was also performed using the survminer package. Known POLE/D1 functional mutation-positive patients were determined as patients harboring at least one functional mutation from the baseline functional list used for generating the logistic prediction model. POLE/D1 function-associated signature-positive VUS patients were identified as patients 1) harboring at least one somatic POLE/D1 mutation; 2) harboring at least 10 SNV for relative reliable mutation signature analysis; and 3) classified as functional samples by the IMPACT logistic regression model. TMB was calculated as MSK-IMPACT filtered non-synonymous mutation counts divided by the length of the corresponding panel version77. FGA (fraction genome altered; aka, fraction copy-number altered) was calculated as the length of FACETS78,79 segments with ∣cnlr.median.clust∣ >= 0.2 (i.e., segments with log2 CNA value > 0.2) divided by the total length of all segments. For the single cancer type specific coxph model, non-small cell lung cancer (NSCLC) was defined as the union of lung adenocarcinoma, lung squamous cell carcinoma, poorly differentiated non-small cell lung cancer and other non-small cell lung cancer cases. The response to immunotherapy was categorized based on RECIST v1.1 criteria80. When formal RECIST reads were not available, physician notes and imaging studies were reviewed to categorize overall best response for each patient using the same criteria based on change in the sum of diameters of target lesions. Patients with CR, PR or SD for at least 6 months were classified as clinical benefit; SD less than 6 months and PD were classified as no clinical benefit. PFS was calculated from ICB first infusion to disease progression or death of any cause; patients without progression were censored at last attended appointment at MSKCC with any clinician. For estimating the CR patient tumor sizes, tumor length and width were retrieved from clinical data and tumor volume is estimated using formula: ((Length) x (Width)^2)/2. In silicon mutation effect prediction result were calculated from dbNSFP4.2a81. C-index calculation and associated statistical analysis are performed as described previously82 using using survcomp v1.40.0. Heatmap is illustrated with complexheatmap package v2.6.283.
SBS signature SNV classes/AA alteration classes association, hydrophobicity and polarity calculation
To calculate the association between a mutational signature and SNV classes or amino acid alteration classes, we first calculated the association of each the SNVs in this class with this mutational signature, based on the mutational signature profile extracted from the samples the SNVs belongs to, and the probability that this mutational signature generates mutations with the same trinucleotide context of this SNV (which is the SBS trinucleotide context matrix). We then assigned each SNVs into different SNV classes or amino acid alteration classes, and summed up the probability of all SNVs in each class. Finally, we normalized to the total probability of all classes for this mutational signature to obtain the proportion of probability of each SNV or amino acid alteration class. For neoantigen related analysis, only samples with at least 1 known binder neoantigen were included into analysis. Hydrophobicity changes were calculated using the Kyte–Doolittle numeric hydrophobicity scale84. Polarity changes were calculated using the Grantham polarity index85.
Statistics & Reproducibility
All the statistical details of experiments including the statistical tests used, number of samples, definition of center, dispersion, precision measures and how statistical significance is determined can be found in figure legends. For mouse tumor experiments, sample size is based on our previous publication7,9. No statistical method was used to predetermine sample size. Mice with severely necrotic and intramuscular tumors are euthanized based on our approved IACUC protocol and thus excluded from analysis. No other data were excluded from the analyses. No statistical method was used to predetermine sample size for other experiments. Randomization was performed in mouse experiments between treatment groups. Where possible, investigators were blinded to allocation during objective outcome assessment including tumor size measurement, immunofluorescence quantification, and clinical immunotherapy response assessment. For generating patient sub-cohorts with matched N, median and mean TMB, wild-type patients were randomly selected in R 4.0.3 with the random seed 12345. For mouse tumor and flow cytometry experiments, statistical analyses were performed using the Graphpad Prism v7.00. For all the rest experiments and analyses were performed in RStudio v1.3.1093 with R v4.0.3.
Extended Data
Supplementary Material
Acknowledgements
We would like to thank colleagues at the MSK core facilities, including Integrated Genomics Operation (IGO), Flow Cytometry Core Facility (FCCF) and Molecular Cytology Core Facility (MCCF) for processing our samples and providing important suggestions. We would like to thank colleagues at the Molecular Diagnostics Service in the Department of Pathology, and the Marie-Josee and Henry R. Kravis Center for Molecular Oncology Marie-Josee of MSK for establishing and generating the MSK-IMPACT data. We would like to thank all the members and alumni of the Chan lab at MSK and CCF, as well as HOPP of MSK for their generous help and support of this study. The results presented here are in part based upon data generated or collected by the TCGA Research Network, Broad CCLE, ICGC. We acknowledge funding sources including NIH R01 CA205426 (TAC), NIH R35 CA232097 (TAC), the STARR Cancer Consortium (TAC) and the NIH/NCI Cancer Center Support Grant P30 CA008748 (MSKCC) .
Footnotes
Competing Interests Statement
T.A.C. is a co-founder of Gritstone Oncology and holds equity. T.A.C. holds equity in An2H. T.A.C. acknowledges grant funding from Bristol-Myers Squibb, AstraZeneca, Illumina, Pfizer, An2H, and Eisai. T.A.C. has served as an advisor for Bristol-Myers, MedImmune, Squibb, Illumina, Eisai, AstraZeneca, and An2H. T.A.C., L.G.T.M, and D.C. are inventors on intellectual property held by MSKCC on using tumor mutation burden to predict immunotherapy response, with pending patent, which has been licensed to PGDx. C.V. acknowledges research grant funding from Fundación Alfonso Martín Escudero. D.Z. received consulting fees from Agenus, Hookipa Biotech, Targovax, Astra Zeneca, Synthekine, Mana Therapeutics, Xencor, Crown Biosciences, and Memgen. D.Z. receives grant/research support from Astra Zeneca, Roche, and Plexxikon. D.Z. holds stock options for Immunos Therapeutics, Calidi Biotherapeutics, Mana Therapeutics and Accurius. D.Z. has a patent related to use of Newcastle Disease Virus for cancer therapy with royalties paid from Merck. R.Y. has served as an advisor for Natera, Array BioPharma/Pfizer, and Mirati Therapeutics and has received research support to her institution from Array BioPharma/Pfizer, Boehringer Ingelheim, and Mirati Therapeutics. The remaining authors declare no competing interests.
Data Availability Statement
Mutation data from the TCGA pan-cancer study (mc3.v0.2.8.PUBLIC.maf.gz) were downloaded from NCI genomic data commons (https://api.gdc.cancer.gov/data/1c8cfe5f-e52d-41ba-94da-f15ea1337efc). Mutation data of ICGC and CCLE were download from ICGC data portal (https://dcc.icgc.org/api/v1/download?fn=/current/Summary/simple_somatic_mutation.aggregated.vcf.gz) and the Broad Institute website (https://ndownloader.figshare.com/files/34008434), accordingly. To generate the ICGC/CCLE test set, TCGA samples in ICGC cohort were manually removed. BED files of MSK-IMPACT sequencing regions are acquired from AACR-Genie project database (https://www.synapse.org/#!Synapse:syn7222066/wiki/405659). COSMIC SBS exome signature v3 and COSMIC SBS exome TSB signature v3 were downloaded from the Synapse database (https://www.synapse.org/#!Synapse:syn11967914). Mouse GRCm38 genome were download from UCSC genome browser (https://hgdownload.soe.ucsc.edu/goldenPath/mm10/bigZips/mm10.fa.gz). A baseline list of known POLE/D1 functional mutations were generated from previous publications and the OncoKB database (https://www.oncokb.org/) 12,35. Additional lists of clinical associated POLE/D1 mutations and POLE/D1 mutator alleles in other species were manually curated from literatures and were available as Supplemental Table 2&3. Gene expression profile and xCell based cell type deconvolution of TCGA endometrial cohort were downloaded from TIMER database (https://timer.cistrome.org/infiltration_estimation_for_tcga.csv.gz) 86 87. TCRb reportorie data were downloaded from NCI genomic data commons (TCGA_mitcr_cdr3_result_161008.tsv, from https://gdc.cancer.gov/about-data/publications/panimmune) 88.TCGA predicted neoantigen information were acquired from TSNAdb (http://biopharm.zju.edu.cn/tsnadb/download/)89. Mouse WES and RNA-seq data were deposited to SRA (PRJNA701709). All the other data are deposited to Synapse (as Synapse project syn29404148, https://www.synapse.org/#!Synapse:syn29404148/wiki/617361) with public access: All the samples underwent evalutation for model building (Synapse ID: syn29477489.1), the mutational signature matrix for training and test the WES (Synapse ID: syn29478035, Synapse ID: syn29478033) and MSK-IMPACT models (Synapse ID: syn29478110, Synapse ID: syn29478151), the genomic, response and survival data of the MSK-IMPACT immunotherapy cohorts (Synapse ID: syn29478036)
Code Availability Statement
All customized code including code for generate the models (Synapse ID: syn29479495)90 ,the analysis of MSK-IMPACT cohort (Synapse ID: syn29479497)91 and other associated code (Synapse ID: syn30137113)92 are deposited to Synapse (Synapse project ID: syn29404148, https://www.synapse.org/#!Synapse:syn29404148/wiki/617361) with public access. Code for processing WES and RNAseq were from published tools and is available from the authors of the tools as described in the above Methods sections.
Reference
- 1.Borcoman E et al. Novel patterns of response under immunotherapy. Ann Oncol 30, 385–396 (2019). [DOI] [PubMed] [Google Scholar]
- 2.Hegde PS & Chen DS Top 10 Challenges in Cancer Immunotherapy. Immunity 52, 17–35 (2020). [DOI] [PubMed] [Google Scholar]
- 3.Mouw KW, Goldberg MS, Konstantinopoulos PA & D'Andrea AD DNA Damage and Repair Biomarkers of Immunotherapy Response. Cancer Discov 7, 675–693 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bever KM & Le DT DNA repair defects and implications for immunotherapy. J Clin Invest 128, 4236–4242 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hsiehchen D et al. DNA Repair Gene Mutations as Predictors of Immune Checkpoint Inhibitor Response beyond Tumor Mutation Burden. Cell Rep Med 1(2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Rizvi NA et al. Cancer immunology. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science 348, 124–8 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Samstein RM et al. Mutations in BRCA1 and BRCA2 differentially affect the tumor microenvironment and response to checkpoint blockade immunotherapy. Nature Cancer (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Le DT et al. Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science 357, 409–413 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Mandal R et al. Genetic diversity of tumors with mismatch repair deficiency influences anti-PD-1 immunotherapy response. Science 364, 485–491 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ying J et al. Additive effects of variants of unknown significance in replication repair-associated DNA polymerase genes on mutational burden and prognosis across diverse cancers. J Immunother Cancer 9(2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lujan SA, Williams JS & Kunkel TA DNA Polymerases Divide the Labor of Genome Replication. Trends Cell Biol 26, 640–654 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Campbell BB et al. Comprehensive Analysis of Hypermutation in Human Cancer. Cell 171, 1042–1056 e10 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kunkel TA Evolving views of DNA replication (in)fidelity. Cold Spring Harb Symp Quant Biol 74, 91–101 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Esteban-Jurado C et al. POLE and POLD1 screening in 155 patients with multiple polyps and early-onset colorectal cancer. Oncotarget 8, 26732–26743 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Mehnert JM et al. Immune activation and response to pembrolizumab in POLE-mutant endometrial cancer. J Clin Invest 126, 2334–40 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.van Gool IC et al. POLE Proofreading Mutations Elicit an Antitumor Immune Response in Endometrial Cancer. Clin Cancer Res 21, 3347–3355 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Domingo E et al. Somatic POLE proofreading domain mutation, immune response, and prognosis in colorectal cancer: a retrospective, pooled biomarker study. Lancet Gastroenterol Hepatol 1, 207–216 (2016). [DOI] [PubMed] [Google Scholar]
- 18.Wang F et al. Evaluation of POLE and POLD1 Mutations as Biomarkers for Immunotherapy Outcomes Across Multiple Cancer Types. JAMA Oncol 5, 1504–1506 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.He J et al. Distinctive genomic characteristics in POLE/POLD1-mutant cancers can potentially predict beneficial clinical outcomes in patients who receive immune checkpoint inhibitor. Ann Transl Med 9, 129 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Barbari SR & Shcherbakova PV Replicative DNA polymerase defects in human cancers: Consequences, mechanisms, and implications for therapy. DNA Repair (Amst) 56, 16–25 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Silberman R et al. Complete and Prolonged Response to Immune Checkpoint Blockade in POLE-Mutated Colorectal Cancer. JCO Precision Oncology, 1–5 (2019). [DOI] [PubMed] [Google Scholar]
- 22.Rayner E et al. A panoply of errors: polymerase proofreading domain mutations in cancer. Nat Rev Cancer 16, 71–81 (2016). [DOI] [PubMed] [Google Scholar]
- 23.Alexandrov LB et al. The repertoire of mutational signatures in human cancer. Nature 578, 94–101 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Petljak M et al. Characterizing Mutational Signatures in Human Cancer Cell Lines Reveals Episodic APOBEC Mutagenesis. Cell 176, 1282–1294 e20 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Koh G, Degasperi A, Zou X, Momen S & Nik-Zainal S Mutational signatures: emerging concepts, caveats and clinical applications. Nat Rev Cancer 21, 619–637 (2021). [DOI] [PubMed] [Google Scholar]
- 26.Albertson TM et al. DNA polymerase epsilon and delta proofreading suppress discrete mutator and cancer phenotypes in mice. Proc Natl Acad Sci U S A 106, 17101–4 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Li HD et al. Polymerase-mediated ultramutagenesis in mice produces diverse cancers with high mutational load. J Clin Invest 128, 4179–4191 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Cong L et al. Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819–23 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Niu B et al. MSIsensor: microsatellite instability detection using paired tumor-normal sequence data. Bioinformatics 30, 1015–6 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Knijnenburg TA et al. Genomic and Molecular Landscape of DNA Damage Repair Deficiency across The Cancer Genome Atlas. Cell Rep 23, 239–254 e6 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Leon-Castillo A et al. Interpretation of somatic POLE mutations in endometrial carcinoma. J Pathol 250, 323–335 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Bellido F et al. POLE and POLD1 mutations in 529 kindred with familial colorectal cancer and/or polyposis: review of reported cases and recommendations for genetic testing and surveillance. Genet Med 18, 325–32 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Aran D, Hu Z & Butte AJ xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol 18, 220 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Alexandrov LB et al. Signatures of mutational processes in human cancer. Nature 500, 415–21 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Chakravarty D et al. OncoKB: A Precision Oncology Knowledge Base. JCO Precis Oncol 2017(2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hoadley KA et al. Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer. Cell 173, 291–304 e6 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Youden WJ Index for rating diagnostic tests. Cancer 3, 32–5 (1950). [DOI] [PubMed] [Google Scholar]
- 38.International Cancer Genome, C. et al. International network of cancer genome projects. Nature 464, 993–8 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Ghandi M et al. Next-generation characterization of the Cancer Cell Line Encyclopedia. Nature 569, 503–508 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Zehir A et al. Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. Nat Med 23, 703–713 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Chowell D et al. Improved prediction of immune checkpoint blockade efficacy across multiple cancer types. Nat Biotechnol 40, 499–506 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Chowell D et al. TCR contact residue hydrophobicity is a hallmark of immunogenic CD8+ T cell epitopes. Proc Natl Acad Sci U S A 112, E1754–62 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Richman LP, Vonderheide RH & Rech AJ Neoantigen Dissimilarity to the Self-Proteome Predicts Immunogenicity and Response to Immune Checkpoint Blockade. Cell Syst 9, 375–382 e4 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Riley TP et al. Structure Based Prediction of Neoantigen Immunogenicity. Front Immunol 10, 2047 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Calis JJ et al. Properties of MHC class I presented peptides that enhance immunogenicity. PLoS Comput Biol 9, e1003266 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Li HD et al. A PoleP286R mouse model of endometrial cancer recapitulates high mutational burden and immunotherapy response. JCI Insight 5(2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Davies H et al. HRDetect is a predictor of BRCA1 and BRCA2 deficiency based on mutational signatures. Nat Med 23, 517–525 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Lee JS & Ruppin E Multiomics Prediction of Response Rates to Therapies to Inhibit Programmed Cell Death 1 and Programmed Cell Death 1 Ligand 1. JAMA Oncol (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Cummings AL et al. Mutational landscape influences immunotherapy outcomes among patients with non-small-cell lung cancer with human leukocyte antigen supertype B44. Nature Cancer 1, 1167–1175 (2020). [DOI] [PubMed] [Google Scholar]
- 50.Litchfield K et al. Meta-analysis of tumor- and T cell-intrinsic mechanisms of sensitization to checkpoint inhibition. Cell 184, 596–614 e14 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
Methods-only References
- 7.Samstein RM et al. Mutations in BRCA1 and BRCA2 differentially affect the tumor microenvironment and response to checkpoint blockade immunotherapy. Nature Cancer (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Mandal R et al. Genetic diversity of tumors with mismatch repair deficiency influences anti-PD-1 immunotherapy response. Science 364, 485–491 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Campbell BB et al. Comprehensive Analysis of Hypermutation in Human Cancer. Cell 171, 1042–1056 e10 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Cong L et al. Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819–23 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Aran D, Hu Z & Butte AJ xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol 18, 220 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Chakravarty D et al. OncoKB: A Precision Oncology Knowledge Base. JCO Precis Oncol 2017(2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.van Elsas A, Hurwitz AA & Allison JP Combination immunotherapy of B16 melanoma using anti-cytotoxic T lymphocyte-associated antigen 4 (CTLA-4) and granulocyte/macrophage colony-stimulating factor (GM-CSF)-producing vaccines induces rejection of subcutaneous and metastatic tumors accompanied by autoimmune depigmentation. J Exp Med 190, 355–66 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Zitvogel L, Pitt JM, Daillere R, Smyth MJ & Kroemer G Mouse models in oncoimmunology. Nat Rev Cancer 16, 759–773 (2016). [DOI] [PubMed] [Google Scholar]
- 53.Zeng Z et al. TISMO: syngeneic mouse tumor database to model tumor immunity and immunotherapy response. Nucleic Acids Res 50, D1391–D1397 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Griswold DP & Corbett TH A colon tumor model for anticancer agent evaluation. Cancer 36, 2441–4 (1975). [DOI] [PubMed] [Google Scholar]
- 55.Ran FA et al. Genome engineering using the CRISPR-Cas9 system. Nat Protoc 8, 2281–2308 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Clement K et al. CRISPResso2 provides accurate and rapid genome editing sequence analysis. Nat Biotechnol 37, 224–226 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Li H & Durbin R Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–60 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.McKenna A et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20, 1297–303 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Cibulskis K et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat Biotechnol 31, 213–9 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Koboldt DC et al. VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics 25, 2283–5 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Kim S et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat Methods 15, 591–594 (2018). [DOI] [PubMed] [Google Scholar]
- 62.Larson DE et al. SomaticSniper: identification of somatic point mutations in whole genome sequencing data. Bioinformatics 28, 311–7 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.McLaren W et al. The Ensembl Variant Effect Predictor. Genome Biol 17, 122 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Dobin A et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Lawrence M et al. Software for computing and annotating genomic ranges. PLoS Comput Biol 9, e1003118 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Rosenbloom KR et al. The UCSC Genome Browser database: 2015 update. Nucleic Acids Res 43, D670–81 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Love MI, Huber W & Anders S Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Reich M et al. GenePattern 2.0. Nat Genet 38, 500–1 (2006). [DOI] [PubMed] [Google Scholar]
- 69.Yu G, Wang LG, Han Y & He QY clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–7 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Bolotin DA et al. MiXCR: software for comprehensive adaptive immunity profiling. Nat Methods 12, 380–1 (2015). [DOI] [PubMed] [Google Scholar]
- 71.Riaz N et al. Tumor and Microenvironment Evolution during Immunotherapy with Nivolumab. Cell 171, 934–949 e16 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Bergstrom EN et al. SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events. BMC Genomics 20, 685 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Blokzijl F, Janssen R, van Boxtel R & Cuppen E MutationalPatterns: comprehensive genome-wide analysis of mutational processes. Genome Med 10, 33 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Rosenthal R, McGranahan N, Herrero J, Taylor BS & Swanton C DeconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution. Genome Biol 17, 31 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Valero C et al. The association between tumor mutational burden and prognosis is dependent on treatment context. Nat Genet 53, 11–15 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Valero C et al. Pretreatment neutrophil-to-lymphocyte ratio and mutational burden as biomarkers of tumor response to immune checkpoint inhibitors. Nat Commun 12, 729 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Samstein RM et al. Tumor mutational load predicts survival after immunotherapy across multiple cancer types. Nat Genet 51, 202–206 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Zhou J et al. Analysis of Tumor Genomic Pathway Alterations Using Broad-Panel Next-Generation Sequencing in Surgically Resected Lung Adenocarcinoma. Clin Cancer Res 25, 7475–7484 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Shen R & Seshan VE FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing. Nucleic Acids Res 44, e131 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Schwartz LH et al. RECIST 1.1-Update and clarification: From the RECIST committee. Eur J Cancer 62, 132–7 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Liu X, Li C, Mou C, Dong Y & Tu Y dbNSFP v4: a comprehensive database of transcript-specific functional predictions and annotations for human nonsynonymous and splice-site SNVs. Genome Med 12, 103 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Chowell D et al. Improved prediction of immune checkpoint blockade efficacy across multiple cancer types. Nat Biotechnol (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Gu Z, Eils R & Schlesner M Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics 32, 2847–9 (2016). [DOI] [PubMed] [Google Scholar]
- 84.Kyte J & Doolittle RF A simple method for displaying the hydropathic character of a protein. J Mol Biol 157, 105–32 (1982). [DOI] [PubMed] [Google Scholar]
- 85.Grantham R Amino acid difference formula to help explain protein evolution. Science 185, 862–4 (1974). [DOI] [PubMed] [Google Scholar]
- 86.Li B et al. Comprehensive analyses of tumor immunity: implications for cancer immunotherapy. Genome Biol 17, 174 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Sturm G et al. Comprehensive evaluation of transcriptome-based cell-type quantification methods for immuno-oncology. Bioinformatics 35, i436–i445 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Thorsson V et al. The Immune Landscape of Cancer. Immunity 48, 812–830 e14 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Wu J et al. TSNAdb: A Database for Tumor-specific Neoantigens from Immunogenomics Data Analysis. Genomics Proteomics Bioinformatics 16, 276–282 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Ma X et al. (2022) NG_WES_IMPACT_Model_training_testing_script.R [R code]. Synapse. 10.7303/syn29479497.1 [DOI] [Google Scholar]
- 91.Ma X et al. (2022) NG_ICB_Cohort_script.R [R code]. Synapse. 10.7303/syn29479495 [DOI] [Google Scholar]
- 92.Ma X et al. (2022) NG-A57433 other associated codes.R [R code]. Synapse. 10.7303/syn30137113 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Mutation data from the TCGA pan-cancer study (mc3.v0.2.8.PUBLIC.maf.gz) were downloaded from NCI genomic data commons (https://api.gdc.cancer.gov/data/1c8cfe5f-e52d-41ba-94da-f15ea1337efc). Mutation data of ICGC and CCLE were download from ICGC data portal (https://dcc.icgc.org/api/v1/download?fn=/current/Summary/simple_somatic_mutation.aggregated.vcf.gz) and the Broad Institute website (https://ndownloader.figshare.com/files/34008434), accordingly. To generate the ICGC/CCLE test set, TCGA samples in ICGC cohort were manually removed. BED files of MSK-IMPACT sequencing regions are acquired from AACR-Genie project database (https://www.synapse.org/#!Synapse:syn7222066/wiki/405659). COSMIC SBS exome signature v3 and COSMIC SBS exome TSB signature v3 were downloaded from the Synapse database (https://www.synapse.org/#!Synapse:syn11967914). Mouse GRCm38 genome were download from UCSC genome browser (https://hgdownload.soe.ucsc.edu/goldenPath/mm10/bigZips/mm10.fa.gz). A baseline list of known POLE/D1 functional mutations were generated from previous publications and the OncoKB database (https://www.oncokb.org/) 12,35. Additional lists of clinical associated POLE/D1 mutations and POLE/D1 mutator alleles in other species were manually curated from literatures and were available as Supplemental Table 2&3. Gene expression profile and xCell based cell type deconvolution of TCGA endometrial cohort were downloaded from TIMER database (https://timer.cistrome.org/infiltration_estimation_for_tcga.csv.gz) 86 87. TCRb reportorie data were downloaded from NCI genomic data commons (TCGA_mitcr_cdr3_result_161008.tsv, from https://gdc.cancer.gov/about-data/publications/panimmune) 88.TCGA predicted neoantigen information were acquired from TSNAdb (http://biopharm.zju.edu.cn/tsnadb/download/)89. Mouse WES and RNA-seq data were deposited to SRA (PRJNA701709). All the other data are deposited to Synapse (as Synapse project syn29404148, https://www.synapse.org/#!Synapse:syn29404148/wiki/617361) with public access: All the samples underwent evalutation for model building (Synapse ID: syn29477489.1), the mutational signature matrix for training and test the WES (Synapse ID: syn29478035, Synapse ID: syn29478033) and MSK-IMPACT models (Synapse ID: syn29478110, Synapse ID: syn29478151), the genomic, response and survival data of the MSK-IMPACT immunotherapy cohorts (Synapse ID: syn29478036)
All customized code including code for generate the models (Synapse ID: syn29479495)90 ,the analysis of MSK-IMPACT cohort (Synapse ID: syn29479497)91 and other associated code (Synapse ID: syn30137113)92 are deposited to Synapse (Synapse project ID: syn29404148, https://www.synapse.org/#!Synapse:syn29404148/wiki/617361) with public access. Code for processing WES and RNAseq were from published tools and is available from the authors of the tools as described in the above Methods sections.