Abstract
DNA mismatch repair deficiency (MMRd) is associated with a high tumor mutational burden (TMB) and sensitivity to immune checkpoint blockade (ICB) therapy. Nevertheless, most MMRd tumors do not durably respond to ICB and critical questions remain about immunosurveillance and TMB in these tumors. In the present study, we developed autochthonous mouse models of MMRd lung and colon cancer. Surprisingly, these models did not display increased T cell infiltration or ICB response, which we showed to be the result of substantial intratumor heterogeneity of mutations. Furthermore, we found that immunosurveillance shapes the clonal architecture but not the overall burden of neoantigens, and T cell responses against subclonal neoantigens are blunted. Finally, we showed that clonal, but not subclonal, neoantigen burden predicts ICB response in clinical trials of MMRd gastric and colorectal cancer. These results provide important context for understanding immune evasion in cancers with a high TMB and have major implications for therapies aimed at increasing TMB.
Subject terms: Immunosurveillance, Colon cancer, Tumour immunology, Immunosurveillance, Colon cancer
Mouse models of lung and colorectal cancer with sporadic DNA mismatch repair deficiency clarify that the intratumor heterogeneity and clonal architecture rather than tumor mutational burden are powerful determinants of immunotherapy response.
Main
Immunotherapy has revolutionized the treatment landscape of many cancers, particularly those with a high tumor mutational burden (TMB)1–5. Somatic mutations can generate neoantigens capable of eliciting tumor-specific T cell responses6,7, and it is widely believed that increased TMB renders tumors susceptible to immune attack after ICB treatment. Indeed, multiple pan-cancer meta-analyses have shown that TMB is one of the strongest predictors of ICB response8–10, leading to approval by the US Food and Drug Administration (FDA) of pembrolizumab (anti-PD-1) for all tumors based on high TMB alone. In particular, MMRd is associated with some of the highest TMBs observed in cancer (hypermutation)11–15 and remarkable response rates to pembrolizumab2,5. However, more than half the patients with MMRd tumors do not durably respond and TMB does not stratify responders2,5. This underscores a critical need to understand what factors beyond TMB mediate efficacy. There are also conflicting studies suggesting that TMB is an imperfect biomarker of immunotherapy response16 and argue that FDA approval of pembrolizumab based on TMB alone may be too broad17. Specifically, TMB provides limited to no additional predictive value within specific subsets of cancer known to have high rates of response to ICB, like those with MMRd2,5.
One factor that may dilute the power of TMB as a biomarker of ICB response is intratumor heterogeneity (ITH) of mutations or, simply defined, the fraction of mutations that are subclonal—present in a minority of tumor cells. Subclonal neoantigens could be deleted with minimal impact on tumor fitness18 or fail to elicit productive T cell responses19. Indeed, it has been observed in human cancer that ITH is associated with decreased T cell infiltration20,21 and poor survival22,23, whereas clonal neoantigens are predictive of response to ICB23–25. This concept has been exemplified in an experimental setting of ultraviolet light B (UVB)-induced hypermutation of melanoma cell lines26. It is reasonable to hypothesize that similar mechanisms are operating in MMRd cancers given their constitutive mutational instability. Although previous studies showed that MMRd mutagenesis in vitro renders cell lines immunogenic27,28, it is unclear what the impact is of MMR loss in vivo in the presence of immunosurveillance, a process that exquisitely shapes tumor immunogenicity6,29,30. To address these questions, we developed autochthonous mouse models of sporadic MMRd lung and colon cancer via targeted ablation of key genes in the MMR complex, MutL homolog 1 (Mlh1) and MutS homologs 2, 3 and 6 (Msh2, Msh3 and Msh6), and performed preclinical trials to determine ICB sensitivity.
Modeling sporadic MMRd in cancer
We adapted the autochthonous mouse model of lung cancer developed in our laboratory31 by breeding in a Cas9-expressing allele (KrasLSL-G12D; Trp53flox/flox; R26LSL-Cas9 (KPC)) or conditional Msh2 knockout allele32 (KrasLSL-G12D;Trp53flox/flox;Msh2flox/flox (KPM)). Intratracheal delivery of lentivirus-expressing Cre and an Msh2-targeting single guide (sg)RNA (sgMsh2) into the former or alveolar type II, cell-specific, adenovirus-expressing Cre into the latter induced lung adenocarcinomas with efficient MSH2 knockout (Fig. 1a–c and Extended Data Fig. 1a–e). We also adapted an endoscope-guided submucosal injection technique33 to deliver lentivirus with sgRNAs targeting the colon tumor suppressor, Apc, in tandem with Msh2, Mlh1, Msh3 or Msh6, into the distal colon of mice with constitutive Cas9 expression (Fig. 1b). This efficiently induced focal colon adenomas with MMR gene knockout (Fig. 1d and Extended Data Fig. 1f–i).
To confirm mutation of MMR genes and investigate the degree of TMB, we performed whole-exome sequencing (WES) on micro-dissected tumors at 16–20 weeks post-initiation, including 26 sgMsh2- and -3- control (sgCtl)-targeted KPC lung tumors, 15 and 6 SPC-Cre-targeted KPM and KP (Msh2 wild-type) lung tumors and 5 sgMsh2-, 6 sgMlh1-, 2 sgMsh6-, 6 sgMsh3- and 5 sgCtl-targeted colon tumors (Supplementary Table 1). Targeted sequencing of MMR genes in these tumors confirmed a preponderance of frame-shifting insertions/deletions (indels) (Supplementary Table 1). KPM, sgMsh2-, sgMlh1- and sgMsh6-targeted lung and colon tumors showed increased burden of somatic single-nucleotide variants (SNVs), indels and microsatellite instability (MSI) scores (MSIsensor34), whereas sgMsh3-targeted colon tumors showed elevated levels of indels only (Fig. 1e,f and Extended Data Fig. 1j–m). Indels across all MMRd tumors were predominantly single nucleotide and enriched at homopolymer repeat microsatellites (Fig. 1f and Extended Data Fig. 1n). Mutational patterns were highly consistent with those observed in human MMRd colon cancer35 (cosine similarity of 0.9 and 0.8), as determined by decomposition of mutational spectra into individual signatures from the Catalogue of Somatic Mutations in Cancer (COSMIC)36,37 (Fig. 1g,h and Extended Data Fig. 1n–q). The MMRd signature (SBS-MMRd) comprised 92%, 78% and 75% of all SNVs in the mouse MMRd colon and lung tumor models and human MMRd colon cancer, respectively (Fig. 1f). It is interesting that one lung tumor had a Ser415Arg mutation in the exonuclease domain of DNA polymerase epsilon (Pole) and, consequently, a much higher TMB (Extended Data Fig. 1j).
We also performed WES on cell lines derived from sgMsh2-targeted lung tumors. Unexpectedly, these showed on average a tenfold greater TMB than sgMsh2-targeted lung tumors (Fig. 1e and Extended Data Fig. 1k), suggesting low tumor purity (unlikely, because Msh2KO lung tumors were on average 72% pure), mutagenesis in culture or substantial ITH reduced by clonal selection on plastic. Arguing against extensive mutagenesis in vitro, WES of a single-cell clone revealed more than double the TMB of the parental line, despite a much smaller increase in TMB after 20 passages (Extended Data Fig. 1r). Consistent with high ITH, more sequencing reads supporting mutations private to the single clone were found in the sequencing data of the parental compared with the unrelated control line (1,685 versus 451), albeit at variant allele frequencies (VAFs) below the threshold of mutation calling (Extended Data Fig. 1s,t), as previously observed in hypermutated gliomas38.
Altogether, these results establish the utility of our models to recapitulate fundamental mutational processes underlying hypermutation in MMRd human cancer. The lower clonal TMB that we observed is probably due to neutral evolution in the absence of major selective bottlenecks, a feature of ablating MMR concomitantly with tumor initiation in our models. Although these models cannot capture the accumulation of mutations or clonal evolution over decades in human cancer, they are uniquely suited to study the role of ITH in immune dysfunction of cancers with high and low prevalence of MMRd alike. Importantly, ITH is associated with aggressive disease and decreased ICB response in humans22–25, but it remains unclear how ITH impacts the immune response in MMRd cancers specifically.
Sporadic MMRd in the mouse is not immunogenic
Next, we sought to determine the effects of sporadic MMRd on tumorigenesis in our models. Neither KPC nor KPM Msh2KO models showed a significant difference in overall tumor burden or grade at either timepoint (Fig. 2a–d and Extended Data Fig. 2b,c). Notably, there was no difference in tumor infiltration by T cells (CD3+) in the sgMsh2-targeted model (Extended Data Fig. 2d,e) or infiltration by cytotoxic (CD8+), helper (CD4+) or regulatory (CD4+FOXP3+) T cells within tumors (Fig. 2e–g) or whole lungs (Extended Data Fig. 2g,h) in the KPM model at either timepoint. Similar to the lung, tumors in the colon model showed no difference in growth kinetics with Msh2 targeting (Extended Data Fig. 2i).
To determine the sensitivity of these models to immunotherapy, we first performed preclinical trials with ICB (anti-CTLA-4/anti-PD-1) in the KPM model. We followed similar dosing as established in seminal preclinical studies that preceded the first human clinical trials of anti-CTLA-4 (ref. 39) and anti-PD-1 (ref. 40), although we continued treatment for longer (4 weeks). In addition, we included treatment with the chemotherapeutic combination of oxaliplatin and low-dose cyclophosphamide (Oxa/Cyc) alone and in combination with ICB (Fig. 2h–i), because Oxa/Cyc has been shown to synergize with ICB41,42. To our surprise, no significant differences were observed between KPM and KP mice across all treatment arms, in both longitudinal change (Fig. 2j) and final tumor burden at necropsy (Fig. 2k and Extended Data Fig. 2j). Consistent with a lack of immunogenicity, we observed no differences in tumor grade or burden at 16 weeks between KPM and KP mice treated continuously with CD4+ and CD8+ T cell-depleting antibodies (αCD4/8) (Extended Data Fig. 2k–p). These results were not unique to the lung, because ICB treatment failed to induce any responses in sgMsh2-targeted colon tumors (Fig. 2l,m and Extended Data Fig. 2q,r). Likewise, there was no significant difference in endpoint size of sgCtl- versus sgMsh2-targeted colon tumors after ICB or continuous αCD4/8 treatment (Fig. 2n). Altogether, these data demonstrate that MMRd in these models is not sufficient to increase immunogenicity or sensitivity of tumors to ICB, alone or in combination with chemotherapy, in stark contrast to previous reports in cell-line transplant models27,28.
MMRd drives mutational heterogeneity
To assess the clonal composition of mutations in our models, we estimated cancer cell fractions (CCFs)43. TMB was predominantly subclonal and most mutations were present in less than a quarter of cells in MMRd tumors from lung and colon (Fig. 3a,b and Extended Data Fig. 3a–c). These mutations also adhered perfectly to a theoretical model of neutral evolution of subclonal mutations in cancer44 (Fig. 3c,d), consistent with the absence of selective events after tumor initiation in our models. To investigate ITH more deeply, we performed WES on eight single-cell clones derived from an sgMsh2-targeted lung tumor cell line (09-2). Importantly, before subcloning, we restored MMR by re-expressing Msh2 on a bicistronic lentivirus conferring puromycin resistance (Fig. 3e). Of note, these clones (M1–8) maintained MSH2 expression and showed stable mutational and clonal architecture after 20 passages in the presence of puromycin (Extended Data Fig. 3d–f). Nevertheless, significantly more somatic mutations were called in all clones than in the parental line (Fig. 3f) and these mutations were not broadly shared across clones (Extended Data Fig. 3g), supporting the notion that ITH is underestimated by bulk sequencing methods38. Phylogenetic analysis of clones further confirmed a considerable level of ITH (Fig. 3g).
Given that ITH in our models arose in vivo, we sought to understand the role of immunoediting6,29,30 in shaping this process. We performed WES on micro-dissected Msh2KO lung tumors (weeks 16–20) from animals continuously depleted of T cells (n = 34) or treated with ICB (n = 12). Continuous T cell depletion had no significant effect on overall TMB or tumor neoantigen burden (TNB), consistent with its lack of effect on tumor progression (Extended Data Fig. 2k–p). However, stratifying mutations by CCF revealed a significant increase in clonal (CCF ≥ 0.75), but not subclonal (CCF ≤ 0.5), TMB and TNB (Fig. 3h,i and Extended Data Fig. 3h,i). T cell-depleted tumors also showed a striking difference in CCF distribution and median CCF of expressed neoantigens, with significant enrichment of clonal neoantigens (Fig. 3j,k and Extended Data Fig. 3j). We observed similar results in the colon Msh2KO model (Extended Data Fig. 3k–n). It is interesting that tumors from ICB-treated animals showed significantly lower subclonal, but not clonal, TMB and TNB, suggesting possible simultaneous loss of subclonal populations and selective expansion of others in this context of ICB nonresponse (Fig. 3h,i and Extended Data Fig. 3h,i). Altogether, these results argue that neoantigens with high, but not low, clonal fraction are negatively selected by the adaptive immune system during tumor progression. By extension, immunosurveillance in mutationally unstable tumors promotes ITH by selectively pruning clonal neoantigens and thereby increasing the relative fraction of subclonal neoantigens. ICB treatment may lower the CCF threshold for T cell-mediated elimination, but probably also drives expansion of less immunogenic subclones that maintain an overall state of ITH in nonresponders.
Intratumor heterogeneity enables immune evasion
To evaluate the impact of ITH, specifically tumor cell clonality, on immunogenicity in our models, we assessed survival of animals after orthotopic transplantation of the lung tumor cell lines and clones described above. The parental MSH2 knockout lines and low TMB control line were similarly nonimmunogenic, showing no difference in disease progression with and without continuous T cell depletion (Fig. 4a). This is consistent with the lack of immunogenicity of tumors induced by chemical carcinogens in immunocompetent animals and the principle of immunoediting6,29. Unlike MMRd in the autochthonous model, however, mice transplanted with the parental Msh2KO line (09-2) or an equal mixture of 09-2-derived Msh2KO clones (M1–8) and treated with ICB showed 20–30% durable responses and reduced hazard ratios (HRs) (0.34–0.50) over 30 weeks (Fig. 4b). Mice transplanted individually with five otherwise nonimmunogenic Msh2KO clones (M1, M2, M4, M5 and M6) and treated with ICB showed 30–75% durable responses and even further reduction in HRs (0.05–0.32) over 30 weeks (Fig. 4b). It is interesting that the other three Msh2KO clones (M3, M7 and M8) were strongly immunogenic even without ICB treatment, efficiently driving disease only with T cell depletion (Fig. 4b). These differences were not the result of tumor intrinsic growth rates or loss of antigen presentation, because all clones showed similar in vitro growth kinetics and readily expressed major histocompatibility complex class I (MHC-I) (H-2Kb, H-2Db) and programmed death-ligand 1 (PD-L1) on stimulation with interferon-γ (IFN-γ; Extended Data Fig. 4a–e). Although M3, M7 and M8 did not show increased TMB/TNB relative to the other clones, they did express on average higher H-2Kb/H-2Db with IFN-γ stimulation, particularly the most immunogenic clone, M3, although this was not significant (Extended Data Fig. 4b,c). It is therefore possible that the higher baseline immunogenicity of these clones is due in part to greater surface presentation of neoantigens. Consistent with MMRd-derived neoantigens underlying the immunogenicity and ICB responsiveness of the clones, mice transplanted individually with single-cell clones derived from the low TMB control line (13-1) were uniformly and completely unresponsive to ICB (Extended Data Fig. 4f). Altogether, these results establish a model of MMRd wherein increasing mutational clonality (autochthonous tumors < cell lines < mixture of clones < clones) correlates with immunogenicity and ICB response.
Given that the parental cell line 09-2 arose in an immunocompetent host yet contained highly immunogenic subclones (M3, M7 and M8), we reasoned that tumors evade immunosurveillance not only by selective outgrowth of nonimmunogenic subclones but passively through the failure of immunity to eliminate otherwise immunogenic tumor cells present at a low clonal fraction. To test this hypothesis, we collected lung tumors and metastases from mice orthotopically transplanted with an equal mixture of clones M1–8 and reconstructed their clonal makeup by ultradeep, targeted amplicon sequencing of unique clone-defining SNVs (Methods, Fig. 4c and Extended Data Fig. 4g,h). Despite the immunogenicity of M3, M7 and M8, these clones were detected in 12 of 20 tumors analyzed from immunocompetent animals at clonal fractions ≥1%. Of these 12 animals 5 also developed liver metastases, 3 of which were clonal outgrowths of an immunogenic clone (M8) (Extended Data Fig. 4h). In contrast, none of the immunocompetent animals transplanted with M3, M7 or M8 alone formed metastases. This pattern was not substantially different to that of transplants in continuously T cell-depleted or ICB-treated mice. However, tumors from T cell-depleted mice on average showed significantly lower clonal diversity with greater dominance by individual clones (Fig. 4c,d). In addition, tumors from immunocompetent animals harboring one or more immunogenic clones (M3, M7 or M8) at ≥1% fraction were significantly more heterogeneous than tumors without these clones present (Fig. 4e). These results are in agreement with experiments in the autochthonous models (Fig. 3h–k and Extended Data Fig. 3h–n) and further support a revised understanding of immunosurveillance as a process that strongly selects against clonal, but not subclonal, neoantigens.
Neoantigen clonality tunes the T cell response
To gain further mechanistic understanding of how T cells shape clonal architecture, we first profiled the MHC-I immunopeptidome45 of M1–8 clones (Supplementary Table 2). Using quantitative tandem mass spectrometry (MS–MS), we identified seven neoepitopes with relative abundance patterns consistent with the associated mutations in the clones. Of these, five neoepitopes (71%) exhibited immunogenicity in normal mice after dendritic cell prime/boost/boost vaccination, by IFN-γ enzyme-linked immune adsorbent spot (ELISpot) and/or MHC-I:epitope tetramer staining of splenocytes (Extended Data Fig. 5a–e). One neoepitope unique to the M5 clone, QAYAFLQHL, elicited a higher-magnitude, antigen-specific CD8+ T cell response than the highly immunogenic model neoantigen SIINFEKL, from chicken ovalbumin46 (Extended Data Fig. 5d). Although MS immunopeptidomics is far from exhaustive47, our identification of five bona fide immunogenic neoepitopes supports the notion that neoantigens underlie the enhanced immunogenicity of Msh2KO clones (Fig. 4b and Extended Data Fig. 4f).
Next, we transplanted the M5 clone into the lungs of syngeneic mice at different clonal fractions, diluted with the other M1–8 clones, and analyzed the M5-specific CD8+ T cell response using flow cytometry with MHC-I:QAYAFLQHL dextramers (Fig. 5a). Consistent with sensitivity of the M5 clone to ICB (Fig. 4b), a robust QAYAFLQHL-specific T cell response was induced in lungs and mediastinal draining lymph nodes (mLNs) of clonally transplanted mice after 2 weeks of ICB treatment. As the clonal fraction and total number of M5 cells were decreased, however, the magnitude of this response also decreased (Fig. 5b–d and Extended Data Fig. 5f–h). Similar results were obtained in analogous experiments with the M2 clone using MHC-I:AALQNAVTF tetramers to analyze M2-specific CD8+ T cells (Extended Data Fig. 5i–k). Surprisingly, the quality of the QAYAFLQHL-specific T cell response also decreased with lower clonal fraction, with a significantly smaller percentage of QAYAFLQHL-specific T cells expressing the major effector protease of cytotoxic T cells, granzyme B (GZMB) and a significantly greater percentage expressing TCF1 (Fig. 5e–i and Extended Data Fig. 5l–p). Expressed highly in naive and early activated T cells, the transcription factor TCF1 is lost during effector differentiation48. We have previously shown that TCF1+GZMB− tumor-specific T cells are enriched and characteristic of tolerogenic dysfunction in a model of microsatellite stable colon cancer49. It is interesting that transplantation of an equal number of M5 cells (100,000) at clonality versus CCF of 0.5 (with 100,000 M1–4,6–8 cells) resulted in no significant difference in magnitude, GZMB+ or TCF+ percentage of the T cell response (Fig. 5d,g,i and Extended Data Fig. 5h,n,p), suggesting that total mass of neoantigen-expressing tumor cells is the primary determinant of quality of the neoantigen-specific T cell response. Altogether, these results provide a compelling rationale for why immunosurveillance fails to delete tumor cells bearing subclonal neoantigens (Fig. 5j).
Clonal TNB predicts ICB response in human MMRd cancer
To explore the translational relevance of our findings, we reanalyzed sequencing data from two clinical trials of anti-PD-1 treatment in advanced MMRd colorectal cancer (CRC; Bortolomeazzi et al.50) and gastric cancer (Kwon et al.35), including 16 and 13 patients, respectively. Predicting neoantigens and the associated CCFs (Supplementary Table 3), we asked whether clonal TNB and ITH are associated with the response to anti-PD-1. Clonal (CCF ≥ 0.75), but not subclonal (CCF ≤ 0.5), TNB was significantly associated with objective response (OR), whereas high ITH index (subclonal to clonal neoantigen ratio) was significantly associated with nonresponse (NR) (Fig. 6a,b and Extended Data Fig. 6a–d). Total TNB was also significantly associated with OR (Extended Data Fig. 6e–g), probably because tumors in these studies had generally more clonal than subclonal neoantigens. Although clonal TNB was generally correlated with total TNB, two notable outliers, both nonresponders, had high total TNB but very low clonal TNB (Extended Data Fig. 6h).
Clonal, but not subclonal, TNB was also significantly associated with longer progression-free survival (PFS) in combined analysis of the trials, whereas the ITH index was associated with shorter PFS (Fig. 6c–e). Total TNB also correlated with longer PFS, but did not reach the same level of significance as clonal TNB (Extended Data Fig. 6i). Importantly, there was no significant difference in PFS between the two trials (Extended Data Fig. 6j), justifying their combination in these analyses. Overall, these results support the major conclusions from our mouse models and suggest that clonal TNB is a more accurate predictor of ICB response than overall TMB in MMRd cancers. Although the trials that we reanalyzed in the present study are limited by small sample size and will require validation in larger prospective investigations in MMRd cancers, which are currently lacking, our results align with a growing body of literature that supports the superior predictive power of clonal versus total TMB/TNB as a biomarker of ICB response across other cancers20–26.
Discussion
In this Article, we report sporadic MMRd mouse models that recapitulate critical features of human lung and colon cancer, including genetics, histopathology and in situ initiation in the relevant tissue microenvironment. Unlike previous studies that demonstrated a role for MMRd and TMB in immunotherapy response, in which mutagenesis occurred in vitro26–28, our models enable study of mutations continuously acquired in vivo from tumor initiation through advanced disease. Other studies showed immunotherapy efficacy in autochthonous cancer models employing tissue-specific knockout of Msh2 or activation of mutant Pole (PoleP286R) during embryogenesis51,52. These previous models recapitulate familial cancers like Lynch syndrome in the accumulation of mutations in normal parenchyma preceding transformation (<5% of colorectal cancers (CRCs)), but not sporadic loss of MMR in established tumors (10–15% of CRCs)14,15. In contrast, our models, which more closely resemble sporadic MMRd, followed a model of neutral evolutionary dynamics44 and did not display increased baseline immunogenicity or response to ICB, probably owing to timing of MMR inactivation and the resulting patterns of clonal expansion53. We induced MMRd concomitantly with tumor initiation, resulting in mutation accumulation during exponential cellular expansion that is reminiscent of so-called ‘born to be bad’ colon tumors that follow an early explosive growth trajectory54. It is interesting that it was recently shown that MMRd occurring either de novo or induced by temozolomide treatment in advanced glioma led to extensive ITH and poor ICB response38.
Tumors in our models developed extensive ITH and a high burden of subclonal mutations that was not detected by bulk sequencing analysis, highlighting the importance of standardization of clinical pipelines to estimate TMB. Strategies to robustly assay ITH, such as multi-region or single-cell DNA-sequencing, may enhance the predictive utility of TMB. Overall, the results from our models strongly support a potential role of ITH in the failure of ICB in some patients with MMRd cancer. However, studies in the mouse cannot be generalized to humans and these results will require clinical validation. Our reanalysis of MMRd cancer clinical trials35,50 showed significant association of clonal neoantigen burden and ITH index with ICB response and lends credibility to our models, but is limited by small sample size and not powered to differentiate any predictive value of subclonal neoantigen burden or ITH index beyond their association with clonal neoantigen burden. Larger prospective clinical studies will be required to definitively establish the role of ITH and its potential utility as a biomarker of ICB response in human MMRd cancers5,6.
Similar to other mechanistic studies of ITH19,26, we found that experimental reconstitution of ITH potentiated immune evasion. Recently, a preclinical study showed that genetic or pharmacological enrichment of MMRd in the context of mixed MMRd/MMR-proficient (MMRp) cell-line transplants potentiated rejection of the MMRp fraction55. This must be interpreted with care, however, because both MMRd and MMRp fractions were derived from the same carcinogen (N-nitroso-N-methylurethane)-induced colon carcinoma line, CT26, and probably share a high burden of clonal neoantigens that may underlie rejection of the MMRp fraction. What distinguishes our study from these prior cell line-based studies is that mutagenesis occurred spontaneously entirely in vivo. That some of the subclones we isolated were highly immunogenic on re-transplantation at clonal, but not subclonal, fraction suggests that they were protected from deletion by high ITH in the original tumor. This probably occurs passively due to low cellularity precluding efficient crosspriming and driving early T cell dysfunction or ignorance19,49. Indeed, our high-resolution analysis of neoantigen-specific T cells after ICB treatment showed that the magnitude and effector potential of the response are attenuated with decreasing neoantigen clonality. This is in agreement with an orthogonal study that used retroviral neoantigen libraries to manipulate clonal fraction19. However, although that study concluded that neoantigen clonal fraction, not total cellularity, is the major determinant of the response, our experiments suggest the converse. Additional studies are needed to resolve these and other complexities, including the interplay of ITH with T cell interclonal dynamics, where the immune response may be deployed against a limited subset of ‘dominant’ neoantigens56, at the expense of recognition of lower affinity, poorly expressed or subclonal neoantigens.
Paradoxically, we found that immunosurveillance may exacerbate ITH by shaping the clonal architecture of tumors while failing to delete most neoantigens. Therefore, we conclude that ITH is shaped by the interplay of tumor intrinsic, positively selective and immunogenic, negatively selective evolutionary pressures. These results provide nuance to our understanding of immunoediting and highlight the power of our models to capture a hallmark of human cancer that is lacking in carcinogen-induced and genetically engineered models alike57—the gradual accumulation of mutations over time36.
Our results raise important questions related to therapies aimed at deliberately increasing TMB to enhance tumor immunogenicity27,58. These strategies will probably fail to elicit meaningful immune engagement. More concerning, collateral mutagenesis may drive more aggressive cancer, therapy resistance or secondary malignancies. Future studies with models that enable temporal control of cooperating tumorigenic events will be helpful in determining the impact of cancer clonal selection on immunosurveillance and immunotherapy response.
Methods
Mice
All animal use was approved by the Department of Comparative Medicine at the Massachusetts Institute of Technology (MIT) and the Institutional Animal Care and Use Committee under protocol no. 0714-076-17. Mice were housed with a 12-h light/12-h dark cycle with temperatures in the range 20–22 °C and 30–70% humidity. KrasLSL-G12D (ref. 59); Trp53flox/flox (ref. 60); R26LSL-Cas9 (ref. 61) (KP; R26LSL-Cas9) mice were maintained on an F1 (C57BL/6 × 129/SvJ) background. KrasLSL-G12D; Trp53flox/flox; Msh2flox/flox (ref. 32) (JAX, stock no. 016231) and R26Cas9 (ref. 62) (JAX, stock no. 028555) mice were maintained on a pure C57BL/6 background. Lung cell lines were isolated from tumors induced in albino C57BL/6 hosts chimeric for tissue derived from blastocyst injection of a KP; R26LSL-Cas9 embryonic stem (ES) cell line (12A2) of mixed C57BL/6 and 129/SvJ background and male sex, as previously described61. In orthotopic lung studies, cell lines were transplanted into male chimeras generated from the same 12A2 ES cell line at 10–16 weeks of age. These chimeras are tolerized to C57BL/6 and 129/SvJ tissues, potential antigens in the R26LSL-Cas9 allele and PuroR introduced into cell lines with Msh2 re-expression (unrecombined KrasLSL-G12D expresses PuroR). Autochthonous tumors in lung and colon were induced in approximately equal numbers of male and female mice at 8–16 weeks of age.
Tumor models
Tumor burden, where measurable, was not allowed to exceed 1 cm2 and animals showing discomfort or distress were humanely euthanized following the recommendations of the American Veterinary Medical Association. Autochthonous lung tumors in KrasLSL-G12D; Trp53flox/flox; R26LSL-Cas9 and KrasLSL-G12D; Trp53flox/flox; Msh2flox/flox mice were induced by intratracheal instillation of 2 × 104 transduction units (TU) of lentivirus and 2 × 108 plaque-forming units of adenovirus-expressing Cre driven by the alveolar type II cell-specific surfactant protein C promoter (SPC-Cre), respectively, as previously described31. Autochthonous colon tumors in R26Cas9 mice were induced by endoscope-guided submucosal injection in the distal colon, as previously described33,63. Two injections at 1.5 × 106 TU of lentivirus in 50 μl of Opti-MEM were delivered per mouse. Lentivirus was produced in HEK293 cells (American Type Culture Collection) and concentrated as previously described31, and functional titers (Cre activity, mScarlet fluorescence) measured as previously described64. Cell lines were orthotopically transplanted by intratracheal instillation of 1 × 105 cells in 50 μl of Spinner Modification of Minimal essential Eagle’s Medium (SMEM)/5 mM EDTA, followed by a 30-μl rinse with the same medium. Cell lines were established from autochthonous lung tumors by microdissection and mechanical mincing in digestion buffer (Hanks’ balanced salt solution with 1 M Hepes, 125 units ml−1 of collagenase type IV (Worthington) and 20 μg ml−1 of DNase (Sigma-Aldrich)), followed by incubation at 37 °C with gentle agitation for 30 min and plating in RPMI + 10% fetal bovine serum (FBS). Lines were plated into 50:50 RPMI/Dulbecco’s modified Eagle’s medium (DMEM) + 10% FBS at first passage and DMEM + 10% FBS at second passage and thereafter. Cells were taken for WES at the third passage. Msh2-expressing lentivirus was produced as above. Cells were incubated with lentiviral supernatant and 3 d later selected and maintained thereafter on medium with 6 μg ml−1 of puromycin (Thermo Fisher Scientific).
WES and mutation calling
Whole-exome libraries were generated using the SureSelect XT Mouse All Exon (Agilent) target enrichment kit; 100-bp paired-end sequencing of samples was performed on the Illumina HiSeq 4000 platform, with the exception of M1–8 passage of 20 single-cell clones, which were 150-bp paired-end sequenced on the Illumina NovaSeq 6000 S4 platform. Library preparation and sequencing to 100× on-target coverage were performed by Psomagen. Raw sequencing reads were mapped to the GRCm38 build of the mouse reference genome using BWA-MEM v.0.7.17-r1188 (ref. 65). Aligned reads in BAM format were processed following the Genome Analysis Toolkit (GATK) v.4.1.8.0 Best Practices workflow to remove duplicates and recalibrate base quality scores66. Median coverage was 98 (25th quartile = 86; 75th quartile = 112), 90 (25th quartile = 82; 75th quartile = 108) and 103 (25th quartile = 100; 75th quartile = 107) for normal tails, autochthonous tumors and cell lines, respectively.
Somatic SNVs and indels were detected using Mutect2, MuSE v.1.0rc67, VarDict v.1.8.2 (ref. 68) and Strelka2 v.2.9.2 (ref. 69) against matched normal tails. Mutect2 was run using a panel of normals compiled from the 18 tails analyzed in the present study. Each caller was run independently on each tumor-normal pair and calls were integrated using SomaticCombiner v.1.03 (ref. 70). For colon tumors, a panel of four normal tails was used to generate the matched normal control for all samples, because these mice were of pure background. We considered SNV mapping to only exonic regions that were detected by Mutect2 and at least one of the other algorithms. To increase accuracy of indel detection, only indels detected by at least two algorithms were considered. Variants mapping to dbSNP (build ID 150) positions were discarded. Mutations identified in tumors from two or more animals or in at least 50% of tumors from the same animal were discarded. No VAF filter was applied. Microsatellites were annotated using SciRoKo v.3.4 (ref. 71), with minimum score = 8, seed length = 8, repeats = 2 and mismatch penalty = 1.
CCF estimation
Somatic copy-number aberrations were detected by integrating output of GATK and FreeBayes v.1.3 (ref. 72) using PureCN v.1.16.0 (ref. 73). Briefly, GATK4 Somatic CNV workflow was utilized for normalization of read counts and genome segmentation using the panel of normals from all tails. FreeBayes was used to obtain B-allele frequencies for dbSNP variant sites. PureCN was used to integrate output from GATK and FreeBayes to estimate allele-specific consensus copy-number profile, purity and ploidy of each sample. The ploidy of cell lines was determined experimentally by metaphase spreads and input into PureCN. Finally, the CCF for each SNV and indel was computed using the R (v.4.0.2) package cDriver43.
Mutational signature and MSI analyses
Mutational signatures were extracted with the R (v.4.0.2) package MutationalPatterns74 (v.3.2.0) using the COSMIC Mutational Signatures catalog v.3 (ref. 75). We used the function fit_to_signatures with default parameters and included only those mutational processes known to operate in human colon and/or lung cancer (excluding tobacco smoking)75: SBS1, SBS5, SBS6, SBS10a, SBS10b, SBS14, SBS15, SBS17a, SBS17b, SBS18, SBS21, SBS26, SBS28, SBS37, SBS40 and SBS44. For visualization, we collapsed signatures of MMRd (SBS6, SBS14, SBS15, SBS21, SBS26 and SBS44; labeled as MMRd) and POLE deficiency (SBS10a, SBS10b, SBS28 and SBS17b; labeled as POLE). Goodness of fit was determined by computing cosine similarity between observed and reconstructed mutational spectra using estimated signature contributions. To estimate the contribution of each mutational process to the human MSI CRC mutational spectrum, we analyzed somatic mutations in the Kwon et al. cohort (non-formalin-fixed paraffin-embedded samples)35 following the methodology described above. Sample MSI score was calculated using MSIsensor-pro (v.1.2.0)34 against matched normal tails.
Clonal deconvolution by targeted amplicon sequencing
To identify private somatic SNVs for distinguishing individual clones in the M1–8 mixed clone tumors (Fig. 4c and Extended Data Fig. 4g,h), we compiled all clonal SNVs in copy-number-neutral regions (four copies, as all lines were tetraploid by metaphase spreads). We then checked the BAM files across all other samples for the complete absence of reads supporting the alternative allele (base quality >20, mapping quality >30) using an in-house Python script relying on the Pysam library. Four private SNVs for each clone and four common SNVs were validated by PCR amplification and Sanger sequencing before proceeding. The 200- to 250-bp regions spanning these SNVs were either individually PCR amplified from samples, gel purified and combined, or amplified in parallel using a multiplexed PCR panel with primers carrying unique molecular indices (CleanPlex UMI Custom Panel, Paragon Genomics). Amplicon libraries were sequenced on the Illumina NovaSeq 6000 S4 platform with150-bp paired-end chemistry.
Reads were aligned to a fasta reference file of all targets (±250 nt upstream/downstream of SNV, GRCm38) using BWA-MEM (v.0.7.17-r1188)65, following the GATK Best Practices workflow. Pileups were generated using the mpileup function of bcftools v.1.10.2 (ref. 76) with --min-BQ 30 and a bed file of all SNV coordinates. For CleanPlex UMI libraries, the following functions in fgbio (v.2.0.1) (https://github.com/fulcrumgenomics/fgbio) were called to extract unique molecular identifiers (UMIs) and call consensus reads: ExtractUmisFromBam, GroupReadsByUmi and CallMolecularConsensusReads. Using a customized R (v.4.0.2) script, total and SNV-specific depths at all locations were extracted. All SNVs were supported by more reads than other alternative alleles in the M1–8 clone-equal mixture control, except for M6_2, which was excluded from subsequent analysis. Background PCR/sequencing error for each SNV was estimated using the median observed frequencies of SNVs in all metastases of different clones, which represented truly clonal controls. SNV frequencies were adjusted by subtracting background values. Clonal percentages in ex vivo tumors were estimated by taking the median of private SNV frequencies, multiplying by 4 (SNVs are 1/4n) and dividing by tumor purity—estimated as the median observed/expected ratio of frequencies of the four common SNVs (present in all clones).
Neoantigen prediction and expression
Variant consequence was annotated using Ensembl Variant Effect Predictor (VEP) v.99 (ref. 77) with Wildtype and Downstream plugins, the VEP cache and reference genome for GRCm38, and the following parameters: --symbol, --terms=SO, --cache, --offline, --transcript_version and --pick. The --pick parameter was reordered from default to report the transcript with most extreme consequence for each variant: rank, canonical, appris, tsl, biotype, ccds, length and mane. Neoepitopes were predicted with C57BL/6 mouse MHC-I alleles, H2-K1 (H-2Kb) and H2-D1 (H-2Db) and variant effect predictions using pVACtools v.1.5.7 (ref. 78). Mutant peptides were generated for peptides that were 8–11 amino acids and MHC:peptide binding affinity was predicted for all peptide:MHC allele pairs with NetMHC-4.0, NetMHCpan-4.0, SMM v.1.0 and SMMPMBEC v.1.0 (refs. 79–82). The median value across all affinity predictions was taken as the final measure of binding affinity. Neoantigens were subset to those with median predicted H-2Kb/H-2Db affinity ≤500 nM. Where multiple neoantigens were predicted for the same SNV, only that with highest predicted affinity was retained.
To assess allele-specific expression of neoantigens, RNA-sequencing (RNA-seq) was performed on autochthonous lung tumors (10 sgMsh2 and 10 sgMsh2 with αCD4/8 treatment) and M1–8 clones. Complementary DNA libraries were prepared using Kapa mRNA Hyperprep and 150-bp paired-end sequencing was performed on the Illumina NextSeq platform. Reads were aligned to the reference genome (GRCm38) using STAR v.2.7.1a83 with outFilterMultimapNmax = 20, alignSJoverhangMin = 8, alignSJDBoverhangMin = 1, outFilterMismatchNmax = 999, outFilterMismatchNoverLmax = 0.1, alignIntronMin = 20, alignIntronMax = 1,000,000, alignMatesGapMax = 1,000,000, outFilterScoreMinOverLread = 0.33, outFilterMatchNminOverLread = 0.33 and limitSjdbInsertNsj = 1,200,000. PCR duplicates were removed using Picard v.2.23.4 (ref. 84). Considering somatic variants identified by WES, we used a customized Python script to interrogate the presence of these variants in the RNA-seq BAM files. Only nonduplicate reads with mapping quality ≥255 and bases with base quality ≥20 were considered to compute VAFs.
Histology and immunohistochemistry
Quantification of lung tumor burden by grade was performed on scans of hematoxylin and eosin (H&E)-stained sections by automated convolutional neural network (CNN)—developed in collaboration with Aiforia Technologies Oy in consultation with veterinarian pathologist R. Bronson. Using semantic multi-class segmentation, the CNN was trained to classify lung parenchyma and adenocarcinoma grades 1–4. For supervised training, selected areas from 93 slides were chosen. The algorithm performed consistently and with high correlation with human graders across multiple validation datasets independent of the training dataset. Algorithm v.NSCLC_v25 was used. Triple staining (CD8a, CD4 and FOXP3) immunohistochemistry (IHC) and CNN quantification (Aiforia) were performed as previously described49. CD3 infiltration in single-stain slide scans was measured as percentage of pixels positive for stain (diaminobenzidine) in Aperio ImageScope. The area of positive and negative MSH2 staining was quantified by manual annotation in QuPath v.0.1.2 (ref. 85).
Western blotting
Cells were lysed in radioimmunoprecipitation (RIPA) buffer (Thermo Fisher Scientific), protein concentration determined using BCA Protein Assay (Thermo Fisher Scientific) and equal protein quantities (20–40 μg) run on NuPage 4–12% Bis–Tris gradient gels (Thermo Fisher Scientific) by sodium dodecylsulfate–polyacrylamide gel electrophoresis and transferred to poly(vinylidene fluoride) membranes. Western blotting was performed against MSH2 (catalog no. D24B5, Cell Signaling Technology) at 1:1,000, MLH1 (catalog no. ab92312, Abcam) at 1:1,000, glyceraldehyde 3-phosphate dehydrogenase (catalog no. 6C5, Santa Cruz) at 1:5,000 and β-actin (catalog no. 13E5, Cell Signaling Technology) at 1:5,000. Blots were stained with horseradish peroxidase (HRP) anti-rabbit immunoglobulin G and developed with Western Lightning Plus-ECL (Perkin Elmer) on X-ray film.
In vivo antibody and chemotherapy dosing
Antibodies were delivered intraperitoneally in 100 μl of phosphate-buffered saline (PBS). αCD4 (catalog no. GK1.5, BioXCell) and αCD8 (catalog no. 2.43, BioXCell) were administered at 200 μg every 4 d. αPD-1 (catalog no. 29F.1A12, BioXCell) was administered at 200 μg 3× a week. αCTLA (catalog no. 9H10, BioXCell) was administered at an initial dose of 200 μg, with subsequent doses at 100 μg, 3× a week. Oxaliplatin (Sigma-Aldrich) and cyclophosphamide (Sigma-Aldrich) (Oxa/Cyc) were co-delivered intraperitoneally in 100 μl of PBS at 2.5 mg per kg body weight and 50 mg per kg body weight, respectively, once a week for 3 weeks, as previously described42.
In vivo tumor imaging and quantification
Lung tumor progression was monitored longitudinally by X-ray microcomputed tomography (μCT) using a GE eXplore CT 120 system, as previously described86. Solid lung volume (tumor burden) was quantified using a customized MATLAB (MathWorks) script, as previously described86. Colon tumor progression was monitored longitudinally using a Karl Storz colonoscopy system with white light and red fluorescent protein fluorescence and biopsy forceps serving as a landmark for objective positioning, as previously described49.
Lentiviral constructs
The U6::sgRNA-EFS::Cre (pUSEC) lentiviral construct86 was digested with BsmBI and sgRNAs cloned as previously described87. H1::sgApc-U6::sgRNA-EFS::mScarlet was generated by Gibson assembly using an H1::sgApc-scaffold gBlock synthetic gene fragment (IDT), PCR amplicons of U6::BsmBI-filler-BsmBI-scaffold, elongation factor-1 (EFS) promoter and mScarlet88, and a lentiviral backbone from the Trono laboratory (Addgene). This was then digested with BsmBI and a second sgRNA cloned as above. The sgRNA sequences, including previously published sgApc33 and sgCtl (mScarlet targeting)64 are provided in Supplementary Table 4. The sgRNA controls against Olfr102 and mScarlet were used interchangeably with no observed differences in tumorigenesis. PGK::Msh2-EFS::PuroR was generated by Gibson assembly using multiple gBlocks spanning murine Msh2 (C57BL/6), PCR amplicons of PGK (3-phosphoglycerate kinase) promoter, EF-1 Alpha Short (EFS) promoter and PuroR, and the Trono lentiviral backbone. All primers were ordered from Sigma-Aldrich.
Validation of CRISPR–Cas9 editing and estimation of tumor purity
To validate efficiency of gene editing by clustered regularly interspaced short palindromic repeats (CRISPR)–Cas9, 200- to 250-bp regions spanning sgRNA sites in the genome were amplified and deep sequenced (Massachusetts General Hospital DNA Core). Colon tumor purity was estimated using a non-wild-type allele fraction at the sgRNA-targeted site in Apc. Loss of Apc is prerequisite for tumorigenesis in the model and thus an assumption was made that all tumor cells harbor loss-of-function edits at this locus. Tumor purity in sgMsh2-targeted lung tumors was estimated using WES BAM coverage spanning exons of the Trp53flox allele and flanking genes (Wrap53, Atp1b2), which were retrieved using the bedcov function of SAMtools v.1.10. The ratio of median coverage in flanking exons (Wrap53 exons 0–9, Trp53 exon 11 and Atp1b2 exons 0–6) versus Trp53 exons flanked by Cre loxP sites (exons 2–10) was calculated in tumors and normal tails. This ratio in tumors, representing the extent of Trp53flox recombination, was then normalized to the median ratio across matched normal tails to estimate tumor purity, with the assumption that all tumor, but not normal, cells underwent complete recombination of Trp53flox alleles. Efficiency of Msh2 knockout in KP; Msh2flox/flox lung tumors was similarly estimated by taking the ratio of reads at the exon flanked by loxP sites (exon 12) and surrounding exons, and adjusting this by tumor purity.
In vitro cell-line assays
Serial live cell imaging of cell lines grown in 96-well plates (Corning) and quantification of confluence were performed with an IncuCyte S3 (Sartorius). Eight replicate wells were seeded with 100 cells and imaged every 3 h for ~6 d. Murine IFN-γ (PeproTech) was used for in vitro stimulation of cell lines for 24 h, followed by live/dead staining (ghost ef780 (Corning), 1:500) in PBS and surface staining in 1 mM EDTA, 25 mM Hepes, 0.5% heat-inactivated FBS in PBS with anti-H-2Kb allophycocyanin (APC) (catalog no. AF6-88.5.5.3, Thermo Fisher Scientific, 1:200), anti-H-2Db FITC (catalog no. 28-14-8, Thermo Fisher Scientific, 1:200) and anti-PD-L1 phycoerythrin (PE)-Cy7 (catalog no. 10F.9G2, BioLegend, 1:200). Samples were run on a BD LSRFortessa using BD FACSDiva v.8.0 software. Results were analyzed in FlowJo v.10.4.2, excluding dead (ef780-positive) cells.
Phylogenetic tree analysis
All somatic SNVs and indels called by the WES analysis pipeline in M1–8 clones and the 09-2 parental cell line were considered in constructing a phylogenetic tree. The R (v.4.0.2) Bioconductor package phangorn (v.2.7.0) was used to construct a tree from a binary presence/absence matrix of mutations across clones and 09-2_par. Specifically, the function prachet was used to calculate the tree using the parsimonious ratchet method and the function acctran was used to calculate branch lengths.
MHC-I immunopeptidomics
MHC-I (H-2Kb and H-2Db) peptide isolation was performed on 108 cells per triplicate for each M1–8 clone as we have previously described49. Cells were grown to confluence before stimulation with 10 ng ml−1 of murine IFN-γ (PeproTech) for 18 h before collection. Pulldowns were performed with 40 μl (bed volume) of rProtein A Sepharose beads (GE Healthcare) preloaded with 1 mg of anti-H-2Kb antibody (Y3, BioXCell) and 1 mg of anti-H-2Db antibody (catalog no. 28-14-8S, purified from HB-27 hybridoma), performed sequentially. Peptides were eluted in 500 μl of 10% acetic acid and purified with 10-kDa MWCO spin filters (PALL Life Science).
MS–MS was performed on eluted peptides as we have previously described49. Tandem mass spectra were searched with Sequest (Thermo Fisher Scientific, v.IseNode in Proteome Discoverer 2.5.0.400). Sequest was set to search the mouse Uniprot database (3 July 2020 version) with 55,650 entries, including common contaminants and green fluorescent protein, Cas9, puromycin and P2A (present in the cell lines) assuming no digestion enzyme, with fragment and parent ion mass tolerances of 0.02 Da and 10.0 p.p.m., respectively. TMTpro was added as a fixed modification on the carboxy and amino termini of peptides. Oxidation of methionine was specified in Sequest as a variable modification. The resulting peptides were filtered to exclude peptides with an isolation interference >30% and p.p.m. error >±3 of the median p.p.m. error of all peptide-spectrum matches. Peptides were further prioritized based on concordance of relative abundance across clones with presence/absence of the associated somatic mutation.
Dendritic cell vaccination, ELISpot and MHC-I multimer staining
Bone marrow-derived dendritic cells were prepared, activated, loaded with putative neoepitopes and injected intradermally at the base of the tail of healthy C57BL/6 mice, followed by two heterologous boosts, as previously described45. A week after the second boost, spleen and peripheral blood were collected for IFN-γ ELISpot and MHC-I:epitope tetramer flow cytometric assays. Red blood cells were first lysed with ACK lysis buffer. IFN-γ ELISpot was performed following the manufacturer’s recommendations (ImmunoSpot, Cellular Technology Limited) using 750,000 splenocytes per well. H-2Kb tetramers were custom generated as previously described45 and used at 1:200 dilution. H-2Db tetramers were generated using UV-labile monomers (UVX Flex-T, BioLegend) following the manufacturer’s recommendations and used at 1:50 dilution. H-2Kb:QAYAFLQHL dextramers were generated using the U-Load Dextramer Kit (Immudex) following the manufacturer’s recommendations and used at 1:10 dilution.
Tissue preparation and flow cytometry
Then 2 min before euthanasia, mice were injected retro-orbitally with anti-CD45 APC-eFluor786 (catalog no. 30-F11, BioLegend, 1:50) to stain intravascular immune cells. Mediastinal lymph nodes and whole tumor-bearing lungs were collected and mechanically dissociated in RPMI-1640 (Corning) with 5% heat-inactivated (HI)-FBS (collection medium). Lungs were placed in digestion buffer containing 500 U ml−1 of collagenase type IV and 20 μg ml−1 of DNase (Sigma-Aldrich) in collection medium, lightly minced and digested at 37 °C for 30 min with gentle agitation, and further dissociated with a gentleMACS Octo Dissociator (Miltenyi Biotec) on the tumor_imp1.1 setting and passed through a 100-μm filter. Live/dead staining (Ghost Dye Red 780, Corning, 1:500 dilution) was performed in PBS and surface stains in FACS buffer (1 mM EDTA, 25 mM Hepes and 0.5% HI-FBS in PBS). For assessment of T cell depletion (Extended Data Fig. 2m), the following antibodies were used for surface staining: CD45 BV785 (catalog no. 30-F11, BioLegend, 1:200), CD3 BV421 (catalog no. 17A2, BioLegend, 1:400), CD8a BUV395 (catalog no. 53-6.7, BioLegend, 1:400) and CD4 AF647 (catalog no. RM4-5, BioLegend, 1:400). For analysis of neoantigen-specific T cells (Extended Data Fig. 5e), the following antibodies were used for surface staining: CD8a BUV395 (as above), CD4 BV711 (catalog no. RM4-5, BioLegend, 1:200), CD44 BV785 (catalog no. IM7, BioLegend, 1:200) and GZMB PE-CF594 (catalog no. GB11, BD Biosciences, 1:250); and intracellular staining: TCF1 AF647 (catalog no. C63D9, CST, 1:200). Cells were fixed for 1 h at room temperature in Fixation/Permeabilization Concentrate (Thermo Fisher Scientific) diluted 1:3 in Fixation/Permeabilization diluent (Thermo Fisher Scientific) and washed in permeabilization buffer (Thermo Fisher Scientific). Intracellular staining was performed in permeabilization buffer overnight at 4 °C. Cells were washed and resuspended in FACS buffer for analysis on a BD LSRFortessa four-laser flow cytometer running BD FACSDiva v.8.0 software. Results were analyzed in FlowJo v.10.4.2. Single lymphocytes were gated first on forward versus side scatter (FSC-A versus SSC-A) and then FSC-A versus FSC-H. Then, live CD8+ T cells were gated on positive CD8α and negative Ghost Red Dye 780 staining. Antigen-specific CD8+ T cells were further gated on CD44 positivity and tetramer/dextramer positivity in two channels (PE/APC). Expression of additional markers was analyzed specifically in this neoantigen-specific CD8+ T cell population.
Analysis of human MMRd cancer clinical trials
Raw WES reads from Bortolomeazzi et al.50 and Kwon et al.35 trials (ClinicalTrial.gov identifiers: NCT02563002 and NCT02589496) were mapped to the reference human genome (GRCh38) using BWA-MEM v.0.7.17-r1188 (ref. 65). Aligned reads were processed as BAMs following the GATK v.4.1.8.0 Best Practices workflow to remove duplicates and recalibrate base quality scores66. Somatic SNVs and indels were detected using the same pipeline and callers described above for mouse tumors. CCF values were estimated as described above.
Raw RNA-seq reads were mapped to the human reference genome (GRCh38) using STAR v.2.7.1a83. STAR was run using the same parameters as described above in the mouse analysis. The function Htseq-count from the Python library HTSeq (v.2.0.1)89 was used to compute read counts for each gene (Ensembl release GRCh38.90), which were normalized to transcripts per million. Neoantigens were predicted and prioritized as described above in the mouse analysis. Clonal and subclonal neoantigens were classified as CCF ≥ 0.75 and <0.5, respectively.
Clinical responses were binned into two groups: OR, including partial and complete responders and NR, including patients with stable and progressive disease. PFS analysis was performed on a combination of both trials35,50 with the trial study as a covariate. Importantly, there was no significant difference in PFS between patients from the two trials. Cox’s regression was performed in R (v.4.0.2) using the package survival (v.3.4-0)90 with comparisons of patients in the upper versus lower quartiles of each variable tested.
Statistics and reproducibility
Statistical analyses and plotting were performed in R (v.4.2.1) using built-in functions and ggplot2 (v.3.4.1), beeswarm (v.0.4.0), corrplot (v.0.88), eulerr (v.6.1.0), gplots (v.3.1.3), survival (v.3.4.0), survminer (v.0.4.9) and RColorBrewer (v.1.1.3). To assess statistical significance, Fisher’s exact 2 × 2 test was used on categorical variables and two-tailed Wilcoxon’s rank-sum test or Student’s t-test (where the assumption of normality was met) was used on continuous variables. HRs were calculated and compared using Cox’s proportional hazards regression. Multiple-comparison corrections were performed using Holm’s method. No statistical method was used to predetermine sample size. In preclinical trials of lung and colon models, only those animals with apparent tumors by μCT or colonoscopy were recruited. No other data were excluded from analyses. Preclinical trials were randomized and investigators blinded to allocation during dosing, imaging and quantification. No experiments failed to replicate.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41588-023-01499-4.
Supplementary information
Source data
Acknowledgements
This work was supported by the National Cancer Institute’s Cancer Center Support Grant (nos. P30-CA14051 and R01 CA233983 to T.J.) and the Howard Hughes Medical Institute (to T.J.) P.M.K.W. was supported by a Damon Runyon Fellowship Award. The funders had no role in the study design, data collection and analysis, decision to publish or preparation of the paper. We thank K. Yee, J. Teixeira, K. Anderson and M. Magendantz for administrative support, and our colleagues in the Jacks laboratory and the broader community at the Koch Institute of MIT for thoughtful discussions and technical advice. We thank the Koch Institute Swanson Biotechnology Center for core support from the BioMicro Center and Histology, Proteomics, and Flow Cytometry facilities, with particular thanks to S. Levine for high-throughput sequencing support, K. Cormier and C. Condon for histology support, A. Koller for help with design and analysis of MS experiments, and M. Griffin, M. Jennings and G. Paradis for flow cytometry support. F.M. and I.C.C. thank EMBL for funding. We are also grateful for collaboration with T. Westerling and Aiforia in developing automated CNNs for lung tumor grade and IHC quantification.
Extended data
Author contributions
P.M.K.W. and T.J. conceived the study. P.M.K.W., T.J. and I.C.C. supervised the study. P.M.K.W., O.C.S., H.H., N.J.S., A.M.J., W.M.R., D.A.C., G.C.J. and S.L.S. carried out all aspects of animal care and experimentation. F.M. and I.C.C designed sequencing analysis pipelines and carried out analysis with P.M.K.W. Z.A.E. designed and executed the neoantigen prediction pipeline. All other data analysis was carried out by P.M.K.W. A.B. provided conceptual and technical guidance in sequencing analysis. S.N. designed dual sgRNA lentiviral constructs. D.Z., M.C.B., A.J., J.J.P. and A.M.C. produced and validated critical reagents. R.T.B. provided validation of ground truth for training the Aiforia lung tumor grading CNN. P.M.K.W., I.C.C. and T.J. wrote the paper, with feedback from all authors.
Peer review
Peer review information
Nature Genetics thanks Charles Swanton and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
FundingInformation
Open access funding provided by European Molecular Biology Laboratory (EMBL).
Data availability
Raw exome sequencing and RNA-seq data from Bortolomeazzi et al.50 are available through controlled-access application via the European Genome-Phenome Archive (hosted by the EMBL-EBI and the CRG) under accession no. EGAD00001006165. Raw sequencing data from Kwon et al.35 were downloaded from the European Nucleotide Archive (ENA) database under primary accession no. PRJEB40416. The sequencing data generated in the present study are available at ENA under primary accession no. PRJEB56609. Raw MS data generated in the present study are available at MassIVE under accession no. MSV000092096. Source data are provided with this paper.
Code availability
Customized scripts used in the analysis of sequencing data in the present study are available at: https://github.com/cortes-ciriano-lab/MMRd_immunogenicity.
Competing interests
T.J. is a member of the Board of Directors of Amgen and Thermo Fisher Scientific, and a co-founder of Dragonfly Therapeutics and T2 Biosystems. He serves on the Scientific Advisory Board of Dragonfly Therapeutics, SQZ Biotech and Skyhawk Therapeutics, and is the President of Break Through Cancer. None of these affiliations represents a conflict of interest with respect to the design or execution of the present study or interpretation of data presented in this manuscript. The Jacks laboratory also currently receives funding from the Johnson & Johnson Lung Cancer Initiative and the Lustgarten Foundation for Pancreatic Cancer Research, but this did not support the research described in the present study. All other authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Haley Hauck, Olivia C. Smith.
Contributor Information
Peter M. K. Westcott, Email: westcott@cshl.edu
Isidro Cortes-Ciriano, Email: icortes@ebi.ac.uk.
Tyler Jacks, Email: tjacks@mit.edu.
Extended data
is available for this paper at 10.1038/s41588-023-01499-4.
Supplementary information
The online version contains supplementary material available at 10.1038/s41588-023-01499-4.
References
- 1.Larkin J, et al. Five-year survival with combined nivolumab and ipilimumab in advanced melanoma. N. Engl. J. Med. 2019;381:1535–1546. doi: 10.1056/NEJMoa1910836. [DOI] [PubMed] [Google Scholar]
- 2.Le DT, et al. Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science. 2017;357:409–413. doi: 10.1126/science.aan6733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Fradet Y, et al. Randomized phase III KEYNOTE-045 trial of pembrolizumab versus paclitaxel, docetaxel, or vinflunine in recurrent advanced urothelial cancer: results of >2 years of follow-up. Ann. Oncol. 2019;30:970–976. doi: 10.1093/annonc/mdz127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Reck M, et al. Pembrolizumab versus chemotherapy for PD-L1–positive non–small-cell lung cancer. N. Engl. J. Med. 2016;375:1823–1833. doi: 10.1056/NEJMoa1606774. [DOI] [PubMed] [Google Scholar]
- 5.Diaz LA, et al. Pembrolizumab versus chemotherapy for microsatellite instability-high or mismatch repair-deficient metastatic colorectal cancer (KEYNOTE-177): final analysis of a randomised, open-label, phase 3 study. Lancet Oncol. 2022;23:659–670. doi: 10.1016/S1470-2045(22)00197-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Matsushita H, et al. Cancer exome analysis reveals a T-cell-dependent mechanism of cancer immunoediting. Nature. 2012;482:400–404. doi: 10.1038/nature10755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Blass, E. & Ott, P. A. Advances in the development of personalized neoantigen-based therapeutic cancer vaccines. Nat. Rev. Clin. Oncol.18, 215–229 (2021). [DOI] [PMC free article] [PubMed]
- 8.Samstein RM, et al. Tumor mutational load predicts survival after immunotherapy across multiple cancer types. Nat. Genet. 2019;51:202–206. doi: 10.1038/s41588-018-0312-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Marabelle A, et al. Association of tumour mutational burden with outcomes in patients with advanced solid tumours treated with pembrolizumab: prospective biomarker analysis of the multicohort, open-label, phase 2 KEYNOTE-158 study. Lancet Oncol. 2020;21:1353–1365. doi: 10.1016/S1470-2045(20)30445-9. [DOI] [PubMed] [Google Scholar]
- 10.Hellmann MD, et al. Tumor mutational burden and efficacy of nivolumab monotherapy and in combination with ipilimumab in small-cell lung cancer. Cancer Cell. 2018;33:853–861.e4. doi: 10.1016/j.ccell.2018.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.The Cancer Genome Atlas Network. Comprehensive molecular characterization of human colon and rectal cancer. Nature487, 330–337 (2012). [DOI] [PMC free article] [PubMed]
- 12.Kim T-M, Laird PW, Park PJ. The landscape of microsatellite instability in colorectal and endometrial cancer genomes. Cell. 2013;155:858–868. doi: 10.1016/j.cell.2013.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chalmers ZR, et al. Analysis of 100,000 human cancer genomes reveals the landscape of tumor mutational burden. Genome Med. 2017;9:34. doi: 10.1186/s13073-017-0424-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Cortes-Ciriano I, Lee S, Park W-Y, Kim T-M, Park PJ. A molecular portrait of microsatellite instability across multiple cancers. Nat. Commun. 2017;8:59. doi: 10.1038/ncomms15180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hause RJ, Pritchard CC, Shendure J, Salipante SJ. Classification and characterization of microsatellite instability across 18 cancer types. Nat. Med. 2016;22:1342–1350. doi: 10.1038/nm.4191. [DOI] [PubMed] [Google Scholar]
- 16.Gurjao, C., Tsukrov, D., Imakaev, M., Luquette, L. J. & Mirny, L. A. Limited evidence of tumour mutational burden as a biomarker of response to immunotherapy. Preprint at bioRxiv10.1101/2020.09.03.260265 (2020).
- 17.Rousseau B, et al. The spectrum of benefit from checkpoint blockade in hypermutated tumors. N. Engl. J. Med. 2021;384:1168–1170. doi: 10.1056/NEJMc2031965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.McGranahan, N. & Swanton, C. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell168, 613–628 (2017). [DOI] [PubMed]
- 19.Gejman RS, et al. Rejection of immunogenic tumor clones is limited by clonal fraction. eLife. 2018;7:e41090. doi: 10.7554/eLife.41090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Morris LGT, et al. Pan-cancer analysis of intratumor heterogeneity as a prognostic determinant of survival. Oncotarget. 2016;7:10051–10063. doi: 10.18632/oncotarget.7067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Lin Z, et al. Intratumor heterogeneity correlates with reduced immune activity and worse survival in melanoma patients. Front. Oncol. 2020;10:596493. doi: 10.3389/fonc.2020.596493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Andor N, et al. Pan-cancer analysis of the extent and consequences of intratumor heterogeneity. Nat. Med. 2016;22:105–113. doi: 10.1038/nm.3984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Miao D, et al. Genomic correlates of response to immune checkpoint blockade in microsatellite-stable solid tumors. Nat. Genet. 2018;50:1271–1281. doi: 10.1038/s41588-018-0200-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.McGranahan N, et al. Clonal neoantigens elicit T cell immunoreactivity and sensitivity to immune checkpoint blockade. Science. 2016;351:1463–1469. doi: 10.1126/science.aaf1490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Litchfield K, et al. Meta-analysis of tumor- and T cell-intrinsic mechanisms of sensitization to checkpoint inhibition. Cell. 2021;184:596–614.e14. doi: 10.1016/j.cell.2021.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wolf Y, et al. UVB-induced tumor heterogeneity diminishes immune response in melanoma. Cell. 2019;179:219–235.e21. doi: 10.1016/j.cell.2019.08.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Germano G, et al. Inactivation of DNA repair triggers neoantigen generation and impairs tumour growth. Nature. 2017;552:116–120. doi: 10.1038/nature24673. [DOI] [PubMed] [Google Scholar]
- 28.Mandal R, et al. Genetic diversity of tumors with mismatch repair deficiency influences anti-PD-1 immunotherapy response. Science. 2019;364:485–491. doi: 10.1126/science.aau0447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Shankaran V, et al. IFNgamma and lymphocytes prevent primary tumour development and shape tumour immunogenicity. Nature. 2001;410:1107–1111. doi: 10.1038/35074122. [DOI] [PubMed] [Google Scholar]
- 30.DuPage M, Mazumdar C, Schmidt LM, Cheung AF, Jacks T. Expression of tumour-specific antigens underlies cancer immunoediting. Nature. 2012;482:405–409. doi: 10.1038/nature10803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.DuPage M, Dooley AL, Jacks T. Conditional mouse lung cancer models using adenoviral or lentiviral delivery of Cre recombinase. Nat. Protoc. 2009;4:1064–1072. doi: 10.1038/nprot.2009.95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kucherlapati MH, et al. An Msh2 conditional knockout mouse for studying intestinal cancer and testing anticancer agents. Gastroenterology. 2010;138:993–1002.e1. doi: 10.1053/j.gastro.2009.11.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Roper J, et al. In vivo genome editing and organoid transplantation models of colorectal cancer and metastasis. Nat. Biotechnol. 2017;35:569–576. doi: 10.1038/nbt.3836. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Jia P, et al. MSIsensor-pro: fast, accurate, and matched-normal-sample-free detection of microsatellite instability. Genom. Proteom. Bioinform. 2020;18:65–71. doi: 10.1016/j.gpb.2020.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kwon M, et al. Determinants of response and intrinsic resistance to pd-1 blockade in microsatellite instability–high gastric cancer. Cancer Discov. 2021;11:2168–2185. doi: 10.1158/2159-8290.CD-21-0219. [DOI] [PubMed] [Google Scholar]
- 36.Alexandrov LB, et al. Signatures of mutational processes in human cancer. Nature. 2013;500:415–421. doi: 10.1038/nature12477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Tate JG, et al. COSMIC: the catalogue of somatic mutations In cancer. Nucleic Acids Res. 2019;47:D941–D947. doi: 10.1093/nar/gky1015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Touat M, et al. Mechanisms and therapeutic implications of hypermutation in gliomas. Nature. 2020;580:517–523. doi: 10.1038/s41586-020-2209-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Leach DR, Krummel MF, Allison JP. Enhancement of antitumor immunity by CTLA-4 blockade. Science. 1996;271:1734–1736. doi: 10.1126/science.271.5256.1734. [DOI] [PubMed] [Google Scholar]
- 40.Iwai Y, et al. Involvement of PD-L1 on tumor cells in the escape from host immune system and tumor immunotherapy by PD-L1 blockade. Proc. Natl Acad. Sci. USA. 2002;99:12293–12297. doi: 10.1073/pnas.192461099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.DuPage M, et al. Endogenous T cell responses to antigens expressed in lung adenocarcinomas delay malignant tumor progression. Cancer Cell. 2011;19:72–85. doi: 10.1016/j.ccr.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Pfirschke C, et al. Immunogenic chemotherapy sensitizes tumors to checkpoint blockade therapy. Immunity. 2016;44:343–354. doi: 10.1016/j.immuni.2015.11.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Zapata L, et al. Signatures of positive selection reveal a universal role of chromatin modifiers as cancer driver genes. Sci. Rep. 2017;7:13124. doi: 10.1038/s41598-017-12888-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Williams MJ, Werner B, Barnes CP, Graham TA, Sottoriva A. Identification of neutral tumor evolution across cancer types. Nat. Genet. 2016;48:238–244. doi: 10.1038/ng.3489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Jaeger AM, et al. Deciphering the immunopeptidome in vivo reveals new tumour antigens. Nature. 2022;607:149–155. doi: 10.1038/s41586-022-04839-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hogquist KA, et al. T cell receptor antagonist peptides induce positive selection. Cell. 1994;76:17–27. doi: 10.1016/0092-8674(94)90169-4. [DOI] [PubMed] [Google Scholar]
- 47.Ghosh, M. et al. Guidance document: Validation of a high-performance liquid chromatography-tandem mass spectrometry immunopeptidomics assay for the identification of HLA class I ligands suitable for pharmaceutical therapies. Mol. Cell. Proteom.19, 432–443 (2020). [DOI] [PMC free article] [PubMed]
- 48.Lin W-HW, et al. CD8+ T lymphocyte self-renewal during effector cell determination. Cell Rep. 2016;17:1773–1782. doi: 10.1016/j.celrep.2016.10.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Westcott PMK, et al. Low neoantigen expression and poor T-cell priming underlie early immune escape in colorectal cancer. Nat. Cancer. 2021;2:1071–1085. doi: 10.1038/s43018-021-00247-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Bortolomeazzi M, et al. Immunogenomics of colorectal cancer response to checkpoint blockade: analysis of the KEYNOTE 177 trial and validation cohorts. Gastroenterology. 2021;161:1179–1193. doi: 10.1053/j.gastro.2021.06.064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Gebert J, et al. Recurrent frameshift neoantigen vaccine elicits protective immunity with reduced tumor burden and improved overall survival in a lynch syndrome mouse model. Gastroenterology. 2021 doi: 10.1053/J.GASTRO.2021.06.073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Li H-D, et al. A PoleP286R mouse model of endometrial cancer recapitulates high mutational burden and immunotherapy response. JCI Insight. 2020;5:e138829. doi: 10.1172/jci.insight.138829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Lee BCH, et al. Mutational landscape of normal epithelial cells in Lynch syndrome patients. Nat. Commun. 2022;13:2710. doi: 10.1038/s41467-022-29920-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Sottoriva A, et al. A big bang model of human colorectal tumor growth. Nat. Genet. 2015;47:209–216. doi: 10.1038/ng.3214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Amodio V, et al. Genetic and pharmacological modulation of DNA mismatch repair heterogeneous tumors promotes immune surveillance. Cancer Cell. 2023;41:196–209.e5. doi: 10.1016/j.ccell.2022.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Burger ML, et al. Antigen dominance hierarchies shape TCF1+ progenitor CD8 T cell phenotypes in tumors. Cell. 2021;184:4996–5014.e26. doi: 10.1016/j.cell.2021.08.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Westcott PMK, et al. The mutational landscapes of genetic and chemical models of Kras-driven lung cancer. Nature. 2015;517:489–492. doi: 10.1038/nature13898. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Helleday T. Making immunotherapy ‘cold’ tumours ‘hot’ by chemotherapy-induced mutations—a misconception. Ann. Oncol. 2019;30:360–361. doi: 10.1093/annonc/mdz013. [DOI] [PubMed] [Google Scholar]
- 59.Johnson L, et al. Somatic activation of the K-ras oncogene causes early onset lung cancer in mice. Nature. 2001;410:1111–1116. doi: 10.1038/35074129. [DOI] [PubMed] [Google Scholar]
- 60.Marino S, Vooijs M, van Der Gulden H, Jonkers J, Berns A. Induction of medulloblastomas in p53-null mutant mice by somatic inactivation of Rb in the external granular layer cells of the cerebellum. Genes Dev. 2000;14:994–1004. [PMC free article] [PubMed] [Google Scholar]
- 61.Li L, et al. Identification of DHODH as a therapeutic target in small cell lung cancer. Sci. Transl. Med. 2019;11:7852. doi: 10.1126/scitranslmed.aaw7852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Chu VT, et al. Efficient generation of Rosa26 knock-in mice using CRISPR/Cas9 in C57BL/6 zygotes. BMC Biotechnol. 2016;16:4. doi: 10.1186/s12896-016-0234-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Roper J, et al. Colonoscopy-based colorectal cancer modeling in mice with CRISPR-Cas9 genome editing and organoid transplantation. Nat. Protoc. 2018;13:217–234. doi: 10.1038/nprot.2017.136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Sánchez-Rivera FJ, et al. Rapid modelling of cooperating genetic events in cancer through somatic genome editing. Nature. 2014;516:428–431. doi: 10.1038/nature13906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at 10.48550/arXiv.1303.3997 (2013).
- 66.Van der Auwera, G. A. & O’Connor, B. D. Genomics in the Cloud: Using Docker, GATK, and WDL in Terra (O’Reilly Media, Inc., 2020).
- 67.Fan Y, et al. MuSE: accounting for tumor heterogeneity using a sample-specific error model improves sensitivity and specificity in mutation calling from sequencing data. Genome Biol. 2016;17:1–11. doi: 10.1186/s13059-016-1029-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Lai Z, et al. VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research. Nucleic Acids Res. 2016;44:e108–e108. doi: 10.1093/nar/gkw227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Kim S, et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat. Methods. 2018;15:591–594. doi: 10.1038/s41592-018-0051-x. [DOI] [PubMed] [Google Scholar]
- 70.Wang M, et al. SomaticCombiner: improving the performance of somatic variant calling based on evaluation tests and a consensus approach. Sci. Rep. 2020;10:12898. doi: 10.1038/s41598-020-69772-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Kofler R, Schlötterer C, Lelley T. SciRoKo: a new tool for whole genome microsatellite search and investigation. Bioinformatics. 2007;23:1683–1685. doi: 10.1093/bioinformatics/btm157. [DOI] [PubMed] [Google Scholar]
- 72.Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. Preprint at 10.48550/arXiv.1207.3907 (2012).
- 73.Riester M, et al. PureCN: copy number calling and SNV classification using targeted short read sequencing. Source Code Biol. Med. 2016;11:13. doi: 10.1186/s13029-016-0060-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Blokzijl F, Janssen R, van Boxtel R, Cuppen E. MutationalPatterns: comprehensive genome-wide analysis of mutational processes. Genome Med. 2018;10:28. doi: 10.1186/s13073-018-0539-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Alexandrov LB, et al. The repertoire of mutational signatures in human cancer. Nature. 2020;578:94–101. doi: 10.1038/s41586-020-1943-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Li H, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.McLaren W, et al. The Ensembl variant effect predictor. Genome Biol. 2016;17:122. doi: 10.1186/s13059-016-0974-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Hundal J, et al. PVACtools: A computational toolkit to identify and visualize cancer neoantigens. Cancer Immunol. Res. 2020;8:409–420. doi: 10.1158/2326-6066.CIR-19-0401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Jurtz V, et al. NetMHCpan-4.0: improved peptide–MHC class I interaction predictions integrating eluted ligand and peptide binding affinity data. J. Immunol. 2017;199:3360–3368. doi: 10.4049/jimmunol.1700893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Andreatta M, Nielsen M. Gapped sequence alignment using artificial neural networks: spplication to the MHC class I system. Bioinformatics. 2016;32:511–517. doi: 10.1093/bioinformatics/btv639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Peters B, Sette A. Generating quantitative models describing the sequence specificity of biological processes with the stabilized matrix method. BMC Bioinform. 2005;6:132. doi: 10.1186/1471-2105-6-132. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Kim Y, Sidney J, Pinilla C, Sette A, Peters B. Derivation of an amino acid similarity matrix for peptide:MHC binding and its application as a Bayesian prior. BMC Bioinform. 2009;10:394. doi: 10.1186/1471-2105-10-394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Dobin A, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Picard Tools—By Broad Institute. GitHubhttp://broadinstitute.github.io/picard.
- 85.Bankhead P, et al. QuPath: Open source software for digital pathology image analysis. Sci. Rep. 2017;7:16878. doi: 10.1038/s41598-017-17204-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Tammela T, et al. A Wnt-producing niche drives proliferative potential and progression in lung adenocarcinoma. Nature. 2017;545:355–359. doi: 10.1038/nature22334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Ran FA, et al. Genome engineering using the CRISPR–Cas9 system. Nat. Protoc. 2013;8:2281–2308. doi: 10.1038/nprot.2013.143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Bindels DS, et al. mScarlet: a bright monomeric red fluorescent protein for cellular imaging. Nat. Methods. 2016;14:53–56. doi: 10.1038/nmeth.4074. [DOI] [PubMed] [Google Scholar]
- 89.Anders S, Pyl PT, Huber W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Therneau, T. M. & Grambsch, P. M. Modeling Survival Data: Extending the Cox Model (Springer, 2000).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Raw exome sequencing and RNA-seq data from Bortolomeazzi et al.50 are available through controlled-access application via the European Genome-Phenome Archive (hosted by the EMBL-EBI and the CRG) under accession no. EGAD00001006165. Raw sequencing data from Kwon et al.35 were downloaded from the European Nucleotide Archive (ENA) database under primary accession no. PRJEB40416. The sequencing data generated in the present study are available at ENA under primary accession no. PRJEB56609. Raw MS data generated in the present study are available at MassIVE under accession no. MSV000092096. Source data are provided with this paper.
Customized scripts used in the analysis of sequencing data in the present study are available at: https://github.com/cortes-ciriano-lab/MMRd_immunogenicity.