Abstract
PD-1 blockade unleashes CD8 T cells1, including those specific for mutation-associated neoantigens (MANA), but factors in the tumour microenvironment can inhibit these T cell responses. Single-cell transcriptomics have revealed global T cell dysfunction programs in tumour-infiltrating lymphocytes (TIL). However, the majority of TIL do not recognize tumour antigens2, and little is known about transcriptional programs of MANA-specific TIL. Here, we identify MANA-specific T cell clones using the MANA functional expansion of specific T cells assay3 in neoadjuvant anti-PD-1-treated non-small cell lung cancers (NSCLC). We use their T cell receptors as a ‘barcode’ to track and analyse their transcriptional programs in the tumour microenvironment using coupled single-cell RNA sequencing and T cell receptor sequencing. We find both MANA- and virus-specific clones in TIL, regardless of response, and MANA-, influenza- and Epstein–Barr virus-specific TIL each have unique transcriptional programs. Despite exposure to cognate antigen, MANA-specific TIL express an incompletely activated cytolytic program. MANA-specific CD8 T cells have hallmark transcriptional programs of tissue-resident memory (TRM) cells, but low levels of interleukin-7 receptor (IL-7R) and are functionally less responsive to interleukin-7 (IL-7) compared with influenza-specific TRM cells. Compared with those from responding tumours, MANA-specific clones from non-responding tumours express T cell receptors with markedly lower ligand-dependent signalling, are largely confined to HOBIThigh TRM subsets, and coordinately upregulate checkpoints, killer inhibitory receptors and inhibitors of T cell activation. These findings provide important insights for overcoming resistance to PD-1 blockade.
Subject terms: Cellular immunity, Immunotherapy, CD8-positive T cells, T-cell receptor, Tumour immunology
Single-cell RNA sequencing and T cell receptor sequencing are combined to identify transcriptional programs specific to mutation-associated neoantigen-specific T cells in non-small cell lung cancers treated with anti-PD-1, providing insights into resistance to PD-1 blockade.
Main
The efficacy of PD-1- and PD-L1-blocking agents is predicated upon CD8 T cell-mediated anti-tumour immunity1. Early studies focused on tumour-associated antigens, whereas recent work has shifted attention to T cell recognition of mutation-associated neoantigens (MANA), owing to the large numbers of somatic mutations acquired by many cancers during their development4. The association of improved anti-PD-1 and anti-PD-L1 clinical responses with high tumour mutational burden5 strongly suggests that MANA are important targets of anti-tumour immunity induced by PD-1 blockade.
Despite the success of immune checkpoint blockade (ICB) in improving clinical outcomes, most cancers still do not respond6. Improving response rates to ICB will require an understanding of the functional state of tumour-specific T cells, particularly in the tumour microenvironment. However, a fundamental limitation in the current understanding of the T cell functional programs that underpin the response to ICB has been the absence of transcriptional profiling of true MANA-specific TIL. A related problem is the paucity of information regarding the differences between MANA-specific TIL in ICB-responsive versus resistant tumours. Indeed, MANA-specific T cells represent a small fraction of total TIL2,7, particularly in lung cancer, in which they have been shown to selectively upregulate CD39. This highlights the challenges confronting characterization of the cells responsible for the activity of T cell-targeting immunotherapies.
Global gene expression of NSCLC TIL
For this study, we used peripheral blood and tissue biospecimens obtained from the first-in-human clinical trial of neoadjuvant anti-PD-1 (nivolumab) in resectable non-small cell lung cancer8 (NSCLC; ClinicalTrials.gov identifier: NCT02259621; Fig. 1a, top) to study the transcriptional programs of MANA-specific TIL. Nine out of 20 patients with NSCLC (45%) treated in this trial had a major pathologic response (MPR) at the time of resection, defined as no more than 10% viable tumour at the time of surgery; previous studies have established an association between MPR and improved overall survival9–12. A schematic of the study design and experimental approach is shown in Fig. 1a, bottom. Combined single-cell RNA sequencing (scRNA-seq) and T cell receptor sequencing (TCR-seq) was performed on TIL (n = 15), paired adjacent normal lung (n = 12), tumour-draining lymph nodes (TDLN, n = 3) and a distant metastasis (Extended Data Fig. 1a, Supplementary Tables 1–3). In total, 560,916 T cells passed quality control (Fig. 1b, Supplementary Table 3) and were carried forward for analyses.
Uniform manifold approximation and projection (UMAP) analysis of cells from all samples on the basis of filtered and normalized transcript counts defined 15 T cell clusters (Fig. 1b, c, Extended Data Fig. 1b–e, Supplementary Data 1.1). Expression of subset-defining markers and T cell checkpoints was visualized in red scale on the UMAP (Fig. 1d). The two clusters designated as TRM had the highest expression of the canonical TRM genes, ZNF683 (also known as HOBIT) and ITGAE (also known as CD103), and the highest expression of a TRM gene set13 (Extended Data Fig. 1f, Supplementary Data 2.1). Principal component analysis (PCA) of samples based on concatenated cell-cluster-level pseudobulk profiles distinguished adjacent normal-lung T cells from TIL (Fig. 1e), but did not distinguish MPR from non-MPR TIL (Fig. 1f). We did not observe notable differentially expressed gene programs between MPR and non-MPR TIL (Supplementary Data 3), indicating that gene expression profiling of total TIL has limited sensitivity in distinguishing the pathologic response to PD-1 blockade.
Expression programs of MANA-specific TIL
We next performed the MANA functional expansion of specific T cells assay (MANAFEST)3 on 9 of the 16 individuals on whom scRNA-seq–TCR-seq was conducted. This assay detects in vivo antigen-experienced T cell responses and identifies the clonal identity of the T cell receptor (TCR) corresponding to these cells. Of these nine, four were classed as MPR and five were non-MPR (results from one individual have been previously described8). Putative MANA (Supplementary Tables 4–6), peptide pools representing influenza matrix and nucleoproteins, and a pool of major histocompatibility complex (MHC) class I-restricted cytomegalovirus (CMV), Epstein–Barr virus (EBV) and influenza virus epitopes (CEF) were queried for CD8+ T cell reactivity in parallel (Supplementary Tables 6, 7). From 7 (3 with MPR and 4 without MPR) of the 9 individuals, 72 total unique MANA-specific TCRs, 33 unique CEF-specific TCRs, and 52 unique influenza-specific TCRs were identified (Extended Data Fig. 2, Supplementary Tables 8, 9, Supplementary Data 4, 5). Out of 33 CEF-specific TCRs, 6 matched known public EBV-specific TCRs and 3 matched known public influenza-specific TCRs14. No CMV-reactive TCRs were mapped from our CEF-specific TCRs. Notably, 4 of the 41 MANA-specific TCRVβ complementarity-determining region 3 (CDR3) clonotypes identified in a patient without MPR (patient ID MD01-004) (Extended Data Fig. 2) were specific for a MANA (MD01-004-MANA12) derived from a p53 R248L hotspot mutation, and were found at appreciable frequency in the pre- and post-treatment tumour (Extended Data Fig. 3), despite the tumour not attaining MPR. Most MANA-specific clones were detected at very low frequency (median: 0.001%) in the peripheral blood across all available time points (Fig. 2a, Extended Data Fig. 3). Overall, pathologic response was not associated with the prevalence, frequency or intratumoral representation of MANA-specific T cells (Extended Data Figs. 2, 3 Supplementary Table 9). In fact, more MANA-specific TIL were observed among non-MPR TIL than among MPR TIL. No consistent pattern was observed for the frequency of viral-specific T cells in the tissue or peripheral blood (Extended Data Figs. 2, 3).
Ten MANA-specific clonotypes, for which the TCRα could be confidently identified from the single-cell analysis, were selected for validation of MANA recognition via TCR cloning and introduction into a Jurkat–NFAT luciferase reporter system15. Seventy per cent of tested clonotypes (representing 95.2% of total cells bearing TCRs identified by MANAFEST) were validated as MANA-specific (Extended Data Fig. 4a–c). Peptide–human leukocyte antigen (HLA) binding assays demonstrated that two MANA peptides—MD01-005-MANA7 and MD01-004-MANA12—displayed comparably high MHC class I affinity (measured dissociation constants (Kd) = 5.1 nM and 17.5 nM, respectively) and stability (Extended Data Fig. 4d, e).
We next evaluated the transcriptional programming of MANA- and viral-specific CD8+ T cells. Refined clustering of all CD8+ T cells (n = 235,851) identified 15 unique clusters (Fig. 2b, Extended Data Fig. 5a, Supplementary Data 1.2). Clusters were named on the basis of previously defined T cell states from single-cell transcriptomic studies16. Six clusters had gene expression programs consistent with TRM T cells, characterized by high expression of HOBIT, LINC02446, CD103 and a previously published TRM gene set (Extended Data Fig. 5b). Selective genes and linkage to the global CD3 T cell clusters shown in Fig. 1 were visualized (Extended Data Figs. 5c, d). The six TRM subsets were heterogenous in their expression of an exhaustion gene set described previously in NSCLC17 (Extended Data Fig. 5e, Supplementary Data 2.2). None of the most frequent tumour-infiltrating clonotypes were restricted to a single cluster (Extended Data Fig. 5f). Among all tested individuals, a total of 28 MANA-specific CD8 clonotypes (1,350 total cells from 3 patients with MPR and 3 patients without MPR) as identified by MANAFEST were detected in the single-cell data, of which 20 clonotypes (890 cells) were in the tumour (Fig. 2c, Supplementary Table 8). Of the viral-specific T cell clonotypes, 23 influenza-specific (866 cells) and 2 EBV-specific (281 cells) clones were found in the CD8 single-cell analysis.
Overlay of these clonotypes onto the CD8+ T cell UMAP demonstrated a marked distinction between the clonotypes with different antigen specificities (Fig. 2c, Extended Data Fig. 6a–c). EBV-reactive T cells primarily resided in effector T (Teff) cell clusters, whereas influenza- and MANA-specific T cells largely occupied distinct TRM clusters. Notably, because influenza is a respiratory virus, influenza-specific T cells may be considered the archetypal lung-resident memory T cells18. None of the patients in our study were symptomatic for influenza in the six weeks preceding surgery. It is therefore not surprising that influenza-specific CD8 cells were TRM rather than Teff cells. By contrast, EBV-specific T cells exclusively occupied Teff clusters, consistent with periodic acute stimulation upon latent EBV reactivation. Whereas influenza-specific cells were the most abundant in normal lung, MANA-specific CD8 cells were more numerous in the tumour (Extended Data Fig. 6d, e).
There was considerable shared expression of selective cytotoxic T lymphocyte (CTL) activation genes between MANA- and EBV-specific T cells, in particular genes encoding T cell activation and CTL activity, such as HLA-DRA, GZMH, IFNG and NKG7 (Fig. 2d, Supplementary Data 1.3). However, genes encoding certain canonical cytolytic molecules, such as GZMK, were expressed at low levels in MANA-specific TIL. Most notably, EOMES, which encodes a transcription factor that is critical for CTL activity19, was expressed in EBV-specific CD8 cells but was minimally expressed in most MANA-specific cells. Multiple checkpoints were significantly upregulated in MANA-specific TIL compared with EBV-specific TIL. Notably, MANA-specific cells expressed higher levels of PRDM1, which encodes BLIMP-1 and has been reported to participate in coordinated transcriptional activation of multiple checkpoint genes, including PD-1 (also known as PDCD1), LAG3, TIGIT and HAVCR220. TOX, which encodes a chromatin modifier important for exhaustion programs of chronic virus-specific and tumour-specific T cells in mouse models21,22, was only marginally increased in MANA-specific cells, whereas its homologue, TOX2, which has also been reported to drive T cell exhaustion23, showed much higher upregulation in MANA-specific versus EBV-specific CD8 TIL. HOBIT, which is selectively upregulated in TRM T cells24, was also upregulated in MANA-specific TIL, even relative to influenza-specific TRM (Fig. 2e). Indeed, MANA-specific T cells demonstrated the highest immune checkpoint and exhaustion signatures17 (Extended Data Fig. 6f). These findings demonstrate that MANA-specific CD8 T cells in the tumour have an unconventional hybrid transcriptional program characterized by incomplete activation of effector programs and significant upregulation of checkpoint molecules such as PD-1, CTLA-4, TIM3, TIGIT and CD39. Genes encoding each of these checkpoint molecules were more highly expressed among MANA-specific CD8 cells than either influenza- or EBV-specific CD8 cells, with CD39 being the most highly differentially expressed (Fig. 2d, e), congruent with previous flow cytometry findings on MANA-specific lung cancer TIL2.
Influenza-specific TRM were distinguished from MANA-specific TRM by low levels of both activation and effector CTL programs and had lower expression of multiple checkpoint molecules, but had the highest levels of genes associated with T memory stem cells, such as TCF7 and IL-7R (Fig. 2e, f). Indeed, IL-7R expression was 4.6-fold higher on influenza-specific TIL relative to MANA-specific TIL. In TIL obtained from patient without MPR MD01-004, culture with titrating concentrations of IL-7 in vitro induced much higher levels of IL-7R-regulated genes (Supplementary Data 2.3) in influenza-specific TIL than in MANA-specific TIL (Fig. 2g, Extended Data Fig. 7). Nonetheless, supraphysiological levels of IL-7 induced appreciable upregulation of IL-7R-induced genes in MANA-specific TIL. Given the distinct transcriptional programs of the identified MANA-specific CD8 cells, we hypothesized that other CD8 T cells in the same TRM cluster showing differential expression relative to influenza-specific T cells (Fig. 2g) may also recognize MANA that were not detected by the MANAFEST assay. We cloned seven TCRs corresponding to CD8+ T cells with highly differential gene expression relative to influenza-specific T cells. We screened each TCR with a library of candidate MANA (Supplementary Table 6) and confirmed MANA recognition in three of these TCRs, one TCR each from patients MD01-004, MD01-005 and MD043-011 (Extended Data Fig. 8a–d).
To next investigate the ligand-dependent TCR signalling capacity of antigen-specific T cells, we performed a dose–response curve with cognate peptides matched to the ten total Jurkat-validated MANA-specific TCRα–TCRβ pairs (Supplementary Table 10). Peptide dose–response curves of MPR-derived TCRs were comparable to those of EBV- and influenza-specific TCRs, suggesting that these TCRs were capable of strong ligand-dependent signalling (sometimes referred to as functional avidity). However, the peptide dose–response curves of TCRs derived from patients without MPR were markedly lower (approximately 2 log10 leftward shift in peptide dose–response curve) (Fig. 2h, Extended Data Fig. 8e). Together, our data show that despite similar measured MANA–HLA binding affinities (Extended Data Fig. 4c, d), TCR from expandable MANA-specific clones from the patient with MPR had significantly higher functional avidity than MANA-specific clones from patients without MPR.
MANA-specific TIL programs correlate with MPR
To explore determinants of ICB sensitivity, we examined differences in gene expression patterns between MPR and non-MPR MANA-specific TIL. The neoadjuvant clinical trial format enabled us to make this distinction through pathological analysis of surgically resected tissue. In total, we compared 45 MPR TIL transcriptomes (39 from MD01-005, 2 from MD043-003 and 4 from NY016-025) with 885 non-MPR TIL transcriptomes (782 from MD043-011, 62 from MD01-004 and 22 from NY016-014; Extended Data Fig. 9, Supplementary Table 8). We observed highly significant differences between pathologic MPR and non-MPR tumours (Fig. 3a, Supplementary Data 1.4). Significantly higher levels of genes associated with T cell dysfunction such as TOX2, CTLA4, HAVCR2 and ENTPD1 were observed for non-MPR MANA-specific T cells, whereas MPR MANA-specific T cells had higher expression of genes associated with memory (IL7R and TCF7) and effector function (GZMK) (Fig. 3a–c). Both the checkpoint score and exhaustion score were higher in MANA-specific TIL from patients without MPR (Fig. 3d, Extended Data Fig. 10a, b). Of note, CXCL13 is one of the genes most highly correlated with checkpoint-associated genes in non-MPR MANA-specific TIL, and was also found to be highly expressed in MANA-specific cells relative to virus-specific cells among CD8 TIL (Fig. 2d–f).
A number of genes encoding T cell inhibitory molecules were more highly correlated with a composite immune checkpoint score of MANA-specific TIL from patients without MPR than those from patients with MPR (Fig. 3e, Extended Data Fig. 10c). In two patients without MPR (MD01-004 and MD043-011) and one patient with MPR (MD01-005), we also detected MANA-specific cells upon single-cell profiling of CD8 T cells from TDLN (Extended Data Fig. 10d, e). Tracking the MANA-specific CD8 clonotypes from the primary tumour, we detected those clones among TIL from a brain metastasis resected from patient MD043-011 24 months after primary tumour resection (Extended Data Fig. 10f). Relative to the primary tumour, even-higher levels of three checkpoints—LAG3, TIGIT and HAVCR2—were expressed on MANA-specific TIL in the metastasis (Extended Data Fig. 10g, Supplementary Data 1.5).
Going back to overall TIL transcriptomic patterns, we hypothesized that MANA-specific T cells and/or a MANA-specific T cell-like signature might correlate with response to ICB, even though total TIL single-cell transcriptomic patterns did not (Fig. 1e). Among CD8 TIL from six MPR tumours and nine non-MPR tumours, the greatest correlation with pathologic response status was observed by combining four TIL clusters most highly enriched in MANA-specific cells, whereas the expression profile of total CD8 TIL did not distinguish MPR from non-MPR (Extended Data Fig. 11). These data suggest that additional T cells with this profile may contribute to the anti-tumour response.
Systemic reprogramming of MANA-specific T cells
We next performed scRNA-seq–TCR-seq of serial peripheral blood T cells from patient with MPR MD01-005 after enriching for expression of the TCR-Vβ genes corresponding to this patient’s MANA-specific TCRs using fluorescence-activated cell sorting (FACS) (Fig. 4a–c, Extended Data Fig. 12a). Nine out of ten MANA-specific clones mapped to a TRM-like cluster (Tmem(3); Tmem, memory T cell), with some transcriptional features of TRM, such as expression of HOBIT) two weeks after the initiation of anti-PD-1 treatment (Fig. 4d). By four weeks (time of tumour resection), a significant diversification of phenotype was observed (P ≤ 0.021; Methods). Half of the MANA-specific cells were in Teff clusters (Fig. 4e). By 11 weeks (7 weeks after tumour resection), the MANA-specific cells were below the limit of detection in the blood, consistent with known TRM patterns in the peripheral blood25. Using RNA velocity, we observed a clear bidirectional flow of TRM-like memory MANA-specific T cells in the Tmem(3) cluster towards either an activated effector (Teff(3)) or a Tmem(2) transcriptional profile (Fig. 4f). Genes associated with Teff cell function and activation, T cell homing and migration, and tissue retention were upregulated along the pseudotime from Tmem(3) to Teff(3), whereas there was a decrease in genes associated with resting memory T cells (Fig. 4g, h). Gene Ontology (GO) analysis revealed significant enrichment of an IFNγ-mediated signalling pathway along the differentiation trajectory (Extended Data Fig. 12b–f). Although all these tissue compartments were only available for one MPR, these findings are consistent with our hypothesis that, upon activation, functional effector MANA-specific T cells enter the blood and traffic into tissues, including normal lung, in search of micro-metastatic tumour26, and are compatible with a previous study showing that TRM cell plasticity can influence systemic memory T cell responses27.
Discussion
Here we describe the transcriptional programming of MANA-specific TIL after ICB in lung cancer, and further, differential gene programs between patients whose tumours show MPR versus those that do not. Using the MANAFEST platform, MANA-specific CD8 T cells in peripheral blood were detected in the majority of patients who were treated with anti-PD-1; these were also found among TIL in roughly a third of these individuals. Detection of these T cells was independent of tumour response, suggesting that factors in the tumour microenvironment affecting T cell function probably contribute to anti-tumour responsiveness. Indeed, the most frequent MANA-specific clonotype, representing 782 TIL, was observed in a patient with no MPR. This tumour had dual KRAS and STK11 oncogenic mutations, which are known to be highly associated with non-response to PD-1 blockade28. Consistent with an earlier study2, CD39 expression was a key difference between MANA-specific and viral-specific T cells. Among MANA-specific CD8 TIL, roughly 90% were TRM with high expression of HOBIT that also displayed a partial but incompletely activated Teff program, along with upregulation of several targetable checkpoints in non-MPR tumours. MANA-specific T cells also express far less IL-7R relative to influenza TRM, translating functionally into poor IL-7 responsiveness. These features may all contribute to their limited tumour-specific responsiveness in contrast to anti-viral responses. Future studies are warranted to assess the diminished functional capacity of MANA-specific T cells that was suggested by the transcriptomic profiles observed in our study.
One hypothesis for the lack of ICB response in some patients is that tumour-specific T cells exhibit low activity owing to poor avidity or affinity of their TCR for its cognate peptide MHC. Our finding comparing the ligand-induced TCR signalling of three MANA-specific TCRs from MPR TIL with seven from patients without MPR supports this notion, although additional studies of this type are necessary to definitively test the hypothesis. An overall limitation of these studies is the modest number of MANA-specific cells among TIL that we were able to detect, representing three responders and three non-responders. Indeed, identification of MANA-specific cells is experimentally challenging, and only a few studies have successfully identified these cells in NSCLC2,3,8,29,30, yet none of these profiled the transcriptome of MANA-specific T cells at single-cell resolution. Among the 930 MANA-specific transcriptomes that we identified in TIL, there was high consistency among cells from each response group in highly differential expression of key genes known to regulate T cell function. These findings inform on potential ICB combination therapies to overcome anti-PD-1 resistance that occurs even in the presence of potent MANA-specific T cells. For example, our data demonstrated reduced activation of transcriptional programs downstream of IL-7 ligation in MANA-specific TIL relative to influenza-specific TIL, but the MANA-specific TIL retain their ability to respond to supraphysiological levels of IL-7. Because IL-7 signalling is a requisite for maintenance of T cell homeostasis and long-lived memory, it is conceivable that targeting the IL-7 pathway could enhance ICB response. Our findings thus provide a platform for follow-up studies to more rigorously test the generalizability of our conclusions in the setting of resectable and metastatic NSCLC.
Methods
No statistical methods were used to predetermine sample size. The experiments were not randomized. The investigators were not blinded to allocation during experiments and outcome assessment.
Patients and biospecimens
This study was approved by the Institutional Review Boards (IRB) at Johns Hopkins University (JHU) and Memorial Sloan Kettering Cancer Center and was conducted in accordance with the Declaration of Helsinki and the International Conference on Harmonization Good Clinical Practice guidelines. The patients described in this study provided written informed consent. All biospecimens were obtained from patients with stage I-IIIA NSCLC who were enrolled to a phase II clinical trial evaluating the safety and feasibility of administering two doses of anti-PD-1 (nivolumab) before surgical resection. Pathological response assessments of primary tumours were reported previously8,31. Tumours with no more than 10% residual viable tumour cells were considered to have a MPR.
scRNA-seq–TCR-seq
Cryobanked T cells were thawed and washed twice with pre-warmed RPMI with 20% FBS and gentamicin. Cells were resuspended in PBS and stained with a viability marker (LIVE/DEAD Fixable Near-IR; ThermoFisher) for 15 min at room temperature in the dark. Cells were the incubated with Fc block for 15 min on ice and stained with antibody against CD3 (BV421, clone SK7) for 30 min on ice. After staining, highly viable CD3+ T cells were sorted into 0.04% BSA in PBS using a BD FACSAria II Cell Sorter. Sorted cells were manually counted using a hemocytometer and prepared at the desired cell concentration (1,000 cells per μl), when possible. The Single Cell 5′ V(D)J and 5′ DGE kits (10X Genomics) were used to capture immune repertoire information and gene expression from the same cell in an emulsion-based protocol at the single-cell level. Cells and barcoded gel beads were partitioned into nanolitre-scale droplets using the 10X Genomics Chromium platform to partition up to 10,000 cells per sample followed by RNA capture and cell-barcoded cDNA synthesis using the manufacturer’s standard protocols. Libraries were generated and sequenced on an Illumina NovaSeq instrument using 2 × 150-bp paired end sequencing. 5′ VDJ libraries were sequenced to a depth of ~5,000 reads per cell, for a total of 5 million to 25 million reads. The 5′ DGE libraries were sequenced to a target depth of ~50,000 reads per cell.
Whole-exome sequencing, mutation calling and neoantigen prediction
Genomic data for most individuals in our study were reported previously8, and whole-exome sequencing, variant calling and neoantigen predictions for individuals MD043-003 and NY016-025 were performed prospectively for the present study. Whole-exome sequencing was performed on pre-treatment tumours unless otherwise noted (Supplementary Table 4) and matched normal samples. DNA was extracted from tumours and matched peripheral blood using the Qiagen DNA kit (Qiagen). Fragmented genomic DNA from tumour and normal samples was used for Illumina TruSeq library construction (Illumina) and exonic regions were captured in solution using the Agilent SureSelect v.4 kit (Agilent,) according to the manufacturers’ instructions as previously described32. Paired-end sequencing, resulting in 100 bases from each end of the fragments for the exome libraries was performed using Illumina HiSeq 2000/2500 instrumentation (Illumina). The depth of total and distinct coverage is shown in Supplementary Table 4. Somatic mutations, consisting of point mutations, insertions, and deletions across the whole exome were identified using the VariantDx custom software for identifying mutations in matched tumour and normal samples as previously described32,33. Somatic mutations, consisting of nonsynonymous single base substitutions, insertions and deletions, were evaluated for putative MHC class I neoantigens using the ImmunoSelect-R pipeline (Personal Genome Diagnostics) as previously described30. Somatic sequence alterations are listed in Supplementary Table 5.
Identification of neoantigen-specific TCRVβ CDR3 clonotypes
We used the MANAFEST assay3 to evaluate T cell responsiveness to MANA and viral antigens. In brief, pools of MHC class I-restricted CMV, EBV and influenza peptide epitopes (CEFX, jpt Peptide Technologies), pools representing the matrix protein and nucleoprotein from H1N1 and H3N2 (jpt Peptide Technologies), and putative neoantigenic peptides defined by the ImmunoSelect-R pipeline (jpt Peptide Technologies; Supplementary Table 6) were each used to stimulate 250,000 T cells in vitro for 10 days as previously described3. The time point of peripheral blood collection used for each MANAFEST assay is described in Supplementary Tables 2, 7. In brief, on day 0, T cells were isolated from PBMC by negative selection (EasySep; STEMCELL Technologies). The T cell-negative fraction was co-cultured with an equal number of selected T cells in culture medium (IMDM/5% human serum with 50 μg ml−1 gentamicin) with 1 μg ml-1 relevant neoantigenic peptide, 1 μg ml−1 of an MHC class I-restricted CMV, EBV, and influenza peptide epitope pool (CEFX, jpt Peptide Technologies), 1 μg ml−1 of pools representing the matrix protein and nucleoprotein from H1N1 and H3N2 (jpt Peptide Technologies), or no peptide (to use as a reference for non-specific or background clonotypic expansion). On day 3, half the medium was replaced with fresh medium containing cytokines for a final concentration of 50 IU ml−1 IL-2 (Chiron), 25 ng ml−1 IL-7 (Miltenyi) and 25 ng ml−1 IL-15 (PeproTech). On day 7, half the medium was replaced with fresh culture medium containing cytokines for a final concentration of 100 IU ml−1 IL-2 and 25 ng ml−1 IL-7 and IL-15. On day 10, cells were harvested, washed twice with PBS, and the CD8+ fraction was isolated using a CD8+ negative enrichment kit (EasySep; STEMCELL Technologies). DNA was extracted from each CD8-enriched culture condition using the Qiamp micro-DNA kit according to the manufacturer’s instructions. TCR sequencing was performed on each individual peptide-stimulated T cell culture using survey-level sequencing (max depth ~60,000 reads) by Adaptive Biotechnologies using their established platform34 or by the Sidney Kimmel Comprehensive Cancer Center FEST and TCR Immunogenomics Core (FTIC) facility using the Oncomine TCR Beta short-read assay (Illumina) and sequenced on an Illumina iSeq 100 using unique dual indexes, for a maximum of ~40,000 reads per sample.
Data pre-processing was performed to eliminate non-productive TCR sequences and to align and trim the nucleotide sequences to obtain only the CDR3 region. Sequences not beginning with C or ending with F or W and having less than seven amino acids in the CDR3 were eliminated. TCR sequencing samples with less than 1,000 productive reads were excluded from downstream analysis. MD043-011-MANA_22 was the only such sample in the present study (see Supplementary Table 7). Resultant processed data files were uploaded to our publicly available MANAFEST analysis web app (http://www.stat-apps.onc.jhmi.edu/FEST) to bioinformatically identify antigen-specific T cell clonotypes.
Bioinformatic analysis of productive clones was performed to identify antigen-specific T cell clonotypes meeting the following criteria: (1) significant expansion (Fisher’s exact test with Benjamini–Hochberg correction for false discovery rate (FDR), P < 0.05) compared to T cells cultured without peptide, (2) significant expansion compared to every other peptide-stimulated culture (FDR <0.05) except for conditions stimulated with similar neoantigens derived from the same mutation, (3) an odds ratio >5 compared to the no peptide control, and (4) present in at least 10% of the cultured wells to ensure adequate distribution among culture wells. A lower read threshold of 300 was used for assays sequenced by the FTIC and a lower threshold of 30 was used for samples sequenced by Adaptive Biotechnologies. In MANAFEST assays testing less than 10 peptides or peptide pools, cultures were performed in triplicate and reactive clonotypes were defined as being significantly expanded relative to T cells cultured without peptide (FDR <0.05) in two out of three triplicates, and not significantly expanded in any other well tested. When available, TCRseq was also performed on DNA extracted from tumour, normal lung, and lymph node tissue obtained before treatment and at the time of surgical resection, as well as serial peripheral blood samples. The assays performed on each biospecimen are outlined in Supplementary Table 2.
Peptide affinity and stability measurements
Peptide affinity for cognate HLA molecules was assessed using a luminescent oxygen channeling immunoassay (LOCI; AlphaScreen, Perkin Elmer) as previously described35. This is a proximity-based system using a donor and acceptor bead, each conjugated with an epitope tag. When the donor bead is excited with light at 650 nm and can activate an acceptor bead, resulting in a signal at 520–620 nm, which can be quantified per second as a surrogate of affinity. A higher number of counts per second indicates higher affinity of the peptide:HLA pair. The stability of peptide loaded complexes was measured by refolding MHC with peptide and subsequently challenging complexes with a titration of urea. The denaturation of MHC was monitored by ELISA as described previously36.
TCR reconstruction and cloning
Ten MANAFEST+ TCR sequences for which the TCRα chain could be enumerated (>3 cells in single-cell data with the same TCRα–TCRβ pair) were selected for cloning. In addition, seven clones (from three individuals: MD01-004, MD01-005 and MD043-011) that have high composite signature (using the AddModuleScore function) consisting of differential gene programs of MANA-specific T cell relative to influenza-specific T cells in the TRM were selected for cloning. Relevant TCRs were analysed with the IMGT/V-Quest database (http://www.imgt.org). The database allows us to identify the TRAV and TRBV families with the highest likelihood to contain the identified segments which match the sequencing data. To generate the TCRs, the identified TCRA V-J region sequences were fused to the human TRA constant chain, and the TCRB V-D-J regions to the human TRB constant chain. The full-length TCRA and TCRB chains were then synthesized as individual gene blocks (IDT) and cloned into the pCI mammalian expression vector, containing a CMV promoter, and transformed into competent Escherichia coli cells according to the manufacturer’s instructions (NEBuilder HiFi DNA Assembly, NEB). Post transformation and plasmid miniprep, the plasmids were sent for Sanger sequencing to ensure no mutations were introduced (Genewiz).
T cell transfection, transient TCR expression and MANA-recognition assays
To generate a Jurkat reporter cell in which we could transfer our TCRs of interest, the endogenous TCR α- and β-chains were knocked out of a specific Jurkat line that contains a luciferase reporter driven by an NFAT response element (Promega) using the Alt-R CRISPR system (Integrated DNA Technologies, IDT). Two sequential rounds of CRISPR knockout were performed using crDNA targeting the TCRα constant region (AGAGTCTCTCAGCTGGTACA) and the TCRβ constant region (AGAAGGTGGCCGAGACCCTC). Limiting dilution was then used to acquire single cell clones and clones with both TCRα and TCRβ knocked out, as confirmed by Sanger sequencing and restoration of CD3 expression only by the co-transfection of TCRα or TCRβ chains, were chose. CD8α and CD8β chains were then transduced into the TCRα−TCRβ− Jurkat reporter cells using the MSCV retroviral expression system (Clontech). Jurkat reporter cells were then co-electroporated with the pCI vector encoding the TCRB and TCRA gene blocks, respectively, using ECM830 Square wave electroporation system (BTX) at 275 V for 10 ms in OptiMem media in a 4-mm cuvette. Post electroporation, cells were rested overnight by incubating in in RPMI 10% FBS at 37 °C, 5% CO2. TCR expression was confirmed by flow cytometric staining for CD3 on a BD FACSCelesta and 50,000 CD3+ T cells were plated in each well of a 96-well plate. Reactivity of the TCR-transduced Jurkat cells was assessed by co-culturing with 1 × 105 autologous EBV-transformed B cells, loaded with titrating concentrations of MANA peptides, viral peptide pools or negative controls. After overnight incubation, activation of the NFAT reporter gene was measured by the Bio-Glo Luciferase Assay per manufacturer’s instructions (Promega). Jurkat cells were routinely tested for mycoplasma contamination. No cell line authentication was performed.
COS-7 transfection with HLA allele and p53 plasmids
gBlocks (IDT) encoding HLA A*68:01, p53(R248L) and wild-type p53 were cloned into pcDNA3.4 vector (Thermo Fisher Scientific, A14697). COS-7 cells were transfected with plasmids at 70–80% confluency using Lipofectamine 3000 (Thermo Fisher Scientific, L3000015) and incubated at 37 °C overnight in T75 flasks. A total of 30 μg plasmid (1:1 ratio of HLA plasmid per target protein plasmid in co-transfections) was used. Post transfection, COS-7 cells were plated with TCRαβ-transfected JurkaT cells containing NFAT reporter gene at a 1:1 ratio. After overnight incubation, activation of the NFAT reporter gene was measured by the Bio-Glo Luciferase Assay per manufacturer’s instructions (Promega).
Single-cell data pre-processing and quality control
Cell Ranger v3.1.0 was used to demultiplex the FASTQ reads, align them to the GRCh38 human transcriptome, and extract their cell and unique molecular identifier (UMI) barcodes. The output of this pipeline is a digital gene expression (DGE) matrix for each sample, which records the number of UMIs for each gene that are associated with each cell barcode. The quality of cells was then assessed based on (1) the number of genes detected per cell and (2) the proportion of mitochondrial gene/ribosomal gene counts. Low-quality cells were filtered if the number of detected genes was below 250 or above 3× the median absolute deviation away from the median gene number of all cells. Cells were filtered out if the proportion of mitochondrial gene counts was higher than 10% or the proportion of ribosomal genes was less than 10%. For single-cell VDJ sequencing, only cells with full-length sequences were retained. Dissociation/stress associated genes37,38, mitochondrial genes (annotated with the prefix ‘MT-’), high abundance lincRNA genes, genes linked with poorly supported transcriptional models (annotated with the prefix ‘RP-’)39 and TCR (TR) genes (TRA/TRB/TRD/TRG, to avoid clonotype bias) were removed from further analysis. In addition, genes that were expressed in less than five cells were excluded.
Single-cell data integration and clustering
Seurat40 (3.1.5) was used to normalize the raw count data, identify highly variable features, scale features, and integrate samples. PCA was performed based on the 3,000 most variable features identified using the vst method implemented in Seurat. Gene features associated with type I Interferon (IFN) response, immunoglobulin genes and specific mitochondrial related genes were excluded from clustering to avoid cell subsets driven by the above genes39. Dimension reduction was done using the RunUMAP function. Cell markers were identified by using a two-sided Wilcoxon rank sum test. Genes with adjusted P <0.05 were retained. Clusters were labelled based on the expression of the top differential gene in each cluster as well as canonical immune cell markers. Global clustering on all CD3 T cells and refined clustering on CD8 T cells were performed using same procedure. To select for CD8+ T cells, SAVER41 was used to impute dropouts by borrowing information across similar genes and cells. A density curve was fitted to the log2-transformed SAVER imputed CD8A expression values (using the ‘density’ function in R) of all cells from all samples. A cut-off is determined as the trough of the bimodal density curve (that is, the first location where the first derivative is zero and the second derivative is positive). All cells with log2-transformed SAVER imputed CD8A expression larger than the cut-off are defined as CD8+ T cells. TRB amino acid sequences were used as a biological barcode to match MANA, EBV or influenza A-specific T cell clonotypes identified from the FEST assay with single-cell VDJ profile and were projected onto CD8+ T cell refined UMAP.
Single-cell subset pseudobulk gene expression analysis
PCA was performed on a standardized pseudobulk gene expression profile, where each feature was standardized to have a mean of zero and unit variance. In global CD3 and CD8 TIL PCA, for each cell cluster we first aggregated read counts across cells within the cluster to produce a pseudobulk expression profile for each sample and normalized these pseudobulk expression profiles across samples by library size. Combat function in the sva R package42,43 was applied to address potential batch effects on the normalized pseudobulk profile. Highly variable genes (HVGs) were selected for each cell cluster by fitting a locally weighted scatterplot smoothing (LOESS) regression of standard deviation against the mean for each gene and identifying genes with positive residuals. For each sample, all cell clusters were then concatenated by retaining each cluster’s HVGs to construct a concatenated gene expression vector consisting of all highly variable features identified from different cell clusters. Each element in this vector represents the pseudobulk expression of a HVG in a cell cluster. Samples were embedded into the PCA space based on these concatenated gene expression vectors. Canonical correlation44,45 between the first two PCs (that is, PC1 and PC2) and a covariate of interest (that is, tissue type or response status) was calculated. Permutation test was used to assess the significance by randomly permuting the sample labels 10,000 times. In the MANA-specific PCA (Extended Data Fig. 11), MANA-enriched cell clusters, defined by clusters with MANA-specific T cell frequency at least two fold higher than randomly expected, were aggregated as one combined cell cluster. Then, a similar procedure by first identifying HVGs, computing the first 2 PCs and then calculating the canonical correlation was repeated for the combined MANA-enriched cell cluster and each of the other CD8 clusters.
Differential analysis comparing MPR and non-MPR by total CD8 or CD4 TIL and by cell cluster
The gene expression read counts were adjusted by library size. SAVER41 was used to impute the dropouts, and further log2-transformed the imputed values after adding a pseudocount of 1. A linear mixed-effect model46 was constructed to identify genes that are significantly differential between MPR and non-MPR among total CD8/CD4 TIL and by each cell cluster, respectively. The B-H procedure47 was used to adjust the P values for multiple testing, and the statistical significance is determined using a cut-off of FDR <0.05.
Differential-expression tests and antigen-specific T cell marker genes
Differential-expression tests for antigen-specific T cells were performed using FindAllMarkers functions in Seurat with Wilcoxon rank-sum test on SAVER imputed expression values. Genes with >0.25 log2-fold changes, at least 25% expressed in tested groups, and Bonferroni-corrected p values <0.05 were regarded as significantly differentially expressed genes (DEGs). Antigen-specific (MANA versus influenza versus EBV) T cell marker genes were identified by applying the differential expression tests for upregulated genes between cells of one antigen specificity to all other antigen-specific T cells in the dataset. MANA-specific T cell genes associated with response to ICB were identified by applying the differential expression tests comparing MANA-specific T cells from MPR versus those from non-MPR. Top ranked DEGs (by log-fold changes) with a log2-fold changes >0.8 and DEGs relating to T cell function were extracted for further visualization in a heat map using pheatmap package. SAVER-imputed expression values of selective marker genes (transcriptional regulators, memory markers, tissue-resident markers, T cell checkpoints, effector and activation markers) were plotted using the RidgePlot function in Seurat.
In vitro short-term TIL stimulation with IL-7
Cryopreserved TIL from patient MD01-004 were thawed, counted, and stained with the LIVE/DEAD Fixable Aqua (ThermoFisher) viability marker and antibodies specific for CD3 (PE, clone SK1) and CD8 (BV786, clone RPA-T8). Thirty-thousand CD8+ T cells per condition were sorted on a BD FACSAria II Cell Sorter into a 96-well plate. Autologous T cell-depleted PBMC were added as antigen presenting cells (APC) at 1:1 ratio. The cells were stimulated with either influenza A or MD01-004-MANA 12 peptide and titrating concentrations of recombinant human IL-7 (Miltenyi) for 12 h in a round-bottomed 96-well plate.
Gene expression analysis of IL-7-stimulated MANA- and influenza-specific TIL
Following 12 h of antigen and IL-7 stimulation, cells were spun down, counted and re-suspended in 1% BSA at desired concentration. scRNA-seq and VDJ libraries were prepared using 10X Chromium single cell platform using 5′ DGE library preparation reagents and kits according to manufacturer’s protocols (10X Genomics) and as described above. MANA- or influenza-specific T cell clonotypes from the single-cell dataset were identified by using the TRB amino acid sequences as a biological barcode. SAVER imputed gene expression was scaled and centred using the ScaleData function in Seurat. A composite score for the IL-7-upregulated gene set48 (Supplementary Data 2.3) expression was computed using the AddModuleScore function and subsequently visualized using ridgeplot. Mean ± standard error was used to show the dose–response curve of the IL-7-upregulated gene-set score by antigen-specific T cells and peptide-stimulation groups.
Immune checkpoint and exhaustion score generation and highly correlated genes
To characterize dysfunctional CD8 MANA TIL, six best-characterized (and clinically targeted) checkpoints: CTLA4, PDCD1, LAG3, HAVCR2, TIGIT and ENTPD1, were used to compute the T cell checkpoint score, and a published gene list from exhausted T cells was used to compute the T cell exhaustion score, using AddModuleScore function in Seurat. Applying T cell checkpoint score as an anchor, genes that were maximally correlated to the score were identified using linear correlation in MANA-specific TIL from MPR and non-MPR, respectively. Top-30 genes (from HVG selected using FindVariableGenes function in Seurat and excluded the 6 genes included in immune checkpoint score generation) with the highest correlation coefficients were plotted as a bar plot. The difference of correlation coefficients of the above genes was additionally computed between MPR and non-MPR and visualized using waterfall plot.
Evaluation of peripheral MANA-specific T cell transcriptome changes during treatment
Peripheral blood T cells from patient MD01-005 were sorted based on expression of CD8 and TCRVβ2, followed by scRNA-seq–TCR-seq and clustering on conventional CD8+ T cells (MAIT cells excluded). To evaluate whether there was a statistically significant change in the cell types of MANA cells between week 2 (W2) and week 4 (W4) samples in Fig. 4d, e, we first conducted a Fisher’s exact test, which yields a P = 0.021, indicating a statistically significant phenotype change in MANA-specific cells (Fig. 4e). We also conducted a more sophisticated test that adjust for potential background differences in cell type abundance between W2 and W4 samples. In this test, we let mc,t denote the probability that a MANA-specific T cell collected at time point t (W2 or W4) comes from cell type c, and let pc,t denote the proportion of all cells in time point t that come from cell type c. We evaluated the ratio Rc,t = mc,t/pc,t, which characterizes the relative abundance of MANA-specific T cells in each cell type. We compared the null model where this ratio does not change over time (H0: Rc,W2 = Rc,W4 for all cell type c) versus the alternative model where W2 and W4 T cells have different ratios (H1: Rc,W2 ≠ Rc,W4). To do this, we computed the test statistic using the observed data and compared it to its null distribution obtained using Monte Carlo simulations. To construct the null distribution for , we pooled cells from W2 and W4 together and treated them as one sample to estimate the common ratio Rc,W2 = Rc,W4 = Rc shared by W2 and W4, and then derived the probability that a MANA-specific T cell collected at time point t comes from cell type c under the null model H0, which is proportional to pc,t Rc (that is, the product of the sample-specific background cell type proportion pc,t and the common MANA-abundance ratio Rc shared between samples). The MANA-specific T cells at time point t were then redistributed to different cell types randomly based on a multinomial distribution with this expected MANA-specific T cell type proportion (that is, the expected probability that a MANA-specific T cell at time point t comes from cell type c under H0 is ), while keeping the total number of MANA-specific T cells at each time point the same as the observed MANA-specific T cell number at that time point. The test statistic S was then computed using this simulated sample. We repeated this simulation 10,000 times to derive the null distribution of S. Comparing the observed S to its null distribution yields a P < 10−4.
RNA velocity-based differentiation-trajectory tracing
The RNA velocity analysis was performed by first recounting the spliced reads and unspliced reads based on aligned bam files of scRNA-seq data using the velocyto Python package. The calculation of RNA velocity values for each gene in each cell and embedding RNA velocity vector to low-dimension space were done using the SeuratWapper workflow for estimating RNA velocity using Seurat (https://github.com/satijalab/seurat-wrappers/blob/master/docs/velocity.md). The first two diffusion components from Diffusion map were used to construct the coordinates along with velocity. TSCAN (v.1.7.0) was used to reconstruct the cellular pseudotime on diffusion maps space for the PBMC T cells from three time points (samples) of one patient (MD01-005). Based on velocity analysis, the Tmem(3) cluster was specified as the starting cluster for the pseudotemporal trajectory which has branches. For each branch, log2-transformed and library size-normalized SAVER-imputed gene expression values were used for analysing gene expression dynamics along the pseudotime. 10,325 genes with normalized expression ≥ 0.01 in at least 1% of cells were retained. For each gene g, the gene expression along pseudotime t in each sample s was described as a function which was obtained by fitting B-spline regression to the gene’s normalized expression values in single cells. The red curves in Fig. 4h are the mean of the function of the three samples. In order to test whether the gene expression shows a significant change along pseudotime, we compared the above model with a null model in which is assumed to be a constant over time. The likelihood ratio statistic between the two models was computed. To determine the P value, the null distribution of the likelihood ratio statistic was constructed by permuting the pseudotime of cells in each sample, refitting the models and recomputing the likelihood ratio statistic. The P value was calculated as the number of permutations out of a total of 1,000 permutations that produce a likelihood ratio statistic larger than the observed one. The P values from all genes were converted to FDR by Benjamini–Hochberg procedure to adjust for multiple testing. Genes with FDR < 0.05 were considered as dynamic genes with statistical significance. k-Means clustering was applied to group genes with similar dynamic expression patterns into clusters. topGO (v.2.42.0) was used to identify the enriched Gene Ontology terms by comparing the genes in each cluster to all 10,325 genes as background.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this paper.
Online content
Any methods, additional references, Nature Research reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41586-021-03752-4.
Supplementary information
Acknowledgements
We thank the FEST and TCR Immunogenomics Core (FTIC) and the Experimental and Computational Genomics Core (ECGC) at the Sidney Kimmel Comprehensive Cancer Center; P. F. Robbins and R. Yossef for assistance with developing TCR cloning methods; G. Pereira and A. Curry from the Johns Hopkins Upper Aerodigestive Biorepository; the Melanoma DMT Tissue Repository Team and Memorial Sloan Kettering; I. Beadles and C. Barkley for clinical support; and K. Maly, L. Hartman, R. Carlson and our respective administrative teams. Graphical images were created by BioRender.com. This work was funded by grants from the Lung Cancer Foundation of America (K.N.S.), The Mark Foundation for Cancer Research (D.M.P., J.M.T., V.E.V. and V.A.), The SU2C/Mark Foundation Lung Cancer Dream Team Convergence Award (D.M.P.), the SU2C INTIME and SU2C-LUNGevity-American Lung Association Lung Cancer Interception Dream Team (J.R.B. and V.E.V.), Bristol-Myers Squibb, the SU2C DCS International Translational Cancer Research Dream Team (SU2C-AACR-DT1415, V.E.V.), The IASLC Foundation (K.N.S.), Swim Across America (K.N.S. and V.A.), The LUNGevity Foundation (K.N.S. and V.A.), The Commonwealth Foundation (K.N.S., S.Z., D.M.P. and S.Y.), The Banks Family Foundation (M. Brock), Subsidies for Current Expenditures to Private Institutions of Higher Education from the PMAC, through a sub-award from Juntendo University (M. Brock), the Dr. Miriam and Sheldon G. Adelson Medical Research Foundation (V.E.V.), The Virginia and D.K. Ludwig Fund for Cancer Research (B.V., K.W.K. and S.Z.), the Ludwig Center for Cancer Immunotherapy at Memorial Sloan Kettering (T.M.), the Cancer Research Institute (T.M.), the Parker Institute for Cancer Immunotherapy (T.M.), The Lustgarten Foundation for Pancreatic Cancer Research (B.V., K.W.K. and S.Z.), The Conquer Cancer Foundation of ASCO (K.A.M. and J.E.R.), Bloomberg Philanthropies, the Maryland Cigarette Restitution Fund (S.Y.), the V Foundation (V.A. and V.E.V.), the Allegheny Health Network–Johns Hopkins Research Fund (V.A. and V.E.V), the Damon Runyon Cancer Research Foundation (CI-98-18, M.D.H.), US NIH grants R37CA251447 (K.N.S.), R01HG010889 (H.J.), R01HG009518 (H.J.), R01CA121113 (V.A. and V.E.V.), R01CA217169 (J.E.C.), R01CA240472 (J.E.C.), CA62924 (B.V. and K.W.K.), T32 CA193145 (T.R.C.), T32 CA009110 (J.X.C.) and T32 GM136577 (B.V. and K.W.K.), and NIH Cancer Center Support Grants (P30 CA008748 and P30 CA006973). Stand Up To Cancer is a programme of the Entertainment Industry Foundation administered by the American Association for Cancer Research.
Extended data figures and tables
Author contributions
A.V. and Z.J. contributed equally to this work. J.X.C., J.Z., S.Y., H.J., D.M.P. and K.N.S. conceived the project and wrote the manuscript. J.X.C., A.V., R.B., A.T., H.G., H.Y.C., D.S., M.E.A., S.T., A.G.D., B.C., L.S.C., R.W., J.M., K.S., A.G. and A.S. designed and performed functional T cell and single-cell experiments. T.R.C., J.M.T., P.B.I., E.L.B., J.H., M. Brock, J.E.R., P.M.F., J.R.B., K.A.M., J.N., B.J.P., D.R.J., M.D.H., J.E.C. and M. Bott. provided pathologic assessments, response classifications and clinical annotation. E.H.-C.H., B.J.M., K.W.K., S.Z. and B.V. designed and optimized the CD8+ Jurkat cell line. V.A., V.E.V., M.L. and Z.B. performed whole-exome sequencing, mutation calls, neoantigen prediction and interpretation of genetic data. S.J. performed peptide–MHC binding and stability assays. J.Z., Z.J., B.Z., P.B., S.W., W.H., L.A., S.Y. and H.J. performed computational, bioinformatic and biostatistical analyses. T.M., D.M.P. and K.N.S. led analysis and interpretation of immunological experiments. All authors provided input on the manuscript. S.Y., H.J., D.M.P. and K.N.S. supervised all aspects of this work.
Data availability
Bulk TCRVβ sequencing data generated by Adaptive Biotechnologies are available in the Adaptive Biotechnologies ImmuneACCESS repository under DOI 10.21417/JC2021N, at https://clients.adaptivebiotech.com/pub/caushi-2021-n. Bulk TCRVβ raw and processed sequencing data generated by the Sidney Kimmel Comprehensive Cancer Center FTIC are available in the Gene Expression Omnibus with accession number GSE173351. Raw scRNA-seq–TCR-seq data reported in this paper are available in the European Genome-phenome Archive under controlled access with accession number EGAS00001005343. Owing to the personal, sensitive and inherently identifying nature of raw genomic data, access to raw RNA-seq–TCR-seq data is controlled and full instructions to apply for data access can be found at https://ega-archive.org/access/data-access. Approvals will be granted immediately upon confirmation that all requirements are met. Processed and de-identified single-cell data are available in the Gene Expression Omnibus with accession number GSE176022.
Code availability
Scripts to reproduce the analyses used in this study are available at https://github.com/BKI-immuno/neoantigen-specific-T-cells-NSCLC.
Competing interests
V.A. receives research funding from Bristol-Myers Squibb and Astra Zeneca. J.M.T. receives research funding from Bristol-Myers Squibb and serves a consulting/advisory role for Bristol-Myers Squibb, Merck, and Astra Zeneca. P.B.I. receives research funding from Bristol-Myers Squibb and Erbe Elektromedizin GmbH and serves a consulting/advisory role for AstraZeneca and Veran Medical Technologies. J.N. receives research funding from AstraZeneca, Bristol-Myers Squibb, and Merck, and serves a consulting/advisory role for AstraZeneca, Daiichi Sankyo, Bristol-Myers Squibb, Merck, and Roche/Genentech. K.A.M. is a consultant for Amgen and Astra Zeneca. D.R.J. is a consultant for More Health and AstraZeneca and a Steering Committee Member for Merck. B.J.P. is a consultant for AstraZeneca and Regeneron and has received honoraria from Intuitive Surgical. J.E.C. is a consultant for AstraZeneca, Genentech, Merck, Flame Bioscience, and Novartis. V.E.V. is a founder of Delfi Diagnostics and Personal Genome Diagnostics, serves on the Board of Directors and as a consultant for both organizations, and owns Delfi Diagnostics and Personal Genome Diagnostics stock, which are subject to certain restrictions under university policy. Additionally, Johns Hopkins University owns equity in Delfi Diagnostics and Personal Genome Diagnostics. V.E.V. is an advisor to Bristol-Myers Squibb, Genentech, Merck, and Takeda Pharmaceuticals. Within the last five years, V.E.V. has been an advisor to Daiichi Sankyo, Janssen Diagnostics, and Ignyta. M.D.H. receives research support from Bristol-Myers Squibb; has been a compensated consultant for Merck, Bristol-Myers Squibb, AstraZeneca, Genentech/Roche, Nektar, Syndax, Mirati, Shattuck Labs, Immunai, Blueprint Medicines, Achilles, Arcus, and Natera; received travel support/honoraria from AstraZeneca, Eli Lilly, and Bristol-Myers Squibb; has options from Shattuck Labs, Immunai, and Arcus; and has a patent filed by his institution related to the use of tumor mutation burden to predict response to immunotherapy (PCT/US2015/062208), which has received licensing fees from PGDx. The Johns Hopkins University is in the process of filing patent applications related to technologies described in this paper on which E.H.-C.H., B.V., K.W.K. and S.Z. are listed as inventors. B.V. and K.W.K. are founders of Thrive Earlier Detection. K.W.K. is a consultant to and was on the Board of Directors of Thrive Earlier Detection. B.V., K.W.K. and S.Z. own equity in Exact Sciences. B.V., K.W.K. and S.Z. are founders of, hold or may hold equity in, and serve or may serve as consultants to manaT Bio, and hold or may hold equity in manaT Holdings, LLC. B.V., K.W.K., and S.Z. are founders of, hold equity in, and serve as consultants to Personal Genome Diagnostics. S.Z. has a research agreement with BioMed Valley Discoveries. K.W.K. and B.V. are consultants to Sysmex, Eisai, and CAGE Pharma and hold equity in CAGE Pharma. B.V. is a consultant to and holds equity in Catalio. K.W.K., B.V. and S.Z. are consultants to and hold equity in NeoPhore. The companies named above, as well as other companies, have licensed previously described technologies related to the work from this lab at Johns Hopkins University. Licenses to these technologies are or will be associated with equity or royalty payments to the inventors as well as to Johns Hopkins University. T.M. is a cofounder and holds equity in IMVAQ Therapeutics, is a consultant for Immunos Therapeutics, ImmunoGenesis, and Pfizer, receives research funding from Bristol-Myers Squibb, Surface Oncology, Kyn Therapeutics, Infinity Pharmaceuticals, Inc., Peregrine Pharmaceuticals, Inc., Adaptive Biotechnologies, Leap Therapeutics, Inc., and Aprea, and holds patents on applications related to work on oncolytic viral therapy, alpha virus-based vaccines, neoantigen modeling, CD40, GITR, OX40, PD-1, and CTLA-4. J.R.B. serves an advisory/consulting role for Amgen, AstraZeneca, Bristol-Myers Squibb, Genentech/Roche, Eli Lilly, GlaxoSmithKline, Merck, Sanofi, and Regeneron, receives research funding from AstraZeneca, Bristol-Myers Squibb, Genentech/Roche, Merck, RAPT Therapeutics, Inc., and Revolution Medicines, and is on the Data and Safety Monitoring Board of GlaxoSmithKline, Janssen, and Sanofi. P.M.F. receives research support from AstraZeneca, Bristol-Myers Squibb, Novartis, and Kyowa, and has been a consultant for AstraZeneca, Amgen, Bristol-Myers Squibb, Daichii Sankyo, and Janssen and serves on a data safety and monitoring board for Polaris. S.Y. receives research funding from Bristol-Myers Squibb/Celgene, Janssen, and Cepheid, has served as a consultant for Cepheid, and owns founder’s equity in Astra Therapeutics and Digital Harmonic. K.N.S., D.M.P., B.V. and K.W.K. have filed for patent protection on the MANAFEST technology described herein (serial no. 16/341,862). K.N.S., D.M.P., J.E.C., B.V., E.H.-C.H. D.M.P., K.W.K. and S.Z. have filed for patent protection on the p53 R248L mutation-specific TCR described herein (serial no. 63/168,878). D.M.P. is a consultant for Compugen, Shattuck Labs, WindMIL, Tempest, Immunai, Bristol-Myers Squibb, Amgen, Janssen, Astellas, Rockspring Capital, Immunomic, Dracen and owns founder’s equity in manaT Holdings, LLC, WindMIL, Trex, Jounce, Anara, Tizona, Tieza, RAPT and receives research funding from Compugen, Bristol-Myers Squibb, and Anara. K.N.S. has received travel support/honoraria from Illumina, Inc., receives research funding from Bristol-Myers Squibb, Anara, and Astra Zeneca, and owns founder’s equity in manaT Holdings, LLC. The terms of all these arrangements are being managed by the respective institutions in accordance with their conflict-of-interest policies.
Footnotes
Peer review informationNature thanks Benny Chain, Aude Chapuis, Evan Newell and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Justina X. Caushi, Jiajia Zhang
Change history
10/4/2021
A Correction to this paper has been published: 10.1038/s41586-021-03893-6
Contributor Information
Srinivasan Yegnasubramanian, Email: syegnasu@jhmi.edu.
Hongkai Ji, Email: hji@jhu.edu.
Drew M. Pardoll, Email: dpardol1@jhmi.edu
Kellie N. Smith, Email: kellie@jhmi.edu
Extended data
is available for this paper at 10.1038/s41586-021-03752-4.
Supplementary information
The online version contains supplementary material available at 10.1038/s41586-021-03752-4.
References
- 1.Tumeh PC, et al. PD-1 blockade induces responses by inhibiting adaptive immune resistance. Nature. 2014;515:568–571. doi: 10.1038/nature13954. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Simoni Y, et al. Bystander CD8+ T cells are abundant and phenotypically distinct in human tumour infiltrates. Nature. 2018;557:575–579. doi: 10.1038/s41586-018-0130-2. [DOI] [PubMed] [Google Scholar]
- 3.Danilova L, et al. The mutation-associated neoantigen functional expansion of specific T Cells (MANAFEST) assay: a sensitive platform for monitoring antitumor immunity. Cancer Immunol. Res. 2018;6:888–899. doi: 10.1158/2326-6066.CIR-18-0129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Vogelstein B, et al. Cancer genome landscapes. Science. 2013;339:1546–1558. doi: 10.1126/science.1235122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Yarchoan M, Hopkins A, Jaffee EM. Tumor mutational burden and response rate to PD-1 inhibition. N. Engl. J. Med. 2017;377:2500–2501. doi: 10.1056/NEJMc1713444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Yarchoan M, Johnson BA, III, Lutz ER, Laheru DA, Jaffee EM. Targeting neoantigens to augment antitumour immunity. Nat. Rev. Cancer. 2017;17:209–222. doi: 10.1038/nrc.2016.154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Scheper W, et al. Low and variable tumor reactivity of the intratumoral TCR repertoire in human cancers. Nat. Med. 2019;25:89–94. doi: 10.1038/s41591-018-0266-5. [DOI] [PubMed] [Google Scholar]
- 8.Forde PM, et al. Neoadjuvant PD-1 blockade in resectable lung cancer. N. Engl. J. Med. 2018;378:1976–1986. doi: 10.1056/NEJMoa1716078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Blank CU, et al. Neoadjuvant versus adjuvant ipilimumab plus nivolumab in macroscopic stage III melanoma. Nat. Med. 2018;24:1655–1661. doi: 10.1038/s41591-018-0198-0. [DOI] [PubMed] [Google Scholar]
- 10.Huang AC, et al. A single dose of neoadjuvant PD-1 blockade predicts clinical outcomes in resectable melanoma. Nat. Med. 2019;25:454–461. doi: 10.1038/s41591-019-0357-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kaunitz GJ, et al. Melanoma subtypes demonstrate distinct PD-L1 expression profiles. Lab. Invest. 2017;97:1063–1071. doi: 10.1038/labinvest.2017.64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Topalian SL, et al. Neoadjuvant nivolumab for patients with resectable Merkel cell carcinoma in the CheckMate 358 Trial. J. Clin. Oncol. 2020;38:2476–2487. doi: 10.1200/JCO.20.00201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Clarke J, et al. Single-cell transcriptomic analysis of tissue-resident memory T cells in human lung cancer. J. Exp. Med. 2019;216:2128–2149. doi: 10.1084/jem.20190249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Bagaev, D. V. et al. VDJdb in 2019: database extension, new analysis infrastructure and a T-cell receptor motif compendium. Nucleic Acids Res. 48, D1057–D1062 (2020). [DOI] [PMC free article] [PubMed]
- 15.Dykema AG, et al. Functional characterization of CD4+ T cell receptors crossreactive for SARS-CoV-2 and endemic coronaviruses. J. Clin. Invest. 2021;131:146922. doi: 10.1172/JCI146922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.van der Leun AM, Thommen DS, Schumacher TN. CD8+ T cell states in human cancer: insights from single-cell analysis. Nat. Rev. Cancer. 2020;20:218–232. doi: 10.1038/s41568-019-0235-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Guo X, et al. Global characterization of T cells in non-small-cell lung cancer by single-cell sequencing. Nat. Med. 2018;24:978–985. doi: 10.1038/s41591-018-0045-3. [DOI] [PubMed] [Google Scholar]
- 18.Pizzolla A, et al. Influenza-specific lung-resident memory T cells are proliferative and polyfunctional and maintain diverse TCR profiles. J. Clin. Invest. 2018;128:721–733. doi: 10.1172/JCI96957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Pearce EL, et al. Control of effector CD8+ T cell function by the transcription factor eomesodermin. Science. 2003;302:1041–1043. doi: 10.1126/science.1090148. [DOI] [PubMed] [Google Scholar]
- 20.Chihara N, et al. Induction and transcriptional regulation of the co-inhibitory gene module in T cells. Nature. 2018;558:454–459. doi: 10.1038/s41586-018-0206-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Scott AC, et al. TOX is a critical regulator of tumour-specific T cell differentiation. Nature. 2019;571:270–274. doi: 10.1038/s41586-019-1324-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Yao C, et al. Single-cell RNA-seq reveals TOX as a key regulator of CD8+ T cell persistence in chronic infection. Nat. Immunol. 2019;20:890–901. doi: 10.1038/s41590-019-0403-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Seo H, et al. TOX and TOX2 transcription factors cooperate with NR4A transcription factors to impose CD8+ T cell exhaustion. Proc. Natl Acad. Sci. USA. 2019;116:12410–12415. doi: 10.1073/pnas.1905675116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Mackay LK, et al. Hobit and Blimp1 instruct a universal transcriptional program of tissue residency in lymphocytes. Science. 2016;352:459–463. doi: 10.1126/science.aad2035. [DOI] [PubMed] [Google Scholar]
- 25.Kok L, et al. A committed tissue-resident memory T cell precursor within the circulating CD8+ effector T cell pool. J. Exp. Med. 2020;217:e20191711. doi: 10.1084/jem.20191711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Topalian SL, Taube JM, Pardoll DM. Neoadjuvant checkpoint blockade for cancer immunotherapy. Science. 2020;367:eaax0182. doi: 10.1126/science.aax0182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Behr FM, et al. Tissue-resident memory CD8+ T cells shape local and systemic secondary T cell responses. Nat. Immunol. 2020;21:1070–1081. doi: 10.1038/s41590-020-0723-4. [DOI] [PubMed] [Google Scholar]
- 28.Skoulidis F, et al. STK11/LKB1 mutations and PD-1 inhibitor resistance in KRAS-mutant lung adenocarcinoma. Cancer Discov. 2018;8:822–835. doi: 10.1158/2159-8290.CD-18-0099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Smith KN, et al. Persistent mutant oncogene specific T cells in two patients benefitting from anti-PD-1. J. Immunother. Cancer. 2019;7:40. doi: 10.1186/s40425-018-0492-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Anagnostou V, et al. Evolution of neoantigen landscape during immune checkpoint blockade in non-small cell lung cancer. Cancer Discov. 2017;7:264–276. doi: 10.1158/2159-8290.CD-16-0828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Cottrell TR, et al. Pathologic features of response to neoadjuvant anti-PD-1 in resected non-small-cell lung carcinoma: a proposal for quantitative immune-related pathologic response criteria (irPRC) Ann. Oncol. 2018;29:1853–1860. doi: 10.1093/annonc/mdy218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Anagnostou V, et al. Multimodal genomic features predict outcome of immune checkpoint blockade in non-small-cell lung cancer. Nat. Cancer. 2020;1:99–111. doi: 10.1038/s43018-019-0008-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Jones S, et al. Personalized genomic analyses for cancer mutation discovery and interpretation. Sci. Transl. Med. 2015;7:283ra53. doi: 10.1126/scitranslmed.aaa7161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Carlson CS, et al. Using synthetic templates to design an unbiased multiplex PCR assay. Nat. Commun. 2013;4:2680. doi: 10.1038/ncomms3680. [DOI] [PubMed] [Google Scholar]
- 35.Harndahl M, et al. Peptide binding to HLA class I molecules: homogenous, high-throughput screening, and affinity assays. J. Biomol. Screen. 2009;14:173–180. doi: 10.1177/1087057108329453. [DOI] [PubMed] [Google Scholar]
- 36.Sylvester-Hvid C, et al. Establishment of a quantitative ELISA capable of determining peptide - MHC class I interaction. Tissue Antigens. 2002;59:251–258. doi: 10.1034/j.1399-0039.2002.590402.x. [DOI] [PubMed] [Google Scholar]
- 37.O’Flanagan CH, et al. Dissociation of solid tumor tissues with cold active protease for single-cell RNA-seq minimizes conserved collagenase-associated stress responses. Genome Biol. 2019;20:210. doi: 10.1186/s13059-019-1830-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.van den Brink SC, et al. Single-cell sequencing reveals dissociation-induced gene expression in tissue subpopulations. Nat. Methods. 2017;14:935–936. doi: 10.1038/nmeth.4437. [DOI] [PubMed] [Google Scholar]
- 39.Li H, et al. Dysfunctional CD8 T cells form a proliferative, dynamically regulated compartment within human melanoma. Cell. 2020;181:747. doi: 10.1016/j.cell.2020.04.017. [DOI] [PubMed] [Google Scholar]
- 40.Stuart T, et al. Comprehensive integration of single-cell data. Cell. 2019;177:747–1902. doi: 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Huang M, et al. SAVER: gene expression recovery for single-cell RNA sequencing. Nat. Methods. 2018;15:539–542. doi: 10.1038/s41592-018-0033-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28:882–883. doi: 10.1093/bioinformatics/bts034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8:118–127. doi: 10.1093/biostatistics/kxj037. [DOI] [PubMed] [Google Scholar]
- 44.Hotelling, H. Relations Between Two Sets of Variates Vol. 28, 321–377 (Biometrika, 1936).
- 45.Härdle, W. K. & Simar, L. Applied Multivariate Statistical Analysis 1st edn (Springer-Verlag, 2003).
- 46.McCulloch, C. E., Searle, S. R. & Neuhaus, J. M. Generalized, Linear and Mixed Models 2nd edn (Wiley, 2008).
- 47.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B. 1995;57:289–300. [Google Scholar]
- 48.Belarif L, et al. IL-7 receptor blockade blunts antigen-specific memory T cell responses and chronic inflammation in primates. Nat. Commun. 2018;9:4483. doi: 10.1038/s41467-018-06804-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Bulk TCRVβ sequencing data generated by Adaptive Biotechnologies are available in the Adaptive Biotechnologies ImmuneACCESS repository under DOI 10.21417/JC2021N, at https://clients.adaptivebiotech.com/pub/caushi-2021-n. Bulk TCRVβ raw and processed sequencing data generated by the Sidney Kimmel Comprehensive Cancer Center FTIC are available in the Gene Expression Omnibus with accession number GSE173351. Raw scRNA-seq–TCR-seq data reported in this paper are available in the European Genome-phenome Archive under controlled access with accession number EGAS00001005343. Owing to the personal, sensitive and inherently identifying nature of raw genomic data, access to raw RNA-seq–TCR-seq data is controlled and full instructions to apply for data access can be found at https://ega-archive.org/access/data-access. Approvals will be granted immediately upon confirmation that all requirements are met. Processed and de-identified single-cell data are available in the Gene Expression Omnibus with accession number GSE176022.
Scripts to reproduce the analyses used in this study are available at https://github.com/BKI-immuno/neoantigen-specific-T-cells-NSCLC.