Abstract
Objective
The immune response to invasive carcinoma has been the focus of published work, but little is known about the adaptive immune response to bronchial premalignant lesions (PMLs), precursors of lung squamous cell carcinoma. This study was designed to characterize the T cell receptor (TCR) repertoire in PMLs and its association with clinical, pathological, and molecular features.
Methods
Endobronchial biopsies (n=295) and brushings (n=137) from high-risk subjects (n=50), undergoing lung cancer screening at approximately 1-year intervals via autofluorescence bronchoscopy and CT, were profiled by RNA-seq. We applied the TCR Repertoire Utilities for Solid Tissue/Tumor tool to the RNA-seq data to identify TCR CDR3 sequences across all samples. In the biopsies, we measured the correlation of TCR diversity with previously derived immune-associated PML transcriptional signatures and PML outcome. We also quantified the spatial and temporal distribution of shared and clonally expanded TCRs. Using the biopsies and brushes, the ratio of private (ie, found in one patient only) and public (ie, found in two or more patients) TCRs was quantified, and the CDR3 sequences were compared with those found in curated databases with known antigen specificities.
Results
We detected 39,303 unique TCR sequences across all samples. In PML biopsies, TCR diversity was negatively associated with a transcriptional signature of T cell mediated immune activation (p=4e-4) associated with PML outcome. Additionally, in lesions of the proliferative molecular subtype, TCR diversity was decreased in regressive versus progressive/persistent PMLs (p=0.045). Within each patient, TCRs were more likely to be shared between biopsies sampled at the same timepoint than biopsies sampled at the same anatomic location at different times. Clonally expanded TCRs, within a biopsied lesion, were more likely to be expanded at future time points than non-expanded clones. The majority of TCR sequences were found in a single sample, with only 3396 (8.6%) found in more than one sample and 1057 (2.7%) found in two or more patients (ie, public); however, when compared with a public database of CDR3 sequences, 4543 (11.6%) of TCRs were identified as public. TCRs with known antigen specificities were enriched among public TCRs (p<0.001).
Conclusions
Decreased TCR diversity may reflect nascent immune responses that contribute to PML elimination. Further studies are needed to explore the potential for immunoprevention of PMLs.
Keywords: lung neoplasms, t-lymphocytes, translational medical research
Introduction
Lung cancer is the second most common cancer in the USA and accounts for approximately one in four cancer deaths.1 Patients with lung cancer have poor overall survival, in part because the disease is diagnosed after it has already spread to distant sites.1 Squamous cell lung cancer, the second most common form of lung cancer, is thought to arise through accumulation of genomic and epigenomic alterations to the airway epithelium. These alterations lead to a series of histological changes, ranging from hyperplasia, metaplasia, several grades of dysplasia, carcinoma in situ and to invasive carcinoma.2 3 Persistent bronchial dysplasia has been associated with increased risk of incident invasive carcinoma, both at the site of the dysplasia and at other locations in the lung.3 4 Since squamous cell lung cancer arises through a series of bronchial premalignant lesions (PMLs) that can be evaluated by bronchoscopy, it has been proposed as a model for cancer chemoprevention and immunoprevention.5–8
By transcriptionally profiling endobronchial biopsies of PMLs obtained from high-risk smokers undergoing lung cancer screening, we have identified distinct molecular subtypes of squamous PMLs.9 A proliferative molecular subtype of squamous PML is enriched with bronchial dysplasia. Proliferative subtype PMLs that progressed or persisted were depleted of innate and adaptive immune cells and demonstrated decreased expression of interferon signaling and antigen presentation pathways. Additional studies have shown that high-grade or progressive/persistent PMLs are associated with an immunosuppressive microenvironment.10–12 These findings support the hypothesis that the spontaneous adaptive immune response to premalignancy is a mechanism controlling PML progression and that this might be therapeutically exploited for immunoprevention of cancer.
T cells are able to recognize aberrant peptides, including neoepitopes that are presented by tumor cells as a result of mutational events acquired during cancer initiation and progression. This endogenous T cell response to malignancy can be harnessed to mediate tumor regression using adoptive cell therapies13 or through immune checkpoint inhibition.14 Studies of non-small cell lung cancer (NSCLC) have shown that the T cell receptor (TCR) repertoire of NSCLC associates with the somatic genomic landscape.15 The TCR repertoire of NSCLC has also been suggested as a biomarker for predicting the response to immune checkpoint blockade.16
Previous studies have demonstrated that the TCR repertoire can be partially assembled from bulk RNA-sequencing (RNA-seq) data.17 18 To enhance our understanding of the immune response to squamous PMLs and the interplay between the adaptive immune response and progression of PMLs, we characterized, using bulk RNA-seq data, the T cell repertoire of squamous cell PMLs and its association with transcriptomic and clinical features. Our results indicate that regressive proliferative lesions have increased TCR repertoire clonality compared with persistent/progression PMLs.
Methods
Subject population and sample collection and processing
Endobronchial biopsies and brushings from high-risk subjects, undergoing lung cancer screening at approximately 1-year intervals via auto fluorescence bronchoscopy and CT, were profiled by RNA-seq as previously described (n=295 biopsies, n=137 brushes, and n=50 patients).9 Briefly, endobronchial biopsies were obtained from abnormal fluorescing airway epithelium, and brushings were obtained from normal appearing mainstem bronchi. RNA was extracted and sequenced, and the data were processed as previously described9 to quantitate gene expression levels.
TCR sequence assembly
We applied the TCR Repertoire Utilities for Solid Tissue/Tumor (TRUST; V.3.0) tool17 to identify TCR CDR3 sequences in the RNA-seq data, applying it in single end mode to reads aligned to hg19 via STAR19 (V.3.0.0) (bam files from our previously published work9). TCR sequences with less than six amino acids were excluded from the analysis. We also compared the performance of TRUST to MiXCR20 (V.2.1.10, using FASTQ files from our prior work9) in assembling CDR3 sequences. TCR sequences were assembled from both biopsy and brush samples, although only biopsies were used for diversity and TCR sharing and clonal expansion analyses. Eleven samples underwent targeted RNA TCR sequencing (IRepertoire, two replicates/sample) of the TCR alpha, beta, gamma and delta chains to compare TCR identification and TCR diversity metrics with those derived using RNA-seq data via TRUST (online supplemental table 1).
jitc-2021-002647supp001.pdf (347.8KB, pdf)
TCR repertoire diversity in PML biopsies
In the bulk RNA-seq data, TCR diversity was measured as the number of unique clonotypes per 1000 TCR reads, previously described as clonotypes per kilo-reads (CPK).18 CPK was chosen over Shannon entropy because it was not strongly correlated with total TCR reads in the bulk RNA-seq data using a linear mixed effects model adjusting for patient. For the targeted TCR sequencing data, where we achieved much higher TCR-read depth, diversity was measured as Shannon entropy. Spearman correlation was used to compare total TCR reads, and TCR diversity between samples where TCR repertoires were measured by bulk RNA-seq and targeted TCR sequencing.
We used a linear mixed effects models adjusting for patient as the random effect to evaluate the association between CPK and the expression of an immune-associated gene set we previously identified and found to be associated with squamous PML regression.9 The association between CPK and proliferative subtype PML progression/persistence versus regression was evaluated with a two-sided Mann-Whitney U test. This test was used instead of a linear mixed effects model adjusting for patient as a random effect due to lack of repeated measures for all patients in this smaller subset of samples.
TCR repertoire sharing and clonal expansion in PML biopsies
We compared the overlap of TCR repertoires in biopsies: (1) between PMLs from different subjects, (2) within the same subject between PMLs acquired during the same procedure at different anatomic locations, and (3) within the same subject between PMLs acquired at the same anatomic location across different procedures. Fisher’s exact test was used to identify pairs of samples enriched in shared TCRs, comparing the number of CDR3 sequences shared and unique, against a background of all 39,303 TCRs. The mean odds ratios (ORs) of these Fisher’s exact tests grouped by shared location, time, and subject were compared using the Tukey test.
To identify TCRs undergoing clonal expansion, we normalized counts of all TCRs to counts per thousand reads within each sample and defined expanded TCRs as those with normalized counts more than three SD greater than the mean calculated across all samples. Samples with fewer than 250 reads total were removed from this analysis, as low total counts in a sample resulted in inflated normalized counts for individual TCRs. With a threshold of 250 reads, there was no longer a significant linear association between total reads and number of expanded clones (p>0.05 Pearson). In order to identify whether the same TCR sequences were repeatedly expanded more often than expected by chance, we conducted a permutation test. We shuffled ‘expanded’ versus ‘not expanded’ labels of TCRs present 1000 times, then compared how often the same TCR sequence was expanded at multiple timepoints at each location within each patient.
Identification of public and private TCRs
TCRs from both biopsies and brushes were defined as public if they were found in more than one patient or, for TCR beta, if they were also found in the peripheral blood of a previously described cohort of 666 healthy donors.21 They were otherwise defined as private. While most of this work focused on biopsy samples, brushes were included in this analysis in order to more accurately identify public sequences. We also quantified the proportion of private and public that were found in publicly available databases, McPAS-TCR22 and VDJdb23, which contain antigen-specific TCR sequences (both databases accessed August 8, 2019). In order to determine if more public versus private TCRs overlapped with TCR sequences found in the queried databases, we used Fisher’s exact test.
Results
TCRs can be accurately assembled from bulk RNA-seq data in airway biopsies and brushes
We detected a total of 39,303 unique TCR sequences of at least six amino acids in length using the TRUST tool and RNA-seq from 294 biopsies of PMLs (one biopsy did not generate TCR sequences) and 137 brushings of normal appearing airway epithelium from 50 subjects at high-risk for lung cancer. The TRUST TCR sequences were used for further analyses as MiXCR assembled fewer sequences (n=31,208 unique TCR sequences), the sequences from both methods were highly overlapping (n=17,751 (45%) TCRs were shared, p<1.5e-21 by hypergeometric test), and the methods produced similar results. Eleven representative biopsy samples underwent targeted RNA sequencing of the TCR loci, yielding an average of 585,589 TCR reads per sample (range 29,076–1,489,423) and 1618 unique CDR3 sequences per sample (range 462–2772). Both the total number of TCR reads identified and TCR diversity were significantly correlated across bulk and targeted sequencing results (rho=0.70, p=0.03 and rho=0.68, p=0.029, respectively (online supplemental figure 1). The TCR sequences from the bulk RNA-seq data and the targeted TCR-seq data were highly overlapping with an average of 21.9% of TCRs identified from bulk RNA-seq also being identified in targeted sequencing of the same sample, compared with 5.4% of bulk TCRs being identified in targeted sequencing from different samples (p=6.05e-6, Mann-Whitney U test). Additionally, the per cent of shared TCR sequences between the targeted sequencing and bulk samples was as least as high as the per cent of shared TCR sequences between replicates of targeted sequencing, regardless of the method used to assemble the TCR sequences (online supplemental figure 2). Based on these results, the TCR sequences assembled from the bulk RNA-seq data are representative of TCR sequences obtained from targeted TCR RNA sequencing.
TCR diversity is associated with progression/persistence in proliferative PML biopsies
We quantified TCR diversity using CPK, because it was least correlated with total TCR reads (p=0.002, linear mixed effects model) in comparison with other commonly used TCR diversity metrics (eg, Shannon entropy and Simpson diversity index) (p<=5e-10, linear mixed effects model) (online supplemental figure 3). In our prior work9, we identified modules of gene coexpression that we used to define PML molecular subtypes. Two of these nine modules are enriched for genes involved with immune functions. We found CPK to be negatively correlated with one of the immune modules that is associated with interferon signaling and antigen processing and presentation (p=4.0e-4, linear mixed effects model) (figure 1A, online supplemental figure 4A, C). In other words, decreased TCR diversity is associated with higher expression of genes in this interferon-related immune module across all biopsies. Interestingly, we previously reported that high expression of the genes in this module is associated with regression within PMLs in the previously defined proliferative molecular subtype that is enriched for bronchial dysplasia. Here, we find that TCR diversity (CPK) was significantly decreased in the proliferative PMLs that regressed versus PMLs that persisted/progressed (28.97 vs 35.57, p=0.045, two-sided Mann-Whitney U test, n=50 samples, n=23 patients, figure 1B, online supplemental figure 4B, D). These results indicate that increased clonality may be important in lesion regression.
TCR clone sharing and clonal expansion is increased within patient PMLs
On average across the subjects, endobronchial biopsies were obtained from different anatomic locations during multiple screening bronchoscopies. Out of the 34,251 CDR3 sequences identified in biopsy samples, 2494 (7.3%) were present in more than one biopsy in our cohort. In order to identify patterns in CDR3 sharing between biopsies, we identified enrichment of shared CDR3 sequences for all possible pairs of PML biopsies using Fisher’s exact test. Only 0.01% of pairs of samples from different patients showed significant enrichment (at False Discovery Rate (FDR) <0.01) of shared CDR3 sequences. Among samples from the same patients, we detected higher TCR clone sharing among samples found at different anatomic locations at identical time points (61.9% of pairs enriched at FDR <0.01, mean OR=26.8) and among samples taken from the same anatomic location at separate time points (55.0% enriched at FDR <0.01, mean OR=23.5) versus samples at different locations and time points (50% enriched at FDR <0.01, mean OR=20.3) (table 1). The mean ORs were significantly different between these groups (Analysis of variance with Tukey’s post hoc, FDR <0.01, figure 2).
Table 1.
Factors shared | Non-enriched pairs | Enriched pairs | Percent enriched (%) | Mean OR | ||
Patient | Location | Time | ||||
83,004 | 8 | 0.009 | 0.214 | |||
× | 736 | 746 | 50.3 | 3.69 | ||
× | × | 206 | 252 | 55.0 | 3.90 | |
× | × | 230 | 374 | 61.9 | 4.09 |
Percent of pairs of samples enriched for shared sequences, grouped by shared location, shared time, and shared patient. Enrichment was determined by a Fisher’s exact test, FDR cut-off of 0.01 and OR >1.
FDR, False Discovery Rate; OR, Odds Ratio.
Among the 74 lesions with at least 250 total TCR counts, we identified 344 TCRs likely to have undergone clonal expansion in PML biopsies based on increased normalized abundance three SD above the mean across all samples. Among TCRs that were only found in a single sample, 3.1% of TCRs were expanded, significantly less than in TCRs that were identified in multiple samples within the same patient (10.3%, p<0.001; OR=3.607 Fisher’s test). Furthermore, we identified 10 TCRs that were expanded at two time points in the same lesion at the same location, more than would be expected if all TCRs recurring in a patient were equally likely to be expanded (p<0.001) (online supplemental tables 2 and 3). The 10 TCRs identified were found in samples with mostly low-grade histology (72% metaplasia or lower), and in all cases, the histology at the second time point was the same or lower than in the first sample. The results indicate that TCRs can persist in an expanded state for years in the same lesion and may contribute to prevention of progression.
Public TCRs commonly target infectious antigens
Among all TCRs identified in both brushes and biopsies, 4543 (11.6%) were considered public, meaning shared between multiple patients in our cohort or, for TCR beta sequences, identified in a previously described cohort of 666 people.21 The proportion of public TCRs was higher among clones expanded in biopsies, at 26.1% (Fisher’s p=1.1e-10, OR 2.36) indicating that these may target an active infection. However, among expanded TCRs at multiple time points in the same lesion at the same location only 1 out of 10 (10%) was a public TCRs. Removal of public TCRs did not change the aforementioned results examining clone sharing between time points and location as well as the associations between TCR diversity and the expression of interferon-associated genes or lesion progression/regression status. The public TCRs were more commonly found in the VDJdb and McPAS-TCR databases than non-public (private) TCRs (8.69% and 1.53%, p<0.001, Fisher’s exact test, OR 6.12). The VDJdb and McPAS-TCR databases predominantly annotate TCRs targeting infectious antigens. TCRs targeting common viral pathogens, including predominantly influenza, cytomegalovirus and Epstein Barr Virus, but also yellow fever, hepatitis C and HIV, were detected. Few TCRs were annotated as recognizing bone marrow stromal antigen 2 (BST2) and melanoma antigen recognized by T cells 1 (MART-1). As expected, the identification of a public TCR is more likely to represent a TCR targeting an infectious antigen than a cancer-specific antigen, although our analysis is limited by the sequences annotated in the VDJdb and McPAS-TCR databases.
Discussion
Lung squamous cell carcinoma arises in the epithelial layer of the bronchial airways and is often preceded by the development of bronchial PMLs. Bronchial PMLs have variable outcome and biomarkers of PML progression, and therapies to intercept lung cancer development at this early stage are needed. We have previously reported the identification of four molecular subtypes of PMLs based on bulk RNA-seq profiling of endobronchial biopsies and brushes from high-risk smokers undergoing lung cancer screening.9 The proliferative subtype was enriched with bronchial dysplasia, and progressive/persistent PMLs in this subtype showed decreased expression of interferon signaling and antigen processing/presentation pathways compared with regressive lesions. In this study, we leverage the bulk RNA-seq data from this prior work to characterize the TCR repertoire of lung squamous PMLs and its association with PML progression.
Given that bulk RNA sequencing is a relatively insensitive method of characterizing the TCR repertoire, capturing only a fraction of the TCRs, we confirmed on a subset of samples that total TCR reads and TCR diversity was significantly correlated between TCRs derived from bulk RNA-seq versus targeted TCR-seq data. Using the TCRs derived from bulk RNA-seq and the gene expression data, we found that in the biopsies, a decrease in TCR diversity or an increase in TCR clonal selection is significantly associated with a metagene score for the gene signature associated with interferon and antigen processing and presentation pathways. Interestingly, we also found increased TCR clonal selection in regressive versus progressive/persistent lesions belonging to the proliferative subtype, although a larger sample size is needed to properly adjust for patient effects. These results suggest that regressive high-grade lesions are recognizing neoantigens, and further single T cell sequencing and functional studies are needed to evaluate the utility of these for immunoprevention of lung squamous cell carcinoma.
Since the biopsies in this study were serially collected at different anatomic locations in the lung, we were also able to quantify the amount of TCR clone sharing between or within patients as a function of space and time. We detected a high degree of clone sharing between biopsies from the same patients and the degree of sharing was higher for samples obtained from the same time point versus the same anatomic location at separate time points. Shared TCRs within a patient were more likely to be expanded than TCRs found in a single sample. We observed 10 TCRs that were expanded at multiple time points in the same lesion at the same location suggesting the potential existence of lesion-specific neoantigens. Additionally, our results show that TCR repertoires significantly overlap in different areas of the airway, suggesting a field immunity effect may exist concurrently with lesion-specific immunity. This finding parallels our prior work showing field-level transcriptomic changes associated with PML and lung cancer development in distant, normal-appearing tissue from the nasal turbinate and the mainstem bronchus.24–26 Currently, however, we do not have evidence that these shared or expanded TCR clones are targeting PML-related antigens and are not the result of subclinical infection. Additionally, our results are challenged by limited sequencing depth, non-uniform sampling of the lung and the small sample size of proliferative lesions. In the future, TCRs detected in regressing lesions that may mediate PML regression need to be functionally characterized. Additionally, further comprehensive characterization of ubiquitous and regional TCR repertoires in the context of lung preneoplastic lesions is needed as has been done for NSCLC.15 27
In order to further characterize the TCR repertoire, we classified TCRs as public (found in more than one patient) and private (found in only one patient). Public TCRs were more likely to be clonally expanded than TCR sequences found only in one patient. Additionally, they were more likely to have known infectious antigen specificity in our dataset, consistent with previous evidence of shared TCRs in response to infectious agents.21 The enrichment of TCRs with known antigenic specificities among the public TCR clones could be explained by the preponderance of infectious epitopes in the existing databases of TCR sequences and their antigenic specificity. TCRs targeting tumor neoantigens are less likely to be shared among patients because individual patients and different PMLs carry distinct mutations that can be presented in differing human leukocyte antigen (HLA) contexts. There are exceptions, however, as public TCRs that target a single neoantigen have been demonstrated in response to the common KRAS exon 12 G12D mutation in colorectal cancer and can be leveraged for adoptive T cell therapy.28 In future studies, it will be important to distinguish between immune responses to infections and cancer development and to identify neoantigen recognition in specific HLA contexts as this may be relevant for immunoprevention.29
Conclusion
Our study provides TCR characterization of bronchial lesions that are precursors of lung squamous cell carcinoma suggesting that TCR diversity measured via RNA may help predict the host immune response at the earliest stages of cancer development and may, in the future, aid in the development of immunoprevention strategies for lung cancer.
Acknowledgments
We would like to thank David L. Coleman MD, Gopal K. Yadavalli MD and Ashish Upadhyay MD from Boston University School of Medicine for their mentorship and support for AM.
Footnotes
AM and CM contributed equally.
Contributors: Study conception and design (JB, AM, and MEL); collection of clinical samples (MER); sample processing for bulk RNA sequencing (GL, SZ, and HL); sample and clinical data management (AP and ACG); bulk RNA sequencing pipeline to generate alignments and gene expression (YK and JDC); TCR read generation and data analyses (AM, CM, and JB); writing of the manuscript (JB, AM, and CM); editing of the manuscript (JB, AM, CM, JDC, SAM, MEL, and AS).
Funding: This study was supported by funding from a Janssen Research and Development sponsored research agreement (PIs: Jennifer Beane, Sarah A. Mazzilli, and Joshua D. Campbell) and by a Stand Up To Cancer-LUNGevity-American Lung Association Lung Cancer Interception Dream Team Translational Cancer Research Grant (grant number: SU2C-AACR-DT23-17 to Steven M. Dubinett and Avrum E. Spira). Stand Up To Cancer is a division of the Entertainment Industry Foundation. Research grants are administered by the American Association for Cancer Research, the scientific partner of SU2C.
Competing interests: JB, SAM, JDC, MER, AS and MEL received commercial research grants from Janssen Pharmaceuticals. MEL is a consultant to Veracyte. AS and CS are employed by Johnson and Johnson.
Provenance and peer review: Not commissioned; externally peer reviewed.
Supplemental material: This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.
Data availability statement
Data are available in a public, open access repository. RNA sequencing data from human endobronchial biopsies and brushings is deposited in the NCBI Gene Expression Omnibus under accession code GSE109743 [http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE109743]. RNA sequencing data from human endobronchial biopsies and brushings has been previously deposited in the NCBI Gene Expression Omnibus under accession code GSE109743 [http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE109743].
Ethics statements
Patient consent for publication
Not required.
Ethics approval
The Institutional Review Boards at Boston University Medical Center and Roswell Comprehensive Cancer Center approved the study, and all subjects provided written informed consent.
References
- 1.Siegel RL, Miller KD, Fuchs HE, et al. Cancer statistics, 2021. CA Cancer J Clin 2021;71:7–33. 10.3322/caac.21654 [DOI] [PubMed] [Google Scholar]
- 2.Keith RL, Miller YE. Lung cancer chemoprevention: current status and future prospects. Nat Rev Clin Oncol 2013;10:334–43. 10.1038/nrclinonc.2013.64 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Merrick DT, Gao D, Miller YE, et al. Persistence of bronchial dysplasia is associated with development of invasive squamous cell carcinoma. Cancer Prev Res 2016;9:96–104. 10.1158/1940-6207.CAPR-15-0305 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.van Boerdonk RAA, Smesseim I, Heideman DAM, et al. Close surveillance with long-term follow-up of subjects with preinvasive endobronchial lesions. Am J Respir Crit Care Med 2015;192:1483–9. 10.1164/rccm.201504-0822OC [DOI] [PubMed] [Google Scholar]
- 5.Spira A, Yurgelun MB, Alexandrov L, et al. Precancer atlas to drive precision prevention trials. Cancer Res 2017;77:1510–41. 10.1158/0008-5472.CAN-16-2346 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Spira A, Disis ML, Schiller JT, et al. Leveraging premalignant biology for immune-based cancer prevention. Proc Natl Acad Sci U S A 2016;113:10750–8. 10.1073/pnas.1608077113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Campbell JD, Mazzilli SA, Reid ME, et al. The case for a pre-cancer genome atlas (PCGA). Cancer Prev Res 2016;9:119–24. 10.1158/1940-6207.CAPR-16-0024 [DOI] [PubMed] [Google Scholar]
- 8.Beane J, Campbell JD, Lel J, et al. Genomic approaches to accelerate cancer interception. Lancet Oncol 2017;18:e494–502. 10.1016/S1470-2045(17)30373-X [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Beane JE, Mazzilli SA, Campbell JD, et al. Molecular subtyping reveals immune alterations associated with progression of bronchial premalignant lesions. Nat Commun 2019;10:1856. 10.1038/s41467-019-09834-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Mascaux C, Angelova M, Vasaturo A, et al. Immune evasion before tumour invasion in early lung squamous carcinogenesis. Nature 2019;571:570–5. 10.1038/s41586-019-1330-0 [DOI] [PubMed] [Google Scholar]
- 11.Merrick DT, Edwards MG, Franklin WA, et al. Altered cell-cycle control, inflammation, and adhesion in high-risk persistent bronchial dysplasia. Cancer Res 2018;78:4971–83. 10.1158/0008-5472.CAN-17-3822 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Pennycuick A, Teixeira VH, AbdulJabbar K, et al. Immune surveillance in clinical regression of preinvasive squamous cell lung cancer. Cancer Discov 2020;10:1489–99. 10.1158/2159-8290.CD-19-1366 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Restifo NP, Dudley ME, Rosenberg SA. Adoptive immunotherapy for cancer: harnessing the T cell response. Nat Rev Immunol 2012;12:269–81. 10.1038/nri3191 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Wei SC, Duffy CR, Allison JP. Fundamental mechanisms of immune checkpoint blockade therapy. Cancer Discov 2018;8:1069–86. 10.1158/2159-8290.CD-18-0367 [DOI] [PubMed] [Google Scholar]
- 15.Joshi K, de Massy MR, Ismail M, et al. Spatial heterogeneity of the T cell receptor repertoire reflects the mutational landscape in lung cancer. Nat Med 2019;25:1549–59. 10.1038/s41591-019-0592-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Gettinger SN, Choi J, Mani N, et al. A dormant TIL phenotype defines non-small cell lung carcinomas sensitive to immune checkpoint blockers. Nat Commun 2018;9:3196–15. 10.1038/s41467-018-05032-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Li B, Li T, Wang B, et al. Ultrasensitive detection of TCR hypervariable-region sequences in solid-tissue RNA-Seq data. Nat Genet 2017;49:482–3. 10.1038/ng.3820 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Li B, Li T, Pignon J-C, et al. Landscape of tumor-infiltrating T cell repertoire of human cancers. Nat Genet 2016;48:725–32. 10.1038/ng.3581 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Dobin A, Davis CA, Schlesinger F, et al. Star: ultrafast universal RNA-seq aligner. Bioinformatics 2013;29:15–21. 10.1093/bioinformatics/bts635 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Bolotin DA, Poslavsky S, Mitrophanov I, et al. MiXCR: software for comprehensive adaptive immunity profiling. Nat Methods 2015;12:380–1. 10.1038/nmeth.3364 [DOI] [PubMed] [Google Scholar]
- 21.Emerson RO, DeWitt WS, Vignali M, et al. Immunosequencing identifies signatures of cytomegalovirus exposure history and HLA-mediated effects on the T cell repertoire. Nat Genet 2017;49:659–65. 10.1038/ng.3822 [DOI] [PubMed] [Google Scholar]
- 22.Tickotsky N, Sagiv T, Prilusky J, et al. McPAS-TCR: a manually curated Catalogue of pathology-associated T cell receptor sequences. Bioinformatics 2017;33:2924–9. 10.1093/bioinformatics/btx286 [DOI] [PubMed] [Google Scholar]
- 23.Shugay M, Bagaev DV, Zvyagin IV, et al. VDJdb: a curated database of T-cell receptor sequences with known antigen specificity. Nucleic Acids Res 2018;46:D419–27. 10.1093/nar/gkx760 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Spira A, Beane JE, Shah V, et al. Airway epithelial gene expression in the diagnostic evaluation of smokers with suspect lung cancer. Nat Med 2007;13:361–6. 10.1038/nm1556 [DOI] [PubMed] [Google Scholar]
- 25.Beane J, Mazzilli SA, Tassinari AM, et al. Detecting the presence and progression of premalignant lung lesions via airway gene expression. Clin Cancer Res 2017;23:5091–100. 10.1158/1078-0432.CCR-16-2540 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.AEGIS Study Team . Shared gene expression alterations in nasal and bronchial epithelium for lung cancer detection. J Natl Cancer Inst 2017;109. 10.1093/jnci/djw327. [Epub ahead of print: 01 Jul 2017]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Reuben A, Zhang J, Chiou S-H, et al. Comprehensive T cell repertoire characterization of non-small cell lung cancer. Nat Commun 2020;11:603–13. 10.1038/s41467-019-14273-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Rosenberg SA, Tran E, Robbins PF. T-Cell transfer therapy targeting mutant KRAS. N Engl J Med 2017;376:e11. [PMC free article] [PubMed] [Google Scholar]
- 29.Maoz A, Rennert G, Gruber SB. T-Cell transfer therapy targeting mutant KRAS. N Engl J Med 2017;376:e11. 10.1056/NEJMc1616637 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
jitc-2021-002647supp001.pdf (347.8KB, pdf)
Data Availability Statement
Data are available in a public, open access repository. RNA sequencing data from human endobronchial biopsies and brushings is deposited in the NCBI Gene Expression Omnibus under accession code GSE109743 [http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE109743]. RNA sequencing data from human endobronchial biopsies and brushings has been previously deposited in the NCBI Gene Expression Omnibus under accession code GSE109743 [http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE109743].