Abstract
Bronchial premalignant lesions (PMLs), precursors of lung squamous cell carcinoma, have distinct molecular subtypes. The proliferative subtype, enriched with bronchial dysplasia, had decreased expression of an antigen processing/presentation gene co-expression module in progressive/persistent versus regressive PMLs, suggesting a functional impact of these genes on immune evasion. Here, we performed miRNA sequencing, miRNA in situ hybridization (ISH), and spatial proteomics of bronchial biopsies from patients at high risk for lung cancer. A miRNA-gene network analysis identified hsa-miR-149–5p as a potential regulator of the antigen presentation gene module. Staining on adjacent biopsy tissue showed that hsa-miR-149–5p was predominantly expressed in the epithelium and up-regulated in progressive/persistent proliferative lesions. Targets of this miRNA, the transcriptional coactivator of MHC-I gene expression, NLRC5, and the genes it regulates were down-regulated in these lesions. Decreased NLRC5 expression reduced both IFN-γ induced MHC-I surface expression and CD8+ T cell cytotoxicity in lung squamous cancer cells. In PMLs, basal cells with high levels of NLRC5 were in close spatial proximity to CD8+ T cells, suggesting that these cells exhibit increased functional MHC-I gene expression in vivo. These findings indicate a functional role for hsa-miR-149–5p in PML progression/persistence and suggest this axis as a potential therapeutic target for PML immunomodulation.
INTRODUCTION
Lung squamous cell carcinoma (LUSC) arises in the bronchial airway and makes up approximately 25% of all lung cancers, the leading cause of cancer-related deaths in the United States and worldwide (1,2). LUSC is thought to progress from bronchial premalignant lesions (PMLs) that are characterized by aberrant epithelial architecture and abnormal expansion of airway basal cells. The pathologic grading of PMLs has been well documented (3) and describes progressive epithelial changes including hyperplasia, metaplasia, dysplasia (mild, moderate, and severe), and carcinoma in situ (CIS). As the normal bronchial epithelium is damaged, the differentiated pseudostratified columnar epithelium becomes hyperplastic with the proliferation of basal like cells, or metaplastic with the expansion of squamous cells, and worsens to dysplasia and carcinoma in situ with increased cytologic aberrations including nuclear pleomorphism, increased mitotic figures, and altered cellular differentiation (3–6). PMLs that progress from normal, hyperplasia or metaplasia to a high-grade PML (moderate/severe dysplasia or CIS) or high-grade PMLs that persists over time are associated with a higher risk for lung cancer development at the site of the PML or elsewhere in the lung (4,7,8). Only a small fraction of high-grade PMLs, however, progress to CIS or invasive cancer and more than half regress to a lower histologic grade spontaneously without any intervention (4,5). DNA and methylation profiling of CIS epithelium revealed increased chromosomal instability and somatic mutations in lesions that progress to LUSC, but that spontaneous regression of CIS can occur in cases with known cancer driver mutations (9). These results suggest that alterations in immune surveillance may be a key determinant of progression to invasive carcinoma.
To gain insight into progression mechanisms, our group and others have profiled the transcriptomes of PMLs biopsied from patients at high-risk for developing lung cancer over time and observed that PML progression is associated with suppression of immune surveillance via multiple mechanisms including impairment of antigen processing and presentation and interferon gamma signaling, increased expression of immune checkpoints and cytokines inhibiting T cell activation, and by promoting the recruitment of myeloid derived tumor suppressor cells (6,10–12). Our studies have identified a transcriptionally distinct group of PMLs (“proliferative subtype”), based on nine gene co-expression modules, that express high levels of cell cycle and proliferation genes, are enriched with bronchial dysplasia, and have high basal cell and low ciliated cell composition. Persistent or progressive lesions transcriptionally classified as proliferative subtype PMLs have reduced numbers of CD8 T cells and decreased expression of a co-expression module involved in interferon signaling and antigen processing and presentation, termed the antigen presentation gene module, compared with regressive PMLs. Despite the multiple reports of immune evasion in progressive high-grade lesions, little is known about what drives immune-related gene regulation.
In the absence of T cell exhaustion in our cohort, we hypothesized that microRNAs (miRNAs) may be regulating the expression of genes in the antigen presentation gene module associated with PML progression. miRNAs are short non-coding RNAs that suppress gene expression and facilitate transcript degradation through complementary base-pair binding to the 3’ untranslated region of gene transcripts (13). One study profiling 14 miRNAs observed hsa-miR-34c to be associated with PML histologic changes (14); however, miRNA expression alterations contributing to bronchial PML progression have not been reported. To address this question, we used sample matched mRNA and miRNA sequencing data from longitudinally collected PML biopsies to probe if miRNA-mediated gene regulation leads to PML progression associated with an immunosuppressive microenvironment. We identified hsa-miR-149–5p as a potential regulator of the antigen presentation gene module associated with proliferative subtype lesion progression. We found hsa-miR-149–5p to be highly expressed in the epithelium, up-regulated in progressing/persistent compared to regressing proliferative subtype PMLs, and a regulator of major histocompatibility complex class I (MHC-I) and antigen processing genes via NLRC5 that influences recruitment of CD8 T cells. Our results suggest that hsa-miR-149–5p may be a potential therapeutic target for preventing PML progression to lung cancer.
MATERIALS AND METHODS
Sample collection.
The patient enrollment and sample collection procedures were described previously (10). The study was conducted with approval from the Roswell Park Comprehensive Cancer Center and the Boston University School of Medicine Institutional Review Boards which adhere to the ethical principles and guidelines delineated in the Belmont Report. All subjects provided written informed consent. Briefly, endobronchial biopsies and bronchial brushings were collected via autofluorescence bronchoscopy from high-risk individuals in a low-dose computed tomography lung cancer screening program at approximately 1-year intervals. Biopsy outcome status was defined as previously described (10) where the outcome status of each biopsy (progressive/persistent, regressive, normal stable, or unknown) was based on the worst histology recorded at the same anatomic location in future procedures. During the bronchoscopy, two biopsy samples were obtained from each sampled lung anatomic location. One biopsy was formalin fixed and paraffin embedded (FFPE) and used for histological evaluation, miRNA in situ hybridization (miR-ISH), and image mass cytometry (IMC) assays, and the other biopsy was frozen and used for RNA isolation and mRNA and miRNA sequencing. Bronchial brushes were collected at each bronchoscopy from a normal appearing area of the right or left mainstem bronchus. In our prior study (10), patients were divided into a Discovery (n=197 biopsies, n=91 brushes, n=30 patients) and Validation Set (n=111 biopsies, n=49 brushes, n=20 patients) based on time of initial sample collection. In this study, Discovery cohort samples with sufficient miRNA underwent miRNA library preparation and sequencing (167 biopsies and 91 brushes, n=30 patients). Additional adjacent fixed samples (19 biopsies, n=9 patients) were subject to detailed H&E annotation, miR-ISH and IMC assays (Figure 1A - 1E).
Figure 1. Study overview.

(A) RNA isolated from 156 frozen endobronchial biopsies (n=28 patients) was profiled by miRNA sequencing. These samples also had paired mRNA sequencing data. Among the 156 biopsies, 19 adjacent FFPE biopsies, transcriptionally defined as the proliferative molecular subtype, were sectioned and stained to assess spatial localization of key findings. (B) The paired miRNA-mRNA sequencing data was used to construct a miRNA-gene network based on database evidence of miRNA target genes and significant negative correlation between miRNA and gene expression patterns (gray circles are miRNAs and red, green, and yellow circles are genes where the color represents their membership in previously described gene co-expression modules that define PML molecular subtype). Target information was obtained by combining a sequence-based prediction database and experimentally validated databases. Pearson correlation was calculated from expression residuals for each miRNA-mRNA pair, and those with significant negative Pearson correlation coefficients (FDR<=0.05) were selected. Next, we filtered edges to identify miRNAs that may have central gene expression regulatory roles within the miRNA-gene network in each gene module. For each gene module, we identified and retained only the connections from miRNAs where 1) there are more significantly negative correlated target genes in the module than in other modules (Fisher-exact test, odds ratio>1) and 2) the Fisher-Z transformed Pearson correlation coefficient densities with its targets within the module were more negative than its targets in other modules (t-test, t < 0). miRNAs with significant results from both tests (FDR <= 0.05) were considered module associated miRNAs and other miRNAs were removed from the network. (C, D and E) For each FFPE biopsy, three consecutive sections were stained for different purposes: (C) H&E to annotate the histologic grades present in the epithelial tissue, (D) chromogenic miRNA in situ hybridization (miRNA-ISH) to quantify hsa-miR-149–5p expression and localization, and (E) imaging mass cytometry (IMC) using 28 metal-tagged antibodies to identify epithelial and immune cell types and hsa-miR-149–5p target protein expression and localization. Single or multiple regions of interest (ROI) per sample were selected for IMC ablation in 14 samples. The epithelial histology in both miRNA-ISH and IMC images was annotated based on the corresponding adjacent H&E section. The schematic was constructed using data images from Figure 4. “Figure 1. Study overview” created in BioRender. Chiu, D. (https://BioRender.com/g78j632) is licensed under CC BY 4.0.
miRNA library preparation, sequencing, and data pre-processing.
miRNA was extracted from biopsies or brushing samples using the miRNeasy Mini kit (Qiagen) according to the manufacturer’s instructions (10). miRNA sequencing libraries were prepared using NEBNext Multiplex Small RNA Library Prep Set for Illumina (NEBioLabs). Sequencing adapters that target the 3’ hydroxyl group of small RNAs were ligated and the transcripts were reverse transcribed and PCR-amplified into single-stranded cDNA libraries. The libraries were pooled, and size selected for small RNA fragments on PAGE gel in groups of 6–10. The libraries were sequenced on Illumina® HiSeq 2500 to generate more than 10 million single-read 35–36 nucleotide (nt) reads per sample (15).
De-multiplexing and generation of FASTQ files were performed using Illumina CASAVA v1.8.2. FastQC v0.11.7 (RRID: SCR_014583) was used to examine the quality of raw reads and cutadapt v1.18 was used for trimming the sequencing adaptors (5’- AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC-3’, -trim-n) (16,17). Reads unlikely to be properly sequenced mature miRNA transcripts were discarded (reads shorter than 16 nt or longer than 25 nt with –m 16 –M 25 in cutadapt). Alignment and quantification of mature miRNAs based on miRBase v22 (RRID: SCR_003152) were performed with miRDeep2 v0.1.0 (18,19) with mapper.pl parameters –e –h –m –i –j and quantifier.pl parameter –k –j –d -P. For mature miRNAs associated with multiple precursor miRNAs, the max counts were used.
We performed miRNA-seq sample quality assessments and miRNA filtering separately for each sample type (endobronchial biopsies and bronchial brushes). We limited the analysis to the miRNA samples with previously analyzed (GSE109743, dbGaP:phs003185.v1.p1) matched mRNA expression profiles (n=160 biopsies, n=89 brushes) (10). The raw counts were converted into log2 counts per million of miRNA reads mapped (logCPM) and quantile normalized with voom (20). miRNAs with logCPM less than 1 in more than half of the samples were removed. We excluded samples (n=4 biopsies, n=2 brushes) if sequencing quality metrics were outside 2 standard deviations from the mean. The quality metrics we used included the first and the second principal components from the PCA using filtered miRNA expression, the mean Pearson expression correlation with all other samples across the filtered miRNAs, and the transcript integrity number (TIN) of the matched mRNA sequencing sample (calculated using RSeQC v3.0.0 (21), RRID: SCR_005275). Finally, 156 biopsies and 87 brush samples from 30 patients were retained. The gene filter and normalization described above were performed on the final high-quality samples with matching mRNA data (n=525 miRNAs in biopsies and n=489 miRNAs in brushes). The residual miRNA expressions adjusting for sequencing batch were computed with limma (22) (RRID: SCR_010943) for downstream analysis.
miRNA-gene network construction to identify miRNAs associated with gene co-expression modules that define PML molecular subtypes.
To capture the association of miRNAs with the previously described (10) 9 co-expression gene modules within biopsy samples, we first constructed a miRNA-gene regulatory network focused on the module genes (n=3936; Supplementary Table S1) (23). For this network, we utilized both the predicted gene targets for a miRNA and the correlation between a miRNA and its targets (Figure 1B). The predicted gene targets for each miRNA were defined as those identified by any of the three miRNA target databases, including TargetScan v7.2, starBase, and miRTarBase v7.0(24–26). Between each miRNA and its predicted target gene, the Pearson correlation coefficient was calculated using the residual expression values, of miRNA (described above) and gene (adjusted for median TIN and batch as in Beane et al (10)), and the significance level was adjusted using false discovery rate (FDR). Only the edges with significant and negative Pearson correlation coefficients (FDR<=0.05) between miRNA and predicted target genes were retained in the network.
After constructing the miRNA-gene regulatory network, we sought to retain only connections where miRNAs had regulatory relationships with a gene module that were stronger and more specific than with other gene modules (Figure 1B). For each miRNA, we first performed Fisher-Z transformation on the Pearson correlation coefficient density of a miRNA with its targets and compared to the density with its targets within the module to the density with targets in other modules using a two-sample or one-sample (if only one target is within a group) t-test to select miRNAs with strong negative relationships to modules. We also used Fisher-exact tests to determine whether a miRNA had more significantly negative-correlated target genes in the module of interest than in other modules (odds ratio > 1). The results of both tests were adjusted using FDR and miRNAs with significant results for the same gene module from both tests (FDR<=0.05) were identified as specifically regulating that module. Any connection between a miRNA and predicted target genes of a module that did not pass both tests was removed. The final network can be viewed as a filtered subset of the full miRNA-gene regulatory network that captures the potential regulatory relationship between miRNAs and gene modules (Figure 2A). Each miRNA in the network was assigned to its associated one or more modules (module-associated miRNA). Summary scores were computed based on miRNAs connected to each gene module (27) using Gene-set Variation Analysis (GSVA).
Figure 2. miRNA-gene network analysis identifies miRNAs associated with the antigen presentation gene module down-regulated in progressive proliferative subtype PMLs.

(A) miRNA-gene module network plot shows 321 miRNAs connected to 9 gene modules where grey circles are miRNAs and colored circles are the gene modules. Edges reflect a miRNA may have strong suppressive regulatory role over the predicted target genes within the connected gene module based on the two statistical test filters. (B) Pearson correlation densities between the module associated miRNAs connected to each gene module in the miRNA-gene module network and the corresponding gene module GSVA score. (C) miRNA-gene network plot of miRNAs connected to the antigen presentation gene module (Module 9, purple circle in (A)). Edges show the significant negative correlations between the miRNAs (grey circles) and the predicted target genes (purple circles). (D) Boxplots of residual expression levels of the 9 miRNAs specifically connected to module 9 in proliferative subtype biopsies stratified by their outcome status (progressive/persistent samples in dark green and regressive samples in light green). *p< 0.05, determined by linear mixed effect model.
Identification of miRNAs and genes associated with PML progression status.
The association between miRNAs that may regulate gene modules and PML progression status was determined based on whether the expression level of a miRNA is significantly different between the progressive/persistent and the regressive PMLs of the proliferative subtype (p<0.05). Within each model, miRNA residual expressions were used as the dependent variable, PML progression status was the main independent variable, and patient was included as a random effect using ‘duplicateCorrelation()’ function from limma v3.44.3 (22).
To identify the genes that are associated with PML progression status, we used the full gene expression data of the discovery and validation cohorts from our previous publication, including those samples without matched miRNA sequencing data (10). Similar linear mixed-effect models were performed with gene residual expressions (adjusted for sequencing batch and TIN) as the dependent variable, PML progression status as the independent variable while adjusting patient as a random effect. We further validated gene associations with PML progression status using an external microarray dataset (12) (GSE114489), in which linear models were used with probe intensities as the dependent variable and progression status as the main independent variable. The progressive/persistent group included the persistent bronchial dysplasia and progressive non-dysplasia samples based on the original annotation, and the regressive group included the regressive bronchial dysplasia group.
The association between hsa-miR-149–5p and the antigen presentation gene module was examined in three datasets with sample-matched miRNA and gene expression profiles, including the airway brushing samples from patience with PMLs, TCGA LUSC samples, and AEGIS samples. Gene set enrichment analysis (GSEA) (28) was performed with R package fgsea (29) (RRID: SCR_003199) with random seed of 1234 and nperm=10000, where genes were ranked by their Pearson correlation coefficient with hsa-miR-149–5p. The negative correlation between hsa-miR-149–5p and NLRC5 was also validated in these three datasets using Pearson correlation. To evaluate the expression levels of hsa-miR-149–5p and its target genes between tumor and adjacent benign tissue, we utilized miRNA (n=45 paired samples) and gene (n=51 paired samples) expression profiles of TCGA LUSC tumor tissue that had matched adjacent benign tissue. Linear models using limma and voom-transformed data (20,22) were performed to derive the gene differentially expressed between tumor and benign tissue using the tissue type as the main variable while controlling for patient.
Examination of the cell type-specific expression of hsa-miR-149–5p.
To examine whether a miRNA is potentially specifically expressed within a certain cell type, we examined the correlation between the expression level of the miRNA and cell-type marker genes. The method for calculating Pearson correlation coefficients has been described in the previous section. The following cell type marker genes were used: CD3G for T cells, CD19 for B cells, CD68 for macrophages, KRT5 for basal cells, FOXJ1 for ciliated cells, MUC5AC for goblet epithelial cells, SCGB1A1 for club cells, MUC5B for mucus secretory cells and CEACAM5 for peri-goblet cells (30). Also, miRNA expression data from functional annotation of the mammalian genome 5 (FANTOM5) project were used to evaluate whether the expression of a miRNA is enriched within a certain group of samples43. Classification of FANTOM5 samples to cell types were based on cell ontology annotation from FANTOM5. Samples belonging to the “epithelial cell” or “leukocyte” group were used as the epithelial and immune compartment. Samples were ranked by their normalized expression levels of hsa-miR-149–5p and GSEA (28) using R package fgsea (29) with seed 1234 and nperm=10000 was used to test whether samples with higher hsa-miR-149–5p expression were enriched for samples from a specific group of cell types. Gene expression data from FANTOM5 were also used to examine the correlation between hsa-miR-149–5p and NLRC5 among different cell-types. The visualization of the miRNA expression by samples was done in similar fashion as in fantom.gsc.riken.jp
In vitro perturbation of hsa-miR-149–5p and NLRC5 expression.
Human LUSC cells, SW900 (ATCC Cat#HTB-59, RRID: CVCL_1731), were grown in RPMI medium and transfected using Lipofectamine RNAiMAX (Thermofisher Cat#13778075) following the manufacturer’s protocol. Cells were transfected with 1) hsa-miR-149–5p (mirVana, Thermo Ref#4464066, ID: MC12788); 2) miR scramble (mock) control (MISSION, Sigma Cat#HMC0002); 3) siNLRC5 (NLRC5 84166 siRNA, SMARTpool, Dharmacon Cat#L-018267–020010). Cells were incubated in transfection mix for 48 hours, washed, and lysed in Qiazol (Qiagen Cat#79306) for 5 min at room temperature before being stored at −80°C until RNA isolation. Following RNA isolation, qRT-PCR (6 to 10 replicates per condition) was conducted using 1 ng of RNA per sample that was reverse transcribed using the Transcriptor First Strand cDNA Synthesis Kit (Roche Cat#04896866001) following the supplieŕs protocol. For miRNA quantification, 1 ng total RNA was used in cDNA synthesis via the TaqMan Advanced miRNA cDNA Synthesis Kit (Thermofisher #CatA28007). qRT-PCR was performed using the TaqManTM Fast Advanced Master Mix (Applied Biosystems Cat#4444557) and TaqMan probes (Thermo): PPIA (Hs04194521_s1), NLRC5 (Hs01072123_m1), HLA-A (Hs01058806_g1), HLA-B (Hs00818803_g1), HLA-C (Hs00740298_g1), PSMB9 (Hs00160610_m1), TAP1 (Hs00388675_m1), hsa-miR-149–5p (477917_mir), and hsa-miR-98–5p (478590_mir) (Supplementary Table S2). The 2−ΔΔCT method (31) was used for relative transcript quantification. PPIA was used as the reference gene for mRNA quantification. For miRNA quantification, miR98 served as an internal control. The average ΔCT of mock controls in each 96-well qPCR plate (representing experiment batch) was used as the reference sample. Gene expression differences between treatment conditions were determined by two-sided Wilcoxon test.
Quantification of MHC-I cell surface expression.
SW900 cells were transfected as described above with siNLRC5 or mock control. Following 24 hours of transfection, cells were stimulated with IFN-γ (Recombinant Human IFN-gamma Protein, R&D systems Cat#285-IF-100) at a concentration of 80ng/mL. Following 24 hours of stimulation, cells were harvested for qRT-PCR validation of the NLRC5 knockdown (~70,000 cells) and for flow cytometry (~500,000 cells) to measure MHC-I surface expression. Cells for flow cytometry were stained on ice for 30 minutes with anti-human HLA-A, B, C (pan-MHC-I, clone W6/32, PE, Biolegend Cat# 311406, RRID: AB_314875, 1:500) in PBS/4% FBS/2mM of EDTA after Fc block (Biolegend Cat# 422301, RRID: AB_2818986, 1:50, 5 min). A viability dye (Zombie Aqua, Biolegend Cat# 423101, 1:800) was included. Samples were washed and acquired by the Cytek Aurora flow cytometer using standard tube mode, followed by analysis using FlowJo v.10 software (RRID:SCR_008520). MCH-I levels in live cells were analyzed using a linear mixed model with group as a fixed factor (mock, mock/IFN-γ, siNLRC5, siNLRC5/IFN-γ) and sample as random effect. Significance was assessed with the contrast (mock/IFN-γ - mock) - (siNLRC5/IFN-γ - siNLRC5) to test whether INF-γ induced change in the controls exceeded that in the NLRC5 knockdown using lmerTest v3.1–3 (RRID:SCR_015656) (32) and emmeans v1.10.3 (RRID:SCR_018734) (33).
Preparation of T cells
T cells were isolated and expanded from PBMCs (StemCell Technologies, Cat#70025.1) by culturing cells in RMPI supplemented with 10% FBS (Gibco, Thermofisher Cat#A5670801, 1% Penicillin-streptomycin (Fisher Scientific Cat#15140–122, and 100 units/mL of Interleukin 2 (Fisher Scientific Cat#200–02-10UG) with the addition of anti-human CD3 (clone OKT3, Thermofisher Cat#16–0037-81, RRID: AB_468854, 1ug/mL in 2ml) and anti-human CD28 (clone CD28.2, Thermofisher Cat#16–0289-81, RRID: AB_468854, 5ug/mL) for 7 days, and then without the CD3 and CD28 for an additional 7 days. The expansion of T cells was confirmed using flow cytometry and staining cells as described above with anti-CD45 (clone HI30, Pacific Blue, Biolegend Cat#304022, RRID: AB_493655, 1:200) and anti-CD8 (clone SK1, PerCP, Biolegend Cat#344708, RRID: AB_1967149, 1:100).
Cytotoxic T lymphocyte killing (CTL) assay
SW900 cells were plated in 96-well plates at a density of 10,000 cells per well and transfected as described above with siNLRC5 or mock control. After 24 hours of transfection, cells were stimulated with IFN-γ (Recombinant Human IFN-γ Protein, R&D systems) at a concentration of 80ng/mL. Following 24 hours of stimulation, T cells expanded as indicated, were plated with the SW900 cells at a ratio of 1 (SW900): 5 (T cell). After 16 hours of incubation, the amount of LDH release was assessed using the CyQUANT LDH Cytotoxicity Assay (Thermofisher Cat#C20300) according to manufacturer protocol. Wells containing only SW900 cells were used as 1) low-level controls (LDH spontaneous activity) and 2) high-level controls with the addition of lysis buffer (LDH maximum release). The percentage of cell-mediated cytotoxicity was calculated based on the formula: (LDH absorbance value (AV) – LDH spontaneous activity AV)/(LDH maximum activity AV – LDH spontaneous activity AV) x 100. For each condition, siNLRC5 or mock control, the mean LDH spontaneous activity AV (n=6) and mean LDH maximum activity AV (n=12) were used in the formula. The percentage of cell-mediated cytotoxicity was compared between cells transfected with siNLRC5 (n=18) or mock control (n=18) via two-sided t-test.
Biopsy selection, sectioning, staining, and pathologist annotation of digitized images.
FFPE tissues sections from 19 proliferative biopsies from 9 patients (n=10 progressive/persistent and n=9 regressive) were selected from samples where adjacent biopsies were profiled by mRNA-seq and miRNA-seq (Supplementary Table S3). Three sequential tissue sections were stained using H&E for annotation of histopathology features, chromogenic miRNA in situ hybridization (miRNA-ISH using miRNAscope, ACDBio Cat#324530) with a hsa-miR-149–5p probe (ACDBio Cat#1058711-S1) to assess hsa-miR-149–5p expression, and imaging mass cytometry (IMC) using a cocktail of 28 antibodies targeted to epithelial and immune populations as well as NLRC5 and its downstream targets (for 14 out of the 19 biopsies). The H&Es were scanned at 40X using the Leica Aperio AT2 scanner and the miRNA-ISH slides were scanned under oil immersion at 63X using the Leica Aperio Versa Scanner. The digitized H&E image files were uploaded to the deepPath Pixelview software (https://www.deeppath.ai) and the epithelial tissue was annotated by the pathologist (E.J. B.) by creating masks for the following: normal epithelium, reserve cell hyperplasia, goblet cell hyperplasia, immature squamous metaplasia, squamous metaplasia, and mild, moderate, or severe squamous dysplasia. These annotations were manually transferred to digitized images of the miRNA-ISH and IMC using QuPath v0.3.2 software (RRID: SCR_018257) (34) and reviewed for accuracy by the pathologist.
Quantification of hsa-miR-149–5p expression in biopsies via miRNAscope.
For miRNA-ISH, 5 μm FFPE tissue sections were stained using the miRNAscope HD Assay Red (ACD Bio, Newark, CA, Cat# 324530) with probe SR-hsa-miR-149–5p-S1 (ACD Bio, Newark, CA, Cat# 1058711-S1). The staining was performed using the manufacturer’s protocol, however, treatment with Protease III was reduced to 10 min to conserve tissue morphology of the small biopsy samples.
A pixel classifier to detect the spots of hsa-miR-149–5p signal was trained on a representative region (250000 μm2) selected in each sample using the artificial neural network (ANN) model from QuPath v0.3.2 with default multiscale features of 3 channels – hematoxylin, probe stain, and residual channels (34). To validate the classifier performance, we selected a representative region (16726 +/− 4332 mm2) different from the training regions in each sample and annotated the hsa-miR-149–5p spots manually. The recall (calculated as true positive (TP) divided by TP plus false negative (FN)), the precision (calculated as TP divided by TP plus false positive), and the F1 score (calculated as the harmonic mean of recall and precision) were calculated per sample. The hsa-miR-149–5p density is defined either as hsa-miR-149–5p predicted spots per annotated area or as hsa-miR-149–5p predicted spots per cell. The cell number was estimated by nuclei segmentation using StarDist extension v0.3.2 (35) with the pre-trained brightfield model implemented via QuPath software in regions annotated as epithelium. The association of hsa-miR-149–5p density with progression status was modeled as a linear mixed-effect model, with the interaction of progression status and histology as fixed effect and sample ID as random effect using lmerTest v3.1–3 (32). To evaluate the association of hsa-miR-149–5p density with histology, linear mixed effect model with histology as fixed effect and sample ID as random effect was applied to the miRNA-ISH images with progressive/persistent lesions or regressive lesions, respectively.
Imaging mass cytometry (IMC) staining and ablation.
The IMC experiments assayed the levels of 28 proteins and DNA (Supplementary Table S4). Antibodies were pre-conjugated (Standard BioTools) or conjugated to lanthanide isotopes following the MaxPar Antibody Labeling kit manufacturers’ protocol. The IMC staining was done using the proximal level of the 14 FFPE tissue sample slides analyzed via miRNAscope™ assay (Supplementary Table S3). The slides were dewaxed in xylene for 10 min, followed by two more xylene washes of 5 min each. Tissue was rehydrated in sequential alcohol dilutions of 100% (three consecutive washes), 95% (two consecutive washes), 80% and 70% for 10 min each before washing in deionized water for 10 min. Heat induced epitope retrieval was performed in Tris-EDTA pH8.5 antigen retrieval buffer containing 5% glycerol and heating to 96°C in the microwave for 30 min. Slides were allowed to cool and washed twice in deionized water and two washes in PBS (10 min each). Samples were blocked with 3% BSA in PBS for 45 min at room temperature in a wet chamber. The antibody cocktail was prepared to a final concentration of 0.5% BSA and added to the samples for overnight incubation at 4°C. The stained samples were washed twice in 0.2% TritonX100/PBS followed by one wash in PBS for 8 min each. A 1:1000 dilution of Iridium intercalators (Standard BioTools) in PBS were used to stain DNA for 30 min at room temperature. Finally, slides were washed in PBS for 8 min and deionized water for 5 min before air drying. Regions of interest (ROIs) were laser ablated using the Hyperion Imaging Mass Cytometry™ system (RRID:SCR_023195 with a 200 Hz frequency at 1 μm2 resolution.
IMC data processing and quality control.
The IMC data was segmented into single cells and single cell data was extracted using the Steinbock (v0.14.1) toolkit (36). Briefly, the raw data (in MCD format) were converted to standardized OME-TIFF files and single cell masks were generated using the TissueNet model in Cellpose(37). DNA1 and DNA2 were used as the nuclei channel, and multiple markers, including KRT5, KRT8, E-Cadherin, SCGB1A1, MUC5AC, MUC5B, FOXJ1, CEACAM5, CD3, CD4, CD8A, CD20, CD68, HLA-DR, αSMA, and Vimentin (VIM), were used as the membrane/cytosol channel. Single cell features, including mean intensity of each marker and spatial coordinates, were extracted from the single-cell masks across all available channels. The extracted single cell data was converted to a SpatialExperiment object using the imcRtools v1.8.0 R package (RRID: SCR_027613) (36) for downstream analysis. Regions of interest (ROIs) less than 1000 cells were removed from this analysis (n=7 ROIs) and cells with an area less than 1st percentile (21 μm2) or larger than 98th percentile (198 μm2) were filtered out (n=2804 cells). A total of 87,401 cells from 24 ROIs were used for final analysis. Antibodies with poor signal or non-specific staining (HLA-ABC, Ki-67, CD25) were excluded from the analysis.
IMC data clustering and differential composition analysis.
The mean intensity of the single-cell raw measurement for each marker was log-transformed, and z-score normalized with respect to each ROI, referred to as log-Z normalization. The canonical phenotypic markers for epithelial cells (KRT5, KRT8, p40, E-Cadherin, MUC5AC, MUC5B, CEACAM5, SCGB1A1, FOXJ1), immune cells (CD20, CD3, CD4, CD8A, GZMB, PDL1, CD68, CD163, HLA-DR), and stromal cells (αSMA, VIM) were used for UMAP dimension reduction, fastMNN batch correction (RRID:SCR_017351) (38) and PhenoGraph clustering (RRID:SCR_022603) (39) of all the cells. Broad cell types, including epithelial, immune, and stromal cells, were annotated based on the expression of KRT5, E-Cadherin, CD20, CD3, HLA-DR, αSMA, and VIM. Given that the proportions of epithelial and non-epithelial regions are different across ROIs, we analyzed the epithelial and non-epithelial (including broad immune and stromal cells) regions separately. For each region type (epithelial or non-epithelial), signals were log-Z normalized within each ROI and used for tSNE dimension reduction and PhenoGraph clustering (39). In the non-epithelium compartment, a cluster was identified where more than half of the cells have epithelial histology annotations. 1260 cells with any epithelial histology annotation in this cluster, which are shown to localized in epithelium in IMC images, are reclassified as broad epithelial cells. Then, we repeated the clustering analysis as described above. Cell types in epithelial regions were identified based on epithelial and immune cell markers, while non-epithelial cell types were identified based on immune and stromal cell markers. The differential composition analysis of the cell types by histology, smoking, and progression status was performed using sccomp v.1.7.14 (40) within each region type.
Quantification of NLRC5 protein expression and its association with tissue and sample characteristics.
We observed that the NLRC5 levels were significantly higher for cells within epithelium compared to the non-epithelial regions. To compare the NLRC5 levels across ROIs, we resampled the cells in non-epithelial regions to equalize the proportions of cells between epithelial and non-epithelial regions in each ROI, followed by z-score normalization of the log-transformed NLRC5 intensities, referred to as log-Z stromal normalized NLRC5. Epithelial cells were dichotomized as NLRC5-high or NLRC5-low using the overall average of the log-Z stromal normalized NLRC5 across all epithelial cells as the cut off. The association of log-Z stromal normalized NLRC5 expression with progression status was modeled using a linear mixed-effect model, with the interaction of progression status and histology as fixed effect and ROI as random effect using lmerTest v3.1–3 (32). The estimated marginal means for the fitted model were computed by emmeans v1.10.3 (33).
IMC spatial cell-cell proximity analysis.
Cell neighbors for each cell were identified by distance-based expansion with a centroid distance threshold of 30 mm using buildSpatialGraph() function in imcRtools v1.8.0 (36). Immune cell type annotations from both epithelial and non-epithelial regions were merged for the same cell types. Broad epithelial or epithelial subtypes are stratified by the NLRC5 level as previously described. Spatial cell-cell proximity analysis for each ROI was performed using standard neighborhood permutation test (41) with 500 iterations using testInteractions() function in imcRtools v1.8.0 (36). The spatial proximity score was defined as the fraction of perturbations having proximity counts equal to or less than the observed counts for each pair of cell type entries. Difference in spatial proximity between NLRC5-high and NLRC5-low epithelial cells or epithelial subtypes with respect to immune subtypes were assessed by the two-sided paired Wilcoxon test.
Data availability.
Endobronchial Biopsy and Airway Brushing miRNA Expression Data.
Raw FASTQ files and the miRNA expression matrices and associated sample and clinical data are available on Gene Expression Omnibus accession number GSE313146 and the Human Tumor Atlas Network Data Portal (https://humantumoratlas.org/explore) using the identifications provided in Supplementary Table S5.
Endobronchial Biopsy Imaging Data.
H&E, miR-ISH, and IMC images are available on Zenodo (42) and the miR-ISH and IMC images are also available via the Human Tumor Atlas Network Data Portal (https://humantumoratlas.org/explore) using the identifications provided in Supplementary Table S5.
Endobronchial Biopsy and Airway Brushing Gene Expression Data.
Raw FASTQ files are available via dbGaP accession phs003185.v1.p1 and the gene expression count and residual matrices are available via Gene Expression Omnibus accession number GSE109743. Gene expression residual data from GSE109743 was used to construct miRNA-mRNA correlation network (10).
Additional Publicly Available Endobronchial Biopsy Data.
miRNA expression data from GSE93284 and gene expression from GSE66499 and GSE114489 were used to validate the miRNA-mRNA correlation and gene association with lesion outcome (12,43,44).
FANTOM5 Gene & miRNA Expression Data.
FANTOM5 miRNA expression profiles for 2595 mature miRNAs across 399 samples (directly downloaded from https://fantom.gsc.riken.jp/5/suppl/De_Rie_et_al_2017/ in November 2022) (45). The library size normalized (CPM) miRNA expression matrix was log2-transformed with pseudocount of 0.1. Matched gene expression profiles in transcripts per million (TPM; N=394) were downloaded from the same website and were used to examine the miRNA-mRNA correlation by cell type. and the tag count of the most supported peak (p1@NLRC5) was used for the expression of NLRC5 per sample. The TPM values were log2-transformed with pseudocount of 0.1 for correlation analysis.
TCGA-LUSC mRNA & miRNA Expression Data.
Sample matched gene and miRNA expression data for the TCGA LUSC (N=475) were also used to validate the miRNA-mRNA correlation. We also examined differences between has-miR-149–5p and NLRC5 in matched TCGA LUSC tumor tissue and adjacent benign tissue (n=45 paired samples for miRNA and n=51 paired samples for mRNA). Gene (legacy) and miRNA expression data of the primary tumor samples were obtained using TCGAbiolink (46) in October 2022. Counts associated with the same mature miRNA transcript were aggregated. miRNA counts normalization was performed as described above. The Pearson correlation coefficients were then calculated between miRNA and mRNA data on expression residuals adjusting for the plate as previously described (10).
Gene and miRNA expression from bronchial brushings from the Airway Epithelial Gene Expression in the Diagnosis of Lung Cancer (AEGIS) trials.
Gene expression and miRNA expression data from sample matched mainstem bronchus brushing samples (N=341) were obtained from AEGIS clinical trials. Gene-level expression data was obtained from GSE66499 (47) and was processed as previously described (48). miRNA sequencing data was obtained from GSE93284 (43).
miRNA Target Database.
Three target databases for miRNA target information were used. For TargetScan v7.2 (24), all conserved miRNA sites were downloaded from targetscan.org. miRNAs with an additional suffix (.1 or .2) compared to miRbase were both included. For starBase (25), all predictions for hg19 were downloaded from starbase.sysu.edu.cn on December 2022 using the API and were filtered for clipExpNum >= 5. For miRTarBase v7.0 (26), the HGNC symbols were translated to Ensembl IDs (GRCh37) using biomaRt (49).
Code availability.
All code is available via the following Github repository, https://github.com/jbeane0428/miR149.git.
RESULTS
miRNA-gene network analysis identifies miRNAs associated with gene co-expression modules that define PML molecular subtypes.
As miRNAs have been shown to be important in carcinogenesis (50,51), we sought to construct a miRNA-gene network from bronchial PMLs to elucidate how miRNAs may be negatively regulating target genes associated with PML progression. To construct a miRNA-gene network, we generated high quality miRNA sequencing data from n=156 endobronchial biopsies from 28 patients that had matching bulk RNA sequencing data (10) (Figure 1A). This subset was representative of the larger previously published cohort (10), based on the observed associations between lesion molecular subtype, smoking status, and histology (chi-square test p<0.05) (Table 1 and Supplementary Table S6). The paired miRNA and mRNA expression data was used to construct the network where nodes are miRNAs and genes and edges represent significant negative correlations between miRNAs and their predicted target genes. After removing miRNA/gene nodes without any connections, the miRNA-gene network consisted of 14744 edges between 464 miRNAs and 2296 genes. The median number of connected target genes for each miRNA was 18. Next, we filtered edges (Figure 1B; see Methods) to identify module associated miRNAs that may have central gene-expression regulatory roles within the miRNA-gene network in each of the nine previously described (10) gene modules that define the PML molecular subtypes (Supplementary Table S1). For each gene module, we identified and retained only the connections from miRNAs whose associations with the module target genes are stronger (more negative) than with non-module target genes and where the significant negatively correlated target genes are significantly enriched within the module. After excluding nodes without any connections and summarizing miRNA-gene connections to the gene module level, we obtained a bipartite directed network consisting of 321 miRNA nodes targeting genes in the nine gene modules (Figure 2A; Supplementary Table S7). The connectivity of the miRNA-gene module network did not show a dependence on the number of genes in a module and 94 miRNAs had connections to more than one gene module.
Table 1.
Clinical annotations for endobronchial biopsies that were profiled by miRNA sequencing stratified by PML molecular subtypes.
| Proliferative (n=46) |
Inflammatory (n=27) |
Secretory (n=46) |
Normal (n=37) |
||
|---|---|---|---|---|---|
| Dysplasia Grade | chi=38.50, p=0.0033 | ||||
| Normal | 3 (6.5) | 4 (14.8) | 12 (26.1) | 10 (27.0) | |
| Hyperplasia | 4 (8.7) | 8 (29.6) | 7 (15.2) | 6 (16.2) | |
| Metaplasia | 8 (17.4) | 8 (39.6) | 9 (19.6) | 12 (32.4) | |
| Mild Dysplasia | 5 (10.9) | 1 (3.7) | 8 (17.4) | 2 (5.4) | |
| Moderate Dysplasia | 19 (41.3) | 4 (14.8) | 7 (15.2) | 5 (13.5) | |
| Severe Dysplasia | 7 (15.2) | 1 (3.7) | 1 (2.2) | 1 (2.7) | |
| UNK | 0 (0.0) | 1 (3.7) | 2 (4.4) | 1 (2.7) | |
| Smoking Status (genomic prediction) | 40 (87.0) | 11 (40.7) | 35 (76.1) | 12 (32.4) |
chi=35.20,
p < 0.001 |
| Progression Status | chi=24.37, p=0.0038 | ||||
| Progression/persistent | 14 (30.4) | 4 (14.8) | 14 (30.4) | 5 (13.6) | |
| Regressing | 14 (30.4) | 3 (11.1) | 5 (10.9) | 3 (8.1) | |
| Normal/Stable | 7 (15.2) | 12 (44.4) | 8 (17.4) | 14 (37.8) | |
| UNK | 11 (23.9) | 8 (29.6) | 19 (41.3) | 15 (40.5) |
Statistical tests for categorical clinical variables (dysplasia grade, smoking status, progression status and batch) were conducted using Chi-square tests. Statistical tests for continuous variables (TIN) were compared using two-sided Student’s t-tests. Percentages are reported for categorical variables and mean/standard deviations are reported for the continuous variable.
To validate that our network captures biologically relevant miRNA-gene relationships, for each module, we confirmed that the correlation density between the expression levels of module associated miRNAs and a module metagene score computed via GSVA were left-shifted from zero (Figure 2B). We also observed a significant negative correlation pattern between the GSVA score of a gene module and the GSVA score calculated from the module associated miRNAs (i.e. negative correlation along the diagonal line in Supplementary Figure S1; FDR<0.05, Pearson correlation), except for Module 2. Moreover, the connections between miRNA and gene modules in the network reflect multifaceted functions of miRNAs demonstrated by previous studies (52,53). For example, miR-200a/b-3p are connected to gene modules with genes associated with the extracellular matrix and the immune activation (54,55) (module 1 and 8; Supplementary Table S7). These results suggest that our miRNA-gene module network reveals biologically meaningful regulatory relationships between miRNAs and the predicted target genes.
Genes in the PML progression-associated antigen presentation module are connected to hsa-miR-149–5p.
The antigen presentation gene module (module 9), enriched with genes associated with interferon signaling and antigen presentation and processing, has previously (10) been shown to be down-regulated among proliferative subtype progressive/persistent compared to the regressive PMLs. To identify miRNAs that potentially contribute to the PML progressive phenotype, we used the miRNA-gene module network to identify 9 miRNAs (hsa-let-7e-5p, hsa-miR-125a-5p, hsa-miR-145–5p, hsa-miR-149–5p, hsa-miR-152–3p, hsa-miR-199a-5p, hsa-miR-26a-5p, hsa-miR-296–3p and hsa-miR-664b-3p) that were significantly negatively correlated with 33 out of the 112 module 9 genes (Figure 2C). For each of these 9 miRNAs, the miRNA-gene correlations with predicted target genes in module 9 were significantly more negative than correlations with predicted target genes in the other gene modules, and the significant negatively correlated target genes were enriched within module 9 (FDR<0.05, t-test and Fisher-exact test; Supplementary Table S8). Next, we examined whether the expression levels of these miRNAs are associated with the progressive status of proliferative subtype PMLs. hsa-miR-149–5p was the only significantly up-regulated miRNA in progressive/persistent (n=14) versus regressive (n=14) proliferative subtype PMLs (p<0.05, linear mixed-effect model; Figure 2D and Supplementary Table S9), suggesting its higher expression is associated with worse lesion prognosis in concordance with its previously reported oncogenic role in different cancer settings (56).
hsa-miR-149–5p targets MHC-I genes by suppressing NLRC5 expression.
Next, we sought to elucidate how hsa-miR-149–5p up-regulation contributes to lesion progression by investigating the 7 genes in module 9 connected to the miRNA in the network analysis (Figure 2C). We compared the expression levels of these genes between the proliferative subtype progressive/persistent (n=15) versus the regressive (n=15) PMLs from the discovery cohort in our prior work (10) (GSE109473). All seven targets were down-regulated in the progressive/persistent PML samples, and the expression level of three target genes, BTN3A3, HLA-E, and NLRC5, are significantly lower in the progressive/persistent proliferative PMLs than in the regressive ones (p<0.05, linear mixed-effect model; Figure 3A and Supplementary Table S10). The lower expression of these genes among PMLs that progress is also observed in the proliferative PML samples from our validation cohort (n=7 progressive/persistent and n=12 regressive) and in an independent dataset from Merrick et. al. (n=32 progressive/persistent and n=15 regressive, GSE114489) (Supplementary Table S10).
Figure 3. Hsa-miR-149–5p and NLRC5 target genes are associated with PML progression and in vitro modulation of NLRC5 changes MHC-1 expression and CD8 T cell-mediated cytotoxicity.

Boxplots of residual expression levels of progressive/persistent (dark green) and regressive (light green) Proliferative subtype PMLs from the discovery cohort (GSE109473) for (A) hsa-miR-149–5p significant negatively correlated target genes in module 9 and (B) NLRC5 target genes. SW900 cells were transfected with mock control (red orange), hsa-miR-149–5p (green) or siNLRC5 (blue) for 48 hours. Relative gene expression detected by qRT-PCR (6 to 10 replicates per condition) of (C) NLRC5 and its downstream targets (HLA-A, HLA-B, HLA-C, TAP1, and PSMB9) and (D) hsa-miR-149–5p. PPIA was used as the reference gene to calculate relative fold change values. (E) Barplot showing the difference in mean fluorescent intensity (MFI) of surface MHC-I and (F) percent cytotoxicity between IFN- γ simulated and unstimulated conditions between mock and siNLRC5 treated SW900 cells. Data indicate median with IQR, and whiskers indicate minimum and maximum measurement. Error bars indicate standard errors. P values were determined using mixed effect linear models for (A), (B), and (E); two-sided Wilcoxon tests for (C) and (D); two-sided t-test for (F). * p<=0.05, **p<=0.01, ***p<=0.001; ns, no significance.
To further validate the association between hsa-miR-149–5p and module 9, we leveraged three airway and lung tumor datasets with sample-matched miRNA and gene expression profiles: airway brushing samples (n=87) obtained during the same bronchoscopy procedure as the endobronchial biopsies used to derive the miRNA-RNA network cohort, TCGA LUSC resection samples (46) (n=475), and airway brushings from the Airway Epithelial Gene Expression in the Diagnosis of Lung Cancer (AEGIS) trials(43,47) (n=341). Module 9 genes were significantly enriched among genes that were negatively correlated with hsa-miR-149–5p in all datasets (p<0.001, GSEA; Supplementary Figure S2A-S2D). Also, 47.7% of the genes in the leading edge driving the signal were overlapping across four studies, including four of the hsa-miR-149–5p predicted target genes in module 9 (BTN3A2, BTN3A3, HLA-E and NLRC5; Supplementary Figure S2E). These observations suggest hsa-miR-149–5p may promote the progressive PML phenotype by specifically suppressing target genes in the antigen presentation gene module.
Among hsa-miR-149–5p target genes that were significantly down-regulated in progressive/persistent lesions, NLRC5 was notable because prior studies have shown that it is a member of the NOD-like receptor (NLR) family and functions as a transcriptional activator of the MHC-I genes (57). NLRC5 is a predicted target for hsa-miR-149–5p in two out of the three miRNA target databases (24–26) used in our network construction, and was negatively correlated to hsa-miR-149–5p in the three airway and lung tissue datasets (p<= 0.05, Pearson correlation; Supplementary Figure S3A-S3C). Additionally, in TCGA LUSC tumor tissue with matched adjacent benign tissue, expression of hsa-miR-149–5p was up-regulated and expression of NLRC5 was down-regulated in tumor compared to adjacent benign tissue (p<0.001, linear mixed-effect model; Supplementary Figure S4A and S4B).
We previously reported that depletion of CD8 T cells is associated with PML progression (10), and hypothesized this could partially be due to hsa-miR-149–5p repressing the expression of the MHC-I genes by down-regulating NLRC5 expression. Seven genes (B2M, HLA-A, HLA-B, HLA-C, PSMB8, PSMB9, and TAP1) were identified to be directly regulated by NLRC5 based on literature reviews (58,59). NLRC5 promoter-binding at the mouse orthologues of these genes can be observed via ChIP-seq experiments from mice T cells (60). The 7 NLRC5 target genes are also within module 9, and the expression levels for all, except HLA-C, were significantly down-regulated within the proliferative subtype progressive/persistent versus regressive PMLs in the previously published discovery cohort (Figure 3B and Supplementary Table S11). We further validated the downregulation of NLRC5 target genes in progressive/persistent versus regressive PMLs in the proliferative subtype PMLs from our validation cohort and in the samples from Merrick et. al. (GSE114489) (Supplementary Table S11).
Quantitative real time-PCR (qRT-PCR) of SW900, lung squamous cancer cell line, transfected with hsa-miR-149–5p, siNLRC5, or a miR scramble control (mock) sequence, demonstrated that cells transfected with hsa-miR-149–5p significantly increased hsa-miR-149–5p and decreased NLRC5 and its target genes, HLA-B, TAP1, and PSMB9, compared to mock. Transfection with siNLRC5 significantly decreased NLRC5 and its target genes HLA-A/B/C, PSMB9, and TAP1 (p<0.05, two-sided Wilcoxon test; Figure 3C and 3D). The experiments also suggest a potential negative feedback loop between NLRC5 and hsa-miR-149–5p as NLRC5 was decreased more strongly by experiments with siNLRC5 than hsa-miR-149–5p alone and siNLRC5 significantly decreased hsa-miR-149–5p expression compared to mock (p< 0.01, two-sided Wilcoxon test; Figure 3C). The computational and qRT-PCR results suggest that hsa-miR-149–5p may promote lesion progression by suppressing NLRC5 and MHC-I gene expression.
To assess the functional consequences of NLRC5 knockdown, using flow cytometry we measured MHC-I surface expression in SW900 cells transfected with siNLRC5 or a mock sequence with and without IFN-γ stimulation. Decreased NLRC5 expression (Supplementary Figure S5A) resulted in a significant reduction in MHC-I surface expression upon stimulation with IFN-γ compared to mock control (p< 0.001, linear mixed-effect model; Figure 3E and Supplementary Figure S5B). Based on these results, we next examined whether SW900 cells with decreased NLRC5 expression had reduced T cell-mediated killing. Following the co-culture of T cells and SW900 cells, there was a significant reduction in cell-mediated cytotoxicity in IFN-γ stimulated siNLRC5 SW900s compared to mock control (p<0.01, two-sided t-test; Figure 3F). These data indicate that decreased NLRC5 expression reduces both MHC-I surface expression and CD8 T cell cytotoxicity, suggesting that decreased NLRC5 expression may facilitate immune evasion in progressive/persistent lesions.
hsa-miR-149–5p expression is enriched within epithelial cells.
Interpreting the potential role of hsa-miR-149–5p in lesion progression is dependent on the cell type specificity of its expression and its targets. To identify the cell type specificity of hsa-miR-149–5p expression, we examined the correlation pattern between hsa-miR-149–5p and canonical cell type markers within our lesion biopsy samples. We observed significant positive correlations between hsa-miR-149–5p and the basal cell marker, KRT5 (61) (FDR<0.001, Pearson correlation) and the peri-goblet cell marker, CEACAM5 (30) (FDR<0.01, Pearson correlation; Figure 4A). In comparison, hsa-miR-34b-5p and hsa-miR-449c-5p, which are expressed highly in mucociliary epithelia, were positively correlated with FOXJ1, and hematopoiesis essential hsa-miR-150–5p was positively correlated with B and T cell markers (62) (Figure 4A). We also utilized cell type-specific transcriptomic data from the FANTOM5 project (45) and ranked samples based on the normalized expression level of hsa-miR-149–5p and identified whether samples derived from epithelial or lymphoid cell types were enriched among samples with high hsa-miR-149–5p expression. Samples from epithelial cells (n=87) were positively enriched among samples with high hsa-miR-149–5p expression (p<0.001, GSEA), while samples from immune cells (n=37) were negatively enriched (p<0.01, GSEA; Figure 4B). Additionally, the expression of hsa-miR-149–5p and NLRC5 in the FANTOM5 data were significantly negatively correlated between samples from epithelial cells (p<0.01, Pearson Correlation; Figure 4C), but not in samples from immune cells (Supplementary Figure S6), suggesting that that the suppression of NLRC5 by hsa-miR-149–5p may be specific to epithelial cells. The highly enriched expression of hsa-miR-149–5p and negative regulation relationship between hsa-miR-149–5p and NLRC5 within the epithelial cells suggested that hsa-miR-149–5p may be involved in epithelial cell mediated immune evasion in early lung cancer.
Figure 4. hsa-miR-149–5p is highly expressed within the epithelium and associates with PML progression.

(A) Bubble plot showing the correlation between the residual expression level of hsa-miR-149–5p and cell-type marker genes, including CD3G (T cells), CD19 (B cells), CD68 (macrophages), KRT5 (basal cells), FOXJ1 (ciliated cells), MUC5AC (goblet cells), SCGB1A1 (club cells), MUC5B (secretory cells), and CEACAM5 (peri-goblet cells). Correlation results for hsa-miR-34b-5p, hsa-miR-449c-5p and hsa-miR-150–5p are shown as controls. (B) Enrichment of hsa-miR-149–5p across cell-type compartments in the FANTOM5 project. Bar chart (top) shows the normalized expression levels of hsa-miR-149–5p in the FANTOM5 samples (n = 399). Vertical bars (bottom) indicate the position of FANTOM5 samples derived from the epithelial (red) or immune (orange) cell compartments. (C) Scatter plot of the Pearson correlation between the normalized expression levels of hsa-miR-149–5p and NLRC5 within the FANTOM5 samples derived from the epithelial cell compartment (n = 87). The dashed red line represents the linear regression fit and the shaded region indicates the 95% confidence interval. (D-F) The hsa-miR-149–5p density was detected by miRNA-ISH from proliferative subtype PMLs (n = 19; 10 progressive/persistent, 9 regressive). (D) Boxplot showing the hsa-miR-149–5p density per area between the epithelium and non-epithelium regions. (E) hsa-miR-149–5p density per nucleus between the progressive/persistent and regressive PMLs within regions of normal epithelium, regions with hyperplasia or metaplasia (hyper-metaplasia), and regions of bronchial dysplasia. (F) Representative miRNA-ISH staining of hsa-miR-149–5p for progressive/persistent PMLs (top) and regressive PMLs (bottom). Arrows indicate stained hsa-miR-149–5p within epithelium. Boxplots indicate median with IQR and whiskers indicate minimum and maximum measurement in (D) and (E). P values were FDR (Benjamini & Hochberg) adjusted and determined by Pearson correlation coefficients for (A), the two-sided paired Wilcoxon test for (D), and a linear mixed effect model for (E). * p<=0.05, **p<=0.01, ***p<= 0.001, ****p<=0.0001; ns, no significance.
Spatial localization of hsa-miR-149–5p in PMLs showed higher expression in the epithelium that was associated with PML progression.
To localize and validate the computational findings, consecutive tissue sections from proliferative subtype endobronchial biopsies underwent staining with H&E to annotate the histologic grades present in the epithelium and chromogenic miRNA in situ hybridization (miRNA-ISH) with a hsa-miR-149–5p probe to measure hsa-miR-149–5p expression and localization (Figure 1A, Figure 1C-D, Supplementary Table S3). An expert pathologist annotated the histologic grades of the epithelium in each digitized whole slide H&E image, and these annotations were transferred to the miRNA-ISH and grouped as follows: 1) normal; 2) hyperplasia, immature squamous metaplasia, and squamous metaplasia (denoted as hyper-metaplasia); and 3) squamous dysplasia. The worst histologic grade observed in the H&E matched the clinical diagnosis in 81% of biopsies, suggesting that many features in the clinical specimens were retained in subsequent sections (Supplementary Table S3 and S12). A pixel classifier was trained to detect hsa-miR-149–5p signal spots in the miRNA-ISH data and its performance was validated on regions (16726 +/− 4332 mm2) containing manual spot annotations for each sample. The classifier showed high recall (0.84 +/− 0.06), precision (0.75 +/− 0.14), and F1 (0.78 +/− 0.08) measurement that was not associated with lesion outcome (Supplementary Figure S7A and S7B). The detected hsa-miR-149–5p spots per area were higher in the epithelium compared to non-epithelium (p < 0.001, two-sided paired Wilcoxon test; Figure 4D), supporting the computational results showing high hsa-miR-149–5p expression correlation with epithelial markers. We also observed higher hsa-miR-149–5p spot density in epithelial areas with squamous dysplasia in the progressive/persistent compared to regressive PMLs (p=0.05, linear mixed-effect model; Figure 4E and 4F). This difference was likely driven by changes in progressive/persistent PMLs as we observed that hsa-miR-149–5p spot density in dysplastic regions was increased compared to normal epithelium in the progressive/persistent lesions, but not in regressive lesions (p<0.05, linear mixed-effect model; Supplementary Figure S8A and S8B). These results confirm that hsa-miR-149–5p is expressed more highly in epithelial cells and that expression differences observed between progressive/persistent versus regressive PMLs may occur within regions of bronchial dysplasia.
IMC data shows differential cell composition associated with histology and correlated protein expression between NLRC5 and its downstream targets.
The transcriptomic and miRNA-ISH data suggest that hsa-miR-149–5p is more highly expressed in the epithelium, so we sought to confirm that the protein expression between NLRC5 and its downstream targets was correlated within epithelial cells via IMC. Using consecutive tissue sections from those used in the miRNA-ISH experiment, we ablated regions of interest (ROIs) containing epithelial and non-epithelial tissue, and cells from these regions were separated using expression of broad cell type markers (KRT5, E-Cadherin, CD20, CD3, HLA-DR, αSMA, and VIM) coupled with pathologist annotation described above (Figure 1E, Supplementary Figure S9A and S9B). The 41,147 high quality cells from the epithelial predominant tissue regions were used to identify 13 cell clusters (Figure 5A) containing epithelial and immune cell types based on expression of canonical cell markers (Figure 5B). Each cell cluster had cells from at least 13 samples (out of 14 samples tested). We did not, however, detect significant associations between cell cluster abundance and smoking status (FDR>0.05, sccomp differential composition test, Supplementary Figure S10A and S10B), potentially due to the underrepresentation of former smokers. The expected location of cell types in the IMC images helped confirm the cluster labels. For example, clusters labeled as basal cells were found close to the basement membrane while clusters labeled as fully differentiated cells (i.e., MUC5AC+ goblet cells), were found at the apical surface in both normal and hyper-metaplastic regions (Supplementary Figure S12A). Differential composition analysis showed that fully differentiated cells, including MUC5B+ secretory cells and FOXJ1+ ciliated cells, are decreased in regions of hyper-metaplasia and dysplasia, whereas intermediate state cells including CEACAM5+ peri-goblet cells and CEACAM5+SCGB1A1+ secretory cells, are increased (FDR<0.001, sccomp differential composition test, Figure 5C, Supplementary Figure S10C). The infiltration of immune cells, including CD8 T cells, CD4 T cells, NK cells, are significantly increased in dysplastic regions (FDR<0.001, sccomp differential composition test, Figure 5C, Supplementary Figure S10C).
Figure 5. Protein expression of NLRC5 and its targets are correlated and basal cells with increased NLRC5 expression are in close spatial proximity to CD8 T cells.

(A) t-distributed Stochastic Neighbor Embedding (tSNE) visualization of 8 epithelial and 4 immune cell populations within the epithelium (n = 41,147 cells) from IMC data (n= 14 samples, 7 progressive, 7 regressive). (B) Heatmap of the average log-Z normalized expression within the epithelium of canonical markers (left) and the average log-Z stromal normalized expression within the epithelium of NLRC5 and its downstream targets (right) in identified cell types from (A). Bar plot shows the relative proportion of each cell type. (C) Bubble plot shows the differential composition of cell types in hyper-metaplasia and dysplasia as compared to normal epithelium. Color represents the compositional estimate, and dot size represents the log FDR value computed by sccomp. (D) t-SNE visualization of epithelial cell populations colored by NLRC5-high (dark blue) versus NLRC5-low (light blue) expression. (E) Violin plot of the log-Z stromal normalized expression of NLRC5 target genes (TAP1, PSMB9, and B2M) stratified by the NLRC5-high and NLRC5-low labels. (F) tSNE visualization of 8 immune and 3 stromal cell population in the stroma region (n = 46,254 cells) from IMC data (n= 14 samples, 7 progressive, 7 regressive). (G) Heatmap of the average log-Z normalized expression within the stroma of canonical markers (left) and the average log-Z stromal normalized expression within the epithelium of NLRC5 and its downstream targets (right) in identified cell types from (F). Bar plot shows the relative proportion of each cell type. (H) Violin plot of log-Z stromal normalized expression of NLRC5 in the epithelium (red) versus the stroma or non-epithelial tissue (yellow). (I and J) Cellular neighbors for each cell were defined based on a centroid distance within 30 μm. Cell-cell proximity score within each ROI (n = 24) is computed by the permutation test in imcRtools. Cyan bar represents p value < 0.05 whereas orange bars are not significant. (I) Bar plot showing the change in spatial proximity of immune cell populations to NLRC5-high versus NLRC5-low epithelial cells, averaged across all ROIs. (J) Bar plot showing the change in spatial proximity of CD8 T cells to NLRC5-high versus NLRC5-low epithelial populations, averaged across all ROIs. P values were determined by sccomp differential composition test for (C), the two-sided unpaired Wilcoxon test for (E) and (H), the two-sided paired Wilcoxon test for (I) and (J). * p<=0.05, **p<= 0.01, ***p<= 0.001, ****p<= 0.0001.
The epithelial cells were dichotomized into cells with high or low NLRC5 expression based on the average of the log-z-score stromal normalized NLRC5 expression across ROIs (Figure 5D, Supplementary Figure S10D and S10E). The expression of NLRC5 downstream targets (B2M, PSMB9, and TAP1) were significantly lower in NLRC5-low versus NLRC5-high cells (p<0.001, two-sided Wilcoxon test, Figure 5E) and NLRC5 and its downstream targets were significantly correlated across epithelial cells (r>0.6, p<0.001, Supplementary Figure S10F). The proportion of NLRC5-low epithelial cells was enriched among KRT5-low basal cells, MUC5B low secretory cells and MUC5B+ secretory cells, while the proportion of NLRC5-high epithelial cells was enriched among basal cells, differentiating basal cells, CEACAM5+ peri-goblets cells CEACAM5+SCGB1A1+ secretory cells, and FOXJ1+ ciliated cells (FDR<0.05, sccomp differential composition test, Supplementary Figure S10G). These results suggest that protein expression of NLRC5 and 3 of its downstream targets are correlated in the epithelium and that NLRC5-high epithelial cells are enriched in intermediate state cell populations (e.g., CEACAM5+ peri-goblets cells CEACAM5+SCGB1A1+ secretory cells) that are increased in abundance in hyper-metaplastic and dysplastic regions.
Basal cells with high NLRC5 protein expression were in closer spatial proximity to CD8 T cells.
Next, we wanted to assess the spatial proximity of immune cell populations to epithelial cells with NLRC5-high versus low protein expression. Toward this goal, we clustered the 46,254 high quality cells in the non-epithelial tissue regions to identify 12 cell clusters (Figure 5F) based on expression of canonical cell markers (Figure 5G). Each cell cluster had cells from all samples, and we did not observe significant associations between cell cluster abundance and smoking status (Supplementary Figure S11A and S11B). NLRC5 expression was significantly lower in the non-epithelial tissue regions compared to the epithelial regions (p<0.001, Wilcoxon test, Figure 5H), suggesting that NLRC5 is more active within the epithelium. Given that some immune cells in non-epithelium regions are located near the epithelial basement membranes, we perform spatial proximity analysis using both epithelial and non-epithelial region immune cells. We tested the spatial cell-cell proximity of NLRC5-high and NLRC5-low epithelial cells to immune cells using the neighborhood permutation test (cell neighbors defined by a centroid distance=30 μm, n=500 permutations). We observed that NLRC5-high epithelial cells had closer spatial proximity to CD8 T cells whereas NLRC5-low epithelial cells had closer spatial proximity to B cells and macrophages (p=0.05, two-sided paired Wilcoxon test, Figure 5I, Supplementary Figure S12B and S12C). Among the epithelial cell types, NLRC5-high basal cells had significantly closer spatial proximity to CD8 T cells than NLRC5-low basal cells (p<0.05, two-sided paired Wilcoxon test, Figure 5J, Supplementary Figure S11C, S12B, and S12C). The observation that NLRC5-high basal cells are near CD8 T cells suggests that these cells may be important mediators of a CD8 T cell response to dysplastic lesion development.
Basal cells in progressive versus regressive PMLs have decreased NLRC5 expression in hyper-metaplasia and dysplasia regions.
Given that NLRC5-high basal cells are near CD8 T cells, we wanted to test whether there were cell type compositional differences between progressive/persistent and regressive lesions, as previously we have shown the reduction of CD8 T cells in progressive lesions (10). Differential cell composition analysis showed the proportion of CEACAM5+ peri-goblet cells are significantly higher in the epithelial tissue regions of progressive/persistent lesions compared with regressive lesions (FDR<0.01, sccomp differential composition test, Figure 6A, Supplementary Figure S10H). In the non-epithelial tissue regions, CD8 T cells, B cells and M2 (CD163+ only) macrophages are decreased, while VIM+ stromal cells and αSMA+VIM+ stromal cells are increased in progressive/persistent lesions compared to regressive lesions (FDR<0.05, sccomp differential composition test, Figure 6B, Supplementary Figure S11D). Basal cells were not decreased in abundance in progressive/persistent lesions, so we next investigated differences in NLRC5 expression. We found that NLRC5 protein levels are increased in epithelial regions of hyperplasia, metaplasia, and dysplasia compared to normal epithelium (p<0.001, linear mixed-effect model, Figure 6C) in regressive compared to progression/persistent lesions. As NLRC5 increases with increasing histologic severity, but increases less in progressive/persistent lesions, we wanted to test the interaction effect between histology and progression status within the epithelial cell types. We found that NLRC5 expression in the basal cells, basal/differentiating cells, K5 low basal cells, and MUC5B+ secretory cells is decreased in progressive/persistent lesions compared to regressive lesions in the hyper-metaplasia and dysplasia regions (p<0.05, linear mixed-effect model, Figure 6D, Supplementary Figure S12B and S12C). These analyses suggest that NLRC5 expression is decreased within basal cells in progressive/persistent compared with regressive PMLs and thus may be associated with the observed depletion of CD8 T cells.
Figure 6. Immune and stroma cell composition and epithelial NLRC5 expression is associated with PML progression.

Volcano plots showing the differential composition of progressive/persistent (n = 7) compared to regressive (n = 7) PMLs for (A) cell populations identified in epithelium and (B) cell populations identified in stroma regions. (C) Line plot showing the estimated marginal mean of NLRC5 expression with standard deviation in epithelial cells of progressive/persistent (dark green) and regressive (light green) PMLs, respectively, across the 3 histology regions. Although histology groups were modeled as categorical, the line plot is used to facilitate visualization of trends across histology grades. (D) Bubble plot showing the differential expression of NLRC5 between progressive/persistent and regressive PMLs in each epithelial cell population across the 3 histology contrasts. Color represents the estimate, and dot size represents the log p-value computed by linear mixed effect model. P values were determined by sccomp differential composition test for (A) and (B) and linear mixed effect models for (C) and (D). * p<=0.05, **p<= 0.01, ***p<= 0.001.
DISCUSSION
To move from treating to preventing lung cancer, a detailed understanding of the mechanisms that contribute to the development and progression of bronchial PMLs is needed. Our group and others have profiled the transcriptomes of bronchial PMLs, precursor lesions of LUSC, and showed that immune evasion is associated with increased histologic severity and lesion progression (6,10–12,63). We previously demonstrated bronchial PMLs can be classified into four distinct molecular subtypes based on nine conserved gene co-expression modules. The proliferative molecular subtype was significantly enriched with bronchial dysplasia, and within this subtype, decreased expression of genes involved in antigen processing and presentation (module 9) and decreased immune cell infiltration are associated with persistent/progressive versus regressive PMLs (10). In related prior work, progressive bronchial dysplasia was associated with decreases in T cells and HLA-DRA positive epithelial cells (12). In progressive CIS lesions, genetic and epigenetic alterations in antigen presentation-related genes including loss of heterozygosity in the HLA region have been observed (11). These findings support the hypothesis that impaired antigen processing and presentation contribute to lesion progression. However, the transcriptional regulators contributing to the dysregulation, especially in non-CIS PMLs, are still poorly understood. In this study, we measured miRNA expression profiles from longitudinally collected endobronchial biopsies previously profiled by mRNA sequencing and constructed a miRNA-mRNA module network to identify miRNAs associated with the antigen presentation gene co-expression module (module 9). We found that hsa-miR-149–5p, a miRNA highly expressed in the epithelium, was up-regulated among the persistent/progressive compared to regressive proliferative PMLs and repressed NLRC5, the transcriptional regulator of MHC-I genes. We found that decreased NLRC5 in a LUSC cell line blunted IFN-γ induced MHC-I surface expression and reduced CD8 T cell cytotoxicity. Furthermore, we used miR-ISH and imaging mass cytometry to characterize the spatial localization of hsa-miR-149–5p, NLRC5, and its downstream targets in the context of the lesion cellular architecture and the immune microenvironment in biopsy tissue sections. Our results demonstrate that NLRC5 and the MHC-I genes that it regulates are increased with increasing histologic severity in the epithelium. However, when comparing progressive/persistent versus regressive proliferative subtype PMLs within different histologic grades, there is a decrease in expression of NLRC5 and its downstream targets partially driven by increased hsa-miR-149–5p expression. Decreased NLRC5 in basal cells was associated with reduced spatial proximity to CD8 T cells and may contribute to the immune evasion phenotype.
Among the direct targets of hsa-miR-149–5p within Module 9 was NLRC5, a gene that regulates the expression of MHC-I (HLA-A, HLA-B, HLA-C, B2M) and immunoproteasome genes (PSMB8, PSMB9, TAP1). miR-ISH analysis indicated that hsa-miR-149–5p was expressed in epithelial cells of PMLs and in vitro experiments showed NLRC5 and several of its targets were decreased by hsa-miR-149–5p overexpression in a human lung epithelial cell line. Several studies, in different cancer contexts, support the oncogenic role of increased hsa-miR-149–5p (56). hsa-miR-149–5p is up-regulated in non-small cell lung tumors compared with adjacent tissues (64), lower expression of hsa-miR-149–5p inhibited cell growth in prostate cancer cells in vitro (65), and higher exosomal hsa-miR-149–5p was associated with metastatic melanoma (66). Notably, Srivastava et al. showed that IFN-γ activity may suppress hsa-miR-149–5p levels and drive an inflammatory response in keratinocytes (67). We observed a similar association where progressive/persistent proliferative PMLs have decreased expression of interferon signaling genes and high levels of hsa-miR-149–5p compared with regressive PMLs. Additionally, defects in antigen processing and presentation are hallmarks of cancer immune evasion (68). Many of the targets of hsa-miR-149–5p are inactivated by other oncogenic mechanisms to alter the antigen presentation activity in cancer cells, including the loss of heterozygosity of HLA and B2M genes (69,70), somatic mutation in HLA genes (71), epigenetic alterations (72,73), and transcriptional regulation (59). Less is known in bronchial PMLs, however, recent work showed that the loss of heterozygosity and hypermethylation in the HLA regions are associated with dysfunctional antigen processing and presentation in CIS lesions (11). Our findings suggest a post-transcriptional regulatory mechanism by which up-regulation of hsa-miR-149–5p in airway basal cells impairs antigen processing and presentation contributing to immune evasion.
Demonstrated by miR-ISH, hsa-miR-149–5p expression is increased in epithelial tissue regions compared to non-epithelial regions, and in regions of dysplasia, its expression is higher in progressive/persistent compared to regressive PMLs. The resolution of hsa-miR-149–5p expression to basal cells, however, was not definitive as we did not have a KRT5 counter stain. In the IMC data, we observed a significant decrease in NLRC5 expression in basal, basal/differentiating, KRT5 low basal, and MUC5B+ secretory cells in regions of hyper-metaplasia and dysplasia in progressive/persistent versus regressive PMLs. Since hsa-miR-149–5p expression is assayed across the whole tissue via miR-ISH, but the IMC results are based on regions of interest, direct integration of the results was not possible. The results, however, suggest that hsa-miR-149–5p and NLRC5 expression were anti-correlated in regions of dysplasia. Out of the epithelial cell types with decreased NLRC5 expression associated with progression, basal cells were the only cell type where the level of NLRC5 influenced its spatial proximity to CD8+ T cells. These observations were consistent with in vitro experiments that showed decreased NLRC5 expression results in reduced MHC-I surface expression and CD8 T cell-mediated cytotoxicity. Basal cells are the progenitor stem cell and suggested to be the cell of origin of LUSC in the proximal airway (74,75). Laughney et al. demonstrated an inverse association between expression levels of SOX2 (a transcription factor specifying lung cell fate) and MHC-I genes in primary and metastatic lung tumors (76). Similarly, the stem cell program in colon cancer has been associated with decreased levels of antigen presentation (77). These observations suggest that the cell states of basal cells may be an important determinant of the immune surveillance and evasion in bronchial PML progression. We also observed that CEACAM5+ peri-goblet cells, a smoking associated cell population (30), were enriched with increasing histologic severity and in progressive/persistent compared to regressive Proliferative PMLs. We did not, however, detect a change in NLRC5 expression in these cells associated with progression/regression status or that NLRC5 levels influence their spatial proximity to CD8+ T cells. Future investigations will be needed to further unravel the causality between epithelial cell state, expression of hsa-miR-149–5p and NLRC5, and the functional ability of cells to present antigen and recruit immune cells.
The IMC data also revealed that NLRC5 protein expression increases as the histology grade advances from normal epithelium to hyper-metaplasia and dysplasia accompanied by an increase in CD8 T cells, CD4 T cells and NK cells. In progressive/persistent compared to regressive proliferative PMLs there is a decrease in NLRC5 protein expression within hyper-metaplasia and dysplasia regions and CD8 T cells, B cells, and M2 macrophages are decreased. Spatial analysis demonstrated that NLRC5-high basal cells are in significantly closer proximity to CD8+ T cells compared to NLRC5-low basal cells. Levels of NLRC5 were also shown to decrease in basal cell populations in areas of hyper-metaplasia and dysplasia in progressive/persistent compared to regressive Proliferative PMLs. Similar relationships between NLRC5 regulated MHC-I gene expression and cancer immune evasion have been extensively studied in the context of cancer (58,59). Using the solid tumor data from TCGA, Yoshihama et al. demonstrated that lower NLRC5 expression is associated with deficient CD8+ T cell activation and poor prognosis (78). Furthermore, recent work demonstrated that impaired MHC-I processing and presentation pathway is related to resistance to immune therapy (79) and poor outcome (80) in lung cancer, and increased MHC-I gene expression can improve immunotherapy response (81), suggesting their critical roles and therapeutic potential in the context of lung cancer management.
In summary, utilizing miRNA-gene network with miR-ISH and imaging mass cytometry assays to identify and resolve the spatial and cellular localization of hsa-miR-149–5p, its downstream targets, and the immune microenvironment we have identified a post-transcriptional regulatory mechanism that may contributing to immune evasion and bronchial PML progression. Our results suggest cell type specific biomarkers of lesion progression and a potential epithelial-specific candidate therapeutic target to potentially reverse early immune evasion. Future experiments in model systems are needed to unravel regulatory mechanisms of hsa-miR-149–5p, to confirm the extent to which hsa-miR-149–5p regulates NLRC5 levels and the immune microenvironment, and to understand the association between MHC-I gene expression and recruitment of cytotoxic CD8 T cells. Additionally, testing anti-mir oligonucleotides (82,83) to target hsa-miR-149–5p in preclinical models such as the NTCU mouse (84,85) are needed to show its efficacy in preventing or delaying lesion progression. Uncovering potential regulators of early immune evasion holds promise to advance lung cancer chemoprevention in patients at high-risk for developing lung cancer.
Supplementary Material
SYNOPSIS.
Using integrative analysis of bronchial premalignant lesions, the authors identified hsa-miR-149–5p as a potential regulator of immune evasion in progressive/persistent lesions and a candidate therapeutic target for intercepting disease progression.
Funding information:
This work was supported by grants from the National Institutes of Health that include National Cancer Institute R21-CA253498, National Cancer Institute U2C-CA233238, National Cancer Institute U2C-CA271898, National Center for Advancing Translational Sciences through BU-CTSI Grant Number 1UL1TR001430, National Heart, Lung, and Blood Institute R01-HL124392. This work was also supported by sponsored research grants from Johnson and Johnson, the LUNGevity Foundation, the American Lung Association, Lung Cancer Research Foundation 1151770[RGEDPNLC], and a CDMRP Lung Cancer Research Program Idea Development Award LC230511.
Abbreviations list:
- CIS
Carcinoma In Situ
- H&E
Hematoxylin and Eosin (staining)
- IFN-γ
Interferon Gamma
- IMC
Imaging Mass Cytometry
- LUSC
Lung Squamous Cell Carcinoma
- MHC-I
Major Histocompatibility Complex Class I
- miRNA / miR
MicroRNA
- miRNA-ISH
MicroRNA In Situ Hybridization
- NLRC5
NOD-Like Receptor Family CARD Domain Containing 5
- PML
Bronchial Premalignant Lesion
- TCGA
The Cancer Genome Atlas
Footnotes
CONFLICTS OF INTEREST STATEMENT
B.N. and A.E.S. are employees of Johnson and Johnson. M.E.L., J.D.C., J.E.B., and S.A.M. received sponsored research agreements from Johnson and Johnson.
REFERENCES
- 1.Siegel RL, Miller KD, Wagle NS, Jemal A. Cancer statistics, 2023. CA Cancer J Clin. 2023;73:17–48. [DOI] [PubMed] [Google Scholar]
- 2.Zhang Y, Vaccarella S, Morgan E, Li M, Etxeberria J, Chokunonga E, et al. Global variations in lung cancer incidence by histological subtype in 2020: a population-based study. The Lancet Oncology. Elsevier; 2023;24:1206–18. [DOI] [PubMed] [Google Scholar]
- 3.Auerbach O, Stout AP, Hammond EC, Garfinkel L. Changes in Bronchial Epithelium in Relation to Cigarette Smoking and in Relation to Lung Cancer. New England Journal of Medicine. Massachusetts Medical Society; 1961;265:253–67. [DOI] [PubMed] [Google Scholar]
- 4.Breuer RH, Pasic A, Smit EF, van Vliet E, Vonk Noordegraaf A, Risse EJ, et al. The natural course of preneoplastic lesions in bronchial epithelium. Clin Cancer Res. 2005;11:537–43. [PubMed] [Google Scholar]
- 5.Ishizumi T, McWilliams A, MacAulay C, Gazdar A, Lam S. Natural history of bronchial preinvasive lesions. Cancer Metastasis Rev. 2010;29:5–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mascaux C, Angelova M, Vasaturo A, Beane J, Hijazi K, Anthoine G, et al. Immune evasion before tumour invasion in early lung squamous carcinogenesis. Nature. Nature Publishing Group; 2019;571:570–5. [DOI] [PubMed] [Google Scholar]
- 7.van Boerdonk RAA, Smesseim I, Heideman DAM, Coupé VMH, Tio D, Grünberg K, et al. Close Surveillance with Long-Term Follow-up of Subjects with Preinvasive Endobronchial Lesions. Am J Respir Crit Care Med. 2015;192:1483–9. [DOI] [PubMed] [Google Scholar]
- 8.Merrick DT, Gao D, Miller YE, Keith RL, Baron AE, Feser W, et al. Persistence of Bronchial Dysplasia Is Associated with Development of Invasive Squamous Cell Carcinoma. Cancer Prev Res (Phila). 2016;9:96–104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Teixeira VH, Pipinikas CP, Pennycuick A, Lee-Six H, Chandrasekharan D, Beane J, et al. Deciphering the genomic, epigenomic, and transcriptomic landscapes of pre-invasive lung cancer lesions. Nat Med. Nature Publishing Group; 2019;25:517–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Beane JE, Mazzilli SA, Campbell JD, Duclos G, Krysan K, Moy C, et al. Molecular subtyping reveals immune alterations associated with progression of bronchial premalignant lesions. Nat Commun. Nature Publishing Group; 2019;10:1856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Pennycuick A, Teixeira VH, AbdulJabbar K, Raza SEA, Lund T, Akarca AU, et al. Immune Surveillance in Clinical Regression of Preinvasive Squamous Cell Lung Cancer. Cancer Discovery. 2020;10:1489–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Merrick DT, Edwards MG, Franklin WA, Sugita M, Keith RL, Miller YE, et al. Altered Cell-Cycle Control, Inflammation, and Adhesion in High-Risk Persistent Bronchial Dysplasia. Cancer Res. 2018;78:4971–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Bartel DP. MicroRNAs: target recognition and regulatory functions. Cell. 2009;136:215–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Mascaux C, Feser WJ, Lewis MT, Barón AE, Coldren CD, Merrick DT, et al. Endobronchial miRNAs as biomarkers in lung cancer chemoprevention. Cancer Prev Res (Phila). 2013;6:100–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Campbell JD, Liu G, Luo L, Xiao J, Gerrein J, Juan-Guardela B, et al. Assessment of microRNA differential expression and detection in multiplexed small RNA sequencing data. RNA. 2015;21:164–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Andrews S FastQC A Quality Control tool for High Throughput Sequence Data [Internet]. [cited 2019 Jun 22]. Available from: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ [Google Scholar]
- 17.Martin M Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal. 2011;17:10–2. [Google Scholar]
- 18.Kozomara A, Griffiths-Jones S. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 2014;42:D68–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Friedländer MR, Mackowiak SD, Li N, Chen W, Rajewsky N. miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades. Nucleic Acids Res. 2012;40:37–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Law CW, Chen Y, Shi W, Smyth GK. voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014;15:R29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wang L, Wang S, Li W. RSeQC: quality control of RNA-seq experiments. Bioinformatics. 2012;28:2184–5. [DOI] [PubMed] [Google Scholar]
- 22.Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Friedman RC, Farh KK-H, Burge CB, Bartel DP. Most mammalian mRNAs are conserved targets of microRNAs. Genome Res. 2009;19:92–105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Li J-H, Liu S, Zhou H, Qu L-H, Yang J-H. starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res. 2014;42:D92–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Chou C-H, Shrestha S, Yang C-D, Chang N-W, Lin Y-L, Liao K-W, et al. miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions. Nucleic Acids Res. 2018;46:D296–302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hänzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-Seq data. BMC Bioinformatics. 2013;14:7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102:15545–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Korotkevich G, Sukhov V, Budin N, Shpak B, Artyomov MN, Sergushichev A. Fast gene set enrichment analysis [Internet]. bioRxiv; 2021. [cited 2025 Jan 22]. page 060012. Available from: https://www.biorxiv.org/content/10.1101/060012v3 [Google Scholar]
- 30.Duclos GE, Teixeira VH, Autissier P, Gesthalter YB, Reinders-Luinge MA, Terrano R, et al. Characterizing smoking-induced transcriptional heterogeneity in the human bronchial epithelium at single-cell resolution. Sci Adv. 2019;5:eaaw3413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods. 2001;25:402–8. [DOI] [PubMed] [Google Scholar]
- 32.Kuznetsova A, Brockhoff PB, Christensen RHB. lmerTest Package: Tests in Linear Mixed Effects Models. Journal of Statistical Software. 2017;82:1–26. [Google Scholar]
- 33.Lenth R emmeans: Estimated Marginal Means, aka Least-Squares Means [Internet]. 2024. [cited 2024 Mar 22]. Available from: https://rvlenth.github.io/emmeans/ [Google Scholar]
- 34.Bankhead P, Loughrey MB, Fernández JA, Dombrowski Y, McArt DG, Dunne PD, et al. QuPath: Open source software for digital pathology image analysis. Sci Rep. 2017;7:16878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Weigert M, Schmidt U. Nuclei Instance Segmentation and Classification in Histopathology Images with Stardist. 2022 IEEE International Symposium on Biomedical Imaging Challenges (ISBIC) [Internet]. 2022. [cited 2025 Jan 22]. page 1–4. Available from: https://ieeexplore.ieee.org/abstract/document/9854534 [Google Scholar]
- 36.Windhager J, Zanotelli VRT, Schulz D, Meyer L, Daniel M, Bodenmiller B, et al. An end-to-end workflow for multiplexed image processing and analysis. Nat Protoc. Nature Publishing Group; 2023;18:3565–613. [DOI] [PubMed] [Google Scholar]
- 37.Pachitariu M, Stringer C. Cellpose 2.0: how to train your own model. Nat Methods. Nature Publishing Group; 2022;19:1634–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Haghverdi L, Lun ATL, Morgan MD, Marioni JC. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat Biotechnol. Nature Publishing Group; 2018;36:421–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Levine JH, Simonds EF, Bendall SC, Davis KL, Amir ED, Tadmor MD, et al. Data-Driven Phenotypic Dissection of AML Reveals Progenitor-like Cells that Correlate with Prognosis. Cell. 2015;162:184–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Mangiola S, Roth-Schulze AJ, Trussart M, Zozaya-Valdés E, Ma M, Gao Z, et al. sccomp: Robust differential composition and variability analysis for single-cell data. Proceedings of the National Academy of Sciences. Proceedings of the National Academy of Sciences; 2023;120:e2203828120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Schapiro D, Jackson HW, Raghuraman S, Fischer JR, Zanotelli VRT, Schulz D, et al. histoCAT: analysis of cell phenotypes and interactions in multiplex image cytometry data. Nat Methods. 2017;14:873–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Chiu D, Pfefferkorn R, Sultan L, Green E, Constant M, Burks E, et al. Image data for paper “Epithelial miR-149 up-regulation is associated with immunosuppression via MHC- I in progressive bronchial premalignant lesions.” 2025. [cited 2025 Dec 10]; Available from: https://zenodo.org/records/17842537 [Google Scholar]
- 43.Pavel AB, Campbell JD, Liu G, Elashoff D, Dubinett S, Smith K, et al. Alterations in Bronchial Airway miRNA Expression for Lung Cancer Detection. Cancer Prev Res (Phila). 2017;10:651–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Whitney DH, Elashoff MR, Porta-Smith K, Gower AC, Vachani A, Ferguson JS, et al. Derivation of a bronchial genomic classifier for lung cancer in a prospective study of patients undergoing diagnostic bronchoscopy. BMC Med Genomics. 2015;8:18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.de Rie D, Abugessaisa I, Alam T, Arner E, Arner P, Ashoor H, et al. An integrated expression atlas of miRNAs and their promoters in human and mouse. Nat Biotechnol. 2017;35:872–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Colaprico A, Silva TC, Olsen C, Garofano L, Cava C, Garolini D, et al. TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 2016;44:e71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Silvestri GA, Vachani A, Whitney D, Elashoff M, Porta Smith K, Ferguson JS, et al. A Bronchial Genomic Classifier for the Diagnostic Evaluation of Lung Cancer. N Engl J Med. 2015;373:243–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Xu K, Shi X, Husted C, Hong R, Wang Y, Ning B, et al. Smoking Modulates Different Secretory Subpopulations Expressing SARS-CoV-2 Entry Genes in the Nasal and Bronchial Airways. Res Sq. 2021;rs.3.rs-887718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Durinck S, Spellman PT, Birney E, Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat Protoc. 2009;4:1184–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Dragomir M, Mafra ACP, Dias SMG, Vasilescu C, Calin GA. Using microRNA Networks to Understand Cancer. Int J Mol Sci. 2018;19:1871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Bracken CP, Scott HS, Goodall GJ. A network-biology perspective of microRNA function and dysfunction in cancer. Nat Rev Genet. 2016;17:719–32. [DOI] [PubMed] [Google Scholar]
- 52.Corney DC, Flesken-Nikitin A, Godwin AK, Wang W, Nikitin AY. MicroRNA-34b and MicroRNA-34c are targets of p53 and cooperate in control of cell proliferation and adhesion-independent growth. Cancer Res. 2007;67:8433–8. [DOI] [PubMed] [Google Scholar]
- 53.Marcet B, Chevalier B, Luxardi G, Coraux C, Zaragosi L-E, Cibois M, et al. Control of vertebrate multiciliogenesis by miR-449 through direct repression of the Delta/Notch pathway. Nat Cell Biol. 2011;13:693–9. [DOI] [PubMed] [Google Scholar]
- 54.Guan T, Dominguez CX, Amezquita RA, Laidlaw BJ, Cheng J, Henao-Mejia J, et al. ZEB1, ZEB2, and the miR-200 family form a counterregulatory network to regulate CD8+ T cell fates. J Exp Med. 2018;215:1153–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Schliekelman MJ, Gibbons DL, Faca VM, Creighton CJ, Rizvi ZH, Zhang Q, et al. Targets of the tumor suppressor miR-200 in regulation of the epithelial-mesenchymal transition in cancer. Cancer Res. 2011;71:7670–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.He Y, Yu D, Zhu L, Zhong S, Zhao J, Tang J. miR-149 in Human Cancer: A Systemic Review. J Cancer. 2018;9:375–88. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Meissner TB, Li A, Biswas A, Lee K-H, Liu Y-J, Bayir E, et al. NLR family member NLRC5 is a transcriptional regulator of MHC class I genes. Proc Natl Acad Sci U S A. 2010;107:13794–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Kobayashi KS, van den Elsen PJ. NLRC5: a key regulator of MHC class I-dependent immune responses. Nat Rev Immunol. Nature Publishing Group; 2012;12:813–20. [DOI] [PubMed] [Google Scholar]
- 59.Yoshihama S, Vijayan S, Sidiq T, Kobayashi KS. NLRC5/CITA: A Key Player in Cancer Immune Surveillance. Trends Cancer. 2017;3:28–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Ludigs K, Seguín-Estévez Q, Lemeille S, Ferrero I, Rota G, Chelbi S, et al. NLRC5 exclusively transactivates MHC class I and related genes through a distinctive SXY module. PLoS Genet. 2015;11:e1005088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Fernanda de Mello Costa M, Weiner AI, Vaughan AE. Basal-like Progenitor Cells: A Review of Dysplastic Alveolar Regeneration and Remodeling in Lung Repair. Stem Cell Reports. 2020;15:1015–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.He Y, Jiang X, Chen J. The role of miR-150 in normal and malignant hematopoiesis. Oncogene. 2014;33:3887–93. [DOI] [PubMed] [Google Scholar]
- 63.Ning B, Tilston-Lunel AM, Simonetti J, Hicks-Berthet J, Matschulat A, Pfefferkorn R, et al. Convergence of YAP/TAZ, TEAD and TP63 activity is associated with bronchial premalignant severity and progression. J Exp Clin Cancer Res. 2023;42:116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Yang C, Sun C, Liang X, Xie S, Huang J, Li D. Integrative analysis of microRNA and mRNA expression profiles in non-small-cell lung cancer. Cancer Gene Ther. Nature Publishing Group; 2016;23:90–7. [DOI] [PubMed] [Google Scholar]
- 65.Fujii T, Shimada K, Tatsumi Y, Fujimoto K, Konishi N. Syndecan-1 responsive microRNA-126 and 149 regulate cell proliferation in prostate cancer. Biochem Biophys Res Commun. 2015;456:183–9. [DOI] [PubMed] [Google Scholar]
- 66.Pfeffer SR, Grossmann KF, Cassidy PB, Yang CH, Fan M, Kopelovich L, et al. Detection of Exosomal miRNAs in the Plasma of Melanoma Patients. J Clin Med. 2015;4:2012–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Srivastava A, Luo L, Lohcharoenkal W, Meisgen F, Pasquali L, Pivarcsi A, et al. Cross-talk between IFN-γ and TWEAK through miR-149 amplifies skin inflammation in psoriasis. J Allergy Clin Immunol. 2021;147:2225–35. [DOI] [PubMed] [Google Scholar]
- 68.Galassi C, Chan TA, Vitale I, Galluzzi L. The hallmarks of cancer immune evasion. Cancer Cell. 2024;42:1825–63. [DOI] [PubMed] [Google Scholar]
- 69.McGranahan N, Rosenthal R, Hiley CT, Rowan AJ, Watkins TBK, Wilson GA, et al. Allele-Specific HLA Loss and Immune Escape in Lung Cancer Evolution. Cell. 2017;171:1259–1271.e11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Sade-Feldman M, Jiao YJ, Chen JH, Rooney MS, Barzily-Rokni M, Eliane J-P, et al. Resistance to checkpoint blockade therapy through inactivation of antigen presentation. Nat Commun. 2017;8:1136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Lawrence MS, Stojanov P, Mermel CH, Robinson JT, Garraway LA, Golub TR, et al. Discovery and saturation analysis of cancer genes across 21 tumour types. Nature. 2014;505:495–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Griffin GK, Wu J, Iracheta-Vellve A, Patti JC, Hsu J, Davis T, et al. Epigenetic silencing by SETDB1 suppresses tumour intrinsic immunogenicity. Nature. 2021;595:309–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Burr ML, Sparbier CE, Chan KL, Chan Y-C, Kersbergen A, Lam EYN, et al. An Evolutionarily Conserved Function of Polycomb Silences the MHC Class I Antigen Presentation Pathway and Enables Immune Evasion in Cancer. Cancer Cell. 2019;36:385–401.e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Hynds RE, Janes SM. Airway Basal Cell Heterogeneity and Lung Squamous Cell Carcinoma. Cancer Prev Res (Phila). 2017;10:491–3. [DOI] [PubMed] [Google Scholar]
- 75.Rock JR, Hogan BLM. Epithelial progenitor cells in lung development, maintenance, repair, and disease. Annu Rev Cell Dev Biol. 2011;27:493–512. [DOI] [PubMed] [Google Scholar]
- 76.Laughney AM, Hu J, Campbell NR, Bakhoum SF, Setty M, Lavallée V-P, et al. Regenerative lineages and immune-mediated pruning in lung cancer metastasis. Nat Med. 2020;26:259–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Volonté A, Di Tomaso T, Spinelli M, Todaro M, Sanvito F, Albarello L, et al. Cancer-initiating cells from colorectal cancer patients escape from T cell-mediated immunosurveillance in vitro through membrane-bound IL-4. J Immunol. 2014;192:523–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Yoshihama S, Roszik J, Downs I, Meissner TB, Vijayan S, Chapuy B, et al. NLRC5/MHC class I transactivator is a target for immune evasion in cancer. Proc Natl Acad Sci U S A. 2016;113:5999–6004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Gettinger S, Choi J, Hastings K, Truini A, Datar I, Sowell R, et al. Impaired HLA Class I Antigen Processing and Presentation as a Mechanism of Acquired Resistance to Immune Checkpoint Inhibitors in Lung Cancer. Cancer Discov. 2017;7:1420–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Tripathi SC, Peters HL, Taguchi A, Katayama H, Wang H, Momin A, et al. Immunoproteasome deficiency is a feature of non-small cell lung cancer with a mesenchymal phenotype and is associated with a poor outcome. Proceedings of the National Academy of Sciences. 2016;113:E1555–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Gu SS, Zhang W, Wang X, Jiang P, Traugh N, Li Z, et al. Therapeutically Increasing MHC-I Expression Potentiates Immune Checkpoint Blockade. Cancer Discov. 2021;11:1524–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Rupaimoole R, Slack FJ. MicroRNA therapeutics: towards a new era for the management of cancer and other diseases. Nat Rev Drug Discov. 2017;16:203–22. [DOI] [PubMed] [Google Scholar]
- 83.Ma L, Reinhardt F, Pan E, Soutschek J, Bhat B, Marcusson EG, et al. Therapeutic silencing of miR-10b inhibits metastasis in a mouse mammary tumor model. Nat Biotechnol. 2010;28:341–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Wang Y, Zhang Z, Yan Y, Lemon WJ, LaRegina M, Morrison C, et al. A chemically induced model for squamous cell carcinoma of the lung in mice: histopathology and strain susceptibility. Cancer Res. 2004;64:1647–54. [DOI] [PubMed] [Google Scholar]
- 85.Sahu P, Donovan C, Paudel KR, Pickles S, Chimankar V, Kim RY, et al. Pre-clinical lung squamous cell carcinoma mouse models to identify novel biomarkers and therapeutic interventions. Front Oncol. 2023;13:1260411. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Endobronchial Biopsy and Airway Brushing miRNA Expression Data.
Raw FASTQ files and the miRNA expression matrices and associated sample and clinical data are available on Gene Expression Omnibus accession number GSE313146 and the Human Tumor Atlas Network Data Portal (https://humantumoratlas.org/explore) using the identifications provided in Supplementary Table S5.
Endobronchial Biopsy Imaging Data.
H&E, miR-ISH, and IMC images are available on Zenodo (42) and the miR-ISH and IMC images are also available via the Human Tumor Atlas Network Data Portal (https://humantumoratlas.org/explore) using the identifications provided in Supplementary Table S5.
Endobronchial Biopsy and Airway Brushing Gene Expression Data.
Raw FASTQ files are available via dbGaP accession phs003185.v1.p1 and the gene expression count and residual matrices are available via Gene Expression Omnibus accession number GSE109743. Gene expression residual data from GSE109743 was used to construct miRNA-mRNA correlation network (10).
Additional Publicly Available Endobronchial Biopsy Data.
miRNA expression data from GSE93284 and gene expression from GSE66499 and GSE114489 were used to validate the miRNA-mRNA correlation and gene association with lesion outcome (12,43,44).
FANTOM5 Gene & miRNA Expression Data.
FANTOM5 miRNA expression profiles for 2595 mature miRNAs across 399 samples (directly downloaded from https://fantom.gsc.riken.jp/5/suppl/De_Rie_et_al_2017/ in November 2022) (45). The library size normalized (CPM) miRNA expression matrix was log2-transformed with pseudocount of 0.1. Matched gene expression profiles in transcripts per million (TPM; N=394) were downloaded from the same website and were used to examine the miRNA-mRNA correlation by cell type. and the tag count of the most supported peak (p1@NLRC5) was used for the expression of NLRC5 per sample. The TPM values were log2-transformed with pseudocount of 0.1 for correlation analysis.
TCGA-LUSC mRNA & miRNA Expression Data.
Sample matched gene and miRNA expression data for the TCGA LUSC (N=475) were also used to validate the miRNA-mRNA correlation. We also examined differences between has-miR-149–5p and NLRC5 in matched TCGA LUSC tumor tissue and adjacent benign tissue (n=45 paired samples for miRNA and n=51 paired samples for mRNA). Gene (legacy) and miRNA expression data of the primary tumor samples were obtained using TCGAbiolink (46) in October 2022. Counts associated with the same mature miRNA transcript were aggregated. miRNA counts normalization was performed as described above. The Pearson correlation coefficients were then calculated between miRNA and mRNA data on expression residuals adjusting for the plate as previously described (10).
Gene and miRNA expression from bronchial brushings from the Airway Epithelial Gene Expression in the Diagnosis of Lung Cancer (AEGIS) trials.
Gene expression and miRNA expression data from sample matched mainstem bronchus brushing samples (N=341) were obtained from AEGIS clinical trials. Gene-level expression data was obtained from GSE66499 (47) and was processed as previously described (48). miRNA sequencing data was obtained from GSE93284 (43).
miRNA Target Database.
Three target databases for miRNA target information were used. For TargetScan v7.2 (24), all conserved miRNA sites were downloaded from targetscan.org. miRNAs with an additional suffix (.1 or .2) compared to miRbase were both included. For starBase (25), all predictions for hg19 were downloaded from starbase.sysu.edu.cn on December 2022 using the API and were filtered for clipExpNum >= 5. For miRTarBase v7.0 (26), the HGNC symbols were translated to Ensembl IDs (GRCh37) using biomaRt (49).
