Abstract
SWI/SNF complexes are major targets of mutations in cancer. Here, we combined multiple “-omics” methods to assess SWI/SNF composition and aberrations in LUAD. Mutations in lung SWI/SNF subunits were highly recurrent in our LUAD cohort (41.4%), and over 70% of the mutations were predicted to have functional impact. Furthermore, SWI/SNF expression in LUAD suffered an overall repression that could not be explained exclusively by genetic alterations. Finally, SWI/SNF mutations were associated with poorer overall survival in TCGA-LUAD. We propose SWI/SNF-mutant LUAD as a separate clinical subgroup with practical implications.
Supplementary Information
The online version contains supplementary material available at 10.1186/s13148-022-01261-3.
Keywords: Multi-omics, Lung cancer, Lung adenocarcinoma, SWI/SNF complex, Epigenetics, Prognosis
Introduction
Lung adenocarcinoma (LUAD) is the main histological subtype of lung cancer, which is currently the deadliest cancer worldwide [1]. The poor outcome of LUAD patients may be improved by an early diagnosis and a personalized clinical approach, both of which can be facilitated by next-generation sequencing (NGS).
NGS studies have identified that the multiprotein complex SWI/SNF (SWitch/Sucrose Non-Fermentable) is mutated in almost 25% of human neoplasias [2, 3]. Before the era of NGS, our group discovered that SMARCA4 is frequently inactivated by truncating mutations in LUAD [4–6]. Together with SMARCA4, other SWI/SNF subunits, such as ARID1A, are recurrently mutated in LUAD and considered as LUAD driver genes [7]. Recently, our group has shown that more than 76% of LUAD cell lines have at least one mutated SWI/SNF subunit [6]. However, the exact composition of the SWI/SNF complex in LUAD is currently unknown and we lack an integration of the genetic and transcriptional profile of this complex in order to facilitate a practical transfer to the clinic.
For all these reasons, we aimed to identify the proteins that form the SWI/SNF complexes in lung epithelial cells as well as their molecular alterations in LUAD primary tumors, and more importantly, the clinical application of this in-depth study.
Methods
Characteristics of lung adenocarcinoma patients
DNA and RNA from 70 lung adenocarcinoma (LUAD) tumors and their paired normal adjacent tissues were obtained from the Basque Biobank (www.biobancovasco.org) and were processed following standard operating procedures. Lung adenocarcinoma patients were diagnosed from August 2008 to January 2016. The main characteristics of these 70 patients are shown in Additional file 2: Table S1. More information about this patient cohort is detailed in Additional file 1: Supplementary Methods.
Cell culture
Normal bronchial epithelial cells, NL20, were grown under standard culture conditions (37ºC, 5% carbon dioxide) in Ham’s F12 medium with 4%FBS, 2.0 mM L-glutamine, 1.5 g/L sodium bicarbonate, 2.7 g/L glucose, 0.1 mM nonessential amino acids, 1 μg/mL transferrin, 5 μg/mL insulin, 10 ng/ml EGF, and 500 ng/mL hydrocortisone.
Gene capture and targeted sequencing
The baits for the gene capture were designed using the NimbleDesign software (Roche, v4.0). The baits were targeted against 20 SWI/SNF genes and the top 10 LUAD drivers identified by Bailey and colleagues [7] (Additional file 2: Table S2). We included the known LUAD drivers as positive controls (see Additional file 1: Supplementary Methods).
Deep sequencing data analysis
We aligned the raw reads to the hg38 human genome using BWA-MEM. Details on the pipelines, software versions and external data sources are discussed in Additional file 1: Supplementary Methods.
Real-time quantitative polymerase chain reaction
Real-time quantitative PCR (RT-qPCR) was optimized using the Applied Biosystems 7900HT Real-Time PCR System with cDNA prepared after a reverse transcription of 1 μg total RNA (RevertAid RT Kit, Thermo Scientific). All qPCR reactions followed the KAPA SYBR® FAST qPCR Master Mix recommendations. Relative expression was calculated using GAPDH as housekeeping gene and applying the DDCt method. Primers for each gene are shown in Additional file 2: Table S3. All experiments were carried out in duplicate or triplicate.
Immunoprecipitation
5 mg of protein from NL20 lysates was immunoprecipitated following the conditions detailed in Additional file 1: Supplementary Methods.
Mass spectrometry
LC–MS/MS was done by coupling an UltiMate 3000 HPLC system to a Q Exactive Plus mass spectrometer (Thermo Fisher Scientific) (see Additional file 1: Supplementary Methods).
Mass spectrometry data analysis
Raw files were processed with MaxQuant (v 1.6.2.6a) using the standard settings against a human protein database (UniProtKB/Swiss-Prot, 20,373 sequences) supplemented with contaminants (more information in Additional file 1: Supplementary Methods).
In silico analysis of the SWI/SNF complex in lung adenocarcinoma patients
We downloaded mutation, gene expression, and clinical data of LUAD patients from The Cancer Genome Atlas (TCGA-LUAD project, last updated October 1, 2019). Analyses are detailed in Additional file 1: Supplementary Methods.
Statistical analyses
Unless otherwise specified, all statistical analyses were performed using R (version 3.6.1). Normality of the data was assessed using quantile–quantile plots and data transformations and statistical tests were chosen accordingly. For more details about the statistical analyses, see Additional file 1: Supplementary Methods.
Results
Although tissue specificity is a widely known trait of the SWI/SNF complex, no studies have analyzed SWI/SNF composition in a lung epithelial cell model [8]. For this reason, first we aimed to identify which subunits constitute the SWI/SNF complex in lung epithelial cells. We performed an endogenous immunoprecipitation of SMARCA4 followed by liquid chromatography-tandem mass spectrometry/mass spectrometry (LC–MS/MS) in NL20, a non-tumorigenic bronchial epithelial cell line. Twenty SWI/SNF subunits were pulled down along with SMARCA4 (Additional file 1: Fig. S1, Additional file 2: Tables S4–S5). From now on, we will refer to the SWI/SNF subunits that were identified in the immunoprecipitation, plus SMARCA4 and SMARCA2, as “lung SWI/SNF subunits.”
To examine the mutational status of the lung SWI/SNF subunits, we performed targeted DNA sequencing in seventy LUAD primary tumors and twenty-seven of the matched normal adjacent samples. We used the information from a paired analysis on the twenty-seven matched tumor-normal pairs to optimize a pipeline for unpaired mutation calling in the seventy primary LUAD tumors (Additional file 1: Supplementary Methods and Supplementary Note). We analyzed the twenty lung SWI/SNF subunits that had good quality sequencing. In these lung SWI/SNF subunits, we found 38 point mutations and small indels in our LUAD patient cohort (N = 70). Twenty-nine (41.4%) of the primary tumors harbored at least one mutation in a lung SWI/SNF subunit (Fig. 1A). SMARCA4 was the most commonly mutated SWI/SNF gene (11.4% of samples), followed by ARID1A (8.6%), ARID2 (7.1%), ARID1B (4.3%), and PBRM1 (4.3%).
Next, to investigate the mutation frequencies in external LUAD cohorts, we examined publicly available data from TCGA-LUAD (last updated on October 1, 2019. N = 567). The distributions of clinical parameters were comparable between the two cohorts (Additional file 2: Table S1). Our cohort showed similar but slightly higher mutation frequencies in the lung SWI/SNF genes (Additional file 1: Fig. S2A). Overall, the total mutation frequency of the SWI/SNF complex was 41.4% in our cohort and 30.0% in TCGA-LUAD, possibly due to a greater coverage in our protocol or to differences in data analysis protocols. Furthermore, regardless of the cohort, SMARCA4, ARID1A, and ARID2 were the SWI/SNF subunits that accumulated the highest number of truncating mutations.
To predict the functional impact of missense mutations, we used the SIFT algorithm [9] (Additional file 1: Supplementary Methods and Fig. S2B). Based on SIFT predictions, more than half of the missense mutations in our cohort (64%, 16/25) and in the external data (65%, 103/159) were “deleterious.” Overall, considering the truncating mutations and the predicted deleterious missense mutations, more than 70% of the SWI/SNF mutations may have a functional impact.
To complement our mutational study, we analyzed the mRNA levels of the lung SWI/SNF subunits in our cohort. To measure expression accurately, we used RT-qPCR. We found that all lung SWI/SNF subunits were significantly downregulated in LUAD primary tumors compared to their matched normal adjacent samples (FDR-adjusted p < 0.05, Additional file 1: Fig. S3). We set a fold change threshold of + 2/− 2 between the tumor and the paired normal sample to consider a subunit to be up- or downregulated, respectively. Remarkably, most lung SWI/SNF subunits consistently showed lower expression in most tumors when compared to their paired normal tissues (Fig. 1B, Additional file 1: Fig. S3). We found 42 tumors (60%) that had more than 10 downregulated subunits. On average, each lung SWI/SNF subunit was downregulated in 57% of LUAD patients. The top downregulated SWI/SNF subunit was SMARCA2 (82% of the cases). Similar results have been observed in other tumors where SMARCA2 was found to be epigenetically repressed [10–12]. Moreover, none of the top 5 downregulated subunits (SMARCA2, DPF2, SMARCD3, PHF10 and SMARCD1) were among the top 5 most frequently mutated subunits. More generally, only 5/11 (45.5%) truncating mutations and 13/23 (56.5%) missense mutations were associated with more than a twofold decrease in expression. Overall, these findings suggest a profound silencing in the expression of the whole SWI/SNF machinery in LUAD and that genetic alterations are not the only cause of SWI/SNF inactivation.
In our mutational analysis, we also observed that SWI/SNF-mutant tumors from TCGA-LUAD showed a significantly higher Tumor Mutation Burden (TMB) than SWI/SNF-wild type tumors (p < 0.05) (Fig. 2A). Furthermore, we evaluated whether the mutational status of the lung SWI/SNF subunits was associated with LUAD overall survival in the TCGA-LUAD cohort. To select variables for a multivariate Cox analysis, we first performed univariate Cox analyses on each of the variables under study and we selected those with p < 0.2. We considered mutations in SWI/SNF and LUAD driver genes, TMB, and other clinically relevant covariates (Additional file 1: Supplementary Methods). In the univariate analysis, none of the individual SWI/SNF subunits were significantly associated with overall survival (OS), but SWI/SNF mutations altogether were significantly associated with poorer OS (HR = 1.42; 95% CI: 1.04–1.93; p = 2.5·10–2) (Fig. 2B and C). These observations led us to consider the SWI/SNF complex as a single functional unit. Mutations in none of the top 10 LUAD driver genes from Bailey et al. [7] were significantly associated with OS (Additional file 1: Fig. S4A–J). Next, all variables with p < 0.2 in the univariate analysis were used for a multivariate analysis. According to this analysis, the SWI/SNF mutational status is an independent prognostic factor associated to shorter OS in LUAD patients (HR = 1.45; 95% CI: 1.05—2.01; p = 2.56·10–2) (Fig. 2D). Therefore, the lung SWI/SNF mutational status distinguishes between two clinically different subgroups.
Discussion
In this work, we have analyzed the status of the SWI/SNF complex in LUAD by combining multiple “-omics” approaches. The distribution of SWI/SNF mutations in our cohort was comparable to that of TCGA-LUAD, and over 70% of the identified mutations were predicted to have functional impact. These results support the key role of SWI/SNF in cancer [2, 3, 13].
To date, although downregulation events have previously been described in certain SWI/SNF subunits [14, 15], our study is the first one that reveals a general downregulation of the whole SWI/SNF complex in LUAD. The functional implications that this observation could have in tumorigenesis are supported by many years of study of the SWI/SNF complex in several tumor types and biological contexts (reviewed in [2]).
Interestingly, we also observed that SWI/SNF-mutant tumors had a higher TMB than SWI/SNF-wild type tumors, supporting the function of the SWI/SNF complex in maintenance of genome integrity [16, 17]. Moreover, this observation highlights the potential use of the mutational status of the SWI/SNF complex as a biomarker of response to immune checkpoint inhibitors, which benefits from elevated tumor mutation burdens [18–20]. Importantly, we have shown for the first time that, when evaluated as a whole, SWI/SNF complex mutations correlate with poor prognosis in LUAD. On the other hand, mutations in well-known biomarkers such as EGFR and KRAS were not significantly associated with overall survival in the analyzed cohort. This reinforces the clinical relevance of analyzing SWI/SNF mutations in LUAD alongside other established prognostic factors. Indeed, the mutational status of the lung SWI/SNF complex was an independent prognostic factor when evaluated alongside the TMB and other clinical variables commonly associated with survival.
Overall, we propose the lung SWI/SNF as a functional unit whose recurrent mutations predict a worse clinical outcome. Moreover, there is a major downregulation of the SWI/SNF complex in LUAD that can only be partly attributed to mutations. Taken together, our findings highlight a major role of genetic and epigenetic alterations in the SWI/SNF complex in LUAD that can have clinical applications.
Supplementary Information
Acknowledgements
We thank the PhD program in Biochemistry and Molecular Biology of the University of Granada.
Abbreviations
- FDR
False Discovery Rate
- LC–MS/MS
Liquid Chromatography-tandem Mass Spectrometry/Mass Spectrometry
- LUAD
Lung adenocarcinoma
- IP
Immunoprecipitation
- RT-qPCR
Real-time quantitative polymerase chain reaction
- SWI/SNF
SWitch/Sucrose Non-Fermentable
- TCGA
The Cancer Genome Atlas
- TMB
Tumor mutation burden
Authors' contributions
PPM conceived the study and provided supervision, resources, expertise, and feedback. PP, MC, MIR, and IFC performed and validated the experimental data. CL and MS acquired clinical data of tumor patients. EZ and JM performed IP-Mass and analyzed the proteomic data. AA, DJG, MSBC, and CC performed all bioinformatic and statistical analysis. PP, AA, MC, and PPM wrote the draft. All authors read and approved the final manuscript.
Funding
PPM’s lab is funded by the Ministry of Economy of Spain (SAF2015-67919-R), Junta de Andalucía (P20-00688, PI-0135-2020, PIGE-0213-2020, PIGE-0440-2019, PI-0245-2017), University of Granada (B-CTS-480-UGR20), International Association for the Study of Lung Cancer (IASLC), and Spanish Association for Cancer Research (LAB-AECC-2018). PP is supported by a PhD “La Caixa Foundation” LCF/BQ/DE15/10360019 Fellowship. AA is supported by an FPU17/00067 fellowship. IFC was supported by a PhD FPI-fellowship (BES-2013-064596). DJG was supported by a “Fundación Benéfica Anticáncer Santa Cándida y San Francisco Javier” predoctoral fellowship. MSBC and CC’s work is supported by the project DPI2017-84439-R Ministry of Economy of Spain and FEDER and by the fellowship “Beca de Iniciación a la Investigación del Plan Propio de Investigación 2019” by University of Granada. MSBC is supported by an FPU19/00576 predoctoral fellowship. CNIO Proteomics Unit is a member of Proteored PRB3 and is supported by grant PT17/0019, of the PE I + D + i 2013–2016, funded by ISCIII and ERDF.
Availability of data and materials
Human DNA sequencing data have been uploaded to the European Genome-phenome Archive (EGA) under the accession EGAD00001005930. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD017397. The results published here are in part based upon data generated by The Cancer Genome Atlas (TCGA) managed by the NCI and NHGRI. Information about TCGA can be found at http://cancergenome.nih.gov. TCGA-LUAD can be accessed at the Genomic Data Commons Data Portal (https://portal.gdc.cancer.gov/).
Declarations
Ethics approval and consent to participate
The study was approved by the Research Ethics Committee of Granada (CEI Granada, Department of Health, Government of Andalucía) and from the Basque Foundation for Health Innovation and Research, Spain. Signed informed consent, following the procedures of the Declaration of Helsinki and institutional and national guidelines, was obtained from all participants.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Paola Peinado, Alvaro Andrades, Marta Cuadros, and Maria Isabel Rodriguez have contributed equally to this work
References
- 1.Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin. 2018;68(6):394–424. doi: 10.3322/caac.21492. [DOI] [PubMed] [Google Scholar]
- 2.Mittal P, Roberts CWM. The SWI/SNF complex in cancer—biology, biomarkers and therapy. Nat Rev Clin Oncol. 2020:1–14. [DOI] [PMC free article] [PubMed]
- 3.Kadoch C, Hargreaves DC, Hodges C, Elias L, Ho L, Ranish J, et al. Proteomic and bioinformatic analysis of mammalian SWI/SNF complexes identifies extensive roles in human malignancy. Nat Genet. 2013;45(6):592–601. doi: 10.1038/ng.2628. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Medina PP, Carretero J, Fraga MF, Esteller M, Sidransky D, Sanchez-Cespedes M. Genetic and epigenetic screening for gene alterations of the chromatin-remodeling factor, SMARCA4/BRG1, in lung tumors. Genes Chromosomes Cancer. 2004;41(2):170–177. doi: 10.1002/gcc.20068. [DOI] [PubMed] [Google Scholar]
- 5.Medina PP, Romero OA, Kohno T, Montuenga LM, Pio R, Yokota J, et al. Frequent BRG1/SMARCA4-inactivating mutations in human lung cancer cell lines. Hum Mutat. 2008;29(5):617–622. doi: 10.1002/humu.20730. [DOI] [PubMed] [Google Scholar]
- 6.Peinado P, Andrades A, Cuadros M, Rodriguez MI, Coira IF, Garcia DJ, et al. Comprehensive analysis of SWI/SNF inactivation in lung adenocarcinoma cell models. Cancers. 2020;12(12):3712–3713. doi: 10.3390/cancers12123712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bailey MH, Tokheim C, Porta-Pardo E, Sengupta S, Bertrand D, Weerasinghe A, et al. Comprehensive characterization of cancer driver genes and mutations. Cell. 2018;173(2):371–6.e18. doi: 10.1016/j.cell.2018.02.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mashtalir N, D’Avino AR, Michel BC, Luo J, Pan J, Otto JE, et al. Modular organization and assembly of SWI/SNF family chromatin remodeling complexes. Cell. 2018;175(5):1272–1288. doi: 10.1016/j.cell.2018.09.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sim N-L, Kumar P, Hu J, Henikoff S, Schneider G, Ng PC. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Research. 2012;40(Web Server issue):W452–7. [DOI] [PMC free article] [PubMed]
- 10.Glaros S, Cirrincione GM, Muchardt C, Kleer CG, Michael CW, Reisman D. The reversible epigenetic silencing of BRM: implications for clinical targeted therapy. Oncogene. 2007;26(49):7058–7066. doi: 10.1038/sj.onc.1210514. [DOI] [PubMed] [Google Scholar]
- 11.Mizutani T, Ito T, Nishina M, Yamamichi N, Watanabe A, Iba H. Maintenance of integrated proviral gene expression requires Brm, a catalytic subunit of SWI/SNF complex. J Biol Chem. 2002;277(18):15859–15864. doi: 10.1074/jbc.M112421200. [DOI] [PubMed] [Google Scholar]
- 12.Yamamichi N, Yamamichi-Nishina M, Mizutani T, Watanabe H, Minoguchi S, Kobayashi N, et al. The Brm gene suppressed at the post-transcriptional level in various human cell lines is inducible by transient HDAC inhibitor treatment, which exhibits antioncogenic potential. Oncogene. 2005;24(35):5471–5481. doi: 10.1038/sj.onc.1208716. [DOI] [PubMed] [Google Scholar]
- 13.Shain AH, Pollack JR. The spectrum of SWI/SNF mutations, ubiquitous in human cancers. PLOS one. 2013;8(1):e55119. [DOI] [PMC free article] [PubMed]
- 14.Coira IF, Rufino-Palomares EE, Romero OA, Peinado P, Metheetrairut C, Boyero-Corral L, et al. Expression inactivation of SMARCA4 by microRNAs in lung tumors. Hum Mol Genet. 2015;24(5):1400–1409. doi: 10.1093/hmg/ddu554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Marquez SB, Thompson KW, Lu L, Reisman D. Beyond mutations: additional mechanisms and implications of SWI/SNF complex inactivation. Front Oncol. 2015;4:372-. [DOI] [PMC free article] [PubMed]
- 16.Bayona-Feliu A, Barroso S, Muñoz S, Aguilera A. The SWI/SNF chromatin remodeling complex helps resolve R-loop-mediated transcription–replication conflicts. Nat Genet. 2021;53(7):1050–1063. doi: 10.1038/s41588-021-00867-2. [DOI] [PubMed] [Google Scholar]
- 17.Ribeiro-Silva C, Vermeulen W, Lans H. SWI/SNF: Complex complexes in genome stability and cancer. DNA Repair. 2019;77:87–95. doi: 10.1016/j.dnarep.2019.03.007. [DOI] [PubMed] [Google Scholar]
- 18.Naito T, Udagawa H, Umemura S, Sakai T, Zenke Y, Kirita K, et al. Non-small cell lung cancer with loss of expression of the SWI/SNF complex is associated with aggressive clinicopathological features, PD-L1-positive status, and high tumor mutation burden. Lung cancer (Amsterdam, Netherlands) 2019;138:35–42. doi: 10.1016/j.lungcan.2019.10.009. [DOI] [PubMed] [Google Scholar]
- 19.Pan D, Kobayashi A, Jiang P, Ferrari de Andrade L, Tay RE, Luoma AM, et al. A major chromatin regulator determines resistance of tumor cells to T cell-mediated killing. Science. 2018;359(6377):770–5. [DOI] [PMC free article] [PubMed]
- 20.Shen J, Ju Z, Zhao W, Wang L, Peng Y, Ge Z, et al. ARID1A deficiency promotes mutability and potentiates therapeutic antitumor immunity unleashed by immune checkpoint blockade. Nat Med. 2018;24(5):556–562. doi: 10.1038/s41591-018-0012-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Hoadley KA, Hinoue T, Wolf DM, Lazar AJ, Drill E, Shen R, et al. Cell-of-origin patterns dominate the molecular classification of 10,000 tumors from 33 types of cancer. Cell. 2018;173(2):291–304.e6. doi: 10.1016/j.cell.2018.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Human DNA sequencing data have been uploaded to the European Genome-phenome Archive (EGA) under the accession EGAD00001005930. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD017397. The results published here are in part based upon data generated by The Cancer Genome Atlas (TCGA) managed by the NCI and NHGRI. Information about TCGA can be found at http://cancergenome.nih.gov. TCGA-LUAD can be accessed at the Genomic Data Commons Data Portal (https://portal.gdc.cancer.gov/).