Abstract
SF3B1 is the most commonly mutated RNA splicing factor in cancer1–4, but the mechanisms by which SF3B1 mutations promote malignancy are poorly understood. Here, we integrated pan-cancer splicing analyses with a positive enrichment CRISPR screen to prioritize splicing alterations that promote tumorigenesis. We report that diverse SF3B1 mutations converge on repression of BRD9, a core component of the recently described GLTSCR1/1L-containing non-canonical BAF (ncBAF) chromatin remodeling complex5–7. Mutant SF3B1 recognizes an aberrant deep intronic branchpoint within BRD9, thereby inducing inclusion of an endogenous retroviral element-derived poison exon and BRD9 mRNA degradation. BRD9 depletion causes loss of ncBAF at CTCF-associated loci and promotes melanomagenesis. BRD9 is a potent tumor suppressor in uveal melanoma (UVM), such that correcting BRD9 mis-splicing in SF3B1-mutant cells with antisense oligonucleotides (ASOs) or CRISPR-directed mutagenesis suppresses tumor growth. Our results implicate ncBAF disruption in the diverse cancers carrying SF3B1 mutations and suggest a mechanism-based therapeutic for these malignancies.
SF3B1 is subject to recurrent missense mutations at specific residues in myeloid1,2 and lymphoid3,8 leukemias as well as solid tumors, at rates of up to 14-29% (UVM9–12) and 65-83% (myelodysplastic syndromes with ring sideroblasts1,2). Consistent with SF3B1’s critical role in 3’ splice site (3’ss) recognition13, several studies reported that SF3B1 mutations induce widespread usage of abnormal 3’ss10,14,15. Although many mis-spliced genes have been identified in SF3B1-mutant samples, few have been functionally implicated in driving disease.
We hypothesized that effectors of SF3B1 mutations’ pro-tumorigenic effects might appear as pan-cancer targets of mutant SF3B1. We accordingly identified mis-spliced events shared between erythroleukemic (K562) and UVM (MEL270) cells expressing wild-type (WT) SF3B1 or the most common SF3B1 mutation (SF3B1K700E). A compact set of 40 events exhibited concordant splicing changes and was sufficient to infer SF3B1 mutational status across 249 chronic lymphocytic leukemia (CLL), MDS, and UVM samples (Fig. 1a, Extended Data Fig. 1a, Supplementary Tables 1–3).
We designed a single guide RNA (sgRNA) library targeting both pan-cancer and cancer type-specific targets of mutant SF3B1, focusing on genes for which SF3B1 mutations are predicted to cause mis-splicing that triggers nonsense-mediated RNA decay (NMD; Fig. 1b, Supplementary Table 4). We tested whether knockout of any such gene promoted transformation of Ba/F3 cells (a spliceosome-WT cell line whose requirement for IL-3 can be overcome by oncogenic lesions; Fig. 1c). In addition to the positive control Pten, our screen revealed that Brd9 loss promoted Ba/F3 transformation (Fig. 1d, Extended Data Fig. 1b–d, Supplementary Tables 5–6). Brd9 was a notable hit because BRD9 exhibited striking mis-splicing in all cancer cohorts (Fig. 1e). Brd9 knockout conferred cytokine independence to 32Dcl3 cells and growth advantage to spliceosome-WT UVM, cutaneous melanoma, and pancreatic cancer cells (Extended Data Fig. 1d–f). In contrast, MLL-rearranged acute myeloid leukemia (AML) cells required BRD9 for growth (Extended Data Fig. 1g), as previously reported16.
SF3B1 mutations cause exonization of a BRD9 intronic sequence, resulting in inclusion of a poison exon that interrupts BRD9’s open reading frame. This BRD9 poison exon is derived from a primate-specific endogenous retroviral element, explaining its absence from mice (Extended Data Fig. 1h–i). We confirmed that poison exon inclusion was induced by expression of endogenous or ectopic mutant SF3B1 in K562 and NALM-6 cells, while SF3B1 knockdown (KD) in SF3B1-WT cells had no effect (Extended Data Fig. 1j–m). The poison exon was included in an SF3B1 mutation-dependent manner in diverse cell lines and CLL, MDS, and UVM samples bearing 19 different SF3B1 mutations, but not healthy tissues (Extended Data Fig. 1m–p, Supplementary Table 7).
BRD9 poison exon inclusion triggered NMD and reduced BRD9 mRNA half-life and full-length BRD9 protein (Extended Data Fig. 1q–w). SF3B1-mutant patients exhibited reduced total BRD9 mRNA levels relative to WT patients (Extended Data Fig. 1x). We tested whether poison exon inclusion could result in production of C-terminally truncated BRD9 by knocking an N-terminal HA tag into the BRD9 locus in MEL270 and K562 cells transgenically expressing WT or mutant SF3B1 (Extended Data Fig. 2a–c). Mutant SF3B1 suppressed full-length BRD9 levels without generating a truncated BRD9 protein (Fig. 1f).
SF3B1 mutations promote cryptic 3’ss usage10,14,15, likely by altering SF3B1’s normal role in branchpoint recognition17. We therefore mapped BRD9 branchpoints used in K562, MEL270, and T47D (breast cancer) cells expressing mutant SF3B1 (Fig. 2a, Extended Data Fig. 2d–f). Poison exon inclusion was associated with an unusually close branchpoint (close branchpoints are rare and normally inefficiently recognized18). Mutating the aberrant branchpoint abolished poison exon recognition (Fig. 2b, Extended Data Fig. 2g). Consistent with the poison exon’s lack of an obvious polypyrimidine tract, neither U2AF1 nor U2AF2 KD compromised poison exon recognition, while introducing a poly(Y) tract resulted in robust poison exon inclusion even in WT cells (Fig. 2b, Extended Data Fig. 2h–j). Finally, we identified a putative exonic splicing enhancer (ESE) that was essential for poison exon inclusion (Fig. 2c, Extended Data Fig. 2k). We confirmed the essentiality of the aberrant branchpoint, lack of a polypyrimidine tract, and ESE for poison exon recognition in the context of SF3B1R625H, the most common mutation in UVM (Extended Data Fig. 2l–n). Disrupting the poison exon’s 3’ss and/or ESE with CRISPR-directed mutagenesis dramatically increased BRD9 protein in SF3B1-mutant UVM cells (Fig. 2d, Extended Data Fig. 2o, Supplementary Table 8), but had no effect on BRD9 splicing or expression in SF3B1-WT cells (Extended Data Fig. 2p–r).
Several studies recently described BRD9 as part of ncBAF, which is biochemically distinct from canonical BAF (cBAF) and polybromo-associated BAF (PBAF)5–7 (Fig. 3a). Although ncBAF is not recurrently mutated in cancer—unlike cBAF and PBAF (Extended Data Fig. 3a)—our data suggested that ncBAF is nonetheless frequently disrupted via SF3B1 mutations.
We investigated the consequences of BRD9 loss by SF3B1 mutations for ncBAF function. Immunoprecipitation and mass spectrometry to identify BRD9’s chromatin-associated interaction partners in K562 cells specifically recovered ncBAF components (Extended Data Fig. 3b–c, Supplementary Table 9). We confirmed these results by immunoblotting against shared and complex-specific components of cBAF, PBAF, and ncBAF in K562 and UVM cells (Fig. 3b, Extended Data Fig. 3d). Expression of mutant, but not WT, SF3B1 reduced BRD9 protein levels and abolished BRG1-GLTSCR1 interactions while leaving BRG1-BAF155 interactions intact, indicating that SF3B1 mutations specifically perturb ncBAF rather than disrupting all BAF complexes (Fig. 3c, Extended Data Fig. 3e). Chemical degradation of BRD9 (dBRD9)19 or BRD9 knockout similarly reduced BRG1-GLTSCR1 interaction (Fig. 3c, Extended Data Fig. 3f). We next identified BRD9 domains necessary for ncBAF formation by generating 3xFLAG-BRD9 deletion mutants and testing for interactions with GLTSCR1/1L. These experiments revealed that BRD9’s DUF3512 domain mediates its interaction with GLTSCR1/1L (Extended Data Fig. 3g–h).
We next determined how SF3B1 mutations altered ncBAF localization to chromatin. We mapped genome-wide binding of the pan-BAF component BRG1 and ncBAF-specific components BRD9 and GLTSCR1 in MEL270 cells expressing WT or mutant SF3B1. We additionally performed the same ChIP-seq experiments following treatment with DMSO or dBRD9 in order to identify BRD9-dependent effects. BRD9 and GLTSCR1 exhibited significant co-localization, consistent with their mutual requirement for ncBAF formation, and were found at a subset of the loci bound by BRG1 (Fig. 3d). BRD9 and GLTSCR1 bound promoters, gene bodies, and likely enhancers, with focal binding at promoters relative to BRG1 (Fig. 3e, Extended Data Fig. 4a). CTCF motifs exhibited striking co-localization with GLTSCR1, but only modest co-localization with BRG1 (Fig. 3f, Extended Data Fig. 4b).
We then tested how depletion of BRD9 (induced by SF3B1K700E or dBRD9) altered ncBAF localization. We defined the genomic loci bound by GLTSCR1 in all samples as constitutive sites. Conversely, we defined genomic loci bound by GLTSCR1 in both control (WT SF3B1 or DMSO) but not BRD9-depleted (mutant SF3B1 or dBRD9) samples as BRD9-sensitive sites (Extended Data Fig. 4c). GLTSCR1 peaks were more BRD9-sensitive than BRG1 peaks and CTCF motifs were uniquely enriched in BRD9-sensitive loci (p < 10−8) versus constitutive GLTSCR1-bound loci (Extended Data Fig. 4d). CTCF was similarly highly enriched at BRG1-bound loci that were BRD9-sensitive (p < 10−55; Extended Data Fig. 4e–f). We conclude that BRD9 loss induced by SF3B1 mutations causes specific loss of ncBAF at CTCF-associated loci.
We identified genes with BRD9-sensitive ncBAF binding in their promoters or enhancers to find that BRD9 loss in UVM preferentially affected genes involved in apoptosis and cell growth, adhesion, and migration (Extended Data Fig. 4g). To understand how BRD9 loss altered gene expression, we identified genes whose promoters exhibited BRD9-sensitive ncBAF binding and which were differentially expressed in SF3B1-mutant versus WT UVM patients. Loss of ncBAF binding was associated with promotion as well as repression of gene expression, suggesting that ncBAF, like other SWI/SNF complexes, plays both activating and repressive roles20 (Extended Data Fig. 4h–j).
Several recent studies reported that BRD9 is required for the survival of certain cancers, particularly those with mutations affecting PBAF and cBAF6,16,21. Because BRD9 loss conferred a proliferative advantage to cancer types with recurrent SF3B1 mutations (Fig. 1d, Extended Data Fig. 1), we wondered whether normalizing BRD9 levels might suppress growth of SF3B1-mutant cells.
As SF3B1 is recurrently mutated in uveal (Fig. 4a), mucosal, and cutaneous melanomas, we first tested whether BRD9 loss induced melanomagenesis in vivo. We transduced non-tumorigenic murine melanocytes (Melan-a cells), which require oncoprotein expression for sustained growth, with a non-targeting shRNA (Extended Data Fig. 5a), doxycycline (dox)-inducible shRNAs targeting Brd9 or Brg1, or a cDNA encoding the oncoprotein CYSLTR2 L129Q (positive control22). KD of either Brd9 or Brg1 resulted in potent tumor growth, augmented melanocyte pigmentation, and expression of melanocyte lineage-specific genes in vivo (Fig. 4b–c, Extended Data Fig. 5b–g).
We next tested whether Brd9 expression influenced metastasis. Brd9 KD significantly increased the number of pulmonary metastatic foci following intravenous injection of murine melanoma (B16) or human UVM (92.1) cells into mice (Extended Data Fig. 6a–f). In contrast, restoring Brd9 expression in established tumors in vivo by withdrawing doxycycline suppressed tumor growth (Extended Data Fig. 6g–h). Similarly, ectopic expression of full-length BRD9, but not the bromodomain or DUF3512 deletion mutants, suppressed growth of UVM cell lines and xenografts (Fig. 4d–e, Extended Data Fig. 6i–k). These data demonstrate that Brd9 loss promotes cell transformation, tumor maintenance, and metastatic progression, and that BRD9’s bromodomain and DUF3512 domain are essential for its anti-proliferative effects.
We sought to understand how BRD9 loss promotes melanoma tumorigenesis. We identified BRD9-bound genes that exhibited dysregulated expression in SF3B1-mutant versus WT UVM patient samples and isogenic UVM cells with or without mutant SF3B1 and with or without forced BRD9 loss. HTRA1, a known tumor suppressor in melanoma23,24, was the most down-regulated gene in UVM (Extended Data Fig. 7a–c). HTRA1 was suppressed by mutant SF3B1 expression and dBRD9 treatment of SF3B1-WT UVM cells, while mutagenesis of the BRD9 poison exon increased HTRA1 levels in SF3B1-mutant UVM cells (Extended Data Fig. 7d–e). HTRA1 is bound by ncBAF in UVM and this binding is reduced by mutant SF3B1 (Extended Data Fig. 7f). HTRA1 KD promoted growth of SF3B1-WT UVM, while ectopic HTRA1 expression suppressed growth of SF3B1-mutant UVM (Extended Data Fig. 7g–k). These data suggest that perturbation of ncBAF-dependent regulation of HTRA1 contributes to the pro-tumorigenic effects of BRD9 loss.
We next tested whether correcting BRD9 mis-splicing suppressed tumorigenesis. CRISPR-based mutagenesis of the poison exon markedly slowed the growth of SF3B1-mutant, but not WT, cells in vitro and in vivo (Fig. 2d, 4f–g, Extended Data Fig. 2o–r, 8). We then designed ASOs to block BRD9 poison exon inclusion (Fig. 4h, Extended Data Fig. 9a). We treated SF3B1-mutant cells with a non-targeting (control) or poison exon-targeting ASO and measured BRD9 splicing, BRD9 protein, and cell growth. Each targeting ASO prevented poison exon inclusion, increased BRD9 protein levels, and suppressed cell growth relative to the control ASO (Fig. 4h, Extended Data Fig. 9b). The relative abilities of each ASO to restore BRD9 protein and suppress cell growth were strongly correlated, consistent with on-target effects.
We next tested whether ASO treatment slowed tumor growth in vivo. We treated SF3B1-mutant (MEL202-derived) xenografts with each ASO via intratumoral injection for 16 days. Treatment with the poison exon-targeting, but not non-targeting, ASO corrected BRD9 mis-splicing, significantly reduced tumor growth, and induced tumor necrosis (Fig. 4i–j, Extended Data Fig. 9c–f). We observed similar ASO efficacy in a rectal melanoma patient-derived xenograft (PDX) with SF3B1R625C (Fig. 4k, Extended Data Fig. 9g–i, Supplementary Table 7). In contrast, when we performed an identical experiment with a UVM PDX lacking an SF3B1 mutation, treatment with the poison exon-targeting ASO had no effect (Extended Data Fig. 9j–l, Supplementary Table 7). We conclude that correcting BRD9 mis-splicing restores BRD9’s tumor suppressor activity in SF3B1-mutant cancers.
Although BRD9 poison exon recognition requires mutant SF3B1, BRD9 mis-splicing and ncBAF disruption may also play roles in SF3B1-WT cancers. We identified significant pan-cancer expression correlations between BRD9 and many genes encoding RNA-binding proteins, as well as six additional BRD9 NMD isoforms that are expressed in SF3B1-WT cancers and predictive of BRD9 expression (Supplementary Table 10, Extended Data Fig. 10). A recent study also identified a promoter polymorphism associated with decreased GLTSCR1 expression as a common risk allele for AML25.
As we observed BRD9 mis-splicing in diverse cancers carrying distinct SF3B1 mutations, targeting BRD9 mis-splicing could be a productive pan-cancer therapy. Although ASO treatment merely restored BRD9 mRNA and protein to normal levels, we nonetheless observed strong suppression of tumor growth. The functional impact of correcting BRD9 mis-splicing was particularly notable given that the UVM models used here contain hundreds of other mis-splicing events and multiple pro-tumorigenic mutations. Given the recent clinical successes with treating spinal muscular atrophy and other diseases with ASOs26, the tumor-suppressive effects of correcting BRD9 mis-splicing suggest that oligonucleotide-based therapy may prove similarly promising for treating cancers with spliceosomal mutations.
METHODS
No statistical methods were used to predetermine sample size. The experiments were not randomized. The investigators were not blinded to allocation during experiments and outcome assessment.
Cell lines and tissue culture.
All cell lines underwent short-tandem repeat (STR) testing (ATCC) and MSK IMPACT genetic analysis28 to evaluate for spliceosome gene mutational status and status of recurrently mutated genes in cancer. HEK293T cells were grown in DMEM/10% FCS. Ba/F3 cells and Melan-a cells (provided by Dorothy Bennett) were grown in RPMI/10% with 1 ng/ml IL-3 (PeproTech; 213-13) and 200 nM TPA (Sigma-Aldrich), respectively, unless noted otherwise. The K562 and NALM-6 isogenic cell lines (engineered to express SF3B1K700E, SF3B1K666N, or SF3B1K700K from the endogenous SF3B1 locus) were cultured in RPMI/10% FCS and their generation has been described previously29. MEL270, MEL285, RN2 cell lines were cultured in RPMI/10% FCS. MEL202, 92-1, and SK-MEL30 cells were grown in RPMI/10% FCS/1% GlutaMAX (Gibco). UPMD1 and UPMD2 cells were grown in Ham F-12/10% FCS. CFPAC1 cells were cultured in IMDM/10% FCS. KPC, Miapaca2, B16 cells were cultured in DMEM/10% FCS. Panc 05.04 cells were grown in RPMI/20% FCS/20 Units/ml human recombinant insulin. T47D cells were cultured in RPMI1640 supplemented with 10% fetal bovine serum (Corning), 100 μg/mL penicillin, 100 mg/mL streptomycin (Corning), and 4 mM glutamine. All cell culture media include penicillin (100 U/ml) and streptomycin (100 μg/ml).
Primary human samples and human PDX models.
Studies were approved by the Institutional Review Boards of Memorial Sloan Kettering Cancer Center (MSK), informed consent was obtained from all subjects (under MSK IRB protocol 06-107) and conducted in accordance to the Declaration of Helsinki protocol. Patients provided samples after their informed consent and primary human de-identified CLL samples derived from whole peripheral blood or BM mononuclear cells were utilized. PDX models were performed using tumor biopsies from de-identified patients under MSK IRB protocol 14-191. Genomic alterations in melanoma tumor biopsies and CLL cells were analyzed using MSK IMPACT28 assay or FoundationOne Heme30 assay, both as previously described. Patient samples were anonymized by the Hematologic Oncology Tissue Bank of MSK (for CLL samples) and the MSK Antitumor Assessment Core Facility (for PDX samples).
Animals.
All animals were housed at Memorial Sloan Kettering Cancer Center (MSKCC). All animal procedures were completed in accordance with the Guidelines for the Care and Use of Laboratory Animals and were approved by the Institutional Animal Care and Use Committees at MSKCC. All mouse experiments were performed in accordance with a protocol approved by the MSKCC Institutional Animal Care and Use Committee (11-12-029). SCID animals (Jackson laboratories stock #001303) were used for all human cell line xenografts while NSG mice (Jackson laboratories stock #005557) were used for patient-derived xenografts (PDX). For all mouse experiments, the mice were monitored closely for signs of disease or morbidity daily and were sacrificed for visible tumor formation tumor volume >1 cm3, failure to thrive, weight loss>10% total body weight, open skin lesions, bleeding, or any signs of infection. In none of the experiments were these limits exceeded.
HA tag knockin into endogenous BRD9.
The following guide RNA sequence targeting BRD9 transcriptional start site was selected using the Optimized CRISPR Design tool (http://crispr.mit.edu): CGAGTGGCGCTCGTCCTACG. DNA oligonucleotides were purchased from IDT and cloned into px458-GFP vector. For homologous recombination, we purchased a custom IDT Ultramer 197bp repair template (single strand donor DNA) with the HA sequence (TACCCATACGATGTTCCAGATTACGCT) directly following the BRD9 start codon. This 197bp fragment contained two silent mutations, one to remove the PAM site (AGG>AAG) and another to introduce an XhoI restriction enzyme (CTCGGG >CTCGAG) site upstream of the HA tag. The 197bp fragment also contained 83bp of homology to BRD9 5’ UTR upstream of the HA tag and 87bp of homology to the BRD9 exon1 downstream of the start codon. Five ug of the targeting construct and 500nM repair template were nucleofected into K562 SF3B1K700E cells and MEL270 cells using the Lonza Nucleofector V kit and Program T-003 on the Nucleofector device. Nucleofected cells were single cell sorted based on GFP positivity 48-hours following nucleofection. Clones were screened for the presence of successful HA insertion by BRD9 exon 1 PCR and subsequent restriction enzyme digest with XhoI and direct Sanger sequencing. A single positive clone containing the HA coding sequencing was selected to carry out further studies. sgRNAs: CACCCGAGTGGCGCTCGTCCTACG (top), AAACCGTAGGACGAGCGCCACTCG (bottom) Single strand donor DNA: CCAGGGGGCGGTGGCGGCCAAGGTCCGACCGGGTGCCAGCTGTTCCCAGCCCCCGCCTCGAGCCCGCCGCCGGCGCCGCCATGTACCCATACGATGTTCCAGATTACGCTGGCAAGAAGCACAAGAAGCACAAGGCCGAGTGGCGCTCGTCCTACGAAGGTGAGGCGGCGGCGCTTTGTGACGCGCGGCGGGCGGGG PCR primers - Fwd: AGCGAGCTCGGCAACCTCG, Rev: CTTCAGGACTAGCTTTAGAGGC Sanger Sequence primers – Rev: TGCAGCCTCGAACCCAGAAC
Overexpression of SF3B1 cDNA in K562 cells.
2 μg of PiggyBac Transposase construct (CMV-PB-Transposase-IRES-TK-HSV), and 6 μg of SF3B1WT (ITR-CAG-Flag-SF3B1WT-IRES-Puro-ITR) or SF3B1K700E mutant (ITR-CAG-Flag-SF3B1K700E-IRES-Puro-ITR) cDNA constructs were electroporated into 2 x 106cells (in 200 μL volume) using the Amaxa Nucleofector Protocol (Program T-003) according to manufacturer instructions (Lonza). Puromycin selection (1 μg/mL) was initiated 4 days after electroporation to select for cells that successfully incorporated the constructs. Sanger sequencing was performed to confirm successful integration of the cDNA plasmid using the following primers – Fwd: TCCAATCAAAGATCTTCTTCCAA, Rev: GAGCAGGTTTCTGCAACGAT.
RT-PCR and quantitative RT-PCR (qRT-PCR).
Total RNA was isolated using RNeasy Mini or Micro kit (Qiagen). For cDNA synthesis, total RNA was reverse transcribed to cDNA with SuperScript VILO cDNA synthesis kit (Life Technologies). The resulting cDNA was diluted 10-20 fold prior to use. Quantitative RT-PCR (qRT-PCR) was performed in 10 μL reactions with either SYBR Green PCR Master Mix. All qRT-PCR analysis was performed on an Applied Biosystems QuantStudio 6 Flex Cycler (ThermoFisher Scientific). Relative gene expression levels were calculated using the comparative CT method. Primers used in RT-PCR reactions were:
BRD9 (human) – | Fwd: GCAATGACATACAATAGGCCAGA, Rev: GAGCTGCCTGTTTGCTCATCA |
Brd9 (mouse) – | Fwd: TTGGAGATGGAAGTCTGCTCT, Rev: GCAACTTGCTAGACAGTGAACT |
BRD9 poison exon (human) – | Fwd: AGCTCTGTTCTGGAGTTCATG, Rev: CTGAAGAAACTCATAGGGGTCGTG |
Brd9 poison exon (mouse) – | Fwd: GGCCCTGTTCTGGACTTCATG, Rev: CTGAAGGAATTCATAAGGGTCGTG |
BRD9 poison exon inclusion for siRNA experiment (human) – | Fwd: CAGCAGCTCTGTTCTGGAGT, Rev: CCTGAAAGAAACCAGAGAGCTG |
BRD9 poison exon exclusion for siRNA experiment (human) – | Fwd: CAGCAGCTCTGTTCTGGAGT, Rev: TCACCTTCCCCAGAGAGCTG |
EPB49 cassette exon inclusion (human) – | Fwd: GCCTGCAGAACGGAGAGG, Rev: ACCACTAGCATTTCATAGGGATAGATCT |
EPB49 cassette exon exclusion (human) – | Fwd: GCCTGCAGATCTATCCCTATGAAAT, Rev: CTCAAGCCGCATCCGATCC |
BRD9 poison exon for mRNA half-life experiment (human) – | Fwd: GTTGGGGACACCCTAGGAGA, Rev (exclusion specific): CTTCACCTTCCCCAGAGAGC, Rev (inclusion specific): CCCTGAAAGAAACCAGAGAGC |
18s rRNA (human) – | Fwd: CTACCACATCCAAGGAAGCA, Rev: TTTTTCGTCACTACCTCCCCG |
Mitf (mouse) – | Fwd: CCAACAGCCCTATGGCTATGC, Rev: CTGGGCACTCACTCTCTGC |
Dct (mouse) – | Fwd: GTCCTCCACTCTTTTACAGACG, Rev: ATTCGGTTGTGACCAATGGGT |
Pmel (mouse) – | Fwd: GAGCTTCCTTCCCGTGCTT, Rev: TGCCTGTTCCAGGTTTTAGTTAC |
Tyrp1 (mouse) – | Fwd: CCCCTAGCCTATATCTCCCTTTT, Rev: TACCATCGTGGGGATAATGGC |
GAPDH (human) – | Fwd: GGAGCGAGATCCCTCCAAAAT, Rev: GGCTGTTGTCATACTTCTCATGG |
Gapdh (mouse) – | Fwd: AGGTCGGTGTGAACGGATTTG, Rev: TGTAGACCATGTAGTTGAGGTCA |
mRNA stability assay.
For mRNA half-life measurement using qRT-PCR, anti-UPF1 shRNAs and control shRNA infected K562 and NALM-6 cells with isogenic SF3B1K700E mutations were treated with 2.5 μg/ml Actinomycin D (Life Technologies) and harvested at 0, 2, 4, 6, and 8 hr (using methods as described previously31). BRD9 poison exon inclusion/exclusion and 18s rRNA mRNA levels were measured by qRT-PCR.
Western blotting.
For western blotting, the following antibodies to the following proteins were used: BRD9 (Bethyl Laboratories; A303-781A and Active Motif; 61538), SF3B1/Sap-155 (MBL; D221-3), Flag-M2 (Sigma-Aldrich; F-1084), β-actin (Sigma-Aldrich; A-5441), GLTSCR1 (Santa Cruz Biotechnology; sc-515086), GLTSCR1L (Thermo Fisher Scientific; PA5-56126), BRM (Bethyl Laboratories; A303-015A), BRG1 (Santa Cruz Biotechnology; sc-17796), BAF155 (Santa Cruz Biotechnology; sc-48350), BAF60A (Santa Cruz Biotechnology; sc-135843), BAF47 (Santa Cruz Biotechnology; sc-166165), ARID1A (Santa Cruz Biotechnology; sc-373784), ARID2 (Santa Cruz Biotechnology; sc-166117), BRD7 (Thermo Fisher Scientific; PA5-49379), U2AF2 (Bethyl Laboratories; A303-665A), U2AF1 (Bethyl Laboratories; A302-080A), Histone H3 (Abcam; ab1791), HTRA1 (R&D systems; MAB2916-SP). All primary antibodies for western blotting were diluted to a final concentration of 1:500 to 1,000, in either 5% BSA (Sigma-Aldrich) in 0.05% TBS-Tween 20 (TBS-T) or 5% skim milk in 0.05% TBS-T. Nuclear extracts were quantified using BCA and 1 mg protein (1 mg ml−1 in IP buffer supplemented with protease inhibitors) was used per immunoprecipitation. Proteins were incubated for 3 hours with 2-5 μg of antibody or with Protein A/G PLUS-Agarose (Santa Cruz Biotechnology; sc-2003) with rotation at 4 °C. After washing 3 times with Pierce™ IP Lysis Buffer (Thermo Fisher Scientific; 87787), immunoprecipitated proteins were eluted with Pierce™ Lane Marker Reducing Sample Buffer (Thermo Fisher Scientific; 39000) and loaded onto 4-12% Bis-Tris NuPAGE Gels (Life Technologies).
Chromatin immunoprecipitation (ChIP).
For ChIP-seq studies in MEL270 cells, antibodies to endogenous BRG1 (Abcam EPNCIR111A, Lot # GR3208604-8), GLTSCR1 (Santa Cruz SC-240516, Lot # A2313), and BRD9 (Abcam, ab137245) were used and ChIP was performed as described in detail previously6. MEL270 cells transduced with empty vector, doxycycline inducible SF3B1 wild type cDNA, or doxycycline inducible SF3B1 K700E mutant cDNA in the backbone of pInducer20, were treated with doxycycline (1 μg/ml) plus dBRD919 (250nM) or DMSO for 72 hours before cross-linking.
Mass spectrometry.
For anti-FLAG-BRD9 ChIP followed by mass spectrometry, K562 cells transduced with empty vector or 3xFLAG tagged BRD9 were grown in RPMI/10% FCS. Ten million cells were crosslinked according to the manufacturer’s instruction (Active Motif) and as described previously.32 Cells were fixed with 1% methanol-free formaldehyde (Sigma, F-8775) for 8 min and quenched with 0.125 M glycine (Sigma, G-7403). Chromatin was isolated by the addition of lysis buffer, followed by disruption with a Dounce homogenizer. Lysates were sonicated and the DNA sheared to an average length of 300-500 bp. Genomic DNA (Input) was prepared by treating aliquots of chromatin with RNase, proteinase K and heat for reverse crosslinking, followed by ethanol precipitation. Pellets were resuspended and the resulting DNA was quantified on a NanoDrop spectrophotometer. Extrapolation to the original chromatin volume allowed quantitation of the total chromatin yield. An aliquot of chromatin (150 ug) was precleared with protein G agarose beads (Invitrogen). Proteins of interest were immunoprecipitated using 15 ug of antibody against FLAG and protein G magnetic beads. Protein complexes were washed and trypsin was used to remove the immunoprecipitate from beads and digest the protein sample. Protein digests were separated from the beads and purified using a C18 spin column (Harvard Apparatus). The peptides were vacuum dried using a SpeedVac. Digested peptides were analyzed by LC-MS/MS on a Thermo Scientific Q Exactive Orbitrap Mass spectrometer in conjunction with a Proxeon Easy-nLC II HPLC (Thermo Scientific) and Proxeon nanospray source.
Protein identifications were accepted if they contained at least one identified peptide. Proteins that contained similar peptides and could not be differentiated based on MS/MS analysis alone were grouped to satisfy the principles of parsimony. Proteins sharing significant peptide evidence were grouped into clusters. Protein coverage percentage is observed in the raw data file. Final list generation was done by taking all proteins with a spectral count of five and above from each replicate reaction and comparing them in a venn diagram against IgG control replicates. Proteins unique to both experimental replicates were then applied to the PANTHER database for protein ontology results.
shRNA experiments.
Cells were transduced with a doxycycline-inducible LT3GEPIR lentiviral vector, T3G-GFP-mirE-PGK-Puro-IRES-rtTA333, expressing shRNAs for BRD9 or a non-targeting renilla. Short hairpins were induced with the addition of doxycycline (2.0 μg/mL; Sigma Aldrich). All shRNAs were designed using the SplashRNA algorithm (Pelossof et al., 2017). The short hairpin sequences are:
shBRD9-1 (human, shBRD9_352): TTTATTATCATTGAATATCCAG
shBRD9-2 (human, shBRD9_353): TTTTATTATCATTGAATATCCA
shBrd9-1 (mouse, shBrd9_511): TTTATTATCATTGAATACCCAG
shBrd9-2(mouse, shBrd9_512): TTTTATTATCATTGAATACCCA
shHTRA1-1 (human, shHTRA1_1192): TTTTTAATCTTATCAGATGGGA
shHTRA1-2 (human, shHTRA1_1669): TGAACAAACAAAATGGCAGTCA
shHTRA1-3 (human, shHTRA1_1898): TTCTATCTACGCATTGTATCGA
siRNA transfections.
K562 cells were transfected with a non-targeting control siRNA (Dharmacon, D-001810-01, target sequence: UGGUUUACAUGUCGACUAA), a siRNA pool against U2AF1 (Dharmacon ON-TARGETplus SMARTpool, L-012325-01), or a siRNA pool against U2AF2 (Dharmacon ON-TARGETplus SMARTpool, L-012380-02) using the Nucleofector II device from Lonza with the Cell Line Nucleofector Kit V (program T16). RNA and protein were extracted 48 hours post-transfection. cDNA was produced using 1 μg of RNA and the Superscript III first strand synthesis system (Thermo Fisher, 18080051).
In vitro competition assay.
For competition assay to evaluate the cellular effect of sgBRD9/Brd9, cell lines were transduced with LentiCas9-Blast (Addgene; #52962) and then single-cell sorted into 96-well plates. Among these clones, we used single clones with strong Cas9 expression, which was confirmed by Western blotting. Cas9 expressing cells were lentivirally transduced with iLenti-guide-GFP vectors subcloned with the gRNA sequences below in which sgRNA expression was linked to GFP expression. The percentage of GFP-expressing cells was then measured over time after infection using BD LSRFortessa. GFP positive rate in living cells at each point compared to that of day2 were calculated. Similarly, to evaluate the cellular effect of BRD9 fragment cDNA, HTRA1 cDNA, or shRNAs against BRD9 or HTRA1, GFP positive rates were followed up after transducing pMIGII-backbone plasmids (cDNA) and LT3GEPIR plasmids (shRNA), respectively, into melanoma cell lines.
sgBRD9-1 (human): ACTCCAGTTACTATGATGAC, sgBRD9-2 (human): AGAGAGGGAGCACTGTGACA, sgBRD9-3 (human): AGATACCGTGTACTACAAGT, sgBrd9-1 (mouse): ATTAACCGGTTTCTCCCGGG, sgBrd9-2 (mouse): GGAACACTGCGACTCAGAGG, sgBrd9-3 (mouse): ACTTGCTAGACAGTGAACTC, Control (Scrambled): ACGGAGGCTAAGCGTCGCAA
CRISPR enrichment screening for NMD targets.
First, Ba/F3 cells were transduced with LentiCas9-Blast (Addgene; #52962) and single-cell sorted into 96-well plates. Among these clones, we used a single clone with strong Cas9 expression. sgRNA library of NMD targets in SF3B1 mutant cells were amplified and packaged as lentivirus. The library includes 4 sgRNAs against each target gene (274 genes), 100 control sgRNAs, and positive control sgRNAs against Pten. Ba/F3 cells were transduced with lentivirus carrying sgRNA library produced by 293FT cells and puromycin selection (2 μg/ml) was performed in IL-3-containing media for 7 days. Then, we washed out IL-3 (Day0) and the surviving cells were harvested at 7 days after IL-3 depletion (Day 7). Cell pellets were lysed and genomic DNA was extracted (Qiagen) and quantified by Qubit (ThermoScientific). A quantity of gDNA covering 1,000X representation of gRNAs was PCR amplified using Q5 high-fidelity polymerase (NEB cat# M0491) to add Illumina adapters and multiplexing barcodes. Amplicons were quantified by Qubit and Bioanalyzer (Agilent) and sequenced on Illumina HiSeq 2500. Sequencing reads were aligned to the screened library; counts were computed for each gRNA; counts for each sgRNA were compared between days 0 and 7 after cytokine depletion. For the probe-level analysis, we fitted a negative binomial generalized log-linear model and performed a likelihood ratio test with glmFit and glmLRT in the R/Bioconductor edgeR package. For the gene-level analysis, we used the CAMERA test as implemented in edgeR34–36. FDRs were computed with the Benjamini-Hochberg method.
CRISPR-directed mutations.
Cas 9 expressing MEL202 and MEL270 cells were transduced with iLenti-guide-Puro vector targeting 5’ end of BRD9 poison exon. sgRNA used to induce mutations was: AAAATACTCAGTTTCTTTCA. Single cell sorting was performed into 96 well plate by BD FACSAria™ III cell sorter after puromycin selection (2.0 μg/mL for 7 days). The mutations caused by Cas9 and sgRNA were confirmed by PCR amplification followed by Sanger sequencing.
PCR primers - Fwd: TGTTGGGTCAGGAAGAGACTTG, Rev: CCATGGACTGAACGGATTCC
Sanger sequencing primer – Fwd: TGTTGGGTCAGGAAGAGACTTG
Colony forming assays.
Single-cell suspensions were prepared from MEL202 and MEL270 cells with or without CRISPR mediated indel and 3,000 cells from each were plated in triplicates in 6-well treated plates, and colonies were enumerated 10 days later. After 10 days, colonies were fixed with 3.7% paraformaldehyde for 5 minutes and stained in a solution of 0.05% crystal violet for 30 minutes at room temperature and washed in PBS and tap water.
In vitro cell viability assays.
Cells were seeded in white flat-well 96-well plates (Costar) at a density of 1000 cells per well. ATP luminescence readings were taken every 24h after seeding using Cell Titer Glo (Promega) according to the manufacturer’s instructions.
BRD9 minigene assay.
We identified putative ESEs with SpliceAid237. We used SF3B1 wild-type MEL270 and T47D cells transduced with FLAG-SF3B1 WT or FLAG-SF3B1K700E cDNA (in the backbone of pInducer20 (Addgene, #44012)). After drug selection with Neomycin (Thermo Fisher Scientific, 10131027), the selected cells were treated with 1 μg /ml doxycycline (Sigma, D9891). The BRD9 minigene construct was generated by inserting the DNA fragment containing the BRD9 genomic sequence from exon 14 to exon 15 in between the BamHI and AgeI restriction sites in the FRE5 plasmid (Addgene 62377) via Gibson assembly. BRD9 minigene mutagenesis was performed with the Agilent QuikChange II Site-directed mutagenesis kit with specific primers according to the manufacturer’s directions. For transient transfection experiments, cells were seeded into 24-well plate one day before transfection of BRD9 minigene constructs in the presence of X-tremeGENE HP DNA transfection reagent (Roche) according to the manufacturer’s directions. Forty-eight hours after transfection, cells were harvested and RNA was extracted using Qiagen RNeasy mini kit. Minigene-derived and endogenous BRD9 transcripts were analyzed by RT-PCR using specific primers. Primers and oligonucleotides used in RT-PCR reactions were:
Cloning – | Fwd: GGACCCAGTACCAGGATCCGTTGGGGACACCCTAGGAG, Rev: CTTGGAGGAGCGCACACCGGTCAGGTGGTGCTGGTCCCTC |
RT-PCR– | Fwd (minigene): GGATTACAAGGATGACGATGAC, Fwd (endogenous): CATGAAGCCTCCAGATGAAG, Rev (common): CTTCGTCGTCTCATCCAAGTTC |
Mutagenesis (AAAA to TTTT) – | Fwd: CAAAGGGATATATTTTGAGATTTTTTACTCAGTTTCTTTCAGG, Rev: CCTGAAAGAAACTGAGTAAAAAATCTCAAAATATATCCCTTTG |
Mutagenesis (AAAA to ATAT) – | Fwd: CAAAGGGATATATTTTGAGATATATTACTCAGTTTCTTTCAGG, Rev: CCTGAAAGAAACTGAGTAATATATCTCAAAATATATCCCTTTG |
Mutagenesis (GATAAAA to TTTTTTT) – | Fwd: GTCAAAGGGATATATTTTGATTTTTTTTACTCAGTTTCTTTCAGG, Rev: CCTGAAAGAAACTGAGTAAAAAAAATCAAAATATATCCCTTTGAC |
Mutagenesis (TTTCT to TTACA at exon 14a)– | Fwd: GAGATAAAATACTCAGTTACATTCAGGGCCTGCCATCTATC, Rev: GATAGATGGCAGGCCCTGAATGTAACTGAGTATTTTATCTC |
Mutagenesis (TTTCT to TTGCG at exon 14a)– | Fwd: GAGATAAAATACTCAGTTGCGTTCAGGGCCTGCCATCTATC, Rev: GATAGATGGCAGGCCCTGAACGCAACTGAGTATTTTATCTC |
Mutagenesis (TTTCT to TTTCA at exon 14a)– | Fwd: GAGATAAAATACTCAGTTTCATTCAGGGCCTGCCATCTATC, Rev: GATAGATGGCAGGCCCTGAATGAAACTGAGTATTTTATCTC |
Mutagenesis (TTTCT to TTTCG at exon 14a)– | Fwd: GAGATAAAATACTCAGTTTCGTTCAGGGCCTGCCATCTATC, Rev: GATAGATGGCAGGCCCTGAACGAAACTGAGTATTTTATCTC |
Mutagenesis (TTTCT to TTTAT at exon 14a)– | Fwd: GAGATAAAATACTCAGTTTATTTCAGGGCCTGCCATCTATC, Rev: GATAGATGGCAGGCCCTGAAATAAACTGAGTATTTTATCTC |
Mutagenesis (Branch point A to G)– | Fwd: GTCAAAGGGATATATTTTGGGATAAAATACTCAGTTTCTTTC, Rev: GAAAGAAACTGAGTATTTTATCCCAAAATATATCCCTTTGAC |
Lariat sequencing.
To map the branchpoints that were used when the BRD9 poison exon was included or excluded, RT-PCR was performed to amplify branchpoint-spanning fragments from lariat RNAs arising during normal (poison exon exclusion) or aberrant (poison exon inclusion) splicing of BRD9 pre-mRNA. Briefly, SuperScript III reverse transcriptase (Invitrogen) and a primer complementary to the intronic sequences downstream of the 5’ splice sites were used to generate cDNA from lariat RNAs. Branchpoint-spanning fragments were then amplified from lariat RNAs by nested PCR with pairs of outer primers (with the RT primer being the reverse primer) and inner primers. The forward primers were complementary to sequences about 200-300 nucleotides upstream of the 3’ splice sites and the reverse primers were complementary to sequences downstream of the 5’ splice sites (see Supplementary Table 11). The PCR products were cloned into the pGEM-T vector (Promega) and sequenced by Sanger sequencing.
Melanoma transplant model.
We resuspended stably-transduced Melan-a/MEL270/MEL202 cells (1M cells) with doxycycline-inducible shRNAs, cDNAs, or sgRNAs in 100 μl of a 1:1 mix of medium and Matrigel (BD Biosciences) and subcutaneously and bilaterally injected the mix into the flanks of 7-week-old female CB17-SCID mice (Taconic). For doxycycline-regulated shRNA induction, we used doxycycline-containing diets (625mg/kg diet, Envigo). To assess tumor growth, at least five mice per group were injected for a total of ten tumors per group. No randomization or blinding was used in the analysis of tumor growth. Tumors were measured with calipers every 7 days. Growth curves were visualized with Prism GraphPad 8.0. Tumor volume was calculated using the formula; Volume = π(length)(width)(height)/6.
In vitro morpholino transfection.
To deliver morpholinos into cultured cell lines, we followed the manufacturer’s instruction (GeneTools LLC). Briefly, we used 6uM Endo-Porter after adding morpholinos (final concentration: 10uM). RNA and proteins are collected 48 hours after delivery.
morpholino target sequence - #3 TAATGAGGCAAGTCCAGTCCCGCTT, #6 AAAGAGGGGATAATGAGGCAAGTCC, #7 GGGATAATGAGGCAAGTCCAGTCCC.
In vivo morpholino treatment.
Treatment with morpholinos was started when the tumor volume in mice reached 100-200 mm3. Cohorts were treated intratumorally with 12.5 mg/kg scrambled or poison exon-targeting Vivo-Morpholinos (AAAGAGGGGATAATGAGGCAAGTCC, GeneTools LLC) dissolved in 50 μl PBS, every 2 days for 8 doses total. The mice were dissected 24 hours after the final treatment. For the PDX model, patients’ derived rectal melanoma cells (SF3B1- R625C) and UVM cells (SF3B1 wild type) were serially transplanted into SCID mice and treated similarly.
In vivo metastasis model.
For lung experimental metastasis, B16 and 92.1 melanoma cells retrovirally transduced with shRenilla, shBRD9/Brd9#1 and shBRD9/Brd9#2 in MLS-E vector (sorted using GFP) were trypsinized, resuspended in PBS, and then 0.4M cells in 0.2 mL PBS were injected via the lateral tail vein using a 27-gauge needle. Mice were sacrificed 14 days after injection and tissues were isolated and fixed in 4% paraformaldehyde (Thermo Fisher Scientific). For evaluation of metastatic colonization of the lung using 92.1 human UVM cells, the burden of metastatic cells was evaluated using GFP expression by flow cytometry as well as anti-GFP immunohistochemistry 14 days following tail-vein injection of 0.4M cells into NOD-Scid IL2Rγ null mice.
Histologic analysis.
Tissues were fixed in 4% paraformaldehyde, processed routinely in alcohol and xylene, embedded in paraffin, sectioned at 5-micron thickness, and stained with hematoxylin-eosin (H&E). Immunohistochemistry (IHC) was performed on a Leica Bond RX automated stainer (Leica Biosystems, Buffalo Grove, IL). Following heat induced epitope retrieval at pH 6.0, the primary antibody against Ki67 (Vector VP-K451) were applied, followed by application of a polymer detection system (DS9800, Novocastra Bond Polymer Refine Detection, Leica Biosystems) in which the chromogen was 3,3 diaminobenzidine tetrachloride (DAB) and counterstain was hematoxylin. Photomicrograph examination of all H&E and IHC slides were performed using a Zeiss Axioskop imaging.
BRD9 expression correlates.
cor.test (in R) was used to calculate Spearman’s ρ and the associated p-value associated with the correlation of BRD9 expression with the expression of each coding gene across all samples within each TCGA cancer cohort. Analysis was restricted to coding genes not on the same chromosome arm as BRD9 (chromosome 5p) to remove potential confounding effects of local correlations. Coding genes with p < 0.01 in at least 10 cancer types were ranked by their absolute mean value of ρ (computed across all TCGA cohorts) and classified as RNA-binding if they were annotated with the “RNA-binding” Gene Ontology term (GO:0003723).
BRD9 alternative splicing.
Potential NMD-targeted isoforms of BRD9 were identified as follows: we queried the MISO v2.0 alternative splicing annotation38 for exon skipping and competing splice site events within the BRD9 gene locus, restricted to those events with evidence of alternative splicing based on spliced junction reads identified via the TCGA analysis described below, assigned open reading frames for the isoforms resulting from each alternative splicing event based on the BRD9 isoform with the longest open reading frame, and classified isoforms as predicted NMD substrates if they contained a termination codon > 50 nt upstream of a splice junction.
Robust linear modeling of BRD9 expression based on the identified alternatively spliced isoforms of BRD9 that are predicted NMD substrates was performed for each cancer cohort with the rlm function in the MASS package in R. Relative expression of each isoform in each sample was estimated from RNA-seq data across all TCGA cohorts as described below. A z-score normalization was performed across all samples for each isoform in each cohort prior to model fitting. The resulting coefficients from the fitted models were subsequently used to predict BRD9 expression from BRD9 NMD-targeted isoform expression.
RNA-seq library preparation.
RNA-seq libraries were prepared from TRIzol-isolated (Thermo Fisher catalog no. 15596026) RNA using the Illumina TruSeq RNA Library Prep Kit v2 (Illumina catalog no. RS-122-2001/2). K562 libraries were sequenced at MSKCC with 101 bp single-end reads. MEL270 libraries were sequenced by the FHCRC Genomics Shared Resource with 2x51 bp paired-end reads.
ChIP-seq library preparation.
ChIP-seq libraries were prepared and sequenced as previously described6 by the Molecular Biology Core Facilities at the Dana-Farber Cancer Institute with 75 bp single-end reads.
Genomic analysis of SWI/SNF complex members from TCGA.
Mutational analysis of genes encoding SWI/SNF complex members was performed as previously described39.
Genome annotation, RNA-seq read mapping, and gene and isoform expression estimation.
RNA-seq reads were processed for gene expression and isoform ratio quantification as previously described40. In brief, RNA-seq reads were aligned to the hg19/GRCh37 assembly of the human genome using a gene annotation created by merging the UCSC knownGene gene annotation41, Ensembl v71.1 gene annotation42, and MISO v2.0 isoform annotation38. Read alignment and expression estimation were performed with RSEM v1.2.443, Bowtie v1.0.044, and TopHat v2.1.145. Isoform ratios were quantified with MISO v2.038. Gene expression estimates were normalized by applying the trimmed mean of M values (TMM) method46 to coding genes. Statistical tests for differential gene and isoform expression were performed for single-sample comparisons with Wagenmakers’s Bayesian framework47 and for sample group comparisons with the Mann-Whitney U test. RNA-seq read coverage plots (e.g., Fig. 1e) represent reads normalized by the number of reads mapping to all coding genes in each sample (per million).
RNA-seq coverage plots.
RNA-seq coverage plots were made using the UCSC Genome Browser48 and/or the ggplot2 package in R49. Repetitive elements were annotated by RepeatMasker27.
Cluster analysis.
Unsupervised clustering of CLL, MDS, and UVM samples (Fig. 1a) was based on the 40 events that were differentially spliced in isogenic UVM (MEL270 cells) as well as myeloid leukemia (K562 cells) cells expressing SF3B1K700E versus WT SF3B1, restricted to the 30 such events with sufficient read coverage in all cohorts for clustering.
ChIP-seq data analysis.
ChIP-seq reads were mapped to the genome by calling Bowtie v1.0.044 with the arguments ‘-v 2 -k 1 -m 1 --best --strata’. Peaks were called using MACS2 v2.1.1.2016030950 against input control libraries with p < 10−5 and subsequently filtered to remove peaks contained within ENCODE blacklisted regions51 and the mitochondrial genome. Subsequent data analysis was performed with Bioconductor in the R programming environment52. Consensus peaks between samples were called using the soGGI package v1.14.053. Peaks were annotated using the ChIPseeker package v1.18.054. Potential transcription factor binding in a 300 nucleotide region around the center of consensus peaks was scored using the TFBSTools package v1.20.055, with models taken from the HOCOMOCO v11 human core collection56 and applied with a threshold of p < 10−4. The highest scores for each consensus peak region were collated for each transcription factor. A two-sided Mann-Whitney U test was used to assess the significance of the difference in scores between constitutive and sensitive peaks for each transcription factor.
DATA AVAILABILITY
RNA-seq and ChIP-seq data generated as part of this study were deposited in the Gene Expression Omnibus (accession number GSE124720). RNA-seq data from published studies were downloaded from CGHub (TCGA UVM57), EMBL-EBI ArrayExpress (Illumina Human BodyMap 2.0: E-MTAB-513), the Gene Expression Omnibus (accession numbers GSE72790, GSE114922) (CLL: Darman et al. 201515, MDS: Pellagati et al. 201858), or directly obtained from the authors (UVM: Alsafadi et al. 201610). Gel source data can be found in Supplementary Fig. 1. Other data that support this study’s findings are available from the authors upon reasonable request.
Extended Data
Supplementary Material
Acknowledgements.
We are grateful for the support of the MSK RNAi core facility for help with the CRISPR screens performed in the study, the MSK anti-tumor assessment core facility for help with patient-derived xenograft experiments, and the Genomics Shared Resource of the Fred Hutch/University of Washington Cancer Consortium (P30 CA015704). D.I., S.C.-W.L, A.Y., and O.A.-W. are supported by the Leukemia & Lymphoma Society. D.I. is supported by grants from Lydia O’Leary Memorial Pias Dermatological Foundation and Kobayashi Foundation for Cancer Research. A.Y. is supported by grants from the Aplastic Anemia and MDS International Foundation (AA&MDSIF) and the Lauri Strauss Leukemia Foundation. S.X.L. is supported by a Conquer Cancer Foundation and ASCO Young Investigator Award, an Aplastic Anemia & Myelodysplastic Syndrome International Foundation research award, as well as an AACR Lymphoma Research Fellowship. G.-L.C. is a Mahan Fellow. O.A.-W. is supported by the Pershing Square Sohn Cancer Research Alliance, the Henry & Marilyn Taub Foundation, and the Starr Cancer Consortium. R.K.B. is a Scholar of The Leukemia and Lymphoma Society (1344-18) and is supported in part by the US National Institutes of Health (R01 DK103854). O.A.-W. and R.K.B are supported by the Evans MDS Foundation, the US National Institutes of Health (R01 HL128239), and the Dept. of Defense Bone Marrow Failure Research Program (BM150092 and W81XWH-12-1-0041). The results shown here are in part based upon data generated by the TCGA Research Network: https://cancergenome.nih.gov/.
Competing interests. C.K. is a Scientific Founder, fiduciary Board of Directors member, Scientific Advisory Board member, consultant, and shareholder of Foghorn Therapeutics, Inc. (Cambridge, MA), none of which are related to the current manuscript. O.A.-W. has served as a consultant for H3 Biomedicine, Foundation Medicine Inc., Merck, and Janssen; O.A.-W. has received personal speaking fees from Daiichi Sankyo. O.A.-W. has received prior research funding from H3 Biomedicine unrelated to the current manuscript. D.I., O.A.-W., and R.K.B. are inventors on a provisional patent application submitted by Fred Hutchinson Cancer Research Center that covers BRD9 activation in cancer.
Footnotes
The authors declare competing financial interests: details are available in the online version of the paper. Readers are welcome to comment on the online version of the paper.
REFERENCES
- 1.Yoshida K, et al. Frequent pathway mutations of splicing machinery in myelodysplasia. Nature 478, 64–69 (2011). [DOI] [PubMed] [Google Scholar]
- 2.Papaemmanuil E, et al. Somatic SF3B1 mutation in myelodysplasia with ring sideroblasts. N Engl J Med 365, 1384–1395 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wang L, et al. SF3B1 and other novel cancer genes in chronic lymphocytic leukemia. The New England journal of medicine 365, 2497–2506 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Seiler M, et al. Somatic Mutational Landscape of Splicing Factor Genes and Their Functional Consequences across 33 Cancer Types. Cell reports 23, 282–296 e284 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Alpsoy A & Dykhuizen EC Glioma tumor suppressor candidate region gene 1 (GLTSCR1) and its paralog GLTSCR1-like form SWI/SNF chromatin remodeling subcomplexes. The Journal of biological chemistry 293, 3892–3903 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Michel BC, et al. A non-canonical SWI/SNF complex is a synthetic lethal target in cancers driven by BAF complex perturbation. Nat Cell Biol 20, 1410–1420 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gatchalian J, et al. A non-canonical BRD9-containing BAF chromatin remodeling complex regulates naive pluripotency in mouse embryonic stem cells. Nat Commun 9, 5139 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Quesada V, et al. Exome sequencing identifies recurrent mutations of the splicing factor SF3B1 gene in chronic lymphocytic leukemia. Nature genetics 44, 47–52 (2012). [DOI] [PubMed] [Google Scholar]
- 9.Furney SJ, et al. SF3B1 mutations are associated with alternative splicing in uveal melanoma. Cancer discovery 3, 1122–1129 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Alsafadi S, et al. Cancer-associated SF3B1 mutations affect alternative splicing by promoting alternative branchpoint usage. Nat Commun 7, 10615 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Harbour JW, et al. Recurrent mutations at codon 625 of the splicing factor SF3B1 in uveal melanoma. Nature genetics 45, 133–135 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Martin M, et al. Exome sequencing identifies recurrent somatic mutations in EIF1AX and SF3B1 in uveal melanoma with disomy 3. Nature genetics 45, 933–936 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Cretu C, et al. Molecular Architecture of SF3b and Structural Consequences of Its Cancer-Related Mutations. Molecular cell 64, 307–319 (2016). [DOI] [PubMed] [Google Scholar]
- 14.DeBoever C, et al. Transcriptome sequencing reveals potential mechanism of cryptic 3’ splice site selection in SF3B1-mutated cancers. PLoS computational biology 11, e1004105 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Darman RB, et al. Cancer-Associated SF3B1 Hotspot Mutations Induce Cryptic 3? Splice Site Selection through Use of a Different Branch Point. Cell reports 13, 1033–1045 (2015). [DOI] [PubMed] [Google Scholar]
- 16.Hohmann AF, et al. Sensitivity and engineered resistance of myeloid leukemia cells to BRD9 inhibition. Nat Chem Biol 12, 672–679 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Gozani O, Potashkin J & Reed R A potential role for U2AF-SAP 155 interactions in recruiting U2 snRNP to the branch site. Molecular and cellular biology 18, 4752–4760 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Pineda JMB & Bradley RK Most human introns are recognized via multiple and tissue-specific branchpoints. Genes Dev 32, 577–591 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Remillard D, et al. Degradation of the BAF Complex Factor BRD9 by Heterobifunctional Ligands. Angew Chem Int Ed Engl 56, 5738–5743 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Martens JA & Winston F Recent advances in understanding chromatin remodeling by Swi/Snf complexes. Curr Opin Genet Dev 13, 136–142 (2003). [DOI] [PubMed] [Google Scholar]
- 21.Brien GL, et al. Targeted degradation of BRD9 reverses oncogenic gene expression in synovial sarcoma. eLife 7(2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Moore AR, et al. Recurrent activating mutations of G-protein-coupled receptor CYSLTR2 in uveal melanoma. Nat Genet 48, 675–680 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Baldi A, et al. The HtrA1 serine protease is down-regulated during human melanoma progression and represses growth of metastatic melanoma cells. Oncogene 21, 6684–6688 (2002). [DOI] [PubMed] [Google Scholar]
- 24.Chien J, et al. Serine protease HtrA1 associates with microtubules and inhibits cell migration. Molecular and cellular biology 29, 4177–4187 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Walker CJ, et al. Genome-wide association study identifies an acute myeloid leukemia susceptibility locus near BICRA. Leukemia 33, 771–775 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Stein CA & Castanotto D FDA-Approved Oligonucleotide Therapies in 2017. Mol Ther 25, 1069–1075 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Smit A, Hubley R & Green P . RepeatMasker Open-4.0. (http://www.repeatmasker.org, 2013-2015. ).
- 28.Cheng DT, et al. Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT): A Hybridization Capture-Based Next-Generation Sequencing Clinical Assay for Solid Tumor Molecular Oncology. The Journal of molecular diagnostics : JMD 17, 251–264 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Seiler M, et al. H3B-8800, an orally available small-molecule splicing modulator, induces lethality in spliceosome-mutant cancers. Nature medicine (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Leeksma AC, et al. Clonal diversity predicts adverse outcome in chronic lymphocytic leukemia. Leukemia (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Kim E, et al. SRSF2 Mutations Contribute to Myelodysplasia by Mutant-Specific Effects on Exon Recognition. Cancer cell 27, 617–630 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Mohammed H, et al. Endogenous purification reveals GREB1 as a key estrogen receptor regulatory factor. Cell reports 3, 342–349 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Fellmann C, et al. An optimized microRNA backbone for effective single-copy RNAi. Cell reports 5, 1704–1713 (2013). [DOI] [PubMed] [Google Scholar]
- 34.Robinson MD, McCarthy DJ & Smyth GK edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.McCarthy DJ, Chen Y & Smyth GK Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic acids research 40, 4288–4297 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Wu D & Smyth GK Camera: a competitive gene set test accounting for inter-gene correlation. Nucleic acids research 40, e133 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Piva F, Giulietti M, Burini AB & Principato G SpliceAid 2: a database of human splicing factors expression data and RNA target motifs. Hum Mutat 33, 81–85 (2012). [DOI] [PubMed] [Google Scholar]
- 38.Katz Y, Wang ET, Airoldi EM & Burge CB Analysis and design of RNA sequencing experiments for identifying isoform regulation. Nature methods 7, 1009–1015 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Mashtalir N, et al. Modular Organization and Assembly of SWI/SNF Family Chromatin Remodeling Complexes. Cell 175, 1272–1288 e1220 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Dvinge H, et al. Sample processing obscures cancer-specific alterations in leukemic transcriptomes. Proc Natl Acad Sci U S A 111, 16802–16807 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Meyer LR, et al. The UCSC Genome Browser database: extensions and updates 2013. Nucleic acids research 41, D64–69 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Flicek P, et al. Ensembl 2013. Nucleic acids research 41, D48–55 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Li B & Dewey CN RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC bioinformatics 12, 323 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Langmead B, Trapnell C, Pop M & Salzberg SL Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome biology 10, R25 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Trapnell C, Pachter L & Salzberg SL TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Robinson MD & Oshlack A A scaling normalization method for differential expression analysis of RNA-seq data. Genome biology 11, R25 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Wagenmakers EJ, Lodewyckx T, Kuriyal H & Grasman R Bayesian hypothesis testing for psychologists: a tutorial on the Savage-Dickey method. Cognitive psychology 60, 158–189 (2010). [DOI] [PubMed] [Google Scholar]
- 48.Kent WJ, et al. The human genome browser at UCSC. Genome research 12, 996–1006 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Wickham H ggplot2: Elegant Graphics for Data Analysis., (Springer-Verlag New York, 2016). [Google Scholar]
- 50.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS). Genome biology 9, R137 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Consortium EP An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Huber W, et al. Orchestrating high-throughput genomic analysis with Bioconductor. Nature methods 12, 115–121 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Dharmalingam G, C. T soGGi: Visualise ChIP-seq, MNase-seq and motif occurrence as aggregate plots Summarised Over Grouped Genomic Intervals R package version 1.14.0. (2018). [Google Scholar]
- 54.Yu G, Wang LG & He QY ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31, 2382–2383 (2015). [DOI] [PubMed] [Google Scholar]
- 55.Tan G & Lenhard B TFBSTools: an R/bioconductor package for transcription factor binding site analysis. Bioinformatics 32, 1555–1556 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Kulakovskiy IV, et al. HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis. Nucleic acids research 46, D252–D259 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Robertson AG, et al. Integrative Analysis Identifies Four Molecular and Clinical Subsets in Uveal Melanoma. Cancer cell 32, 204–220 e215 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Pellagatti A, et al. Impact of spliceosome mutations on RNA splicing in myelodysplasia: dysregulated genes/pathways and clinical associations. Blood 132, 1225–1240 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.McLean CY, et al. GREAT improves functional interpretation of cis-regulatory regions. Nature biotechnology 28, 495–501 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Pollard KS, Hubisz MJ, Rosenbloom KR & Siepel A Detection of nonneutral substitution rates on mammalian phylogenies. Genome research 20, 110–121 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
RNA-seq and ChIP-seq data generated as part of this study were deposited in the Gene Expression Omnibus (accession number GSE124720). RNA-seq data from published studies were downloaded from CGHub (TCGA UVM57), EMBL-EBI ArrayExpress (Illumina Human BodyMap 2.0: E-MTAB-513), the Gene Expression Omnibus (accession numbers GSE72790, GSE114922) (CLL: Darman et al. 201515, MDS: Pellagati et al. 201858), or directly obtained from the authors (UVM: Alsafadi et al. 201610). Gel source data can be found in Supplementary Fig. 1. Other data that support this study’s findings are available from the authors upon reasonable request.