Abstract
Androgen receptor (AR) is a ligand-responsive transcription factor that drives terminal differentiation of the prostatic luminal epithelia. By contrast, in tumors originating from these cells, AR chromatin occupancy is extensively reprogrammed to activate malignant phenotypes, the molecular mechanisms of which remain unknown. Here, we show that tumor-specific AR enhancers are critically reliant on H3K36 dimethyltransferase activity of NSD2. NSD2 expression is abnormally induced in prostate cancer, where its inactivation impairs AR transactivation potential by disrupting over 65% of its cistrome. NSD2-dependent AR sites distinctively harbor the chimeric FOXA1:AR half-motif, which exclusively comprise tumor-specific AR enhancer circuitries defined from patient specimens. NSD2 inactivation also engenders increased dependency on the NSD1 paralog, and a dual NSD1/2 PROTAC degrader is preferentially cytotoxic in AR-dependent prostate cancer models. Altogether, we characterize NSD2 as an essential AR neo-enhanceosome subunit that enables its oncogenic activity, and position NSD1/2 as viable co-targets in advanced prostate cancer.
Subject terms: Prostate cancer, Epigenetics
CRISPR screen identifies coactivators of the androgen receptor (AR) complex, including NSD2. NSD2 contributes to AR cistrome reprogramming during prostate cancer progression, and its degradation via a novel PROTAC reduces prostate cancer cell viability in vitro.
Main
Prostate cancer (PCa) is the most commonly diagnosed malignancy in North American men, with over 95% of the primary disease expressing the androgen receptor (AR) protein1. AR is a transcription factor that dimerizes and shuttles into the nucleus upon binding to its ligand (that is, androgen), where it activates the expression of genes that drive terminal (that is, nonproliferative) differentiation of luminal epithelial cells. In concert with chromatin and epigenetic regulatory proteins, AR primarily binds at distal cis-regulatory sites (also known as enhancers) containing a canonical androgen response element (ARE) that comprises a 15-bp palindromic DNA sequence with two invertedly oriented hexameric 5′-AGAACA-3′ half-sites2, separately recognized by each half of the AR homodimer3.
In PCa cells, AR activity is extensively reprogrammed to enable and maintain malignant phenotypes4–6. Consequently, the androgen/AR axis is the primary target of all therapies following surgical resection or radiation of the organ-confined disease7. This acute dependency on AR activity is further reinforced in relapsed metastatic castration-resistant PCa (mCRPC) through activating mutations or copy amplification of AR or its cofactors8–12. Seminal studies profiling the AR cistrome in primary PCa uncovered de novo genesis of enhancers in the malignant state (that is, neo-enhancers), resulting in a two- to threefold expansion of the AR enhancer circuitry5,6,13–15. This process engenders an acute dependency on chromatin-binding AR cofactors, such as SWI/SNF, BRD4, MED1 and p300/CBP, all of which have been independently assessed for therapeutic druggability in mCRPC16–22. Yet, the molecular mechanisms underlying chromatin redistribution of AR upon transformation or distinctive subunits of the AR transcriptional complex that assembles at neo-enhancer elements (that is, the neo-enhanceosome) are poorly studied and, thus, unexplored for therapeutic targetability.
In this study, using an epigenetics-targeted functional CRISPR screen, we identified nuclear receptor binding SET domain protein 2 (NSD2, also known as MMSET, WHSC1) as a subunit of the AR enhanceosome complex in PCa cells. NSD2 is a histone 3 lysine 36 (H3K36) mono- and dimethyltransferase that activates gene expression by protecting the chromatin from accumulating repressive epigenetic marks, such as H3K27me3 (refs. 23–25). NSD2 is a bona fide oncogene in hematologic cancers and harbors recurrent activating alterations in over 15–20% of multiple myeloma26–28 and 10% of childhood acute lymphoblastic leukemia29–31.
In PCa, we found NSD2 to be exclusively expressed in the transformed cells—with no detectable expression in the normal epithelia—where it directly interacts with AR to enable its binding at chimeric AR half-motifs in concert with FOXA1 or other driver oncogenes. Inactivation of NSD2 entirely disrupted AR binding at over 65% of its tumor cistrome, importantly without affecting AR protein levels, and attenuated hallmark cancer phenotypes. NSD2 deficiency also engendered an increased dependency on NSD1, positioning the two paralogs as a digenic dependency. Concordantly, a dual NSD1/2 PROTAC degrader, called LLC0150, showed selective potency in AR-dependent as well as NSD2-altered human cancers. These findings mechanistically explain how AR gets reprogrammed, away from prodifferentiation physiological functions, to instead fuel PCa growth and survival, and offer NSD1 and NSD2 as therapeutic vulnerabilities in the advanced disease.
Results
Functional CRISPR screen reveals NSD2 as an AR coactivator
Conventional plasmid-based reporter systems fail to capture intricate epigenetic or chromatin-level regulation of gene expression as they lack the native histone composition or higher-order chromosomal structure. Thus, we engineered an endogenous AR reporter system by using the CRISPR/Cas9 and homologous recombination methodologies. We edited the KLK3 gene (also known as prostate-specific antigen, PSA) locus in AR-driven LNCaP cells to knock-in the mCherry coding sequence directly downstream of the endogenous promoter and fused in-frame via an endopeptidase sequence to the KLK3 gene (Fig. 1a and Extended Data Fig. 1a–c). In the monoclonal reporter cell line, akin to PSA, mCherry expression is directly regulated by the AR transcriptional complex (Extended Data Fig. 1d–f) and, most importantly, captures chromatin or epigenetic-level changes in AR transactivation potential. Like PSA, mCherry expression was attenuated upon pharmacologic inhibition of coactivators like BRD4 (ref. 16), SWI/SNF18 or P300/CBP19 while increasing upon inhibition of the repressive PRC2/EZH2 complex32,33 (Fig. 1b and Extended Data Fig. 1g). Using these endogenous AR reporter cell lines, we carried out a functional CRISPR screen, wherein we treated the cells with a custom single guide RNA (sgRNA) library targeting druggable transcriptional cofactors34 (Extended Data Fig. 1h) for 8 days, stimulated with DHT for 16 h and FACS-sorted into mCherryHIGH and mCherryLOW populations. Genomic sgRNAs were sequenced and the ratio of normalized counts in mCherryLOW to mCherryHIGH cell populations was used to rank individual sgRNAs. Here, ranked alongside BRD4 (ref. 16) and TRIM24 (refs. 35,36), we identified NSD2 as an AR coactivator (Fig. 1c and Extended Data Fig. 1i). In contrast, subunits of the PRC2 complex, namely EZH2 and JARID2, that repress AR activity32 were enriched in the mCherryHIGH cells. Validating the screening results, siRNA-mediated knockdown of NSD2 attenuated the expression of PSA/KLK3 in PCa cell lines (Fig. 1d).
The NSD2 gene templates two splice isoforms producing a long, catalytically active form (hereafter referred to as NSD2-L) as well as a truncated shorter isoform (called NSD2-S) containing only the reader and protein-protein interacting PWWP and HMG domains, respectively. We found both NSD2 isoforms to be robustly expressed in PCa cells (Fig. 1e and Extended Data Fig. 2a). Deletion of NSD2-L alone strongly attenuated the expression of AR target genes in LNCaP cells, which was comparable to complete loss of the NSD2 protein (Fig. 1f). The transcriptomic analysis further showed global AR activity to be significantly dampened in NSD2-deficient LNCaP cells with a parallel loss in hyperproliferative gene expression programs (Fig. 1g). AR and NSD2 transcriptional activities were also positively correlated in primary prostate tumors from the TCGA cohort (Extended Data Fig. 2b; R = 0.68, P = 2.2 × 10−16). Notably, there was no change in the abundance of AR transcript or protein itself in NSD2-deleted cells (Fig. 1f and Extended Data Fig. 2c), yet stimulation with DHT failed to significantly up-regulate the expression of AR target genes (Fig. 1h,i and Extended Data Fig. 2c,d).
To date, several studies have implicated NSD2 in PCa37–41; however, it is worth noting that these studies were focused on the AR-negative disease. Using tissue microarrays, these studies showed NSD2 protein to be elevated in cancer specimens, showing a stage-wise increase from primary to mCRPC or neuroendocrine PCa39,40. Building on these findings, in primary prostatectomy specimens we found NSD2 levels to be undetectable in the normal or adjacent benign foci with marked gain in expression in malignant cells (Fig. 1j,k). Consistent with this, in single-cell RNA-seq data from patient tumors, we found the NSD2 transcript to be exclusively expressed in the AR+ luminal epithelial cells (Extended Data Fig. 2e). Pseudo-bulk analyses confirmed NSD2 expression to be markedly elevated in the matched tumor vs the normal luminal compartment (n = 18), and NSD2 expression positively correlated with Gleason score of the primary disease (Extended Data Fig. 2f,g). Multiplex immunofluorescence in additional prostatectomy and patient tumor specimens further confirmed the KRT8+/AR+ normal epithelial cells to have no detectable expression of NSD2, which was robustly expressed in the transformed epithelial cells (Fig. 1l,m and Extended Data Fig. 3a–e). Altogether, these data suggest that NSD2 is abnormally expressed in the transformed prostate luminal epithelial cells, wherein its methyltransferase function is critical for maintaining transcriptional activity of the AR complex.
NSD2 activates neo-enhancers with chimeric AR half-motifs
Given that NSD2 loss had no impact on the abundance of the AR protein, we next profiled AR binding on chromatin. AR chromatin immunoprecipitation with sequencing (ChIP-seq) in NSD2-deficient LNCaP cells showed a dramatic and complete off-loading of the AR protein from over 40,000 genomic sites that comprise over 65% of the tumor cistrome (Fig. 2a). The majority of the lost sites (that is, NSD2-dependent) were within intronic or intergenic regions associated with cis-regulatory DNA elements (Fig. 2b). At these sites there was no change in the binding of FOXA1 upon NSD2 inactivation. Yet, disruption of AR binding was sufficient to trigger loss of the H3K27ac mark that demarcates active enhancers (Fig. 2c). In contrast, AR remains bound at over 20,000 genomic sites independent of NSD2, which also retained the H3K27ac active mark in the NSD2-null PCa cells (Fig. 2a–c). Next, ChIP-seq-based profiling of the chemical chromatin state showed NSD2-dependent AR sites to have higher abundance of H3K36me2 as well as active enhancer-associated H3K4me1/2 and H3K27ac modifications compared to the NSD2-independent elements (Extended Data Fig. 3d–f). Contrastingly, NSD2-independent AR sites had higher levels of the PRC2/EZH2 catalyzed repressive H3K27me3 mark. More importantly, NSD2 inactivation led to a significant decrease in H3K36me2 levels at the NSD2-dependent sites, reducing it to the levels at NSD2-independent sites in the wild-type cells, with a parallel increase in H3K27me3 (Extended Data Fig. 3g,h).
Motif analyses (HOMER42) of the NSD2-dependent AR sites identified a chimeric motif comprising a FOXA1 element juxtaposed to the AR half site (called FOXA1:AR half-motif) as the most significantly enriched DNA sequence (Fig. 2d), with 40% of these enhancers harboring this motif (Supplementary Tables 1 and 2). In contrast, NSD2-independent AR sites, a large fraction of which showed increased AR binding upon NSD2 inactivation, housed the palindromic ARE (Fig. 2d). These distinct modes of AR DNA interaction were evident within a Chr10 gene locus, wherein the loss of NSD2 completely disrupted AR binding at the FOXA1:AR half-motif, without affecting AR’s interaction with a canonical ARE element in cis-proximity (Fig. 2e). Furthermore, custom motif analyses showed enrichment of other transcription factor motifs, including HOXB13 and ETS, within 25 bp of AR half elements detected within the NSD2-dependent AR sites (Extended Data Fig. 4a). We also custom-assembled chimeric AR half-motifs with FOXA1 and HOXB13 elements (in both 5′ and 3′ confirmations, see Methods) and interrogated their recurrence in published AR ChIP-seq data derived from non-cancerous LHSAR cells5. Here, we found the overexpression of FOXA1 and HOXB13 alone, or in combination, to markedly shift the AR cistrome away from full AREs (normal-like) towards chimeric AR half elements in the tumor-like state (Fig. 2f and Extended Data Fig. 4b). Strikingly, motif analysis of the AR cistromes generated from patient specimens5,6 revealed the FOXA1:AR half-motif to be exclusively detected in the tumor-specific AR enhancer circuitries, with such chimeric motifs being essentially absent at normal AR sites (Fig. 2g and Extended Data Fig. 4c–f). In these analyses, we also found palindromic AREs to be depleted within cancer-specific enhancers (Fig. 2g). Concordantly, AR ChIP-seq signal at ARE sites was strongest in normal prostate tissues, whereas enhancers containing the chimeric FOXA1:AR half-motif had higher AR binding in PCa specimens (Fig. 2h). H3K27ac ChIP-seq signal from matched tumors showed similar redistribution, with FOXA1:AR half-sites being strongly activated in mCRPC tumors (Extended Data Fig. 4g).
In tumor cells, the aberrant expression of oncogenes is frequently amplified through dense clusters of closely spaced enhancers, often referred to as super-enhancers43. NSD2 inactivation resulted in the loss of over 75% of the AR-bound super-enhancers in PCa (Fig. 2i), including those that are hijacked by activating translocations44 (Extended Data Fig. 4h). The residual super-enhancers also showed a significant decrease in the enhancer strength in the NSD2-null relative to the wild-type LNCaP cells (Fig. 2j). Altogether, these data suggest that, upon ectopic expression, NSD2 assists oncogenic transcription factors (namely FOXA1 and HOXB13) in expanding the AR enhancer circuitry to include chimeric AR half-sites that constitute over two-thirds of PCa AR cistromes.
NSD1 and NSD2 independently enable oncogenic AR activity
Given NSD2 inactivation resulted in disruption of the cancer-specific AR cistrome, we set out to phenotypically characterize NSD2-deficient PCa cells. Here, siRNA/shRNA knockdown or CRISPR knockout of NSD2 significantly impaired hyperproliferative ability of AR-positive PCa cell lines (Fig. 3a and Extended Data Fig. 5a,b). NSD2-deficient cells also lost their ability to invade through Matrigel (Fig. 3b) and form colonies starting from single cells in clonogenic assays (Fig. 3c and Extended Data Fig. 5c). NSD2-null 22RV1 cells also lost their ability to graft when injected subcutaneously in NOD/SCID mice (Fig. 3d). Strikingly, exogenous reintroduction of NSD2-L restored the xenografting potential (green line, Fig. 3d), with resulting tumors growing at a rate comparable to those established with the parental wild-type cells (Extended Data Fig. 5d). NSD2-L re-expression also restored the invasive ability (Extended Data Fig. 5e), along with restoring the expression of KLK3 (Extended Data Fig. 5f). In the same experiment, re-expression of NSD2 variant lacking the SET domain (dSET) failed to rescue KLK3 (Extended Data Fig. 5g), whereas expression of the hyperactive NSD2-E1099K SET-domain mutant29 completely restored KLK3 levels (Extended Data Fig. 5h). Next, we engineered the NSD2-null 22RV1 cells to stably express dTAG-version of the NSD2-L protein fused to the FKBP12F36V tag45, which is rapidly degraded upon treatment with an FKBP12 degrader (Extended Data Fig. 5i). The 22RV1 NSD2-KO+ NSD2-L-FKBP12F36V cells successfully grafted and grew to form tumors in vivo; however, dosing of host animals with an FKBP12 degrader significantly diminished the growth of tumor xenografts (Fig. 3e and Extended Data Fig. 5j). Even at the molecular level, degradation of the exogenous NSD2-L-FKBP12F36V protein resulted in lower levels of KLK3 and chromatin-bound (that is, p-S81) AR without a decrease in total AR expression (top panel, Fig. 3f). Concordantly, chromatin fractionation in these cells showed a marked loss of AR binding with a parallel decrease in H3K36me2 upon NSD2-L-FKBP12F36V degradation (bottom panel, Fig. 3f). In LNCaP NSD2-dTAG models, NSD2 degradation led to downregulation of multiple AR target genes in a time-dependent manner (Extended Data Fig. 5k). Also, other genes encoded in cis-proximity of NSD2-dependent AR sites (see Methods) were similarly downregulated in the NSD2-deficient LNCaP cells (Extended Data Fig. 6a and Supplementary Table 4), and were enriched for oncogenic KRAS, angiogenesis, and G2M checkpoint pathways (Extended Data Fig. 6b). In contrast, genes associated with NSD2-independent AR sites were enriched for developmental pathways and AR/NKX3-1 signaling (Extended Data Fig. 6c). These findings position NSD2 as a molecular 'switch' that activates oncogenic AR cistrome and enables hallmark cancer properties.
Next, in size exclusion chromatography we found NSD2 to co-elute with higher-order AR transcriptional complexes (Extended Data Fig. 7a). NSD2 also co-precipitated with AR in several PCa cell lines (Extended Data Fig. 7b). As previously reported46, using fragment-based coimmunoprecipitation, we confirmed the high mobility group box (HMG-box) domain of NSD2 to interact with the DNA-binding domain (DBD) of AR (Fig. 3g and Extended Data Fig. 7c,d).Furthermore, alanine substitution of three highly conserved HMG-box residues (that is, F463/W491/Y502A) individually or together (triple mutant, Fig. 3g) disrupted its interaction with the AR-DBD in the ectopic HEK293T as well as cell-free purified wheatgerm extract systems (Fig. 3h). This finding suggests that NSD2 directly, and independent of DNA, interacts with the AR-DBD through its HMG-box domain, which is notably absent in other NSD family histone methyltransferases.
Despite a striking loss of neoplastic features, NSD2-deficient PCa cells remained viable. Thus, we speculated if NSD2 paralogs could sustain AR activity through alternative mechanisms. To test this, we knocked-out NSD1 or NSD3 individually in LNCaP and assessed its transcriptional impact. Unlike NSD3, NSD1 loss significantly attenuated the AR and MYC gene programs (Fig. 3i and Extended Data Fig. 7e,f), in addition to reducing the AR protein levels (Extended Data Fig. 7g). NSD1 inactivation also diminished hyperproliferative gene pathways (like E2F and G2M; Extended Data Fig. 7h) and had the strongest reduction in H3K36me2 levels upon a single-gene loss (Fig. 3j and Extended Data Fig. 7g), positioning NSD1 as the predominant H3K36 dimethyltransferase in PCa cells. NSD-catalyzed H3K36me2 mark was recently shown to sterically hinder loading of the H3K27 residue into catalytic pocket of the EZH2 enzyme47, and NSD1 was reported to primarily antagonize the repressive PRC2/EZH2 complex48. Consistently, we found NSD1 loss to trigger a marked increase in EZH2/PRC2 activity, with several repressed target gene signatures being significantly downregulated upon NSD1 knockdown in VCaP cells (top panel, Fig. 3k). This was also confirmed using a PCa-specific PRC2 gene signature (bottom panel, Fig. 3k). Concordantly, treatment of siNSD1 cells with EPZ-6438 had substantially higher residual levels of the H3K27me3 mark relative to the control as well as NSD2-inactivated VCaP cells (Fig. 3l). In these experiments, loss of NSD2 alone had little to no effect on EZH2/PRC2 activity (Fig. 3j–l). These results position NSD1 as the primary writer of the H3K36me2 histone mark that counterbalances the EZH2/PRC2 repressive complex in PCa cells to maintain the hyper-transcriptional AR and MYC gene programs.
Interestingly, the loss of NSD2 led to a marked increase in NSD1 levels in PCa cells (Fig. 3j,m), likely suggesting that NSD1 could sustain residual oncogenic AR activity in these cells. Parallel inactivation of NSD1 and NSD2 in PCa cells resulted in the strongest decrease in H3K36me2 levels and AR target gene expression (Fig. 3j,m), triggering an accumulation of apoptotic marker cleaved-PARP (Extended Data Fig. 7i). Consistently, combined NSD1 and NSD2 inhibition resulted in significant cytotoxicity in AR-positive PCa cells (Fig. 3n), whereas inactivation of either genes alone had little to no tumor-killing effect in prostatic cell lines (Extended Data Fig. 7j). Altogether, these data suggest that NSD1 and NSD2, through distinct mechanisms, promote a hypertranscriptional chromatin state or enable oncogenic AR activity, respectively, in PCa cells.
NSD1/2 dual PROTAC preferentially kills AR+ PCa
Following a medicinal chemistry campaign, we developed a proteolysis targeting chimera (PROTAC) compound, called LLC0150, which co-targets NSD1 and NSD2 (Fig. 4a and Supplementary Notes). LLC0150 links an NSD2 PWWP-domain binding warhead49 to a cereblon E3-ligase-recruiting moiety pomalidomide. Treatment with LLC0150 triggers degradation of NSD1 and NSD2, while sparing NSD3 (Fig. 4b), in a proteasome and cereblon-dependent manner (Extended Data Fig. 8a). LLC0150 had no effect on other PWWP-domain-containing proteins, but showed partial neo-substrate activity (Extended Data Fig. 8b). This first-generation PROTAC also had poor solubility and pharmacokinetic properties for in vivo use. In line with our genetic data, in PCa cells, NSD1/2 co-degradation with LLC0150 triggered a decrease in the expression of AR and MYC, as well as their downstream gene targets (Fig. 4c,d). Acute loss of NSD1 and NSD2 in LLC0150-treated LNCaP cells resulted in impaired AR and FOXA1 chromatin binding (Fig. 4e,f and Extended Data Fig. 8c), with a parallel loss of H3K27ac activation mark at shared AR/FOXA1 enhancer sites (Fig. 4f). Treatment with LLC0150 also diminished the chromatin-bound AR fraction and the H3K36me2 histone mark (Extended Data Fig. 8d). In absence of NSD1/2, DHT-induced expression of AR target genes was significantly weakened (Extended Data Fig. 8e). LLC0150 treatment also markedly disrupted the assembly and activity of AR super-enhancers in LNCaP cells (Extended Data Fig. 8f)
Global transcriptomic analyses of LNCaP and VCaP cells treated with LLC0150 further showed a significant attenuation of proliferative pathways with a parallel induction of apoptotic signaling (Extended Data Fig. 9a). This was confirmed via massive accumulation of cleaved-PARP in the LLC0150-treated AR-positive PCa cell lines (Fig. 4c and Extended Data Fig. 9b). AR-positive PCa cell lines were considerably more sensitive to treatment with LLC0150 relative to the AR-negative disease models, immortalized normal, as well as primary prostate epithelial cells (Extended Data Fig. 9c). Inactive epimer control of LLC0150 (labeled as LLC0150-dead) did not affect the NSD1/2 levels or the viability of PCa cells (Fig. 4b and Extended Data Fig. 9d). Notably, LLC0150 showed marked synergy with enzalutamide—an AR-antagonistic drug—in killing LNCaP and VCaP cells (Fig. 4g and Extended Data Fig. 9e, f). More impressively, LLC0150 also retained cytotoxicity in cell line models that had acquired resistance to enzalutamide (Fig. 4h and Extended Data Fig. 9g). Similarly, several models of AR-positive mCRPC organoids that robustly express NSD2 (Extended Data Fig. 10a), showed significant attenuation of growth upon treatment with LLC0150 in a dose-dependent manner (Extended Data Fig. 10b–d).
Next, we characterized the cytotoxic effect of LLC0150 in a panel of over 110 human-derived normal and cancer cell lines originating from 22 different lineages (Supplementary Table 3). As expected, hematologic cancers harboring activating NSD2 mutations emerged as the most sensitive to treatment with LLC0150 (IC50 ranging from 0.274 - 69.68 nM), which was immediately followed by AR-positive PCa cell lines (shown in red, Fig. 4i). Notably, AR-positive disease models showed preferential cytotoxicity to NSD1/2 combined loss relative to AR-negative disease models as well as a host of normal cell lines. As proof of concept, we next performed direct intratumoral injection of LLC0150 in mice bearing VCaP xenograft tumors (Extended Data Fig. 10e). LLC0150 triggered marked degradation of NSD1/2 in tumor xenografts with a parallel loss in proliferative and gain of apoptotic markers (Extended Data Fig. 10f,g). Altogether, this data suggests that combined loss of NSD1 and NSD2 leads to a dramatic, almost complete, loss of the H3K36me2 histone mark and disruption of the AR/FOXA1 neo-enhancer circuitry, resulting in apoptotic PCa cell death. This positions NSD1/2 paralogs as a targetable digenic dependency in AR-driven, therapy-resistant PCa.
Discussion
Most targeted therapies following surgical resection or radiation of primary PCa inhibit the androgen/AR signaling axis4. However, how the prodifferentiation AR pathway in normal physiology gets reprogrammed to serve as the central oncogene in PCa remains largely unknown. Global AR chromatin-binding profiles are markedly different between normal and transformed prostate epithelia5,6,13–15,50, and FOXA1 and HOXB13 have been implicated in driving AR’s reprogramming upon transformation5,14. However, both FOXA1 and HOXB13 are also expressed in the normal epithelial cells, raising the possibility for additional cofactors to underlie the recruitment of AR to PCa-specific enhancer elements. Here, in a functional CRISPR screen, we identify NSD2 as a coactivator of the AR/FOXA1 enhanceosome. NSD2 is exclusively expressed in PCa cells, wherein it enables functional binding of AR at chimeric AR half-motifs, which majorly comprise the AR neo-enhancer circuitries. Consequently, NSD2 inactivation abolishes hallmark cancer phenotypes, whereas its re-expression in deficient cells restores neoplastic features. This positions NSD2 as a neo-coactivator of AR that assists transcription factors, like FOXA1, HOXB13, and ETS, in redistributing AR on the chromatin, thereby unlocking its oncogenic gene programs.
Intriguingly, in motif analyses of the PCa-specific AR cistromes, we also found significant depletion of the canonical ARE elements. Despite magnitude-folds increase in AR abundance in mCRPC, its loading at cis-regulatory elements comprising only palindromic AREs was significantly diminished. Also, the full ARE-containing sites were particularly inactivated in mCRPC tumors as evidenced by the loss of H3K27ac. This raises an intriguing possibility for the AR transcriptional activity stemming from a subset of canonical elements to rather impede tumor formation and/or progression, which is consistent with the physiological role of AR as a prodifferentiation factor. In fact, hyper-stimulation of AR activity has anti-proliferative effects in PCa cells51, and bipolar androgen therapy involving cyclical inhibition and hyperactivation of AR is being currently tested in advanced patients52,53. These are exciting areas for further research.
We further found the loss of NSD2 in PCa cells to up-regulate NSD1, and co-inactivation of both NSD1/2 paralogs to be acutely cytotoxic. We uncovered that NSD1 and NSD2, through disparate mechanisms converge on wiring and maintaining the oncogenic AR gene program. Although NSD2 directly binds to AR and stabilizes the AR enhanceosome at de novo neo-enhancer elements, NSD1 functions as the primary writer enzyme for the H3K36me2 mark that antagonizes the PRC2/EZH2 repressive complex32,33. We envision the NSD2 function to evolve from enabling oncogenic AR activity in primary AR-dependent PCa to additionally supporting NSD1 in counter-balancing the canonical repressive PRC2 activity in the metastatic castration-resistant disease. Thus, the loss of NSD2 creates an increased dependency on NSD1 in AR-addicted PCa cells, positioning the NSD1/2 paralogs as targetable co-vulnerabilities in advanced disease. Here, we also characterized a dual PROTAC of NSD1 and NSD2 that confirmed co-degradation of these proteins to result in apoptotic cell death in AR-positive PCa. Notably both NSD1 and NSD2 are recurrently altered in hematological malignancies where they function as driver oncogenes (Supplementary Notes). Accordingly, we found LLC0150 to have the highest potency in NSD2-altered cancers. This finding highlights the potential application of this compound in studying and treating these tumors.
In summary, we identify and characterize NSD2 as an essential coactivator of the AR neo-enhanceosome that is exclusively expressed in PCa cells. NSD2 directly binds to AR and enables its loading at cis-regulatory elements harboring chimeric AR half-motifs, comprising over 65% of the malignant AR cistrome. We coalesce these mechanistic insights to propose that AR has two distinct modes of interacting with chromatin: 1) NSD2-independent binding at cis-elements harboring canonical full AREs that are predominantly found in the physiological enhancer circuitry, and 2) NSD2-dependent binding at cis-regulatory elements harboring chimeric AR half-motifs (like FOXA1:AR half) that distinctively constitute the cancer-specific enhancer circuitries of AR (Fig. 5). Furthermore, we uncover NSD1 and NSD2 as a digenic dependency in AR-positive PCa, and develop an NSD1/2 dual PROTAC degrader that shows preferential cytotoxicity in AR-positive PCa. Our findings warrant a focused development of new NSD-targeting therapeutics and evaluation of their efficacy and safety in preclinical and clinical studies.
Methods
Ethical statement
All experiments detailed in this paper were performed in compliance with the Institutional Review Board and the Institutional Animal Care and Use Committee at the University of Pennsylvania and the University of Michigan.
Animal procurement
Animal studies were approved by the Institutional Animal Care and Use Committee at the University of Pennsylvania and/or the University of Michigan. Animal use and care were in strict compliance with institutional guidelines, and all experiments conformed to the relevant regulatory standards by the universities. NOD SCID or NCI SCID/NCr athymic nude mice were obtained from the Jackson Laboratory (strain code: 005557) and Charles River (strain code: 561). All in vivo experiments were initiated with male mice aged 5–8 weeks. All mice were housed in a pathogen-free animal barrier facility and all in vivo experiments were initiated with male mice aged 5-8 weeks. All mice were maintained under the conditions of pathogen-free, 12 h light/12 h dark cycle, temperatures of 18–23 °C, and 40–60% humidity.
Statement on use of human specimens
Prostate tumor patient tissues were acquired from the University of Michigan pathology archives. These tissues were utilized for Immunohistochemistry and multiplex Immunofluorescence experiments to assess for Cytokeratin-8 and NSD2 expression in tumor or adjacent normal prostate cells. Formalin-fixed paraffin-embedded specimens from the archives were used upon approval by the University of Michigan Institutional Review Board and does not require patient consent.
Cell lines
Most cell lines were purchased from the American Type Culture Collection (ATCC) and were cultured following ATCC protocols. For all experiments, LNCaP and 22RV1 cells were grown in RPMI 1640 medium (Gibco) and VCaP cells in DMEM with Glutamax (Gibco) medium supplemented with 10% fetal bovine serum (FBS; Invitrogen). HEK293FT cells were grown in DMEM (Gibco) medium with 10% FBS. All cells were grown in a humidified 5% CO2 incubator at 37 °C. Mycoplasma and cell line genotyping were performed once a fortnight and every month respectively at the University of Michigan Sequencing Core using Profiler Plus (Applied Biosystems). Results from these were compared with corresponding short tandem repeat profiles in the ATCC database to authenticate their identity.
Antibodies
For immunoblotting, the following antibodies were used: NSD1 (NeuroMab: 75-280, 1:1000); NSD2 (Abcam:ab75359, 1:1,000); NSD3 (Cell Signaling Technologies: 92056 S, 1:1,000); KLK3/PSA (Dako:A0562, 1:1,000); FKBP5(Cell Signaling Technologies: 12210, 1:1,000); NKX3-1 (Cell Signaling Technologies:83700 S, 1:1,000); FOXA1 N-terminal (Cell Signaling Technologies: 58613 S; Sigma-Aldrich: SAB2100835, 1:1,000); FOXA1 C-terminal (ThermoFisher Scientific: PA5-27157, 1:1,000); AR (Millipore: 06-680, 1:1,000); AR (Abcam: ab133273, ab108341, 1:1,000); H3 (Cell Signaling Technologies: 3638 S, 1:2,500); GAPDH (Cell Signaling Technologies: 3683, 1:2,500); H3K27me3(Millipore: 07-449, 1:2500); H3K36me2 (Cell Signaling Technologies: 2901 S, Abcam: ab9049, 1:2500); H3K27ac (Active Motif, catalog no39336, catalog no39133, 1:2500); Phospho-AR (Ser-81) (Millipore, catalog no 07-1375-EMD, 1:1,000); HALO (ThermoFisher Scientific, catalog no G9281, 1:1,000); HA (Cell Signaling Technologies, catalog no 3724 S, 1:1000); His (Cell Signaling Technologies, catalog no2365 S, 1:1,000). ChIP-seq assays were performed using the following antibodies: FOXA1 (ThermoFisher Scientific: PA5-27157); AR (Millipore: 06-680); H3K4me1 (Abcam: ab8895); H3K4me2 (CST: C64G9); H3K36me2 (Abcam: ab9049); H3K27me3 (EMD: 07-449), and H3K27ac (Active Motif, catalog no39336).
Cell-free protein-protein interaction studies
In vitro protein expression was carried out by cloning the desired expression cassettes downstream of a Halo- or His-tag to produce fusion proteins. Briefly, AR-DBD was subcloned in pFN21K containing Halo-tag, and NSD2-HMGa was cloned in pcDNA4c containing His-tag. After cloning, the fusion proteins were expressed using the cell-free transcription and translation system (catalog no L4140, Promega) following the manufacturer’s protocol. For each reaction, protein expression was confirmed by Western blot.
A total of 10 μl cell-free reaction containing halo- and His-tag fusion proteins was incubated in PBST (0.1% tween) at 4 °C overnight. Ten microliter HaloLink beads (catalog noG931, Promega) were blocked in BSA at 4 °C for overnight. After washes with PBS, the beads were mixed with AR-NSD2-HMGa and TM mixture and incubated at room temperature for 1 h. Halolink beads were then washed with PBST for four times and eluted in SDS loading buffer. Proteins were separated on SDS gel and blotted with anti-His Ab (CST: catalog no2365 S).
Colony formation assays
For the colony formation assay, approximately 10,000 cells/well in six-well plates (n = 3) were seeded and treated with the required drugs/compounds or vehicle for 12–14 days. Media was replenished every 3-4 days. Colonies were fixed and stained using 0.5% (w/v) crystal violet (Sigma, C0775) in 20% (v/v) methanol for 30 min, washed with distilled deionized water, and air-dried. After scanning the plate, the stained wells were destained with 500 μl 10% acetic acid, and the absorbance was determined at 590 nm using a spectrophotometer (Synergy HT, BioTek Instruments).
Cellular protein fractionation assays
Chromatin-bound proteins were extracted following a protocol previously described17. In brief, 10 million cells were collected, washed with DPBS, and resuspended in 250 μl Buffer A (10 mM HEPES pH 7.9, 10 mM KCl, 1.5 mM MgCl2, 0.34 M sucrose, 10% glycerol and 1 mM DTT) supplemented with 0.1% TritonX-100. After incubation on ice for 10 min, the nuclear pellet was collected by centrifugation at 1,300g for 5 min at 4 °C, washed in Buffer A, and resuspended in Buffer B (3 mM EDTA, 0.2 mM EGTA and 1 mM DTT) with the same centrifugation settings, and incubated on ice for 30 min. The chromatin pellet was collected by centrifugation at 1,700g for 5 min at 4 °C, washed and resuspended in Buffer B with 150 mM NaCl, and incubated on ice for 20 min. After centrifugation at 1,700g for 5 min to remove proteins soluble in 150 mM salt concentrations, the pellet was then incubated in Buffer B with 300 mM NaCl on ice for 20 min and centrifuged again at 1,700g to obtain the final chromatin pellet. The chromatin pellet was dissolved in a sample buffer, sonicated for 15 s, and boiled at 95 °C for 10 min. Immunoblot analysis was conducted on samples as described above. All buffers were supplemented with Pierce protease inhibitor and Halt protease & phosphatase inhibitors.
RNA isolation and quantitative real-time PCR
Standard protocol from the miRNeasy Mini kit (Qiagen) was used to extract total RNA with the inclusion of on-column genomic DNA digestion step using the RNase-free DNase Kit (Qiagen). RNA concentration was estimated using the NanoDrop 2000 spectrophotometer (ThermoFisher Scientific), and 1 g total RNA was used for complementary DNA (cDNA) synthesis using the SuperScript III Reverse Transcriptase enzyme (ThermoFisher Scientific) following manufacturer’s instructions. 20 ng cDNA was used for each polymerase chain reaction (PCR) using the FAST SYBR Green Universal Master Mix (ThermoFisher Scientific), and every sample was quantified in triplicates. Gene expression was normalized and calculated relative to GAPDH and HPRT1 (loading control) using the delta-delta Ct method and normalized to the control group for graphing. Quantitative PCR (qPCR) primers were designed using the Primer3Plus tool (http://www.bioinformatics.nl/cgi-bin/primer3plus/primer3plus.cgi) and synthesized by Integrated DNA Technologies. Primers used in this study are provided in Supplementary Table 5.
siRNA/ASO-mediated gene knockdown
Mammalian cells were seeded in a 6-well plate format at the density ranging from 100,000–250,000 cells per well. 12 h post seeding, cells were transfected with 25 nM of gene-targeting ON-TARGETplus SMARTpool siRNAs (or ASOs) or non-targeting pool siRNAs (or ASOs) as negative control (Dharmacon) using the RNAiMAX reagent (Life Technologies; catalog no: 13778075) on two consecutive days, following manufacturer’s instructions. 72 h after transfection, total RNA and protein were extracted to confirm efficient ( > 80%) knockdown of the target genes. For the siRNA-treated VCaP DMSO/EPZ-6438 RNA-seq experiment (Fig. 3k), cells were pre-treated with control siRNA (siNC) or siRNA targeting NSD1, NSD2, or NSD1/2 (siNSD1, siNSD2) for 30 days, followed by 72 h of EPZ-6438 treatment. Catalog numbers and guide RNA sequences of siRNA SMARTpools (Dharmacon) are provided in Supplementary Table 5.
CRISPR-Cas9-mediated gene knockout
For gene knockouts, cells were seeded in a 6-well plate at a density of 200,000–300,000 cells per well and transduced with viral particles with lentiCRISPR-V2 plasmids coding either non-targeting (sgNC) or sgRNAs targeting NSD1 and NSD2. This was followed by 3 days of puromycin selection, after which proliferation assays were carried out as described below. The lentiCRISPR-V2 vector was a gift from Dr. Feng Zhang’s lab (Addgene plasmid #52961). sgRNA sequences are provided in Supplementary Table 5.
Proliferation assays
For siRNA growth assays, cells were directly plated in a 96-well plate at the density of 2,500–8,000 cells per well and transfected with gene-specific or non-targeting siRNAs, as described above, on day 0 and day 1. Every treatment was carried out in six independent replicate wells. CellTiter-Glo reagent (Promega) was used to assess cell viability at multiple time points after transfection, following the manufacturer’s protocol. Data were normalized to readings from siNC treatment on day 1 and plotted as relative cell viability to generate growth curves. Alternatively, for CRISPR sgRNA growth assays, cells were treated as described above for target gene inactivation and seeded into a 96-well plate at 2500 cells per well, with five-six replicates per group.
Matrigel invasion assay
LNCaP CRISPR clones were grown in 10% CSS-supplemented medium for 48 h for androgen starvation. Matrigel-coated invasion chambers were used and additionally coated with polyethylene terephthalate membrane to allow for fluorescent quantification of the invaded cells (Biocoat: 24-well format, no. 354166). On the upper layer of the chamber, fifty thousand starved cells were resuspended in serum-free medium and were added to each invasion chamber while 20% FBS-supplemented medium was added to the bottom wells to serve as a chemoattractant. 12 h later, medium from the bottom well was aspirated and replaced with 1x HBSS (Gibco) containing 2 μg/ml Calcein-green AM dye (ThermoFisher Scientific; C3100MP) and incubated for 30 min at 37 °C. Invasion chambers were then placed in a fluorescent plate reader (Tecan-Infinite M1000 PRO), and fluorescent signals from the invaded cells at the bottom were averaged across 16 distinct regions per chamber to determine the extent of invasion. For rescue experiments, stable lines overexpressing the NSD2 isoforms were generated. Briefly, to LNCaP NSD2-KO lines, GFP or NSD2-Long isoform containing viruses were added. These lines were then used to perform the invasion assay as described above.
RNA-seq and analysis
RiboErase RNA-seq libraries were prepared using 200–1,000 ng total RNA. Ribosomal RNA was removed by enzymatic digestion of the specific probe-bound duplex rRNA (KAPA RNA Hyper+RiboErase HMR, Roche) and then fragmented to around 200-300 bp with heat in the fragmentation buffer. Following this, double-stranded cDNA was generated, and end-repair and ligation was performed using New England Biolabs (NEB) adapters. Final library preparation was performed by amplification with the 2x KAPA HiFi HotStart mix and NEB dual barcode following the manufacturer’s protocol. Library quality was measured on an Agilent 2100 Bioanalyzer for product size and concentration. Paired-end libraries were sequenced with the Illumina HiSeq 2500, (2 × 100 nucleotide read length) with sequence coverage to 15–20 M paired reads.
RNA data was first processed using kallisto (version 0.46.1)54. Then analysis was performed in R, first read counts were normalized and filtered (counts >10) using EdgeR55 (edgeR_3.39.6), and differential expression was performed using Limma-Voom (limma_3.53.10)56. GSEA was performed using fgsea (fgsea_1.24.0)57 and comparisons were made to several signatures, including an experimentally derived AR signature, the human hallmark MsigDB signatures (/www.gsea-msigdb.org), and the hallmark androgen response signature (HALLMARK_ANDROGEN_RESPONSE.v7.5.1.gmt). In addition, R packages tidyverse, gtable, gplots, ggplot2 and EnhancedVolcano (EnhancedVolcano_1.15.0) were also used for generating summary figures (R version 4.2.1 (refs. 58–60)).
ChIP-seq and data analysis
ChIP experiments were carried out using the Ideal ChIP-seq Kit for Transcription Factors or Histones (Diagenode) as per the manufacturer’s protocol. Chromatin from 2 × 106 cells (for transcription factors) and 1×106 cells (for histones) was used for each ChIP reaction with 4 or 2 μg of the target protein antibody, respectively. In brief, cells were trypsinized and washed twice with 1× PBS, followed by crosslinking for 8 min in 1% formaldehyde solution. Crosslinking was terminated by the addition of 1/10 volume 1.25 M glycine for 5 min at room temperature followed by cell lysis and sonication (Bioruptor, Diagenode), resulting in an average chromatin fragment size of 200 bp. Fragmented chromatin was then used for immunoprecipitation using various antibodies, with overnight incubation at 4 °C. ChIP DNA was de-crosslinked and purified using the iPure Kit V2 (Diagenode) using the standard protocol. Purified DNA was then prepared for sequencing as per the manufacturer’s instructions (Illumina). ChIP samples (1–10 ng) were converted to blunt-ended fragments using T4 DNA polymerase, Escherichia coli DNA polymerase I large fragment (Klenow polymerase), and T4 polynucleotide kinase (New England BioLabs (NEB)). A single adenine base was added to fragment ends by Klenow fragment (3′ to 5′ exo minus; NEB), followed by ligation of Illumina adaptors (Quick ligase, NEB). The adaptor-ligated DNA fragments were enriched by PCR using the Illumina Barcode primers and Phusion DNA polymerase (NEB). PCR products were size selected using 3% NuSieve agarose gels (Lonza) followed by gel extraction using QIAEX II reagents (Qiagen). Libraries were quantified and quality checked using the Bioanalyzer 2100 (Agilent) and sequenced on the Illumina HiSeq 2500 Sequencer (125-nt read length).
ChIP-seq analysis was carried out by first assessing reads and performing trimming using Trimmomatic version 0.39 (settings TruSeq3-PE-2.fa:2:30:10, minlen 50)61. Paired-end reads were aligned to hg38 (GRCh38) human genome reference using bwa (“bwa mem” command with options −5SP -T0, version 0.7.17-r1198-dirty)62. Alignments were then filtered using both samtools63 (v1.1,quality score cutoff of 20) and picard64 MarkDuplicates (v(2.26.0-1-gbaf4d27-SNAPSHOT), removed duplicates). Peak calling was performed using MACS2 (v2.2.7.1)65 using narrowpeak setting for narrow peaks and a second set for broad peaks (for example, H3K27ac,–broad -B–cutoff-analysis–broad-cutoff 0.05–max-gap 500). Finally, bedtools (v2.27.1)66 was used to remove blacklisted regions of the genome from the peak list (Encode’s exclusion list ENCFF356LFX.bed). UCSC’s tool wigtoBigwig (v2.8) was used for conversion to bigwig formats67.
Overlap analysis of ChIP-seq data
Peak lists from MACS were compared between samples using R package ChIPpeakAnno68–70. Peaks within 500 bp of each other were reduced to single peaks. Overlaps were calculated using settings maxgap = −1L, minoverlap=0 L, ignore.strand=TRUE, connectedPeaks=c(‘keepAll’, ‘min’, ‘merge’). Comparisons of enrichment sites to the known gene database (TxDb.Hsapiens.UCSC.hg38.knownGene) were performed using R package ChIPseeker. A distance of ±1 kb was used to assess relative distance from gene regions.
HOMER motif calling
De novo and known motif enrichment analysis was performed using HOMER (version v.4.10)42,71. Custom motif matrices were generated manually, then assigned score thresholds using HOMER’s utility seq2profile, allowing for two mismatches. This setting was chosen after iteratively comparing performance with the pre-existing FOXA1:AR motif. Further customization was achieved by checking for presence of motif elements with different spacings, ranging from 0-8 ‘N’s added between elements, and flipping the order of elements in each of these: FOXA1-ARE, ARE-FOXA1, FOXA-N-ARE, ARE-N-FOXA1, FOXA1-NN-ARE, ARE-NN-FOXA1, etc.
Custom motifs were then further validated using XSTREME (v5.5.5)72 from the MEME Suite71 to check for additional configurations and variations in padding between motif elements.
Enrichment heatmaps
The software Deeptools (v3.5.1) was used to generate enrichment plots and read-density heatmaps. A reference point parameter of ±2.5 kb for histone signals and ±1.5 kb for AR/FOX signals was used. Other settings included using ‘skipzeros’, ‘averagetype mean,’ and ‘plotype se’. The Encode blacklist ENCFF356LFX was used73.
Motif and signal plots
Sushi (Sushi_1.32.0) package in R was used to layer signal tracks. The plotBedgraph(), plotGenes(), plotBed() functions were used with output from ChIP-seq alignments and output from HOMER motif enrichment analysis74.
Superenhancer analysis
Super-enhancer regions were identified with findPeaks function from HOMER (version v.4.10)42 using options “-style super -o auto”. In addition, the option “-superSlope −1000” was added to include all potential peaks, which were used to generate the super-enhancer plot (super-enhancer score versus ranked peaks). The slope value of greater than or equal to 1 was used to identify super-enhancer clusters. The input files to findPeaks were tag directories generated from alignment files in SAM format with makeTagDirectory function from HOMER. Super-enhancer scores were plotted using the normalized tag count values between the datasets.
Single-cell data analysis
Three public scRNA-seq datasets from primary PCa were downloaded from GEO or a website provided by the author (GSE193337, GSE185344, www.prostatecellatlas.org)75. Using cell annotation from the Tuong et al. dataset as reference, luminal cells were annotated for the other two datasets with the label transfer method of Seurat29. Pseudo-bulk expression profiles30 were generated by summing counts from all cells annotated as luminal cells for each patient (tumor and normal samples separately). Normalization was achieved by computing normalization factors with the trimmed mean of M-values method31 and applying the cpm function from edgeR (v3.36.0)32. Box plots of NSD2 and PCA3 expression were generated with ggpubr33 and paired Wilcoxon test was used to test the significance of the difference between benign and tumor (only patients with paired benign and tumor samples were included).
IHC and immunofluorescence
IHC was performed on 4-μm-thick formalin-fixed, paraffin-embedded tissue sections using anti-NSD2 mouse monoclonal primary antibody (catalog no. ab75359, Abcam), anti-AR rabbit monoclonal primary antibody (catalog no. 760-4605, Roche-Ventana), and anti-CK-8 rabbit monoclonal primary antibody (catalog no. ab53280, Abcam). Singleplex IHC was carried out on the Ventana ULTRA automated slide staining system (Roche-Ventana Medical Systems) using the OmniView Universal diaminobenzidine detection kit (catalog no. 760-500, Roche-Ventana) and hematoxylin II (catalog no. 790-2208, Roche-Ventana) for counterstain. Staining was evaluated under 100× and 200× magnification using a brightfield microscope.
Assessment of drug synergism
To determine the synergy between two drug treatments, cells were treated with increasing concentrations of either drug for 120 h, followed by the determination of viable cells using the CellTiter-Glo Luminescent Cell Viability Assay (Promega). The experiment was carried out in four biological replicates. The data were expressed as percentage inhibition relative to baseline, and the presence of synergy was determined by the Bliss method using the synergy finder R package.
Statistics and reproducibility
All immunoblot experiments were repeated at least two to three times. For immunofluorescent staining experiments, number of biological replicates used in each case are noted in the figure legend. While representative images are shown in some panels, for example Figure 3b and Extended Data Fig. 5e, quantitation from all independent replicates is included. No statistical methods were used to predetermine sample sizes for any experiments. For all analyses, data distribution was assumed to be normal, but this was not formally tested. All immunofluorescence data quantification was performed in a double-blinded manner by in-house pathologists. For in vivo animal experiments.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41588-024-01893-6.
Supplementary information
Source data
Acknowledgements
We thank R. Wang from the sequencing team at the Michigan Center for Translational Pathology (MCTP) for generating NGS libraries and coordinating sequencing. We thank J. Waninger from MCTP for her technical help with size exclusion chromatography. We thank R. Rebernick for help with patient gene correlation analyses. We also acknowledge critical support from the University of Michigan Biomedical Research Core Facilities, especially the Vector, Flow Cytometry, and Proteomics and Peptide Synthesis Cores. We thank C. Vakoc from Cold Spring Harbor Laboratories for generously sharing the domain-focused epigenetics sgRNA library. We thank A. Heller, O. Rivera, A. Pawar and R. Natesan from the University of Pennsylvania for providing technical and bioinformatics assistance. This research was supported by the following mechanisms: Prostate Cancer Foundation (PCF), Prostate Specialized Programs of Research Excellence (SPORE) Grant P50-CA186786, National Cancer Institute Outstanding Investigator Award R35-CA231996, National Cancer Institute P30-CA046592 (Rogel Cancer Center investigators, including A.P. and A.M.C.), Early Detection Research Network Grant U2C CA271854 (A.M.C.), and National Cancer Institute R00-CA187664. A.P. is supported by the NIH/NCI K00 fellowship (K00-CA245825), Michigan SPORE Career Enhancement Program, PCF Young Investigator Award, and Rogel Fellowship. L.X. is supported by a Department of Defense Prostate Cancer Research Program Idea Development Award (W81XWH-21-1-0500) and a PCF Young Investigator Award. A.M.C. is a Howard Hughes Medical Institute Investigator, A. Alfred Taubman Scholar, and American Cancer Society Professor. I.A.A. is supported by grants from the National Institute of Health and Department of Defense (R01-CA249210-0 and W81XWH-17-0404).
Extended data
Author contributions
A.P., A.M.C. and I.A.A. conceived the study and the experiments; A.P. and S.E. designed and carried out the multiomics and functional experiments with assistance from J.L., S.E.C., Y.L., L.X., X.W., T.H., Y.Q., P.G., M.J., R.R., S.M., M.P., E.M., J.C.T., M.L., F.S. and X.C.; B.K.V. designed and carried out the fragment-based coimmunoprecipitation and functional experiments with assistance from S.A. and R.R.; B.K.V. and C.K.D. conducted in vivo experiments. A.P. and E.Y. carried out all the bioinformatics analyses with assistance from J.G., S.V. and M.A.; Y.Z. analyzed the single-cell RNA-seq datasets; O.T. provided guidance; L.L., C.H., Z.W. and K.D. were involved in the synthesis and structural validation of the LLC0150 compound; A.P. wrote the manuscript and developed the figures with feedback from A.M.C. and I.A.A.; S.E. wrote the Methods section, with help from B.K.V., E.Y., Z.W., R.M. and Y.Z. Next-generation sequencing related to data shown in Figs. 1c, 3k,m and 4d as well as related supplementary figures was performed at the University of Michigan, whereas next-generation sequencing related to data shown in Figs. 1g,i, 2a–e,i,j and 4e,f as well as related supplementary figures was carried out at the University of Pennsylvania. All authors discussed the results, provided feedback and reviewed the final manuscript.
Peer review
Peer review information
Nature Genetics thanks Cory Abate-Shen, Gong-Hong Wei and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
All data are available in the manuscript or the supplementary information. Raw next-generation sequencing data, including ChIP-seq and RNA-seq, generated in this study are deposited in the Gene Expression Omnibus (GEO) repository (accession number: GSE242737) at National Center for Biotechnology Information. ChIP-seq data from normal, primary PCa and mCRPC were pulled from GEO repositories GSE130408 and GSE70079. Three public scRNA-seq datasets from primary PCa were downloaded from GEO (GSE193337 and GSE185344) or a web portal provided by the authors. Source data are provided with this paper.
Code availability
All custom codes used for data analyses are freely available from the following public repositories: Github: https://github.com/mctp/NSD2_req_subunit (ref. 76) and Zenodo: 10.5281/zenodo.12979564 (ref. 77).
Competing interests
All the authors declare no competing financial interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Sanjana Eyunni, Brijesh Kumar Verma.
Contributor Information
Abhijit Parolia, Email: aparolia@umich.edu.
Arul M. Chinnaiyan, Email: arul@umich.edu
Irfan A. Asangani, Email: asangani@upenn.edu
Extended data
is available for this paper at 10.1038/s41588-024-01893-6.
Supplementary information
The online version contains supplementary material available at 10.1038/s41588-024-01893-6.
References
- 1.Heinlein, C. A. & Chang, C. Androgen receptor in prostate cancer. Endocr. Rev.25, 276–308 (2004). [DOI] [PubMed] [Google Scholar]
- 2.Nadal, M. et al. Structure of the homodimeric androgen receptor ligand-binding domain. Nat. Commun.8, 14388 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Shaffer, P. L., Jivan, A., Dollins, D. E., Claessens, F. & Gewirth, D. T. Structural basis of androgen receptor binding to selective androgen response elements. Proc. Natl Acad. Sci. USA101, 4758–4763 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Mills, I. G. Maintaining and reprogramming genomic androgen receptor activity in prostate cancer. Nat. Rev. Cancer14, 187–198 (2014). [DOI] [PubMed] [Google Scholar]
- 5.Pomerantz, M. M. et al. The androgen receptor cistrome is extensively reprogrammed in human prostate tumorigenesis. Nat. Genet.47, 1346–1351 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Pomerantz, M. M. et al. Prostate cancer reactivates developmental epigenomic programs during metastatic progression. Nat. Genet.52, 790–799 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wong, Y. N. S., Ferraldeschi, R., Attard, G. & de Bono, J. Evolution of androgen receptor targeted therapy for advanced prostate cancer. Nat. Rev. Clin. Oncol.11, 365–376 (2014). [DOI] [PubMed] [Google Scholar]
- 8.Robinson, D. et al. Integrative clinical genomics of advanced prostate cancer. Cell161, 1215–1228 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.& Abida, W. et al. Genomic correlates of clinical outcome in advanced prostate cancer.Proc. Natl. Acad. Sci. USA116, 11428–11436 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sahu, B. et al. Dual role of FoxA1 in androgen receptor binding to chromatin, androgen signalling and prostate cancer. EMBO J.30, 3962–3976 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Parolia, A. et al. Distinct structural classes of activating FOXA1 alterations in advanced prostate cancer. Nature571, 413–418 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Armenia, J. et al. The long tail of oncogenic drivers in prostate cancer. Nat. Genet.50, 645–651 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Berchuck, J. E. et al. The prostate cancer androgen receptor cistrome in African American men associates with upregulation of lipid metabolism and immune response. Cancer Res.82, 2848–2859 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Grbesa, I. et al. Reshaping of the androgen-driven chromatin landscape in normal prostate cells by early cancer drivers and effect on therapeutic sensitivity. Cell Rep.36, 109625 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Augello, M. A. et al. CHD1 loss alters AR binding at lineage-specific enhancers and modulates distinct transcriptional programs to drive prostate tumorigenesis. Cancer Cell35, 603–617.e8 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Asangani, I. A. et al. Therapeutic targeting of BET bromodomain proteins in castration-resistant prostate cancer. Nature510, 278–282 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Rasool, R. U. et al. CDK7 inhibition suppresses castration-resistant prostate cancer through MED1 inactivation. Cancer Discov.9, 1538–1555 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Xiao, L. et al. Targeting SWI/SNF ATPases in enhancer-addicted prostate cancer. Nature601, 434–439 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Welti, J. et al. Targeting the p300/CBP axis in lethal prostate cancer. Cancer Discov.11, 1118–1137 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Faivre, E. J. et al. Selective inhibition of the BD2 bromodomain of BET proteins in prostate cancer. Nature578, 306–310 (2020). [DOI] [PubMed] [Google Scholar]
- 21.Lasko, L. M. et al. Discovery of a selective catalytic p300/CBP inhibitor that targets lineage-specific tumours. Nature550, 128–132 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Jin, L. et al. Therapeutic targeting of the CBP/p300 bromodomain blocks the growth of castration-resistant prostate cancer. Cancer Res.77, 5564–5575 (2017). [DOI] [PubMed] [Google Scholar]
- 23.Schmitges, F. W. et al. Histone methylation by PRC2 is inhibited by active chromatin marks. Mol. Cell42, 330–341 (2011). [DOI] [PubMed] [Google Scholar]
- 24.Hansen, K. H. et al. A model for transmission of the H3K27me3 epigenetic mark. Nat. Cell Biol.10, 1291–1300 (2008). [DOI] [PubMed] [Google Scholar]
- 25.Poulin, M. B. et al. Transition state for the NSD2-catalyzed methylation of histone H3 lysine 36. Proc. Natl Acad. Sci. USA113, 1197–1201 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Malgeri, U. et al. Detection of t(4;14)(p16.3;q32) chromosomal translocation in multiple myeloma by reverse transcription-polymerase chain reaction analysis of IGH-MMSET fusion transcripts. Cancer Res.60, 4058–4061 (2000). [PubMed] [Google Scholar]
- 27.Chesi, M. et al. The t(4;14) translocation in myeloma dysregulates both FGFR3 and a novel gene, MMSET, resulting in IgH/MMSET hybrid transcripts. Blood92, 3025–3034 (1998). [PubMed] [Google Scholar]
- 28.Vo, J. N. et al. The genetic heterogeneity and drug resistance mechanisms of relapsed refractory multiple myeloma. Nat. Commun.13, 3750 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Jaffe, J. D. et al. Global chromatin profiling reveals NSD2 mutations in pediatric acute lymphoblastic leukemia. Nat. Genet.45, 1386–1391 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Li, J. et al. NSD2-E1099K mutation leads to glucocorticoid-resistant B cell lymphocytic leukemia in mice. Blood136, 3–4 (2020).32614960 [Google Scholar]
- 31.Kuo, A. J. et al. NSD2 links dimethylation of histone H3 at lysine 36 to oncogenic programming. Mol. Cell44, 609–620 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Xiao, L. et al. Epigenetic reprogramming with antisense oligonucleotides enhances the effectiveness of androgen receptor inhibition in castration-resistant prostate cancer. Cancer Res.78, 5731–5740 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Ku, S. Y. et al. Rb1 and Trp53 cooperate to suppress prostate cancer lineage plasticity, metastasis, and antiandrogen resistance. Science355, 78–83 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Shi, J. et al. Discovery of cancer drug targets by CRISPR-Cas9 screening of protein domains. Nat. Biotechnol.33, 661–667 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Groner, A. C. et al. TRIM24 is an oncogenic transcriptional activator in prostate cancer. Cancer Cell29, 846–858 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Fong, K.-W., Zhao, J. C., Song, B., Zheng, B. & Yu, J. TRIM28 protects TRIM24 from SPOP-mediated degradation and promotes prostate cancer progression. Nat. Commun.9, 5007 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Yang, P. et al. Histone methyltransferase NSD2/MMSET mediates constitutive NF-κB signaling for cancer cell proliferation, survival, and tumor growth via a feed-forward loop. Mol. Cell. Biol.32, 3121–3131 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ezponda, T. et al. The histone methyltransferase MMSET/WHSC1 activates TWIST1 to promote an epithelial-mesenchymal transition and invasive properties of prostate cancer. Oncogene32, 2882–2890 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Asangani, I. A. et al. Characterization of the EZH2-MMSET histone methyltransferase regulatory axis in cancer. Mol. Cell49, 80–93 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Aytes, A. et al. NSD2 is a conserved driver of metastatic prostate cancer progression. Nat. Commun.9, 5201 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Li, N. et al. AKT-mediated stabilization of histone methyltransferase WHSC1 promotes prostate cancer metastasis. J. Clin. Invest.127, 1284–1302 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell38, 576–589 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hnisz, D. et al. Super-enhancers in the control of cell identity and disease. Cell155, 934–947 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Tomlins, S. A. et al. Distinct classes of chromosomal rearrangements create oncogenic ETS gene fusions in prostate cancer. Nature448, 595–599 (2007). [DOI] [PubMed] [Google Scholar]
- 45.Nabet, B. et al. The dTAG system for immediate and target-specific protein degradation. Nat. Chem. Biol.14, 431–441 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Kang, H.-B. et al. The histone methyltransferase, NSD2, enhances androgen receptor-mediated transcription. FEBS Lett.583, 1880–1886 (2009). [DOI] [PubMed] [Google Scholar]
- 47.Finogenova, K. et al. Structural basis for PRC2 decoding of active histone methylation marks H3K36me2/3. Elife9, e61964 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Drosos, Y. et al. NSD1 mediates antagonism between SWI/SNF and polycomb complexes and is required for transcriptional activation upon EZH2 inhibition. Mol. Cell82, 2472–2489.e8 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Dilworth, D. et al. A chemical probe targeting the PWWP domain alters NSD2 nucleolar localization. Nat. Chem. Biol.18, 56–63 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Shen, M., Demers, L.-K., Bailey, S. D. & Labbé, D. P. To bind or not to bind: cistromic reprogramming in prostate cancer. Front. Oncol.12, 963007 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Schweizer, M. T. et al. Effect of bipolar androgen therapy for asymptomatic men with castration-resistant prostate cancer: results from a pilot clinical study. Sci. Transl. Med.7, 269ra2 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Markowski, M. C. et al. Clinical efficacy of bipolar androgen therapy in men with metastatic castration-resistant prostate cancer and combined rumor-suppressor loss. Eur. Urol. Open Sci.41, 112–115 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Markowski, M. C. et al. A multicohort open-label phase II trial of bipolar androgen therapy in men with metastatic castration-resistant prostate cancer (RESTORE): a comparison of post-abiraterone versus post-enzalutamide cohorts. Eur. Urol.79, 692–699 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol.34, 525–527 (2016). [DOI] [PubMed] [Google Scholar]
- 55.Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics26, 139–140 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res.43, e47 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Korotkevich, G., Sukhov, V. & Sergushichev, A. Fast gene set enrichment analysis. Preprint at https://www.biorxiv.org/content/10.1101/060012v2.full (2019).
- 58.Wickham, H. Ggplot2. Wiley Interdiscip. Rev. Comput. Stat.3, 180–185 (2011). [Google Scholar]
- 59.Wickham, H. & Pedersen, T. L. gtable: Arrange grobs in tables. R package version 0.3. 0.
- 60.Blighe, K. EnhancedVolcano: Publication-ready volcano plots with enhanced colouring and labeling. Githubhttps://github.com/kevinblighe/EnhancedVolcano (2024).
- 61.Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics30, 2114–2120 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics25, 1754–1760 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience10, giab008 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Institute, B. ‘Picard Toolkit’, Broad institute, GitHub repository. Picard Toolkithttps://broadinstitute.github.io/picard/ (2024).
- 65.Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol.9, R137 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics26, 841–842 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Kent, W. J., Zweig, A. S., Barber, G., Hinrichs, A. S. & Karolchik, D. BigWig and BigBed: enabling browsing of large distributed datasets. Bioinformatics26, 2204–2207 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Zhu, L. J. Integrative analysis of ChIP-chip and ChIP-seq dataset. Methods Mol. Biol.1067, 105–124 (2013). [DOI] [PubMed] [Google Scholar]
- 69.Yu, G., Wang, L.-G. & He, Q.-Y. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics31, 2382–2383 (2015). [DOI] [PubMed] [Google Scholar]
- 70.Wang, Q. et al. Exploring Epigenomic Datasets by ChIPseeker. Curr. Protoc.2, e585 (2022). [DOI] [PubMed] [Google Scholar]
- 71.Bailey, T. L., Johnson, J., Grant, C. E. & Noble, W. S. The MEME Suite. Nucleic Acids Res.43, W39–W49 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Grant, C. E. & Bailey, T. L. XSTREME: Comprehensive motif analysis of biological sequence datasets. Preprint at bioRxiv10.1101/2021.09.02.458722 (2021).
- 73.Ramírez, F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res.44, W160–W165 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Phanstiel, D. H., Boyle, A. P., Araya, C. L. & Snyder, M. P. Sushi.R: flexible, quantitative and integrative genomic visualizations for publication-quality multi-panel figures. Bioinformatics30, 2808–2810 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Killian, T. & Gatto, L. Exploiting the DepMap cancer dependency data using the depmap R package. F1000Res.10, 416 (2021). [Google Scholar]
- 76.NSD2_req_subunit: Code for Paper: NSD2 Is a Requisite Subunit of the AR/FOXA1 Neo-Enhanceosome in Promoting Prostate Tumorigenesis. Githubhttps://github.com/mctp/NSD2_req_subunit (2024). [DOI] [PMC free article] [PubMed]
- 77.Eleanor, C. mctp/NSD2_req_subunit: v1.0.0. 10.5281/zenodo.12979565
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data are available in the manuscript or the supplementary information. Raw next-generation sequencing data, including ChIP-seq and RNA-seq, generated in this study are deposited in the Gene Expression Omnibus (GEO) repository (accession number: GSE242737) at National Center for Biotechnology Information. ChIP-seq data from normal, primary PCa and mCRPC were pulled from GEO repositories GSE130408 and GSE70079. Three public scRNA-seq datasets from primary PCa were downloaded from GEO (GSE193337 and GSE185344) or a web portal provided by the authors. Source data are provided with this paper.
All custom codes used for data analyses are freely available from the following public repositories: Github: https://github.com/mctp/NSD2_req_subunit (ref. 76) and Zenodo: 10.5281/zenodo.12979564 (ref. 77).