Abstract
Lineage plasticity, the ability of a cell to alter its identity, is an increasingly common mechanism of adaptive resistance to targeted therapy in cancer. An archetypal example is the development of neuroendocrine prostate cancer (NEPC) after treatment of prostate adenocarcinoma (PRAD) with inhibitors of androgen signaling. NEPC is an aggressive variant of prostate cancer that aberrantly expresses genes characteristic of neuroendocrine (NE) tissues and no longer depends on androgens. Here, we investigate the epigenomic basis of this resistance mechanism by profiling histone modifications in NEPC and PRAD patient-derived xenografts (PDXs) using chromatin immunoprecipitation and sequencing (ChIP-seq). We identify a vast network of cis-regulatory elements (N~15,000) that are recurrently activated in NEPC. The FOXA1 transcription factor (TF), which pioneers androgen receptor (AR) chromatin binding in the prostate epithelium, is reprogrammed to NE-specific regulatory elements in NEPC. Despite loss of dependence upon AR, NEPC maintains FOXA1 expression and requires FOXA1 for proliferation and expression of NE lineage-defining genes. Ectopic expression of the NE lineage TFs ASCL1 and NKX2-1 in PRAD cells reprograms FOXA1 to bind to NE regulatory elements and induces enhancer activity as evidenced by histone modifications at these sites. Our data establish the importance of FOXA1 in NEPC and provide a principled approach to identifying cancer dependencies through epigenomic profiling.
Subject terms: Prostate cancer, Epigenomics
The molecular processes that lead to neuroendocrine prostate cancer after treating prostate adenocarcinoma (PRAD) are not well understood. Here the authors show that regulation by FOXA1 and changes in the epigenomic profile drive the transition from PRAD to a neuroendocrine phenotype.
Introduction
In recent years, potent AR pathway inhibitors have extended the survival of patients with metastatic prostate cancer1,2. Prostate tumors inevitably escape AR inhibition through reactivation of AR signaling or, increasingly, via lineage plasticity3,4. The mechanisms underlying lineage plasticity remain unclear but likely involve transdifferentiation of PRAD to NEPC rather than de novo emergence of NEPC. NEPC and PRAD tumors from an individual patient share many somatic DNA alterations, implying a common ancestral tumor clone5. While the genomic profiles of NEPC and PRAD are relatively similar, their gene expression profiles and clinical behavior differ markedly6. We therefore set out to characterize epigenomic differences between NEPC and PRAD, hypothesizing that reprogramming of distinct regulatory elements drives their divergent phenotypes.
In this study, we profile histone modifications in NEPC and PRAD patient-derived xenografts (PDXs), identifying ~15,000 regulatory elements that are dormant in PRAD but consistently activated in NEPC. A significant portion of NEPC-enriched regulatory elements are bound in NEPC by FOXA1, a transcription factor associated with prostate development and AR-mediated transcription7,8. The FOXA1 cistrome, or set of binding sites, is extensively reprogrammed in NEPC, with loss of FOXA1 binding at PRAD-enriched regulatory elements and gain of FOXA1 at NEPC-enriched regulatory elements. Unexpectedly, FOXA1 remains active in NEPC despite the loss of luminal identity and AR expression. FOXA1 is necessary for proliferation and neuroendocrine gene expression in experimental models of NEPC. Ectopic expression of NEPC-associated transcription factors is sufficient to reprogram the FOXA1 cistrome in PRAD to resemble its counterpart in NEPC and activate NEPC transcriptional programs. Our data indicate a dependency of NEPC upon FOXA1, which may have therapeutic implications.
Results
Comparative epigenomic profiles of PRAD and NEPC
We performed ChIP-seq for the histone post-translational modification H3K27ac to identify active regulatory elements in the LuCaP PDX series9, a set of xenografts derived from advanced PRAD (N = 22) and treatment-emergent NEPC (N = 5). We identified a median of 55,095 H3K27ac peaks per sample (range 37,599–74,640) (Supplementary Data 1). Notably, the transcriptomes of the LuCaP PDXs reflect differences in gene expression observed between clinical PRAD and NEPC metastases (Supplementary Fig. 1a), indicating their relevance to clinical prostate cancer.
Unsupervised hierarchical clustering and principal component analysis based on genome-wide H3K27 acetylation cleanly partitioned NEPC and PRAD LuCaP PDXs (Fig. 1a and Supplementary Fig. 1b, c). We identified 14,985 sites with eight-fold or greater increases in H3K27 acetylation in NEPC compared to PRAD at an adjusted p-value of 10−3. We termed these sites neuroendocrine-enriched candidate regulatory elements (“Ne-CREs”; Fig. 1b, Supplementary Data 2, and Supplementary Fig. 1d). A smaller set of sites (4338) bore greater H3K27ac signal in PRAD (termed “Ad-CREs”). Liver metastases from clinical NEPC and PRAD demonstrated enrichment of H3K27ac at Ne-CREs and Ad-CREs, respectively, confirming that the LuCaP PDX models reflect lineage-specific epigenomic features of clinical prostate tumors (Supplementary Fig. 1e).
Ad-CREs were found near prostate-lineage genes such as KLK3, HOXB13, and NKX3-1, while Ne-CREs resided near genes enriched for neuronal and developmental annotations, including CHGA, ASCL1, and SOX210 (Fig. 1c and Supplementary Table 1). Genes with higher expression in NEPC compared to PRAD were enriched for nearby Ne-CREs (Supplementary Fig. 1f) and formed three-dimensional contacts with a greater number of Ne-CREs as assessed by H3K27ac HiChIP (Fig. 1d, Supplementary Fig. 1g, h, and Supplementary Data 3 and 4). For example, ASCL1, which encodes a neural lineage TF that is highly upregulated in NEPC (Supplementary Fig. 1a), interacts with 15 gene-distal Ne-CREs between 280 and 465 kb telomeric to ASCL1, including two NEPC-restricted super-enhancers within intronic regions of C12ORF42 (Fig. 1e). These results suggest that Ne-CREs regulate neuroendocrine transcriptional programs through interaction with NEPC gene promoters.
We nominated candidate TFs that may orchestrate NEPC lineage gene expression by binding to Ne-CREs. Lineage-defining TF genes often reside within densely H3K27-acetylated super-enhancers11 and form core regulatory circuits, or “cliques”, by mutual binding of one another’s cis-regulatory regions12,13. Several TFs showed clique enrichment specifically in NEPC (Fig. 1f) and/or were encompassed by NEPC-restricted super-enhancers (Supplementary Fig. 2), including known NE lineage TFs (e.g., ASCL1 and INSM1) and candidates such as HOXB2-5.
FOXA1 is an essential transcription factor in NEPC
Notably, a single TF gene, FOXA1, demonstrated clique enrichment in all NEPC and PRAD LuCaP PDXs (Fig. 1f). FOXA1 is a pioneer TF of endodermal tissues7 with a critical role in prostate development8 but no characterized function in NEPC. The forkhead motif recognized by FOXA1 was the second most significantly enriched nucleotide sequence within Ne-CREs (Fig. 1g). FOXA2, a previously-reported NEPC TF14, does not wholly account for the forkhead motif enrichment because FOXA2 was not expressed in several NEPC samples (Fig. 2a, b and Supplementary Fig. 3a). In contrast, FOXA1 was expressed in all NEPCs (Fig. 2a, b and Supplementary Table 2), as well as in resident neuroendocrine cells of benign prostate tissue (Supplementary Fig. 3).
Multiple lines of investigation supported a pivotal role of FOXA1 in NEPC. A super-enhancer encompassed FOXA1 in all NEPC LuCaP PDXs (Fig. 2c and Supplementary Fig. 2). In NEPC, the FOXA1 promoter shed contacts with its regulatory region identified in PRAD15 and looped to a distinct NEPC-restricted super-enhancer (Fig. 2d). Both the distal super-enhancer and promoter were co-bound by FOXA1 and ASCL1, suggesting an auto-regulatory circuit that is characteristic of master transcriptional regulators16. Suppression of FOXA1 in a variety of NEPC cellular models17,18 demonstrated that FOXA1 is essential for cellular proliferation and expression of NE markers, including NE lineage TFs such as FOXA2 and INSM1 (Fig. 2e–h). Analysis of a published shRNA screen confirmed a dependency on FOXA1 in the NEPC cell line NCI-H660 (Fig. 2i). Thus, FOXA1 exhibits several features of a master transcriptional regulator in NEPC.
We profiled FOXA1-binding sites in NEPC and PRAD using ChIP-seq. FOXA1 relocates to a distinct set of binding sites in NEPC PDXs (Fig. 3a), which overlap with the majority of Ne-CREs (Fig. 3b). In PRAD, Ne-CREs were devoid of FOXA1 binding and heterochromatic as assayed by ATAC-seq, but they acquired FOXA1 binding and chromatin accessibility in NEPC (Fig. 3c). Conversely, Ad-CREs lost FOXA1 binding in NEPC and became less accessible by ATAC-seq. These changes were not due to FOXA1 mutations, which can promote neuroendocrine gene transcription19, because all NEPC LuCaPs contained wild-type FOXA1 sequences. To contextualize the extent of FOXA1 reprogramming in NEPC, we compared FOXA1-binding profiles in normal prostate epithelium, localized PRAD, and PDXs derived from metastatic PRAD. At the same level of stringency, fewer than 500 sites exhibited differential FOXA1 binding between these categories; by comparison, FOXA1 binding was gained at 20,935 and lost at 29,308 sites in NEPC compared to metastatic PRAD (Fig. 3d).
NEPC genes contain bivalent promoter marks in PRAD
We sought to understand the mechanism by which FOXA1 binding is reprogrammed in NEPC. In addition to DNA sequence, cooperative binding with partner TFs is an important determinant of pioneer factor localization20. Since the motifs recognized by ASCL1 and NKX2-1 were highly enriched at Ne-CREs (Fig. 1g), we tested whether overexpression of these TFs in the PRAD cell line LNCaP could induce FOXA1 binding at Ne-CREs. Overexpression of ASCL1 and NKX2-1 (A + N) increased FOXA1 binding at NEPC-enriched FOXA1-binding sites (Fig. 4a, b) and induced H3K27 acetylation of Ne-CREs (Fig. 4c–f). ASCL1 co-localized with FOXA1 at NEPC-enriched FOXA1-binding sites and Ne-CREs (Fig. 4g–h). A+N expression recapitulated global transcriptional changes between NEPC and PRAD, including suppression of AR and induction of SYP and CHGA (Fig. 4i–k). Thus, ectopic expression of ASCL1 and NKX2-1 is sufficient to partially reprogram FOXA1 binding in PRAD to Ne-CREs and induce de novo H3K27 acetylation at these regions, with resultant NEPC gene expression.
Despite intense interest, it remains unclear why PRAD can adopt a seemingly unrelated lineage to overcome androgen blockade, while most cancers do not dramatically alter their cellular identity throughout treatment. Lineage tracing studies have demonstrated that the epithelial cells that give rise to PRAD share a common developmental progenitor with resident neuroendocrine cells in the prostate21,22. In this common progenitor cell, Ne-CREs and their FOXA1-binding sites might be physiologically poised for activation upon commitment to a neuroendocrine lineage. In support of this model, genes that are highly expressed in normal neuroendocrine prostate cells are also highly expressed in NEPC (Fig. 5a), and are enriched for nearby Ne-CREs and NEPC-restricted FOXA1-binding sites (Fig. 5b). Additionally, Ne-CREs are relatively hypomethylated in normal prostate tissue and PRAD despite absence of H3K27 acetylation, a feature of decommissioned enhancers that were active in development (Fig. 5c)23,24.
We hypothesized that a neuroendocrine epigenomic program is encoded in the developmental history of the prostate, thereby priming NEPC genes for inappropriate activation under the selective pressure of androgen blockade. Consistent with this hypothesis, many genes that become highly expressed in NEPC have “bivalent” (H3K4me3+/H3K27me3+) promoter histone marks in normal prostate tissue and PRAD (Fig. 6a). Bivalent genes are thought to be poised for lineage-specific activation upon removal of H3K27me3 at the appropriate stage of development25,26. Our data suggested that a similar principle underlies transcriptional changes in neuroendocrine differentiation of prostate cancer. H3K27me3 levels decreased in NEPC compared to PRAD at 633 gene promoters, which were enriched for binding sites of the REST repressor of neuronal lineage transcription27 (Supplementary Fig. 4). Similar numbers of these promoters were bivalent (H3K4me3+/H3K27me3+; n = 195) and repressed (H3K4me3−/H3K27me3+; n = 229) in PRAD (Fig. 6b). Critically, however, genes with bivalent (H3K4me3+) promoters in PRAD became more highly expressed in NEPC (Fig. 6c) than H3K4me3− genes. These bivalent genes, which included NEPC TFs ASCL1, INSM1, and SOX2, may have been prepared for activation in the development of a prostate progenitor cell. Their residual H3K4me3 and promoter hypomethylation (Fig. 6d) suggest heightened potential for reactivation24 in NEPC with the disruption of pro-luminal AR-driven transcriptional programs.
Discussion
In summary, our work demonstrates that the cis-regulatory landscape of prostate cancer is extensively reprogrammed in NEPC. Epigenomic profiling of human NEPC xenografts supports a central role of FOXA1 in this reprogramming. The FOXA1 cistrome shifts dramatically between NEPC and PRAD, with gain of FOXA1-binding sites at NEPC regulatory elements and loss of FOXA1 at PRAD elements. FOXA1 exhibits features of a master transcriptional regulator. It is encompassed by a super-enhancer in NEPC and is involved in core regulatory circuits with neuronal lineage TFs such as ASCL1. Future studies will be necessary to determine whether the shift in FOXA1-binding sites is required for neuroendocrine differentiation, or merely correlative. In either case, we establish FOXA1 as a dependency in NEPC. The finding that FOXA1 is essential in NEPC has perhaps been overlooked because candidate drivers of NEPC have been identified mainly based on differential transcript expression or somatic DNA alterations6,10,28,29. Our study demonstrates that epigenomic profiling can identify cancer dependencies that are difficult to detect based on mutational and transcriptional profiling alone.
A dependency of NEPC on FOXA1 is unexpected based on prior work. FOXA1 has been reported to inhibit neuroendocrine differentiation of prostate adenocarcinoma, based on the observations that FOXA1 is downregulated in NEPC and that FOXA1 knock-down induces neuroendocrine features in PRAD cell lines30. Our data demonstrate that FOXA1 remains crucial in NEPC despite consistent, modest transcript downregulation in NEPC compared to PRAD. Our H3K27ac HiChIP data reveal that in NEPC, FOXA1 contacts distal super-enhancers that are distinct from its PRAD enhancers and contain binding sites for NE-associated TFs such as ASCL1 and INSM1 (Fig. 2d and Supplementary Fig. 5). Thus, an NEPC-specific regulatory program may maintain FOXA1 expression at lower levels that are conducive to NE gene expression, reconciling our findings with the reported pro-neuroendocrine effects of partial FOXA1 suppression in PRAD30. This hypothesis should be tested in future mechanistic studies. While our data show that FOXA1 is essential in NEPC, further studies are required to determine if FOXA1 cistrome reprogramming directly activates Ne-CREs and to assess its role dynamic lineage plasticity.
FOXA1 may have a more general role in controlling neuroendocrine differentiation. For example, in small cell lung cancer (SCLC), a neuroendocrine lung cancer variant that can emerge de novo or from EGFR-mutant lung adenocarcinoma after targeted kinase inhibition, FOXA1 is highly expressed and encompassed by a super-enhancer31. We observe extensive H3K27 acetylation in SCLC cell lines specifically at Ne-CREs and NEPC-enriched FOXA1-binding sites, suggesting similar enhancer usage between in SCLC and NEPC (Supplementary Fig. 6), consistent with recent reports28,32. The large set of Ne-CREs and NEPC-enriched FOXA1-binding sites could aid the pathologic diagnosis of neuroendocrine differentiation, which can be challenging and relies on only a handful of markers. Ultimately, therapeutic targeting of FOXA1 and/or proteins that collaborate with or covalently modify this TF33 presents an attractive strategy as FOXA1 is a common vulnerability in both PRAD and NEPC.
Methods
Patient-derived xenograft and tissue specimens
LuCaP patient-derived xenografts (PDXs) have been described previously9,34,35 with the exception of LuCaP 208.1. LuCaP 208.1 was derived from treatment-emergent NEPC and demonstrates typical small cell histology. All LuCaP PDXs, including LuCaP 208.1 were derived from resected prostate cancer with explicit written consent of patient donors as described previously9 under a protocol approved by the University of Washington Human Subjects Division IRB (#39053). PDXs were generated in compliance with all relevant ethical regulations for animal testing and research under protocol #39053. The University of Washington Institutional Animal Care and Use Committee approved all animal procedures including generation and processing of PDXs, including LuCaP 208.1. Liver metastasis needle biopsy specimens were obtained from the Dana-Farber Cancer Institute Gelb Center biobank and were collected under a DFCI/Harvard Cancer Center IRB-approved protocol (01-045) with informed consent of patients. Metastases were reviewed by a clinical pathologist. The NEPC metastasis was obtained from a patient with de novo metastatic prostate adenocarcinoma after 17 months of androgen deprivation therapy with leuprolide and bicalutamide. Immunohistochemistry revealed staining for synaptophysin, chromogranin, and NKX3-1 (weak), and absence of RB1, AR, and PSA.
Epigenomic profiling
chromatin immunoprecipitation in LuCaP PDXs
Frozen tissue (20–30 mg for histone mark ChIP and 50–80 mg for transcription factor ChIP) was pulverized using the CryoPREP dry impactor system (Covaris). The tissue was then fixed using 1% formaldehyde (Thermo fisher) in phosphate-buffered saline (PBS) for 18 min either at 37 degrees Celsius (histone mark ChIP) or at room temperature (transcription factor ChIP) and was quenched with 125 mM glycine. Chromatin was lysed in ice-cold lysis buffer (50 mM Tris, 10 mM EDTA, 1% SDS with protease inhibitor for histone mark ChIP; 0.1% SDS, 0.5% sodium deoxycholate and 1% NP-40 with protease inhibitor for transcription factor ChIP) and was sheared to 300–800 bp using the Covaris E220 sonicator (105 watt peak incident power, 5% duty cycle, 200 cycles/burst for 10 min for histone mark ChIP; 140 watt peak incident power, 5% duty cycle, 200 cycles/burst for 20 min for transcription factor ChIP). Five volumes of dilution buffer (1% Triton X-100, 2 mM EDTA, 150 mM NaCl, 20 mM Tris-HCl pH 8.1) were added to chromatin for histone mark ChIP. The sample was then incubated with antibodies (H3K27ac, Diagenode, C15410196; H3K27me3, Cell Signaling 9733S; H3K4me3, Diagenode C15410003 premium; FOXA1, ab23738, Abcam) coupled with protein A and protein G beads (Life Technologies) at 4 degrees Celsius overnight. The chromatin was washed with RIPA wash buffer (100 mM Tris pH 7.5, 500 mM LiCl, 1% NP-40, 1% sodium deoxycholate) for 10 min six times and rinsed with TE buffer (pH 8.0) once. DNA was purified using MinElute column followed by incubation in the de-crosslinking buffer (1% SDS, 0.1 M NaHCO3 with Proteinase K and RNase A) at 65 degrees Celsius.
LNCaP ChIP
ChIP in LNCaP was performed according to published protocols36. Ten million cells were fixed with 1% formaldehyde at room temperature for 10 min and quenched. Cells were collected in lysis buffer (1% NP-40, 0.5% sodium deoxycholate, 0.1% SDS and protease inhibitor (#11873580001, Roche) in PBS)37. Chromatin was sonicated to 300–800 bp using a Covaris E220 sonicator (140 watt peak incident power, 5% duty cycle, 200 cycleburtst). Antibodies (FOXA1, ab23738, Abcam; H3K27ac, C15410196, Diagenode; ASCL1, ab74065) were incubated with 40 μl of Dynabeads protein A/G (Invitrogen) for at least 6 h before immunoprecipitation of the sonicated chromatin overnight. Chromatin was washed with LiCl wash buffer (100 mM Tris pH 7.5, 500 mM LiCl, 1% NP-40, 1% sodium deoxycholate) six times for 10 min sequentially.
ChIP sequencing
Sequencing libraries were generated from purified IP sample DNA using the ThruPLEX-FD Prep Kit (Rubicon Genomics). Libraries were sequenced using 150-base paired-end reads on an Illumina platform (Novogene).
ATAC-seq
LuCaP PDX tissues were resuspended and dounced in 300 ul of RSB buffer (10 mM Tris-HCl pH 7.4, 10 mM NaCl, and 3 mM MgCl2 in water) containing 0.1% NP-40, 0.1% Tween-20, and 0.01% digitonin. Homogenates were transferred to a 1.5 ml microfuge tube and incubated on ice for 10 min. Nuclei were filtered through a 40 μm cell strainer and nuclei were washed with RSB buffer and counted. Fifty thousand nuclei were resuspended in 50 μl of transposition mix38 (2.5 μl transposase (100 nM final), 16.5 μl PBS, 0.5 μl 1% digitonin, 0.5 μl 10% Tween-20, and 5 μl water) by pipetting up and down six times. Transposition reactions were incubated at 37 °C for 30 min in a thermomixer with shaking at 1000 r.p.m. Reactions were cleaned with Qiagen columns. Libraries were amplified39 and sequenced on an Illumina Nextseq 500 with 35 base paired-end reads.
ChIP-seq data analysis
ChIP-sequencing reads were aligned to the human genome build hg19 using the Burrows-Wheeler Aligner (BWA) version 0.7.1740. Non-uniquely mapping and redundant reads were discarded. MACS v2.1.1.2014061641 was used for ChIP-seq peak calling with a q-value (FDR) threshold of 0.01. ChIP-seq data quality was evaluated by a variety of measures, including total peak number, FrIP (fraction of reads in peak) score, number of high-confidence peaks (enriched >10-fold over background), and percent of peak overlap with DNAse hypersensitivity (DHS) peaks derived form the ENCODE project. ChIP-seq peaks were assessed for overlap with gene features and CpG islands using annotatr42. IGV v2.8.243 was used to visualize normalized ChIP-seq read counts at specific genomic loci. ChIP-seq heatmaps were generated with deepTools v3.3.144 and show normalized read counts at the peak center ±2 kb unless otherwise noted. Overlap of ChIP-seq peaks was assessed using BEDTools v2.26.0. Peaks were considered overlapping if they shared one or more base pairs.
Identification and annotation of PRAD- and NEPC-enriched ChIP-seq peaks
Sample–sample clustering, principal component analysis, and identification of lineage-enriched peaks were performed using Cobra v2.045 (https://bitbucket.org/cfce/cobra/src/master/), a ChIP-seq analysis pipeline implemented with Snakemake46. ChIP-seq data from PRAD and NEPC LuCaP PDXs were compared to identify H3K27ac, H3K27me3, and FOXA1 peaks with significant enrichment in the NEPC or PRAD lineage. Only LuCaP PDXs from distinct patients were included, with the exception of the H3K27me3 differential peak analysis, which included both LuCaP 145.1 and 145.2, two LuCaP PDXs derived from distinct NEPC metastases from a single patient. A union set of peaks for each histone modification or TF was created using BEDTools. narrowPeak calls from MACS were used for H3K27ac and FOXA1, while broadPeak calls were used for H3K27me3. The number of unique aligned reads overlapping each peak in each sample was calculated from BAM files using BEDtools. Read counts for each peak were normalized to the total number of mapped reads for each sample. Quantile normalization was applied to this matrix of normalized read counts. Using DEseq2 v1.14.147, lineage-enriched peaks were identified at the indicated FDR-adjusted p-value (padj) and log2 fold-change cutoffs (H3K27ac, padj < 0.001, |log2 fold-change| >3; FOXA1, padj < 0.001, |log2 fold-change| >2; H3K27me3, padj < 0.01, |log2 fold-change| > 1). Unsupervised hierarchical clustering was performed based on Spearman correlation between samples. Principal component analysis was performed using the prcomp R function. Enriched de novo motifs in differential peaks were detected using HOMER version 4.7. The top non-redundant motifs were ranked by adjusted p-value.
The GREAT tool48 (V3.0) was used to asses for enrichment of Gene Ontology (GO) and MSigDB perturbation annotations among genes near differential ChIP-seq peaks, assigning each peak to the nearest gene within 500 kb. The cistromedb toolkit (http://dbtoolkit.cistrome.org/) was used to compare ChIP-seq peaks for overlap with peaks from a large database of uniformly analyzed published ChIP-seq data (quantified as a “GIGGLE score”)49. Published TFs and histone marks were ranked by similarity to the querry dataset based on the top 1000 peaks in each published dataset. Prior to cistromedb toolkit analysis, ChIP-seq peaks were mapped from hg19 to hg38 using the UCSC liftover tool (https://genome.ucsc.edu/cgi-bin/hgLiftOver).
For analysis of H3K27 acetylation in lung cancer at lineage-enriched candidate regulatory elements, fastq files were generated from sequence read archives (SRA) from published ChIP-seq experiments for SCLC50 and LUAD51,52 (SRA numbers SRR568435, SRR3098556, SRR4449027, SRR4449025, and SRR6124068).
For Fig. 5c, H3K27ac ChIP-seq peaks from primary peripheral blood monocytes (ENCFF540CVX) and epithelial keratinocytes (ENCFF943CBQ)53 were used as a comparator to peaks derived from LuCaP PDXs. For these comparisons, monocyte and keratinocyte peaks within 1 kb of a LuCaP peak were excluded.
RNA-seq and differential expression analysis
RNA-seq data from human adenocarcinoma and NEPC have been reported previously5 and were obtained from dbGaP (accession number phs000909.v1.p1). Transcriptomes were sequenced from two replicates from each of five PRAD LuCaP PDXs (23, 77, 78, 81, and 96) and five NEPC LuCaP PDXs (49, 93, 145.1, 145.2, and 173.1). RNA concentration, purity, and integrity were assessed by NanoDrop (Thermo Fisher Scientific Inc.) and Agilent Bioanalyzer. RNA-seq libraries were constructed from 1 μg total RNA using the Illumina TruSeq Stranded mRNA LT Sample Prep Kit according to the manufacturer’s protocol. Barcoded libraries were pooled and sequenced on the Illumina HiSeq 2500 generating 50 bp paired-end reads. FASTQ files were processed using the VIPER workflow54. Read alignment to human genome build hg19 was performed with STAR v 2.7.0f55. Cufflinks was used to assemble transcript-level expression data from filtered alignments56. Differential gene expression analysis (NEPC vs. PRAD) was conducted using DESeq247.
H3K27ac HiChIP
Pulverized frozen tissue from LuCaP 173.1 was fixed with 1% formaldehyde in PBS at room temperature for 10 min36. Sample was incubated in lysis buffer and digested with MboI (NEB) for 4 h. After 1 h of biotin incorporation with biotin dATP, the sample was ligated using T4 DNA ligase for 4 h. Chromatin was sheared using 140 PIP, 5% duty cycle, and 200 cycles/burst for 8 min in shearing buffer composed of 1% NP-40, 0.5% sodium deoxycholate, and 0.1% SDS in PBS (LNCaP) or using 100 PIP, 5% duty cycle, 200 cycles/burst for 3 min in 1% SDS, 50 mM Tris (pH 8.1), and 5 mM EDTA (LuCaP 173.1). ChIP was then performed using H3K27Ac antibody (Diagenode, C1541019)57.
Immunoprecipitated sample was pulled down with streptavidin C1 beads (Life Technologies) and treated with Transposase (Illumina). Amplification was performed for the number of cycles required to reach 1/3 of the maximal fluorescence on quantitative PCR (qPCR) plot with SYBR® Green I(Life Technologies). Libraries were sequenced using 150-base paired-end reads on the Illumina platform (Novogene).
Alignment and filtering using HiC-Pro
We processed paired-end fastq files using HiC-Pro58 to generate intra- and inter-chromosomal contact maps. The reads were first trimmed to remove adaptor sequences using Trim Galore (https://github.com/FelixKrueger/TrimGalore). Default settings from HiC-Pro were used to align reads to the hg19 human genome, assign reads to MboI restriction fragments, and remove duplicate reads. Only uniquely mapped valid read pairs involving two different restriction fragments were used to build the contact maps.
FitHiChIP
We applied FitHiChIP59 for bias-corrected peak calling and DNA loop calling.
We used MACS2 broadPeak peak calls from H3K27ac ChIP-seq in LuCaP 173.1 (NEPC). 44,609 peaks were called at a q-value < 0.01. We used a 5 kb resolution and considered only interactions between 5 kb and 3 Mb. We used peak-to-peak (stringent) interactions for the global background estimation of expected counts (and contact probabilities for each genomic distance), and peak-to-all interactions for the foreground, meaning at least one anchor must overlap a H3K27ac peak. The corresponding FitHiChiP options specified are “IntType = 3” and “UseP2PBackgrnd = 1”.
Assignment of enhancer-promoter interactions using H3K27ac HiChIP data
NCBI RefSeq genes (hg19) were downloaded from the UCSC genome table browser (https://genome.ucsc.edu/cgi-bin/hgTables). Only uniquely mapping genes were considered. The longest transcript was selected for genes with multiple annotated transcripts. We searched for H3K27ac HiChIP loops with one anchor (defined with a 5 kb window) overlapping a region between 0 and 5 kb upstream of a gene transcriptional start site. We selected subset of these loops for which the second anchor (with a 5 kb window) overlapped with H3K27ac peaks identified by ChIP-seq in LuCaP 173.1 (NEPC) or with NEPC-enriched H3K27ac peaks (Ne-CREs). Gene promoters and distal H3K27ac peaks/Ne-CREs were considered looped if each overlapped with an anchor of the same high-confidence H3K27ac HiChIP loop(s). To examine the association of regulatory element looping with gene expression, genes were binned by the number of distinct, looped Ne-CREs or H3K27ac peaks. Differential expression between NEPC and PRAD LuCaP PDXs, as assessed by DESeq2 analysis of LuCaP RNA-seq data, was plotted for genes in each bin. Wilcoxon rank-sum p-values were calculated for differential expression of genes looped to one versus two or more H3K27ac/Ne-CRE peaks. A p-value < 0.01 was considered significant.
Master transcription factor analysis
Super-enhancer ranking analyses
Enhancer and super-enhancer (SE) calls were obtained using the Rank Ordering of Super-enhancer (ROSE2) algorithm11. We selected SEs assigned to transcription factors (TFs)60,61, and for each sample, we obtained the ranks of all TF SEs. Considering only the top 5% TFs by median ranking in NEPC or PRAD, we applied a one-sided Mann–Whitney U-test to identify lineage-enriched TF SEs (FDR = 10%).
Clique enrichment and clustering analysis
Clique enrichment scores (CESs) for each TF were calculated using clique assignments from Coltron62. Coltron assembles transcriptional regulatory networks (cliques) based on H3K27 acetylation and TF-binding motif analysis. The clique enrichment score for a given TF is the number of cliques containing the TF divided by the total number of cliques. We incorporated ATAC-seq data to restrict the motif search to regions of open chromatin. Using the CES, we performed clustering (distance = Canberra, agglomeration method = ward.D2) considering only TFs that appear in cliques in at least 80% of the samples in at least one lineage group (4 out of 5 NEPC and 11 out of 14 PRAD).
Motif enrichment at super-enhancers with loops to the FOXA1 locus
H3K27ac HiChIP data were used to select distal SEs that form three-dimensional contacts with the FOXA1 locus. We used the Coltron algorithm to search for TF motifs in ATAC-seq peaks within these SEs. We considered all TFs that were categorized as expressed by Coltron based on H3K27ac levels at the TF gene locus. Motif enrichment for a TF was calculated as the total number of non-overlapping base pairs (bp) covered by the TF motif, divided by the summed length (in bp) of the SEs. Values in the heatmap legend correspond to percent coverage (i.e., the largest value corresponds to 0.4%).
FOXA1 mutational profiling
FOXA1 mutational status was assessed from exome sequence data (62x–110X depth of coverage). Each LuCaP PDX was sequenced using the Illumina Hi-seq platform with 100 bp paired-end reads. Hybrid capture was performed SeqCapV3. Mouse genome subtraction was performed using the mm10 genome build and reads were aligned to human reference genome hg19. For sequence analysis, bam files processed as per Genome Analysis Toolkit (GATK) best practice guideline63. We Used MuTect2 and HaplotypeCaller for mutation calls. All mutations were manually reviewed and subsequently annotated using Annovar64. Copy number was derived using the Sequenza R package65. Factera66 was used to predict structural events involving FOXA1.
FOXA1 siRNA knock-down
WCM154 organoids were cultured and maintained as described18. Organoids were dissociated to single cells using TrypLE (ThermoFisher). One million cells were resuspended in 20 μl of electroporation buffer (BTXpress) and mixed with 60 pmole of control or FOXA1 On-target pool siRNA (Dharmacon). Then organoid-siRNA mixtures were transferred to a 16-well NucleocuvetteTM Strip and nucleofection was performed in a 4D-Nucleofector (Lonza). Following nucleofection, 105 organoids cells were grown in a 12-well plate coated with 1% collagen I (ThermoFisher) for 7 days. Both adherent and floating cells were collected and stained with 0.4% trypan blue solution (ThermoFisher). Total cell numbers were measured by a hemocytometer. Cell proliferation with FOXA1 knock-down was normalized to control siRNA cells.
FOXA1 shRNA knock-down
LNCaP, LNCaP 42D, and LNCaP 42F cells were seeded in parallel 6-well plates at 500, 500, or 100 k, respectively. Twenty-four hours later, cells were infected with lentivirus containing shRNAs targeting GFP control or FOXA1. Forty-eight hours following infection, equal cell numbers were seeded, and proliferation was assayed 6 days later using a Vi-Cell. Seventy-two hours following infection, a second plate infected in parallel was harvested for immunoblotting. The target sequence against GFP was CCACATGAAGCAGCACGACTT (shGFP). The target sequences against FOXA1 were GCGTACTACCAAGGTGTGTAT (shFOXA1-1) and TCTAGTTTGTGGAGGGTTAT (shFOXA1-2).
FOXA1 CRISPR-Cas9 knock-out
Blasticidin-resistant Cas9-positive LNCaP, LNCaP 42D, and LNCaP 42F cells were cultured in 20 μg/ml blasticidin (Thermo Fisher Scientific, NC9016621) for 72 h to select for cells with optimal Cas9 activity. LNCaP, LNCaP 42D, and LNCaP 42F cells were seeded in parallel 6-well plates at 300, 300, 300, or 60 k, respectively. Cells were infected after 24 h with lentiviruses expressing sgRNAs targeting GFP control or FOXA1. Cells were subject to puromycin selection and harvested for immunoblot after 3 days. Six days following selection, cell viability was determined using a Vi-Cell. The target sequences against GFP were AGCTGGACGGCGACGTAAA (sgGFP1) and GCCACAAGTTCAGCGTGTCG (sgGFP2). The target sequences against FOXA1 were GTTGGACGGCGCGTACGCCA (sgFOXA1-1), GTAGTAGCTGTTCCAGTCGC (sgFOXA1-2), CAGCTACTACGCAGACACGC (sgFOXA1-3), and ACTGCGCCCCCCATAAGCTC (sgFOXA1-4).
Western blots
For WCM154 western blots, cell pellets were lysed in RIPA buffer (MilliporeSigma, 20–188) supplemented with Protease/Phosphatase Inhibitor Cocktail (Cell Signaling Technology, 5872S). Protein concentrations were assayed with a Pierce BCA Protein Assay Kit (Thermo Fisher Scientific, PI23225), and protein was subsequently denatured in NuPAGE LDS sample buffer (Thermo Fisher Scientific, NP0007) containing 5% β-Mercaptoethanol. Thirteen micrograms of each protein sample was loaded onto NuPAGE 4–12% Bis-Tris Protein gels (Thermo Fisher Scientific), and samples were run in NuPAGE MOPS SDS Running Buffer (Thermo Fisher Scientific, NP0001). Following electrophoresis, proteins were transferred to nitrocellulose membranes via an iBlot apparatus (Thermo Fisher Scientific). After blocking in Odyssey Blocking Buffer (LI-COR Biosciences, 927-70010) for 1 h at room temperature, membranes were cut and incubated in primary antibodies diluted 1:1000 in Odyssey Blocking Buffer overnight at 4 °C. The next morning, membranes were washed three times with phosphate-buffer saline, 0.1% Tween (PBST) and then incubated with fluorescent anti-rabbit secondary antibodies (Thermo Fisher Scientific, NC9401842) for 1 h at room temperature. Membranes underwent five PBST washes and were then imaged using an Odyssey Imaging System (LI-COR Biosciences). Primary antibodies used include FOXA1 (Cell Signaling Technology, 58613S) and β-actin (Cell Signaling Technology, 8457L).
For LNCaP, LNCaP 42D, and LNCaP 42F western Blots, cell lysate was extracted using RIPA lysis buffer (Sigma) containing protease inhibitor (Roche) and phosphatase inhibitor (ThermoFisher). Fifty micrograms of protein was subjected to a 4–15% Mini-PROTEAN Precast electrophoresis gel (Bio-Rad) then transferred to 0.22 µm nitrocellulose membrane (Bio-Rad) and blocked in 5% blotting grade blocker (Bio-Rad). Membranes were incubated with primary antibodies overnight (FOXA1, Abcam, 1:2000, ab23738; Synaptophysin, Cell Marque, 1:5000, MRQ-40; INSM1, Santa Cruz, 1:2000, sc-377428; FOXA2, Abcam; 1:2500, ab108422; Chromogranin A, Abcam, 1:2000, ab15160; Vinculin, Cell signaling, 1:5000, #13901). Membranes were then washed in 1x Tris-buffered saline with 0.5% Tween-20 (Boston BioProducts) and incubated with secondary antibodies (mouse, Bio-Rad, 1:2500; rabbit, Bio-Rad, 1:2500). Western HRP substrate kit was used to detect chemiluminescent signal (Millipore, Classico).
Analysis of FOXA1-binding sites across prostate cancer states
FOXA1 cistromes were compared across different states of prostate cancer progression (normal prostate, prostate-localized adenocarcinoma, PDXs derived from metastatic castration resistant prostate cancer, and PDXs derived from NEPC). FOXA1 ChIP from normal prostate tissue and prostate-localized adenocarcinoma will be reported separately (Pomerantz et al., submitted). For normal prostate tissue FOXA1 ChIP, tissue cores were obtained from regions of prostatectomy specimens with dense epithelium and no evidence of neoplasia on review by a genitourinary pathologist. PDX samples used are listed in Supplementary Data 1. PDXs derived from localized prostate cancer were excluded from this analysis. As the normal prostate and localized adenocarcinoma samples were sequenced with single-end sequencing with an average of ~20 million reads, paired-end sequencing data from LuCaP PDXs were down-sampled to 20 M reads, using a single-end trimmed to 75 base pairs using seqtk (https://github.com/lh3/seqtk).
Pairwise comparisons were made between normal prostate (N = 5) and localized PRAD (N = 5), localized PRAD and metastatic PRAD PDXs (N = 11), and metastatic PRAD PDXs and NEPC PDXs (N = 5) using DESeq2 as described above. Peaks were considered significantly different between groups at a log2 |fold-change| threshold of 2 and FDR-adjusted p-value threshold of 0.001. “Shared” peaks were defined as the intersection of all peaks that were present in each group but not significantly different in any comparison.
Immunohistochemistry
Immunohistochemistry was performed on tissue microarray (TMA) sections. TMA slides were stained for FOXA1 (Abcam ab170933, 1:100 dilution with 10 mM NaCitrate antigen retrieval) and FOXA2 (Abcam ab108422, 1:500 dilution with 10 mM NaCitrate antigen retrieval) using a standard procedure67. Rabbit IgG was used as a negative control. Nuclear staining intensity was assigned levels 0, 1+, 2+, or 3+ and H-scores were calculated as: [1 x (% of 1+ cells) + 2 x (% of 2+ cells) + 3 x (% of 3+ cells)]. Evaluations were performed in a blinded fashion.
ASCL1/NKX2-1 overexpression in LNCaP
Transduction of LNCaP cells with ASCL1 and NKX2-1
The open reading frames of ASCL1 and NKX2-1 were cloned into the pLX_TRC302 lentiviral expression vector (Broad Institute) using the gateway recombination system. A construct expressing eGFP (pLX_TRC302_GFP) was used as a negative control. Viruses were generated by transfecting 293T cells with packaging vectors pVsVg and pDelta8.9. Supernatant was collected after 48 h. LNCaP cells were transduced in the presence of 4 μg/ml polybrene and harvested after 3 days for RNA-seq, ATAC-seq, and ChIP-seq.
ChIP seq was performed as described above, using 10–15 million cells fixed with 1% paraformaldehyde for 10 min at room temperature, followed by quenching with glycine. RNA was isolated using QIAGEN RNeasy Plus Kit and cDNA synthesized using Clontech RT Advantage Kit. Quantitative PCR was performed on a Quantstudio 6 using SYBR green. Primers used for quantitative reverse transcription PCR (qRT-PCR) are listed in Supplementary Table 4.
Analysis of promoter H3K4 and H3K27 trimethylation
Refseq gene coordinates (hg19) were compiled, selecting the longest isoform where multiple were annotated. Normalized tag counts from H3K27me3 and H3K4me3 ChIP-seq within 2 kb of each transcriptional start site (TSS) were calculated for each sample, then averaged across multiple samples in each group (five NEPC PDXs, five PRAD PDXs, three normal prostates; Pomerantz et al., submitted). Contours were calculated using the R function geom_density_2d from the ggplot2 package; they represent the 2d kernel density estimation for all included transcriptional start sites. Gene promoters were assigned “active”, “bivalent”, “unmarked”, and “repressed” annotations based on H3K4me3 and H3K27me3 levels. High/low cutoffs for these marks were determined as follows. First, the H3K4me3 normalized tag counts near each TSS were fit to two normal distributions using the normalmixEM R function from the mixtools R package. The cutoff between H3K4me3-high and -low was set at four standard deviations below the mean value of the H3K4me3-high distribution. Next, the normalized H3K27me3 tag counts near H3K4me3-high TSSs were fit to two normal distributions. The cutoff for H3K27me3-high promoters was set at four standard deviations above the mean value of the H3K27me3-low distribution. The Pearson Chi-squared test was used to quantify significance of enrichment of NEPC-upregulated genes in the “bivalent” quadrant compared to “repressed” or “unmarked” quadrants. NEPC-upregulated genes were defined as those with log2 fold-change >3 and adjusted p-value < 1 x 10−6 in NEPC vs. PRAD. The results of the analysis were robust to using other p-value and differential expression thresholds.
Methylation analysis of normal prostate
Whole-genome bisulfite sequencing data from histologically normal prostate tissue were reported previously68 and processed in our prior report69. Briefly, bases with a Phred score below 20 were trimmed and adapter sequences were discarded using Trim Galore! (version 0.4.4_dev; https://github.com/FelixKrueger/TrimGalore). Trimmed reads were mapped to the hg19 reference genome using Bowtie2 and Bismark v0.19.070. Methylated read counts and total coverage for CpGs were extracted using the bismark_methylation_extractor command and portions of methylated CpGs were computed with the bsseq Bioconductor package71. CpG methylation at indicated sites was visualized using deepTools44 v3.3.1.
Statistical tests
All statistical tests were two-sided except where otherwise indicated.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This work was supported by the PNW Prostate Cancer SPORE P50 CA097186, DOD W81XWH-19-1-0565, DOD W81XWH-17-1-0415, P01 CA163227, R01 CA193910, R01 CA233863, The Prostate Cancer Foundation, The PhRMA Foundation, The Richard M. Lucas Foundation, the European Union’s Horizon 2020 Research and Innovation program under the Marie Skłodowska-Curie grant agreement No. 754490, the National Cancer Institute (T32CA009172), and by Rebecca and Nathan Milikowsky. We would like to thank the patients who generously donated tissue that made this research possible.
Source data
Author contributions
S.C.B. analyzed ChIP-seq data and wrote the manuscript. X.Q. assisted with ChIP-seq data analysis. D.Y.T., J.H. and T.A. performed ASCL1/NKX2-1 overexpression experiments. S.Y.K. performed FOXA1 siRNA experiments under supervision of H.B. J.H., T.A., R.A. and S.A. performed FOXA1 shRNA and CRISPR experiments under the supervision of W.C.H. and K. Kelly. E.O., C.B. and S.A.A. performed ChIP-seq experiments. J.-H.S. performed HiChIP experiments. C.G. and B.P analyzed HiChIP data. R.I.C. and M.A.S.F. performed core regulatory analysis under supervision of K.L. P.C. and K.L. performed ATAC-seq under supervision of H.W.L. and M.B. M.H. and A.N. assisted with procurement of clinical samples. J.E.B. and K. Korthauer assisted with analysis of WGBS methylation data. L.B. performed immunohistochemistry experiments. I.M.C. and A.K. performed RNA-seq under the supervision of P.S.N. H.M.N., C.M. and E.C. provided LuCaP PDXs. A.S. assisted with analysis of tumor RNA-seq data under the supervision of L.E. A.Z. provided cell lines. N.D.S performed gene sequencing of LuCaPs. E.C., M.M.P. and M.L.F. supervised the project.
Data availability
The ChIP-seq data generated in this study (sequencing reads in fastq format and normalized read counts in bigwig format) have been deposited in GEO under accession number GSE161948. The RNA-seq data from clinical NEPC are available in dbGaP under accession code phs000909.v.p1. scRNAseq data used in this study are available at in GEO under accession number GSE120716. Lung cancer H3K27ac ChIP-seq data are available in SRA under accession numbers SRR568435, SRR3098556, SRR4449027, SRR4449025, and SRR6124068. H3K27ac ChIP-seq peaks from primary peripheral blood monocytes and keratinocytes are available from ENCODE (https://www.encodeproject.org/) under accession numbers ENCFF540CVX and ENCFF943CBQ. The WGBS data used in this study were obtained from the authors and are available on request by contacting Dr. Jianhua Luo at the Department of Pathology of the University of Pittsburgh (luoj@msx.upmc.edu)68. LuCaP PDXs are available upon request from Dr. Eva Corey (ecorey@uw.edu). Source data are provided with this paper.
Code availability
Code for the Cobra pipeline45 used for the analyses in this paper is available at https://bitbucket.org/cfce/cobra/src/master/.
Competing interests
W.C.H. is a consultant for Thermo Fisher, Solasta Ventures, iTeos, Frontier Medicines, Tyra Biosciences, MPM Capital, KSQ Therapeutics, and Paraxel and is a founder of KSQ Therapeutics. All other authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-021-22139-7.
References
- 1.Scher H, et al. Increased survival with enzalutamide in prostate cancer after chemotherapy. N. Engl. J. Med. 2012;367:1187–1197. doi: 10.1056/NEJMoa1207506. [DOI] [PubMed] [Google Scholar]
- 2.de Bono JS, et al. Abiraterone and increased survival in metastatic prostate cancer. N. Engl. J. Med. 2011;364:1995–2005. doi: 10.1056/NEJMoa1014618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Aggarwal R, et al. Clinical and genomic characterization of treatment-emergent small-cell neuroendocrine prostate cancer: a multi-institutional prospective study. J. Clin. Oncol. 2018;36:22492–22503. doi: 10.1200/JCO.2017.77.6880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bluemn E, et al. Androgen Receptor pathway-independent prostate cancer is sustained through FGF signaling. Cancer Cell. 2017;32:474–489.e6. doi: 10.1016/j.ccell.2017.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Beltran H, et al. Divergent clonal evolution of castration resistant neuroendocrine prostate cancer. Nat. Med. 2016;22:298–305. doi: 10.1038/nm.4045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Davies AH, Beltran H, Zoubeidi A. Cellular plasticity and the neuroendocrine phenotype in prostate cancer. Nat. Rev. Urol. 2018;15:271–286. doi: 10.1038/nrurol.2018.22. [DOI] [PubMed] [Google Scholar]
- 7.Zaret KS, Carroll JS. Pioneer transcription factors: establishing competence for gene expression. Genes Dev. 2011;25:22227–22241. doi: 10.1101/gad.176826.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Friedman J, Kaestner K. The Foxa family of transcription factors in development and metabolism. Cell Mol. Life Sci. 2006;63:2317–2328. doi: 10.1007/s00018-006-6095-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Nguyen H, et al. LuCaP prostate cancer patient‐derived xenografts reflect the molecular heterogeneity of advanced disease and serve as models for evaluating cancer therapeutics. Prostate. 2017;77:654–671. doi: 10.1002/pros.23313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Mu P, et al. SOX2 promotes lineage plasticity and antiandrogen resistance in TP53- and RB1-deficient prostate cancer. Science. 2017;355:84–88. doi: 10.1126/science.aah4307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Whyte WA, et al. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell. 2013;153:307–319. doi: 10.1016/j.cell.2013.03.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lin C, et al. Active medulloblastoma enhancers reveal subgroup-specific cellular origins. Nature. 2016;530:57. doi: 10.1038/nature16546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ott CJ, et al. Enhancer architecture and essential core regulatory circuitry of chronic lymphocytic leukemia. Cancer Cell. 2018;34:982–995.e7. doi: 10.1016/j.ccell.2018.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Qi J, et al. Siah2-dependent concerted activity of HIF and FoxA2 regulates formation of neuroendocrine phenotype and neuroendocrine prostate tumors. Cancer Cell. 2010;18:23–38. doi: 10.1016/j.ccr.2010.05.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Parolia A, et al. Distinct structural classes of activating FOXA1 alterations in advanced prostate cancer. Nature. 2019;571:413–418. doi: 10.1038/s41586-019-1347-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Jaenisch R, Young R. Stem cells, the molecular circuitry of pluripotency and nuclear reprogramming. Cell. 2008;132:567–582. doi: 10.1016/j.cell.2008.01.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Bishop JL, et al. The master neural transcription factor BRN2 is an androgen receptor-suppressed driver of neuroendocrine differentiation in prostate cancer. Cancer Discov. 2017;7:54–71. doi: 10.1158/2159-8290.CD-15-1263. [DOI] [PubMed] [Google Scholar]
- 18.Puca L, et al. Patient derived organoids to model rare prostate cancer phenotypes. Nat. Commun. 2018;9:2404. doi: 10.1038/s41467-018-04495-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Adams EJ, et al. FOXA1 mutations alter pioneering activity, differentiation and prostate cancer phenotypes. Nature. 2019;571:408–412. doi: 10.1038/s41586-019-1318-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Donaghey J, et al. Genetic determinants and epigenetic effects of pioneer-factor occupancy. Nat. Genet. 2018;50:250–258. doi: 10.1038/s41588-017-0034-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ousset M, et al. Multipotent and unipotent progenitors contribute to prostate postnatal development. Nat. Cell Biol. 2012;14:1131. doi: 10.1038/ncb2600. [DOI] [PubMed] [Google Scholar]
- 22.Pignon J-C, et al. p63-expressing cells are the stem cells of developing prostate, bladder, and colorectal epithelia. Proc. Natl Acad. Sci. 2013;110:8105–8110. doi: 10.1073/pnas.1221216110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Hon GC, et al. Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. Nat. Genet. 2013;45:1198–1206. doi: 10.1038/ng.2746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Jadhav U, et al. Extensive recovery of embryonic enhancer and gene memory stored in hypomethylated enhancer DNA. Mol. Cell. 2019;74:542–554.e5. doi: 10.1016/j.molcel.2019.02.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bernstein BE, et al. A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell. 2006;125:315–326. doi: 10.1016/j.cell.2006.02.041. [DOI] [PubMed] [Google Scholar]
- 26.Mohn F, et al. Lineage-specific polycomb targets and de novo DNA methylation define restriction and potential of neuronal progenitors. Mol. Cell. 2008;30:755–766. doi: 10.1016/j.molcel.2008.05.007. [DOI] [PubMed] [Google Scholar]
- 27.Schoenherr C, Anderson D. The neuron-restrictive silencer factor (NRSF): a coordinate repressor of multiple neuron-specific genes. Science. 1995;267:1360–1363. doi: 10.1126/science.7871435. [DOI] [PubMed] [Google Scholar]
- 28.Park J, et al. Reprogramming normal human epithelial tissues to a common, lethal neuroendocrine cancer lineage. Science. 2018;362:91–95. doi: 10.1126/science.aat5749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Ku S, et al. Rb1 and Trp53 cooperate to suppress prostate cancer lineage plasticity, metastasis, and antiandrogen resistance. Science. 2017;355:78–83. doi: 10.1126/science.aah4199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kim J, et al. FOXA1 inhibits prostate cancer neuroendocrine differentiation. Oncogene. 2017;36:4072–4080. doi: 10.1038/onc.2017.50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Borromeo MD, et al. ASCL1 and NEUROD1 reveal heterogeneity in pulmonary neuroendocrine tumors and regulate distinct genetic programs. Cell Rep. 2016;16:1259–1272. doi: 10.1016/j.celrep.2016.06.081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Balanis NG, et al. Pan-cancer convergence to a small-cell neuroendocrine phenotype that shares susceptibilities with hematological malignancies. Cancer Cell. 2019;36:17–34.e7. doi: 10.1016/j.ccell.2019.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Gao S, et al. Chromatin binding of FOXA1 is promoted by LSD1-mediated demethylation in prostate cancer. Nat. Genet. 2020;52:1011–1017. doi: 10.1038/s41588-020-0681-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Beshiri ML, et al. A PDX/Organoid biobank of advanced prostate cancers captures genomic and phenotypic heterogeneity for disease modeling and therapeutic screening. Clin. Cancer Res. 2018;24:14332–14345. doi: 10.1158/1078-0432.CCR-18-0409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Labrecque MP, et al. Molecular profiling stratifies diverse phenotypes of treatment-refractory metastatic castration-resistant prostate cancer. J. Clin. Invest. 2019;129:4492–4505. doi: 10.1172/JCI128212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Pomerantz MM, et al. The androgen receptor cistrome is extensively reprogrammed in human prostate tumorigenesis. Nat. Genet. 2015;47:1346–1351. doi: 10.1038/ng.3419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Johnson DS, Mortazavi A, Myers RM, Wold B. Genome-wide mapping of in vivo protein-DNA interactions. Science. 2007;316:1497–1502. doi: 10.1126/science.1141319. [DOI] [PubMed] [Google Scholar]
- 38.Corces MR, et al. An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues. Nat. Methods. 2017;14:959–962. doi: 10.1038/nmeth.4396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Buenrostro J, Wu B, Chang H, Greenleaf W. ATAC-seq: a method for assaying chromatin accessibility genome-wide. Curr. Protoc. Mol. Biol. 2015;109:21.29.1–21.29.9. doi: 10.1002/0471142727.mb2129s109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Cavalcante RG, Sartor MA. annotatR: genomic regions in context. Bioinformatics. 2017;33:12381–12383. doi: 10.1093/bioinformatics/btx183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Robinson JT, et al. Integrative genomics viewer. Nat. Biotechnol. 2011;29:24–26. doi: 10.1038/nbt.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Ramírez F, Dündar F, Diehl S, Grüning BA, Manke T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 2014;42:WW187–W191. doi: 10.1093/nar/gku365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Qiu, X. et al. CoBRA: Containerized bioinformatics workflow for reproducible chip/atac-seq analysis-from differential peak calling to pathway analysis. Preprint at bioRxiv10.1101/2020.11.06.367409 (2020).
- 46.Köster J, Rahmann S. Snakemake—a scalable bioinformatics workflow engine. Bioinformatics. 2012;28:2520–2522. doi: 10.1093/bioinformatics/bts480. [DOI] [PubMed] [Google Scholar]
- 47.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.McLean CY, et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 2010;28:495–501. doi: 10.1038/nbt.1630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Layer RM, et al. GIGGLE: a search engine for large-scale integrated genome analysis. Nat. Methods. 2018;15:123–126. doi: 10.1038/nmeth.4556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Huang Y-H, et al. POU2F3 is a master regulator of a tuft cell-like variant of small cell lung cancer. Genes Dev. 2018;32:915–928. doi: 10.1101/gad.314815.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Abraham BJ, et al. Small genomic insertions form enhancers that misregulate oncogenes. Nat. Commun. 2017;8:14385. doi: 10.1038/ncomms14385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Handoko L, et al. JQ1 affects BRD2-dependent and independent transcription regulation without disrupting H4-hyperacetylated chromatin states. Epigenetics. 2018;13:410–431. doi: 10.1080/15592294.2018.1469891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature489, 57–74 (2012). [DOI] [PMC free article] [PubMed]
- 54.Cornwell M, et al. VIPER: visualization pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis. BMC Bioinforma. 2018;19:135. doi: 10.1186/s12859-018-2139-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Dobin A, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Trapnell C, et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 2012;7:562–578. doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Mumbach MR, et al. HiChIP: efficient and sensitive analysis of protein-directed genome architecture. Nat. Methods. 2016;13:1919–1922. doi: 10.1038/nmeth.3999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Servant N, et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 2015;16:259. doi: 10.1186/s13059-015-0831-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Bhattacharyya, S., Chandra, V., Vijayanand, P., Ay, F. FitHiChIP: identification of significant chromatin contacts from HiChIP data. Nat. Commun.10, 4221 (2019). [DOI] [PMC free article] [PubMed]
- 60.D’Alessio AC, et al. A systematic approach to identify candidate transcription factors that control cell identity. Stem Cell Rep. 2015;5:763–775. doi: 10.1016/j.stemcr.2015.09.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Lambert SA, et al. The human transcription factors. Cell. 2018;172:650–665. doi: 10.1016/j.cell.2018.01.029. [DOI] [PubMed] [Google Scholar]
- 62.Federation, A. J. et al. Identification of candidate master transcription factors within enhancer-centric transcriptional regulatory networks. Preprint at bioRxiv10.1101/345413 (2018).
- 63.Van der Auwera GA, et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinforma. 2013;43:11.10.1–11.10.33. doi: 10.1002/0471250953.bi1110s43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:1e164. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Favero F, et al. Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data. Ann. Oncol. 2015;26:64–70. doi: 10.1093/annonc/mdu479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Newman AM, et al. FACTERA: a practical method for the discovery of genomic rearrangements at breakpoint resolution. Bioinformatics. 2014;30:23390–23393. doi: 10.1093/bioinformatics/btu549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Nguyen HM, et al. Cabozantinib inhibits growth of androgen-sensitive and castration-resistant prostate cancer and affects bone remodeling. PLoS ONE. 2013;8:e78881. doi: 10.1371/journal.pone.0078881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Yu YP, et al. Whole-genome methylation sequencing reveals distinct impact of differential methylations on gene transcription in prostate cancer. Am. J. Pathol. 2013;183:1960–1970. doi: 10.1016/j.ajpath.2013.08.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Takeda DY, et al. A somatically acquired enhancer of the Androgen Receptor is a noncoding driver in advanced prostate cancer. Cell. 2018;174:422–432.e13. doi: 10.1016/j.cell.2018.05.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Krueger F, Andrews SR. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics. 2011;27:11571–11572. doi: 10.1093/bioinformatics/btr167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Hansen KD, Langmead B, Irizarry RA. BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions. Genome Biol. 2012;13:1R83. doi: 10.1186/gb-2012-13-10-r83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Tsherniak A, et al. Defining a cancer dependency map. Cell. 2017;170:564–576.e16. doi: 10.1016/j.cell.2017.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Henry GH, et al. A cellular anatomy of the normal adult human prostate and prostatic urethra. Cell Rep. 2018;25:3530–3542.e5. doi: 10.1016/j.celrep.2018.11.086. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The ChIP-seq data generated in this study (sequencing reads in fastq format and normalized read counts in bigwig format) have been deposited in GEO under accession number GSE161948. The RNA-seq data from clinical NEPC are available in dbGaP under accession code phs000909.v.p1. scRNAseq data used in this study are available at in GEO under accession number GSE120716. Lung cancer H3K27ac ChIP-seq data are available in SRA under accession numbers SRR568435, SRR3098556, SRR4449027, SRR4449025, and SRR6124068. H3K27ac ChIP-seq peaks from primary peripheral blood monocytes and keratinocytes are available from ENCODE (https://www.encodeproject.org/) under accession numbers ENCFF540CVX and ENCFF943CBQ. The WGBS data used in this study were obtained from the authors and are available on request by contacting Dr. Jianhua Luo at the Department of Pathology of the University of Pittsburgh (luoj@msx.upmc.edu)68. LuCaP PDXs are available upon request from Dr. Eva Corey (ecorey@uw.edu). Source data are provided with this paper.
Code for the Cobra pipeline45 used for the analyses in this paper is available at https://bitbucket.org/cfce/cobra/src/master/.