Skip to main content
eLife logoLink to eLife
. 2021 Jan 11;10:e64090. doi: 10.7554/eLife.64090

Single-cell chromatin accessibility profiling of glioblastoma identifies an invasive cancer stem cell population associated with lower survival

Paul Guilhamon 1,2, Charles Chesnelong 2, Michelle M Kushida 2, Ana Nikolic 3,4,5, Divya Singhal 3,4,5, Graham MacLeod 6, Seyed Ali Madani Tonekaboni 1,7, Florence MG Cavalli 2, Christopher Arlidge 1, Nishani Rajakulendran 6, Naghmeh Rastegar 2, Xiaoguang Hao 3,8,9, Rozina Hassam 3,8,9, Laura J Smith 10, Heather Whetstone 2, Fiona J Coutinho 2, Bettina Nadorp 1, Katrina I Ellestad 3,4, H Artee Luchman 3,8,9, Jennifer Ai-wen Chan 3,4,11, Molly S Shoichet 10, Michael D Taylor 2,12,13, Benjamin Haibe-Kains 1,7,14,15,16, Samuel Weiss 3,8,9,17, Stephane Angers 6,18, Marco Gallo 3,4,5,17, Peter B Dirks 2,12,15,, Mathieu Lupien 1,7,
Editors: Lynne-Marie Postovit19, Kevin Struhl20
PMCID: PMC7847307  PMID: 33427645

Abstract

Chromatin accessibility discriminates stem from mature cell populations, enabling the identification of primitive stem-like cells in primary tumors, such as glioblastoma (GBM) where self-renewing cells driving cancer progression and recurrence are prime targets for therapeutic intervention. We show, using single-cell chromatin accessibility, that primary human GBMs harbor a heterogeneous self-renewing population whose diversity is captured in patient-derived glioblastoma stem cells (GSCs). In-depth characterization of chromatin accessibility in GSCs identifies three GSC states: Reactive, Constructive, and Invasive, each governed by uniquely essential transcription factors and present within GBMs in varying proportions. Orthotopic xenografts reveal that GSC states associate with survival, and identify an invasive GSC signature predictive of low patient survival, in line with the higher invasive properties of Invasive state GSCs compared to Reactive and Constructive GSCs as shown by in vitro and in vivo assays. Our chromatin-driven characterization of GSC states improves prognostic precision and identifies dependencies to guide combination therapies.

Research organism: Human

Introduction

Glioblastoma (GBM) is a lethal form of brain cancer with standard surgery and radiation giving a median survival of only 12.6 months (Carlsson et al., 2014). The addition of temozolomide chemotherapy provides only an additional 2.5 months in the small subset of responsive patients (Stupp et al., 2005). Despite extensive characterization and stratification of the bulk primary tumors, no targeted therapies have been successfully developed (Carlsson et al., 2014; von Neubeck et al., 2015). GBM tumors are rooted in self-renewing tumor-initiating cells commonly referred to as glioblastoma stem cells (GSCs) (Venere et al., 2011) that drive disease progression in vivo (Chen et al., 2012; Gallo et al., 2015) and display resistance to chemo- and radiotherapy leading to disease recurrence (Bao et al., 2006). The promise of therapeutically targeting self-renewing tumor-initiating cancer cells depends on our capacity to capture the full range of heterogeneity within this population from individual tumors. Intratumoral heterogeneity within primary GBM has recently been documented through single-cell RNA-seq experiments and revealed a continuum between four cellular states (Neftel et al., 2019): neural-progenitor-like (NPC), oligodendrocyte-progenitor-like (OPC), astrocyte-like (AC), and mesenchymal-like (MES) (Neftel et al., 2019). A subsequent study (Wang et al., 2019) using single-cell gene-centric enrichment analysis placed GBM cells along a single axis of variation from proneural to mesenchymal transcriptional profiles, with cells expressing stem-associated genes lying at the extremes of this axis. Hence, primary GBM consists of distinct states, across which stem-like cells appear to be found. Whether these stem-like cells found across GBM states represent functionally distinct GSC populations with tumor-initiating properties and unique dependencies remains to be established to guide therapeutic progress. To address this issue, we combined single-cell technologies to define GSC composition in primary GBM with functional assays to reveal the unique dependencies across GSCs, reflective of invasive, constructive, and reactive states that relate to patient outcome.

Results

Chromatin accessibility readily discriminates stem from mature cell populations (Stergachis et al., 2013), which can be resolved at the single-cell level through single-cell ATAC-seq (Buenrostro et al., 2018; Corces et al., 2016), taking into account non-gene centric features, such as accessibility of noncoding elements and total amount of accessible DNA sequences. Applying single-cell chromatin accessibility profiling (scATAC-seq) across four primary adult GBM tumors (3797 cells), wild type for both IDH1 and IDH2, revealed seven to nine accessibility modules in each tumor based on unsupervised clustering (Figure 1A). We assigned cells to each of the four scRNA-seq-derived cellular states (Neftel et al., 2019) based on individual cells’ chromatin accessibility enrichment scores for the promoter regions of each state’s signature genes. Across the four tumors, 35–55.2% of the cells were significantly enriched for at least one state’s signature genes (Figure 1—figure supplement 1A). The MES state reported from scRNA-seq (Neftel et al., 2019) dominates the identity of two or more modules reported from chromatin accessibility in every tumor (Figure 1B,C, Figure 1—figure supplement 1B). In contrast, the NPC and OPC states are mixed within the same module defined based on chromatin accessibility, dominating over the other states typically in at least two modules. Cells assigned to the AC state did not preferentially cluster within a single module reported from chromatin accessibility (Figure 1B,C, Figure 1—figure supplement 1B). Collectively, our results suggest that chromatin accessibility reflects a greater stratification of the MES state and detects similarities between the OPC and NPC states and heterogeneity within the AC state.

Figure 1. The diverse glioblastoma (GBM) cancer stem cell pool.

(A) UMAP (Uniform Manifold Approximation and Projection) representation of chromatin accessibility across four primary GBM. (B) UMAPs with tumor cells assigned to cellular states. (C) UMAP modules are grouped by dominant cellular state. (D) UMAPs with cancer stem cells highlighted based on the enrichment of GBM cancer stem transcription factor promoters. (E) Distribution of cancer stem cells across the modules dominated by each cellular state.

Figure 1.

Figure 1—figure supplement 1. The distribution of cell states across tumor cells.

Figure 1—figure supplement 1.

(A) Percentage of each tumor’s cells significantly enriched with each state’s signature genes. (B) Number of tumors with at least one UMAP module dominated by a given cell state.

To identify putative cancer stem cells within each primary tumor, we next focused on the level of chromatin accessibility at promoters of 19 transcription factors previously associated with self-renewal and tumor-propagating capacity in GBM (Suvà et al., 2014; Figure 1D). Individual cells scoring as putative cancer stem cells were not restricted to a unique module defined by chromatin accessibility but were distributed across a subset of modules, suggesting heterogeneity across cancer stem cells in primary GBM, in agreement with reports relying on single GBM cell labeling (Corces et al., 2016; Gallo et al., 2015; Lan et al., 2017; Liau et al., 2017; Meyer et al., 2015; Miller et al., 2017; Orzan et al., 2017; Park et al., 2017; Rheinbay et al., 2013; Stergachis et al., 2013; Suvà et al., 2014), assessing the heterogeneity of self-renewing tumor-initiating cells. Putative cancer stem cells identified in primary GBM through scATAC-seq were found in modules ascribed to every one of the four cellular states defined by gene expression (Neftel et al., 2019), predominantly within NPC and OPC containing modules and a smaller fraction (<10%) in MES-specific modules across all four tumors (Figure 1E). This suggests that the core transcriptional unit of cancer stem cells in primary GBM (Suvà et al., 2014) is not restricted to a unique population defined by its global transcriptional or chromatin accessibility profile with the resolution achieved with current single-cell technologies.

To further probe the heterogeneity in chromatin accessibility within the GBM cancer stem cell pool, we derived GSC populations from 27 adult IDH wild-type GBM tumors (Pollard et al., 2009) and profiled their chromatin accessibility by bulk ATAC-seq (Figure 2A). Each patient-derived GSC showed a similar enrichment for accessible chromatin regions in promoters and 5′UTRs, and depletion in introns and distal intergenic regions (Figure 2B). Collectively, we uncovered 92% of the total predicted regions of accessible chromatin (255,891 regions) within GSCs based on a saturation analysis using a self-starting non-linear regression model across the 27 samples (Figure 2C). We next assessed the similarity between these GSCs and the putative cancer stem cells found by scATAC-seq in the four primary GBMs. GSCs were identified within each tumor by calculating the enrichment of accessible chromatin regions shared by a majority of GSCs (>14/27) in each tumor cell (Figure 2D). On average, 11.3% of cells in each primary GBM were labeled as GSCs. Comparing the distribution of GSCs across the seven to nine modules defined by scATAC-seq to that of the 19 transcription factor-derived cancer stem cell signature demonstrates concordance between the two signatures (Figure 2E). Moreover, the enrichment z-scores for both cancer stem signatures (i.e. stem transcription factors signature and GSC chromatin accessibility signature) are significantly correlated across cells in all four tumors (p≤1.6×10−5) (Figure 2F). The overlap in cells significantly enriched for either stem signatures across the four tumors is also significant (hypergeometric test p-value=8.8×10−8) Additionally, an average of 91.2% (85.1–100%) of the cells identified by either signature display the hallmark GBM copy number changes at chromosomes 7 and 10, confirming their neoplastic status (Figure 2—figure supplement 1). Collectively, these results demonstrate that the patient-derived GSC populations reflect the chromatin identity of putative cancer stem cells found in primary brain tumors, highlighting the value of these GSCs as models to deepen our understanding of individual cells within primary GBM with features found in self-renewing tumor-initiating cells. Accordingly, spectral clustering of the 27 patient-derived GSC ATAC profiles identifies three distinct states of self-renewing tumor-initiating cells (Figure 3A). Expression profiling of these 27 GSCs by RNA-seq reveals GSCs significantly enriched for the signatures of each of the three TCGA GBM subtypes (proneural, classical, and mesenchymal) (Wang et al., 2017; Figure 3B). However, the assignment of the proneural, classical, and mesenchymal subtypes across GSCs did not match the three clusters identified from ATAC-seq (Figure 3B). Conversely, clustering the GSCs by gene expression, independently of their chromatin accessibility, did largely recapitulate the GSC states defined by chromatin accessibility (Figure 3B). This suggests that the mismatch between the GSC cluster from chromatin accessibility profiles and TCGA expression subtypes is not mainly due to differences between chromatin accessibility and gene expression. Potential alternative causes for this observed mismatch include the absence of a tumor microenvironment in the GSC populations or that given the TCGA subtypes were determined from bulk GBM, they may not fully capture the nature of rarer populations found within a tumor, such as the cancer stem cell populations.

Figure 2. GSCs recapitulate the glioblastoma (GBM) cancer stem cell population.

(A) Schematic representation of the GSC derivation process, from patient tumor to GSC-enriched population. (B) Genomic feature enrichment of accessible chromatin peaks. (C) Saturation curve for the 27 GSCs. (D) UMAPs with GSCs highlighted based on the enrichment of shared accessible regions across GSCs. (E) Proportion of UMAP modules assigned to cancer stem cells and GSCs. (F) Correlation of z-scores for each signature for each cell in each primary GBM.

Figure 2.

Figure 2—figure supplement 1. Identification of tumor cells through characteristic copy number changes.

Figure 2—figure supplement 1.

(A–D) UMAPs showing cells confirmed to be GBM tumor cells, with gains of chr7 or losses of chr10. A: G4218, B: G4250, C: G4275; D: G4349.

Figure 3. Three glioblastoma stem cell (GSC) states driven by chromatin accessibility.

(A) Spectral clustering of ATAC-seq signal across peaks in 27 GSCs. (B) Spectral clustering of gene expression across GSCs and comparison to chromatin-derived GSC states. Enrichment of TCGA subtypes across GSCs and comparison to GSC states as determined by ATAC. The GSEA (Gene Set Enrichment Analysis) gene sets each contained 50 genes and the enrichment scores ranged from 0.16 to 0.73. (C) Gene set enrichment analysis in each GSC state. All displayed terms are significantly enriched (q-value < 0.05). (D) Copy number alterations (CNAs) across GSCs identified from DNA methylation array data cluster GSCs into five subgroups. (E) Number and percentage of peaks unique and shared in each GSC state. (F) Saturation analysis of each individual state.

Figure 3.

Figure 3—figure supplement 1. Distinct chromatin accessibility profiles across glioblastoma stem cell states.

Figure 3—figure supplement 1.

(A–C) Overlaid ATAC signal tracks for the 27 GSCs at the 92 most differentially accessible regions between GSC states as determined by pairwise Wilcoxon test and a median signal fold change equal or greater than 2. The region coordinates can be found in the Supplementary file 2. (A) Most accessible in Reactive state GSCs. (B) Most accessible in Constructive state GSCs. (C) Most accessible in Invasive state GSCs.

Gene set enrichment analysis with GSEA (Gene Set Enrichment Analysis) (Subramanian et al., 2005) and g:profiler (Reimand et al., 2016) using genes exclusively enriched for both expression and promoter chromatin accessibility in each subtype reported significantly enriched terms defining the largest GSC state as a Reactive state, with terms related to immune cells and response (Figure 3C, top panel). A second GSC state was enriched for Constructive gene sets involved in brain, neuron, and glial cell development (Figure 3C, middle panel). The third and smallest GSC state presented an Invasive state characterized by terms relating to the extracellular matrix and angiogenesis (Figure 3C, bottom panel). We next mapped copy number alterations (CNAs) across the 27 GSCs by applying the molecular neuropathology classifier tool (Capper et al., 2018) to DNA methylation data from the GSCs (Figure 3D). While common GBM CNAs, including EGFR gains and CDKN2A/B loss, were observed, the CNA-based classification of GSCs failed to match the three chromatin accessibility-derived states, suggesting the three GSC states are not defined by somatic copy number events (Figure 3D). Further comparison of the accessible chromatin in each GSC state reveals that only a small subset of accessible chromatin regions drives the three GSC states (Figure 3E, Figure 3—figure supplement 1). Our ability to discriminate GSC state-specific regions of accessible chromatin is reflective of the comprehensiveness of our cohort to saturate the detection of accessible regions to 93%, 88%, and 71% across the Reactive (n = 13), Constructive (n = 9), and Invasive (n = 5) state GSCs, respectively (Figure 3F).

Considering that regions of accessible chromatin serve as binding sites for transcription factors engaging in gene expression regulation, we next tested for DNA recognition motif family enrichment across regions exclusively accessible in Reactive, Constructive, or Invasive GSC states (Figure 4A, Figure 4—figure supplement 1A). The most enriched DNA recognition motif families in each state were either depleted or showed low-level enrichment in the other states. Specifically, the DNA recognition motifs for the interferon-regulatory factor (IRF) and Cys2-His2 zinc finger (C2H2 ZF) transcription factor families were enriched in the Reactive state (Figure 4A, top panel). Regulatory factor X (RFX) and basic helix-loop-helix (bHLH) DNA recognition motifs were enriched in the Constructive state (Figure 4A, middle panel), while the Forkhead motif family was enriched in the Invasive state (Figure 4A, bottom panel). Genome-wide CRISPR/Cas9 essentiality screens (Figure 4—figure supplement 1B) in three Reactive, two Constructive and one Invasive GSC (MacLeod et al., 2019) revealed the preferential requirement for expressed transcription factors (Figure 4—figure supplement 1C–E) recognizing the enriched DNA recognition motif in a state-specific manner (Figure 4B). Specifically, the SP1 regulatory network is preferentially essential in the Reactive state GSCs (Figure 4B, top panel), ASCL1, OLIG2, AHR, and NPAS3 are uniquely essential in the Constructive state GSCs (Figure 4B, middle panel) and FOXD1 is essential only in the Invasive state GSC (Figure 4B, bottom panel). Notably, SP1 itself is exclusively essential in only one Reactive GSC (G564). However, of the 36 transcription factors from the Reactive-enriched families (IRF and C2H2 ZF) that were essential in at least one Reactive GSC and not in any of the Constructive or Invasive GSCs, 13 are directly regulated by SP1 (Figure 4—figure supplement 1C), thus suggesting that the SP1 regulatory network as a whole, rather than SP1 on its own, is key in the Reactive GSC state. Notably, all six transcription factors display significantly higher expression in GBM compared to normal brain (Tang et al., 2017; Figure 4—figure supplement 1F), further supporting their function as key regulators of tumor initiation and development. An additional gene set enrichment analysis combining genes exclusively essential to each state with the putative targets of the key transcription factors outlined above identifies additional enriched terms supporting the identities of the three GSC states as Reactive, Constructive, and Invasive (Figure 4—figure supplement 1G).

Figure 4. Functional diversity between glioblastoma stem cell (GSC) states and intra-tumor heterogeneity.

(A) Motif family enrichment in each cluster; log2(fold enrichment) > 0.5 threshold selected based on the distribution of values in each cluster (Figure 2—figure supplement 1). (B) Z-score distribution of key essential genes in each cluster. Red line corresponds to the empirically determined threshold for essentiality in each tested line, scaled, and adjusted to zero. Boxplot whiskers in this case extend to data extremes. Side barplots show the total count of the key cluster-specific regulators found essential in each subtype. (C) UMAPs with GSCs from each state highlighted based on the enrichment of the top differentially accessible regions in each GSC state.

Figure 4.

Figure 4—figure supplement 1. dentification and characterization of essential transcription factors across GSC states.

Figure 4—figure supplement 1.

(A) The log2(fold enrichment) for all motif families were ordered and plotted. A cut-off threshold to select families for follow-up was selected based on the steep inflection of the curve above 0.5. (B) Schematic of drop-out essentiality screen using GSCs stably expressing Cas9 and gRNA libraries. (C–E) The expression values determined by RNA-seq for all transcription factors whose motif is enriched in each GSC state and exclusively essential in at least one GSC of each state is plotted. Bold colors highlight factors essential in all tested GSCs in that state. In the case of the Reactive state, blue boxplots correspond to SP1 and members of its regulatory network. (F) Expression levels of key transcription factors in tumor and normal samples analyzed and displayed using GEPIA (Tang et al., 2017). (G) Gene enrichment analysis of subtype-specific essential genes (Essentiality), upregulated and differentially accessible genes (ATAC and RNA combined), and putative targets of key transcription factors.

Previous work suggests that GBM tumors harbor a heterogeneous population of GSCs (Lan et al., 2017; Meyer et al., 2015; Patel et al., 2014). We therefore quantified the presence of Reactive, Constructive, and Invasive cancer stem cells in our four primary GBM based on their scATAC-seq profiles. The Constructive state was dominant in every primary tumor, ranging from 9 to 21% of all cells captured by scATAC-seq (Figure 4C). The Reactive and Invasive states accounted for only 0–9% of all cells, in varying proportions from one tumor to another (Figure 4C). Collectively, our results further support the heterogeneous nature of cancer stem cells that populate primary tumors.

While various classifications of GBMs and/or their constitutive bulk and stem tumor cells have been reported, with some associating with patient survival (Neftel et al., 2019; Patel et al., 2014; Cancer Genome Atlas Research Network et al., 2010; Wang et al., 2017; Yin et al., 2019; Zuo et al., 2019), molecular signatures in adult GBM stratifying the poorer prognosis IDH wild-type patients by survival are lacking. We performed intracranial xenografts of 37 IDH wild-type GSC populations and classified the transplanted cells by their GSC state to perform a differential survival analysis (Figure 5A). The overall survival times of the transplanted mice grouped by GSC state were significantly different (LogRank test p=0.041), with the Invasive state GSCs leading to the worst prognosis. Relative to the mice injected with Reactive state GSCs, mice with Constructive state GSCs and Invasive state GSCs had hazard ratios of 1.3 (95% confidence interval [CI]: 0.57–2.97) and 3.5 (95% CI: 1.2–10.49), respectively.

Figure 5. Glioblastoma stem cell (GSC) invasiveness is associated with survival in glioblastoma (GBM).

(A) Kaplan–Meier plot for orthotopic xenografts grouped by GSC state. For each of the 37 GSCs used in the xenograft survival analysis, the median survival value was used from multiple mice injected with cells from each GSC (average number of mice injected/GSC = 5). The dotted lines indicate median survival. The pairwise p-values are also significant for Invasive vs Reactive (p=0.02) and Invasive vs Constructive (p=0.045) but not for Reactive vs Constructive (p=0.45). (B) Invaded area over time normalized to t0 by three to four representative GSCs of each GSC state and human fetal neural stem cells (hfNSCs) used as controls. See Figure 5—figure supplement 1A for individual GSC invasion. (C) Invasion images of representative hfNSCs and GSCs at t0, 6 hr, and 12 hr. Scale bar is 150 µm. (D) TCGA samples ordered by increasing concordance with Invasive GSCs and grouped into three subgroups: <1, 1–1.65, >1.65. (E) Kaplan–Meier plot for TCGA samples grouped by concordance with Invasive GSCs. The dotted line indicates median survival. When considering pairwise comparisons, only the Invasive-high and Invasive-low subgroups were significantly different (p=0.0043). Further subgrouping of the TCGA samples into smaller intervals of concordance z-score yielded no benefit, preserving the Invasive-high subgroup as the only one with significantly poorer prognosis (Figure 5—figure supplement 2B,C).

Figure 5.

Figure 5—figure supplement 1. Invasive properties across glioblastome stem cell states.

Figure 5—figure supplement 1.

(A) Invaded area over time normalized to t0 by three to four representative GSCs of each GSC state and hfNSCs used as controls. For the most invasive GSCs, data collection was stopped when invasion from neighboring spheres converged. (B–C) Human-specific staining of mouse brains injected with Invasive GSCs G411 (B) and G837 (C) with higher magnification (black inset) at the mass boundaries.

Figure 5—figure supplement 2. Assessment of Invasive GSC score classification.

Figure 5—figure supplement 2.

(A) Kaplan–Meier plot for TCGA samples grouped by GSC state. The dotted lines indicate median survival. (B) TCGA samples ordered by increasing concordance with Invasive GSCs and grouped into eight subgroups. (C) Kaplan–Meier plot for TCGA samples grouped by concordance with Invasive GSCs. The dotted line indicates median survival. (D) Kaplan–Meier plot for TCGA samples from both RNA-seq and microarray-based expression grouped by GSC state. The dotted lines indicate median survival. (E) Kaplan–Meier plot for TCGA samples from both RNA-seq and microarray-based expression grouped by concordance with Invasive GSCs. The dotted line indicates median survival.

In light of this finding, we sought to evaluate the invasive properties of the Invasive GSC state population. To that end, we conducted an invasion assay (Restall et al., 2018) wherein neurospheres from representative GSCs of each state (four Reactive, three Constructive, and three Invasive; three independent biological replicates for each GSC) are embedded in a collagen matrix and invasion is monitored in real time (Figure 5B,CFigure 5—figure supplement 1A). GSCs of the Invasive state invaded strikingly faster and further into the collagen matrix and overall displayed a three times greater invasive ability than that of the Reactive and Constructive states. This invasiveness was further confirmed in vivo through human-specific staining of mouse brains injected with Invasive-state GSCs (Figure 5—figure supplement 1B–C).

Next, we investigated the prognostic value of the GSC states using the TCGA GBM cohort (IDH1 and IDH2 wild type, n = 144). When classified by dominant GSC state, TCGA tumors display the same trend as the xenografts with Invasive state-dominated tumors showing the lowest survival (Figure 5—figure supplement 2A). However, with only two tumors classified as Invasive-dominant, the difference in survival between the three patient groups was not statistically significant (p=0.3) (Figure 5—figure supplement 2A). We proceeded to rank the TCGA tumors solely by their concordance to Invasive GSCs and classified the patient tumors into Invasive-low (z-score < 1), Invasive-mid (z-score = 1–1.65), and Invasive-high (z-score ≥ 1.65) groups (Figure 5D). With this stratification method, median patient survival per group not only decreased with increasing Invasive GSC score, but we also identified an Invasive-high subset of tumors with significantly lower survival (p=0.019, HR (Hazard Ratio) = 2.8, 95% CI: [1.3–5.81]) (Figure 5E). This was further validated using an additional cohort of TCGA samples with microarray gene expression (Figure 5—figure supplement 2D–E). These results show that cancer stem cell states defined based on the chromatin accessibility in GSCs can identify transcriptional programs associated with poor prognosis and can serve as a signature to identify high-risk patients in IDH wild-type GBM.

Discussion

Defining the nature of self-renewing tumor-initiating cells in primary GBM is required to identify vulnerabilities for therapeutic intervention. Quantifying their heterogeneity within tumors can guide treatment strategies and assist in predicting the course of disease progression. Here we show that chromatin accessibility assays capture a heterogeneity across self-renewing tumor-initiating cells in primary GBM that extends beyond their genetic diversity and underlies the heterogeneity in bulk progeny (Meyer et al., 2015). This heterogeneity aligns with diversity in the three-dimensional genome organization of GSCs (Johnston et al., 2019) and agrees with how the three-dimensional genome organization instructs cis-regulatory plexuses underlying gene regulation (Bailey et al., 2016; Kim et al., 2016; Sallari et al., 2017; Schmitt et al., 2016; Zheng and Xie, 2019). While the chromatin accessibility signature derived from GSCs robustly identified cancer stem cells across the four primary GBM tumors used in this study, a larger cohort will be required to establish whether this signature is always sufficient to capture all cancer stem cells in primary GBMs.

We further reveal a specific cancer stem state that is significantly predictive of patient survival and can be used as a signature to identify high-risk patients. Specifically, the Invasive GSC signature draws from a population with enhanced ability to invade and spread compared to other GSC states. Collectively, our results argue for distinct GSC populations whose composition in tumors impacts survival.

Our results also highlight dependencies unique to each cancer stem state. Specifically, the Reactive GSC state-associated transcription factor SP1 and its regulatory partners are involved in cellular differentiation and growth, apoptosis, response to DNA damage, chromatin remodeling (O'Connor et al., 2016), stimulation of TERT expression in cancer stem cells (Liu et al., 2016), and increased stemness and invasion in GBM (Lee et al., 2014). In contrast, the Constructive GSC state relies on transcription factors including OLIG2, a known GSC marker (Trépant et al., 2015); AHR involved in tumor microenvironment responses and metabolic adaptation (Gabriely et al., 2017); NPAS3, a regulator of Notch signaling and neurogenesis (Michaelson et al., 2017); and ASCL1, a critical regulator of GSC differentiation and marker of sensitivity to Notch inhibition in GSCs (Park et al., 2017; Rajakulendran et al., 2019). Finally, the Invasive GSC state relies on FOXD1, a pluripotency regulator and determinant of tumorigenicity in GSCs regulating expression of the aldehyde dehydrogenase ALDH1A3, a functional marker for invasive GSCs (Cheng et al., 2016; Koga et al., 2014). These results support developing combination therapy using targeting agents against each GSC state, such as Notch inhibitors (Park et al., 2017) and small molecule inhibitors of ALDH (Cheng et al., 2016), to eradicate self-renewing tumor-initiating cells with the hope to cure GBM patients.

Materials and methods

Patient samples and cell culture

All tissue samples were obtained following informed consent from patients, and all experimental procedures were performed in accordance with the Research Ethics Board at The Hospital for Sick Children (Toronto, Canada), the University of Calgary Ethics Review Board, and the Health Research Ethics Board of Alberta – Cancer Committee (HREBA). Approval to pathological data was obtained from the respective institutional review boards. Patient tumor tissue samples were dissociated in artificial cerebrospinal fluid followed by treatment with enzyme cocktail at 37°C. Patient tumor-derived GSCs were grown as adherent monolayer cultures in serum-free medium (SFM) as previously described (Pollard et al., 2009). Briefly, cells were grown adherently on culture plates coated with poly-l-ornithine and laminin. Serum-free NS cell self-renewal media (NS media) consisted of Neurocult NS-A Basal media, supplemented with 2 mmol/L l-glutamine, N2 and B27 supplements, 75 μg/mL bovine serum albumin, 10 ng/mL recombinant human EGF (rhEGF), 10 ng/mL basic fibroblast growth factor (bFGF), and 2 μg/mL heparin. A subset (22/37) of the GSCs used for orthotopic xenografts were grown as non-adherent spheres prior to single-cell dissociation and injection into the mice. Briefly, SFM was used to initiate GSC cultures. Non-adherent spheres formed after 7–21 days in culture and were expanded, then cryopreserved in 10% dimethyl sulfoxide (DMSO; Sigma-Aldrich) in SFM until used in experiments.

ATAC-seq

ATAC-seq was used to profile the accessible chromatin landscape of 27 patient tumor-derived GSCs. Fifty thousand cells were processed from each sample as previously described (Buenrostro et al., 2013; Corces et al., 2017). The resulting libraries were sequenced with 50 bp single-end reads, which were mapped to hg19. Reads were filtered to remove duplicates, unmapped or poor-quality (Q < 30) reads, mitochondrial reads, chrY reads, and those overlapping the ENCODE blacklist. Following alignment, accessible chromatin regions/peaks were called using MACS2. Default parameters were used except for the following: --keep-dup all -B --nomodel --SPMR -q 0.05 --slocal 6250 --llocal 6250. The signal intensity was calculated as the fold enrichment of the signal per million reads in a sample over a modeled local background using the bdgcmp function in MACS2. Spectral clustering implemented in the SNFtool package (Wang et al., 2014) was run on the SNF-fused similarity matrix to obtain the groups corresponding to k = 2–12. Enrichment for genomic features was calculated using CEAS (Shin et al., 2009). The GSC state labels for each sample can be found in Supplementary file 1, and the coordinates of discriminating ATAC regions for each GSC state are in Supplementary file 2.

A hypergeometric test was used to determine whether there was any sex bias within the three GSC states. All tests resulted in p-value>0.05 except in the Invasive cluster where three out of five patients are female, giving a p-value=0.047. Given the small size of the Invasive group and the impossibility of obtaining an even number of males and females in a group of 5, this result was not considered to indicate sex bias within the cohort.

A given chromatin region was considered exclusive to one of the clusters if it was called as a peak in any of the cluster’s samples using a q-value filter of 0.05 and was not called as a peak in any of the other samples using a q-value filter of 0.2, in order to ensure stringency of exclusivity.

The ATAC-seq saturation analysis was performed by randomizing the order of samples and successively calculating the number of additional peaks discovered with the addition of each new sample. This process was repeated 10,000 times and averaged. A self-starting non-linear regression model was then fitted to the data to estimate the level of saturation reached.

For the xenograft survival analysis, 11/37 GSCs used overlap with the cohort of 27 described above. The other 26/37 GSCs were profiled by ATAC-seq independently following the same protocol described above and assigned to a GSC state as described below. Similarly, samples G432, G440, and G472 used in the essentiality gene analysis were profiled by ATAC-seq independently following the same protocol described above and assigned to a GSC state as follows: the signal obtained from MACS2 for each sample was mapped to the peak catalog of the original cohort of 27 GSCs. Each sample was then allocated to a GSC state through unsupervised hierarchical clustering with the original cohort of 27 GSCs.

Single-cell ATAC-seq

The four tumors used were G4218 (primary GBM, IDH wt, male, 64 years), G4250 (primary GBM, IDH wt, male, 73 years), G4275 (primary GBM, IDH wt, female, 52 years), and G4349 (primary GBM, IDH wt, male, 62 years). Fragments of tumor were received fresh from the operating room, and blunt dissected into individual fragments of approximately 0.3–0.7 cm3. Each fragment was placed in 1 mL of freezing media (400 μL of NeuroCult NS-A Basal medium with proliferation supplement (StemCell Technologies; #05751) containing 20 μg/mL rhEGF (Peprotech, AF-100–15), 10 μg/mL bFGF (StemCell Technologies, #78003), and 2 μg/mL heparin (StemCell Technologies, #07980); 500 μL of 25% bovine serum albumin (BSA) (Millipore-Sigma; A9647) in Dulbecco's modified Eagle's medium, and 100 μL DMSO (Millipore-Sigma; D2650) in a 2 mL cryotube, and placed at −80 C in a CoolCell for at least 24 hr. Samples were then stored at −80C until use. Cryopreserved primary GBM samples were washed at 1000 RPM for 5 min in phosphate-buffered saline (PBS) to remove DMSO, and then transferred to 1.5 mL tubes. Samples were resuspended in cold ATAC resuspension buffer (10 mM Tris–HCl pH 7.4, 10 mM NaCl, 3 mM MgCl2, 0.1% NP-40, 0.1% Tween-20, 0.01% Digitonin, 1% BSA in PBS) on ice and dissociated using a wide-bore P1000 pipette tip and vortexing, followed by 10 min of incubation on ice. Cells were spun down at 500 x g for 5 min at 4°C, washed in the ATAC resuspension buffer, spun down again, and resuspended in ATAC-Tween wash buffer (10 mM Tris–HCl pH 7.4, 10 mM NaCl, 3 mM MgCl2, 0.1% Tween-20, 1% BSA in PBS), then passed through a cell strainer top FACS tube (Falcon; #38030) to remove debris. Nuclei quality and quantity was evaluated using trypan blue on an Invitrogen Countess II device in duplicate, and a subset of nuclei was spun down in a fresh tube and resuspended in 10× sample dilution buffer. Nuclei were then used for single-cell ATAC-seq library construction using the Chromium Single Cell ATAC Solution v1.0 kit (10× Genomics) on a Chromium controller. Completed libraries were further quality checked for fragment size and distribution using an Agilent TapeStation prior to sequencing. Single-cell ATAC-seq samples were sequenced on a NextSeq 500 (Illumina) instrument with 50 bp paired-end reads at the Centre for Health Genomics and Informatics (CHGI) at the University of Calgary.

The raw sequencing data was demultiplexed using cellranger-atac mkfastq (Cell Ranger ATAC, version 1.0.0, 10× Genomics). Single-cell ATAC-seq reads were aligned to the hg19 reference genome (hg19, version 1.1.0, 10× Genomics) and quantified using cellranger-atac count function with default parameters (Cell Ranger ATAC, version 1.1.0, 10× Genomics). The resulting data were analyzed using the chromVAR (Schep et al., 2017) and Signac (Stuart et al., 2019) R packages (v1.4.1). The number of accessibility modules in each sample was determined using the ElbowPlot method implemented in Signac. Similarity between individual cells and GSC states was assessed using the deviation scores calculated by chromVAR within the single-cell data for significantly differentially accessible sets of peaks (fold change signal difference > 2 and Wilcoxon test q-value ≤ 0.05) between the states as determined by bulk ATAC-seq. Similarity between individual cells and the expression-derived cellular states was assessed using the deviation scores calculated by chromVAR within the single-cell data for promoter regions of the signature genes of each of the cellular states (Neftel et al., 2019). A twofold cut-off was used to determine the dominance of a UMAP module by an individual or group of cellular states. Similarity between individual cells and the GBM cancer stem cell signatures was assessed using the deviation scores calculated by chromVAR within the single-cell data for either promoter regions of the 19 transcription factors identified as markers of cancer stem cells in GBM (Suvà et al., 2014) or accessible chromatin regions shared by a majority of GSCs (>14/27). An average of 11.3% of cells in each tumor was identified as a GSC: G4218: 11.3%, G4250: 8.6%, G4275: 8.1%, G4349: 17.4%.

Copy number variants in single cells were determined using CONICSmat (Müller et al., 2018) with default parameters using the gene activity matrix generated by Signac as input. We focused on chr7 gains and chr10 losses as they are hallmark chromosomal changes in GBM and found the following fractions of cells carrying these CNVs, on average across the four tumors: 76% of all cells, 88% of cells allocated to scRNA-seq cellular states (Neftel et al., 2019), 95% of cancer stem cells based on the 19 gene signature, 91% of GSCs based on shared accessible regions between 14/27 GSC populations, and 94% of GSCs identified based on the state-specific signatures.

DNA methylation arrays

Bisulfite conversion of DNA for methylation profiling was performed using the EZ DNA Methylation kit (Zymo Research) on 500 ng genomic DNA from all 27 samples. Conversion efficiency was quantitatively assessed by quantitative PCR (qPCR). The Illumina Infinium MethylationEPIC BeadChips were processed as per manufacturer’s recommendations. The R package ChAMP v2.6.4 (Morris et al., 2014) was used to process and analyze the data. For the copy number analysis, the raw IDAT files were uploaded to the MNP tool (Capper et al., 2018), which directly compares the copy number profile estimated from the probe intensities on the methylation array to the distribution observed across thousands of brain tumors in its database.

RNA-seq

RNA was extracted from GSCs using the Qiagen RNeasy Plus kit. RNA sample quality was measured by Qubit (Life Technologies) for concentration and by Agilent Bioanalyzer for RNA integrity. All samples had RIN above 9. Libraries were prepared using the TruSeq Stranded mRNA kit (Illumina). Two hundred nanograms from each sample were purified for polyA tail containing mRNA molecules using poly-T oligo attached magnetic beads, then fragmented post-purification. The cleaved RNA fragments were copied into first strand cDNA using reverse transcriptase and random primers. This is followed by second strand cDNA synthesis using RNase H and DNA Polymerase I. A single ‘A’ base was added and adapter ligated followed by purification and enrichment with PCR to create cDNA libraries. Final cDNA libraries were verified by the Agilent Bioanalyzer for size and concentration quantified by qPCR. All libraries were pooled to a final concentration of 1.8 nM, clustered, and sequenced on the Illumina NextSeq500 as a pair-end 75 cycle sequencing run using v2 reagents to achieve a minimum of ~40 million reads per sample. Reads were aligned to hg19 using the STAR aligner v2.4.2a (Dobin et al., 2013), and transcripts were quantified using RSEM v1.2.21 (Li and Dewey, 2011) or vst transformed using DESeq2 (Anders and Huber, 2010).

Motif enrichment

Regions exclusively accessible in one of the GSC states and not the others were used as input sequences for the motif enrichment, while the full ATAC-seq catalog served as the background set when running HOMER v4.7 to detect enrichments of transcription factor binding motifs. Enriched motifs were then grouped into families based on similarities in DNA-binding domains using the CIS-BP database (Weirauch et al., 2014). Each family was assigned the fold-enrichment value of the most enriched motif within the family.

The transcription factors whose motifs were found enriched in Reactive-exclusive accessible regions were run together through GSEA (Subramanian et al., 2005), and the gene set corresponding to genes potentially regulated by SP1 was identified as significantly enriched (GSEA gene set GGGCGGR_SP1_Q6). The expression levels of key transcription factors in tumor and normal samples were analyzed and displayed using GEPIA (Tang et al., 2017).

Gene essentiality screen

Illumina sequencing reads from genome-wide TKOv1 CRISPR screens in patient-derived GSCs (MacLeod et al., 2019) were mapped using MAGECK (Li et al., 2014) and analyzed using the BAGEL algorithm with version two reference core essential genes/non-essential genes (Hart et al., 2017; Hart et al., 2015). Resultant raw Bayes factor (BF) statistics were used to determine essentiality of transcription factor genes using a minimum BF of 3 and a 5% false discovery rate cut-off. For visualization purposes only, the essentiality scores were scaled and the individual GSC essentiality thresholds subtracted from each score to obtain a common threshold at 0 across GSCs. In the GSEA analysis of essentiality genes, only those genes found essential only in a given GSC state were used, giving sets of 12, 29, and 235 genes for the Reactive, Constructive, and Invasive GSC states, respectively.

Orthotopic xenografts

All animal procedures were performed according to and approved by the Animal Care Committee of the Hospital for Sick Children or the University of Calgary. All attempts are made to minimize the handling time during surgery and treatment so as not to unduly stress the animals. Animals are observed daily after surgery to ensure there are no unexpected complications. For intracranial xenografts, 100,000 GSC cells were stereotactically injected into the frontal cortex of 6–8 weeks old female NOD/SCID or C17/SCID mice. Mice were monitored and euthanized once neurological symptoms were observed or at the experimental end point of 12 months.

Invasion assay

GSCs and human fetal neural stem cells (hfNSCs; used as controls) were seeded in flasks maintained vertically to limit adherence and incubated at 37°C, 5% CO2 until the average neurosphere size reached approximately 150 μm. Neurospheres were then collected and allowed to settle by gravity to the bottom of a prechilled 1.5 mL conical tubes for 5 min on ice. Following aspiration of the supernatant, spheres were re-suspended in 0.4 mg/mL type I collagen (Cultrex Rat Collagen I, Trevigen, 3440–005) on ice. The suspended spheres were dispensed in prechilled 96-well plates (100 μL/well). The plates were maintained on ice for 5 min to allow spheres to settle at the same level at the bottom of the well and then transferred to the incubator at 37°C for 15 min to allow polymerization of the collagen. Plates were then placed in an IncuCyte (Essen Bioscience, Ann Arbor, MI), and invasion was monitored every hour for 12 hr. To quantify invasion of the cells from the embedded spheres into the collagen matrix, the area of the spheres at each time point was normalized to the area of the spheres at T0. Invasion experiments were performed at least in triplicates for each GSC and hfNSC lines. For each replicate, invasion was measured based on a minimum of three spheres.

Immunohistochemistry

Tissue samples were formalin fixed and paraffin embedded. Serial sections deparaffinized and rehydrated through an alcohol gradient to water, and antigen retrieval in citrate buffer pH 6.0 was used for the human nucleolin antibody at 5.0 g/mL (ab13541) (Abcam, Cambridge, MA). Endogenous peroxide activity and nonspecific binding were blocked with 3%(vol/vol) peroxide and 2% (vol/vol) normal horse serum. Primary antibody and anti-mouse ImmPRESS-HRP secondary antibody were incubated for 1 hr and visualized using 3,3′-diaminobenzidine (Vectorlabs, Burlingame, CA). Normal horse serum or monoclonal IgM was used in control sections.

Survival analysis

Survival analysis on xenografts and TCGA data was performed using R packages survival (Therneau, 2015) and survminer (‘Drawing Survival Curves using ‘ggplot2’ [R package survminer, 2020]’, n.d.). The LogRank test was used in every analysis. See ATAC-seq section for details on how each GSC used in the orthotopic xenografts was assigned to a GSC state. For each of the 37 GSCs used in the xenograft survival analysis, the median survival value was used from multiple mice injected with each GSC. Each GSC was injected into one to six mice (mean number of mice used = 5; only one GSC has a single replicate: G489, a Constructive state GSC).

TCGA samples for this analysis were selected as follows: 144 IDH-wt adult GBM samples for which RNA-seq, IDH mutation status, and survival information were available. TCGA samples were assigned to individual GSC states in the following ways. (1) Using the unsupervised clustering of RNA-seq data presented in Figure 3B, the 23/27 GSCs that displayed matched GSC state assignments by RNA-seq and ATAC-seq were used in this analysis. (2) Genes preferentially enriched in each GSC state were determined using DEseq2 (Anders and Huber, 2010) (q ≤ 0.05 and fold change ≥ 2). (3) The mean log2(FPKM+1) value for each of these genes over all GSCs in each state was calculated to obtain a single representative value for each gene in each of the three GSC states. (4) The concordance index was then calculated between each TCGA sample and each GSC state, and individual TCGA samples were assigned to the GSC state with the highest score. Similarly, to assign TCGA samples to the three Invasive groups (Invasive-low, -mid, and -high), the concordance to Invasive GSCs as calculated above was used. The z-score for each sample was then used to classify each TCGA sample into the three subgroups of Invasive-low (Invasive z-score < 1), Invasive-mid (Invasive z-score = 1–1.65), and Invasive-high (Invasive z-score ≥ 1.65). When changing the Invasive z-score thresholds for grouping the TCGA samples, the most Invasive-high subgroup remains associated with the lowest survival (Figure 5—figure supplement 2B,C).

A further 158 samples that did not overlap with the RNA-seq set for which microarray expression data, survival information, and IDH status were available were used in a validation set. Individual samples were assigned to GSC clusters or C3 score classes as described above. The scores were then combined with those of the RNA-seq set for the survival analysis shown in Figure 5—figure supplement 2D–E.

Acknowledgements

We would like to thank the Princess Margaret Genomics Centre for their assistance in this study and 10× Genomics, especially Adam Jerauld, Ariel Royall, and John Chevillet for training and support. We also thank Aude Gerbaud and Stacey Krunholtz for their help with figure formatting and David Vetrie for his thoughtful input on the manuscript.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Peter B Dirks, Email: peter.dirks@sickkids.ca.

Mathieu Lupien, Email: mlupien@uhnres.utoronto.ca.

Lynne-Marie Postovit, University of Alberta, Canada.

Kevin Struhl, Harvard Medical School, United States.

Funding Information

This paper was supported by the following grants:

  • Stand Up To Cancer Canada, Genome Canada, CIHR, OICR, AACR SU2C-AACR-DT-19-15 to Michael D Taylor, Samuel Weiss, Peter B Dirks, Mathieu Lupien.

  • CIHR TGH-158221 to Stephane Angers, Peter B Dirks, Mathieu Lupien.

  • CIHR MFE 338954 to Paul Guilhamon.

  • Princess Margaret Cancer Foundation to Mathieu Lupien.

  • CIHR to H Artee Luchman, Samuel Weiss.

  • Terry Fox Research Institute to Stephane Angers, Peter B Dirks.

  • Hospital for Sick Children to Peter B Dirks.

  • Canadian Cancer Society to Stephane Angers.

  • NSERC to Marco Gallo.

  • Alliance for Cancer Gene Therapy to Marco Gallo.

  • Jessica's Footprint to Peter B Dirks.

  • Hopeful Minds to Peter B Dirks.

  • The Bresler Family to Peter B Dirks.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Writing - original draft, Writing - review and editing.

Investigation.

Investigation.

Investigation.

Investigation.

Resources.

Formal analysis.

Data curation.

Investigation.

Resources.

Investigation.

Investigation.

Investigation.

Investigation, Visualization.

Investigation.

Project administration.

Formal analysis.

Investigation.

Supervision.

Resources.

Supervision.

Supervision.

Supervision.

Supervision.

Supervision.

Supervision, Writing - review and editing.

Conceptualization, Supervision, Writing - original draft, Writing - review and editing.

Conceptualization, Supervision, Funding acquisition, Writing - original draft, Project administration, Writing - review and editing.

Ethics

Human subjects: All tissue samples were obtained following informed consent from patients, and all experimental procedures were performed in accordance with the Research Ethics Board at The Hospital for Sick Children (Toronto, Canada), the University of Calgary Ethics Review Board, and the Health Research Ethics Board of Alberta - Cancer Committee (HREBA). Approval for access to pathological data was obtained from the respective institutional review boards.

Animal experimentation: All animal procedures were performed according to and approved by the Animal Care Committee of the Hospital for Sick Children or the University of Calgary. All attempts are made to minimize the handling time during surgery and treatment so as not to unduly stress the animals. Animals are observed daily after surgery to ensure there are no unexpected complications.

Additional files

Supplementary file 1. List of GSC state labels, patient age, and sex.
elife-64090-supp1.zip (447B, zip)
Supplementary file 2. Coordinates of discriminating ATAC regions for each GSC state.
elife-64090-supp2.zip (3.1KB, zip)
Transparent reporting form

Data availability

The GSCs are available upon reasonable request from PBD and SW. The GSC ATAC-seq and DNA methylation data have been deposited at GEO (GSE109399). The scATAC-seq data has been deposited at GEO (GSE139136). RNA-seq data are available at EGA (EGAS00001003070).

The following datasets were generated:

Guilhamon P, Kushida MM, Dirks PB, Lupien M. 2019. Epigenetic characterization of glioblastoma stem cells. NCBI Gene Expression Omnibus. GSE109399

Nikolic A, Ellestad K, Singhal D, Gallo M. 2021. Single-cell ATAC-Seq of Adult GBM. NCBI Gene Expression Omnibus. GSE139136

Guilhamon P, Kushida MM, Cavalli FMG, Dirks PB, Lupien M. 2018. RNA-seq of Glioblastoma stem cells. EGA. EGAS00001003070

References

  1. Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biology. 2010;11:R106. doi: 10.1186/gb-2010-11-10-r106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bailey SD, Desai K, Kron KJ, Mazrooei P, Sinnott-Armstrong NA, Treloar AE, Dowar M, Thu KL, Cescon DW, Silvester J, Yang SY, Wu X, Pezo RC, Haibe-Kains B, Mak TW, Bedard PL, Pugh TJ, Sallari RC, Lupien M. Noncoding somatic and inherited single-nucleotide variants converge to promote ESR1 expression in breast Cancer. Nature Genetics. 2016;48:1260–1266. doi: 10.1038/ng.3650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bao S, Wu Q, McLendon RE, Hao Y, Shi Q, Hjelmeland AB, Dewhirst MW, Bigner DD, Rich JN. Glioma stem cells promote radioresistance by preferential activation of the DNA damage response. Nature. 2006;444:756–760. doi: 10.1038/nature05236. [DOI] [PubMed] [Google Scholar]
  4. Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nature Methods. 2013;10:1213–1218. doi: 10.1038/nmeth.2688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Buenrostro JD, Corces MR, Lareau CA, Wu B, Schep AN, Aryee MJ, Majeti R, Chang HY, Greenleaf WJ. Integrated Single-Cell analysis maps the continuous regulatory landscape of human hematopoietic differentiation. Cell. 2018;173:1535–1548. doi: 10.1016/j.cell.2018.03.074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Cancer Genome Atlas Research Network. Verhaak RG, Hoadley KA, Purdom E, Wang V, Qi Y, Wilkerson MD, Miller CR, Ding L, Golub T, Mesirov JP, Alexe G, Lawrence M, O'Kelly M, Tamayo P, Weir BA, Gabriel S, Winckler W, Gupta S, Jakkula L, Feiler HS, Hodgson JG, James CD, Sarkaria JN, Brennan C, Kahn A, Spellman PT, Wilson RK, Speed TP, Gray JW, Meyerson M, Getz G, Perou CM, Hayes DN. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell. 2010;17:98–110. doi: 10.1016/j.ccr.2009.12.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Capper D, Jones DTW, Sill M, Hovestadt V, Schrimpf D, Sturm D, Koelsche C, Sahm F, Chavez L, Reuss DE, Kratz A, Wefers AK, Huang K, Pajtler KW, Schweizer L, Stichel D, Olar A, Engel NW, Lindenberg K, Harter PN, Braczynski AK, Plate KH, Dohmen H, Garvalov BK, Coras R, Hölsken A, Hewer E, Bewerunge-Hudler M, Schick M, Fischer R, Beschorner R, Schittenhelm J, Staszewski O, Wani K, Varlet P, Pages M, Temming P, Lohmann D, Selt F, Witt H, Milde T, Witt O, Aronica E, Giangaspero F, Rushing E, Scheurlen W, Geisenberger C, Rodriguez FJ, Becker A, Preusser M, Haberler C, Bjerkvig R, Cryan J, Farrell M, Deckert M, Hench J, Frank S, Serrano J, Kannan K, Tsirigos A, Brück W, Hofer S, Brehmer S, Seiz-Rosenhagen M, Hänggi D, Hans V, Rozsnoki S, Hansford JR, Kohlhof P, Kristensen BW, Lechner M, Lopes B, Mawrin C, Ketter R, Kulozik A, Khatib Z, Heppner F, Koch A, Jouvet A, Keohane C, Mühleisen H, Mueller W, Pohl U, Prinz M, Benner A, Zapatka M, Gottardo NG, Driever PH, Kramm CM, Müller HL, Rutkowski S, von Hoff K, Frühwald MC, Gnekow A, Fleischhack G, Tippelt S, Calaminus G, Monoranu CM, Perry A, Jones C, Jacques TS, Radlwimmer B, Gessi M, Pietsch T, Schramm J, Schackert G, Westphal M, Reifenberger G, Wesseling P, Weller M, Collins VP, Blümcke I, Bendszus M, Debus J, Huang A, Jabado N, Northcott PA, Paulus W, Gajjar A, Robinson GW, Taylor MD, Jaunmuktane Z, Ryzhova M, Platten M, Unterberg A, Wick W, Karajannis MA, Mittelbronn M, Acker T, Hartmann C, Aldape K, Schüller U, Buslei R, Lichter P, Kool M, Herold-Mende C, Ellison DW, Hasselblatt M, Snuderl M, Brandner S, Korshunov A, von Deimling A, Pfister SM. DNA methylation-based classification of central nervous system tumours. Nature. 2018;555:469–474. doi: 10.1038/nature26000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Carlsson SK, Brothers SP, Wahlestedt C. Emerging treatment strategies for glioblastoma multiforme. EMBO Molecular Medicine. 2014;6:1359–1370. doi: 10.15252/emmm.201302627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chen J, Li Y, Yu TS, McKay RM, Burns DK, Kernie SG, Parada LF. A restricted cell population propagates glioblastoma growth after chemotherapy. Nature. 2012;488:522–526. doi: 10.1038/nature11287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cheng P, Wang J, Waghmare I, Sartini S, Coviello V, Zhang Z, Kim SH, Mohyeldin A, Pavlyukov MS, Minata M, Valentim CL, Chhipa RR, Bhat KP, Dasgupta B, La Motta C, Kango-Singh M, Nakano I. FOXD1-ALDH1A3 signaling is a determinant for the Self-Renewal and tumorigenicity of mesenchymal glioma stem cells. Cancer Research. 2016;76:7219–7230. doi: 10.1158/0008-5472.CAN-15-2860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Corces MR, Buenrostro JD, Wu B, Greenside PG, Chan SM, Koenig JL, Snyder MP, Pritchard JK, Kundaje A, Greenleaf WJ, Majeti R, Chang HY. Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nature Genetics. 2016;48:1193–1203. doi: 10.1038/ng.3646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Corces MR, Trevino AE, Hamilton EG, Greenside PG, Sinnott-Armstrong NA, Vesuna S, Satpathy AT, Rubin AJ, Montine KS, Wu B, Kathiria A, Cho SW, Mumbach MR, Carter AC, Kasowski M, Orloff LA, Risca VI, Kundaje A, Khavari PA, Montine TJ, Greenleaf WJ, Chang HY. An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues. Nature Methods. 2017;14:959–962. doi: 10.1038/nmeth.4396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Gabriely G, Wheeler MA, Takenaka MC, Quintana FJ. Role of AHR and HIF-1α in glioblastoma metabolism. Trends in Endocrinology & Metabolism. 2017;28:428–436. doi: 10.1016/j.tem.2017.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gallo M, Coutinho FJ, Vanner RJ, Gayden T, Mack SC, Murison A, Remke M, Li R, Takayama N, Desai K, Lee L, Lan X, Park NI, Barsyte-Lovejoy D, Smil D, Sturm D, Kushida MM, Head R, Cusimano MD, Bernstein M, Clarke ID, Dick JE, Pfister SM, Rich JN, Arrowsmith CH, Taylor MD, Jabado N, Bazett-Jones DP, Lupien M, Dirks PB. MLL5 orchestrates a Cancer Self-Renewal state by repressing the histone variant H3.3 and Globally Reorganizing Chromatin. Cancer Cell. 2015;28:715–729. doi: 10.1016/j.ccell.2015.10.005. [DOI] [PubMed] [Google Scholar]
  16. Hart T, Chandrashekhar M, Aregger M, Steinhart Z, Brown KR, MacLeod G, Mis M, Zimmermann M, Fradet-Turcotte A, Sun S, Mero P, Dirks P, Sidhu S, Roth FP, Rissland OS, Durocher D, Angers S, Moffat J. High-Resolution CRISPR screens reveal fitness genes and Genotype-Specific Cancer liabilities. Cell. 2015;163:1515–1526. doi: 10.1016/j.cell.2015.11.015. [DOI] [PubMed] [Google Scholar]
  17. Hart T, Tong AHY, Chan K, Van Leeuwen J, Seetharaman A, Aregger M, Chandrashekhar M, Hustedt N, Seth S, Noonan A, Habsid A, Sizova O, Nedyalkova L, Climie R, Tworzyanski L, Lawson K, Sartori MA, Alibeh S, Tieu D, Masud S, Mero P, Weiss A, Brown KR, Usaj M, Billmann M, Rahman M, Costanzo M, Myers CL, Andrews BJ, Boone C, Durocher D, Moffat J. Evaluation and design of Genome-Wide CRISPR/SpCas9 knockout screens. G3: Genes, Genomes, Genetics. 2017;7:2719–2727. doi: 10.1534/g3.117.041277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Johnston MJ, Nikolic A, Ninkovic N, Guilhamon P, Cavalli FMG, Seaman S, Zemp FJ, Lee J, Abdelkareem A, Ellestad K, Murison A, Kushida MM, Coutinho FJ, Ma Y, Mungall AJ, Moore R, Marra MA, Taylor MD, Dirks PB, Pugh TJ, Morrissy S, St Croix B, Mahoney DJ, Lupien M, Gallo M. High-resolution structural genomics reveals new therapeutic vulnerabilities in glioblastoma. Genome Research. 2019;29:1211–1222. doi: 10.1101/gr.246520.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Kim K, Jang K, Yang W, Choi EY, Park SM, Bae M, Kim YJ, Choi JK. Chromatin structure-based prediction of recurrent noncoding mutations in Cancer. Nature Genetics. 2016;48:1321–1326. doi: 10.1038/ng.3682. [DOI] [PubMed] [Google Scholar]
  20. Koga M, Matsuda M, Kawamura T, Sogo T, Shigeno A, Nishida E, Ebisuya M. Foxd1 is a mediator and Indicator of the cell reprogramming process. Nature Communications. 2014;5:3197. doi: 10.1038/ncomms4197. [DOI] [PubMed] [Google Scholar]
  21. Lan X, Jörg DJ, Cavalli FMG, Richards LM, Nguyen LV, Vanner RJ, Guilhamon P, Lee L, Kushida MM, Pellacani D, Park NI, Coutinho FJ, Whetstone H, Selvadurai HJ, Che C, Luu B, Carles A, Moksa M, Rastegar N, Head R, Dolma S, Prinos P, Cusimano MD, Das S, Bernstein M, Arrowsmith CH, Mungall AJ, Moore RA, Ma Y, Gallo M, Lupien M, Pugh TJ, Taylor MD, Hirst M, Eaves CJ, Simons BD, Dirks PB. Fate mapping of human glioblastoma reveals an invariant stem cell hierarchy. Nature. 2017;549:227–232. doi: 10.1038/nature23666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Lee WS, Kwon J, Yun DH, Lee YN, Woo EY, Park M-J, Lee J-S, Han Y-H, Bae IH. Specificity protein 1 expression contributes to Bcl-w-Induced aggressiveness in glioblastoma multiforme. Molecules and Cells. 2014;37:17–23. doi: 10.14348/molcells.2014.2161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Li W, Xu H, Xiao T, Cong L, Love MI, Zhang F, Irizarry RA, Liu JS, Brown M, Liu XS. MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens. Genome Biology. 2014;15:554. doi: 10.1186/s13059-014-0554-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Liau BB, Sievers C, Donohue LK, Gillespie SM, Flavahan WA, Miller TE, Venteicher AS, Hebert CH, Carey CD, Rodig SJ, Shareef SJ, Najm FJ, van Galen P, Wakimoto H, Cahill DP, Rich JN, Aster JC, Suvà ML, Patel AP, Bernstein BE. Adaptive chromatin remodeling drives glioblastoma stem cell plasticity and drug tolerance. Cell Stem Cell. 2017;20:233–246. doi: 10.1016/j.stem.2016.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Liu T, Yuan X, Xu D. Cancer-Specific telomerase reverse transcriptase (TERT) Promoter mutations: biological and clinical implications. Genes. 2016;7:38. doi: 10.3390/genes7070038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. MacLeod G, Bozek DA, Rajakulendran N, Monteiro V, Ahmadi M, Steinhart Z, Kushida MM, Yu H, Coutinho FJ, Cavalli FMG, Restall I, Hao X, Hart T, Luchman HA, Weiss S, Dirks PB, Angers S. Genome-Wide CRISPR-Cas9 screens expose genetic vulnerabilities and mechanisms of temozolomide sensitivity in glioblastoma stem cells. Cell Reports. 2019;27:971–986. doi: 10.1016/j.celrep.2019.03.047. [DOI] [PubMed] [Google Scholar]
  28. Meyer M, Reimand J, Lan X, Head R, Zhu X, Kushida M, Bayani J, Pressey JC, Lionel AC, Clarke ID, Cusimano M, Squire JA, Scherer SW, Bernstein M, Woodin MA, Bader GD, Dirks PB. Single cell-derived clonal analysis of human glioblastoma links functional and genomic heterogeneity. PNAS. 2015;112:851–856. doi: 10.1073/pnas.1320611111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Michaelson JJ, Shin MK, Koh JY, Brueggeman L, Zhang A, Katzman A, McDaniel L, Fang M, Pufall M, Pieper AA. Neuronal PAS domain proteins 1 and 3 are master regulators of neuropsychiatric risk genes. Biological Psychiatry. 2017;82:213–223. doi: 10.1016/j.biopsych.2017.03.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Miller TE, Liau BB, Wallace LC, Morton AR, Xie Q, Dixit D, Factor DC, Kim LJY, Morrow JJ, Wu Q, Mack SC, Hubert CG, Gillespie SM, Flavahan WA, Hoffmann T, Thummalapalli R, Hemann MT, Paddison PJ, Horbinski CM, Zuber J, Scacheri PC, Bernstein BE, Tesar PJ, Rich JN. Transcription elongation factors represent in vivo Cancer dependencies in glioblastoma. Nature. 2017;547:355–359. doi: 10.1038/nature23000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Morris TJ, Butcher LM, Feber A, Teschendorff AE, Chakravarthy AR, Wojdacz TK, Beck S. ChAMP: 450k chip analysis methylation pipeline. Bioinformatics. 2014;30:428–430. doi: 10.1093/bioinformatics/btt684. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Müller S, Cho A, Liu SJ, Lim DA, Diaz A. CONICS integrates scRNA-seq with DNA sequencing to map gene expression to tumor sub-clones. Bioinformatics. 2018;34:3217–3219. doi: 10.1093/bioinformatics/bty316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Neftel C, Laffy J, Filbin MG, Hara T, Shore ME, Rahme GJ, Richman AR, Silverbush D, Shaw ML, Hebert CM, Dewitt J, Gritsch S, Perez EM, Gonzalez Castro LN, Lan X, Druck N, Rodman C, Dionne D, Kaplan A, Bertalan MS, Small J, Pelton K, Becker S, Bonal D, Nguyen QD, Servis RL, Fung JM, Mylvaganam R, Mayr L, Gojo J, Haberler C, Geyeregger R, Czech T, Slavc I, Nahed BV, Curry WT, Carter BS, Wakimoto H, Brastianos PK, Batchelor TT, Stemmer-Rachamimov A, Martinez-Lage M, Frosch MP, Stamenkovic I, Riggi N, Rheinbay E, Monje M, Rozenblatt-Rosen O, Cahill DP, Patel AP, Hunter T, Verma IM, Ligon KL, Louis DN, Regev A, Bernstein BE, Tirosh I, Suvà ML. An integrative model of cellular states, plasticity, and genetics for glioblastoma. Cell. 2019;178:835–849. doi: 10.1016/j.cell.2019.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. O'Connor L, Gilmour J, Bonifer C. The role of the ubiquitously expressed transcription factor Sp1 in Tissue-specific transcriptional regulation and in disease. The Yale Journal of Biology and Medicine. 2016;89:513–525. [PMC free article] [PubMed] [Google Scholar]
  35. Orzan F, De Bacco F, Crisafulli G, Pellegatta S, Mussolin B, Siravegna G, D'Ambrosio A, Comoglio PM, Finocchiaro G, Boccaccio C. Genetic evolution of glioblastoma Stem-Like cells from primary to recurrent tumor. Stem Cells. 2017;35:2218–2228. doi: 10.1002/stem.2703. [DOI] [PubMed] [Google Scholar]
  36. Park NI, Guilhamon P, Desai K, McAdam RF, Langille E, O'Connor M, Lan X, Whetstone H, Coutinho FJ, Vanner RJ, Ling E, Prinos P, Lee L, Selvadurai H, Atwal G, Kushida M, Clarke ID, Voisin V, Cusimano MD, Bernstein M, Das S, Bader G, Arrowsmith CH, Angers S, Huang X, Lupien M, Dirks PB. ASCL1 reorganizes chromatin to direct neuronal fate and suppress tumorigenicity of glioblastoma stem cells. Cell Stem Cell. 2017;21:209–224. doi: 10.1016/j.stem.2017.06.004. [DOI] [PubMed] [Google Scholar]
  37. Patel AP, Tirosh I, Trombetta JJ, Shalek AK, Gillespie SM, Wakimoto H, Cahill DP, Nahed BV, Curry WT, Martuza RL, Louis DN, Rozenblatt-Rosen O, Suvà ML, Regev A, Bernstein BE. Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma. Science. 2014;344:1396–1401. doi: 10.1126/science.1254257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Pollard SM, Yoshikawa K, Clarke ID, Danovi D, Stricker S, Russell R, Bayani J, Head R, Lee M, Bernstein M, Squire JA, Smith A, Dirks P. Glioma stem cell lines expanded in adherent culture have tumor-specific phenotypes and are suitable for chemical and genetic screens. Cell Stem Cell. 2009;4:568–580. doi: 10.1016/j.stem.2009.03.014. [DOI] [PubMed] [Google Scholar]
  39. R package survminer 0.4.6Drawing Survival Curves Using “Ggplot2”. 2020 http://finzi.psych.upenn.edu/library/survminer/html/ggsurvplot.html
  40. Rajakulendran N, Rowland KJ, Selvadurai HJ, Ahmadi M, Park NI, Naumenko S, Dolma S, Ward RJ, So M, Lee L, MacLeod G, Pasiliao C, Brandon C, Clarke ID, Cusimano MD, Bernstein M, Batada N, Angers S, Dirks PB. Wnt and notch signaling govern self-renewal and differentiation in a subset of human glioblastoma stem cells. Genes & Development. 2019;33:498–510. doi: 10.1101/gad.321968.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Reimand J, Arak T, Adler P, Kolberg L, Reisberg S, Peterson H, Vilo J. G:profiler-a web server for functional interpretation of gene lists (2016 update) Nucleic Acids Research. 2016;44:W83–W89. doi: 10.1093/nar/gkw199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Restall I, Bozek DA, Chesnelong C, Weiss S, Luchman HA. Live-Cell imaging assays to study glioblastoma brain tumor stem cell migration and invasion. Journal of Visualized Experiments. 2018;1:e58152. doi: 10.3791/58152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Rheinbay E, Suvà ML, Gillespie SM, Wakimoto H, Patel AP, Shahid M, Oksuz O, Rabkin SD, Martuza RL, Rivera MN, Louis DN, Kasif S, Chi AS, Bernstein BE. An aberrant transcription factor network essential for wnt signaling and stem cell maintenance in glioblastoma. Cell Reports. 2013;3:1567–1579. doi: 10.1016/j.celrep.2013.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Sallari RC, Sinnott-Armstrong NA, French JD, Kron KJ, Ho J, Moore JH, Stambolic V, Edwards SL, Lupien M, Kellis M. Convergence of dispersed regulatory mutations predicts driver genes in prostate Cancer. bioRxiv. 2017 doi: 10.1101/097451. [DOI]
  45. Schep AN, Wu B, Buenrostro JD, Greenleaf WJ. chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nature Methods. 2017;14:975–978. doi: 10.1038/nmeth.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Schmitt AD, Hu M, Ren B. Genome-wide mapping and analysis of chromosome architecture. Nature Reviews Molecular Cell Biology. 2016;17:743–755. doi: 10.1038/nrm.2016.104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Shin H, Liu T, Manrai AK, Liu XS. CEAS: cis-regulatory element annotation system. Bioinformatics. 2009;25:2605–2606. doi: 10.1093/bioinformatics/btp479. [DOI] [PubMed] [Google Scholar]
  48. Stergachis AB, Neph S, Reynolds A, Humbert R, Miller B, Paige SL, Vernot B, Cheng JB, Thurman RE, Sandstrom R, Haugen E, Heimfeld S, Murry CE, Akey JM, Stamatoyannopoulos JA. Developmental fate and cellular maturity encoded in human regulatory DNA landscapes. Cell. 2013;154:888–903. doi: 10.1016/j.cell.2013.07.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck WM, Hao Y, Stoeckius M, Smibert P, Satija R. Comprehensive Integration of Single-Cell Data. Cell. 2019;177:1888–1902. doi: 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Stupp R, Mason WP, van den Bent MJ, Weller M, Fisher B, Taphoorn MJB, Belanger K, Brandes AA, Marosi C, Bogdahn U, Curschmann J, Janzer RC, Ludwin SK, Gorlia T, Allgeier A, Lacombe D, Cairncross JG, Eisenhauer E, Mirimanoff RO. Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. New England Journal of Medicine. 2005;352:987–996. doi: 10.1056/NEJMoa043330. [DOI] [PubMed] [Google Scholar]
  51. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. PNAS. 2005;102:15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Suvà ML, Rheinbay E, Gillespie SM, Patel AP, Wakimoto H, Rabkin SD, Riggi N, Chi AS, Cahill DP, Nahed BV, Curry WT, Martuza RL, Rivera MN, Rossetti N, Kasif S, Beik S, Kadri S, Tirosh I, Wortman I, Shalek AK, Rozenblatt-Rosen O, Regev A, Louis DN, Bernstein BE. Reconstructing and Reprogramming the Tumor-Propagating Potential of Glioblastoma Stem-like Cells. Cell. 2014;157:580–594. doi: 10.1016/j.cell.2014.02.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Tang Z, Li C, Kang B, Gao G, Li C, Zhang Z. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Research. 2017;45:W98–W102. doi: 10.1093/nar/gkx247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Therneau T. 2.38A Package for Survival Analysis in S. 2015
  55. Trépant A-L, Bouchart C, Rorive S, Sauvage S, Decaestecker C, Demetter P, Salmon I. Identification of OLIG2 as the most specific glioblastoma stem cell marker starting from comparative analysis of data from similar DNA chip microarray platforms. Tumor Biology. 2015;36:1943–1953. doi: 10.1007/s13277-014-2800-5. [DOI] [PubMed] [Google Scholar]
  56. Venere M, Fine HA, Dirks PB, Rich JN. Cancer stem cells in gliomas: identifying and understanding the apex cell in Cancer's hierarchy. Glia. 2011;59:1148–1154. doi: 10.1002/glia.21185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. von Neubeck C, Seidlitz A, Kitzler HH, Beuthien-Baumann B, Krause M. Glioblastoma multiforme: emerging treatments and stratification markers beyond new drugs. The British Journal of Radiology. 2015;88:20150354. doi: 10.1259/bjr.20150354. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Wang B, Mezlini AM, Demir F, Fiume M, Tu Z, Brudno M, Haibe-Kains B, Goldenberg A. Similarity network fusion for aggregating data types on a genomic scale. Nature Methods. 2014;11:333–337. doi: 10.1038/nmeth.2810. [DOI] [PubMed] [Google Scholar]
  59. Wang Q, Hu B, Hu X, Kim H, Squatrito M, Scarpace L, deCarvalho AC, Lyu S, Li P, Li Y, Barthel F, Cho HJ, Lin YH, Satani N, Martinez-Ledesma E, Zheng S, Chang E, Sauvé CG, Olar A, Lan ZD, Finocchiaro G, Phillips JJ, Berger MS, Gabrusiewicz KR, Wang G, Eskilsson E, Hu J, Mikkelsen T, DePinho RA, Muller F, Heimberger AB, Sulman EP, Nam DH, Verhaak RGW. Tumor evolution of Glioma-Intrinsic gene expression subtypes associates with immunological changes in the microenvironment. Cancer Cell. 2017;32:42–56. doi: 10.1016/j.ccell.2017.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Wang L, Babikir H, Müller S, Yagnik G, Shamardani K, Catalan F, Kohanbash G, Alvarado B, Di Lullo E, Kriegstein A, Shah S, Wadhwa H, Chang SM, Phillips JJ, Aghi MK, Diaz AA. The phenotypes of proliferating glioblastoma cells reside on a single Axis of variation. Cancer Discovery. 2019;9:1708–1719. doi: 10.1158/2159-8290.CD-19-0329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Weirauch MT, Yang A, Albu M, Cote AG, Montenegro-Montero A, Drewe P, Najafabadi HS, Lambert SA, Mann I, Cook K, Zheng H, Goity A, van Bakel H, Lozano JC, Galli M, Lewsey MG, Huang E, Mukherjee T, Chen X, Reece-Hoyes JS, Govindarajan S, Shaulsky G, Walhout AJM, Bouget FY, Ratsch G, Larrondo LF, Ecker JR, Hughes TR. Determination and inference of eukaryotic transcription factor sequence specificity. Cell. 2014;158:1431–1443. doi: 10.1016/j.cell.2014.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Yin W, Tang G, Zhou Q, Cao Y, Li H, Fu X, Wu Z, Jiang X. Expression profile analysis identifies a novel Five-Gene signature to improve prognosis prediction of glioblastoma. Frontiers in Genetics. 2019;10:419. doi: 10.3389/fgene.2019.00419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Zheng H, Xie W. The role of 3D genome organization in development and cell differentiation. Nature Reviews Molecular Cell Biology. 2019;20:535–550. doi: 10.1038/s41580-019-0132-4. [DOI] [PubMed] [Google Scholar]
  64. Zuo S, Zhang X, Wang L. A RNA sequencing-based six-gene signature for survival prediction in patients with glioblastoma. Scientific Reports. 2019;9:2615. doi: 10.1038/s41598-019-39273-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Lynne-Marie Postovit1
Reviewed by: Roel Verhaak2

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The current manuscript by Guilhamon et al. uses single cell sequencing technologies to identify and then characterize heterogeneities among glioblastoma (GBM) stem cells (GSC), which extend beyond current disease classifications. This is the first study to apply single cell sequencing specifically to GSCs with the goal of identifying GSC subtypes using primarily chromatin accessibility signatures. Moreover, the use of chromatin accessibility signatures in combination with TF motif prediction to guide identification of transcriptional dependencies is novel in the context of GSC studies.

Decision letter after peer review:

[Editors’ note: the authors submitted for reconsideration following the decision after peer review. What follows is the decision letter after the first round of review.]

Thank you for submitting your work entitled "Single-cell chromatin accessibility in glioblastoma delineates cancer stem cell heterogeneity predictive of survival" for consideration by eLife. Your article has been reviewed by three peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by a Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Roel Verhaak (Reviewer #3).

Our decision has been reached after consultation between the reviewers. Based on these discussions and the individual reviews below, we regret to inform you that your work will not be considered further for publication in eLife, due to the need for revisions that would likely take longer than 2-3 months. We would however encourage resubmission once the revisions are completed.

Summary

The current manuscript by Guilhamon et al. uses single cell sequencing technologies to identify and then characterize heterogeneities among glioblastoma (GBM) stem cells (GSC), which extend beyond current disease classifications. The paper was reviewed by three experts, who all agreed that the work will add meaningful knowledge to the field; but that more experimentation would be needed to better support the claims as made. The reviewers agreed that this is the first study to apply single cell sequencing specifically to GSCs with the goal of identifying GSC subtypes using primarily chromatin accessibility signatures. In particular, the use of chromatin accessibility signatures in combination with TF motif prediction to guide identification of transcriptional dependencies is novel in the context of GSC studies. However, it was felt that a number of enhancements would be needed, including greater validation of the cell samples, the inclusion of additional samples, and clarification of some of the methodologies employed. We encourage resubmission once these issues have been addressed.

Major concerns:

1) The tumor cell and GSC population enrichment should be validated using functional and genomic methods (including evidence for either Chr7 amplification or Chr10 deletion), and additional patient samples should be analyzed, preferably without extended culture times. Indeed, there appeared to be a low correlation between GBM stem TF enrichment and GSC signature enrichment and the degree of concordance between samples was not fully developed. It would be important to have more samples for many reasons, including the idea presented by the authors of differences between tumors in the dominant cell type. GBM has significant variation spatially, as well, so having only 4 tumors seems underpowered.

2) Details about the tumor collection should be included as should full information about tumor genetics, patient information, and treatment for each sample.

3) Figure 1 should be revised to clarify cells which fail to classify and to signify whether shared chromatin states exist across patients.

4) Figure 3 should be revised. Data pertaining to GSCs should be compared to the published GBM scRNAseq data to check the subtypes (Neftel et al., 2019, etc.). In addition, the top and bottom panels could be combined, and the sample assignment of TCGA subtypes, chromatin clusters and expression clusters should be better presented in the sankey diagram. The GSC samples and color bars showing chromatin clusters should be kept consistent between the top and the bottom panels. Figure 3A is missing a legend as there is no indication of what the intensity values represent. For Figure 3B, please clearly indicate an associated P-value testing for the enrichment of TCGA subtypes among the GSC ATAC states. Figure 3C should present cut-offs for what was considered statistically significant (i.e., a dotted line). Importantly, "Figure 3—figure supplement 1" presents the most differentially accessible regions between GSC states. Notably, ChrX is present in all. Chromatin accessibility read-outs might be significantly impacted by the presence of an inactive X chromosome. Hence an imbalance in the sex distribution in the GSC states might pick up sex-specific and not tumor-specific differences. Thus, the authors should show that these regions do not vary with patient sex. Finally, the evidence that "GSC states are not genetically defined (Figure 3D)" is derived from DNA methylation copy number, and a signature of selected genes. Bona fide GBM genes such as PDGFRA, NF1, AKT3, QKI should also be included in this analysis and the possibility that the GSC states cluster by frequent chromosome arm-level events (chr7, chr19, chr20 gains and chr10 loss) should be entertained. If they still do not cluster by GSC state, then it is perhaps more accurate to say that "GSC states are not defined by somatic copy number events" since there is no gene mutation data presented for these samples.

5) The validation of the states in the TCGA data are not strong. The cut-offs seem arbitrary and most of the samples seem to fall into a single group. Is this due to sampling bias? There is no connection to tumor genetics. Also, the bulk GBM ATACseq tumor profiles were generated in the TCGA datasets that presumably have accompanying RNA, DNA, and methylation data (Corces et al., Science 2018). While this represents a small dataset, it would be advantageous to incorporate analysis of these more complex tissue samples alongside the already presented data to provide confidence in generalizability. The concordance between RNA-seq and ATAC-seq data for the paired datasets should also be presented.

6) Invasion assays should include all subtypes in order to highlight properties of the invasive subtype.

7) In order to support the claim that the "Invasive-high" group confers a worse prognosis, a multivariate analysis should be employed if possible.

8) There are a number of clerical errors that should be addressed:

– Figure 4—figure supplement 3A is blank

– Please double check all sample names, in Figure 3B, G613 changes from invasive to reactive and there are several instances where the sample names do not match up.

– If other samples (outside of the 27 discussed in Figure 3) were used in Figure 4, it should be made clear. G472, G440, and G473 are not in Figure 3

– A table with all the sample names, GSC subtype identified through ATAC-seq, and validation of GSC should be included

– Figure 4—figure supplement 1 G is missing labels

– Please include the GSEA gene set size.

– Describe how many essentiality genes were included for GSEA genes

– Figure 4D should be revised to better represent the number of mice were used. A detailed experimental design and the survival curve analysis should also be provided.

– Figure 2—figure supplement 1 lacks sample annotation in both the image and the figure legend.

– Figure 4—figure supplement 3 was blacked out in the version provided for review.

– There appears to be a typographical error at the start of Results paragraph five, "FOXD1 is essential only in the Reactive state […]". This seems like it should be, "Invasive".

Reviewer #1:

The current manuscript by Guilhamon et al. uses single cell sequencing technologies to identify and then characterize heterogeneities among glioblastoma (GBM) stem cells (GSC), which extend beyond current disease classifications. Single cell technologies have already been applied to GBM in various publications. The identification of an invasive GSC subtype associated with poor survival is not novel, as it largely complements the previously identified mesenchymal GSC subtype. However, as far as I'm aware this is the first study to apply single cell sequencing specifically to glioblastoma stem cells (GSCs) with the goal of identifying GSC subtypes using primarily chromatin accessibility signatures. In particular, the use of chromatin accessibility signatures in combination with TF motif prediction to guide identification of transcriptional dependencies is novel in the context of GSC studies. Thus, this manuscript would constitute a meaningful contribution to the field of GBM and epigenetics.

– Validation of GSC population enrichment should be done and shown. Particularly because despite what is described in the text, Figure 2F show low correlation between GBM stem TF enrichment and GSC signature enrichment. 2 out of 4 primary GBM show very little correlation. Therefore, GSC enrichment should be validated with other methods, such as staining for known neural stem cell markers or clustering/enrichment/scoring based on validated GSC gene sets.

– Chromatin accessibility scores at promoter regions is used as a surrogate for gene expression for numerous analyses. What is the concordance between RNA-seq and ATAC-seq for the paired datasets? Particularly, what is the concordance for the subtype specific peaks? The concordance of gene expression and chromatin accessibility feels particularly important when exclusive peaks only account for ~5-15% of all peaks.

– Invasion assays should include all subtypes in order to highlight properties of the invasive subtype. Additionally, the author's conclusions would be strengthened by the combination of FOXD1 knockdown/inhibition with the invasion assays.

– Figure 4—figure supplement 3A is blank.

– Please double check all sample names, in Figure 3B, G613 changes from invasive to reactive. There are several instances where the sample names do not match up.

– If other samples (outside of the 27 discussed in Figure 3) were used in Figure 4, it should be made clear. G472, G440, and G473 are not in Figure 3.

– Please include table with all the sample names, GSC subtype identified through ATAC-seq, and validation of GSC.

– Figure 4—figure supplement 1 G is missing labels.

Reviewer #2:

In this manuscript, Guilhamon et al. describe a single cell ATACseq study in GBM patient tissues and bulk ATACseq in a panel of patient-derived GBM stem cell cultures. Using this dataset, the authors studied the three states in glioblastoma stem cell cultures, which presents some interesting findings to the biology of GBM. The dataset generated in this study is useful to the field. There are some concerns to be addressed:

The single cell data would seem to be potentially the most important, but the number of samples is small and the degree of concordance between samples was not fully developed. It would be important to have more samples for many reasons, including the idea presented by the authors of differences between tumors in the dominant cell type. GBM has significant variation spatially, as well, so having only 4 tumors seems underpowered.

There are missing details about the tumor collection. The three groups are designated to include reactive, which has immune cell signatures, and invasive. Often the tumor does not include areas of necrosis and invasion in most surgical specimens.

Why were the GSC cultured for 5-12 passages? This would seem to induce potential selection or change the cells. If the laminin conditions are optimal, why weren't the GSC studied immediately? It would be helpful to include support for the LDAs and xenograft studies for each of the lines.

Please include the full information about tumor genetics, patient information, and treatment for each sample.

There are many very strong statements made. It would seem premature to call single cells GSC only based on transcription. I am not aware that any profile is validated to claim absolutely to identify GSC. This is but one of the leaps that were made. I would ask the authors to be more careful and simply identify exactly what was measured.

There is no direct comparison of GBM tissues and GSC cultures. The conclusion that GSCs "reflect the chromatin identity of putative cancer stem cells found in primary brain tumors" should be supported by more evidence. For examples, direct comparison of some important drivers or markers at the chromatin level should be provided.

The analysis of GBM cancer stem cell signature and 4 cellular states is interesting. Can the authors explain why the putative cancer stem cells were under-represented in MES cell state? Did the authors consider or separate transcription factors or markers for cancer stem cells of different types in GBM? Is there any difference between stem and non-stem cells from the same cellular state?

Is there overlap between 4 GBM tissues with scATAC data and patients with GSC culture?

The description of the mismatch between chromatin and TCGA subtype is confusing (Figure 3B). According to the description, GSCs are believed to be a relatively pure population, can the authors compare GSCs to the published GBM scRNAseq data to check the subtypes (Neftel et al., 2019, etc.)? In addition, the top and bottom panels could be combined, and the sample assignment of TCGA subtypes, chromatin clusters and expression clusters should be better presented in the sankey plot. The GSC samples and color bars showing chromatin clusters are not consistent between the top and the bottom panels, making it really difficult to understand.

It's interesting that many GSCs express immune cell related genes, which should be further explained, as the enrichment significance (FDR) is not very strong.

The differentially expressed gene analysis between three GSC states should be performed to provide state markers to have a better understanding of cellular states.

Figure 4D is difficult to understand. What's the meaning of the n? How many mice were used in the experiment? A detailed experimental design and the survival curve analysis should be provided.

The CRISPR screen would be beneficial to validate, including additional direct studies of specific targets in a larger cohort of the GSC.

The validation of the states in the TCGA data are not strong. The cutoffs seem arbitrary and most of the samples seem to fall into a single group. Is this due to sampling bias? There is no connection to tumor genetics.

Reviewer #3:

Guilhamon et al. performed single-cell ATACseq in four primary IDH-wildtype glioblastomas to derive chromatin accessibility signatures of heterogeneous glioma stem cell populations. The scATACseq primary tumor analysis suggested the existence of three major modules: Reactive, Constructive, and Invasive that define glioma stem populations and are present to varying degrees within a single tumor. The module/cell state relevance is supported by leveraging new and existing functional assay data in sets of patient-derived GSC populations. The authors then demonstrate that transcription factors nominated by DNA recognition motif enrichment in each GSC state are preferentially essential through analysis of a recently published genome-wide CRISPR/Cas9 essentiality screen. Finally, evidence is presented to suggest that these chromatin signatures are associated with survival in a xenograft model and TCGA subjects.

Overall, this study addresses an important biological question of glioblastoma stem cell heterogeneity and is one of the earliest papers investigating scATAC glioblastoma profiles. However, the conclusions drawn are not fully supported by the results and in some cases the technical details of the experiment/analysis are lacking. The following suggestions are intended to help strengthen this work.

1) The clarity of the paper would be aided by additional technical descriptions throughout the manuscript. For example:

a) The four GBM samples profiled using scATACseq were classified using the Neftel signatures, on the basis of accessibility of signature gene promoters. This approach will bin individual cells whether the classification is accurate, or not. How can we be sure that this approach accurately provides Neftel classification?

b) The same classification method is suggested to result in greater stratification of the Neftel MES category. This result may be confounded by the presence of tumor-associated microglia that will be classified as mesenchymal. Were all classified cells tumor cells and how was this validated?

c) It was not fully explained why single cells without evidence for either Chr7 amplification or Chr10 deletion were included in further analyses regarding glioma stem cell populations. The cells that were not labelled by "GBM CNV+" (Figure 2—figure supplement 1) also seem to less frequently be classified as "STEM" (Figure 1D, especially G4250 and G4275). Are these cells of poorer quality or do they represent non-tumor cell types? It would also be helpful to know whether the authors enriched for tumor cells during the dissociation process (otherwise one would expect non-tumor cells too), the number of nuclei that were loaded for 10X, basic quality control metrics for each sample, and if cells identified as non-tumor were excluded from construction of the chromatin modules.

2) Figure 1 was challenging to follow with the multiple annotations of the patient UMAPs. For instance, in panels 1B and 1D the original numbered clusters remain for some not all cells. Are these cells that failed to confidently classify as cell states (i.e., AC, MES, NPC, OPC) or stem states? Perhaps consider "graying" out cells that fail to classify and explain why not all tumor cells might have a cell state classification would benefit the audience. I could not find mention of this in the Results or Materials and methods sections. Also, what happens when all cells from the four subjects are analyzed together in a single analysis? Do shared chromatin states that exist across patients emerge or is there highly specific patient clustering that reflects inter-patient heterogeneity? Panel C in this figure was also difficult to understand. None of the modules equal to 100%, is this because some cells failed to classify?

3) Bulk GBM ATACseq tumor profiles were generated in the TCGA dataset that presumably have accompanying RNA, DNA, and methylation data (Corces et al. Science 2018). While this represents a small dataset, it would be advantageous to incorporate analysis of these more complex tissue samples alongside the already presented data to provide confidence in generalizability.

4) Figure 3 appears to be missing some critical information. Figure 3A is missing a legend as there is no indication what the intensity values represent. For Figure 3B, is there an associated P-value testing for the enrichment of TCGA subtypes among the GSC ATAC states? Figure 3C could present cut-offs for what was considered statistically significant (i.e., a dotted line). Importantly, "Figure 3—figure supplement 1" presents the most differentially accessible regions between GSC states. While it is difficult to read the all the regions it's notable that ChrX is present in all. I imagine that chromatin accessibility read-outs might be significantly impacted by the presence of an inactive X chromosome. If there is an imbalance in the sex distribution in the GSC states, then this might skew to picking up sex-specific and not tumor-specific differences. Have the authors shown that these regions do not vary with patient sex.

5) The set of GSCs making up the Invasive subtype are all classified as Mesenchymal by gene expression (Figure 3B), consider calling it as such for consistency?

6) The evidence that "GSC states are not genetically defined (Figure 3D)" is derived from DNA methylation copy number, and a signature of selected genes. Is this sufficient? What about bona fide glioblastoma genes such as PDGFRA, NF1, AKT3, QKI? Did DNA copy number profiles align with transcriptional subtypes? Do the GSC states cluster by frequent chromosome arm-level events (chr7, chr19, chr20 gains and chr10 loss)? If they still do not cluster by GSC state, then it is perhaps more accurate to say that "GSC states are not defined by somatic copy number events" since there is no gene mutation data presented for these samples.

7) More than one publication has speculated to be able to stratify IDH wild type glioblastoma into prognostically relevant groups, i.e. PMID 30753603. For a tumor type with a single treatment that is not very effective, it is unclear that predicting prognosis has relevance.

8) In a multi-variate model that includes known prognostic factors (i.e., subject age and MGMT promoter methylation status) does the "Invasive-high" group still confer a worse prognosis? This would aid the author's claim that the signature is "predictive of low patient survival" independent on known prognostic factors.

9) Single-nucleus ATACseq of IDH mutant glioma has been reported (PMID 31806013) and there is some overlap in the gene sets that were identified to associate with cell states and survival, and it would be appropriate to cite this literature.

[Editors’ note: further revisions were suggested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "Single-cell chromatin accessibility of glioblastoma identifies invasive cancer stem cell linked to lower survival" for further consideration by eLife. Your revised article has been evaluated by Kevin Struhl (Senior Editor) and a Reviewing Editor.

The manuscript has been improved but there are some remaining issues that need to be addressed before acceptance, as outlined below:

The current manuscript by Guilhamon et al. uses single cell sequencing technologies to identify and then characterize heterogeneities among glioblastoma (GBM) stem cells (GSC), which extend beyond current disease classifications. The paper will add meaningful knowledge to the field and revisions have enhanced this manuscript. For example, validation of the cells was provided, methodologies were clarified, and impressive data related to the invasive subtype was provided. Additional samples would have enhanced the study, but were difficult to obtain given the COVID-19 pandemic. Overall, though, this would be the first study to apply single cell sequencing specifically to GSCs with the goal of identifying GSC subtypes using primarily chromatin accessibility signatures. Moreover, the use of chromatin accessibility signatures in combination with TF motif prediction to guide identification of transcriptional dependencies is novel in the context of GSC studies. These strengths overcome weaknesses associated with small sample numbers.

1) Please clarify what percentage of cells were estimated to be stem cells, among the scATACseq datasets.

2) Please modify the statement "A more likely possibility is that given the TCGA subtypes were determined from bulk GBM, they may not fully capture the nature of rarer populations found within a tumor, such as the cancer stem cell populations." Due to the amount of testing needed to test this, please modify this explanation to be broader and to also include the possibility that this is due at least in part to differences in microenvironments.

3) Please modify the statement "No molecular signature in GBM has so far been reported that can significantly stratify the poorer prognosis IDH wild-type patients by survival." This is incorrect. DNA methylation profiling classifies IDH wild type tumors into several categories including Mesenchymal-like (Ceccarelli et al., Cell, 2016), which shows significantly worse outcomes even among only MGMT-unmethylated tumors.

4) Regarding the statement "Using the TCGA IDH wild type cohort (n=144)", please clarify that: a) this cohort relates to the subset of GBM IDH wildtype cases with available RNAseq data (presumably); and b) that the GSC states were converted from a chromatin accessibility signature to a RNA expression signature.

5) There is no obvious reason why the TCGA IDH wildtype cohort could not also be classified on the basis of the available Affymetrix microarray data, which would enable analysis of a substantially larger cohort in pursuit of a survival difference (n=400). Please include this analysis.

6) Please include a small paragraph in the Discussion about the limitations of the study; particularly as they relate to a potentially small samples size.

Reviewer #3:

A few remaining comments with respect to the revision:

1) It would be helpful to clarify what percentage of cells were estimated to be stem cells, among the scATACseq datasets.

2) "A more likely possibility is that given the TCGA subtypes were determined from bulk GBM, they may not fully capture the nature of rarer populations found within a tumor, such as the cancer stem cell populations." While in this reviewer's opinion the most likely explanation is the presence/absence of a tumor microenvironment, we won’t know without a substantial amount of extra testing, which is beyond the scope. Please modify as to keep the explanations broad.

3) "No molecular signature in GBM has so far been reported that can significantly stratify the poorer prognosis IDH wild-type patients by survival." This is incorrect. DNA methylation profiling classifies IDH wild type tumors into several categories including Mesenchymal-like (Ceccarelli et al., Cell, 2016), which shows significantly worse outcomes even among only MGMT-unmethylated tumors.

4) "Using the TCGA IDH wild type cohort (n=144)" it would help to clarify that a. this cohort relates to the subset of GBM IDH wildtype cases with available RNAseq data (presumably); and b. that the GSC states were converted from a chromatin accessibility signature to a RNA expression signature.

5) There is no obvious reason why the TCGA IDH wildtype cohort could not also be classified on the basis of the available Affymetrix microarray data, which would enable analysis of a substantially larger cohort in pursuit of a survival difference (n=400).

eLife. 2021 Jan 11;10:e64090. doi: 10.7554/eLife.64090.sa2

Author response


[Editors’ note: the authors resubmitted a revised version of the paper for consideration. What follows is the authors’ response to the first round of review.]

Major concerns:

1) The tumor cell and GSC population enrichment should be validated using functional and genomic methods (including evidence for either Chr7 amplification or Chr10 deletion), and additional patient samples should be analyzed, preferably without extended culture times. Indeed, there appeared to be a low correlation between GBM stem TF enrichment and GSC signature enrichment and the degree of concordance between samples was not fully developed. It would be important to have more samples for many reasons, including the idea presented by the authors of differences between tumors in the dominant cell type. GBM has significant variation spatially, as well, so having only 4 tumors seems underpowered.

Evidence of chr7 and chr10 alterations at the single cell level can be found in Figure 2—figure supplement 1A-D, with details presented in the Materials and methods section “Single cell ATAC-seq”, and in the main text. To further explore the concordance between samples in response to this and other comments below, we also annotated single cells that failed to classify into one of the Neftel et al. subtypes, and the number of tumours sharing UMAP modules of each state. These data can be found in Figure 1—figure supplement 1A-B, and in the main text.

We also conducted additional analyses into the concordance of stem cell calls by the two signatures (GSCs and the 19 stem TF signature by Suva et al., 2014). We found that the overlap in cells significantly enriched for either stem signature across the four tumours was significant (hypergeometric test p-value = 8.8 x 10−8), and this has been added to the main text.

Restrictions due to the covid-19 pandemic prevented us from obtaining and processing new samples for scATAC-seq. We however attempted to incorporate new samples into our single-cell analysis using publicly available data from the Wang et al. (Cancer Discovery, 2019) study. That study contained four IDH-wt GBM samples processed for scATAC-seq, of which three had raw data available (one sample’s data was corrupted on the EGA archive). Unfortunately, only one of the three samples had more than 350 cells available after quality control, and that sample had only 19% sequencing saturation (as compared to ~75% in our samples). Under these circumstances it was unfeasible to integrate these data into our study and perform on them the same analyses used on our samples.

Collectively, the data presented in this manuscript represent the most comprehensive study to date of single cell chromatin accessibility in glioblastoma and, as highlighted in your summary above, provide novel insights and a valuable resource to the community.

2) Details about the tumor collection should be included as should full information about tumor genetics, patient information, and treatment for each sample.

We have included details of tumor collection for those samples used in the scATAC-seq as well as known patient characteristics (IDH status, sex, and age) in the Materials and methods section “Single cell ATAC-seq”. Details on sample collection for the GSCs are included in the Materials and methods section “Patient samples and cell culture”. Details on the GSC state of each GSC are in Supplementary Table 1.

3) Figure 1 should be revised to clarify cells which fail to classify and to signify whether shared chromatin states exist across patients.

We have added that information in the form of stacked bar plots in Figure 1—figure supplement 1A.

4) Figure 3 should be revised. Data pertaining to GSCs should be compared to the published GBM scRNAseq data to check the subtypes (Neftel et al., 2019, etc.). In addition, the top and bottom panels could be combined, and the sample assignment of TCGA subtypes, chromatin clusters and expression clusters should be better presented in the sankey diagram. The GSC samples and color bars showing chromatin clusters should be kept consistent between the top and the bottom panels. Figure 3A is missing a legend as there is no indication of what the intensity values represent. For Figure 3B, please clearly indicate an associated P-value testing for the enrichment of TCGA subtypes among the GSC ATAC states. Figure 3C should present cut-offs for what was considered statistically significant (i.e., a dotted line). Importantly, "Figure 3—figure supplement 1" presents the most differentially accessible regions between GSC states. Notably, ChrX is present in all. Chromatin accessibility read-outs might be significantly impacted by the presence of an inactive X chromosome. Hence an imbalance in the sex distribution in the GSC states might pick up sex-specific and not tumor-specific differences. Thus, the authors should show that these regions do not vary with patient sex. Finally, the evidence that "GSC states are not genetically defined (Figure 3D)" is derived from DNA methylation copy number, and a signature of selected genes. Bona fide GBM genes such as PDGFRA, NF1, AKT3, QKI should also be included in this analysis and the possibility that the GSC states cluster by frequent chromosome arm-level events (chr7, chr19, chr20 gains and chr10 loss) should be entertained. If they still do not cluster by GSC state, then it is perhaps more accurate to say that "GSC states are not defined by somatic copy number events" since there is no gene mutation data presented for these samples.

Following the reviewer’s recommendations we have combined the top and bottom panels of Figure 3B, and corrected the sample labelling on the heatmap. We thank the reviewers for pointing out that error. We have additionally added a legend to both heatmaps on Figure 3A and 3B and added the GSEA Enrichment Score range to Figure 3B.

In Figure 3C only significant (q-value <=0.05 after multiple test correction) terms are displayed. We have included this in the revised figure legend.

The reviewers correctly point out that sex imbalance between groups could have been a concern; we had verified there was none early in the study but failed to discuss it in the original version of this manuscript. This is now included in the Materials and methods section “ATAC-seq”.

On the reviewers’ recommendation, we have extensively updated Figure 3D, including additional samples, as well as more GBM-related genes and the large genomic events most commonly found in GBM, including on chromosomes 7,10,19 and 20. The conclusions from this analysis are maintained and we have amended the text as suggested.

5) The validation of the states in the TCGA data are not strong. The cut-offs seem arbitrary and most of the samples seem to fall into a single group. Is this due to sampling bias? There is no connection to tumor genetics. Also, the bulk GBM ATACseq tumor profiles were generated in the TCGA datasets that presumably have accompanying RNA, DNA, and methylation data (Corces et al. Science 2018). While this represents a small dataset, it would be advantageous to incorporate analysis of these more complex tissue samples alongside the already presented data to provide confidence in generalizability. The concordance between RNA-seq and ATAC-seq data for the paired datasets should also be presented.

The cut-offs and methods used in the TCGA state labelling are detailed in the Materials and methods section “Survival Analysis”. They were set at z-score of 1.65 (which corresponds to a p-value of 0.05) for the Invasive-high to identify those patients with a significant enrichment of the Invasive GSC signature. For the Invasive-mid patients, we chose a z-score between 1 and 1.65 in order to identify those patients who displayed an important enrichment of the signature even though it did not reach significance. In addition, we show in Figure 5—figure supplement 2B-C, that changing these thresholds does not alter the conclusions of the analysis.

While we appreciate the suggestion to use the Corces et al. datasets to match expression and chromatin accessibility, the 9 GBM ATAC-seq samples used in that study do not have RNA-seq data available. This can be verified in the supplementary table of the study as well as directly on the GDC data portal.

6) Invasion assays should include all subtypes in order to highlight properties of the invasive subtype.

We have performed numerous additional invasion assays to confirm the invasive properties of the invasive GSC state. Using a minimum of three representative GSC lines from each of the three GSC states as well as two human fetal neural stem cells (hfNSCs) as controls, we performed collagen invasion assays. For each GSC, the assay was repeated in triplicate, and at least 3 spheres were measured for each replicate. The results confirm that Invasive state GSCs are over 3-fold more invasive than GSCs from the other states or the control hfNSCs. These data can be found in Figure 5 B-C, Figure 5—figure supplement 1, in the Materials and methods section “Invasion assays”, and in the main text.

7) In order to support the claim that the "Invasive-high" group confers a worse prognosis, a multivariate analysis should be employed if possible.

With this analysis, we were investigating whether a particular GSC subtype enriched in a given tumour might be associated with survival. To do this we calculated the hazard ratios of the different GSC states both in the xenografts and TCGA data as shown in the Results. As no other molecular signature in GBM has so far been reported that can significantly stratify the poorer prognosis IDH wild-type patients by survival, and as using profiles from GSCs specifically has not been attempted before, there are no known covariates we could integrate into this analysis.

8) There are a number of clerical errors that should be addressed:

– Figure 4—figure supplement 3A is blank

– Please double check all sample names, in Figure 3B, G613 changes from invasive to reactive and there are several instances where the sample names do not match up.

– If other samples (outside of the 27 discussed in Figure 3) were used in Figure 4, it should be made clear. G472, G440, and G473 are not in Figure 3

– A table with all the sample names, GSC subtype identified through ATAC-seq, and validation of GSC should be included

– Figure 4—figure supplement 1 G is missing labels

– Please include the GSEA gene set size.

– Describe how many essentiality genes were included for GSEA genes

– Figure 4D should be revised to better represent the number of mice were used. A detailed experimental design and the survival curve analysis should also be provided.

– Figure 2—figure supplement 1 lacks sample annotation in both the image and the figure legend.

– Figure 4—figure supplement 3 was blacked out in the version provided for review.

– There appears to be a typographical error at the start of Results paragraph five, "FOXD1 is essential only in the Reactive state […]". This seems like it should be, "Invasive".

We are grateful to the reviewers for pointing out these errors. We have corrected them as follows:

– The invasion data is no longer in Figure 4—figure supplement 3A but in Figure 5 and Figure 5—figure supplement 1.

– The labels in Figure 3B have been corrected

– Additional samples used beyond the 27 discussed in Figures 2 and 3 are outlined in the Materials and methods section “ATAC-seq”

– A table with sample names and GSC states has been added as Supplementary file 1

– Labels have been added to Figure 4—figure supplement 1G

– The GSEA gene set sizes have been added to the figure legend

– Essentiality gene numbers used in the GSEA analysis have been added to the Materials and methods section “Gene Essentiality Screen”

– We have added the number of mice used for each of the 37 GSCs in the Materials and methods section “Survival Analysis” and in the figure legend. An average of 5 mice were used for each GSC.

– Annotation has been added to Figure 2—figure supplement 1 in the image and figure

– As mentioned above, the invasion data is no longer in Figure 4 but in Figure 5 and Figure 5—figure supplement 1.

– The typographical error related to FOXD1 has been corrected in the main text.

[Editors’ note: what follows is the authors’ response to the second round of review.]

The current manuscript by Guilhamon et al. uses single cell sequencing technologies to identify and then characterize heterogeneities among glioblastoma (GBM) stem cells (GSC), which extend beyond current disease classifications. The paper will add meaningful knowledge to the field and revisions have enhanced this manuscript. For example, validation of the cells was provided, methodologies were clarified, and impressive data related to the invasive subtype was provided. Additional samples would have enhanced the study, but were difficult to obtain given the COVID-19 pandemic. Overall, though, this would be the first study to apply single cell sequencing specifically to GSCs with the goal of identifying GSC subtypes using primarily chromatin accessibility signatures. Moreover, the use of chromatin accessibility signatures in combination with TF motif prediction to guide identification of transcriptional dependencies is novel in the context of GSC studies. These strengths overcome weaknesses associated with small sample numbers.

1) Please clarify what percentage of cells were estimated to be stem cells, among the scATACseq datasets.

This information has now been added within the Results section as well as within the Materials and methods (Single Cell Atac section).

2) Please modify the statement "A more likely possibility is that given the TCGA subtypes were determined from bulk GBM, they may not fully capture the nature of rarer populations found within a tumor, such as the cancer stem cell populations." Due to the amount of testing needed to test this, please modify this explanation to be broader and to also include the possibility that this is due at least in part to differences in microenvironments.

We have now broadened the statement in the main text and included the possibility of the absence of a tumor microenvironment in the GSC populations. This change can be found in the Results section.

3) Please modify the statement "No molecular signature in GBM has so far been reported that can significantly stratify the poorer prognosis IDH wild-type patients by survival." This is incorrect. DNA methylation profiling classifies IDH wild type tumors into several categories including Mesenchymal-like (Ceccarelli et al., Cell, 2016), which shows significantly worse outcomes even among only MGMT-unmethylated tumors.

In our own review of the manuscript cited by the reviewer, we could not find data that directly supports the identification of a subset of adult IDH-wt GBM tumors with a worse prognosis, particularly as the cohorts used mixed both low-grade gliomas and GBM as well as pediatric and adult samples. However, in order to clarify the statement in the manuscript, we have modified it to reflect that this claim applies only to IDH-wt adult GBM and phrased as follows “molecular signature in adult GBM stratifying the poorer prognosis IDH wild-type patients by survival are lacking.”

4) Regarding the statement "Using the TCGA IDH wild type cohort (n=144)", please clarify that: a) this cohort relates to the subset of GBM IDH wildtype cases with available RNAseq data (presumably); and b) that the GSC states were converted from a chromatin accessibility signature to a RNA expression signature.

We have expanded the Materials and methods section related to the Survival Analysis to include these clarifications and specify how the TCGA cohort was identified as well as the full details of the conversion between the chromatin accessibility and RNA expression signatures.

5) There is no obvious reason why the TCGA IDH wildtype cohort could not also be classified on the basis of the available Affymetrix microarray data, which would enable analysis of a substantially larger cohort in pursuit of a survival difference (n=400). Please include this analysis.

We thank the reviewer for this suggestion. Incorporating the microarray expression data confirms our findings. We have included the results of this analysis in the following text and figures: Results , Figure 5—figure supplement 2D-E and Materials and methods survival analysis section.

Of note, the results of the survival analysis obtained using the TCGA microarray data are very similar to those obtained using the RNAseq (p-value=0.017 vs 0.019) and are featured in the supplementary figure as validation. However, as only 54% of our signature genes were included in the microarray dataset, the original classification based on RNAseq remains more robust and we have elected to keep that original classification within the main manuscript figure.

6) Please include a small paragraph in the Discussion about the limitations of the study; particularly as they relate to a potentially small samples size.

We have now included a paragraph discussion this limitation in the Discussion section.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Guilhamon P, Kushida MM, Dirks PB, Lupien M. 2019. Epigenetic characterization of glioblastoma stem cells. NCBI Gene Expression Omnibus. GSE109399
    2. Nikolic A, Ellestad K, Singhal D, Gallo M. 2021. Single-cell ATAC-Seq of Adult GBM. NCBI Gene Expression Omnibus. GSE139136
    3. Guilhamon P, Kushida MM, Cavalli FMG, Dirks PB, Lupien M. 2018. RNA-seq of Glioblastoma stem cells. EGA. EGAS00001003070

    Supplementary Materials

    Supplementary file 1. List of GSC state labels, patient age, and sex.
    elife-64090-supp1.zip (447B, zip)
    Supplementary file 2. Coordinates of discriminating ATAC regions for each GSC state.
    elife-64090-supp2.zip (3.1KB, zip)
    Transparent reporting form

    Data Availability Statement

    The GSCs are available upon reasonable request from PBD and SW. The GSC ATAC-seq and DNA methylation data have been deposited at GEO (GSE109399). The scATAC-seq data has been deposited at GEO (GSE139136). RNA-seq data are available at EGA (EGAS00001003070).

    The following datasets were generated:

    Guilhamon P, Kushida MM, Dirks PB, Lupien M. 2019. Epigenetic characterization of glioblastoma stem cells. NCBI Gene Expression Omnibus. GSE109399

    Nikolic A, Ellestad K, Singhal D, Gallo M. 2021. Single-cell ATAC-Seq of Adult GBM. NCBI Gene Expression Omnibus. GSE139136

    Guilhamon P, Kushida MM, Cavalli FMG, Dirks PB, Lupien M. 2018. RNA-seq of Glioblastoma stem cells. EGA. EGAS00001003070


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES