Abstract
Recent longitudinal studies of glioblastoma (GBM) have demonstrated a lack of apparent selection pressure for specific DNA mutations in recurrent disease. Single-cell lineage tracing has shown that GBM cells possess a high degree of plasticity. Together this suggests that phenotype switching, as opposed to genetic evolution, may be the escape mechanism that explains the failure of precision therapies to date. We profiled 86 primary-recurrent patient-matched paired GBM specimens with single-nucleus RNA, single-cell open-chromatin, DNA and spatial transcriptomic/proteomic assays. We found that recurrent GBMs are characterized by a shift to a mesenchymal phenotype. We show that the mesenchymal state is mediated by activator protein 1. Increased T-cell abundance at recurrence was prognostic and correlated with hypermutation status. We identified tumor-supportive networks of paracrine and autocrine signals between GBM cells, nonmalignant neuroglia and immune cells. We present cell-intrinsic and cell-extrinsic targets and a single-cell multiomics atlas of GBM under therapy.
Subject terms: Tumour heterogeneity, Cancer genomics, Cancer, Cancer therapeutic resistance
Wang et. al. perform single-cell and spatial analyses of paired primary and recurrent samples from patients receiving standard-of-care therapy for GBM and find changes in tumor signaling pathways and the microenvironment with targetable potential.
Main
The genetics of glioblastoma (GBM), the most common and aggressive primary malignancy of the adult brain, have been extensively characterized1–3. Despite this fact, GBMs have proven resistant to all genotoxic therapies employed in clinical trials thus far. Recent longitudinal studies of GBMs based on bulk DNA sequencing demonstrate a lack of selection pressure for DNA mutations that are private to either primary or recurrent disease, a lack of association between genetic selection pressure and standard therapy and a remarkable clonal stability under therapy4. These findings are consistent with recent molecular analyses of spatially mapped biopsies from GBM specimens, which find minimal evidence of intra-tumor regional heterogeneity in clonal mutations5. Taken together, these results support the hypothesis that selection pressure for or against specific mutations occurs mostly during initial malignant transformation and that standard chemoradiation therapy does not apply significant additional selection pressure at the level of genomic alteration.
On the other hand, recent single-cell/single-nucleus RNA-sequencing (sc/snRNA-seq) studies of primary disease have demonstrated that GBM cells exhibit a high degree of plasticity at the phenotypic level and apparent transitions between cellular states6–9; however, it is unknown to what extent standard therapy, temozolomide (TMZ) chemotherapy, ionizing radiation (IR) and surgical resection, applies a selection pressure for or against specific cell types at recurrence. The extent to which standard therapy shapes the milieu of tumor-associated immune cells and nonmalignant neuroglia is not fully understood. To address this, we performed snRNA-seq on a cohort of patient-matched primary-recurrent paired specimens (n = 86), together with single-cell assay for transposase-accessible chromatin via sequencing (scATAC-seq), spatial transcriptomics (ST), spatial proteomics (SP) and exome sequencing (exome-seq) for specific subcohorts (Fig. 1a). We present a single-cell multi-omics atlas of GBM under therapy. This resource fills a gap in our knowledge of GBM at recurrence and has allowed us to derive several clinically relevant findings.
We extend our previous finding that proliferating GBM cells lie on a single axis of variation, ranging from the Verhaak proneural (PN) to mesenchymal (MES) phenotypes to the context of recurrent GBM. On average, we found that GBM patients undergo a PN-to-MES shift at recurrence, concomitant with an increase in the birth rate of MES cells in recurrent tumors and supported by paracrine signals from the tumor microenvironment. We identified gene-expression correlates of the re-entry of previously quiescent MES cells into the cell cycle at recurrence and found that targeting these genes decreases GBM cell viability. We identified chromosome-arm-scale copy-number variants (CNVs) that correlate with the MES phenotype, not previously described. Hypermutation status was found to be a predictor of increased T-cell infiltration at recurrence. Moreover, increased T-cell infiltration at recurrence was prognostic. Last, we mapped intercellular paracrine and autocrine signals between neoplastic cells, non-neoplastic neuroglia and immune cells via snRNA-seq. We validated coexpression of these signals in situ via ST and SP. We showed that these signals enhance GBM cell viability in a panel of low-passage patient-derived cell lines and are targetable therapeutically. Taken together, these studies address fundamental questions of GBM cellular biology and the selection pressure applied by standard therapy, as well as provide therapeutic targets for further development.
Results
Single-cell transcriptomics of longitudinal GBM specimens
We profiled 86 longitudinal fresh-frozen tissue specimens from 49 patients undergoing surgical resection for GBM via snRNA-seq. For 36 patients we profiled paired specimens from the primary untreated tumor and matched first recurrence (Fig. 1a and Supplementary Table 1), additionally we profiled four untreated-primary and six first recurrence unmatched specimens. At the time of the first recurrence all patients had been treated only with standard-of-care therapy (TMZ, IR and surgical resection). The cohort’s age ranged from 35–76 years and had a male-to-female ratio of 1.2. Nuclei isolation from frozen tissue, nuclei capture and library preparation were performed as previously described8, yielding 254,288 transcriptomes. We found that data quality metrics for our single-nucleus data, such as number of features per cell (Extended Data Fig. 1a) or doublet rate (Extended Data Fig. 1b), met or exceeded the quality for recent GBM single-cell studies7,9. Neoplastic cells (daughters of the tumor-initiating cell) were separated from nonmalignant neuroglia, endothelial and immune cells via our previously described approaches10–12, which include an analysis of gene signatures as well as expressed mutations (Fig. 1b–d). Mutation profiles were validated by DNA-amplicon sequencing via the UCSF500 panel for 52 of 86 cases, the mutational status of isocitrate dehydrogenase (IDH) was assessed for all patients via sequencing and and/or immunohistochemistry. Four samples were identified as having mutations in IDH1 and were excluded from further analysis. We use the term GBM to refer to IDH-wild-type GBM from hereafter. On average, samples from recurrent tumors had less purity than primary cases (Fig. 1d and Extended Data Fig. 1c). All cell types found were represented in specimens from across all lobes of the brain, a spectrum of ages and both sexes (Extended Data Fig. 1d–g).
Meta-analysis supports a proneural–mesenchymal axis
We recently showed that the phenotypes of proliferating primary GBM cells have a dominant axis of variation that ranges from the MES to PN transcriptional subtypes8. Subsequent scRNA-seq studies of primary GBM specimens and patient-derived tumor-propagating cells have produced similar findings6. We performed a meta-analysis of snRNA-seq data from our primary tumors and other recent single-cell studies of primary GBMs7,9. An unbiased multiple-factor analysis (MFA; Methods), an extension of principal-component analysis (PCA) to multiple tables, showed that the largest contribution to variation in primary GBM neoplastic cells was an axis between MES (for example CD44 and CHI3L1) and PN (for example OLIG2 and DLL3) expression programs (Fig. 2a,b and Supplementary Tables 2 and 3). Inter-table analysis demonstrated nearly equal contribution to overall variance from each of the studies included, indicating that this result was not due to inter-laboratory technical effects (Fig. 2c). The second largest source of variation in this analysis, consistent with Wang et al., was the expression of markers of mitotic cells such as MKI67. We found that the same result held in both primary and recurrent specimens; however, the distribution of cells along the PN–MES axis shifts at recurrence (Fig. 2d,e). Using our previously described approach, we classified all neoplastic cells as either PN or MES11. This classification agreed with PCA analysis (Fig. 2d,e). We found a significant association between patient age and the MES phenotype (Extended Data Fig. 2a–c); however, there was no such association with tumor location (Extended Data Fig. 2d). While we did see an association between age and sex in our data, we did not find a significant association between sex and the MES phenotype (Extended Data Fig. 2e,f). Last, we identified megabase-scale CNVs in the snRNA-seq data using patient-matched exome sequencing (exome-seq) as validation (Extended Data Fig. 2g and Methods). We found significant associations with between several prevalent chromosome-arm level CNVs and the MES phenotype (Fig. 2f,g and Extended Data Fig. 2g).
A mesenchymal shift in recurrent disease
A shift toward the MES phenotype at recurrence is a hallmark of therapy resistance3,13–15. Considering paired cases only and using a paired test, we found a significant increase in the percentage of MES cells per patient on average at recurrence (Fig. 3a). This MES shift was also observed in our bulk RNA-seq data (Extended Data Fig. 3a). While we found no significant difference in the percentage of cycling cells overall when comparing paired primary and recurrent cases (Fig. 3b), cases that underwent a MES shift showed a marked increase in the percentage of MES cycling cells at recurrence (Fig. 3c). The MES shift, as previously described3, could in principle be explained by at least several factors: a preferential resistance of MES cells to therapy, activation of a MES expression program within non-MES cells and/or a change in the birth rates of non-MES and MES cells at recurrence. We found that the latter, an increase in the proliferation rate within the MES population, has a clear contribution. When we performed RNA velocity analysis to estimate rates of gene transcription (Methods) we found modest and infrequent positive velocities for MES genes, in non-MES cells (Extended Data Fig. 3b). PCA of MES cells from recurrent cases identified cycling versus quiescence as their primary axis of variation. RNA velocities indicated a unidirectional transition from quiescent to cycling MES cells (Fig. 3d,e and Extended Data Fig. 3c). There was no significant difference in the fractions of unspliced versus spliced total transcripts observed when comparing cycling to quiescent populations (Extended Data Fig. 3d), indicating that this result is not due to technical bias. Genes that correlated with progression from quiescence to cycling (Supplementary Table 4) were over-represented in the DNA-damage response pathway (Extended Data Fig. 3c,e,f). Conversely, transforming growth factor-β pathway genes were upregulated in quiescent MES cells.
Single-cell open-chromatin profiling implicates AP1
We performed scATAC-seq on 10 IDH-wild-type GBMs: four primary-recurrent pairs, one unmatched primary case and one unmatched recurrent case. This yielded 22,214 sequenced cells. Additionally, we included in our analysis four primary IDH-wild-type GBMs that we had profiled via snATAC-seq previously8. Neoplastic cells were separated from immune cells and nonmalignant glia based on detected mutations and a clustering of gene activity profiles. To identify cell states based on transcription factor activity, we scanned scATAC-seq reads for over-represented transcription factor motifs compared to a data-driven background model (Methods). Neoplastic cells were then hierarchically clustered based on inter-cell correlations in motif frequency deviances from background, identifying clusters of cells with similar transcription factor utilization (Fig. 3f,g). In both the primary and recurrent GBM hierarchical clusters, at their top levels, neoplastic cells were split into two states that bore hallmarks of the PN and MES phenotypes, respectively. In particular, the first cluster was enriched for proneural transcription factors (for example OLIG2 and NEUROG1). The second had activator protein 1 (AP1) complex components over-represented, some of which have been previously described as regulating MES gene expression16. Motif analysis of scATAC-seq provides information about differential targeting of transcription factors. This is often independent of differential transcription factor expression per se. Nonetheless, we did find that some of these cluster-specific transcription factors were also differentially expressed between PN and MES cells in our snRNA-seq data (Fig. 3h), further supporting our interpretation of these two clusters as consisting of PN and MES cells, respectively. Consistent with the MES shift identified in our snRNA-seq data, we also found evidence for the MES shift in our scATAC-seq data. In particular, the MES cluster increased in relative size at recurrence (primary, 60% PN and 40% MES; recurrent, 51% PN and 49% MES; Fig. 3f,g). Moreover, the PN and MES cells became more polarized in their respective phenotypes at recurrence with the MES cluster becoming more homogenous; median intra-MES cell correlation increased from 0.08 to 0.20, whereas PN-to-MES cell correlations dropped from −0.12 to −0.180.18 in the median. A differential motif-enrichment test confirmed that PN and MES transcription factors were over-represented in primary and recurrent GBM specimens, respectively (Fig. 3i and Supplementary Table 5). Last, when we identified peaks from the snATAC-seq data we found the same enrichments for proneural and AP1 transcription factor motifs in the peaks specific to primary versus recurrent cases, respectively (Extended Data Fig. 3g,h and Supplementary Tables 6–9).
Radiation induced AP1 enhances mesenchymal genes
We sought to elucidate the AP1 regulome in GBM and determine whether it could be targeted for therapeutic benefit. To accomplish this, we correlated snRNA-seq gene expression with scATAC-seq chromatin accessibility in cis, using a latent-space approach to infer cis-regulatory enhancer activity (Methods). We then scanned these enhancers for AP1 recognition motifs and cross-referenced the results with genes that were differentially expressed between PN and MES neoplastic cells. This identified genes specifically expressed in MES cells, with MES-specific cis-regulatory enhancers that are targeted by AP1 (Fig. 3j,k). All previously described hallmarks of the MES phenotype16–18 were thusly identified as part of the AP1 regulome in GBM. To functionally validate AP1 regulation of these genes, we exploited low-passage (p3–6) cell lines derived from specimens of human recurrent GBMs, cultured as monolayers in defined factors (Methods). We genotyped these lines via UCSF500 DNA-amplicon sequencing and verified that they closely match their parental tumors (Extended Data Fig. 3i). We then treated these cells with an AP1 inhibitor for 48 h and performed scRNA-seq on treated cells and untreated controls (Methods). We found that AP1 inhibition regressed the expression of genes with AP1-regulated enhancers, including genes associated with stemness (for example VIM and MIR99AHG), MES hallmarks (for example CD44 and YAP1), invasiveness (for example TNC and FN1), inflammation (for example NFKB1, FYN and IL1B), IR resistance (for example TLK1) and others (Fig. 4a–e and Extended Data Fig. 4a,b). All these genes were part of the AP1 regulome inferred from in vivo human data and had MES-specific cis-regulatory enhancers that were targeted by AP1. AP1 inhibition did not significantly decrease GBM cell proliferation, although AP1 inhibition did synergize with IR (Fig. 4f). After approximately 48 h of AP1 inhibitor treatment, GBM cells that had been growing as a monolayer on basement membrane extract (BME)-coated plates detached from the plate and continued to grow as floating spheroids (Fig. 4g). Moreover, AP1 inhibition completely abrogated the ability of GBM cells to form colonies in BME. These findings are consistent with our inference of AP1 regulation of TNC, FN1 and other matricellular genes. To determine whether AP1 is induced by IR, we employed an immunocompetent, intracranial murine model19. We treated tumor-bearing mice with IR using a fractionated schedule of 5 Gy on days 10, 12 and 14 after implantation (Methods). This resulted in a significant (28%, P = 0.039) extension of survival (Fig. 4h). Upon euthanasia, mice were perfused and tissue from the injection site was collected. We observed a significant increase in the expression of AP1-component genes, as well as MES hallmark genes, AP1-regulated mediators of inflammation and the DNA-damage response (Fig. 4i and Extended Data Fig. 4d). Taken together, these results indicate that AP1 is induced in GBM upon IR treatment and that AP1 positively regulates the MES hallmarks of inflammation, IR resistance and invasiveness through cis-regulatory enhancers.
Targeting the AP1 regulome for therapeutic benefit
We screened genes from the AP1 regulome against databases of known drug interactions, including off-label activity (Fig. 4j–m and Extended Data Fig. 4e.f). Drug candidates were further prioritized based on evidence of blood–brain barrier permeability. We identified two antipsychotics and an antidepressant that target the AP1-regulated gene TLK1 (Fig. 4d,i) that showed cytotoxic activity in vitro (Fig. 4j). In particular, thioridazine synergized with IR (Fig. 4k), consistent with the role of TLK1 in DNA-damage repair. Additionally, the histone deacetylase HDAC9 and lysine methyltransferases KMT5A, KMT2C were implicated targets of AP1 (Fig. 3j and Extended Data Fig. 4a,b,d) and inhibiting these pathways showed synergy with IR and TMZ (Fig. 4l,m). Having determined that standard therapy selects for a MES phenotype in neoplastic cells, we next sought to analyze how immune cells in the GBM microenvironment respond to standard of care.
Tumor-associated innate immune cells show limited activation
Tumor-associated innate immune cells in the aggregate represented 12.6% and 16.5% of cells profiled, on average, in primary and recurrent tumors respectively (Fig. 5a). There was a significant increase in the percent of innate immune cells classified as bone marrow-derived monocytic (BMDM) lineage cells at recurrence and a significant decrease in the relative proportion of central nervous-system-resident microglia. An unbiased PCA of microglia and BMDM cells, including monocytes and their differentiated progeny, identified ontogeny as their primary source of variation (Fig. 5b). The second principal component (PC) stratified cells by the expression of regulators of inflammation (for example NFKB1), antigen presentation (for example CD74) and reactivity to a hypoxic microenvironment (for example HIF1A). We then scored microglia and BMDM for activation status, according to canonical markers of the pro-inflammatory (M1) and inflammation-resolving (M2) phenotypes, against a data-driven background model (Methods). Cells that did not express either program above background levels were classified as M0. We found that M0 cells had negative PC2 scores on average, whereas M1 and M2 cells had positive scores on average (Fig. 5c). This indicates that the second largest source of variation in innate immune cells is activation status. Consistent with recent findings20, the majority of GBM-associated innate immune cells resided in an M0 state in primary GBM, based on canonical markers. While there were significant increases in the percentages of activated cells at recurrence, pluralities of both microglia and BMDM remained M0 (Fig. 5d).
T cells invade the tumor in outliers with superior prognosis
Tumor-associated lymphocytes consisted of only 0.9% and 1.7% of cells in primary and recurrent GBMs, respectively, with no significant change at recurrence in paired samples. When we performed clustering on lymphocytes (Fig. 5e and Extended Data Fig. 5a,b), the majority were identified to be in an exhausted state based on the expression of immune checkpoints (Fig. 5f). The percentage of exhausted T cells significantly increased at recurrence, at the expense of regulatory and proliferating T-cell percentages (Fig. 5g). Although T cells in total represented less than 1% of cells found in most specimens, T-cell abundance at recurrence did correlate with a significant increase in overall survival (Fig. 5h and Extended Data Fig. 5c). This result was apparently driven by 16% of recurrent cases which had levels of T-cell infiltration 2–20-fold higher than average. To identify correlates of these T-cell outliers we first assessed hypermutation status, as defined by greater than 20 mutations per megabase of DNA21,22. Hypermutation correlated with significantly higher levels of T-cell infiltration at recurrence when compared to either non-hypermutated recurrent cases or patient-matched primary cases (Fig. 5h and Extended Data Fig. 5d–f). While neurofibromin 1 (NF1) mutation status correlated with significantly increased innate immune-cell infiltration in primary GBMs (Extended Data Fig. 5g), this correlation did not hold in recurrent cases (Extended Data Fig. 5h). In contrast to previous reports23, NF1 mutations did not correlate with T-cell abundance (Extended Data Fig. 5i,j). Female sex was another correlate of increased T-cell infiltration at recurrence, but this association did not hold in primary cases (Extended Data Fig. 5k,l). When we assessed mismatch-repair (MMR) gene expression and mutation status we found that MMR expression, but not mutation status, correlated with increased T-cell infiltration in primary GBMs alone (Extended Data Fig. 5n–q). Consistent with these findings, MMR expression correlated with mutational burden only in primary GBMs (Extended Data Fig. 5r). To validate T-cell levels measured by snRNA-seq, we performed immunohistochemistry for CD8 (a marker of activated T cells) on slides from three T-cell outlier recurrent GBMs and their non-outlier patient-matched primary tumors (Fig. 5i and Extended Data Fig. 5s,t). The primary tumors had typical low levels of CD8+ T-cell abundance and CD8+ T cells were sparse, isolated and frequently confined to the perivascular space. On the other hand, the matched recurrences showed robust T-cell invasion of the cellular tumor.
Spatial analysis yields cell-extrinsic therapeutic targets
To further evaluate immune outlier cases, we profiled six formalin-fixed paraffin-embedded (FFPE) slides from three recurrent outlier cases and their matched primary cases by SP, using the Nanostring GeoMx platform (Methods). For tissue visualization, we performed immunofluorescence (IF) for glial fibrillary acidic protein (GFAP) (broadly expressed in glia), CD68 (enriched in innate immune cells), CD45 (a pan-immune marker) and DNA. Approximately 12 regions of interest (ROIs) were profiled for 17 immune-related targets per slide (Fig. 6a). Having clustered our normalized SP data, we identified regions devoid of immune infiltrates and regions expressing high levels of markers of T cells (for example CD4). We observed two types of T-cell-rich regions in the SP data, the first correlated with expression of immune checkpoints (PD-1 and PD-L1), antigen presentation/presenting-cells (CD68, CD11c, HLA-DR and B2M) and blood vessels (SMA and PanCk). The second T-cell rich cluster showed little expression of PD-1 or PD-L1. Notably, all outlier cases contained some ROIs that were enriched for B-cell markers, such as CD20, along with markers of dendritic cells (for example CD11c) and T cells (for example CD4). IF on adjacent slides for Iba, CD3 and CD20 confirmed the aggregation of B cells, T cells and monocytic-lineage cells that were consistent with the presence of tertiary lymphatic structures, for two of the three outlier cases surveyed (Fig. 6b).
We next sought to expand our spatial analysis more broadly by performing ST on ten slides from five patient-matched primary-recurrent pairs (for example Extended Data Fig. 6a). We compared ST datasets via hierarchical clustering (Fig. 6c) and performed deconvolution analysis of the ST data based on our snRNA-seq derived signatures (Extended Data Fig. 6b and Methods). This analysis divided samples into four clusters based on two gene sets. The first two clusters were enriched for markers of proliferating tumor cells and depleted of characteristic markers of immune cells (Extended Data Fig. 6c and Supplementary Tables 10 and 11). These two clusters also had higher PN cell percentages. By contrast the third and fourth clusters overexpressed class-II interferon-signaling genes, class I and II human leukocyte antigen (HLA) and markers of innate and adaptive immune cells. We reasoned that an integrated analysis of ST and snRNA-seq data could be used to map paracrine signals in the GBM microenvironment and identify therapeutic targets. To that end, we first used CellChat to infer receptor–agonist interactions between different GBM cell types using snRNA-seq data (Methods). This approach integrates databases of agonist–receptor interactions with agonist–receptor subunit and antagonist expression levels to infer cell-type-specific signaling networks. This identified a diverse network consisting largely of secreted ligands and their receptors. We mapped differences between primary and recurrent specimens, differences between PN and MES neoplastic cell types and differences between monocytic-lineage cells based on ontogeny (Fig. 6d, Extended Data Fig. 6d and Supplementary Table 12). To validate these findings from snRNA-seq in the ST data, we computed Pearson correlations between agonist–receptor gene pairs across ROIs and samples for the hits from our CellChat analysis (Extended Data Fig. 6e and Supplementary Table 12).
We next focused on signaling between neoplastic cells and nonmalignant glia, as recurrent disease often emanates from regions of diffuse infiltration and glial interactions in that niche are poorly understood. We leveraged two recurrent GBM cases (SF11082 and SF12407) where the interface between tumor and nonmalignant tissue could be discerned in FFPE slides (Fig. 7 and Extended Data Figs. 7 and 8). Many of the ligand–receptor interactions that we inferred from the above snRNA-seq/ST co-analysis were positively correlated with progression from tumor to adjacent tissue across all three serial ROI ladders in SF11082 (Fig. 7a and Extended Data Fig. 8a). Correlated genes were enriched for ontology annotations that included WNT, growth factor, cytokine and chemokine signaling (Extended Data Fig. 8c,d). We validated a tractable number of these paracrine networks in adjacent sections via RNAscope duplex in situ hybridization (RNAscope). In particular, we assessed IGF1-IGF1R, PTN-PTPRZ1, WNT3A-LRP6 and WNT2B-LRP6 coexpression in SF11082 and SF12407. We then quantified the frequencies of receptor-positive, ligand-positive and double-positive cells (Methods). For IGF and WNT pathway genes, we identified positive gradients of receptor-expressing and ligand-expressing cells as we transitioned from tumor to nonmalignant tissue (Fig. 7b,c and Extended Data Figs. 7 and 8e,f). We observed fourfold to tenfold increases in the frequencies of both receptor-expressing and ligand-expressing cells in diffusely infiltrated nonmalignant adjacent tissue when compared to regions of dense cellular tumor (Fig. 7 and Extended Data Figs. 7 and 8f). Notably, for IGF and WNT pathways, cells that were double positive for both the receptor and cognate ligand were infrequent. Instead, ligand-positive putative neoplastic cells were found adjacent to receptor-positive apparent nonmalignant glia, validating our predicted paracrine networks (Fig. 7, Extended Data Figs. 7 and 8e and Supplementary Table 12). By contrast, the PTN-PTPRZ1 RNAscope assays indicated higher levels of autocrine signaling in regions of dense tumor, evidenced by a nearly fourfold drop in double-positive cells when comparing regions in the cellular tumor to adjacent tissue. While the frequency of cells expressing PTN increased in the transition from tumor to adjacent tissue, the frequency of cells expressing PTPRZ1 decreased in an anticorrelated fashion. This is consistent with the asymmetric signaling from nonmalignant to malignant glia that was predicted by our snRNA-seq/ST network analysis for PTN-PTPRZ1 (Extended Data Fig. 7a).
To assess the functional impact of these signals, we treated our panel of cell lines with recombinant IGF1, PTN and WNT3A (Fig. 8). We also tested recombinant CNTF, HGF, LIF and OSM (Methods), as these pathways had likewise been implicated in our network analysis. We found that each of these treatments significantly increased proliferation in at least some cell lines and that CNTF, PTN or WNT3A treatment robustly and consistently increased proliferation across all lines (Fig. 8a), in a dose-dependent fashion (Fig. 8b). WNT3A treatment provided a modest, but statistically significant degree of protection against TMZ chemotherapy (Fig. 8c). The baseline rate of proliferation could be recovered post WNT3A treatment via WNT3A inhibition (Fig. 8d). Moreover, WNT3A significantly enhanced colony formation and conferred resistance to IR (Fig. 8f–h).
Discussion
We constructed a single-cell atlas of GBM under therapy, including RNA, open-chromatin and spatial readouts. Additionally, we have derived results from this resource regarding the molecular basis for MES transition, genetic correlates of the MES phenotype, the relationship between hypermutation status and T-cell infiltration, as well as cell-intrinsic and cell-extrinsic targets for therapeutic development.
We previously found that the largest source of variation in the phenotypes of proliferating cells from primary GBMs is the PN–MES axis8. Our meta-analysis of public and in-house snRNA-seq data (Fig. 2a–e), as well as our analysis of open-chromatin signatures (Fig. 3f–i), supports this finding and extends it to recurrent GBM. A recent study that profiled GBM stem cells found a similar gradient between what they termed developmental and injury-response states, bearing markers of the PN and MES subtypes, respectively6. Another recent study of GBM stem cells likewise reported a single axis of variation in phenotypes between what they call MES and non-MES cell types, which strongly correlated with our PN and MES signatures at both the level of gene expression as well as DNA methylation patterns24. Gangoso et al. also clearly show that a component of the MES signature is induced by immune cells and likely driven by interferon exposure.
These results dovetail with the recent findings of Schmitt et al., who used a reporter system driven by PN- and MES-specific enhancer sequences25. Their studies indicate that the MES phenotype is inducible, for example via exposure to TNF-α and driven by NF-κB upregulation. This is consistent with previous studies in PN GBM models13. The notion that the MES phenotype is inducible is supported by our analysis of the AP1 regulome (Figs. 3 and 4). This analysis shows that most MES genes are enhanced by AP1. It is well known that AP1 regulates responses to external stimuli, such as growth factors, cytokines and reactive oxygen species. We were able to induce AP1 expression and downstream MES hallmarks in vivo via treatment with IR. We also found a correlation between whole-chromosome copy-number gains on chromosomes 19 and 20 and the MES signature (Fig. 2). Taken together these results support the hypothesis that the MES phenotype is inducible yet biased by genetics. In the reporter system of Schmitt et al. the induction of a MES phenotype was reversible. Here too, we could regress MES genes via AP1 inhibition (Fig. 4). Strategies to reverse the MES phenotype would clearly be therapeutically relevant.
The basis for the MES shift in recurrent GBM is not completely understood. Multiple factors that are related to standard therapy could, in principle, contribute to this shift, for example changes in the cellular rates of division within MES and non-MES populations in response to therapy, a phenotypic shift within individual cells to a MES phenotype, preferential resistance to standard therapy by quiescent MES cells or genetic alterations that bias cells toward the MES state. There is certainly evidence in our data and from recent studies supporting all these mechanisms as factors. In our data the contribution from phenotypic shifts within individual cells was modest when measured via RNA velocity (Extended Data Fig. 3b); however, because some of these samples have been taken months after treatment pressure is removed, we cannot rule out a phenotypic shift occurring within individual cells in response to treatment playing a significant role in establishing the increased prevalence of MES cells seen at this later stage. A clear contribution to the MES shift at recurrence seems to be a marked increase in the percentage of MES cells which are cycling (Fig. 3b,c). This percentage stands in stark contrast to the strong correlation observed between the MES phenotype and quiescence in primary, treatment-naive GBM8,9.
Recently, Alexander et al. found that Olig2+ radiosensitive proliferating cells consisted of the bulk of treatment-naive PN murine gliomas26; however, side-population, immunohistochemistry (IHC) and scRNA-seq analysis also identified Olig2−/Nestin+ radioresistant perivascular cells that proliferated in response to radiotherapy26. The analysis by Alexander et al. of upstream transcription factors implicated the AP1 component FOSL2 and other genes that have been associated with the MES phenotype in previous reports16 and in our snRNA-seq and scATAC-seq (Figs. 3f–i and 4a–e). These findings are also consistent with the Holland laboratory’s previous report of a PN-to-MES shift in this model following radiotherapy14. Taken together, ours and previous studies support the premise of a quiescent stem-like cell with MES characteristics that is resistant to IR and TMZ, but that re-enters the cell cycle following therapy and becomes a driver of recurrent disease.
We observed an increase in the relative proportion of monocytic-lineage cells from the periphery at recurrence (Fig. 5a). We and others have shown that the abundances of myeloid-derived cells from peripheral blood and MES neoplastic cells correlate across patients and that these cell types co-occupy distinct tumor-anatomical niches8,10,27. For hypermutated cases, recurrence also correlated with a significant increase in T-cell abundance (Fig. 5h). By contrast, Hodges et al. concluded no association between tumor mutational burden (TMB) and T-cell infiltration based on IHC for PD-1, PD-L1 and CD8 in human GBMs21; however, while the cohort sizes interrogated for PD-1 and PD-L1 were sizable according to Hodges et al. (n = 94 and 189, respectively) the cohort interrogated for CD8 (n = 9) was less so and other pan-T-cell markers were not considered. Touat et al. tested for an association between MMR deficiency and T-cell infiltration in GBM (again via IHC) and found none, but TMB and T-cell abundance were not compared22. Our results most closely align with those of Wang et al.3, which we extend with our finding of a significant increase in T cells in high-TMB recurrences compared to recurrent cases with mid and low TMBs. TMZ treatment has been associated with hypermutation status at recurrence28, although hypermutation status has not been reported to convey increased survival4. By contrast, we found that T-cell abundance was correlated with a significant increase in survival (Fig. 5h). This seems to be driven by a cohort of outlier recurrent cases, whose T-cell abundances were two to eightfold higher than average. T cells represent just over 1% of the cellular tumor on average, so this is not high in absolute numbers; however, it indicates that (1) GBMs with a higher mutational burden are potentially more immunogenic; and (2) standard care has a treatment effect on the adaptive immune response; T-cell outlier cases are all recurrences and TMB correlates with significant increases in T-cell abundance only at recurrence (Fig. 5h and Extended Data Fig. 5d,e). T cells in most tumors were either isolated cells in a field of apparent glia or confined to a perivascular space (for example Fig. 5i). By contrast, T cells in the outlier recurrent cases robustly invade the cellular tumor (Fig. 5i and Extended Data Figs. 5s,t and 6a).
The innate immune compartment showed a remarkable lack of activation in primary GBM according to standard markers for the M1 and M2 phenotypes (Fig. 5b–d). This is consistent with previous reports20; however, we found significant increases in the percentages of activated innate immune cells at recurrence (Fig. 5d). Although we did not see a significant increase in the total abundance of innate immune cells in recurrent GBMs (Fig. 5a, top), we did find a significant increase in the relative abundance of innate immune cells which were classified as being derived from circulation (Fig. 5a, bottom). At the same time, the M1/M2 paradigm is overly simplistic and many patterns of non-canonical innate immune activation have been described10. A spectrum model may be more suitable than a bimodal M1/M2 classification29.
The tumor microenvironment shapes the composition of GBM neoplastic cells30. We modeled autocrine and paracrine signaling networks in GBM through an analysis of receptor and paired agonist expression in our snRNA-seq data (Fig. 6d, Extended Data Fig. 6d,e and Supplementary Table 12). These networks were subsequently validated via ST, RNAscope and in vitro analysis (Fig. 7 and Extended Data Fig. 7). Many of the inferred networks represent well-studied pathways in GBM: inflammation (for example interleukin-1–4 and type II interferon), immune-cell chemotaxis (for example colony-stimulating factor and CCL/CXCL) and angiogenesis (for example platelet-derived growth factor and vascular endothelial growth factor). These pathways were active in the initial disease and remained persistently active through recurrence; however, others were specifically upregulated in recurrent GBM, particularly in MES cells. For example, MES cells expressed receptors for WNT, NRG, NGF and IGF pathway genes specifically at recurrence (for example Fig. 6d, Extended Data Fig. 6d and Supplementary Table 12). This may reflect differences in microenvironment composition at recurrence. We observed a greater abundance of nonmalignant oligodendrocyte-lineage cells and innate immune cells derived from peripheral blood in recurrent specimens (Fig. 5a).
Early single-cell studies in GBM were limited to working with fresh tissue. Although these studies yielded unprecedented insights into GBM cellular composition and the tumor microenvironment, the requirement for prospective sample collection limited our statistical power as well as the types of cohorts we could assemble. Recent seminal studies demonstrated that nuclei could be efficiently extracted from archival frozen tumor specimens for single-nucleus profiling and that the resulting data were quantitative and comparable to scRNA-seq data31. This advance opened tissue archives to studies such as the one presented here; however, while profiling nuclear RNA gives an accurate quantification of relative gene expression there is a loss of information from a lack of mitochondrial RNA in snRNA-seq. In addition to the information about metabolism that can be gleaned from mitochondrial RNA24, expressed mitochondrial mutations can be used for single-cell phylogenetics8,32. More generally, our droplet-based approach yields 3′-enriched coverage that is intended only for quantification of gene expression. Single-cell alternative splicing, an increasingly recognized contributor to tumor immunogenicity33, and single-cell analysis of expressed mutations are not optimal in these data. Last, epigenetic regulation is increasingly understood as an important lens through which therapeutically relevant processes in GBM can be understood, for example the MES shift at recurrence (Figs. 3 and 4), stemness and tumorigenicity34, oncogene amplification35 and sex differences36, to name a few. Single-cell epigenetic analyses of longitudinal brain tumor specimens are lacking. Given the plethora of new modalities available37,38, the rationale for doing these studies is strong. The resource generated by this study broadly informs disparate aspects of glioma biology. Further work will be required to functionally test many of the hypotheses generated from this atlas.
Methods
Ethical approval
Study protocols and sample use were approved by the University of California, San Francisco (UCSF) Institutional Review Board. All clinical samples were analyzed in a de-identified fashion. All experiments were carried out in conformity to the principles set out in the Declaration of Helsinki as well as the Department of Health and Human Services Belmont Report. Informed written consent was provided by all patients.
Tumor tissue acquisition
We obtained fresh-frozen and FFPE tissue specimens from patients undergoing surgical resection for glioma at UCSF. De-identified samples were provided by the UCSF Neurosurgery Tissue Bank.
Statistics and reproducibility
No statistical method was used to predetermine sample size. All available GBM specimens in the UCSF Brain Tumor Center Tissue Bank were profiled, representing decades of biobanking at UCSF. No data were excluded from the analyses. Randomization and blinding was used for all in vitro and in vivo experiments. The Wilcoxon signed-rank test for paired samples was used to compare percentages of cell types between primary and paired recurrent specimens. As the Wilcoxon test is nonparametric, we did not formally test for normality of the data. A log-rank test was used to assess significance in Kaplan–Meier survival analysis. A two-sided likelihood-ratio test between hurdle models was used to assess differential gene expression between single-cell datasets and were adjusted for multiple hypothesis testing via the Benjamini–Hochberg method. Fisher’s exact test was used to test for genotype–phenotype associations.
Nuclei isolation
For snRNA-seq, nuclei were extracted from frozen tissues following the ‘Frakenstein’ protocol developed by L. Martelotto, Melbourne, Centre for Cancer Research, Victorian Comprehensive Cancer Centre and available from 10x Genomics (https://community.10xgenomics.com/t5/Customer-Developed-Protocols/ct-p/customer-protocols). For snATAC-seq, frozen tissues were digested mechanically in a Dounce grinder with 500 µl of lysis buffer (Sigma). The lysate was strained through a 40-μm strainer, pelleted, washed and resuspended in 500 µl nuclei wash buffer (10x Genomics). Nuclei were subsequently purified via centrifugation in a sucrose-based density gradient, pelleted, washed and resuspended in tagmentation buffer (10x Genomics).
Cell derivation, culture and in vitro viability assays
Fresh tumor tissues were dissociated mechanically with a scalpel and then enzymatically (32 mg collagenase IV, 10 mg deoxyribonuclease I, 20 mg soybean trypsin inhibitor and 10 ml DPBS) at 37 °C under rotation for 15 min. Tissue lysate was further dissociated via pipetting and incubated at 37 °C for another 15 min with rotation. The lysate was filtered through a 70-μm strainer then a 40-μm strainer, washed twice and resuspended in RBC lysis buffer for 5 min. Cells were spun down, washed and resuspended in culture medium (DMEM/F12 GlutaMAX, 0.5% N2, 0.5% B27 without vitamin A, 1% antibiotic-antimycotic, 0.5% NEAA, 20 ng ml−1 epidermal growth factor and 20 ng ml−1 fibroblast growth factor). Cells were cultured in Matrigel-coated plates at 37 °C and 5% CO2.
Where indicated, cell viability was assessed via AlamarBlue (Invitrogen) reduction. Briefly, cells were seeded in 96-well plates at a density of 5,000 cells per well with DMEM/F12 complete medium and held overnight at 37 °C and 5% CO2. The medium was then aspirated and test compounds diluted in culture medium were administered at the reported concentrations. After culture for the reported time periods, the cells were washed with PBS and resuspended in 9 ml of culture medium plus 1 ml of AlamarBlue (Invitrogen). The plate which was then incubated for 4 h. Cell viability was assessed via absorbance using a microplate reader and compared to vehicle-treated control wells. In other experiments, proliferation was measured by trypan blue cell counting via a Countess II.
For WNT3A inhibition endo-IWR 1 (R&D Sys) was used. This is a small molecule inhibitor of Axin turnover resulting in stabilization of the β-catenin and suppression of Wnt signal transduction. The diastereomeric form of IWR 1, exo-IWR 1, which exhibits decreased Axin stabilizing activity compared to endo-IWR 1 was used as a control.
Clonogenic assays
Cell we seeded at a density of 600 cells per well in the middle wells of six-well plates that were coated with 2% Cultrex TM BME (R&D Sys A1569601). Cells were incubated at 37 °C and 5% CO2 for 2 h to allow attachment. Subsequently, plates were exposed to 3 or 4 Gy of gamma radiation (JL Shepherd & Associates) in rotation function without plate cover via a cesium-137 source emitting at a fixed dose rate of 2.46 Gy min−1. After 10 d, colonies were fixed using 6% glutaraldehyde diluted in DPBS and stained with 1 ml of Crystal Violet. ImageJ (v.1.51 h) was used for automated colony counting.
Multiplex immunofluorescence
FFPE tumor sections were profiled by multiplex immunofluorescence using a Discovery XT autostainer (Ventana Medical Systems) with appropriate controls. Antibodies used were CD3 (Leica, Clone LN10, 1:100 dilution), CD20 (Leica, Clone L26, 1:200 dilution) and Iba1 (Wako Chemicals, 019-19741, 1:500 dilution).
Dual RNAscope
FFPE sections were evaluated by dual RNAscope chromogenic in situ hybridization assay for the expression of ligand–receptor pairs using Advanced Cell Diagnostics probes specific for PTN (838191) and PTPRZ1 (584789-C2), WNT2B (453369) or WNT3A (429439) and LRP6 (custom probe, C2) and IGF1 (313039) and IGF1R (415819-C2). The RNA Probe PPIB (313909) and dapB (312039) were used as positive and negative control probes, respectively. Cell segmentation, classification and staining quantification were performed via QuPath (v.0.3.2).
Murine IR assay
All animal experiments were conducted in compliance with protocols approved by the Institutional Animal Care and Use Committee at UCSF, following the National Institutes of Health (NIH) guidelines for animal care. The UCSF Institutional Animal Care and Use Committee maximal tumor burden of 20 mm in any direction was not exceeded. Ten-week-old C57BL/6J female mice were purchased from the Jackson Laboratory (000664) and housed in the UCSF animal facility 1 week before tumor transplantation at temperatures of 65–75 °F (~18–23 °C) with 40–60% humidity and a 14–10-h light–dark cycle. An aliquot of 1,600 SB28 cells19 were injected into the right frontal cortex at the coordinate Bregma, AP +2.0 mm, ML +2.0 mm and DV −2.0 mm in a cohort of six mice. All mice developed tumors, based on bioluminescence imaging. On days 10, 12 and 14 after tumor transplantation, 5 Gy head-only irradiation was given to three randomly chosen tumor-bearing animals as previously described39. Endpoints were determined by weight loss and neurological symptoms. Upon euthanasia, brain tumors were quickly dissected after perfusion (cold 1× PBS) and snap frozen in liquid nitrogen, then stored at −80 °C before use.
Spatial transcriptomics and proteomics assays
For the ST assay, FFPE tissue blocks were reviewed for tumor purity and ten 5-µm sections were cut by the UCSF Neurosurgery Tissue Core. Slides were baked at 37 °C overnight and then deparaffinized, rehydrated, antigen-retrieved for 20 min at 100 °C and digested with proteinase-K 0.1 µg ml−1 for 15 min in a Leica BOND-RX. Samples were post-fixed in neutral-buffered formalin for 10 min and hybridized to the Cancer-Transcriptome Atlas (>1,800 targets) UV-photocleavable barcode-conjugated RNA in situ hybridization probe set overnight. Samples were washed to remove off-target probes and counterstained with morphology markers for 2 h. The morphology markers consisted of 1:25 dilution SYTO13 (Invitrogen), 1:100 dilution anti-CD3 Alexa Fluor 647 (UMAB54, Origene), 1:200 dilution anti-CD68-Alexa Fluor 594 (KP1, Santa Cruz) and 1:400 dilution anti-GFAP-Alexa Fluor 488 (GA5, Invitrogen). IF imaging, ROI selection, spatially indexed barcode cleavage and collection were performed on a GeoMx Digital Spatial Profiling instrument (NanoString). Approximately 12 ROIs were collected per sample. Photoreleased GeoMx DSP oligonucleotide tags containing RNA IDs and a unique molecular identifier were collected from each ROI. After PCR with dual-indexing Illumina i5 and i7 primers, the library was purified with AMPure XP beads (Beckman Coulter), quantitated with a Qubit (Themo Fisher Scientific) and quality was checked with a Bioanalyzer (Agilent). Paired-end sequencing was performed on NextSeq 550 and NextSeq 2000 instruments.
Similarly, for the SP assay, FFPE tissue blocks were reviewed for tumor purity and six 5-µm sections were cut by the UCSF Neurosurgery Tissue Core. Slides were baked at 37 °C overnight, deparaffinized, rehydrated, antigen-retrieved in a pressure cooker for 15 min at 100 °C at high pressure. Samples were then incubated overnight with the GeoMx Immune Cell Profiling Protein Core antibodies (NanoString) containing UV-photocleavable barcode-conjugated antibodies against 17 targets and 6 control targets. At the same time, the samples were incubated with morphology antibodies consisting of SYTO83 (100 nM final concentration), 1:200 dilution anti-CD68-Alexa Fluor 594 (clone KP1), 1:200 dilution anti-CD45-Alexa Fluor 647 (clone 2B11 + PD7/26) and 1:400 dilution anti-GFAP-Alexa Fluor 488 (clone GA5). IF imaging, ROI selection, spatially indexed barcode cleavage and collection were performed on a GeoMx Digital Spatial Profiling instrument (NanoString) by GENEWIZ. Approximately ten ROIs were collected per sample. The resulting photocleavable barcode tags were subsequently detected and counted using an nCounter Prep Station and Digital Analyzer (NanoString).
10x Genomics-based snRNA-seq/snATAC-Seq
Single-nucleus capture, reverse transcription, cell lysis and library preparation for snRNA-seq were performed on the 10x Genomics platform as per manufacturer’s protocol. Approximately 15,000 nuclei were loaded per capture. For snATAC-seq assay, tagmentation, nuclei capture and library prep were likewise performed via the 10x Genomics platform as per manufacturer’s protocol. Sequencing was performed on an Illumina NovaSeq with 10x Genomics recommended parameters.
SnRNA-seq data preprocessing
The preprocessing of snRNA-seq data was performed as described previously8. We utilized CellRanger (v.3.0.2) for alignment and gene expression quantification, following the guidelines from the CellRanger website (https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/advanced/references#premrna). We filtered cells that have >2.5% mitochondrial read counts and <200 expressed genes. DoubletFinder (v.2.0.2)40 was used to remove doublets and was run using the first ten PCs and default parameters.
Copy-number variation analysis of snRNA-seq and snATAC-seq
CONICSmat (v.1.0) was used to assess the presence/absence of somatic CNVs in 10x snRNA-seq data41. We retained CNVs with a CONICSmat likelihood-ratio test <0.05 and a difference in Bayesian Criterion >50. For each CNV we used a cutoff of posterior probability >0.5 in the CONICSmat mixture model to infer the presence/absence of that CNV in a given cell. The presence/absence of somatic CNVs in 10x snATAC-seq data was likewise estimated with CONICSmat. Here, the gene activity of cells generated by snapATAC (v.1.0.0)42 was used as input to perform CNV analysis.
Exome sequencing and copy-number variant identification
The Targeted DNA Seq Library Reagent kit was used for exome capture on tumor samples for selected patients. Libraries were sequenced on an Illumina-HiSeq 4000 using 150-bp paired-end reads. Reads were first trimmed and filtered with TrimGalore v.0.6.5 (parameter, −q = 30) (https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/) and Cutadapt v.3.4 (ref. 43). The quality control passed reads were mapped to the human Grch38 genome with BWA and only uniquely matched paired reads were used for analysis44. PicardTools (http://broadinstitute.github.io/picard/) and the GATK toolkit carried out quality score recalibration, duplicate removal and realignment around indels45. CNVs were inferred with CNVkit v.0.9.6 (segment function, P value threshold <1 × 10−3)46.
Dimensionality reduction, clustering and cell-type classification for snRNA-seq
The snRNA-seq data were processed with Seurat v.3 (ref. 47). Data were normalized via the LogNormalize method with scale.factor of 10,000 using the NormalizeData function. Highly variable genes were identified via Seurat using the mean.var.plot method with default parameters. Based on these genes, a PCA was performed and the first 15 PCs were retained for clustering and visualization via t-SNE. Neoplastic cells were separated from non-neoplastic cells based on the presence of CNVs. For neoplastic cells, MES and PN cell-type labels were assigned via ELSA, an ensemble-learning approach that has been trained on historical data8,11. For non-neoplastic cells, cell clusters were identified via the ‘FindClusters’ function via the Louvain algorithm with the resolution parameter of 0.52. Cluster-specific genes were identified via the FindAllMarkers function in Seurat v.3 with Rstudio running R v.3.6.0 (ref. 47) via a MAST test and used to assign cell-type labels.
Multiple-factor analysis
To perform MFA, we obtained 10x scRNA-seq data from Neftel et al.9 (GSE131928) and Couturier et al.7 (EGAS00001004422). Cells with more than 5% mitochondrial read counts were filtered for both datasets and cells with at least 200 expressed genes were retained for analysis. The gene matrix for each dataset was filtered to only retain the highly variable genes, identified via the FindVariableFeatures function of Seurat v.3 with selection.method ‘mvp’. This was applied to each dataset separately. Then genes that were identified as variable in at least two datasets were used for the MFA (1,411 genes in total). The FactoMiner package48 was used to perform MFA. The contributions of each dataset to total variance explained, as well as the genes’ qualities of representation by dimensions one and two (cos2), was computed via the fviz_contrib function from the factoextra package49.
Single-nucleotide variant calling in snRNA-seq and UCSF500 genotypes
The bam file of 10x snRNA-seq data generated via CellRanger was used to perform single-nucleotide variant calling by pooling reads by patient and running the GATK RNA-seq best-practices pipeline (https://software.broadinstitute.org/gatk/best-practices/workflow?id=11164). Variant assignments in single cells were then assessed via the VarTrix (v.1.0) tool (https://github.com/10xgenomics/vartrix). Variants were annotated with the Annovar software package50. The UCSF500 mutation panel is a clinical assay that uses an amplicon sequencing-based genotyping approach that compares a tumor tissue sample and a patient-matched blood control. For the mutation calling of UCSF500, reads were mapped to the human genome reference with BWA44. PicardTools (http://broadinstitute.github.io/picard) and the GATK toolkit45 carried out quality score recalibration, duplicate removal and realignment around indels. Somatic single-nucleotide variants were detected with MuTect (https://www.broadinstitute.org/cancer/cga/mutect) for each tumor–control pair. The mutations were annotated with the Annovar software package.
RNA velocity analysis
RNA velocities were computed via scVelo using default parameters51. MES neoplastic cells from recurrent GBMs were used as input. SnRNA-seq data were filtered and normalized via scvelo.pp.filter_and_normalize with parameter min_shared_counts = 20. The first 30 PCs were used to compute moments for velocity estimation via scvelo.pp.moments. The terminal states and latent time of cycling cells were computed via scvelo.tl.latent_time function included in scVelo. The putative driver genes of transcriptional changes were systematically identified by high likelihoods in the dynamic model. In particular, the top 300 highest-likelihood genes were used to generate a heat map via the scvelo.pl.heatmap function. Genes that showed a Pearson correlation with pseudotime of .1 or higher at an adjusted P value52 of q < 0.05 were used for Gene Ontology analysis via WebGestalt using the WikiPathway reference (Fig. 3e).
Monocytic cell classification and analysis
To classify the activation status of monocytic-lineage cells, signature genes of classically (M1: CCL2/3/4/5/8, CCR7, CD74, CSF2, CXCL10, HLA-DRA/B, IFNG, IL1B, IL1R1, IL6, INOS, IRF5, NFKB1, TLR2/4 and TNF) and alternatively activated (M2: ARG1, CD74, CCL1/17/22/, CXCL16, CXCR4, HLA-DRA/B, IL10, IL4, IRF4, MRC1, NFKB1, TGFB1 and TNF) phenotypes were aggregated from previous reports53–57. The M1 and M2 score for each cell was calculated via AddModuleScore function with the above M1/M2 signatures as input. This routine compares average signature levels to a data-driven background distribution. Cells were assigned M0 status if the M1 and M2 signature scores were both less than 0.0. Otherwise, activation status was determined by the higher of the M1 and M2 signature scores.
T-cell phenotype classification and analysis
We used Seurat to perform dimensionality reduction, clustering and visualization for T cells. In particular, data were normalized via the NormalizeData function using the LogNormalize method. Highly variable genes were identified via the FindVariableFeatures function using the mvp method (mean.cutoff of c(0.1, 8) and dispersion.cutoff = c(1, Inf)) and other parameters set to default values. PCA was performed based on these genes. The first 15 PCs were retained for clustering via a k-nearest neighbor graph and visualized via t-SNE. A heat map was generated via DoHeatmap with the average expression calculated with the AverageExpression function. A boxplot of different T-cell types was generated via ggplot with geom_boxplot in R v.3.6.0.
snATAC-seq data processing and analysis
The CellRanger ATAC software (v.1.1.0) was used for read alignment, deduplication and identifying transposase cut sites (https://support.10xgenomics.com/single-cell-atac/software/pipelines/latest/algorithms/overview). The output matrix of CellRanger was further processed via the snapATAC package (https://github.com/r3fang/SnapATAC)42. We selected the highest quality barcodes for each case based on two criteria: (1) number of filtered fragments >1,000; and (2) fragments in promoter ratio >0.2 for the case. Clustering was performed using Seurat v.3 SNN-graph clustering via the ‘FindClusters’ routine, with gene body-accessibility scores generated by the snapATAC package as input. Transcription factor motif frequency deviations from a data-driven background model were calculated via the computeDeviations function in chromVAR (v.1.6.0) with default parameters58, using only neoplastic cells as input. Differential motif deviances were computed via a t-test and controlled for multiple hypothesis testing via fdrtool52. Differentially accessible regions, peaks and motif enrichments on differential peaks (relative to a genome-wide background) were computed using snapATAC’s ‘findDAR’, ‘runMACSForAll’ and ‘runHomer’ respectively, run with default parameters. Heat maps of differential peaks were created in deepTools v.3.4.0 (ref. 59).
Cell–cell communication analysis
The CellChat package was used to assess cell–cell communication via interaction-network analysis60. A Seurat object was used as input for CellChat following their standard protocol as described in https://github.com/sqjin/CellChat. Circle plots and dot plots were generated via netVisual_aggregate, vertex.size = groupSize and netAnalysis_dot resp. Data from primary GBM cases were processed separately from data from recurrent cases and compared a posteriori.
Data processing for spatial transcriptomics and proteomics
The SP data were normalized by ROI surface area. The ST data were normalized against the 75th percentile of signal (Q3 normalization). Multi-subject single-cell deconvolution was used for deconvolution of the ST data based on cell-type signatures determined from snRNA-seq data used as input61. Heat maps were generated via pheatmap package (https://cran.r-project.org/web/packages/pheatmap/index.html). To validate putative ligand–receptor interactions via ST, the expression of each ligand–receptor pair obtained from CellChat were used to calculate a Pearson’s correlation coefficient across ROIs. These were adjusted for multiple hypothesis testing via fdrtool52. Representative IF images for specific pathways were obtained based on sorting for correlation and then for expression of ligand–receptor pairs in that pathway.
Survival analysis
Survival analysis was conducted in R. Samples were divided into two groups T-cell high and T-cell low based on T-cell percentage at a cutoff of 1%. Kaplan–Meier plots of overall survival and elapsed days to recurrence for recurrent GBM were generated via ggsurvplot function in Survminer package. P values were calculated via a log-rank test.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This work was supported by research awards from NIH/NLM R01LM013897 and UC/CRCC CRN-19-586041 to A.D.; the UCSF Glioma Precision Medicine Program to A.D., S.C., J.J.P. and D.S.; NIH/NINDS R01CA246722 to A.D, N.G. and S.R.; UCSF Brain Tumor SPORE Biorepository NIH/NCI P50CA097257 to J.J.P.; and a gift from the Panattoni family to A.D. and J.J.P. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.
Extended data
Source data
Author contributions
A.D. designed and supervised the study. H.B., J.J. and K.S. performed tissue processing, library preps and in vitro assays with input from S.J., under the supervision of A.D. The longitudinal specimens used in this study represent decades of biobanking by the UCSF Brain Tumor Center Tissue Repository, under the supervision of J.J.P.; they screened archival tissue specimens for suitability of use, tumor purity and tissue quality, additionally they performed IHC and RNAscope assays. D.S. performed genotyping via the UCSF500 assay. S.C, J.J.P. and A.D. identified cases for inclusion in the study. X.F. performed the murine studies under the supervision of N.G., S.R. and A.D. L.W. performed all bioinformatics analyses under the supervision of A.D. D.R. derived the cell lines used. A.D. and L.W. wrote the manuscript with input from all authors. All authors read and approved the final manuscript.
Peer review
Peer review information
Nature Cancer thanks Lucy Stead and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Julia Simundza, in collaboration with the Nature Cancer team.
Data availability
The study data, in the form of raw sequenced reads, are available from the European Genome–phenome Archive repository (https://www.ebi.ac.uk/ega/home), accession EGAS00001004909. Spatial image data, processed expression and peak Supplementary Tables are available from the Gene Expression Omnibus repository (https://www.ncbi.nlm.nih.gov/geo/), accession code GSE174554. Previously published scRNA-seq data that were re-analyzed here are available from https://github.com/mbourgey/scRNA_GBM and from the Gene Expression Omnibus, accession code GSE131928. All other data supporting the findings of this study are available from the corresponding author on reasonable request. Source data are provided with this paper.
Code availability
Custom code used is available from GitHub at https://github.com/linwang6/Longitudinal_glioblastoma_paper.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Lin Wang, Jangham Jung, Husam Babikir.
Extended data
is available for this paper at 10.1038/s43018-022-00475-x.
Supplementary information
The online version contains supplementary material available at 10.1038/s43018-022-00475-x.
References
- 1.Ceccarelli M, et al. Molecular profiling reveals biologically discrete subsets and pathways of progression in diffuse glioma. Cell. 2016;164:550–563. doi: 10.1016/j.cell.2015.12.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Verhaak RGW, et al. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell. 2010;17:98–110. doi: 10.1016/j.ccr.2009.12.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wang Q, et al. Tumor evolution of glioma-intrinsic gene expression subtypes associates with immunological changes in the microenvironment. Cancer Cell. 2017;32:42–56. doi: 10.1016/j.ccell.2017.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Barthel FP, et al. Longitudinal molecular trajectories of diffuse glioma in adults. Nature. 2019;576:112–120. doi: 10.1038/s41586-019-1775-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Mathur, R. et al. EPCO-31. Epigenomic intratumoral heterogeneity of glioblastoma in three-dimensional space. Neuro Oncol.10.1093/neuonc/noaa215.310 (2020).
- 6.Richards LM, et al. Gradient of developmental and injury response transcriptional states defines functional vulnerabilities underpinning glioblastoma heterogeneity. Nat. Cancer. 2021;2:157–173. doi: 10.1038/s43018-020-00154-9. [DOI] [PubMed] [Google Scholar]
- 7.Couturier CP, et al. Single-cell RNA-seq reveals that glioblastoma recapitulates a normal neurodevelopmental hierarchy. Nat. Commun. 2020;11:3406. doi: 10.1038/s41467-020-17186-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wang L, et al. The phenotypes of proliferating glioblastoma cells reside on a single axis of variation. Cancer Discov. 2019;9:1709–1719. doi: 10.1158/2159-8290.CD-19-0329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Neftel, C. et al. An integrative model of cellular states, plasticity, and genetics for glioblastoma. Cell10.1016/j.cell.2019.06.024 (2019). [DOI] [PMC free article] [PubMed]
- 10.Müller S, et al. Single-cell profiling of human gliomas reveals macrophage ontogeny as a basis for regional differences in macrophage activation in the tumor microenvironment. Genome Biol. 2017;18:234. doi: 10.1186/s13059-017-1362-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wang L, Catalan F, Shamardani K, Babikir H, Diaz A. Ensemble learning for classifying single-cell data and projection across reference atlases. Bioinformatics. 2020;36:3585–3587. doi: 10.1093/bioinformatics/btaa137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Müller S, et al. Single-cell sequencing maps gene expression to mutational phylogenies in PDGF and EGF driven gliomas. Mol. Syst. Biol. 2016;12:889. doi: 10.15252/msb.20166969. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Bhat KPL, et al. Mesenchymal differentiation mediated by NF-κB promotes radiation resistance in glioblastoma. Cancer Cell. 2013;24:331–346. doi: 10.1016/j.ccr.2013.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Halliday J, et al. In vivo radiation response of proneural glioma characterized by protective p53 transcriptional program and proneural-mesenchymal shift. Proc. Natl Acad. Sci. USA. 2014;111:5248–5253. doi: 10.1073/pnas.1321014111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Piao Y, et al. Acquired resistance to anti-VEGF therapy in glioblastoma is associated with a mesenchymal transition. Clin. Cancer Res. 2013;19:4392–4403. doi: 10.1158/1078-0432.CCR-12-1557. [DOI] [PubMed] [Google Scholar]
- 16.Carro MS, et al. The transcriptional network for mesenchymal transformation of brain tumours. Nature. 2010;463:318–325. doi: 10.1038/nature08712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Behnan J, Finocchiaro G, Hanna G. The landscape of the mesenchymal signature in brain tumours. Brain. 2019;142:847–866. doi: 10.1093/brain/awz044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kim, Y. et al. Perspective of mesenchymal transformation in glioblastoma. Acta Neuropathol. Commun.10.1186/s40478-021-01151-4 (2021). [DOI] [PMC free article] [PubMed]
- 19.Kosaka A, Ohkuri T, Okada H. Combination of an agonistic anti-CD40 monoclonal antibody and the COX-2 inhibitor celecoxib induces anti-glioma effects by promotion of type-1 immunity in myeloid cells and T-cells. Cancer Immunol. Immunother. 2014;63:847–857. doi: 10.1007/s00262-014-1561-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Gabrusiewicz K, et al. Glioblastoma-infiltrated innate immune cells resemble M0 macrophage phenotype. JCI Insight. 2016;1:0–19. doi: 10.1172/jci.insight.85841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Hodges TR, et al. Mutational burden, immune checkpoint expression, and mismatch repair in glioma: implications for immune checkpoint immunotherapy. Neuro Oncol. 2017;1:1–11. doi: 10.1093/neuonc/nox026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Touat M, et al. Mechanisms and therapeutic implications of hypermutation in gliomas. Nature. 2020;580:517–523. doi: 10.1038/s41586-020-2209-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Caleb Rutledge W, et al. Tumor-infiltrating lymphocytes in glioblastoma are associated with specific genomic alterations and related to transcriptional class. Clin. Cancer Res. 2013;19:4951–4960. doi: 10.1158/1078-0432.CCR-13-0551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Gangoso, E. et al. Glioblastomas acquire myeloid-affiliated transcriptional programs via epigenetic immunoediting to elicit immune evasion. Cell10.1016/j.cell.2021.03.023 (2021). [DOI] [PMC free article] [PubMed]
- 25.Schmitt, M. J. et al. Phenotypic mapping of pathological crosstalk between glioblastoma and innate immune cells by synthetic genetic tracing. Cancer Discov.10.1158/2159-8290.cd-20-0219 (2020). [DOI] [PMC free article] [PubMed]
- 26.Alexander J, et al. Multimodal single-cell analysis reveals distinct radioresistant stem-like and progenitor cell populations in murine glioma. Glia. 2020;68:2486–2502. doi: 10.1002/glia.23866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Puchalski, R. B. et al. An anatomic transcriptional atlas of human glioblastoma. Science10.1126/science.aaf2666 (2018). [DOI] [PMC free article] [PubMed]
- 28.Johnson BE, et al. Mutational analysis reveals the origin and therapy-driven evolution of recurrent glioma. Science. 2014;343:189–193. doi: 10.1126/science.1239947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Xue J, et al. Transcriptome-based network analysis reveals a spectrum model of human macrophage activation. Immunity. 2014;40:274–288. doi: 10.1016/j.immuni.2014.01.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Pine AR, et al. Tumor microenvironment is critical for the maintenance of cellular states found in primary glioblastomas. Cancer Discov. 2020;10:964–979. doi: 10.1158/2159-8290.CD-20-0057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Slyper M, et al. A single-cell and single-nucleus RNA-seq toolbox for fresh and frozen human tumors. Nat. Med. 2020;26:792–802. doi: 10.1038/s41591-020-0844-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ludwig LS, et al. Lineage tracing in humans enabled by mitochondrial mutations and single-cell genomics. Cell. 2019;176:1325–1339. doi: 10.1016/j.cell.2019.01.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wang L, et al. The evolution of alternative splicing in glioblastoma under therapy. Genome Biol. 2021;22:1–15. doi: 10.1186/s13059-021-02259-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Gimple RC, et al. Glioma stem cell-specific superenhancer promotes polyunsaturated fatty-acid synthesis to support EGFR signaling. Cancer Discov. 2019;9:1248–1267. doi: 10.1158/2159-8290.CD-19-0061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Morton AR, et al. Functional enhancers shape extrachromosomal oncogene amplifications. Cell. 2019;179:1330–1341. doi: 10.1016/j.cell.2019.10.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kfoury N, Qi Z, Prager BC, Wilkinson MN, Broestl L. Brd4-bound enhancers drive cell-intrinsic sex differences in glioblastoma. Proc. Natl Acad. Sci. USA. 2021;118:e2017148118. doi: 10.1073/pnas.2017148118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Gaiti F, et al. Epigenetic evolution and lineage histories of chronic lymphocytic leukaemia. Nature. 2019;569:576–580. doi: 10.1038/s41586-019-1198-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kaya-Okur HS, et al. CUT&Tag for efficient epigenomic profiling of small samples and single cells. Nat. Commun. 2019;10:1–10. doi: 10.1038/s41467-019-09982-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Feng X, et al. Colony-stimulating factor 1 receptor blockade prevents fractionated whole-brain irradiation-induced memory deficits. J. Neuroinflamm. 2016;13:1–13. doi: 10.1186/s12974-016-0671-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.McGinnis, C. S., Murrow, L. M. & Gartner, Z. J. DoubletFinder: doublet detection in single-cell RNA sequencing data using artificial nearest neighbors. Cell Syst. 10.1016/j.cels.2019.03.003 (2019). [DOI] [PMC free article] [PubMed]
- 41.Müller, S., Cho, A., Liu, S. J., Lim, D. A. & Diaz, A. CONICS integrates scRNA-seq with DNA sequencing to map gene expression to tumor sub-clones. Bioinformatics10.1093/bioinformatics/bty316/4979546 (2018). [DOI] [PMC free article] [PubMed]
- 42.Fang R, et al. Comprehensive analysis of single cell ATAC-seq data with SnapATAC. Nat. Commun. 2021;12:1337. doi: 10.1038/s41467-021-21583-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J.10.14806/ej.17.1.200 (2011).
- 44.Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.McKenna A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Talevich E, Shain AH, Botton T, Bastian BC. CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing. PLoS Comput. Biol. 2016;12:1–18. doi: 10.1371/journal.pcbi.1004873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Stuart, T. et al. Comprehensive integration of single-cell data. Cell10.1016/j.cell.2019.05.031 (2019). [DOI] [PMC free article] [PubMed]
- 48.Lê, S., Josse, J. & Husson, F. FactoMineR: an R package for multivariate analysis. J. Stat. Softw. 10.18637/jss.v025.i01 (2008).
- 49.Kassambara, A. & Mundt, F. factoextra: extract and visualize the results of multivariate data analyses. R Package Version 1.0.7 (2020).
- 50.Wang K, Li M, Hakonarson H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:1–7. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Bergen, V., Lange, M., Peidli, S., Wolf, F. A. & Theis, F. J. Generalizing RNA velocity to transient cell states through dynamical modeling. Nat. Biotechnol. 10.1038/s41587-020-0591-3 (2020). [DOI] [PubMed]
- 52.Strimmer K. fdrtool: a versatile R package for estimating local and tail area-based false discovery rates. Bioinformatics. 2008;24:1461–1462. doi: 10.1093/bioinformatics/btn209. [DOI] [PubMed] [Google Scholar]
- 53.Allavena, P., Sica, A., Garlanda, C. & Mantovani, A. The Yin-Yang of tumor-associated macrophages in neoplastic progression and immune surveillance. Immunol. Rev.10.1111/j.1600-065X.2008.00607.x (2008). [DOI] [PubMed]
- 54.Gordon S, Taylor PR. Monocyte and macrophage heterogeneity. Nat. Rev. Immunol. 2005;5:953–964. doi: 10.1038/nri1733. [DOI] [PubMed] [Google Scholar]
- 55.Mosser, D. M. & Edwards, J. P. Exploring the full spectrum of macrophage activation. Nat. Rev. Immunol.10.1038/nri2448 (2008). [DOI] [PMC free article] [PubMed]
- 56.Sica, A. & Bronte, V. Altered macrophage differentiation and immune dysfunction in tumor development. J. Clin. Investig.10.1172/JCI31422 (2007). [DOI] [PMC free article] [PubMed]
- 57.Sica, A. & Mantovani, A. Macrophage plasticity and polarization: in vivo veritas. J. Clin. Investig.10.1172/JCI59643 (2012). [DOI] [PMC free article] [PubMed]
- 58.Schep AN, Wu B, Buenrostro JD, Greenleaf WJ. ChromVAR: Inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat. Methods. 2017;14:975–978. doi: 10.1038/nmeth.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Ramírez, F., Dündar, F., Diehl, S., Grüning, B. A. & Manke, T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 10.1093/nar/gku365 (2014). [DOI] [PMC free article] [PubMed]
- 60.Jin S, et al. Inference and analysis of cell–cell communication using CellChat. Nat. Commun. 2021;12:1–20. doi: 10.1038/s41467-021-21246-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Wang, X., Park, J., Susztak, K., Zhang, N. R. & Li, M. Bulk tissue cell type deconvolution with multi-subject single-cell expression reference. Nat. Commun. 10.1038/s41467-018-08023-x (2019). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The study data, in the form of raw sequenced reads, are available from the European Genome–phenome Archive repository (https://www.ebi.ac.uk/ega/home), accession EGAS00001004909. Spatial image data, processed expression and peak Supplementary Tables are available from the Gene Expression Omnibus repository (https://www.ncbi.nlm.nih.gov/geo/), accession code GSE174554. Previously published scRNA-seq data that were re-analyzed here are available from https://github.com/mbourgey/scRNA_GBM and from the Gene Expression Omnibus, accession code GSE131928. All other data supporting the findings of this study are available from the corresponding author on reasonable request. Source data are provided with this paper.
Custom code used is available from GitHub at https://github.com/linwang6/Longitudinal_glioblastoma_paper.