Summary
Esophageal adenocarcinoma (EAC) is a highly lethal cancer of the upper gastrointestinal tract with rising incidence in western populations. To decipher EAC disease progression and therapeutic response, we perform multiomic analyses of a cohort of primary and metastatic EAC tumors, incorporating single-nuclei transcriptomic and chromatin accessibility sequencing along with spatial profiling. We recover tumor microenvironmental features previously described to associate with therapy response. We subsequently identify five malignant cell programs, including undifferentiated, intermediate, differentiated, epithelial-to-mesenchymal transition, and cycling programs, which are associated with differential epigenetic plasticity and clinical outcomes, and for which we infer candidate transcription factor regulons. Furthermore, we reveal diverse spatial localizations of malignant cells expressing their associated transcriptional programs and predict their significant interactions with microenvironmental cell types. We validate our findings in three external single-cell RNA sequencing (RNA-seq) and three bulk RNA-seq studies. Altogether, our findings advance the understanding of EAC heterogeneity, disease progression, and therapeutic response.
Keywords: esophageal adenocarcinoma, gastrointestinal cancer, computational biology, epigenetics, transcriptomics, spatial transcriptomics, single cell, oncology, bioinformatics
Graphical abstract

Highlights
-
•
Multi-omics reveals five distinct malignant programs in esophageal adenocarcinoma
-
•
Malignant programs exhibit unique chromatin accessibility and epigenetic plasticity
-
•
Spatial mapping reveals distinct tumor-microenvironment interactions and programs
-
•
Malignant programs correlate with clinical stages and therapy responses in patients
Yates and Mathey-Andrews et al. dissect the cellular and spatial heterogeneity of esophageal adenocarcinoma across clinical stages. They identify five malignant cell programs with distinct transcriptomic and epigenetic profiles, linked to disease progression, prognosis, and immune response. These programs interact with the tumor microenvironment, highlighting potential therapeutic targets.
Introduction
Esophageal adenocarcinoma (EAC) is believed to arise from Barrett’s esophagus, an uncommon metaplastic condition.1,2,3,4,5,6,7 EAC is exceptionally lethal, with a 5-year survival rate of less than 5% for patients with non-resectable disease or detectable metastases, representing over half of diagnosed patients.7,8 The recalcitrant and heterogeneous response to treatment underscores the need to understand EAC progression at a cellular level and delineate malignant cell and tumor microenvironment (TME) heterogeneity in therapy-resistant and metastatic settings.4,9
While recent studies explored EAC at single-cell resolution to identify candidate immune and stromal cell types relevant to pathogenesis,9,10 malignant cell states and their heterogeneity in EAC across disease stages—crucial for predicting disease progression, metastasis, and therapeutic response—remain largely undetermined.11,12 Moreover, epigenetic heterogeneity, vital for understanding malignant cell plasticity,12 as well as spatial relationships between distinct cell types and states, remains unexplored in EAC. Given recent advances of single-cell and spatial transcriptomics (ST) technologies,13,14,15 we hypothesized that joint inference of transcriptional, epigenetic, and spatial heterogeneity in EAC across disease stages, metastatic foci, and therapeutic exposures may provide insights into programs dictating lethal disease. Our analysis uncovered malignant cell programs and their spatial localizations and interactions with microenvironmental cell types that inform EAC disease progression and therapeutic resistance.
Results
Characterizing the transcriptional and chromatin accessibility landscape of primary and metastatic EAC
In our discovery cohort, we analyzed 13 biopsies from therapy-naive and therapy-exposed patients with EAC. Multiome sequencing (single-nuclei RNA sequencing [snRNA-seq] and single-nuclei assay for transposase-accessible chromatin [ATAC] sequencing [snATAC-seq]) was performed on samples from 10 patients. Visium ST was conducted on a subset of 5 matched samples from 3 patients (Figures 1A and S1; Table S1; STAR Methods). Additionally, Xenium ST was performed on 6 samples from 3 patients and on 10 samples from 3 additional patients profiled exclusively using Xenium (Figures 1A and S1; Table S1; STAR Methods). OncoPanel18 analysis provided genomic profiling data for seven patients, revealing that all were mismatch repair proficient but exhibited a heterogeneous genetic background. Recurrent genetic alterations included TP53 mutations (5/7 patients), CDKN2A mutations (3/7), and amplifications in KRAS (3/7) and CDH4 (3/7) (Tables S2, S3, S4, and S5).
Figure 1.
EAC primary and metastatic samples show a diverse landscape of TME and malignant cells in transcriptomic and epigenetic data
(A) Schematic representation of the study workflow. Biopsies from 10 patients in our discovery cohort, including normal adjacent tissue (NAT), primary tissue, and metastatic samples, were subjected to single-nuclei RNA and ATAC sequencing using 10X Chromium technology. For a subset of these patients as well as three additional patients, matched primary and metastatic samples were profiled with 10X Visium and 10X Xenium spatial transcriptomics (ST) technologies. For single-nuclei data, cells were annotated by cell type and categorized into malignant and TME components. TME subtypes were linked to metastasis, with validation against an external pan-cancer fibroblast atlas.16 The malignant cell components underwent analysis using consensus non-negative matrix factorization (cNMF) to uncover malignant programs, which were further characterized for transcriptional and epigenetic heterogeneity at a single-cell and spatial level and candidate master transcription factors. External validation was performed in two single-cell validation cohorts,9,10 and associations with clinical and molecular characteristics, as well as survival, were assessed in three bulk validation cohorts.7,10,17
(B) Uniform manifold approximation and projection (UMAP) representation of the full cohort in Harmony-corrected integrated transcriptomic data, with major cell type compartments labeled and cell counts indicated.
(C) Proportion of major cell types in each sample based on transcriptomic data, with percentages for compartments representing over 5% of the total sample composition.
(D) UMAP representation of the full cohort in Harmony-corrected integrated ATAC data, with cell type annotations transferred from the RNA annotations. “NA” denotes cells without paired associated RNA information.
(E) Proportion of major cell types in each sample based on ATAC data, with percentages for compartments representing over 5% of the total sample composition.
After preprocessing, we identified 70,658 high-quality cells with expression information for 21,444 genes within the snRNA-seq data and 33,705 cells with chromatin accessibility information for 311,978 genomic regions within the snATAC-seq data, represented for visualization purposes only in Harmony19-corrected space (STAR Methods; Figures 1B–1E and S1). Seven major cellular compartments were delineated: carcinoma, epithelial/nerve, myeloid, muscle, fibroblast, and lymphoid, for which we uncovered various cell subtypes (Figures 1B and S2). Malignant cells represented an average of 54% of all cells across tumor samples (interquartile range: 48%–63%; Figure 1C).
The EAC TME contains distinct macrophage, lymphoid, and fibroblast populations
Although the response of EAC to immunotherapy can vary, recent studies have demonstrated that specific myeloid cell subtypes within EACs are associated with the effectiveness of immune checkpoint inhibitors (ICIs).9 We found 5 distinct cell subtypes within the myeloid compartment, including two tumor-associated macrophage (TAM) populations (TAM1 and TAM2; Figures 2A and 2B). TAM1 cells, exhibiting pro-inflammatory gene expression patterns,21,22,23 were significantly enriched in tumor samples, whereas TAM2 cells, exhibiting characteristics of anti-inflammatory macrophages,24,25 were differentially enriched in normal adjacent tissue though still detectable in tumor tissue (one-sample t tests p < 0.0001) (Figure 2C).20
Figure 2.
The EAC TME contains several pro- and anti-inflammatory populations of macrophages and RUNX1/RUNX2/PRRX1/BNC2-regulated inflammatory cancer-associated fibroblasts enriched in metastatic samples
(A) Uniform manifold approximation and projection (UMAP) representation of the myeloid compartment in Harmony-corrected integrated transcriptomic data, with annotated subtypes indicated.
(B) Proportion of myeloid subtypes per patient.
(C) Distribution of Milo20 fold change scores between normal-adjacent and tumor samples for myeloid cells; Milo scores measure differential abundances of specific cell subtypes by assigning cells to overlapping neighborhoods in a k-nearest neighbor graph.
(D) Marker genes of annotated myeloid subtypes, with cells grouped by subtype and expression information provided.
(E) UMAP representation of the fibroblast compartment in Harmony-corrected integrated transcriptomic data, with annotated subtypes indicated.
(F) Proportion of fibroblast subtypes per patient.
(G) Distribution of Milo fold change scores between metastatic and primary tumor samples for fibroblast subtypes, with labeling and exclusion criteria similar to (C).
(H) Marker genes of annotated fibroblast subtypes, with cells grouped by subtype and expression information provided.
(I) Distribution of the inflammatory cancer-associated fibroblast (CAF) score in the stromal compartment of Carroll et al.’s9 cohort, stratified by response to immune checkpoint inhibitor (ICI) therapy: clinical benefit (CB) and no clinical benefit (NCB). The inflammatory CAF program is scored on the entire cohort. Paired measurements of patients were made before treatment (PreTx) and after a 4-week ICI treatment window (ICI-4W). The distribution of the inflammatory CAF score is compared among the CB and NCB groups across PreTx and ICI-4W time points. Significance testing is conducted using a Mann-Whitney test to assess differences between the CB and NCB groups.
(J–L) Results for SCENIC+-derived transcription factor (TF) candidates for inflammatory fibroblasts, with cells grouped by subtype and Z scores of TF expression (J), eRegulon gene-based expression (K), and eRegulon region-based expression (L) are shown.
(M) TF gene expression correlation with inflammatory CAF score in the external pan-cancer fibroblast validation cohort of Luo et al.,16 with candidate TFs identified with the SCENIC+ analysis highlighted.
(N) Correlation of all available TFs’ gene expression and SCENIC-estimated gene-based eRegulon score with the inflammatory CAF score in the pan-cancer fibroblast atlas.16 Only PRRX1’s eRegulon activity, but not BNC2 and RUNX1/2, was estimated using SCENIC.
These TAM subpopulations resembled previously described populations in the pan-cancer tumor-infiltrating myeloid cell atlas26 and a study in EAC by Carroll et al.9 (Figure S3). Importantly, the TAM1 cells resembled the TAMs from the latter study, linked to higher monocyte content and selective ICI response. In contrast, TAM2 appeared similar to the M2 macrophages from the same study, linked to lower monocyte content and resistance to ICI.
Lymphoid cells in the TME also play a critical role in therapy response and cancer progression.27,28,29 Within the lymphoid compartment, we identified five distinct subtypes: B cells, CD4+ T cells (TCD4+), CD8+ T cells (TCD8+), regulatory T cells (Tregs), and natural killer cells (Figure S2). B cells, which have been recently linked to ICI response through their involvement in forming tertiary lymphoid structures,30,31 were notably depleted in metastatic samples (one-sample t test p < 0.001, Figure S2). This aligns with previous findings in melanoma32 and colorectal cancer,33 where B cell depletion is linked to enhanced tumor growth and metastasis. Given their connection to tertiary lymphoid structures and their impact on ICI response, strategies aimed at restoring or modulating B cell populations may hold potential for improving immunotherapy efficacy in EAC.
Finally, cancer-associated fibroblasts (CAFs) have also been previously implicated in tumor progression and therapy resistance.34,35 We identified four distinct CAF populations in our cohort (Figures 2E and 2F), including an inflammatory CAF population (iCAF) (expressing e.g., CDH11, RUNX1, and COL1A1) enriched in metastatic EAC tumor samples and non-activated fibroblasts displaying relative abundance in primary EAC tumors (one-sample t test p < 0.0001) (Figures 2G and 2H; STAR Methods).20 These CAF populations were also consistently recovered in external pan-cancer and EAC-specific cohorts, encompassing a total of 246 tumor samples (Figure S4).9,16
We next examined whether the presence of iCAFs correlated with selective ICI response in an external cohort.9 Among the non-clinical benefit (non-CB) patient group (defined as the group of patients showcasing less than 12 months of progression-free survival), there was a significant decrease in iCAF gene signature scores following ICI treatment consistent across patients. In contrast, a minimal increase in the iCAF score (that was inconsistent across patients) was observed pre- and post-ICI in the clinical benefit (CB) group (Figures 2I and S4).
Leveraging our paired snRNA-seq/ATAC-seq data, we used SCENIC+36 to identify candidate master transcription factor (mTF) regulons associated with the iCAF population.36 RUNX1, RUNX2, PRRX1, and BNC2, previously implicated in various oncogenic processes,37,38,39,40,41 were nominated as candidate mTFs of these cells (Figures 2J–2L) and further corroborated within the external pan-cancer CAF atlas16 (Figures 2M and 2N). Notably, attempts to link the iCAF-identified mTFs to ICI response revealed no strong associations, primarily due to high dropout rates. This finding further emphasizes the value of using cell subtype signatures over individual genes (Figure S4).
Overall, distinct macrophage, lymphoid, and CAF cells associated with therapy response populate the microenvironment of both primary and metastatic EAC.
Five malignant cell programs are identified across primary and metastatic EAC tumor samples
In contrast to TME investigations, tumor-intrinsic cellular programs relevant to progression, metastasis, and therapy resistance in EAC remain poorly understood.11,12,42 To uncover unique gene activity programs operant among the EAC tumor compartment, we employed consensus non-negative matrix factorization (cNMF) and identified five cNMF programs consistently present across different patients (cNMF1 to cNMF5) (Figures 3A–3C; STAR Methods). We conducted gene set enrichment analysis (GSEA) to assess enrichment of established biological pathways from MSigDB within the five cNMF malignant cell programs and then compared the identified programs with the pan-cancer tumor cell programs from Gavish et al.42 and the Barrett’s esophagus programs described by Nowicki-Osuch et al.1 (Figures 3D and S5).
Figure 3.
Five recurrent transcriptomic programs characterize EAC malignant cells with distinct RNA profiles
(A) Illustration of the methodology employed for identifying transcriptomic programs. For each patient, consensus non-negative matrix factorization (cNMF) is performed on the malignant cell compartment, followed by manual filtration to retain high-quality programs characterized by gene weightings. Pairwise cosine similarity between programs across all patients is computed to cluster programs using hierarchical clustering with average linkage.
(B) Cosine similarity matrix representing the similarity between cNMF-derived programs across all samples, clustered using hierarchical clustering with average linkage. The five identified programs (cNMF1 through cNMF5) are delineated.
(C) UMAP representation of the malignant cell compartment using unintegrated transcriptomic data, colored according to their program score (cNMF1 through cNMF5) and sample ID.
(D) GSEA enrichment of the five programs in the 50 hallmarks of cancer, based on genes ranked according to their weight contribution to cNMF programs. Hallmarks are grouped according to category. Enrichments that did not reach significance (FDR = 0.05) are blanked out.
(E) GSEA enrichment plots for selected programs described by Nowicki et al. in Barrett’s esophagus.
(F) GSEA enrichment plots for hallmarks G2M checkpoint in cNMF2 and epithelial-to-mesenchymal transition in cNMF3.
(G) Distribution of the five program scores in metastatic and primary samples. Significance is computed using the Mann-Whitney U test. The difference in median score is indicated as Δ.
(H and I) Cosine similarity between programs derived with cNMF in external datasets and cNMF1 through cNMF5 programs, derived in the Carroll et al. dataset (H) and in the Croft et al. dataset (I). The cosine similarity is computed between the cNMF-derived gene weights of programs for all patients in the external datasets and the median gene weight associated with each cNMF program derived in the discovery set.
cNMF1 resembled the intermediate columnar profile in Barrett’s esophagus1 (normalized enrichment score [NES] = 2.5, false discovery rate [FDR] q < 0.0001; Figure 3E) and showed enrichment in MYC targets, oxidative phosphorylation, and MTORC1 signaling pathways, akin to previously described Gavish et al.’s programs “EMT-III” and “Interferon/MHC-II (II).” cNMF2 exhibited properties consistent with a cell cycling program (NES = 2.5, FDR q < 0.0001; Figure 3F), reminiscent of Gavish et al.’s program “Cell cycle G2/M.” cNMF3 resembled a classical epithelial-to-mesenchymal transition (EMT) program (NES = 2.0, FDR q < 0.0001; Figure 3F), enriched in EMT and WNT beta-catenin pathways, aligned with Gavish et al.’s program “EMT-I.” cNMF4 resembled the differentiated Barrett’s esophagus program (NES = 3.8, FDR q < 0.0001; Figure 3E),1 displayed enrichment in tumor necrosis factor, interferon-gamma, and interferon-alpha signaling, and appeared similar to Gavish et al.’s programs “PDAC-classical,” “PDAC-related,” and “Epithelial senescence.” Finally, cNMF5 resembled the undifferentiated Barrett’s esophagus program (NES = 2.6, FDR q < 0.0001; Figure 3E).
Moreover, cNMF4 (differentiated esophagus program) was significantly enriched in malignant cells of primary EAC tumors (difference in median score between primary and metastatic malignant cells Δ = −0.35, p < 0.0001), while cNMF5 (undifferentiated esophagus program) exhibited a slight enrichment in malignant cells of metastatic EAC samples (Δ = 0.07, p < 0.0001) (Figure 3G).
To validate the robustness of the malignant cell cNMF programs uncovered in our study, we similarly performed cNMF on two external single-cell datasets sourced from Croft et al.10 and Carroll et al.,9 across an aggregate of 6,838 malignant cells from 17 patient tumors. In the Carroll et al. dataset, we identified several programs consistent with cNMF1, cNMF2, cNMF4, and cNMF5; moreover, in the Croft et al. dataset, we observed enrichment of cNMF1, cNMF3, and cNMF4 programs, supporting the generalizability of the identified malignant cell programs across datasets (Figures 3F, 3G, and S5).
The five malignant cell programs displayed differential chromatin accessibility patterns and epigenetic plasticity
We next leveraged the paired snRNA-seq/ATAC-seq data to interrogate the connection between observed transcriptional programs and epigenetic diversity, aiming to decipher whether distinct EAC malignant cell programs correspond to specific chromatin accessibility patterns (Figure 4A). We correlated the score of malignant cNMF programs with the normalized ATAC peak counts and identified significant associations between all cNMF programs and differentially accessible chromatin regions, denoted as cNMF-related peaks (Figure 4B). We uncovered distinct chromatin accessibility patterns across cells representing cNMF programs (200 top-scoring cells, STAR Methods), with several genes of interest displaying differential promoter accessibility, including AKT2,43 MKI67,44 SPARC,45,46 BHLHE41,47,48 and ANXA1149 (Figures 4C and 4D). These variations in promoter and enhancer accessibility suggest a potential functional link between epigenetic alterations and evolution trajectories of tumor cells. Of note, cNMF1, cNMF2, and cNMF3 generally displayed less distinct chromatin accessibility profiles than cNMF4 and cNMF5 (Figure 4C).
Figure 4.
EAC malignant cell programs display unique ATAC profiles and epigenetic plasticity
(A) UMAP representation of the malignant cell compartment using unintegrated snATAC-seq data, color-coded according to their cNMF gene signature score (cNMF1 through cNMF5) and sample ID. The program score is transferred from the RNA annotation.
(B) Number of open chromatin regions significantly correlated with each program (FDR < 0.05, Pearson’s R > 0.1).
(C) Heatmap illustrating chromatin accessibility in cNMF-associated regions for representative program cells. Cells are scored using cNMF signatures derived from RNA, with the top 5% unique cells in each score selected as representative cells. The top 200 regions with the higher correlation between chromatin accessibility and each program are represented.
(D) Chromatin accessibility of representative cNMF program cells for genes of interest. Genes are selected based on their association with the regions of the highest correlation between chromatin accessibility and gene signature scores of cNMF programs. Chromatin accessibility of promoters for AKT2, MKI67, SPARC, BHLHE41, and ANXA11 is depicted for representative cells of cNMF1 through cNMF5 and all remaining carcinoma cells.
(E) This heatmap illustrates the accuracy of classifying cells into cNMF programs based on chromatin accessibility profiles. Cells are scored using the average Z score of chromatin accessibility across the top 200 cNMF-associated regions, and the highest score determines their chromatin accessibility identity. Each point represents the percentage of cells with a given transcriptional identity (e.g., cNMF3) classified under a specific chromatin accessibility identity (e.g., cNMF1). Off-diagonal points indicate cells whose transcriptional and chromatin accessibility identities differ, suggesting potential plasticity or transitions between programs.
(F) Distribution of the epigenetic plasticity scores across representative cells of cNMF1 to cNMF5. Average Z scores of ATAC accessibility vectors are transformed into a probability distribution using a softmax transformation with temperature, and the plasticity score is computed as the Shannon entropy over the resulting probability distribution.
(G) Representation of the candidate master transcription factors (mTFs) associated with programs consistent across datasets. We jointly model chromatin accessibility and gene expression to obtain candidate master transcription factors for each cNMF program in the discovery cohort that are subsequently validated in the two external validation cohorts. The identified mTFs consistent across datasets are represented.
Epigenetic plasticity, particularly the modulation of chromatin accessibility in malignant cells, is a recognized hallmark of cancer.50 To determine if the identified cNMF programs exhibited chromatin states that facilitate transcriptional program diversity (epigenetic plasticity, as defined by Burdziak et al.51), we compared the paired transcriptional gene expression and chromatin accessibility profiles among malignant cells. Additionally, we analyzed the distribution of epigenetic plasticity scores within the malignant cells representing each cNMF program (Figures 4E and 4F; STAR Methods).
We assigned cells two identities, a transcriptional identity based on the highest score of signature genes and a chromatin accessibility identity based on the expression of cNMF-related peaks. Cells with strong cNMF4 and cNMF5 signature scores (within the top 5% of score distribution; STAR Methods), representing differentiated and undifferentiated programs, respectively, exhibited mostly concordant transcriptional gene expression and chromatin accessibility identities, as well as low epigenetic plasticity, consistent with the hypothesized stable identity of these programs (Figures 4E and 4F). Conversely, cells from the cell cycling program, cNMF2, displayed discordant expression of chromatin accessibility patterns characteristic of different programs along with high epigenetic plasticity.52,53 cNMF1 also displayed high epigenetic plasticity, and certain malignant cells expressing the program had chromatin accessibility profiles that also associated with cNMF4 and cNMF5, consistent with the proposed intermediate nature of cNMF1 between the continuum represented by cNMF5 and cNMF4 programs (Figures 4E and 4F).
Furthermore, cells within the EMT-like cNMF3 program displayed mixed chromatin accessibility identity and high epigenetic plasticity, consistent with previous observations of EMT state plasticity and its reversible nature.54,55 Based on the snATAC-seq scores, i.e., the average Z score of normalized counts over cNMF-related peaks, we speculate that cNMF3 cells predominantly originate from the cNMF1 and cNMF5 pools rather than the cNMF4 pool, potentially suggesting that terminally differentiated EAC cells do not undergo EMT (Figure 4E).
Predicted transcription factor regulons of the malignant cell programs
To ascertain whether the identity of malignant cell cNMF programs was governed by a specific set of mTFs, we next inferred the gene regulatory network underlying cell programs in our dataset leveraging the paired multiome data with SCENIC+36 and also evaluated these findings in the external Croft el al. and Carroll et al. datasets for reproducibility (STAR Methods; Figures 4G and S6).
Candidate mTFs included E2F756 for cNMF2; ZEB1,57,58 TCF7L1,59 and MAFB60,61 for cNMF3; FOXO1 and FOXO3,62 MXD1,63,64 LCOR,65,66 CREB3L1,67,68,69 MAFK,70 PPARD,71,72 and HNF4A1,73,74,75 (tumor suppressor transcription factors [TFs] and/or associated with favorable prognosis) for cNMF4; and MECOM76,77 and HMGA278,79 for cNMF5. Notably, no mTF was robustly identified across datasets for cNMF1. We therefore identified a set of candidate mTFs reproducibly associated with each malignant cell program except cNMF1 in three independent datasets (summarized in Figure 4G). Lastly, the expression of genes coding for candidate mTFs identified for cNMF4 and cNMF5 was analyzed along the axis of expression of these two hypothesized opposing programs by ranking cells according to their relative cNMF4 to cNMF5 expression. The mTFs showed a consistent positive and negative gradient of expression along the cNMF5 to cNMF4 axis (Figure S6), supporting their role in orchestrating these program expressions.
Malignant and TME cell programs in EAC display differential spatial enrichment in defined tumor regions
To assess whether the malignant cell programs identified in the snRNA-seq data exhibit spatial heterogeneity within individual EAC tumor samples, we performed Visium and Xenium ST on additional EAC tissue from matched patients (STAR Methods). For Visium samples, we categorized ST spots into pure tumor regions, mixed regions containing both malignant cells and TME cells in similar proportions, or regions of normal tissue, using CNV (copy-number variation) profiles inferred from the spatial data (STAR Methods, Figures 5A and S7). Deconvoluted ST spot cell type proportions and gene expression80 broadly agreed with CNV assignments (Figures S7 and S8).We scored the five malignant cell cNMF programs based on the corrected, deconvoluted carcinoma-specific gene expression matrix and found that they displayed distinct spatial distributions within the EAC tumor samples (Figures 5A and S7).
Figure 5.
Single-nuclei-derived transcriptional programs highlight different spatial regions of EAC tumors
(A) Spatial transcriptomics (ST) slides of P8 primary tumor A, colored according to cNMF program score and the CNV-derived label. For each spot, we infer the CNV profile with inferCNV and assign spots to tumor, mixed, and normal status. cNMF scores are computed as the average Z score of signature genes using the deconvolved carcinoma-specific gene expression profile of spots derived with Cell2Location.
(B) Average cNMF score according to the position of the spots compared to the tumor-leading edge. For each tumor spot, we compute the distance to the edge as the shortest path to a normal or mixed spot. The distribution of cNMF scores with standard error is represented for normal spots, mixed spots, and spots of a certain distance to the edge.
(C) cNMF scores for carcinoma cells and cell type annotations in a subset of Xenium-profiled samples (P4_A, P11_B, and P12_D). cNMF scores are computed as the average Z score across all carcinoma cells of signature genes included in the Xenium panel.
(D) CellCharter cluster assignments for a subset of Xenium-profiled samples (P4_A, P11_B, and P12_D).
(E) CellCharter cluster cell type proportion: each cell is assigned a cluster and we represent the proportion of the different cell types in each of the CellCharter clusters.
(F) Distribution of cNMF scores of carcinoma cells belonging to the CellCharter clusters with a substantial amount of carcinoma cells (>5%), CC1, CC2, CC3, and CC4. For all comparisons, Mann-Whitney U p < 0.000005.
In most samples, cNMF1 was primarily expressed in the tumor core, characterized by greater distances from the periphery, while cNMF4 was more localized at the tumor edge (Figure 5B). The spatial distribution of cNMF5 and cNMF2 varied, whereas cNMF3, which was less frequently detected in snRNA-seq, was expressed in only three samples (P8_A, P8_B, and P5) and exhibited a scattered enrichment pattern across the tumor (Figures 5A and S7). To further validate the spatial organization of these programs, we analyzed Xenium-profiled data (which allows for true single-cell resolution) from 16 samples spanning six patients, including three profiled exclusively with Xenium. Using canonical marker analysis, we annotated cell types and scored cNMF programs in carcinoma cells (STAR Methods). The cNMF programs showed distinct spatial localization within the tumor, consistent with the Visium data (Figures 5C, S9, and S10). Notably, cNMF4 and cNMF5 were strongly expressed in specific tumor regions and were often co-expressed in regions with high cNMF1 expression, reinforcing its intermediate identity. cNMF2-expressing cells were frequently clustered in small regions of cycling cells, whereas cNMF3-expressing cells were more diffusely distributed across the tumor, except in localized areas, consistent with the Visium data observations.
To systematically identify shared spatial niches within tumors, we applied the CellCharter (CC) method,81 which uncovered eight distinct CC clusters (CC1–CC8) across all 16 samples, each enriched in specific cell types and exhibiting distinct spatial patterns (Figures 5D, 5E, and S10). Clusters CC1, CC2, CC3, and CC4 represented carcinoma cells with varying degrees of non-malignant cell infiltration (carcinoma content 8%–87%; Figure 5E). CC1 was composed mostly of carcinoma cells, CC2 showed infiltration by myeloid, stromal, and T cells, CC3 contained both carcinoma and normal epithelial cells with some immune infiltration, and CC4 was highly infiltrated with stromal and myeloid cells. The remaining clusters displayed <5% of carcinoma cells. CC5, CC6, and CC7 represented stromal and immune niches, with CC5 enriched in myeloid cells, CC6 dominated by stromal cells, and CC7 enriched in stromal and B cells. Finally, CC8 represented a normal epithelial cell niche (Figure 5E).
We next examined whether carcinoma cells in different spatial niches expressed distinct cNMF programs (Figure 5F; for all comparisons, Mann-Whitney U p < 0.000005). In clusters highly infiltrated with immune cells (CC2 and CC4), carcinoma cells exhibited strong expression of the cNMF3 programs, while cells in the pure carcinoma cluster (CC1) showed similar levels of all programs except cNMF3. The CC3 cluster displayed high expression of both cNMF1 and cNMF4, consistent with the proposed differentiated identity of cNMF4, which may reflect early EAC cells clustering with normal epithelial cells. Interestingly, CC3 was frequently located at the tumor margin (Figures 5F and S10), aligning with the Visium findings that cNMF4 expression is enriched at the tumor edge.
These results demonstrate that malignant cell programs exhibit distinct and reproducible spatial patterns within EAC tumors. Similarly, stromal, lymphoid, and myeloid programs identified earlier also showed distinct spatial localization within EAC tumors, as shown by deconvolution estimates in Visium-profiled samples and the distribution of non-malignant CC clusters in Xenium-profiled samples (Figures 5D, S8, and S10). Among stromal programs, RUNX1 expression closely aligned with the iCAF score in the Visium data, whereas other candidate mTFs showed weaker correlations (Figure S8). However, in the sparse Xenium data, none of these candidate mTFs showed substantial expression. These findings establish RUNX1 expression as a marker for iCAFs and suggest its role in regulating this population. Together, these results confirm the existence of distinct tumor microenvironment programs and their spatial compartmentalization within EAC tumors.
Malignant cell programs in EAC correlate with clinical characteristics, patient prognosis, and predicted drug sensitivity
We then sought to determine whether any of the identified malignant cell cNMF programs were associated with distinct clinical prognostic stages and therapeutically relevant states. By projecting these programs into the primary EAC The Cancer Genome Atlas (TCGA) cohort, we observed that cNMF4 was significantly linked with lower T and N stages, whereas cNMF3 exhibited a moderate association with higher T stages, consistent with its EMT-like nature82 (Figures 6A and 6B). Other programs did not display significant associations with these clinical stages, and no malignant cell program showed significant associations with M staging (Figure S11).
Figure 6.
Discovered malignant programs have different clinical characteristics and predicted drug sensitivity
(A and B) Link between uncovered programs and (A) N stage, i.e., proxy of the number of nearby lymph nodes that have cancer, and (B) T stage, i.e., size and extent of the main tumor in the TCGA bulk cohort.7 Patients are scored using single-sample gene set enrichment analysis (ssGSEA) with a cancer-specific gene signature. Statistical testing is performed using the Mann-Whitney U test.
(C–E) Hazard ratio associated with scores in bulk validation cohorts of (C) TCGA, (D) Hoefnagel et al., and 17 (E) Carroll et al.9 Cox proportional hazard univariate models are employed using disease-specific survival for TCGA and overall survival for Hoefnagel et al. and Carroll et al. The p values are estimated using the Wald test.
(F) Distribution of the cNMF5 score in the malignant cell compartment of Carroll et al.’s cohort,9 stratified by response to immune checkpoint inhibitor (ICI) therapy: clinical benefit (CB) and no clinical benefit (NCB). The cNMF5 program is scored on the full cohort. Paired measurements of patients were made before treatment (PreTx) and after a 4-week ICI treatment window (ICI-4W). The distribution of the cNMF5 score is compared among the CB and NCB groups across PreTx and ICI-4W time points. Significance testing is conducted using a Mann-Whitney U test.
(G) Predicted drug sensitivity by program. scTherapy is used to infer which drugs may be effective against EAC cells exhibiting specific tumor programs. The upset plot represents the total number of drugs predicted to target cells expressing a specific program on the left, as well as the size of the intersection represented in the middle on the top. Drugs of interest are indicated on the plot.
(H) Selected predicted candidate combination therapies that could target cells in all five programs at a time are represented.
We also investigated whether the identified malignant programs were linked to specific mutations in TCGA dataset. Among these, cNMF1 scores were significantly elevated in patients with non-silent somatic mutations in CDKN2A (Figure S11). However, no significant associations were observed for the other cNMF scores, suggesting that highly recurrent somatic mutations observed in EAC are unlikely to be the primary drivers of cNMF program abundance.
We then investigated the relationship between the malignant cell programs and patient survival using a univariate Cox proportional hazard model in two external bulk EAC patient cohorts treated with conventional therapies, namely surgery and neoadjuvant chemotherapy (TCGA7 and Hoefnagel et al.17), and one external EAC patient cohort treated with ICI (Carroll et al.9) (STAR Methods). Higher cNMF4 scores were predictive of improved patient survival in the first two patient cohorts exposed to conventional therapies (p = 0.01 and p = 0.08, respectively), but not in the third patient cohort exposed to ICI (p = 0.49) (Figures 6C–6E). The association of cNMF4 with less aggressive clinical features in this context is consistent with other program-specific features previously shown (i.e., enrichment in primary tumors, differentiated transcriptional profile, and link to TFs associated with improved patient prognosis).
Finally, we investigated whether the cNMF programs displayed differential enrichment in therapy exposure categories of the external EAC patient cohort treated with ICIs. We assessed the distribution shift of the cNMF5 gene signature score in Carroll et al.’s single-cell data and observed that the score was high in patients experiencing CB to ICI both pre- and post-ICI exposure compared to non-CB patients (Mann-Whitney U p < 1e−5, Figure 6F). In addition, the cNMF5 program gene signature score was significantly lower post ICI exposure only in patients experiencing a CB (Figure 6F). These patterns were consistent on a per-individual patient sample basis (Figure S11).
To assess whether the identified programs are associated with differential sensitivity to existing therapies, we used scTherapy83 to predict drug responses in EAC cells expressing each program. Our analysis revealed a broad spectrum of predicted sensitivities, with cNMF2-expressing cells responsive to the largest number of compounds and cNMF3-expressing cells exhibiting the lowest overall drug sensitivity (Figure 6G). Notably, no single drug was predicted to target EAC cells in all five programs simultaneously; however, delanzomib, a next-generation proteasome inhibitor84; romidepsin, an HDAC inhibitor85; and SN-38, the active metabolite of irinotecan and a Food and Drug Administration (FDA)-approved chemotherapy previously shown to have potential in EAC86; were the only agents predicted to affect four out of five programs (Figure 6G; Table S6).
Leveraging these predictions, we identified potential combination therapies by pairing drugs that together putatively target all five cNMF programs (Figure 6H). Among these combinations, some included FDA-approved chemotherapy agents for EAC, such as paclitaxel and docetaxel. Specifically, the combination of paclitaxel with delanzomib84 or with irinotecan86 was predicted to effectively target all five programs. Additionally, the combination of docetaxel and onvansertib, a Polo-like kinase 1-selective therapeutic recently granted FDA fast-track designation for metastatic colorectal cancer,87 also emerged as a potential therapeutic strategy.
Thus, in aggregate, the cNMF programs identified in our study exhibited distinct associations with patient survival, therapy exposure status in external EAC cohorts, and predicted drug sensitivity. Notably, no single agent was predicted to target all five programs simultaneously, highlighting potential resistance mechanisms to standard chemotherapy. Furthermore, these findings pointed to rational combination strategies that could be explored in preclinical models to overcome therapy resistance, offering a potential path toward more effective treatment approaches.
Co-occurring groups of TME cells are linked with malignant cell programs
Lastly, given our findings of the key roles of tumor cell programs in EAC, we sought to understand whether and how the malignant cells interacted with specific TME cells (including the key myeloid and CAF populations described earlier). We first conducted an analysis of ecotypes, i.e., co-occurring abundance of tumor immune and stromal microenvironment cells, as measured in deconvolved data, in EAC.88 Leveraging the external TCGA, Hoefnagel et al., and Carroll et al. EAC patient cohorts (n = 268 patients), we identified two major ecotypes: “immune-desert” (predominantly comprising malignant cells and endothelial cells) and “immune-activated” (featuring a mixture of myeloid, lymphoid, and stromal cells; Figure S12; STAR Methods). A high cNMF3 gene signature score in deconvoluted samples was significantly associated with the immune-activated ecotype across all studies, consistent with findings on Xenium experiments and supporting the role of malignant-stromal and malignant-myeloid interactions in driving EMT89,90,91 (Figure S12). cNMF5 exhibited a significantly lower gene signature score in the immune-activated ecotype in two out of the three external cohorts (Figure S12).
Next, we predicted signaling interactions between malignant cells and TME cells using ST data (STAR Methods).92 Malignant cells exhibited predicted interactions with all TME compartments, with the strongest signals (i.e., highest impact on gene expression as predicted by node-centric expression models; STAR Methods) involving myeloid and lymphoid cells (STAR Methods; Figure S12).9 To explore spatially activated signaling pathways, we performed computational cell-cell interaction analyses, assessing ligand-receptor interactions from cell-cell contact within 200 μm and secreted ligand-receptor interactions within 400 μm (STAR Methods; Figures 7A–7E).93 This analysis revealed numerous autocrine and paracrine signaling pathways associated with malignant and TME program expression. For instance, CEACAM signaling was predominantly produced and received by cNMF4-expressing cells, suggesting a self-reinforcing loop, while platelet-derived growth factor signaling was secreted primarily by CAFs and strongly received by both CAFs and cNMF3-high malignant cells (Figures 7A–7D).
Figure 7.
Uncovered malignant programs show associations with clinical and molecular characteristics, prognosis, and distinct ecotypes
(A–D) Pathways significantly correlated with TME and malignant programs. Using COMMOT, sender and receiver activity scores were assigned to each spot for specific pathways and correlated with program scores. The heatmap displays the median Pearson’s r across samples for pathways significantly correlated with a program in at least 4/5 samples; (A) and (B) depict significant sender and receiver signals for cell-cell contact pathways, while (C) and (D) show significant signals for secreted pathways.
(E) Directionality of THY1, MK, and MIF signaling in sample P8_A. Spots are colored based on sender activity for each pathway, with arrows indicating the estimated direction of signaling. Yellow outlines mark tumor/mixed tissue boundaries. Bottom graphs display program scores significantly correlated with THY1, MK, and MIF sender signals.
(F) Receiver activity of the THY1, MK, and MIF pathways in two Xenium samples (P4_A and P13_C). The top illustrates the average expression of receptor genes included in the Xenium panel. The left inset shows the distribution of average receptor gene expression across relevant cell populations, while the right inset highlights cell types predicted to participate in these pathways based on COMMOT analysis.
(G) Schematic representation of inferred interactions between malignant cells expressing cNMF programs and iCAFs, TAM1 macrophages, and DCs. Recurrent interactions between cells are depicted with colored arrows.
Among contact-dependent signaling pathways, THY1 signaling was particularly enriched in iCAFs, with strong receiver activity in iCAFs, TAM1 macrophages, and dendritic cells (DCs) (Figures 7A and 7B). THY1 signaling facilitates leukocyte transmigration via integrin binding and supports immune cell recruitment to inflammatory sites.94,95,96 Thy-1+ CAFs have also been linked to increased collagen deposition, contributing to extracellular matrix stiffness and remodeling, which may alter TME architecture.97 We also identified significant midkine (MK) signaling activity, with strong sender activity in cNMF1- and cNMF4-expressing regions and receiver activity in these populations and myeloid-rich areas (Figures 7C and 7D). MK signaling was spatially enriched within tumor regions, though no consistent signaling directionality was observed (Figure 7E). Given MK’s known role in tumorigenesis, its expression by malignant cells may drive immune-tolerant macrophage activation and promote an inflamed but ICI-resistant TME.98,99,100,101
Macrophage migration inhibitory factor (MIF) signaling was also highly active, with cNMF1-expressing cells acting as major senders and cNMF3-high cells, iCAFs, TAM1 macrophages, and DCs showing strong receiver activity (Figures 7C and 7D). MIF expression was concentrated within tumor regions (Figure 7E), aligning with its established role in promoting gastric and other carcinomas,102,103,104 as well as driving tumorigenesis through chronic inflammation and immune evasion.105,106
We validated these interactions using the Xenium-profiled data, analyzing genes from the Xenium panel (Figure S13). Cells predicted to interact through the THY1, MK, and MIF pathways frequently showed close spatial proximity (Figures 7F and S10). Moreover, these cells consistently displayed strong receiver activity. Across samples, iCAFs, TAM1 macrophages, and DCs exhibited robust expression of THY1 receptor genes; carcinoma cells, iCAFs, and TAM1 macrophages expressed MK receptor genes; and carcinoma cells, TAM1 macrophages, and DCs expressed MIF receptor genes (Figures 7F and S13). Of the ligands included in the Xenium panel, only THY1 ligand expression could be assessed, and it was consistently overexpressed in iCAFs (Figure S13).
Together, these findings highlight complex and spatially organized signaling networks between malignant cells and the TME, identifying potentially targetable pathways that could influence EAC progression and therapeutic response (Figure 7G).
Discussion
Despite progress in dissecting EAC and Barrett’s esophagus biology, as well as relating biological programs to selective therapeutic response across treatment modalities, the complexities of its malignant cell compartment, epigenetic variations, and disease progression remain poorly understood. Leveraging a multi-modal profiling strategy across primary and metastatic EAC samples, our study unveiled considerable heterogeneity within and between tumors. In addition to identifying previously described myeloid and stromal compartments, this study defines EAC malignant cell heterogeneity across primary and metastatic sites in distinct clinical settings across transcriptomic, chromatin accessibility, and spatial dimensions. We identified five major malignant cell programs, shared across patients in our study and external EAC patient cohorts, that possessed distinct chromatin accessibility profiles and spatial distributions. Among the programs identified, cNMF5, cNMF1, and cNMF4 delineated a continuum from undifferentiated to differentiated programs, mirroring a trajectory observed in Barrett’s esophagus,1 cNMF2 represented a cell cycling program, and cNMF3 emerged as a rarer EMT-associated program. Furthermore, we identified candidate TFs for various programs and a concordance between transcriptional programs and estimated epigenetic plasticity, contributing to the growing evidence emphasizing the significance of epigenetic plasticity as a facilitator of cancer progression and metastasis through increased heterogeneity.107,108,109,110,111 We highlighted the differential spatial distribution of malignant cell program gene signature scores and the complexity of the tumor ecosystem, further supporting prior studies reporting that malignant cells express distinct cellular pathways at the tumor edge and core.112 Finally, we identified recurrent interactions between cells with high expression of cNMF programs and TME cell types, which could in turn influence therapy response, notably to ICI or targeted therapies that have shown strong dependency to TME cells and malignant cell heterogeneity.113,114,115,116
In particular, the identified association between iCAF presence and ICI response suggested potential mechanisms through which iCAFs may mediate immune evasion. iCAFs secrete cytokines (e.g., interleukin-6) and chemokines (e.g., CXCL9 and CXCL10) that recruit various immune cells to the TME, including immunosuppressive populations such as Tregs and myeloid-derived suppressor cells, as well as cytotoxic CD8+ T cells.117,118,119 During ICI therapy, the resulting immune-activated environment may reduce immunosuppressive populations, indirectly destabilizing iCAFs by disrupting their interactions with other TME components. Additionally, ICI treatment could promote the repolarization of iCAFs into other fibroblast subsets.118 In patients where iCAF depletion occurs, the loss of these cells may hinder the influx of peripheral CD8+ T cells and weaken support for pre-existing CD8+ T cells, contributing to poor ICI outcomes in non-CB patients. Given the enrichment of iCAFs in metastatic tissue and their association with ICI response, future studies could investigate their potential role in this process. Furthermore, recent work has highlighted the heterogeneity of iCAFs, revealing multiple potential origins120 and diverse spatial distributions.121 These findings underscore the complex role of iCAFs in shaping immune responses and modulating ICI efficacy.
Broadly, our study underscores the clinical importance of tumor cell heterogeneity in primary and metastatic EAC, elucidating the association of distinct tumor cell states with clinical characteristics, ICI response, and potential TME interplay, marking a key step toward understanding EAC formation and progression.
Limitations of the study
There are several limitations in our study. Firstly, the discovery cohort comprised 10 samples, with 8 being tumor samples, hindering direct linkage between proportions of TME cells and clinical characteristics with malignant cell composition. Consequently, we mostly depended on bulk validation cohorts to further elucidate these associations. Second, sampling and processing biases may affect differential abundance testing and limit the interpretability of the results. Notably, metastatic tumors were obtained from diverse anatomical sites, meaning that observed TME composition differences may reflect site-specific characteristics rather than metastatic status alone. Third, one of the single-cell validation cohorts had very few malignant cells (∼400 cells), suggesting that larger, clinically integrated single-cell EAC cohorts with sufficient malignant cells are needed to further validate our cNMF results. Fourth, the relatively small size of the ST samples necessitates caution in interpreting quantitative conclusions. Fifth, while we obtained somatic panel sequencing data for seven of the eight tumors profiled with snRNA-seq and snATAC-seq, we were unable to generate matched whole-genome or exome sequencing data. As a result, we cannot rule out the potential influence of genetic heterogeneity on cell type composition and malignant program abundance. Sixth, while we used Xenium ST to validate findings from Visium ST, we did not perform immunohistochemistry or immunofluorescence profiling, which would have provided complementary protein-level validation of the transcriptomic data. Seventh, while we proposed candidate combination therapies based on our analyses, we did not experimentally test their efficacy. Lastly, the heterogeneous nature of the cohort, including variations in metastatic status, treatment regimen, and anatomical location, further emphasizes the need for larger and clinically representative molecular cohorts to further investigate the findings from this study.
Resource availability
Lead contact
Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Eliezer M. Van Allen (EliezerM_VanAllen@dfci.harvard.edu).
Materials availability
This study did not generate new unique reagents.
Data and code availability
-
•
Raw sequencing data for single-nuclei RNA- and ATAC-seq generated in this study are deposited in the database of Genotypes and Phenotypes (https://www.ncbi.nlm.nih.gov/gap/) with accession number phs003438.v1. Processed data are available at the Broad Single Cell Portal (https://singlecell.broadinstitute.org/single_cell) (SCP: SCP3028 and SCP3029). Raw and processed data for Visium and Xenium ST profiled samples are available on Zenodo (https://zenodo.org/): with DOI https://doi.org/10.5281/zenodo.15341263. Existing single-cell and bulk sequencing data can be downloaded from EGA (EGAS00001006468, EGAS00001006469) for Carroll et al.’s paper. Remaining existing single-cell data can be downloaded from the Gene Expression Omnibus (GEO) website, accession numbers GSE222078 (Croft et al.) and GSE210347 (Luo et al.). Remaining existing bulk data can be downloaded from the GEO website, accession number GSE207527, for Hoefnagel et al.’s data, and from the UCSC Xena browser for the Cancer Genome Atlas ESCA cohort (https://xenabrowser.net/datapages/?cohort=TCGA%20Esophageal%20Cancer%20(ESCA)&removeHub=https%3A%2F%2Fxena.treehouse.gi.ucsc.edu%3A443). Details on how to download the data used for the analysis are detailed on GitHub: https://github.com/vanallenlab/EAC-multiome. Source data are provided with this paper.
-
•
All the code needed to reproduce this analysis is available on Zenodo (DOI: https://doi.org/10.5281/zenodo.15269379) and GitHub at the following address: https://github.com/vanallenlab/EAC-multiome.
-
•
Any additional information required to reanalyze the data reported in this work paper is available from the lead contact upon request.
Acknowledgments
The authors would like to thank the Single Cell Core at Harvard Medical School, Boston, MA, for performing the multiome snRNA-seq/ATAC-seq sample preparation. They would also like to thank the patients and their families, as well as hospital personnel. Funding sources that supported this project include the Ambrose Monell Foundation (E.M.V.A.), NIH R50CA265182 (J.P.), U2CCA233195 (E.M.V.A.), R01CA227388 (E.M.V.A.), R01CA279221 (E.M.V.A.). T32CA092203 (C.M.-A.), and Swiss National Science Foundation grant number 205321_207931 (J.Y.).
Author contributions
Conceptualization, J.Y., C.M.-A., V.B., and E.M.V.A.; methodology, J.Y., C.M.-A., V.B., and E.M.V.A.; formal analysis, J.Y., V.B., and E.M.V.A.; investigation, J.Y., C.M.-A., S.H., K.B., B.T., S.C., A.K., S.B., S.K.C., Z.L., A.K.R., N.S.S., V.B., and E.M.V.A.; resources, C.M.-A., J.P., A. Gagné, A. Garza, C.H., J.R., E.S., G.B., A.J.A., and M.C.; writing – original draft, J.Y.; writing – review and editing, all authors contributed; visualization, J.Y.; funding acquisition, V.B. and E.M.V.A.; supervision, V.B. and E.M.V.A.
Declaration of interests
E.M.V.A.—advisory/consulting: Enara Bio, Manifold Bio, Monte Rosa, Novartis Institute for Biomedical Research, Serinus Bio, and TracerDx; research support: Novartis, BMS, Sanofi, and NextPoint; equity: Tango Therapeutics, Genome Medical, Genomic Life, Enara Bio, Manifold Bio, Microsoft, Monte Rosa, Riva Therapeutics, Serinus Bio, Syapse, and TracerDx; travel reimbursement: none; patents: institutional patents filed on chromatin mutations and immunotherapy response, and methods for clinical interpretation; intermittent legal consulting on patents for Foaley & Hoag; editorial boards: Science Advances.
A.J.A. has consulted for Anji Pharmaceuticals, Affini-T Therapeutics, Arrakis Therapeutics, AstraZeneca, Boehringer Ingelheim, Kestrel Therapeutics, Merck & Co., Inc., Mirati Therapeutics, Nimbus Therapeutics, Oncorus, Inc., Plexium, Quanta Therapeutics, Revolution Medicines, Reactive Biosciences, Riva Therapeutics, Servier Pharmaceuticals, Syros Pharmaceuticals, T-knife Therapeutics, Third Rock Ventures, and Ventus Therapeutics. A.J.A. holds equity in Riva Therapeutics and Kestrel Therapeutics. A.J.A. has research funding from Amgen, AstraZeneca, Boehringer Ingelheim, Bristol Myers Squibb, Deerfield, Inc., Eli Lilly, Mirati Therapeutics, Nimbus Therapeutics, Novartis, Novo Ventures, Revolution Medicines, and Syros Pharmaceuticals.
Declaration of generative AI and AI-assisted technologies in the writing process
During the preparation of this work, the authors used ChatGPT in order to improve readability and language. After using this tool, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.
STAR★Methods
Key resources table
Experimental model and study participant details
Patients samples
The 13 patient samples (eleven tumor tissue and two non-paired normal adjacent tissue) were collected with written informed consent and ethics approval by the Dana-Farber Cancer Institute Institutional Review Board under protocol numbers 14–408, 03–189, or 17-000. The nomenclature designates: normal adjacent tissue samples as P1 and P2; primary tissue samples as P3, P4, and P5; metastatic samples as P6, P7, P8, P9, and P10; and samples exclusively profiled with Xenium P11, P12, and P13. There were two female and 11 male patients, ages 36 to 78 (average 61 years). Four patients were treatment-naïve while nine had received prior treatment. Further patient details are provided in Table S1.
Method details
Patient tissue sample collection and dissociation for multiome snRNA-seq/ATAC-seq
Nuclei isolation was performed on frozen biopsy specimens as previously described.122 Low-retention microcentrifuge tubes (Fisher Scientific, Hampton, NH, USA) were used throughout the procedure to minimize nuclei loss. Briefly, patient tissue was separated from optimal cutting temperature (OCT) by removing the OCT with sharp tweezers and scalpels. Tissues were then manually dissociated into a single-nuclei suspension by chopping the tissue with fine spring scissors for 10 min, homogenizing in TST solution, filtering through a 30 μm MACS SmartStrainer (Miltenyi Biotec, Germany), and centrifuging for 10 min at 500 g at 4C. The resulting nuclei pellet was resuspended in a lysis buffer to permeabilize the nuclei before centrifuging again for 10 min at 500 g at 4C. The final nuclei pellet was resuspended in 100 μl of 10x Genomics Diluted Nuclei Buffer and trypan blue-stained nuclei were counted by eye using INCYTO C-Chip Neubauer Improved Disposable Hemacytometers (VWR International Ltd., Radnor, PA, USA).
Approximately 16,000–25,000 nuclei per sample were loaded per channel of the Chromium Next GEM Chip J for processing on the 10x Chromium Controller (10x Genomics, Pleasanton, CA, USA) followed by transposition or cDNA generation and library construction according to manufacturer’s instructions (Chromium Next GEM Single Cell Multime ATAC + Gene Expression User Guide, Rev F). Libraries were normalized and pooled for sequencing on two NovaSeq SP-100 flow cells (Illumina, Inc., San Diego, CA, USA).
snRNA and snATAC multiome processing
snRNA-seq and snATAC paired data preprocessing
The paired snRNA-seq and snATAC-seq samples were sequenced using Illumina HiSeq X. Subsequently, the raw bcl files were aligned to the human reference genome GRCh38 for each sample via Cell Ranger Arc 2.0.
snRNA-seq specific processing and cell type annotation
To mitigate potential ambient RNA contamination within the RNA assay of the multiome data, we used Cellbender123 to computationally remove ambient RNA counts from each count matrix.
After, Scrublet124 was employed to identify cell barcodes that may be potential doublets from the ambient RNA-adjusted RNA count matrices, and these barcodes were subsequently removed. The resulting doublet-free ambient RNA-adjusted count matrices were then employed for further downstream analyses.
RNA assay quality control procedures were conducted for each individual patient sample using Scanpy.125 Cell barcodes with fewer than 200 unique genes expressed, genes expressed in fewer than three cells, and cell barcodes exhibiting greater than 20% of all RNA expression counts mapped to mitochondrial genes (pctMT) were filtered out. RNA expression per cell was normalized via counts per 10k (CP10k), i.e., dividing the counts by the library size of the cell and normalizing to 10,000 total counts per cell, followed by log(x+1) transformation. After performing Leiden clustering (resolution = 0.7) on the 15-nearest neighbor graph of the RNA assay per individual patient sample, component cell types were manually annotated by evaluating canonical marker gene expression per cluster identified through differential expression (DE) utilizing the overestimated variance t-test.
The copy number variation (CNV) profile of each cell per individual patient sample was computed utilizing a Python implementation of InferCNV (https://github.com/icbi-lab/infercnvpy), employing a mixture of non-malignant cells as a reference (annotated fibroblasts, endothelial cells, and immune cells) based on their presence in the sample. Cells were clustered according to their CNV profile using Leiden clustering, with clusters labeled as malignant or non-malignant depending on their average CNV score. Subsequently, cells were assigned a malignant or non-malignant status based on their cluster membership per individual patient sample.
Refinement of cell type annotation was performed by analyzing cells from all patients of a single type after integration. For each major TME cell type (T/NK, myeloid, endothelial, fibroblast, muscle), cells having a relatively lower pctMT (<15%) were further analyzed downstream. We strengthened the pctMT threshold only in the TME compartment, as malignant and epithelial cells can display higher basal levels of mitochondrial counts.126 Cells were subsetted per cell type and all cells of the same type were integrated using Harmony,19 followed by Leiden clustering to obtain subclusters. The integration was performed on a cell-type level rather than on the full set of cells to obtain more fine-grained integration. Manual annotation of subclusters was carried out using marker genes identified through differential gene expression with an overestimated variance t-test as before.
Annotations of myeloid cell populations were cross-referenced with pan-cancer myeloid annotations from Cheng et al.,26 while cancer-associated fibroblasts (CAF) cells were compared to pan-cancer CAF annotations from Luo et al.16 For visualization only, we integrated the fully annotated cohort using Harmony, opting not to use the cell-type-specific Harmony integration.
snATAC-seq specific preprocessing
The processed snATAC-seq data was acquired utilizing CellRanger Arc 2.0 (snapshot 28). Subsequently, the Signac package was employed for comprehensive processing of the ATAC data127 (https://stuartlab.org/signac/). Adhering to the guidelines outlined in the 10X multiome Signac vignette, the filtered counts and ATAC fragments obtained from CellRanger Arc 2.0 were utilized to re-call peaks using MACS2128 (https://pypi.org/project/MACS2/). Additionally, peaks located in non-standard chromosomes and genomic blacklisted regions were excluded. The consolidated peaks from all samples underwent further filtration, removing those with a width below 20 bp or exceeding 10,000 bp.
Cell type annotations were directly transferred from the snRNA annotations, as the RNA and ATAC measurements were paired. Cells excluded during standard quality control in the RNA measurements but not in the ATAC measurements were annotated as NA. Subsequently, a comprehensive quality control assessment was conducted on the entire set of cells across all samples. Cells with ATAC counts falling below 1000 or exceeding 100,000, a nucleosome signal surpassing 2, a TSS enrichment below 3, or a fraction read in peaks below 0.15 were filtered out.
Normalization of the ATAC count matrix was executed utilizing the term-frequency inverse-document-frequency (TF-IDF) transformation, following default parameters in Signac. Dimensionality reduction was carried out using Latent Semantic Indexing (LSI) with 40 components on the TF-IDF normalized matrix, with UMAP computed on the harmony-corrected LSI components.
snRNA-seq analysis
Differential abundance testing
Differential abundance testing for the myeloid, CAF, and lymphoid compartments was conducted employing the milopy package20 (https://github.com/emdann/milopy). Of note, the sampling bias, i.e., the fact the resection from the tumor tissue and adjacent normal tissue may vary in tissue size and baseline abundance and types of cells across the tissue, as well as the processing bias, i.e., the fact cells differentially suffer from dissociation and processing, might bias differential abundance testing and limit the interpretability of differential abundance results. For the myeloid compartment, differential abundance testing compared normal adjacent tissue with tumor tissue. For the CAF compartment, differential abundance testing compared primary with metastatic tissue. The Milo method was executed on the cell-type specific Harmony-corrected principal components (PC), utilizing a 20-nearest neighbors graph. Neighborhoods were assigned labels through majority voting: if over 60% of cells within a neighborhood belonged to an individual cell type, the neighborhood was labeled accordingly. Otherwise, the label "mixed" was assigned.
Malignant cell program discovery through consensus negative matrix factorization (cNMF) and characterization
To dissect the malignant cell compartment, we employed consensus non-negative matrix factorization (cNMF)129 (https://github.com/dylkot/cNMF) per individual patient sample and then aggregated the results as described below. cNMF was performed on a sample-level rather than on the full cohort to avoid detecting patient-specific programs primarily driven by technical factors such as batch effects or copy-number variation (CNV) profiles. Cells annotated as putatively malignant based on canonical marker gene expression but not from clustering on inferCNVpy copy number score were filtered. For each sample, cNMF was performed on the RNA counts matrix of the 2,000 most highly variable genes, selecting the number of components (k) based on recommended criteria (i.e., inspecting the error and stability plot and picking the smallest k that minimized error while maximizing stability). Density threshold was set to 0.1 for each sample. cNMF programs expressed in too few cells or showing expression of TME-related genes, potentially indicating contamination, were manually removed.
The cNMF gene expression programs generated per individual patient sample were characterized by a vector of weights per gene representing its contribution to the program. These programs were combined across all samples by calculating their pairwise cosine similarity after removing small (high score in <10 of cells) or contaminated programs. Hierarchical clustering with an average linkage method was then applied to group similar programs into five clusters. A cNMF program was defined by the median weight of clustered gene expression programs, with the top 100 contributing genes used as a gene signature for the cNMF program. Cells from all patients were scored for the resulting cNMF gene signatures using the scanpy scoring method, i.e., the average gene expression of signature genes subtracted with the average gene expression of control genes.
The programs were compared to pan-cancer programs described in Gavish et al.42 For each combination of program uncovered in our dataset and program uncovered in the Gavish et al. publication, we computed the fraction of genes that were found in both programs on the number of genes from the Gavish et al. programs captured in our dataset. We also compared the programs to the Barrett’s esophagus programs described by Nowicki-Osuch et al.1 using Gene Set Enrichment Analysis (GSEA).130 Finally, GSEA130 was run using the prerank function on the ranked list of genes associated with each program, using the hallmarks of cancer as a search database.131
Validation of malignant cell programs in external datasets
In order to assess the reproducibility of the malignant cell cNMF programs identified within our cohort, we conducted a similar analysis on the malignant cell compartment of two external single-cell RNA sequencing studies focusing on esophageal adenocarcinoma: the datasets from Carroll et al.9 and Croft et al.10 Following the methodology outlined in the previous section, we applied cNMF to derive programs for each sample in these external datasets. Subsequently, we computed the cosine similarity between each of these programs and the cNMF programs previously identified in our own dataset. This comparative analysis allowed us to determine the degree of recurrence and consistency of the identified programs across multiple independent datasets.
snATAC analysis and link with snRNA
Link between snRNA and snATAC
To establish a connection between the programs identified in the malignant cell compartment and ATAC peaks, we calculated the Pearson correlation in malignant cells between the TF-IDF-normalized peak accessibility and program score transferred from the snRNA-seq. We then filtered out peaks in the 25% least expressed category in malignant data before performing the correlation computation. Subsequently, we determined the false-discovery rate (FDR) corrected q-value associated with correlation for each peak. Peaks with an FDR q-value below 0.05 and a Pearson correlation coefficient exceeding 0.1 were considered significantly correlated with a specific program.
Representative cells and link between RNA and ATAC identity
To establish the connection between the transcriptomic and epigenetic characteristics of cells, we identified representative cells for each cNMF program. Specifically, we selected cells within the top 5% highest cNMF score for each program, ensuring exclusivity by removing cells that ranked in the top 5% for two or more programs. These cells were designated as cNMF representative cells and were utilized to depict genome tracks surrounding genes of interest. Notably, due to differential recovery rates of ATAC and RNA, the proportion of representative cells with paired ATAC measurements varied.
To characterize the ATAC identity of cells, we identified the top 200 most significantly correlated regions with each cNMF program as cNMF-associated regions. Subsequently, we computed the Z score for each region, estimating the mean and standard deviation across the population of cNMF representative cells. The ATAC data were then scored for each program using the mean Z score of cNMF-associated regions, and each cell was assigned an ATAC identity based on the maximum score. A comparison between RNA and ATAC identities was performed using a confusion matrix.
Drawing inspiration from previous work,51 we assigned a plasticity score to each cell using Shannon’s entropy as a measure of plasticity. We assigned a probability of belonging to a program using a softmax transformation with temperature. Let be the ATAC score associated with cNMFi in cell j; we transformed the score in probability
The temperature parameter T was chosen to optimize the calibration curve associated with RNA and ATAC identity correspondence (Figure S14). The plasticity score of cell j was then computed as
We finally computed the distribution of plasticity scores in the cNMF representative populations.
Spatial transcriptomics (ST) analysis
Visium ST data preparation and sequencing
FFPE-embedded tissue sections of 3–10 μm thickness were sectioned then placed on a slide. H&E staining was performed by Brigham and Women’s Hospital Pathology Department core facility. When available, 2–4 FFPE scrolls of 10–20 μm thickness were collected in microtubes and stored at −200C. RNA quality was assessed using FFPE scrolls or from tissue sections previously placed on a slide by gently removing the FFPE section with a sterile blade and immediately transferring it to a microtube. RNA extraction was carried out using a Qiagen RNeasy FFPE kit. RNA integrity, measured by DV200 value, was determined using the Agilent 4200 TapeStation with RNA High Sensitivity ScreenTape was used. FFPE H&E-stained slides were imaged according to the Visium CytAssist Spatial Gene Expression Imaging Guidelines Technical note. Briefly, using the Leica Aperio VERSA scanner microscope, slides were scanned at 10× magnification. Next, the hardest coverslip was removed, and the sample deparaffinized according to the 10X Genomics Visium CytAssist Spatial Tissue Preparation guide (CG000518 Rev C) and FFPE – deparaffinization and decrosslinking guide (CG000520 Rev B). Hardset coverslips were removed by immersing them in xylene for 10 min, twice for each slide. Then, slides were immersed in 100% ethanol for 3 min, 2 times, followed by immersion in 96% ethanol for 3 min twice and finally in 70% ethanol for 3 min. Slides were incubated overnight at 4°C before proceeding to destaining and decrosslinking according to the guidelines. Next, the slide was placed in the Visium CytAssist Tissue Slide Cassette and destained by incubating on a low profile thermocycler adapter in a thermal cycler (BioRad C1000 Touch) at 420C in 0.1 N HCL. Subsequently, decrosslinking with 10X buffers was performed at 950C for 1 h. All 5 downstream steps were followed according to the Visium CytAssist Spatial Gene Expression User Guide (CB000495, Rev E) including using 6.5 mm × 6.5 mm Visium capture area slides; (1) Probe hybridization; (2) Probe ligation, (3) Probe Release & Extension; (4) Pre-amplification and SPRIselect cleanup; (5) Visium CytAssist Spatial Gene Expression - Probe Based library construction. Visium Human Transcriptome probe set v2.0 used, which contains 18,536 genes targeted by 54,5018 probes. 2.4% (451) of these genes are excluded by default due to predicted off-target activity to a different gene. All cleanup methods were performed using SPRIselect beads (Beckman Coulter), Qiagen EB buffer, and 10X Magnetic separator. Cycle number determination for GEX sample index PCR was performed using Kapa SYBR Fast qPCR Master Mix and qPCR amplification plots were visualized on the 7900HT Real-Time PCR system. Dual Index TS Set A, contains a mix of one unique i7 and one unique i5 sample index was used for sample index PCR. GEX Post-Library Construction QC was performed on Agilent TapeStation DNA High-Sensitivity ScreenTape. Libraries were normalized and pooled for sequencing on NextSeq 150 flow cells (Illumina, Inc., San Diego, CA, USA).
Xenium ST data preparation
Formalin-fixed, paraffin-embedded (FFPE) tissue sections (5 μm) from six patients were placed on Xenium slides following the demonstrated protocol guide (CG000578). Deparaffinization and tissue permeabilization were performed according to the 10x Genomics Xenium User Guide (GC000580). Probe hybridization and cell segmentation staining were carried out following the 10x Genomics User Guide (CG000760 Rev B). The hybridized slide was then loaded onto the Xenium Analyzer in accordance with the User Guide (CG000584 Rev G). The 5K Prime Human Pan-Tissue & Pathway Panel was used for this study. Post-Xenium Analyzer run H&E staining was performed according to the demonstrated protocol CG000613.
We acknowledge the Molecular Imaging Core (mic.dana-farber.org) at Dana-Farber Cancer Institute for Xenium staining and imaging.
Visium ST data preprocessing and cell type annotation
Following the Visium spatial transcriptomics sequencing, the raw bcl files were demultiplexed using bcl2fastq and aligned to the human reference genome GRCh38 for each sample via SpaceRanger (v2.1.1). Quality control procedures were conducted individually for each patient using Squidpy.132 Spots with fewer than 5,000 counts, genes expressed in fewer than 10 spots, and spots exhibiting over 30% reads mapped to mitochondrial DNA (pctMT) were filtered out.
The copy number variation (CNV) profile of each cell was computed utilizing a Python implementation of InferCNV (https://github.com/icbi-lab/infercnvpy). To get an initial estimate of malignant versus normal ST spots, used as input to inferCNV, we employed a method inspired from the STARCH method initialization.133 Briefly, we ran PCA on the log(1+CP10K) normalized ST data and clustered the data using K-means (k=2). We assigned the cluster with the highest average expression to the tumor cluster and the remaining cluster to normal. Normal spots are used as reference for the inferCNV algorithm. We then clustered spots according to their CNV profile using Leiden clustering and assigned clusters with a strong CNV profile to tumor spots. Clusters with a similar CNV profile to the tumor spots but with a weaker overall signal were assigned to mixed spots. Finally, spots with no CNV profile or with a CNV profile opposite to the tumor profile were labeled as normal spots. Hence, this procedure yields a refined assignment to spots to mostly tumor, mixed tumor and TME, and mostly TME regions. We further refined the annotations by spatially smoothing annotations: if a tumor or normal spot contained one or zero spots in the 6-nearest neighbors of the same category, the label was reassigned to the majority label of the neighborhood (tumor, mixed, or normal).
We then computed the distance of each tumor spot to the periphery of the tumor using the shortest path to the nearest normal or mixed spot. Spots with a small assigned distance were hence located at the tumor periphery, while spots with a large assigned distance were located at the tumor core.
Xenium ST data preprocessing and cell type annotation
Xenium spatial transcriptomics data were preprocessed using the SOPA software,134 following established guidelines. Initial cell segmentation was performed using the Cellpose135 “cyto3” model with parameters recommended in the SOPA tutorial, using the DAPI and ATP1A1/CD45/E-Cadherin channels for detection. For each sample, cells with fewer than 10 counts were excluded from downstream analysis.
Cell type annotation was conducted through Leiden clustering on a 15-nearest neighbor graph constructed from a 50-component PCA. Clusters were manually annotated based on canonical marker analysis and comparison with single-nuclei population markers to ensure accurate cell type classification.
Deconvolution of visium ST data
To estimate the proportion of specific cell types within each spot as well as to obtain cell-type specific gene expression in the Visium data, we ran Cell2Location80 on each sample, using the full annotated snRNA-seq discovery cohort as reference. We trained the negative binomial model on the discovery cohort using default parameters to obtain estimated cell-type specific average gene profiles. We then ran the Cell2Location model, using as prior N = 5 average cells per spot and alpha = 20 (relaxed regularization). This yielded an estimated number of cells from a specific cell type per spot. We then sampled from the posterior distribution of the trained model to obtain cell-type specific gene expression per spot.
Scoring the cNMF programs and TME subtypes
For the Visium data, we used the carcinoma-specific gene expression matrix generated by Cell2Location to score the cNMF programs, using the top 100 cNMF contributing genes as a signature, similarly as for the snRNA-seq data. The matrix was first normalized using the log(1+CP10K) transformation. The cNMF score was computed as the average expression of the Z score of signature genes, where the Z score of a gene is computed as , where X is the original gene expression, (resp. ) is the gene average (resp. standard deviation) over all spots. We used a similar procedure to score the TME subtypes, using the corresponding deconvolved layer for scoring, i.e., myeloid-specific gene expression matrix for myeloid subtypes and fibroblast-specific for fibroblast subtypes.
For the Xenium data, the cNMF score was computed as the average Z-scored expression of the top 100 cNMF-contributing genes present in the Xenium panel. The Z score was calculated as , where X is the original gene expression, (resp. ) is the gene average (resp. standard deviation) of all carcinoma cells across all 16 samples in the dataset.
Spatial region discovery with CellCharter in Xenium data
To identify recurrent spatial regions, we applied the CellCharter81 computational tool. First, we computed a latent representation for all cells using scVI136 with default settings, conditioning on the sample to account for sample-specific effects. Next, we concatenated the embeddings from neighboring cells (up to three hops) and applied a Gaussian mixture model to the combined embeddings to identify spatial clusters.
Following the recommended protocol, we ran the procedure five times for each cluster number (k = 3 to 10) and selected the k-value that produced the highest clustering stability. Each cell was then assigned to a cluster, which was shared across patients.
Spatially constrained malignant and TME interaction with NCEM in visium data
To obtain estimates of interaction between cell types present in our data, we used node-centric expression models (NCEM)92 on the Cell2Location deconvolved data for Visium data, using the following tutorial to prepare the data (https://github.com/theislab/ncem_benchmarks/blob/main/notebooks/data_preparation/deconvolution/cell2location_human_lymphnode.ipynb) and the following tutorial to process the data (https://github.com/theislab/ncem_tutorials/blob/main/tutorials/type_coupling_visium.ipynb). In brief, the intensity of the interaction is estimated as the L1 norm of the significant coefficients of the model predicting the gene expression from the cell type and niche. In the visual representation, the strength of interactions is proportional to the width of the line linking two cell types; only cell types that have more than 25 significant coefficients (FDR p < 0.05) are linked in the graph.
Ligand-receptor interactions using COMMOT and Xenium validation
To identify significant cell-cell interactions at a local scale using Visium data, we applied COMMOT, an optimal transport-based spatial communication tool.93 COMMOT was run using the CellChat “cell-cell contact” and “secreted signaling” ligand-receptor databases, with interactions restricted to spots within 200μm for cell-cell contact, effectively limiting the analysis to neighboring spots, and 400μm for secreted signaling. The tool generates sender and receiver scores at the pathway level, which we correlated with the previously defined malignant, stromal, and myeloid program scores.
Pathways that showed consistent correlations across samples (detected in at least 3 of 5 patients) were selected for further investigation. We first visualized these interactions by plotting the median correlation scores between sender and receiver activities across patients. Additionally, we leveraged COMMOT’s inferred spatial signaling directionality to depict interactions within the THY1, MK, and MIF signaling pathways.
For Xenium validation, we computed the receiver score for each pathway by averaging the expression of receptors from the corresponding CellChat ligand-receptor database, when present in the Xenium panel. Only the ligand THY1 was available in the Xenium panel, allowing us to estimate sender activity only for the THY1 pathway. Receiver scores were then visualized in cells predicted to be involved in the interaction through COMMOT for each pathway: inflammatory CAF, TAM1 macrophages, and DCs for THY1; carcinoma, inflammatory CAF, and TAM1 for MK; and carcinoma, TAM1, and DCs for MIF.
Enhancer-driven gene regulatory network inference
To construct an enhancer-driven gene regulatory network (GRN), we utilized the SCENIC+ software.36 The SCENIC+ analysis was conducted at a sample level, with subsequent aggregation of the sample results. Samples with adequate ATAC recovery were included, excluding two normal adjacent samples and one primary sample (P1, P2 and P3).
Enhancer-driven GRN inference with SCENIC+
For each sample, we first created a pycisTopic object by integrating the filtered gene expression and cell type annotations, preprocessed according to the pipeline outlined in “snRNA-seq data preprocessing and cell type annotation,” along with the ATAC fragments obtained through the Cellranger ARC pipeline and the MACS2-called peaks. The analysis encompassed all cells that passed the Cellranger ARC filtering. Subsequently, we employed the serial Latent Dirichlet Allocation (LDA) implementation in pycisTopic, running models with 2, 4, 10, and 16 topics. The selection of the optimal model was based on a combination of metrics as recommended in the pycisTopic tutorial (https://pycistopic.readthedocs.io/en/latest/Single_sample_workflow-RTD.html).
The topic-region distributions were binarized using both the Otsu method and the top 3,000 regions per topic, as advised in the tutorial. Additionally, we computed the differentially accessible regions per cell type, utilizing the cell types annotated in the snRNA data. To identify enriched motifs in candidate enhancer regions, we executed pycisTarget with precomputed databases for motif enrichment and annotations obtained from the auxiliary data of cisTarget (https://resources.aertslab.org/cistarget/).
Subsequently, genes and regions expressed in less than 10% of the cells were filtered out, and the September 2019 Ensembl version was employed for annotation (https://sep2019.archive.ensembl.org/index.html). Finally, leveraging the paired snRNA- and snATAC-seq data and the motif enrichment matrix derived from pycisTarget, we inferred a GRN with SCENIC+ default parameters.
Identifying candidate master transcription factors associated with TME and malignant programs
To identify potential candidate master transcription factors (TFs) associated with cell types or programs, we analyzed the results of SCENIC+ following the tutorial (https://scenicplus.readthedocs.io/en/latest/Scenicplus_step_by_step-RTD.html). SCENIC+ provided outputs of enhancer-driven regulons (eRegulons), delineated as a transcription factor and its regulated genes and regions. The eRegulons were scored in each cell using AUCell, and the TF-eRegulon relationship was computed using pseudobulks for each cell type. High-quality eRegulons were selected, with those exhibiting a TF-eRegulon correlation below 0.2 being removed.
To identify candidate TFs associated with each major cell type in the TME, we calculated the regulon specificity score (RSS) for each eRegulon. Candidate TFs were considered associated with a TME cell type if they exhibited a significant RSS in at least two samples for this cell type. Subsequently, for each cell type, we determined the TF expression Z score on the entire cohort, along with the associated gene-based and region-based eRegulon Z-scores. Candidate TFs were evaluated for their consistent overexpression in the cell type of interest across all three Z-scores.
To identify candidate master TFs associated with the cNMF programs, we computed the correlation for each available TF between the cNMF program scores and all three measurements of TF activity (TF gene expression, gene-based eRegulon score, and region-based eRegulon score). The TFs were ranked based on their correlation with the three measurements, and the median rank of TFs across all three measurements was computed. Only TFs with a correlation exceeding 0.1 in all three modalities were selected. The top 20 TFs with the highest correlation mean across modalities were designated as candidate TFs.
Validation of the identified transcription factors in external datasets
To validate the association between the inflammatory cancer-associated fibroblast (CAF) phenotype we identified and its associated transcription factors (TFs), we utilized the pan-cancer CAF atlas provided by Luo et al.16 Furthermore, to confirm the link between the revealed cNMF programs and their respective TFs, we employed two previously mentioned external single-cell validation cohorts (Croft et al.10 and Carroll et al; 9).
Using the top 100 marker genes for the inflammatory CAF phenotype or the cNMF programs as signatures, we scored them using the scanpy scoring function in the external cohorts. To prevent overestimation of the correlation between the score and TF, we excluded the candidate TFs from the original signature. Due to the absence of associated scATAC-seq data, SCENIC+ could not be executed in external datasets. Therefore, we utilized the SCENIC program137 on the three external datasets to estimate regulon activity.
Subsequently, we computed the correlation between the inflammatory CAF score or cNMF scores and the expression of all known TFs listed by Lambert et al.138 (http://humantfs.ccbr.utoronto.ca/). Additionally, we calculated the correlation between the scores and the eRegulon score as determined by SCENIC. The candidate TFs identified in our dataset through SCENIC+ were highlighted among the most highly correlated TFs, considering both their correlation with TF gene expression and eRegulon score. Of note, we only computed the correlation between the scores uncovered in Carroll et al. (cNMF1, cNMF2, cNMF4, and cNMF5) and Croft et al. (cNMF1, cNMF3, and cNMF4).
Furthermore, to examine whether the trajectory of candidate mTFs aligned with the hypothesized trajectories across cNMF programs, we investigated the expression of candidate mTFs linked with cNMF4 and cNMF5, which were postulated as stable opposed programs. Cells were ranked based on the difference Δ=(cNMF4 score - cNMF5 score), representing the trajectory from cNMF5 toward cNMF4. Subsequently, cells were grouped into ten equally sized bins, and the average expression level of candidate mTFs along with their associated 95% confidence interval was estimated for each bin.
Link between malignant programs and clinical characteristics in external bulk datasets
To ascertain whether the identified cNMF programs correlate with clinical characteristics, we assessed the scores of these programs in external bulk datasets and examined their association with various clinical parameters. To minimize the inclusion of TME components in our signature, we curated a cancer-specific signature. We selected the top 200 genes with the highest weight and subsequently filtered out genes expressed in at least 10% of any major TME cell type (endothelial, fibroblast, muscle, myeloid, lymphoid), using the remaining genes as the signature for each program.
We retrieved data from patients with esophageal adenocarcinoma from the TCGA ESCA project.7 RNA-seq Fragments Per Kilobase of transcript per Million mapped reads (FPKM) data, non-silent mutation calls, and survival information from Liu et al.139 were obtained from the UCSC Xena browser (https://xenabrowser.net/datapages/). Additionally, clinical details were directly obtained from the TCGA Network study.7 RNA-seq expression data and clinical characteristics from the study by Hoefnagel et al.17 were also downloaded, with the RNA-seq raw counts transformed into transcript per million (TPM). Bulk RNA-seq expression data and associated clinical characteristics from the study by Carroll et al.9 were obtained and similarly transformed into TPM.
The cNMF programs were scored using single-sample Gene Set Enrichment Analysis (ssGSEA),140 with input data being FPKM for TCGA or TPM for Hoefnagel et al. or Carroll et al. The resulting scores were then correlated with TNM staging in the TCGA cohort. Survival analysis was conducted in the three cohorts using a univariate Cox Proportional Hazards model on standardized scores, employing default parameters from the lifelines package (https://lifelines.readthedocs.io/en/latest/fitters/regression/CoxPHFitter.html). For the Carroll et al. dataset, we used the subset of bulk expression obtained before the treatment (PreTx) to avoid introducing high correlation between patients. To link mutations to cNMF programs, we computed the point biserial correlation between scores and non-silent mutation calling and corrected p-values with FDR correction.
Link between TME and malignant programs and immune checkpoint inhibitor therapy clinical benefit
To assess whether presence of specific programs, notably the inflammatory CAF program and the malignant cNMF programs, was linked to response to immune checkpoint inhibitor (ICI) therapy, we compared the distribution of score programs across patients of the Carroll et al.9 with or without clinical benefit. Patients were categorized into two groups based on their response to ICI therapy: those experiencing clinical benefit (CB) and those with no clinical benefit (NCB), as annotated in the Carroll et al. cohort. Notably, some patients lacked CB annotations, and patients from the operable cohort in the single-cell atlas were excluded from the CB vs. NCB comparison, participating only in patient-level comparisons. Paired measurements were obtained for patients before treatment (PreTx), after a 4-week ICI treatment window (ICI-4W), and following combined chemotherapy and ICI treatment (PostTx) when available. The distribution of inflammatory CAF and cNMF scores was compared between CB and NCB groups across PreTx and ICI-4W timepoints. Patient-level comparisons, including PostTx when applicable, were also conducted. Significance testing was performed using the Mann-Whitney U test to determine differences between scores at PreTx, ICI-4W, and PostTx measurements.
Ecotype analysis using BayesPrism
To explore the potential association between the identified malignant programs and the TME composition, we conducted a deconvolution analysis on the three previously described independent cohorts (TCGA,7 Hoefnagel et al; ,17 and Carroll et al; 9). We used the BayesPrism algorithm141 with default parameters, with the single-cell data from the Carroll et al. study9 as a reference cohort. BayesPrism provided estimates of the proportions of various cell types present in the datasets.
Inspired by the methodology outlined by Wang et al.,88 we identified ecotypes within the datasets. Each sample was characterized based on its estimated proportion of cell types derived from the Carroll et al. dataset. Subsequently, we computed the Z score across all samples from both cohorts regarding the proportions of cell types. Euclidean pairwise distances were then calculated between all samples, followed by hierarchical clustering with Ward linkage to group samples with similar cell type proportions.
Using the methodology described in the preceding paragraph, we scored the identified programs and assessed the enrichment of scores in the uncovered ecotypes. This analysis aimed to elucidate any potential relationships between the malignant programs and the composition of the TME across the studied cohorts.
Prediction of drug sensitivity using scTherapy
To predict drug sensitivity for each malignant cNMF program, we employed scTherapy83 following its tutorial for subclone-specific drug response prediction, adapting the approach to use cNMF representative cells instead of subclones. We first identified differentially expressed genes specific to each program using these representative cells and applied the filtering criteria outlined in the tutorial to predict monotherapy sensitivity. For each program, we selected drugs predicted to have high or high-to-moderate efficacy. To identify potential combination therapies, we selected drug pairs that, together, were predicted to target all five cNMF programs.
Quantification and statistical analysis
Details of statistical analyses of the various experiments are described in the relevant methods section and in the respective figure legends and main text, if appropriate. If not specified, statistical analysis was carried out using Python, packages “scipy” and “statannotations”. To determine significance of difference between two groups we used the Mann-Whitney U test a p-value of <0.05 was considered significant. When necessary, we corrected for multiple testing using the False Discovery Rate (FDR) (indicated as q). To estimate the significance of the correlation between two groups we used Pearson’s correlation analysis. For Gene Set Enrichment Analysis (GSEA) we used their developed empirical test to estimate significance, which was corrected using FDR. Significance was set at q < 0.05. For survival analysis, the significance for the Cox proportional hazard test was computed using the Wald test, while for the Kaplan-Meier curves we used the two-side log-rank test.
Published: June 10, 2025
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.xcrm.2025.102188.
Contributor Information
Valentina Boeva, Email: valentina.boeva@inf.ethz.ch.
Eliezer M. Van Allen, Email: eliezerm_vanallen@dfci.harvard.edu.
Supplemental information
Only OncoPanel tested variants present in at least one sample are indicated.
The numbers correspond to the estimated number of copies of the gene (2 = no event, >4 = amplification, 0 = two copy deletion). Only genes present in at least one sample are indicated.
1 indicates the presence of the SV.
Drugs predicted to be efficacious against cells expressing specific programs are indicated by a 1.
References
- 1.Nowicki-Osuch K., Zhuang L., Jammula S., Bleaney C.W., Mahbubani K.T., Devonshire G., Katz-Summercorn A., Eling N., Wilbrey-Clark A., Madissoon E., et al. Molecular phenotyping reveals the identity of Barrett’s esophagus and its malignant transition. Science. 2021;373:760–767. doi: 10.1126/science.abd1449. [DOI] [PubMed] [Google Scholar]
- 2.Que J., Garman K.S., Souza R.F., Spechler S.J. Pathogenesis and Cells of Origin of Barrett’s Esophagus. Gastroenterology (New York, N. Y., 1943) 2019;157:349–364.e1. doi: 10.1053/j.gastro.2019.03.072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Jiang M., Li H., Zhang Y., Yang Y., Lu R., Liu K., Lin S., Lan X., Wang H., Wu H., et al. Transitional basal cells at the squamous-columnar junction generate Barrett’s oesophagus. Nature. 2017;550:529–533. doi: 10.1038/nature24269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Pennathur A., Gibson M.K., Jobe B.A., Luketich J.D. Oesophageal carcinoma. Lancet. 2013;381:400–412. doi: 10.1016/S0140-6736(12)60643-6. [DOI] [PubMed] [Google Scholar]
- 5.Derakhshan M.H., Arnold M., Brewster D.H., Going J.J., Mitchell D.R., Forman D., McColl K.E.L. Worldwide Inverse Association between Gastric Cancer and Esophageal Adenocarcinoma Suggesting a Common Environmental Factor Exerting Opposing Effects. Am. J. Gastroenterol. 2016;111:228–239. doi: 10.1038/ajg.2015.405. [DOI] [PubMed] [Google Scholar]
- 6.Brown L.M., Devesa S.S., Chow W.-H. Incidence of adenocarcinoma of the esophagus among white Americans by sex, stage, and age. J. Natl. Cancer Inst. 2008;100:1184–1187. doi: 10.1093/jnci/djn211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Cancer Genome Atlas Research Network. Analysis Working Group: Asan University. BC Cancer Agency. Brigham and Women’s Hospital. Broad Institute. Brown University. Case Western Reserve University. Dana-Farber Cancer Institute. Duke University. Greater Poland Cancer Centre Integrated genomic characterization of oesophageal carcinoma. Nature. 2017;541:169–175. doi: 10.1038/nature20805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Enzinger P.C., Mayer R.J. Esophageal cancer. Esophageal cancer. N. Engl. J. Med. 2003;349:2241–2252. doi: 10.1056/NEJMra035010. [DOI] [PubMed] [Google Scholar]
- 9.Carroll T.M., Chadwick J.A., Owen R.P., White M.J., Kaplinsky J., Peneva I., Frangou A., Xie P.F., Chang J., Roth A., et al. Tumor monocyte content predicts immunochemotherapy outcomes in esophageal adenocarcinoma. Cancer Cell. 2023;41:1222–1241. doi: 10.1016/j.ccell.2023.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Croft W., Evans R.P.T., Pearce H., Elshafie M., Griffiths E.A., Moss P. The single cell transcriptional landscape of esophageal adenocarcinoma and its modulation by neoadjuvant chemotherapy. Mol. Cancer. 2022;21:200. doi: 10.1186/s12943-022-01666-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Dagogo-Jack I., Shaw A.T. Tumour heterogeneity and resistance to cancer therapies. Nat. Rev. Clin. Oncol. 2018;15:81–94. doi: 10.1038/nrclinonc.2017.166. [DOI] [PubMed] [Google Scholar]
- 12.Marusyk A., Janiszewska M., Polyak K. Intratumor Heterogeneity: The Rosetta Stone of Therapy Resistance. Cancer Cell. 2020;37:471–484. doi: 10.1016/j.ccell.2020.03.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ji A.L., Rubin A.J., Thrane K., Jiang S., Reynolds D.L., Meyers R.M., Guo M.G., George B.M., Mollbrink A., Bergenstråhle J., et al. Multimodal Analysis of Composition and Spatial Architecture in Human Squamous Cell Carcinoma. Cell. 2020;182:1661–1662. doi: 10.1016/j.cell.2020.08.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Neftel C., Laffy J., Filbin M.G., Hara T., Shore M.E., Rahme G.J., Richman A.R., Silverbush D., Shaw M.L., Hebert C.M., et al. An Integrative Model of Cellular States, Plasticity, and Genetics for Glioblastoma. Cell. 2019;178:835–849. doi: 10.1016/j.cell.2019.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bi K., He M.X., Bakouny Z., Kanodia A., Napolitano S., Wu J., Grimaldi G., Braun D.A., Cuoco M.S., Mayorga A., et al. Tumor and immune reprogramming during immunotherapy in advanced renal cell carcinoma. Cancer Cell. 2021;39:649–661. doi: 10.1016/j.ccell.2021.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Luo H., Xia X., Huang L.B., An H., Cao M., Kim G.D., Chen H.N., Zhang W.H., Shu Y., Kong X., et al. Pan-cancer single-cell analysis reveals the heterogeneity and plasticity of cancer-associated fibroblasts in the tumor microenvironment. Nat. Commun. 2022;13:6619. doi: 10.1038/s41467-022-34395-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Hoefnagel S.J.M., Koemans W.J., Khan H.N., Koster J., Meijer S.L., van Dieren J.M., Kodach L.L., van Sandick J.W., Calpe S., Del Sancho-Serra C.M., et al. Identification of Novel Molecular Subgroups in Esophageal Adenocarcinoma to Predict Response to Neo-Adjuvant Therapies. Cancers. 2022;14 doi: 10.3390/cancers14184498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Garcia E.P., Minkovsky A., Jia Y., Ducar M.D., Shivdasani P., Gong X., Ligon A.H., Sholl L.M., Kuo F.C., MacConaill L.E., et al. Validation of OncoPanel: A targeted next-generation sequencing assay for the detection of somatic variants in cancer. Arch. Pathol. Lab Med. 2017;141:751–758. doi: 10.5858/arpa.2016-0527-OA. [DOI] [PubMed] [Google Scholar]
- 19.Korsunsky I., Millard N., Fan J., Slowikowski K., Zhang F., Wei K., Baglaenko Y., Brenner M., Loh P.R., Raychaudhuri S. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods. 2019;16:1289–1296. doi: 10.1038/s41592-019-0619-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dann E., Henderson N.C., Teichmann S.A., Morgan M.D., Marioni J.C. Differential abundance testing on single-cell data using k-nearest neighbor graphs. Nat. Biotechnol. 2022;40:245–253. doi: 10.1038/s41587-021-01033-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Liguori M., Digifico E., Vacchini A., Avigni R., Colombo F.S., Borroni E.M., Farina F.M., Milanesi S., Castagna A., Mannarino L., et al. The soluble glycoprotein NMB (GPNMB) produced by macrophages induces cancer stemness and metastasis via CD44 and IL-33. Cell. Mol. Immunol. 2021;18:711–722. doi: 10.1038/s41423-020-0501-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.van Leent M.M.T., Beldman T.J., Toner Y.C., Lameijer M.A., Rother N., Bekkering S., Teunissen A.J.P., Zhou X., van der Meel R., Malkus J., et al. Prosaposin mediates inflammation in atherosclerosis. Sci. Transl. Med. 2021;13 doi: 10.1126/scitranslmed.abe1433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Yi Y.-S., Kim H.G., Kim J.H., Yang W.S., Kim E., Jeong D., Park J.G., Aziz N., Kim S., Parameswaran N., Cho J.Y. Syk-MyD88 Axis Is a Critical Determinant of Inflammatory-Response in Activated Macrophages. Front. Immunol. 2021;12 doi: 10.3389/fimmu.2021.767366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Zhang X., Sun Y., Ma Y., Gao C., Zhang Y., Yang X., Zhao X., Wang W., Wang L. Tumor-associated M2 macrophages in the immune microenvironment influence the progression of renal clear cell carcinoma by regulating M2 macrophage-associated genes. Front. Oncol. 2023;13 doi: 10.3389/fonc.2023.1157861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Beyer M., Mallmann M.R., Xue J., Staratschek-Jox A., Vorholt D., Krebs W., Sommer D., Sander J., Mertens C., Nino-Castro A., et al. High-resolution transcriptome of human macrophages. PLoS One. 2012;7 doi: 10.1371/journal.pone.0045466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Cheng S., Li Z., Gao R., Xing B., Gao Y., Yang Y., Qin S., Zhang L., Ouyang H., Du P., et al. A pan-cancer single-cell transcriptional atlas of tumor infiltrating myeloid cells. Cell. 2021;184:792–809. doi: 10.1016/j.cell.2021.01.010. [DOI] [PubMed] [Google Scholar]
- 27.Raskov H., Orhan A., Christensen J.P., Gögenur I. Cytotoxic CD8+ T cells in cancer and cancer immunotherapy. Br. J. Cancer. 2021;124:359–367. doi: 10.1038/s41416-020-01048-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Schumacher T.N., Thommen D.S. Tertiary lymphoid structures in cancer. Science. 2022;375 doi: 10.1126/science.abf9419. [DOI] [PubMed] [Google Scholar]
- 29.Chiossone L., Dumas P.-Y., Vienne M., Vivier E. Natural killer cells and other innate lymphoid cells in cancer. Nat. Rev. Immunol. 2018;18:671–688. doi: 10.1038/s41577-018-0061-z. [DOI] [PubMed] [Google Scholar]
- 30.Cabrita R., Lauss M., Sanna A., Donia M., Skaarup Larsen M., Mitra S., Johansson I., Phung B., Harbst K., Vallon-Christersson J., et al. Tertiary lymphoid structures improve immunotherapy and survival in melanoma. Nature. 2020;577:561–565. doi: 10.1038/s41586-019-1914-8. [DOI] [PubMed] [Google Scholar]
- 31.Helmink B.A., Reddy S.M., Gao J., Zhang S., Basar R., Thakur R., Yizhak K., Sade-Feldman M., Blando J., Han G., et al. B cells and tertiary lymphoid structures promote immunotherapy response. Nature. 2020;577:549–555. doi: 10.1038/s41586-019-1922-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.DiLillo D.J., Yanaba K., Tedder T.F. B cells are required for optimal CD4+ and CD8+ T cell tumor immunity: therapeutic B cell depletion enhances B16 melanoma growth in mice. J. Immunol. 2010;184:4006–4016. doi: 10.4049/jimmunol.0903009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Xu Y., Wei Z., Feng M., Zhu D., Mei S., Wu Z., Feng Q., Chang W., Ji M., Liu C., et al. Tumor-infiltrated activated B cells suppress liver metastasis of colorectal cancers. Cell Rep. 2022;40 doi: 10.1016/j.celrep.2022.111295. [DOI] [PubMed] [Google Scholar]
- 34.Chen Y., McAndrews K.M., Kalluri R. Clinical and therapeutic relevance of cancer-associated fibroblasts. Nat. Rev. Clin. Oncol. 2021;18:792–804. doi: 10.1038/s41571-021-00546-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Sun H., Wang X., Wang X., Xu M., Sheng W. The role of cancer-associated fibroblasts in tumorigenesis of gastric cancer. Cell Death Dis. 2022;13:874. doi: 10.1038/s41419-022-05320-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bravo González-Blas C., De Winter S., Hulselmans G., Hecker N., Matetovici I., Christiaens V., Poovathingal S., Wouters J., Aibar S., Aerts S. SCENIC+: single-cell multiomic inference of enhancers and gene regulatory networks. Nat. Methods. 2023;20:1355–1367. doi: 10.1038/s41592-023-01938-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Becker W.R., Nevins S.A., Chen D.C., Chiu R., Horning A.M., Guha T.K., Laquindanum R., Mills M., Chaib H., Ladabaum U., et al. Single-cell analyses define a continuum of cell state and composition changes in the malignant transformation of polyps to colorectal cancer. Nat. Genet. 2022;54:985–995. doi: 10.1038/s41588-022-01088-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kim W., Barron D.A., San Martin R., Chan K.S., Tran L.L., Yang F., Ressler S.J., Rowley D.R. RUNX1 is essential for mesenchymal stem cell proliferation and myofibroblast differentiation. Proc. Natl. Acad. Sci. USA. 2014;111:16389–16394. doi: 10.1073/pnas.1407097111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Halperin C., Hey J., Weichenhan D., Stein Y., Mayer S., Lutsik P., Plass C., Scherz-Shouval R. Global DNA Methylation Analysis of Cancer-Associated Fibroblasts Reveals Extensive Epigenetic Rewiring Linked with RUNX1 Upregulation in Breast Cancer Stroma. Cancer Res. 2022;82:4139–4152. doi: 10.1158/0008-5472.CAN-22-0209. [DOI] [PubMed] [Google Scholar]
- 40.Bobowski-Gerard M., Boulet C., Zummo F.P., Dubois-Chevalier J., Gheeraert C., Bou Saleh M., Strub J.M., Farce A., Ploton M., Guille L., et al. Functional genomics uncovers the transcription factor BNC2 as required for myofibroblastic activation in fibrosis. Nat. Commun. 2022;13:5324. doi: 10.1038/s41467-022-33063-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Lee K.-W., Yeo S.Y., Gong J.R., Koo O.J., Sohn I., Lee W.Y., Kim H.C., Yun S.H., Cho Y.B., Choi M.A., et al. PRRX1 is a master transcription factor of stromal fibroblasts for myofibroblastic lineage progression. Nat. Commun. 2022;13:2793. doi: 10.1038/s41467-022-30484-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Gavish A., Tyler M., Greenwald A.C., Hoefflin R., Simkin D., Tschernichovsky R., Galili Darnell N., Somech E., Barbolin C., Antman T., et al. Hallmarks of transcriptional intratumour heterogeneity across a thousand tumours. Nature. 2023;618:598–606. doi: 10.1038/s41586-023-06130-4. [DOI] [PubMed] [Google Scholar]
- 43.Rychahou P.G., Kang J., Gulhati P., Doan H.Q., Chen L.A., Xiao S.Y., Chung D.H., Evers B.M. Akt2 overexpression plays a critical role in the establishment of colorectal cancer metastasis. Proc. Natl. Acad. Sci. USA. 2008;105:20315–20320. doi: 10.1073/pnas.0810715105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Uxa S., Castillo-Binder P., Kohler R., Stangner K., Müller G.A., Engeland K. Ki-67 gene expression. Cell Death Differ. 2021;28:3357–3370. doi: 10.1038/s41418-021-00823-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Robert G., Gaggioli C., Bailet O., Chavey C., Abbe P., Aberdam E., Sabatié E., Cano A., Garcia de Herreros A., Ballotti R., Tartare-Deckert S. SPARC represses E-cadherin and induces mesenchymal transition during melanoma development. Cancer Res. 2006;66:7516–7523. doi: 10.1158/0008-5472.CAN-05-3189. [DOI] [PubMed] [Google Scholar]
- 46.Sarrió D., Rodriguez-Pinilla S.M., Hardisson D., Cano A., Moreno-Bueno G., Palacios J. Epithelial-mesenchymal transition in breast cancer relates to the basal-like phenotype. Cancer Res. 2008;68:989–997. doi: 10.1158/0008-5472.CAN-07-2017. [DOI] [PubMed] [Google Scholar]
- 47.Montagner M., Enzo E., Forcato M., Zanconato F., Parenti A., Rampazzo E., Basso G., Leo G., Rosato A., Bicciato S., et al. SHARP1 suppresses breast cancer metastasis by promoting degradation of hypoxia-inducible factors. Nature. 2012;487:380–384. doi: 10.1038/nature11207. [DOI] [PubMed] [Google Scholar]
- 48.Bigot P., Colli L.M., Machiela M.J., Jessop L., Myers T.A., Carrouget J., Wagner S., Roberson D., Eymerit C., Henrion D., et al. Functional characterization of the 12p12.1 renal cancer-susceptibility locus implicates BHLHE41. Nat. Commun. 2016;7 doi: 10.1038/ncomms12098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Duncan R., Carpenter B., Main L.C., Telfer C., Murray G.I. Characterisation and protein expression profiling of annexins in colorectal cancer. Br. J. Cancer. 2008;98:426–433. doi: 10.1038/sj.bjc.6604128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Flavahan W.A., Gaskell E., Bernstein B.E. Epigenetic plasticity and the hallmarks of cancer. Science. 2017;357 doi: 10.1126/science.aal2380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Burdziak C., Alonso-Curbelo D., Walle T., Reyes J., Barriga F.M., Haviv D., Xie Y., Zhao Z., Zhao C.J., Chen H.A., et al. Epigenetic plasticity cooperates with cell-cell interactions to direct pancreatic tumorigenesis. Science. 2023;380 doi: 10.1126/science.add5327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Shipony Z., Mukamel Z., Cohen N.M., Landan G., Chomsky E., Zeliger S.R., Fried Y.C., Ainbinder E., Friedman N., Tanay A. Dynamic and static maintenance of epigenetic memory in pluripotent and somatic cells. Nature. 2014;513:115–119. doi: 10.1038/nature13458. [DOI] [PubMed] [Google Scholar]
- 53.Nashun B., Hill P.W.S., Hajkova P. Reprogramming of cell fate: epigenetic memory and the erasure of memories past. EMBO J. 2015;34:1296–1308. doi: 10.15252/embj.201490649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Pastushenko I., Blanpain C. EMT Transition States during Tumor Progression and Metastasis. Trends Cell Biol. 2019;29:212–226. doi: 10.1016/j.tcb.2018.12.001. [DOI] [PubMed] [Google Scholar]
- 55.McDonald O.G., Wu H., Timp W., Doi A., Feinberg A.P. Genome-scale epigenetic reprogramming during epithelial-to-mesenchymal transition. Nat. Struct. Mol. Biol. 2011;18:867–874. doi: 10.1038/nsmb.2084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Zalmas L.P., Zhao X., Graham A.L., Fisher R., Reilly C., Coutts A.S., La Thangue N.B. DNA-damage response control of E2F7 and E2F8. EMBO Rep. 2008;9:252–259. doi: 10.1038/sj.embor.7401158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Krebs A.M., Mitschke J., Lasierra Losada M., Schmalhofer O., Boerries M., Busch H., Boettcher M., Mougiakakos D., Reichardt W., Bronsert P., et al. The EMT-activator Zeb1 is a key factor for cell plasticity and promotes metastasis in pancreatic cancer. Nat. Cell Biol. 2017;19:518–529. doi: 10.1038/ncb3513. [DOI] [PubMed] [Google Scholar]
- 58.Caramel J., Ligier M., Puisieux A. Pleiotropic Roles for ZEB1 in Cancer. Cancer Res. 2018;78:30–35. doi: 10.1158/0008-5472.CAN-17-2476. [DOI] [PubMed] [Google Scholar]
- 59.Esfandbod M., Naderi M., Sadatnaseri A., Ahmadi A., Noroozi M., Sadeghi Joni S. Evaluation of the Preventive Effects of Carvedilol on Trastuzumab-Induced Cardiotoxicity in Early-Stage and Locally Advanced HER2-Positive Breast Cancer Patients. Int. J. Hematol. Oncol. Stem Cell Res. 2021;15:206–212. doi: 10.18502/ijhoscr.v15i4.7475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Chen Y., Wang T., Huang M., Liu Q., Hu C., Wang B., Han D., Chen C., Zhang J., Li Z., et al. MAFB Promotes Cancer Stemness and Tumorigenesis in Osteosarcoma through a Sox9-Mediated Positive Feedback Loop. Cancer Res. 2020;80:2472–2483. doi: 10.1158/0008-5472.CAN-19-1764. [DOI] [PubMed] [Google Scholar]
- 61.Eychène A., Rocques N., Pouponnot C. A new MAFia in cancer. Nat. Rev. Cancer. 2008;8:683–693. doi: 10.1038/nrc2460. [DOI] [PubMed] [Google Scholar]
- 62.Calissi G., Lam E.W.-F., Link W. Therapeutic strategies targeting FOXO transcription factors. Nat. Rev. Drug Discov. 2021;20:21–38. doi: 10.1038/s41573-020-0088-2. [DOI] [PubMed] [Google Scholar]
- 63.Xiong F., Liu W., Wang X., Wu G., Wang Q., Guo T., Huang W., Wang B., Chen Y. HOXA5 inhibits the proliferation of extrahepatic cholangiocarcinoma cells by enhancing MXD1 expression and activating the p53 pathway. Cell Death Dis. 2022;13:829. doi: 10.1038/s41419-022-05279-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Cascón A., Robledo M. MAX and MYC: a heritable breakup. Cancer Res. 2012;72:3119–3124. doi: 10.1158/0008-5472.CAN-11-3891. [DOI] [PubMed] [Google Scholar]
- 65.Celià-Terrassa T., Liu D.D., Choudhury A., Hang X., Wei Y., Zamalloa J., Alfaro-Aco R., Chakrabarti R., Jiang Y.Z., Koh B.I., et al. Normal and cancerous mammary stem cells evade interferon-induced constraint through the miR-199a-LCOR axis. Nat. Cell Biol. 2017;19:711–723. doi: 10.1038/ncb3533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Jalaguier S., Teyssier C., Nait Achour T., Lucas A., Bonnet S., Rodriguez C., Elarouci N., Lapierre M., Cavaillès V. Complex regulation of LCoR signaling in breast cancer cells. Oncogene. 2017;36:4790–4801. doi: 10.1038/onc.2017.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Pan Z., Xu T., Bao L., Hu X., Jin T., Chen J., Chen J., Qian Y., Lu X., Li L., et al. CREB3L1 promotes tumor growth and metastasis of anaplastic thyroid carcinoma by remodeling the tumor microenvironment. Mol. Cancer. 2022;21:190. doi: 10.1186/s12943-022-01658-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Feng Y.-X., Jin D.X., Sokol E.S., Reinhardt F., Miller D.H., Gupta P.B. Cancer-specific PERK signaling drives invasion and metastasis through CREB3L1. Nat. Commun. 2017;8:1079. doi: 10.1038/s41467-017-01052-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Saito A., Omura I., Imaizumi K. CREB3L1/OASIS: cell cycle regulator and tumor suppressor. FEBS J. 2024;291:4853–4866. doi: 10.1111/febs.17052. [DOI] [PubMed] [Google Scholar]
- 70.Okita Y., Kimura M., Xie R., Chen C., Shen L.T.W., Kojima Y., Suzuki H., Muratani M., Saitoh M., Semba K., et al. The transcription factor MAFK induces EMT and malignant progression of triple-negative breast cancer cells through its target GPNMB. Sci. Signal. 2017;10 doi: 10.1126/scisignal.aak9397. [DOI] [PubMed] [Google Scholar]
- 71.Liu Y., Deguchi Y., Tian R., Wei D., Wu L., Chen W., Xu W., Xu M., Liu F., Gao S., et al. Pleiotropic Effects of PPARD Accelerate Colorectal Tumorigenesis, Progression, and Invasion. Cancer Res. 2019;79:954–969. doi: 10.1158/0008-5472.CAN-18-1790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Wang X., Wang G., Shi Y., Sun L., Gorczynski R., Li Y.J., Xu Z., Spaner D.E. PPAR-delta promotes survival of breast cancer cells in harsh metabolic conditions. Oncogenesis. 2016;5 doi: 10.1038/oncsis.2016.41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Blum A.E., Ravillah D., Betrnet Consortium. Katabathula R.M., Khalighi S., Varadan V., Guda K. HNF4A Defines Molecular Subtypes and Vulnerability to Transforming Growth Factor β-Pathway Targeted Therapies in Cancers of the Distal Esophagus. Gastroenterology (New York, N. Y., 1943) 2022;163:1457–1460. doi: 10.1053/j.gastro.2022.07.073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Pan J., Silva T.C., Gull N., Yang Q., Plummer J.T., Chen S., Daigo K., Hamakubo T., Gery S., Ding L.W., et al. Lineage-Specific Epigenomic and Genomic Activation of Oncogene HNF4A Promotes Gastrointestinal Adenocarcinomas. Cancer Res. 2020;80:2722–2736. doi: 10.1158/0008-5472.CAN-20-0390. [DOI] [PubMed] [Google Scholar]
- 75.Rogerson C., Britton E., Withey S., Hanley N., Ang Y.S., Sharrocks A.D. Identification of a primitive intestinal transcription factor network shared between esophageal adenocarcinoma and its precancerous precursor state. Genome Res. 2019;29:723–736. doi: 10.1101/gr.243345.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Backx E., Wauters E., Baldan J., Van Bulck M., Michiels E., Heremans Y., De Paep D.L., Kurokawa M., Goyama S., Bouwens L., et al. MECOM permits pancreatic acinar cell dedifferentiation avoiding cell death under stress conditions. Cell Death Differ. 2021;28:2601–2615. doi: 10.1038/s41418-021-00771-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Ma Y., Kang B., Li S., Xie G., Bi J., Li F., An G., Liu B., Li J., Shen Y., et al. CRISPR-mediated MECOM depletion retards tumor growth by reducing cancer stem cell properties in lung squamous cell carcinoma. Mol. Ther. 2022;30:3341–3357. doi: 10.1016/j.ymthe.2022.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Kumar M.S., Armenteros-Monterroso E., East P., Chakravorty P., Matthews N., Winslow M.M., Downward J. HMGA2 functions as a competing endogenous RNA to promote lung cancer progression. Nature. 2014;505:212–217. doi: 10.1038/nature12785. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 79.Wang X., Liu X., Li A.Y.J., Chen L., Lai L., Lin H.H., Hu S., Yao L., Peng J., Loera S., et al. Overexpression of HMGA2 promotes metastasis and impacts survival of colorectal cancers. Clin. Cancer Res. 2011;17:2570–2580. doi: 10.1158/1078-0432.CCR-10-2542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Kleshchevnikov V., Shmatko A., Dann E., Aivazidis A., King H.W., Li T., Elmentaite R., Lomakin A., Kedlian V., Gayoso A., et al. Cell2location maps fine-grained cell types in spatial transcriptomics. Nat. Biotechnol. 2022;40:661–671. doi: 10.1038/s41587-021-01139-4. [DOI] [PubMed] [Google Scholar]
- 81.Varrone M., Tavernari D., Santamaria-Martínez A., Walsh L.A., Ciriello G. CellCharter reveals spatial cell niches associated with tissue remodeling and cell plasticity. Nat. Genet. 2024;56:74–84. doi: 10.1038/s41588-023-01588-4. [DOI] [PubMed] [Google Scholar]
- 82.Ribatti D., Tamma R., Annese T. Epithelial-Mesenchymal Transition in Cancer: A Historical Overview. Transl. Oncol. 2020;13 doi: 10.1016/j.tranon.2020.100773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Ianevski A., Nader K., Driva K., Senkowski W., Bulanova D., Moyano-Galceran L., Ruokoranta T., Kuusanmäki H., Ikonen N., Sergeev P., et al. Single-cell transcriptomes identify patient-tailored therapies for selective co-inhibition of cancer clones. Nat. Commun. 2024;15:8579. doi: 10.1038/s41467-024-52980-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Piva R., Ruggeri B., Williams M., Costa G., Tamagno I., Ferrero D., Giai V., Coscia M., Peola S., Massaia M., et al. CEP-18770: A novel, orally active proteasome inhibitor with a tumor-selective pharmacologic profile competitive with bortezomib. Blood. 2008;111:2765–2775. doi: 10.1182/blood-2007-07-100651. [DOI] [PubMed] [Google Scholar]
- 85.Valdez B.C., Brammer J.E., Li Y., Murray D., Liu Y., Hosing C., Nieto Y., Champlin R.E., Andersson B.S. Romidepsin targets multiple survival signaling pathways in malignant T cells. Blood Cancer J. 2015;5 doi: 10.1038/bcj.2015.83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Burtness B., Gibson M., Egleston B., Mehra R., Thomas L., Sipples R., Quintanilla M., Lacy J., Watkins S., Murren J.R., Forastiere A.A. Phase II trial of docetaxel-irinotecan combination in advanced esophageal cancer. Ann. Oncol. 2009;20:1242–1248. doi: 10.1093/annonc/mdn787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Ahn D.H., Barzi A., Ridinger M., Samuëlsz E., Subramanian R.A., Croucher P.J.P., Smeal T., Kabbinavar F.F., Lenz H.J. Onvansertib in combination with FOLFIRI and bevacizumab in second-line treatment of KRAS-mutant metastatic colorectal cancer: A phase Ib clinical study. Clin. Cancer Res. 2024;30:2039–2047. doi: 10.1158/1078-0432.CCR-23-3053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Wang R., Song S., Qin J., Yoshimura K., Peng F., Chu Y., Li Y., Fan Y., Jin J., Dang M., et al. Evolution of immune and stromal cell states and ecotypes during gastric adenocarcinoma progression. Cancer Cell. 2023;41:1407–1426. doi: 10.1016/j.ccell.2023.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Aggarwal V., Montoya C.A., Donnenberg V.S., Sant S. Interplay between tumor microenvironment and partial EMT as the driver of tumor progression. iScience. 2021;24 doi: 10.1016/j.isci.2021.102113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Su S., Liu Q., Chen J., Chen J., Chen F., He C., Huang D., Wu W., Lin L., Huang W., et al. A positive feedback loop between mesenchymal-like cancer cells and macrophages is essential to breast cancer metastasis. Cancer Cell. 2014;25:605–620. doi: 10.1016/j.ccr.2014.03.021. [DOI] [PubMed] [Google Scholar]
- 91.Wu J., Shen S., Liu T., Ren X., Zhu C., Liang Q., Cui X., Chen L., Cheng P., Cheng W., Wu A. Chemerin enhances mesenchymal features of glioblastoma by establishing autocrine and paracrine networks in a CMKLR1-dependent manner. Oncogene. 2022;41:3024–3036. doi: 10.1038/s41388-022-02295-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Fischer D.S., Schaar A.C., Theis F.J. Modeling intercellular communication in tissues using spatial graphs of cells. Nat. Biotechnol. 2023;41:332–336. doi: 10.1038/s41587-022-01467-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Cang Z., Zhao Y., Almet A.A., Stabell A., Ramos R., Plikus M.V., Atwood S.X., Nie Q. Screening cell-cell communication in spatial transcriptomics via collective optimal transport. Nat. Methods. 2023;20:218–228. doi: 10.1038/s41592-022-01728-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Barker T.H., Hagood J.S. Getting a grip on Thy-1 signaling. Biochim. Biophys. Acta. 2009;1793:921–923. doi: 10.1016/j.bbamcr.2008.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Wetzel A., Chavakis T., Preissner K.T., Sticherling M., Haustein U.F., Anderegg U., Saalbach A. Human Thy-1 (CD90) on activated endothelial cells is a counterreceptor for the leukocyte integrin Mac-1 (CD11b/CD18) J. Immunol. 2004;172:3850–3859. doi: 10.4049/jimmunol.172.6.3850. [DOI] [PubMed] [Google Scholar]
- 96.Lu H., Clauser K.R., Tam W.L., Fröse J., Ye X., Eaton E.N., Reinhardt F., Donnenberg V.S., Bhargava R., Carr S.A., Weinberg R.A. A breast cancer stem cell niche supported by juxtacrine signalling from monocytes and macrophages. Nat. Cell Biol. 2014;16:1105–1117. doi: 10.1038/ncb3041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Rege T.A., Hagood J.S. Thy-1 as a regulator of cell-cell and cell-matrix interactions in axon regeneration, apoptosis, adhesion, migration, cancer, and fibrosis. FASEB J. 2006;20:1045–1054. doi: 10.1096/fj.05-5460rev. [DOI] [PubMed] [Google Scholar]
- 98.Yan P., Jimenez E.R., Li Z., Bui T., Seehawer M., Nishida J., Foidart P., Stevens L.E., Xie Y., Gomez M.M., et al. Midkine as a driver of age-related changes and increase in mammary tumorigenesis. Cancer Cell. 2024;42:1936–1954. doi: 10.1016/j.ccell.2024.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Cerezo-Wallis D., Contreras-Alcalde M., Troulé K., Catena X., Mucientes C., Calvo T.G., Cañón E., Tejedo C., Pennacchi P.C., Hogan S., et al. Midkine rewires the melanoma microenvironment toward a tolerogenic and immune-resistant state. Nat. Med. 2020;26:1865–1877. doi: 10.1038/s41591-020-1073-3. [DOI] [PubMed] [Google Scholar]
- 100.Filippou P.S., Karagiannis G.S., Constantinidou A. Midkine (MDK) growth factor: a key player in cancer progression and a promising therapeutic target. Oncogene. 2020;39:2040–2054. doi: 10.1038/s41388-019-1124-8. [DOI] [PubMed] [Google Scholar]
- 101.Olmeda D., Cerezo-Wallis D., Riveiro-Falkenbach E., Pennacchi P.C., Contreras-Alcalde M., Ibarz N., Cifdaloz M., Catena X., Calvo T.G., Cañón E., et al. Whole-body imaging of lymphovascular niches identifies pre-metastatic roles of midkine. Nature. 2017;546:676–680. doi: 10.1038/nature22977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.He X.-X., Yang J., Ding Y.W., Liu W., Shen Q.Y., Xia H.H.X. Increased epithelial and serum expression of macrophage migration inhibitory factor (MIF) in gastric cancer: potential role of MIF in gastric carcinogenesis. Gut. 2006;55:797–802. doi: 10.1136/gut.2005.078113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.He X.-X., Chen K., Yang J., Li X.Y., Gan H.Y., Liu C.Y., Coleman T.R., Al-Abed Y. Macrophage migration inhibitory factor promotes colorectal cancer. Mol. Med. 2009;15:1–10. doi: 10.2119/molmed.2008.00107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Charan M., Das S., Mishra S., Chatterjee N., Varikuti S., Kaul K., Misri S., Ahirwar D.K., Satoskar A.R., Ganju R.K. Macrophage migration inhibitory factor inhibition as a novel therapeutic approach against triple-negative breast cancer. Cell Death Dis. 2020;11:774. doi: 10.1038/s41419-020-02992-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Balogh K.N., Templeton D.J., Cross J.V. Macrophage Migration Inhibitory Factor protects cancer cells from immunogenic cell death and impairs anti-tumor immune responses. PLoS One. 2018;13 doi: 10.1371/journal.pone.0197702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Bucala R., Donnelly S.C. Macrophage migration inhibitory factor: a probable link between inflammation and cancer. Immunity (Camb., Mass.) 2007;26:281–285. doi: 10.1016/j.immuni.2007.03.005. [DOI] [PubMed] [Google Scholar]
- 107.Whiting F.J.H., Househam J., Baker A.-M., Sottoriva A., Graham T.A. Phenotypic noise and plasticity in cancer evolution. Trends Cell Biol. 2024;34:451–464. doi: 10.1016/j.tcb.2023.10.002. [DOI] [PubMed] [Google Scholar]
- 108.Heide T., Househam J., Cresswell G.D., Spiteri I., Lynn C., Mossner M., Kimberley C., Fernandez-Mateos J., Chen B., Zapata L., et al. The co-evolution of the genome and epigenome in colorectal cancer. Nature. 2022;611:733–743. doi: 10.1038/s41586-022-05202-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Chaligne R., Gaiti F., Silverbush D., Schiffman J.S., Weisman H.R., Kluegel L., Gritsch S., Deochand S.D., Gonzalez Castro L.N., Richman A.R., et al. Epigenetic encoding, heritability and plasticity of glioma transcriptional cell states. Nat. Genet. 2021;53:1469–1479. doi: 10.1038/s41588-021-00927-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Househam J., Heide T., Cresswell G.D., Spiteri I., Kimberley C., Zapata L., Lynn C., James C., Mossner M., Fernandez-Mateos J., et al. Phenotypic plasticity and genetic control in colorectal cancer evolution. Nature. 2022;611:744–753. doi: 10.1038/s41586-022-05311-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.LaFave L.M., Kartha V.K., Ma S., Meli K., Del Priore I., Lareau C., Naranjo S., Westcott P.M.K., Duarte F.M., Sankar V., et al. Epigenomic State Transitions Characterize Tumor Progression in Mouse Lung Adenocarcinoma. Cancer Cell. 2020;38:212–228. doi: 10.1016/j.ccell.2020.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Mo C.-K., Liu J., Chen S., Storrs E., Targino da Costa A.L.N., Houston A., Wendl M.C., Jayasinghe R.G., Iglesia M.D., Ma C., et al. Tumour evolution and microenvironment interactions in 2D and 3D space. Nature. 2024;634:1178–1186. doi: 10.1038/s41586-024-08087-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Son B., Lee S., Youn H., Kim E., Kim W., Youn B. The role of tumor microenvironment in therapeutic resistance. Oncotarget. 2017;8:3933–3945. doi: 10.18632/oncotarget.13907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Martin J.D., Cabral H., Stylianopoulos T., Jain R.K. Improving cancer immunotherapy using nanomedicines: progress, opportunities and challenges. Nat. Rev. Clin. Oncol. 2020;17:251–266. doi: 10.1038/s41571-019-0308-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Shibue T., Weinberg R.A. EMT, CSCs, and drug resistance: the mechanistic link and clinical implications. Nat. Rev. Clin. Oncol. 2017;14:611–629. doi: 10.1038/nrclinonc.2017.44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Barkley D., Moncada R., Pour M., Liberman D.A., Dryg I., Werba G., Wang W., Baron M., Rao A., Xia B., et al. Cancer cell states recur across tumor types and form specific interactions with the tumor microenvironment. Nat. Genet. 2022;54:1192–1201. doi: 10.1038/s41588-022-01141-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Barnas J.L., Simpson-Abelson M.R., Brooks S.P., Kelleher R.J., Jr., Bankert R.B. Reciprocal functional modulation of the activation of T lymphocytes and fibroblasts derived from human solid tumors. J. Immunol. 2010;185:2681–2692. doi: 10.4049/jimmunol.1000896. [DOI] [PubMed] [Google Scholar]
- 118.Li M., Wu B., Li L., Lv C., Tian Y. Reprogramming of cancer-associated fibroblasts combined with immune checkpoint inhibitors: A potential therapeutic strategy for cancers. Biochim. Biophys. Acta. Rev. Cancer. 2023;1878 doi: 10.1016/j.bbcan.2023.188945. [DOI] [PubMed] [Google Scholar]
- 119.Barrett R.L., Puré E. Cancer-associated fibroblasts and their influence on tumor immunity and immunotherapy. eLife. 2020;9 doi: 10.7554/eLife.57243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Liu K., Cui Y., Han H., Guo E., Shi X., Xiong K., Zhang N., Zhai S., Sang S., Liu M., et al. Fibroblast atlas: Shared and specific cell types across tissues. Sci. Adv. 2025;11 doi: 10.1126/sciadv.ado0173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Liu Y., Sinjab A., Min J., Han G., Paradiso F., Zhang Y., Wang R., Pei G., Dai Y., Liu Y., et al. Conserved spatial subtypes and cellular neighborhoods of cancer-associated fibroblasts revealed by single-cell spatial multi-omics. Cancer Cell. 2025;43:905–924. doi: 10.1016/j.ccell.2025.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Slyper M., Porter C.B.M., Ashenberg O., Waldman J., Drokhlyansky E., Wakiro I., Smillie C., Smith-Rosario G., Wu J., Dionne D., et al. A single-cell and single-nucleus RNA-Seq toolbox for fresh and frozen human tumors. Nat. Med. 2020;26:792–802. doi: 10.1038/s41591-020-0844-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Fleming S.J., Chaffin M.D., Arduini A., Akkad A.D., Banks E., Marioni J.C., Philippakis A.A., Ellinor P.T., Babadi M. Unsupervised removal of systematic background noise from droplet-based single-cell experiments using CellBender. Nat. Methods. 2023;20:1323–1335. doi: 10.1038/s41592-023-01943-7. [DOI] [PubMed] [Google Scholar]
- 124.Wolock S.L., Lopez R., Klein A.M. Computational Identification of Cell Doublets in Single-Cell Transcriptomic Data. Cell Syst. 2019;8:281–291. doi: 10.1016/j.cels.2018.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Wolf F.A., Angerer P., Theis F.J. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018;19:15. doi: 10.1186/s13059-017-1382-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Subramanian A., Alperovich M., Yang Y., Li B. Biology-inspired data-driven quality control for scientific discovery in single-cell transcriptomics. Genome Biol. 2022;23:267. doi: 10.1186/s13059-022-02820-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Stuart T., Srivastava A., Madad S., Lareau C.A., Satija R. Single-cell chromatin state analysis with Signac. Nat. Methods. 2021;18:1333–1341. doi: 10.1038/s41592-021-01282-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Zhang Y., Liu T., Meyer C.A., Eeckhoute J., Johnson D.S., Bernstein B.E., Nusbaum C., Myers R.M., Brown M., Li W., Liu X.S. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9 doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Kotliar D., Veres A., Nagy M.A., Tabrizi S., Hodis E., Melton D.A., Sabeti P.C. Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq. eLife. 2019;8 doi: 10.7554/eLife.43803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Subramanian A., Tamayo P., Mootha V.K., Mukherjee S., Ebert B.L., Gillette M.A., Paulovich A., Pomeroy S.L., Golub T.R., Lander E.S., Mesirov J.P. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA. 2005;102:15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Liberzon A., Birger C., Thorvaldsdóttir H., Ghandi M., Mesirov J.P., Tamayo P. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst. 2015;1:417–425. doi: 10.1016/j.cels.2015.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Palla G., Spitzer H., Klein M., Fischer D., Schaar A.C., Kuemmerle L.B., Rybakov S., Ibarra I.L., Holmberg O., Virshup I., et al. Squidpy: a scalable framework for spatial omics analysis. Nat. Methods. 2022;19:171–178. doi: 10.1038/s41592-021-01358-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Elyanow R., Zeira R., Land M., Raphael B.J. STARCH: copy number and clone inference from spatial transcriptomics data. Phys. Biol. 2021;18 doi: 10.1088/1478-3975/abbe99. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Blampey Q., Mulder K., Gardet M., Christodoulidis S., Dutertre C.A., André F., Ginhoux F., Cournède P.H. Sopa: a technology-invariant pipeline for analyses of image-based spatial omics. Nat. Commun. 2024;15:4981. doi: 10.1038/s41467-024-48981-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Stringer C., Pachitariu M. Cellpose3: one-click image restoration for improved cellular segmentation. Nat. Methods. 2025;22:592–599. doi: 10.1038/s41592-025-02595-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Lopez R., Regier J., Cole M.B., Jordan M.I., Yosef N. Deep generative modeling for single-cell transcriptomics. Nat. Methods. 2018;15:1053–1058. doi: 10.1038/s41592-018-0229-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Aibar S., González-Blas C.B., Moerman T., Huynh-Thu V.A., Imrichova H., Hulselmans G., Rambow F., Marine J.C., Geurts P., Aerts J., et al. SCENIC: single-cell regulatory network inference and clustering. Nat. Methods. 2017;14:1083–1086. doi: 10.1038/nmeth.4463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.Lambert S.A., Jolma A., Campitelli L.F., Das P.K., Yin Y., Albu M., Chen X., Taipale J., Hughes T.R., Weirauch M.T. The Human Transcription Factors. Cell. 2018;172:650–665. doi: 10.1016/j.cell.2018.01.029. [DOI] [PubMed] [Google Scholar]
- 139.Liu J., Lichtenberg T., Hoadley K.A., Poisson L.M., Lazar A.J., Cherniack A.D., Kovatich A.J., Benz C.C., Levine D.A., Lee A.V., et al. An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics. Cell. 2018;173:400–416. doi: 10.1016/j.cell.2018.02.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 140.Barbie D.A., Tamayo P., Boehm J.S., Kim S.Y., Moody S.E., Dunn I.F., Schinzel A.C., Sandy P., Meylan E., Scholl C., et al. Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1. Nature. 2009;462:108–112. doi: 10.1038/nature08460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 141.Chu T., Wang Z., Pe’er D., Danko C.G. Cell type and gene expression deconvolution with BayesPrism enables Bayesian integrative analysis across bulk and single-cell RNA sequencing in oncology. Nat. Cancer. 2022;3:505–517. doi: 10.1038/s43018-022-00356-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Only OncoPanel tested variants present in at least one sample are indicated.
The numbers correspond to the estimated number of copies of the gene (2 = no event, >4 = amplification, 0 = two copy deletion). Only genes present in at least one sample are indicated.
1 indicates the presence of the SV.
Drugs predicted to be efficacious against cells expressing specific programs are indicated by a 1.
Data Availability Statement
-
•
Raw sequencing data for single-nuclei RNA- and ATAC-seq generated in this study are deposited in the database of Genotypes and Phenotypes (https://www.ncbi.nlm.nih.gov/gap/) with accession number phs003438.v1. Processed data are available at the Broad Single Cell Portal (https://singlecell.broadinstitute.org/single_cell) (SCP: SCP3028 and SCP3029). Raw and processed data for Visium and Xenium ST profiled samples are available on Zenodo (https://zenodo.org/): with DOI https://doi.org/10.5281/zenodo.15341263. Existing single-cell and bulk sequencing data can be downloaded from EGA (EGAS00001006468, EGAS00001006469) for Carroll et al.’s paper. Remaining existing single-cell data can be downloaded from the Gene Expression Omnibus (GEO) website, accession numbers GSE222078 (Croft et al.) and GSE210347 (Luo et al.). Remaining existing bulk data can be downloaded from the GEO website, accession number GSE207527, for Hoefnagel et al.’s data, and from the UCSC Xena browser for the Cancer Genome Atlas ESCA cohort (https://xenabrowser.net/datapages/?cohort=TCGA%20Esophageal%20Cancer%20(ESCA)&removeHub=https%3A%2F%2Fxena.treehouse.gi.ucsc.edu%3A443). Details on how to download the data used for the analysis are detailed on GitHub: https://github.com/vanallenlab/EAC-multiome. Source data are provided with this paper.
-
•
All the code needed to reproduce this analysis is available on Zenodo (DOI: https://doi.org/10.5281/zenodo.15269379) and GitHub at the following address: https://github.com/vanallenlab/EAC-multiome.
-
•
Any additional information required to reanalyze the data reported in this work paper is available from the lead contact upon request.







