Abstract
The heterogeneity and the complex cellular architecture have a crucial effect on breast cancer progression and response to treatment. However, deciphering the neoplastic subtypes and their spatial organization is still challenging. Here, we combine single-nucleus RNA sequencing (snRNA-seq) with a microarray-based spatial transcriptomics (ST) to identify cell populations and their spatial distribution in breast cancer tissues. Malignant cells are clustered into distinct subpopulations. These cell clusters not only have diverse features, origins and functions, but also emerge to the crosstalk within subtypes. Furthermore, we find that these subclusters are mapped in distinct tissue regions, where discrepant enrichment of stromal cell types are observed. We also inferred the abundance of these tumorous subpopulations by deconvolution of large breast cancer RNA-seq cohorts, revealing differential association with patient survival and therapeutic response. Our study provides a novel insight for the cellular architecture of breast cancer and potential therapeutic strategies.
Supplementary Information
The online version contains supplementary material available at 10.1186/s13045-022-01236-0.
Keywords: Single-nucleus RNA sequencing, Spatial transcriptomics, Breast cancer, Tissue architecture, Heterogeneity
To the editor,
A universal feature of breast cancer (BC) is the complex cellular ecosystems, whereby intra- and inter-tumor heterogeneity is crucial for determining malignant progression and response to treatment. In this study, we aimed to decipher the heterogeneity and complex architecture of breast cancer by combining two complementary high-resolution omics tools [1, 2].
We analyzed 2 primary tumors (BC-A and BC-B) by single-nucleus RNA sequencing (snRNA-seq, 10 × genomics) [2, 3]. Fresh samples from BC-A were analyzed in parallel by spatial transcriptomics (ST, 10 × genomics) (Fig. 1a) [4, 5]. By analyzing the transcriptomes, we detected 9 distinct cell clusters in the BC-A tumors by using canonical lineage markers (Fig. 1b). The epithelial cells were further re-clustered, and 6 subclusters were identified, respectively (Fig. 1c). The copy number variation (CNV) profiles of the BC-A tumor cells show a substantial degree of heterogeneity, suggesting that the origin of these subclusters was variable. Chr8q gain in genomic unstable luminal-A cells was found in the BC-A profile (Additional file 1: Figure S1A). Furthermore, the malignant subclusters were annotated with cluster-specific genes mainly including the basal subtype (EGFR+KIT+ basal cells and EGFR+TP63+ normal-like cells) and luminal subtype (cycling luminal-B cells, ERBB4+ luminal-A cells, ERBB4+AREG+ luminal-A cells and genomic unstable luminal-A cells) in the BC-A sample (Fig. 1c–e) (Additional file 2: Result S1). To characterize the functional features of these distinct cancer clusters, we dissected their differentially expressed genes. Cycling luminal-B (LumB) cells were uniquely enriched for hallmarks of the cell cycle and proliferation (e.g., E2F and MYC TARGETS), and this cluster exhibited a high metabolic state with the activation of glycatabolism, fatty acid and cholesterol metabolism potentially regulated by PI3K/AKT/mTOR signaling, hypoxia and MYC (Fig. 1f, Additional file 1: Figure S1B-C). In contrast, genomic unstable luminal-A (LumA) cells only showed an activated KRAS-downregulated signaling (Fig. 1f). Next, we asked whether transcription factors (TFs) contribute to the phenotypic state of these subpopulations (Fig. 1g). First, GATA3, a key mediator during the differentiation from luminal progenitor to mature luminal cells, was active in these luminal clusters. AP-1 transcription factor complexes, including JUN, JUNB, JUND, FOS and FOSB, were enriched in ERBB4+AREG+ LumA, genomic unstable LumA, EGFR+KIT+ basal and cycling LumB clusters. Likewise, ERBB4+AREG+LumA cluster also highly expressed SREBF1/2 which are crucial for controlling cholesterol homeostasis [6]. Pseudotime analysis revealed trajectories of luminal and basal/myoepithelial lineages (Fig. 1h). The six major cell clusters were corroborated by typical mammary lineage marker genes for basal progenitor/myoepithelium (e.g., MYLK and ACTA2), mature luminal cells (e.g., ESR1, FOXA1, PGR and AFF3) and luminal progenitor cells (e.g., EZH2) (Additional file 1: Figure S3C-D) [7]. Moreover, the myoepithelial cells stemmed from basal progenitor cells, while luminal progenitor cells gradually differentiated into mature luminal cells (Fig. 1h). Interestingly, a cluster of luminal cells expressed certain level of basal-like genes, suggesting that they may emerge as a dedifferentiated state or represent invasiveness. EGFR+TP63+ normal-like subtype was fully distributed in the left arm, while cycling LumB cluster was completely situated in the arm of the luminal progenitor (Fig. 1h). Then, we investigated the interactions between ligands and receptors across all neoplastic populations. Interestingly, ERBB4+AREG+ LumA cells secreted increasing EGF and EGF-like cytokine-like AREG, which could combine with EGF receptors such as EGFR and ERBB4 (Fig. 1i, Additional file 1: Figure S1D) (Additional file 2: Result S2).
Fig. 1.
snRNA-seq analysis of tumor from sample BC-A. a Schematic of the single-nucleus RNA-seq and ST experiment and analysis. b UMAP visualization of 4,093 nuclei from BC-A tumor analyzed by snRNA-seq showing nine major cell types. c UMAP visualization of inferred epithelial cells from BC-A tumor analyzed by snRNA-seq. Clusters are colored and labeled according to their inferred cell subtypes. d Feature plot of subcluster-specific marker genes in epithelial cells. e Heatmap of differentially expressed genes in each epithelial subcluster. The color bars above the heatmap reflects the subcluster PAM50 subtype estimated by ‘pseudobulk.’ f Heatmap showing the pathway enrichment of each epithelial subcluster using MSigDB HALLMARK gene sets. Mean score of GSVA was z-score transformed. g Heatmap of the area under the curve (AUC) scores of TF motifs estimated per cell by SCENIC. h Differentiation trajectories of epithelial cells by Monocle 2. i Circos plot showing the interactions between ligands and receptors across cell types
Subsequently, we sought to annotate the intratumoral heterogeneity by integrating the snRNA-seq and ST datasets [5]. The ST dataset was classified into six main areas: the luminal region, basal region, the interfacing area between the luminal region and basal region, stroma and infiltrating lymphocyte areas based on the principal component scores across all ST spots (Fig. 2a, b, Additional file 1: S5C). We found that these luminal subtypes were principally assembled in the luminal region and the interfacing area except that the cycling LumB cells were scattered throughout the tissue. Interestingly, the ERBB4+ LumA type scarcely existed in the interfacing area (Fig. 2a, b, Additional file 1: S5B). Finally, the spatial division of the primary nontumor cells was identified in the ST dataset, showing that neutrophils were enriched in the luminal region, while B cells principally infiltrated into the basal region (Fig. 2c) (Additional file 2: Result S3).
Fig. 2.
ST analysis of BC-A and cell type deconvolution. a Scaled deconvolution values for six epithelial subclusters overlaid onto tissue spots. b Different spatial distribution of six groups as defined in a. c Heatmap of estimated scores of immune cells in each group by MCPcounter. d Heatmap of the ssGSEA score of six gene signatures estimated in each METABRIC sample. e Kaplan–Meier survival curve for METABRIC cohort in three groups. P value was calculated with log-rank test. Log-rank p value < 0.05 was considered as statistically significant. f Box plot of the estimated proportion of six gene signatures in two NAC cohorts using CIBERSORTx. Statistical significance was determined using a two-sided t-test in a pairwise comparison of means between groups, with P values adjusted using the Benjamini–Hochberg procedure. *P < 0.05, **P < 0.005, ***P < 0.005
We deconvoluted breast cancer profiles in the METABRIC cohort [8]. Based on the defined gene signature, all of cases were divided into three discriminated subgroups (DSGs). And DSG2, representing the HER2-overexpressing subtypes based on PAM50 was designated the genomic unstable LumA subtypes (Fig. 2d). For survival analysis, patients representing the DSG2 subtype were associated with the worst survival (Fig. 2e). Through examining public datasets of patients treated with chemotherapy[9], the proportions of the ERBB4+ LumA cells or EGFR+TP63+ normal-like cells were associated with poor response to neoadjuvant chemotherapy (NAC) while the ratio of the cycling LumB cluster significantly increased in patients who response to NAC (Fig. 2f) (Additional file 2: Result S4, Additional file 3).
In summary, we have provided a comprehensive approach to depict the heterogeneity and the architecture of breast cancer. Our study could provide novel insights into the ecosystem of breast cancer and novel therapeutic strategies.
Supplementary Information
Additional file 3. Materials and methods.
Acknowledgements
We thank a professional language editing by American Journal Experts.
Abbreviations
- snRNA-seq
Single-nucleus RNA sequencing
- ST
Spatial transcriptomics
- BC
Breast cancer
- CNV
Copy number variation
- LumA
Luminal-A
- LumB
Luminal-B
- AREG
Amphiregulin
- EGF
Epidermal growth factor
- TGFα
Transforming growth factor alpha
- EMT
Epithelial–mesenchymal transition
- PTN
Pleiotrophin
- EFNA5
Ephrin
- TFs
Transcription factors
- LTF
Lactotransferrin
- ECM
Extracellular matrix
- H&E
Hematoxylin and eosin
- DCIS
Ductal carcinoma in situ
- IHC
Immunohistochemistry
- ER
Estrogen receptor
- PR
Progesterone receptor
- DSG
Discriminated subgroups
- NAC
Neoadjuvant chemotherapy
- pCR
Pathologic complete response
- RD
Residual disease
Authors' contributions
QW contributed to conceptualization; XYM and JW were involved in methodology; SQL, ZJG and JW analyzed the data; QW, SQL, ZJG and XYM wrote and prepared the original draft; SQL and ZJG took part in writing, reviewing and editing; HMZ conducted the supervision; BL participated in project administration; and SS acquired the funding. All authors read and approved the final manuscript.
Funding
This work was supported by a National Natural Science Foundation of China (NSFC) grant (Grant NO: 81471781) to Prof. Sheng-Rong Sun and (Grant NO: 81903166) to Dr. Si Sun. This work was also supported by a Hubei Province health and family planning scientific research project to Dr. Si Sun (Grant No. WJ2019Q044) and a project (WJ2019Q053) to Dr. Hong-Mei Zheng. And this work was supported by a Fundamental Research Funds for the Central Universities of China grant (Grant NO: 413000480) to Dr. Qi Wu. Finally, this work was supported by the Natural Science Foundation of Hubei (Grant No. 2018CKB916) to Dr. Si Sun and a project (2020CFB874) to Dr. Hong-Mei Zheng. Dr. Xiang-Yu Meng was supported by a fellowship from ITMO Cancer AVIESAN within the framework of Cancer Plan.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Si-Qing Liu, Zhi-Jie Gao and Juan Wu have contributed equally to this work.
Contributor Information
Si Sun, Email: 59333173@qq.com.
Xiang-Yu Meng, Email: mengxy_whu@163.com.
Qi Wu, Email: waiwai@whu.edu.cn.
References
- 1.Moncada R, Barkley D, Wagner F, Chiodin M, Devlin JC, Baron M, Hajdu CH, Simeone DM, Yanai I. Integrating microarray-based spatial transcriptomics and single-cell RNA-seq reveals tissue architecture in pancreatic ductal adenocarcinomas. Nat Biotechnol. 2020;38(3):333–342. doi: 10.1038/s41587-019-0392-8. [DOI] [PubMed] [Google Scholar]
- 2.Lei Y, Tang R, Xu J, Wang W, Zhang B, Liu J, Yu X, Shi S. Applications of single-cell sequencing in cancer research: progress and perspectives. J Hematol Oncol. 2021;14(1):91. doi: 10.1186/s13045-021-01105-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wu SZ, Al-Eryani G, Roden DL, Junankar S, Harvey K, Andersson A, Thennavan A, Wang C, Torpy JR, Bartonicek N, Wang T, Larsson L, Kaczorowski D, Weisenfeld NI, Uytingco CR, Chew JG, Bent ZW, Chan CL, Gnanasambandapillai V, Dutertre CA, Gluch L, Hui MN, Beith J, Parker A, Robbins E, Segara D, Cooper C, Mak C, Chan B, Warrier S, Ginhoux F, Millar E, Powell JE, Williams SR, Liu XS, O'Toole S, Lim E, Lundeberg J, Perou CM, Swarbrick A. A single-cell and spatially resolved atlas of human breast cancers. Nat Genet. 2021;53(9):1334–1347. doi: 10.1038/s41588-021-00911-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Stahl PL, Salmen F, Vickovic S, Lundmark A, Navarro JF, Magnusson J, Giacomello S, Asp M, Westholm JO, Huss M, Mollbrink A, Linnarsson S, Codeluppi S, Borg A, Ponten F, Costea PI, Sahlen P, Mulder J, Bergmann O, Lundeberg J, Frisen J. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science. 2016;353(6294):78–82. doi: 10.1126/science.aaf2403. [DOI] [PubMed] [Google Scholar]
- 5.Fu T, Dai LJ, Wu SY, Xiao Y, Ma D, Jiang YZ, Shao ZM. Spatial architecture of the immune microenvironment orchestrates tumor immunity and therapeutic response. J Hematol Oncol. 2021;14(1):98. doi: 10.1186/s13045-021-01103-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Xu D, Wang Z, Xia Y, Shao F, Xia W, Wei Y, Li X, Qian X, Lee JH, Du L, Zheng Y, Lv G, Leu JS, Wang H, Xing D, Liang T, Hung MC, Lu Z. The gluconeogenic enzyme PCK1 phosphorylates INSIG1/2 for lipogenesis. Nature. 2020;580(7804):530–535. doi: 10.1038/s41586-020-2183-2. [DOI] [PubMed] [Google Scholar]
- 7.Hu L, Su L, Cheng H, Mo C, Ouyang T, Li J, Wang T, Fan Z, Fan T, Lin B, Zhang J, Xie Y. Single-Cell RNA Sequencing Reveals the Cellular Origin and Evolution of Breast Cancer in BRCA1 Mutation Carriers. Cancer Res. 2021;81(10):2600–2611. doi: 10.1158/0008-5472.CAN-20-2123. [DOI] [PubMed] [Google Scholar]
- 8.C. Curtis, S.P. Shah, S.F. Chin, G. Turashvili, O.M. Rueda, M.J. Dunning, D. Speed, A.G. Lynch, S. Samarajiwa, Y. Yuan, S. Graf, G. Ha, G. Haffari, A. Bashashati, R. Russell, S. McKinney, M. Group, A. Langerod, A. Green, E. Provenzano, G. Wishart, S. Pinder, P. Watson, F. Markowetz, L. Murphy, I. Ellis, A. Purushotham, A.L. Borresen-Dale, J.D. Brenton, S. Tavare, C. Caldas, S. Aparicio The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature. 2012;486(7403):346–352. doi: 10.1038/nature10983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Park YH, Lal S, Lee JE, Choi YL, Wen J, Ram S, Ding Y, Lee SH, Powell E, Lee SK, Yu JH, Ching KA, Nam JY, Kim SW, Nam SJ, Kim JY, Cho SY, Park S, Kim J, Hwang S, Kim YJ, Bonato V, Fernandez D, Deng S, Wang S, Shin H, Kang ES, Park WY, Rejto PA, Bienkowska J, Kan Z. Chemotherapy induces dynamic immune responses in breast cancers that impact treatment outcome. Nat Commun. 2020;11(1):6175. doi: 10.1038/s41467-020-19933-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Additional file 3. Materials and methods.
Data Availability Statement
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.