Skip to main content
Disease Models & Mechanisms logoLink to Disease Models & Mechanisms
. 2022 Jun 23;15(6):dmm049538. doi: 10.1242/dmm.049538

Identification of two cancer stem cell-like populations in triple-negative breast cancer xenografts

Jun Nakayama 1,2,3,*, Hiroko Matsunaga 4, Koji Arikawa 3,4, Takuya Yoda 2,4, Masahito Hosokawa 2,3,4, Haruko Takeyama 2,3,4, Yusuke Yamamoto 1, Kentaro Semba 2,5,*
PMCID: PMC9235877  PMID: 35611554

ABSTRACT

Gene expression analysis at the single-cell level by next-generation sequencing has revealed the existence of clonal dissemination and microheterogeneity in cancer metastasis. The current spatial analysis technologies can elucidate the heterogeneity of cell–cell interactions in situ. To reveal the regional and expressional heterogeneity in primary tumors and metastases, we performed transcriptomic analysis of microtissues dissected from a triple-negative breast cancer (TNBC) cell line MDA-MB-231 xenograft model with our automated tissue microdissection punching technology. This multiple-microtissue transcriptome analysis revealed three cancer cell-type clusters in the primary tumor and axillary lymph node metastasis, two of which were cancer stem cell (CSC)-like clusters (CD44/MYC-high, HMGA1-high). Reanalysis of public single-cell RNA-sequencing datasets confirmed that the two CSC-like populations existed in TNBC xenograft models and in TNBC patients. The diversity of these multiple CSC-like populations could cause differential anticancer drug resistance, increasing the difficulty of curing this cancer.

KEY WORDS: Breast cancer, Cancer, Cancer stem cell, Spatial transcriptome, Xenograft model, scRNA-seq


Summary: We identified two types of cancer stem cell (CSC)-like populations in triple-negative breast cancer xenografts and patients. These CSC-like populations could potentially make tumors more drug resistant and thus more difficult to treat.

INTRODUCTION

Breast cancer cells metastasize to multiple distant organs, such as the axillary lymph nodes, lungs, bone, liver and brain (Nakayama et al., 2021; Obenauf and Massagué, 2015). In particular, metastasis to axillary lymph nodes is an indicator of cancer grade in breast cancer patients (Giuliano et al., 2011). Most breast cancer tissues, including distant metastases, exhibit genetic heterogeneity (Gillies, 2022; McGranahan and Swanton, 2017). Single-cell analyses have revealed that cancer cells evolve through the acquisition of genomic mutations in the primary tumor and metastases (Echeverria et al., 2018; Yates et al., 2017). Most previous analyses have been performed using isolated cancer cells and stromal cells from cancer tissues. Thus, the cell–cell interactions between cancer cells and stromal cells remain to be analyzed. In recent studies, current single-cell analysis and spatial transcriptome technologies have revealed the heterogeneity of cell–cell interactions between cancer cells and stromal cells in situ (Andersson et al., 2021; Rao et al., 2021; Wu et al., 2021b); however, further analysis is needed to elucidate the nature of tumor heterogeneity.

Comprehensive gene expression analysis of metastases harvested from ∼500 specimens of various cancer types and metastatic organs (MET500 cohort) has suggested that metastatic tissues can be divided into several categories [e.g. proliferative or epithelial–mesenchymal transition (EMT)-like/inflammatory] (Robinson et al., 2017). In particular, some samples were found to show signatures of more than one category, suggesting that these samples have micro-intratumor heterogeneity. Heterogeneous tumors contain a small subpopulation of cancer stem cells (CSCs) able to induce anticancer drug resistance and metastasis (Oskarsson et al., 2014; Turdo et al., 2019; Weiss et al., 2022). To clarify such heterogeneity, microtissue sectioning using laser capture microdissection has often been performed (Civita et al., 2019). This method has several disadvantages, including the laborious and time-consuming nature of sample handling and a high risk of RNA degradation. Thus, in previous work, we developed a system involving automated tissue microdissection punching followed by transcriptomic analysis of the tumor microtissue (Yoda et al., 2017). This site-specific rapid sampling method by a hollow punching needle from frozen tissue enables low-cost molecular analyses to be performed on low-resolution spatially resolved tissue specimens. To analyze the expressional heterogeneity in microtissues from the primary tumor and axillary lymph node metastasis, we performed analysis of the spatial microtissue transcriptome in a xenograft model with the triple-negative breast cancer (TNBC) cell line MDA-MB-231. We focused on the expression profiles of known metastasis-promoting genes and CSC markers in dissected microtissues.

RESULTS

Sampling microtissues from primary tumor and axillary lymph node metastasis in MDA-MB-231 xenografts

Primary tumors and axillary lymph node metastases were harvested from NOD-SCID mice with MDA-MB-231-parent-Venus cell line xenografts. We subjected the sliced tissues to microtissue dissection by an automated tissue microdissection punching system (Fig. 1A). RNA was successfully recovered from the microtissues collected at 93 spots in the primary lesion and 44 spots in axillary lymph node metastasis using a microtissue automatic sampling device (Fig. 1B). In samples of this size, although the number of cells present in the tumor tissue varies, it can be inferred that several to ∼10-30 cells are present in each spot (Yoda et al., 2017). RNA-sequencing (RNA-seq) analysis was performed on the total RNA extracted from each spot. We checked the quality of the FASTQ files by FASTQC. Total RNA samples contained RNA from human cancer cell lines and RNA from mouse stromal cells in the tumor microenvironment. Therefore, the obtained sequences were mapped to both the human reference genome and the mouse reference genome by HISAT2 (Kim et al., 2019). Protein-coding genes (human, 19,961 genes; mouse, 22,050 genes) were extracted as transcripts per million (TPM) for spatial transcriptome analysis with Seurat (Fig. 1C; Fig. S1A-C) (Butler et al., 2018; Stuart et al., 2019).

Fig. 1.

Fig. 1.

Microtissue sectioning of the primary tumor and the axillary lymph node metastasis from the TNBC xenograft model. (A) Experimental flowchart of the spatial transcriptomics analysis. The human triple-negative breast cancer (TNBC) cell line MDA-MB-231-parent-Venus was transplanted orthotopically into a female NOD-SCID mouse. After 8 weeks, the primary tumor was harvested, and the center of the tumor was sectioned. After 4 weeks, axially lymph node metastases were harvested and sectioned from the same mouse. Sectioning was performed by an automated tissue microdissection punching system with a 100 μm needle. (B) In total, 93 microspots were sectioned from the primary tumor, and 43 microspots were sectioned from axillary lymph node metastases. RNA was extracted from a total of 137 spots. (C) Flowchart of the transcriptome analysis. A quality check was performed by FASTQC. The reads were mapped to the human reference genome and the mouse reference genome by HISAT2 and RSEM. Protein-coding genes were selected for analysis with Seurat.

Microtissue transcriptomics analysis reveals two types of CSC-like populations

The clustering analysis and uniform manifold approximation and projection (UMAP) plots showed three clusters of cancer cells (transcripts mapped to the human reference genome) and four clusters of stromal cells (transcripts mapped to the mouse reference genome) in the microspots dissected from primary tumors and axillary lymph node metastases (Fig. 2A,B; Table S1). Next, we evaluated the expression of CSC markers to focus on the CSC populations in the primary tumor and metastatic lesion (Oskarsson et al., 2014). We found that human cancer clusters showed specific gene expression patterns for high-mobility group AT-hook 1 (HMGA1) and CD44 (Fig. 2C,E; Table S2). CD44 and HMGA1 are well-known markers of CSCs in breast cancer (Liu et al., 2010; Pegoraro et al., 2013). CD44 was broadly expressed in all human cell clusters, whereas HMGA1 was highly expressed only in HMGA1-high clusters (Fig. 2C). The upstream enrichment analysis using Metascape (Zhou et al., 2019) showed that the differentially expressed genes (DEGs) of the CD44-expressing clusters were enriched in MYC-targeted genes (Table S3). Consequently, these human cell clusters were named HMGA1-high, CD44/MYC-high and marker-low. The DEGs in the human cell clusters were extracted and visualized as a heatmap (Fig. 2D; Table S2). The heatmap showed that the HMGA1-high cluster expressed TMSB10 (Zhang et al., 2017), CTSD (Ashraf et al., 2019) and LGALS1 (Balestrieri et al., 2021; Jung et al., 2007), which are correlated with poor prognosis in breast cancer. The human CD44/MYC-high cluster expressed CENPK (Lee et al., 2015) and CENPN (Wu et al., 2021a), which regulate the cell cycle and cell division in cancer. The human marker-low clusters showed low expression levels of these genes. These results suggested that two types of CSC-like populations existed in the MDA-MB-231 xenografts.

Fig. 2.

Fig. 2.

Transcriptome profiling of the clusters in each microspot using cancer RNA and stromal RNA. (A) Uniform manifold approximation and projection (UMAP) plot of 137 human (cancer cell) spots in the primary tumor and lymph node metastasis. (B) UMAP plot of 137 mouse (stromal cell) spots in the primary tumor and lymph node metastasis. (C) Heatmap of CD44, HMGA1 and MYC expression in each human cell cluster. (D) Heatmap of differentially expressed genes (DEGs; adjusted P-value<0.05 and pct.1−pct.2>0.1) in each human cell cluster. (E) Violin plots of expression levels of cancer marker genes, CD44, HMGA1 and MYC. (F) Violin plots of expression levels of stromal marker genes, Ttr, Cd3d, Ms4a1 and Inhba.

To determine the biological function of DEGs, we performed upstream analysis, gene ontology (GO) enrichment analysis and pathway enrichment analysis using Metascape. We performed pathway and GO enrichment analyses focused on the two CSC-like clusters that had upregulated DEGs (Fig. S2, Table S4). Amide metabolites, VEGFA–VEGFR signaling and ribonucleoprotein complex biogenesis were enriched in both clusters. The DEGs in the CD44/MYC-high clusters were enriched in many terms related to the cell cycle, cell division and ribosomal biogenesis. In the HMGA1-high clusters, the terms ribosome and TRBP complex were significantly enriched.

The expression of CD44 and HMGA1 in transplanted sections was confirmed with fluorescence immunostaining (Fig. 3A). CD44 and HMGA1 were detected in primary tumor sections. There were cells that expressed CD44 and HMGA1 independently and cells that co-expressed both CD44 and HMGA1. To confirm the existence of two CSC-like populations in the MDA-MB-231 xenograft model, we reanalyzed the public single-cell RNA-seq (scRNA-seq) samples of primary tumors and circulating tumor cells (CTCs) from the MDA-MB-231-LM2 xenograft model (Moravec et al., 2021 preprint). We downloaded the GSE163210 dataset from the Gene Expression Omnibus (GEO) database and reanalyzed it with Seurat/R (Fig. 3B). We extracted 8494 cancer cells from the dataset with UMAP visualization (Fig. 3C). The scRNA-seq analysis revealed that there were HMGA1-high, CD44-high and double-positive (HMGA1- and CD44-positive) populations in both the primary tumors and CTCs in the xenograft model (Fig. 3D,E). These results suggested that there were two types of CSC-like populations expressing CSC marker genes in the MDA-MB-231 xenograft model.

Fig. 3.

Fig. 3.

Two CSC-like populations detected by immunostaining and scRNA-seq analysis. (A) Representative images of CD44 and HMGA1 immunostaining in primary tumor sections from the MDA-MB-231 xenograft model. Red, HMGA1; green, CD44; blue, nuclei. Scale bars: 20 μm (left), 10 μm (right). White arrows represent cancer cells with independent expression or co-expression. (B) Flowchart of public single-cell RNA-sequencing (scRNA-seq) reanalysis using the GSE163210 dataset. (C) UMAP plot of MDA-MB-231-LM2 xenograft tumors and circulating tumor cells. (D) Expression of CD44 and HMGA1 in the UMAP plot. (E) Pie chart of the cancer stem cell (CSC)-like population proportion in MDA-MB-231-LM2 xenografts.

The mouse stromal clusters showed specific gene expression patterns for transthyretin (Ttr), Cd3d (T-cell marker), membrane-spanning 4-domains a1 (Ms4a1; B-cell marker) and inhibin subunit beta A (Inhba; a subunit of both activin and inhibin) (Fig. 2F; Fig. S3A). Ttr and Inhba were highly expressed in their respective specific clusters; therefore, these mouse clusters were named ‘Ttr-high’, ‘Tcell-like’, ‘Inhba-high’ and ‘Bcell-like’. Enrichment analysis of the DEGs of the mouse clusters showed that only the Tcell-like clusters had many enriched terms (Fig. S3B, Table S5). The citric acid [tricarboxylic acid (TCA)] cycle, chemical stress response and fatty acid oxidation were enriched in the Tcell-like populations.

Microspot spatial and cell cycle analyses

Next, we performed spot analysis with the spatial information to determine the spatial heterogeneity in the xenografts (Fig. 4A-C). Interestingly, although three human cell clusters were present in both the primary tumors and the lymph node metastases (Fig. S4A), the mouse stromal clusters showed a site-specific pattern (Fig. S4B). Most of the Ttr-high clusters were observed in the primary tumors, whereas most of the Tcell-like clusters and Bcell-like clusters were found in the lymph node metastases. Human CD44/MYC-high cancer cells tended to localize the outside of the primary tumor (Fig. 4A,D).

Fig. 4.

Fig. 4.

Spatial analysis revealed the aggressive proliferation of CSC-like populations in the lymph node metastasis. (A) Spatial transcriptomics analysis of human (cancer cell) clusters in the primary tumor and the lymph node metastasis. (B) Cell cycle phase of cancer cells in the primary tumor and lymph node metastases. (C) Spatial transcriptomics analysis of mouse (stromal cell) clusters in the primary tumor and lymph node metastasis. (D) Spatial analysis of human (cancer cell) clusters with the chi-squared test; *P<0.05; n.s., not significant. (E) Sunburst plot of the cell cycle in human cancer cell clusters. (F) Bar plot of cell cycle phases in the primary tumor and lymph node metastasis. Fisher's exact test; *P<0.05; **P<0.01; n.s., not significant.

Cell cycle analysis of the cancer cells showed that cell proliferation occurred outside of the primary tumor and at sparse sites among the lymph node metastases (Fig. 4B). Approximately 50% of the cells in the CD44/MYC-high clusters and 30% of the cells in the HMGA1-high clusters were actively undergoing cell division (Fig. 4E; Fig. S4C, Table S6). These results suggested that the cells that had originated from the two CSC-like clusters, leading to cancer expressional heterogeneity. Next, a comparative analysis of the cell cycle in the primary tumor and lymph node metastasis showed that the two CSC-like clusters (HMGA1-high and CD44/MYC-high) had an increased cell division index in lymph node metastasis (Fig. 4F). In contrast, the marker-low clusters did not have an altered cell cycle index in either location. These results suggested that the CD44/MYC and HMGA1 CSC-like populations in metastatic tissues proliferated aggressively.

Mouse stromal cell localization assessment showed that most of the Tcell-like clusters were present throughout the entire lymph node metastases, and the Tcell-like clusters also existed outside the primary tumor. Most of the Ttr-high clusters were sparsely present throughout the primary tumor (Fig. 4C). The cells in the mouse Tcell-like clusters and Inhba-high clusters showed an active cell cycle (Fig. S4D-F).

Two CSC-like populations in TNBC patients

To confirm our findings in clinical samples, we reanalyzed three integrated public scRNA-seq datasets (Pal et al., 2021; Wu et al., 2021b; Xu et al., 2021). We analyzed the integrated scRNA-seq dataset of 19 TNBC samples (Fig. 5A) and extracted 48,362 cancer cells from the dataset with UMAP visualization (Fig. 5B; Fig. S5A). Expression analysis detected HMGA1-high cancer cells (HMGA1 expression level>2), CD44-high cancer cells (CD44 expression level>2) and double-positive cancer cells in the integrated TNBC cohorts (Fig. 5C,D; Fig. S5B). Most of the TNBC patients had two CSC-like populations and a double-positive population (Fig. 5E,F; Table S7).

Fig. 5.

Fig. 5.

Reanalysis of clinical scRNA-seq with CSC-like signatures. (A) Flowchart representing the strategy of reanalysis using public scRNA-seq datasets. We downloaded GSE161529, GSE176078 and GSE180286 including scRNA-seq data from 19 TNBC patients. The integrated datasets were analyzed with Seurat. Log normalization, scaling, principal component analysis (PCA) and UMAP visualization were performed following the basic protocol in Seurat. To extract the cancer cells, cells expressing EPCAM/KRT8 (epithelial marker) were filtered. Cancer cells (48,362 cells) were extracted from 107,164 single cells. (B) UMAP plot of cancer cells from 19 TNBC patients with sample IDs. (C,D) Expression analysis of CD44 (C; expression level>2) and HMGA1 (D; expression level>2) with UMAP plots. (E) UMAP plot of CD44-high, HMGA1-high, HMGA1/CD44-high and marker-negative cancer cells. (F) The proportion of cancer cells that expressed CD44 and HMGA1 are represented in a bar plot.

DISCUSSION

Spatial transcriptomics technologies have enabled us to reveal the in situ expressional profiles and microheterogeneity of cancer (Rao et al., 2021). In particular, in a xenograft model, both human-derived RNA and mouse-derived RNA can be analyzed simultaneously and individually by mapping the sequence reads to a human genome reference or mouse genome reference (Bradford et al., 2013; Callari et al., 2018). In this study, by combining microtissue sampling and the isolation of human–mouse gene expression by mapping, we revealed the expressional heterogeneity of cancer cells and stromal cells in MDA-MB-231 primary tumors and axillary lymph node metastases. We observed two types of CSC-like populations in both the primary tumors and lymph node metastases. One of the CSC-like populations expressed CD44 and MYC. The CD44 gene is a well-known CSC marker in breast cancer (Liu et al., 2010; Marotta et al., 2011; Sheridan et al., 2006). The other CSC-like population, the HMGA1-high cluster, was observed in both the primary tumors and lymph node metastases. HMGA1 promotes tumor initiation, cancer stemness and metastasis in TNBC (Huang et al., 2015; Pegoraro et al., 2013; Shah et al., 2013). Enrichment analysis of DEGs showed upregulation of terms related to ribosome processes. The upregulation of translation and ribosomal processes may promote distant metastasis in breast cancer (Ebright et al., 2020). scRNA-seq analysis and immunostaining of MDA-MB-231 xenografts confirmed the two types of CSC-like populations. Spatial transcriptomics of mouse genes showed Inhba-high stromal populations in the xenograft model. Inhba is a member of the TGF-β superfamily (Bloise et al., 2019). INHBA is upregulated in breast tumors and induces EMT, tumor growth and distant metastasis (Bashir et al., 2015; Kalli et al., 2019). Most Inhba-high populations also existed in the primary tumor. Our results suggest that stromal expression of Inhba enhanced tumor growth in the MDA-MB-231 primary tumors. In addition, the punching microdissection system used in this study enables the extraction of only the parts of a section, which is useful for analysis at low cost. This system might be applied in analysis for extraction of fresh RNA samples instead of laser-capture microdissection methods.

Reanalysis of scRNA-seq of the CTCs in MDA-MB-231-LM2 xenografting mouse revealed that the number of CSC-like populations in CTCs was small; however, a large number of CSC-like populations was identified in the lymph node metastasis by our spatial transcriptomics. Our results showed that CSC-like populations significantly proliferated in the metastatic region, suggesting that CSC-like populations were increased by aggressive proliferation in the metastatic regions from a small number of disseminated tumor cells. In the integrated clinical scRNA-seq analysis, both types of CSC-like populations were observed in single-cell analysis of TNBC patients. Our results suggested that the co-existence of these multi-CSC-like populations makes curative treatment difficult and causes anticancer drug resistance in the clinic. Additionally, we extracted the DEGs from these CSC-like populations in TNBC patients, suggesting that some DEGs (CD44, SRGN, CXCR4, TXNIP, ANXA1, HNRNPH1, DDX5 and ITM2B in the CD44/MYC-high cluster; HMGA1, GAPDH and RPLP0 in the HMGA1-high cluster) of CSC-like populations were consistent between the clinical patients and our spatial transcriptomics (Table S8). Several samples from the clinical patients were enriched CSC-like populations. The difference in populations is caused by sampling bias in the GSE176078 dataset. It is considered that the patients' samples with a large number of CSC-like populations might have been sampled from an area enriched with CSC-like cancer cells. This is a methodological limitation of reanalysis using a public dataset.

Our study has limitations. Several spots did not contain enough RNA for analysis or exhibited bias toward either human RNA or mouse RNA (Fig. S1A). Thus, one limitation of this sampling method is that some spots have a biased cell type or no cells. Regarding the read counts (nCount_RNA) of each cluster, there were lower counts for the human marker-low cluster, mouse Ttr-high cluster, mouse Inhba-high cluster and mouse Bcell-like cluster than for other clusters (Fig. S1B,C). The limitations of cell type bias and low RNA extraction efficiency caused these low transcript counts. In particular, the marker-low human cluster and Ttr-high mouse cluster had the lowest nCount_RNA. This result suggests that these clusters may have included dead/dying cells. Tumor tissue has spatial heterogeneity, including in necroptotic/apoptotic areas, which is one of the limitations of spatial transcriptome analysis. We also detected different nCount_RNA and features, so it was difficult to define the exact characteristics and cell types in the tissue at single-cell resolution. This is a technical limitation of spatial transcriptomics. More high-resolution spatial transcriptomics analyses are needed to define the cluster characteristics at single-cell resolution, as has been recently developed (Srivatsan et al., 2021). For reproducibility of these spatial transcriptomics in MDA-MB-231 xenografts, we reanalyzed primary tumors and metastatic lesions from other mice as pilot study samples. The microspots including test trial samples showed three human clusters, which were classified into CD44/MYC, HMGA1 and marker-low clusters (Fig. S6).

In conclusion, our study showed that there are two types of CSC-like populations in MDA-MB-231 xenograft models and TNBC patients. The presence of these CSC-like populations could potentially make tumors more drug resistant and thus more difficult to treat. More effective therapies need to be developed through the elucidation of intratumor heterogeneity. In addition, our spatial transcriptomics methods will be helpful for the diagnosis, further identification of biomarkers and elucidation of the essential characteristics of cancer.

MATERIALS AND METHODS

Cell culture

The MDA-MB-231-parent-Venus cell line was cultured in RPMI-1640 medium (Fujifilm Wako, Osaka, Japan) supplemented with 10% heat-inactivated fetal bovine serum (FBS; Fujifilm Wako), 100 µg/ml streptomycin (Meiji Seika Pharma Co. Ltd., Tokyo, Japan) and 100 U/ml penicillin (Meiji Seika Pharma) at 37°C with 5% CO2.

Animal studies

A breast cancer xenograft model was established in NOD.CB-17-Prkdcscid/J mice (NOD-SCID; Charles River Laboratories Japan, Inc., Kanagawa, Japan) by orthotopic transplantation as previously described (Nakayama et al., 2017). A total of 1.0×106 cells were injected into the fourth fat pad of NOD-SCID mice. The primary tumor was removed 8 weeks after transplantation. An axillary lymph node metastasis was sampled 2 weeks after removing the primary tumor. The growth of the primary tumors and metastases was monitored by bioluminescence using an in vivo imaging system (IVIS-XRMS, PerkinElmer, Waltham, MA, USA). For bioluminescence monitoring by in vivo imaging system (IVIS), mice were anesthetized with 2.5% isoflurane (Fujifilm Wako) and intraperitoneally injected with 3 mg D-luciferin (Gold Biotechnology Inc., Olivette, MO, USA) in 200 µl PBS as previously described (Han et al., 2020; Kuroiwa et al., 2020). The harvested organs were placed in ice-cold PBS (Fujifilm Wako), embedded in Super Cryoembedding Medium (SECTION-LAB, Hiroshima, Japan) using liquid nitrogen and stored at −80°C until sectioning.

Animal experiments in this study were conducted under the approval of the Animal Committee of Waseda University (2017-A043a) and conformed to Animal Research: Reporting of In Vivo Experiments (ARRIVE) guidelines. Approved protocols were strictly adhered to..

Microtissue dissection and RNA-seq analysis

Microtissue sampling was performed by an automated tissue microdissection punching system as previously described (Yoda et al., 2017). Frozen sections were sliced at a thickness of 20 μm and transferred to an LMD film II (SECTION-LAB). Microspots were sampled with a 100 μm needle in the dissection instrument. RNA-seq was performed by Illumina HiSeq as previously described (Yoda et al., 2017).

Mapping and quality check

Transcriptome analysis was performed with HISAT2 version 2.0.5 (Kim et al., 2019) and RSEM version 1.3.0 (Li and Dewey, 2011). The expression of genes in cancer cells was obtained by mapping RNA-sequence reads to the human reference genome or mouse reference genome. We subjected ‘protein_coding’ genes to spatial transcriptome analysis.

Clustering and UMAP visualization

Data mining analyses such as clustering, UMAP analysis and DEG extraction were performed with the functions ‘runPCA’, ‘FindNeighbors’, ‘FindClusters’, ‘runUMAP’ and ‘FindAllMarkers’ in Seurat version 3.2. (Stuart et al., 2019). Cell cycle estimation was performed by the function ‘CellCycleScoring’ using cell cycle marker genes in Seurat. A heatmap of DEGs (adjusted P-value<0.05 and pct.1−pct.2>0.1) (Tables S2 and S4) was drawn using ComplexHeatmap (Gu et al., 2016). These packages and functions were run in R version 3.6.3.

Enrichment analysis using DEGs

Pathway and GO enrichment analyses were performed by Metascape (https://metascape.org/gp/index.html#/main/step1) (Zhou et al., 2019). DEGs from each cluster were subjected to the Metascape interface. Differential enrichment terms were analyzed in multiple gene list mode. The results of the enrichment analysis were visualized as heatmaps.

Immunostaining of tumor sections

Primary tumors generated from transplanted MDA-MB-231 cells were fixed in 4% paraformaldehyde overnight and embedded in paraffin. Primary tumor sections were dewaxed with xylene and rehydrated with ethanol (100% to 70%). Antigen retrieval was performed by boiling the specimens in Immunosaver (Nissin EM, Tokyo, Japan) diluted 1:200 for 45 min at 98°C. The sections were permeabilized with 0.1% Triton X-100 (Sigma-Aldrich, St Louis, MO, USA) for 15 min. After blocking with Dako blocking reagent (Vector Laboratories, Newark, CA, USA) for 30 min, sections were incubated with primary antibodies for 1 h at room temperature. Sections were then incubated with the primary antibodies, anti-CD44 (60224-1-IG, Proteintech Group, Rosemont, IL, USA; diluted 1:100) and anti-HMGA1 (ab252930, Abcam, Cambridge, UK; diluted 1:200), and secondary anti-mouse Alexa Fluor 488-labeled antibody (A-11001, Thermo Fisher Scientific, Waltham, MA, USA; diluted 1:1000) and anti-rabbit Alexa Fluor 594-labeled antibody (A-212-7, Thermo Fisher Scientific; diluted 1:1000). Slides were mounted with VECTASHIELD mounting medium with Hoechst 33342 (Thermo Fisher Scientific). Stained sections were imaged using an FV10i Laser Scanning Microscope (OLYMPUS, Tokyo, Japan).

Analysis of the public scRNA-seq dataset

The scRNA-seq dataset of the MDA-MB-231-LM2 xenograft model (GSE163210) (Moravec et al., 2021 preprint) was downloaded from the GEO database. The scRNA-seq datasets of TNBC patients from the public datasets GSE161529 (Pal et al., 2021), GSE176078 (Wu et al., 2021b) and GSE180286 (Xu et al., 2021) were also downloaded from the GEO database. The scRNA-seq datasets were analyzed with Seurat vignettes, and the integration of datasets was performed by Harmony (Korsunsky et al., 2019). Low-quality single cells (nFeature_RNA<500 and percent.mt>20) were removed. The clinical datasets were analyzed with clinical annotation data.

Statistical analysis

Chi-squared test, Fisher's exact test and log rank test were performed in R. DEG extraction was performed by the ‘FindAllMarkers’ function with the Wilcoxon rank sum test. Significance was defined as P<0.05.

Supplementary Material

Supplementary information
dmm-15-049538-s1.pdf (4.4MB, pdf)

Acknowledgements

We thank Dr Tomofumi Yamamoto, Mr Kiyofumi Takahashi and Ms Chikako Sakanashi for technical support. We thank Prof. Semba's laboratory, Prof. Takeyama's laboratory and CBBD-OIL (AIST) for meaningful discussions.

Footnotes

Competing interests

The authors declare no competing or financial interests.

Author contributions

Conceptualization: J.N., K.S.; Methodology: J.N., H.M., K.A., T.Y., M.H., H.T.; Software: J.N., K.A.; Validation: J.N.; Formal analysis: J.N.; Investigation: J.N.; Resources: J.N., K.A., M.H.; Data curation: J.N., H.M., T.Y., M.H.; Writing - original draft: J.N.; Writing - review & editing: J.N., H.M., K.A., T.Y., M.H., H.T., Y.Y., K.S.; Visualization: J.N., Y.Y.; Supervision: J.N.; Project administration: J.N., H.T., Y.Y., K.S.; Funding acquisition: J.N.

Funding

This study was supported in part by the Japan Society for the Promotion of Science KAKENHI (18K16269, 20J01794 and 21K15562 to J.N.), and Platform Project for Supporting Drug Discovery and Life Science Research (Basis for Supporting Innovative Drug Discovery and Life Science Research) from the Japan Agency for Medical Research and Development (JP19am0101004). The supercomputing resource was provided by the Human Genome Center (University of Tokyo). Open access funding provided by the Japan Society for the Promotion of Science KAKENHI (21K15562 to J.N.). Deposited in PMC for immediate release.

Data availability

The spatial expression data (RNA-seq data of each microspot) were deposited at GEO under accession number GSE184720. The source code used for the spatial transcriptome analysis is available on GitHub (https://github.com/JunNakayama/Spatial-Transcriptomics-of-MDA-MB-231-xenografts).

References

  1. Andersson, A., Larsson, L., Stenbeck, L., Salmén, F., Ehinger, A., Wu, S. Z., Al-Eryani, G., Roden, D., Swarbrick, A., Borg, Å.et al. (2021). Spatial deconvolution of HER2-positive breast cancer delineates tumor-associated cell type interactions. Nat. Commun. 12, 6012. 10.1038/s41467-021-26271-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ashraf, Y., Mansouri, H., Laurent-Matha, V., Alcaraz, L. B., Roger, P., Guiu, S., Derocq, D., Robin, G., Michaud, H. A., Delpech, H.et al. (2019). Immunotherapy of triple-negative breast cancer with cathepsin D-targeting antibodies. J. Immunother. Cancer 7, 29. 10.1186/s40425-019-0498-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Balestrieri, K., Kew, K., McDaniel, M., Ramez, M., Pittman, H. K., Murray, G., Vohra, N. A. and Verbanac, K. M. (2021). Proteomic identification of tumor- and metastasis-associated galectin-1 in claudin-low breast cancer. Biochim. Biophys. Acta Gen. Subj. 1865, 129784. 10.1016/j.bbagen.2020.129784 [DOI] [PubMed] [Google Scholar]
  4. Bashir, M., Damineni, S., Mukherjee, G. and Kondaiah, P. (2015). Activin-A signaling promotes epithelial-mesenchymal transition, invasion, and metastatic growth of breast cancer. NPJ Breast Cancer 1, 15007. 10.1038/npjbcancer.2015.7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bloise, E., Ciarmela, P., Dela Cruz, C., Luisi, S., Petraglia, F. and Reis, F. M. (2019). Activin A in mammalian physiology. Physiol. Rev. 99, 739-780. 10.1152/physrev.00002.2018 [DOI] [PubMed] [Google Scholar]
  6. Bradford, J. R., Farren, M., Powell, S. J., Runswick, S., Weston, S. L., Brown, H., Delpuech, O., Wappett, M., Smith, N. R., Carr, T. H.et al. (2013). RNA-seq differentiates tumour and host mRNA expression changes induced by treatment of human tumour xenografts with the VEGFR tyrosine kinase inhibitor cediranib. PLoS One 8, e66003. 10.1371/journal.pone.0066003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Butler, A., Hoffman, P., Smibert, P., Papalexi, E. and Satija, R. (2018). Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411-420. 10.1038/nbt.4096 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Callari, M., Batra, A. S., Batra, R. N., Sammut, S.-J., Greenwood, W., Clifford, H., Hercus, C., Chin, S.-F., Bruna, A., Rueda, O. M.et al. (2018). Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts. BMC Genomics 19, 19. 10.1186/s12864-017-4414-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Civita, P., Franceschi, S., Aretini, P., Ortenzi, V., Menicagli, M., Lessi, F., Pasqualetti, F., Naccarato, A. G. and Mazzanti, C. M. (2019). Laser capture microdissection and RNA-Seq analysis: high sensitivity approaches to explain histopathological heterogeneity in human glioblastoma FFPE archived tissues. Front. Oncol. 9, 482. 10.3389/fonc.2019.00482 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Ebright, R. Y., Lee, S., Wittner, B. S., Niederhoffer, K. L., Nicholson, B. T., Bardia, A., Truesdell, S., Wiley, D. F., Wesley, B., Li, S.et al. (2020). Deregulation of ribosomal protein expression and translation promotes breast cancer metastasis. Science 367, 1468-1473. 10.1126/science.aay0939 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Echeverria, G. V., Powell, E., Seth, S., Ge, Z., Carugo, A., Bristow, C., Peoples, M., Robinson, F., Qiu, H., Shao, J.et al. (2018). High-resolution clonal mapping of multi-organ metastasis in triple negative breast cancer. Nat. Commun. 9, 5079. 10.1038/s41467-018-07406-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Gillies, R. J. (2022). Cancer heterogeneity and metastasis: life at the edge. Clin. Exp. Metastasis 39, 15-19. 10.1007/s10585-021-10101-2 [DOI] [PubMed] [Google Scholar]
  13. Giuliano, A. E., Hunt, K. K., Ballman, K. V., Beitsch, P. D., Whitworth, P. W., Blumencranz, P. W., Leitch, A. M., Saha, S., McCall, L. M. and Morrow, M. (2011). Axillary dissection vs no axillary dissection in women with invasive breast cancer and sentinel node metastasis: a randomized clinical trial. JAMA 305, 569-575. 10.1001/jama.2011.90 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Gu, Z., Eils, R. and Schlesner, M. (2016). Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics 32, 2847-2849. 10.1093/bioinformatics/btw313 [DOI] [PubMed] [Google Scholar]
  15. Han, Y., Nakayama, J., Hayashi, Y., Jeong, S., Futakuchi, M., Ito, E., Watanabe, S. and Semba, K. (2020). Establishment and characterization of highly osteolytic luminal breast cancer cell lines by intracaudal arterial injection. Genes Cells 25, 111-123. 10.1111/gtc.12743 [DOI] [PubMed] [Google Scholar]
  16. Huang, R., Huang, D., Dai, W. and Yang, F. (2015). Overexpression of HMGA1 correlates with the malignant status and prognosis of breast cancer. Mol. Cell. Biochem. 404, 251-257. 10.1007/s11010-015-2384-4 [DOI] [PubMed] [Google Scholar]
  17. Jung, E.-J., Moon, H.-G., Cho, B. I., Jeong, C.-Y., Joo, Y.-T., Lee, Y.-J., Hong, S.-C., Choi, S.-K., Ha, W.-S., Kim, J. W.et al. (2007). Galectin-1 expression in cancer-associated stromal cells correlates tumor invasiveness and tumor progression in breast cancer. Int. J. Cancer 120, 2331-2338. 10.1002/ijc.22434 [DOI] [PubMed] [Google Scholar]
  18. Kalli, M., Mpekris, F., Wong, C. K., Panagi, M., Ozturk, S., Thiagalingam, S., Stylianopoulos, T. and Papageorgis, P. (2019). Activin A signaling regulates IL13Rα2 expression to promote breast cancer metastasis. Front. Oncol. 9, 32. 10.3389/fonc.2019.00032 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Kim, D., Paggi, J. M., Park, C., Bennett, C. and Salzberg, S. L. (2019). Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37, 907-915. 10.1038/s41587-019-0201-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Korsunsky, I., Millard, N., Fan, J., Slowikowski, K., Zhang, F., Wei, K., Baglaenko, Y., Brenner, M., Loh, P.-R. and Raychaudhuri, S. (2019). Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods 16, 1289-1296. 10.1038/s41592-019-0619-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kuroiwa, Y., Nakayama, J., Adachi, C., Inoue, T., Watanabe, S. and Semba, K. (2020). Proliferative classification of intracranially injected HER2-positive breast cancer cell lines. Cancers 12, 1811. 10.3390/cancers12071811 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Lee, Y.-C., Huang, C.-C., Lin, D.-Y., Chang, W.-C. and Lee, K.-H. (2015). Overexpression of centromere protein K (CENPK) in ovarian cancer is correlated with poor patient survival and associated with predictive and prognostic relevance. PeerJ 3, e1386. 10.7717/peerj.1386 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Li, B. and Dewey, C. N. (2011). RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 12, 323. 10.1186/1471-2105-12-323 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Liu, H., Patel, M. R., Prescher, J. A., Patsialou, A., Qian, D., Lin, J., Wen, S., Chang, Y.-F., Bachmann, M. H., Shimono, Y.et al. (2010). Cancer stem cells from human breast tumors are involved in spontaneous metastases in orthotopic mouse models. Proc. Natl. Acad. Sci. USA 107, 18115-18120. 10.1073/pnas.1006732107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Marotta, L. L., Almendro, V., Marusyk, A., Shipitsin, M., Schemme, J., Walker, S. R., Bloushtain-Qimron, N., Kim, J. J., Choudhury, S. A., Maruyama, R.et al. (2011). The JAK2/STAT3 signaling pathway is required for growth of CD44+CD24 stem cell-like breast cancer cells in human tumors. J. Clin. Invest. 121, 2723-2735. 10.1172/JCI44745 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. McGranahan, N. and Swanton, C. (2017). Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168, 613-628. 10.1016/j.cell.2017.01.018 [DOI] [PubMed] [Google Scholar]
  27. Moravec, J. C., Lanfear, R., Spector, D. L., Diermeier, S. D. and Gavryushkin, A. (2021). Cancer phylogenetics using single-cell RNA-seq data. bioRxiv 2021.01.07.425804. 10.1101/2021.01.07.425804 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Nakayama, J., Ito, E., Fujimoto, J., Watanabe, S. and Semba, K. (2017). Comparative analysis of gene regulatory networks of highly metastatic breast cancer cells established by orthotopic transplantation and intra-circulation injection. Int. J. Oncol. 50, 497-504. 10.3892/ijo.2016.3809 [DOI] [PubMed] [Google Scholar]
  29. Nakayama, J., Han, Y., Kuroiwa, Y., Azuma, K., Yamamoto, Y. and Semba, K. (2021). The in vivo selection method in breast cancer metastasis. Int. J. Mol. Sci. 22, 1886. 10.3390/ijms22041886 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Obenauf, A. C. and Massagué, J. (2015). Surviving at a distance: organ-specific metastasis. Trends Cancer 1, 76-91. 10.1016/j.trecan.2015.07.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Oskarsson, T., Batlle, E. and Massagué, J. (2014). Metastatic stem cells: sources, niches, and vital pathways. Cell Stem Cell 14, 306-321. 10.1016/j.stem.2014.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Pal, B., Chen, Y., Vaillant, F., Capaldo, B. D., Joyce, R., Song, X., Bryant, V. L., Penington, J. S., Di Stefano, L., Tubau Ribera, N.et al. (2021). A single-cell RNA expression atlas of normal, preneoplastic and tumorigenic states in the human breast. EMBO J. 40, e107333. 10.15252/embj.2020107333 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Pegoraro, S., Ros, G., Piazza, S., Sommaggio, R., Ciani, Y., Rosato, A., Sgarra, R., Del Sal, G. and Manfioletti, G. (2013). HMGA1 promotes metastatic processes in basal-like breast cancer regulating EMT and stemness. Oncotarget 4, 1293-1308. 10.18632/oncotarget.1136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Rao, A., Barkley, D., França, G. S. and Yanai, I. (2021). Exploring tissue architecture using spatial transcriptomics. Nature 596, 211-220. 10.1038/s41586-021-03634-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Robinson, D. R., Wu, Y.-M., Lonigro, R. J., Vats, P., Cobain, E., Everett, J., Cao, X., Rabban, E., Kumar-Sinha, C., Raymond, V.et al. (2017). Integrative clinical genomics of metastatic cancer. Nature 548, 297-303. 10.1038/nature23306 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Shah, S. N., Cope, L., Poh, W., Belton, A., Roy, S., Talbot, C. C., Jr., Sukumar, S., Huso, D. L. and Resar, L. M. (2013). HMGA1: a master regulator of tumor progression in triple-negative breast cancer cells. PLoS One 8, e63419. 10.1371/journal.pone.0063419 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Sheridan, C., Kishimoto, H., Fuchs, R. K., Mehrotra, S., Bhat-Nakshatri, P., Turner, C. H., Goulet, R., Jr, Badve, S. and Nakshatri, H. (2006). CD44+/CD24-breast cancer cells exhibit enhanced invasive properties: an early step necessary for metastasis. Breast Cancer Res. 8, R59. 10.1186/bcr1610 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Srivatsan, S. R., Regier, M. C., Barkan, E., Franks, J. M., Packer, J. S., Grosjean, P., Duran, M., Saxton, S., Ladd, J. J., Spielmann, M.et al. (2021). Embryo-scale, single-cell spatial transcriptomics. Science 373, 111-117. 10.1126/science.abb9536 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Stuart, T., Butler, A., Hoffman, P., Hafemeister, C., Papalexi, E., Mauck, W. M., 3rd, Hao, Y., Stoeckius, M., Smibert, P. and Satija, R. (2019). Comprehensive integration of single-cell data. Cell 177, 1888-1902.e21. 10.1016/j.cell.2019.05.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Turdo, A., Veschi, V., Gaggianesi, M., Chinnici, A., Bianca, P., Todaro, M. and Stassi, G. (2019). Meeting the challenge of targeting cancer stem cells. Front. Cell Dev. Biol. 7, 16. 10.3389/fcell.2019.00016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Weiss, F., Lauffenburger, D. and Friedl, P. (2022). Towards targeting of shared mechanisms of cancer metastasis and therapy resistance. Nat. Rev. Cancer 22, 157-173. 10.1038/s41568-021-00427-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Wu, H., Zhou, Y., Wu, H., Xu, L., Yan, Y., Tong, X. and Yan, H. (2021a). CENPN acts as a novel biomarker that correlates with the malignant phenotypes of glioma cells. Front. Genet. 12, 732376. 10.3389/fgene.2021.732376 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Wu, S. Z., Al-Eryani, G., Roden, D. L., Junankar, S., Harvey, K., Andersson, A., Thennavan, A., Wang, C., Torpy, J. R., Bartonicek, N.et al. (2021b). A single-cell and spatially resolved atlas of human breast cancers. Nat. Genet. 53, 1334-1347. 10.1038/s41588-021-00911-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Xu, K., Wang, R., Xie, H., Hu, L., Wang, C., Xu, J., Zhu, C., Liu, Y., Gao, F., Li, X.et al. (2021). Single-cell RNA sequencing reveals cell heterogeneity and transcriptome profile of breast cancer lymph node metastasis. Oncogenesis 10, 66. 10.1038/s41389-021-00355-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Yates, L. R., Knappskog, S., Wedge, D., Farmery, J. H. R., Gonzalez, S., Martincorena, I., Alexandrov, L. B., Van Loo, P., Haugland, H. K., Lilleng, P. K.et al. (2017). Genomic evolution of breast cancer metastasis and relapse. Cancer Cell 32, 169-184.e7. 10.1016/j.ccell.2017.07.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Yoda, T., Hosokawa, M., Takahashi, K., Sakanashi, C., Takeyama, H. and Kambara, H. (2017). Site-specific gene expression analysis using an automated tissue micro-dissection punching system. Sci. Rep. 7, 4325. 10.1038/s41598-017-04616-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Zhang, X., Ren, D., Guo, L., Wang, L., Wu, S., Lin, C., Ye, L., Zhu, J., Li, J., Song, L.et al. (2017). Thymosin beta 10 is a key regulator of tumorigenesis and metastasis and a novel serum marker in breast cancer. Breast Cancer Res. 19, 15. 10.1186/s13058-016-0785-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Zhou, Y., Zhou, B., Pache, L., Chang, M., Khodabakhshi, A. H., Tanaseichuk, O., Benner, C. and Chanda, S. K. (2019). Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 10, 1523. 10.1038/s41467-019-09234-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary information
dmm-15-049538-s1.pdf (4.4MB, pdf)

Articles from Disease Models & Mechanisms are provided here courtesy of Company of Biologists

RESOURCES