Skip to main content
The EMBO Journal logoLink to The EMBO Journal
. 2020 Aug 13;39(19):e104063. doi: 10.15252/embj.2019104063

Stromal cell diversity associated with immune evasion in human triple‐negative breast cancer

Sunny Z Wu 1,2, Daniel L Roden 1,2, Chenfei Wang 3, Holly Holliday 1,2, Kate Harvey 1, Aurélie S Cazet 1,2, Kendelle J Murphy 1,2, Brooke Pereira 1,2, Ghamdan Al‐Eryani 1,2, Nenad Bartonicek 1,2, Rui Hou 4, James R Torpy 1,2, Simon Junankar 1,2, Chia‐Ling Chan 5, Chuan En Lam 5, Mun N Hui 1,6, Laurence Gluch 7, Jane Beith 6, Andrew Parker 8, Elizabeth Robbins 9, Davendra Segara 8, Cindy Mak 6, Caroline Cooper 10,11, Sanjay Warrier 12,13, Alistair Forrest 4,14, Joseph Powell 5,15, Sandra O'Toole 1,2,16, Thomas R Cox 1,2, Paul Timpson 1,2, Elgene Lim 1,2,8, X Shirley Liu 3, Alexander Swarbrick 1,2,
PMCID: PMC7527929  PMID: 32790115

Abstract

The tumour stroma regulates nearly all stages of carcinogenesis. Stromal heterogeneity in human triple‐negative breast cancers (TNBCs) remains poorly understood, limiting the development of stromal‐targeted therapies. Single‐cell RNA sequencing of five TNBCs revealed two cancer‐associated fibroblast (CAF) and two perivascular‐like (PVL) subpopulations. CAFs clustered into two states: the first with features of myofibroblasts and the second characterised by high expression of growth factors and immunomodulatory molecules. PVL cells clustered into two states consistent with a differentiated and immature phenotype. We showed that these stromal states have distinct morphologies, spatial relationships and functional properties in regulating the extracellular matrix. Using cell signalling predictions, we provide evidence that stromal‐immune crosstalk acts via a diverse array of immunoregulatory molecules. Importantly, the investigation of gene signatures from inflammatory‐CAFs and differentiated‐PVL cells in independent TNBC patient cohorts revealed strong associations with cytotoxic T‐cell dysfunction and exclusion, respectively. Such insights present promising candidates to further investigate for new therapeutic strategies in the treatment of TNBCs.

Keywords: cancer‐associated fibroblasts, single‐cell RNA sequencing, stromal heterogeneity, triple‐negative breast cancer, tumour microenvironment

Subject Categories: Cancer, Immunology


Single‐cell profiling of primary breast cancer provides unprecedented insights into cell‐type heterogeneity within the tumor microenvironment.

graphic file with name EMBJ-39-e104063-g013.jpg

Introduction

Heterotypic interactions between stromal, immune and malignant epithelial cells play important roles in solid tumour progression and therapeutic response. Cancer‐associated fibroblasts (CAFs) play an integral part in the tumour microenvironment (TME) and can influence many aspects of carcinogenesis including extracellular matrix (ECM) remodelling, angiogenesis, cancer cell proliferation, invasion, inflammation, metabolic reprogramming and metastasis (Kalluri, 2016). Recent studies have described roles for CAFs in mediating immune suppression and chemoresistance, establishing CAFs as novel and attractive targets for anti‐cancer therapies in advanced breast cancer (Brechbuhl et al, 2017; Cazet et al, 2018; Costa et al, 2018; Givel et al, 2018; Su et al, 2018). Despite their well‐described roles in cancer biology, CAFs remain enigmatic: limited studies suggest phenotypic heterogeneity, plasticity and functional diversity, with both tumour‐promoting and tumour‐suppressive properties (Kalluri, 2016). The multi‐faceted nature of CAFs suggests that they are comprised of diverse subpopulations, and an improved understanding of stromal heterogeneity may explain how CAFs contribute to the dynamic complexity and functional malleability of the tumour ecosystem.

CAFs of the tumour parenchyma are routinely studied using a handful of markers including α‐smooth muscle actin (α‐SMA), fibroblast activation protein (FAP), CD90 (THY‐1), platelet‐derived growth factor receptor α and β (PDGFRα and PDGFRβ), podoplanin (PDPN) and fibroblast‐specific protein 1 (FSP‐1, also named S100A4) (Kawase et al, 2008; Kisselbach et al, 2009; Neri et al, 2015; Kalluri, 2016). However, these markers are not necessarily co‐expressed, nor specific to the fibroblast lineage (Costa et al, 2018). For instance, α‐SMA not only identifies CAFs with a myofibroblast morphology but also serves as a general marker for myoepithelial cells and perivascular cells. α‐SMA+ cells in the breast tumour stroma can also arise from different mesenchymal lineages including resident fibroblasts, smooth muscle cells and pericytes (Ronnov‐Jessen et al, 1995). In addition, FSP‐1 is also expressed in macrophages, other immune cells and even cancer cells (Osterreicher et al, 2011). Thus, a categorical definition of cancer‐associated stromal cells and specific cell surface markers remains challenging and is urgently needed (Kalluri, 2016).

Three broad CAF subtypes have been recently profiled in mouse models of pancreatic ductal adenocarcinoma (PDAC) (Ohlund et al, 2017; Biffi et al, 2018; Elyada et al, 2019). These are characterised by a myofibroblast‐like (myCAFs) phenotype, inflammatory properties (iCAFs) and antigen‐presenting capabilities (apCAFs) (Ohlund et al, 2017; Biffi et al, 2018; Elyada et al, 2019). Although little is known about the mechanistic role and clinical relevance of iCAFs and apCAFs, an accumulation of the myCAF marker α‐SMA has been shown to correlate with poor outcome in breast and pancreatic cancer (Yamashita et al, 2012; Sinn et al, 2014). We have shown that targeting Hedgehog‐activated CAFs, which have a myofibroblast‐like phenotype in ECM regulation, results in markedly improved survival, chemosensitivity and reduced metastatic burden in pre‐clinical models of TNBC (Cazet et al, 2018). In addition, myofibroblast‐like CAFs have been shown to contribute to an immunosuppressive microenvironment by attracting T‐regulatory cells in breast and ovarian cancer (Costa et al, 2018; Givel et al, 2018). While these studies point towards the therapeutic targeting of myofibroblast‐like CAFs, genetic ablation of α‐SMA+ cells in a mouse model of PDAC resulted in more aggressive tumours and reduced mouse overall survival, indicating complex stromal functionalities across distinct tissue sites (Ozdemir et al, 2015).

Recent advances in single‐cell RNA sequencing (scRNA‐Seq) have overcome some of the technical hurdles in the investigation of cellular heterogeneity amongst complex tissues such as carcinomas. Recent patient studies have dissected the TME in head and neck squamous cell carcinomas and lung tumours, revealing new insights into stromal and immune subsets associated with disease progression (Puram et al, 2017; Lambrechts et al, 2018). Single‐cell studies of human breast cancers have been limited to immune cells, while studies in mouse models have revealed four subclasses of CAFs (Bartoschek et al, 2018). Although CAFs from human breast carcinomas have been profiled by flow cytometry and bulk sequencing, comprehensive single‐cell profiling has yet to be performed in TNBC patients (Costa et al, 2018).

TNBC is an aggressive breast cancer subtype, which is lacking in effective targeted therapeutic options. It is clinically defined by negative status for targetable hormone receptors (oestrogen receptor and progesterone receptor) or HER2 amplification. Studies in mice and humans have demonstrated that TNBC progression can be influenced by stromal cells; however, a comprehensive understanding of the stromal hierarchy is yet to be established (Brechbuhl et al, 2017; Cazet et al, 2018; Costa et al, 2018; Givel et al, 2018; Su et al, 2018). To investigate this in more detail, we performed unbiased high‐throughput scRNA‐Seq to profile the TME directly in patient tumour tissues. In addition to CAFs, we identified stromal cells with a perivascular‐like (PVL) profile, which were not necessarily associated with blood vessels. Our study focuses exclusively on CAFs and PVL cells, which we collectively refer to as “stroma”. Using orthogonal methods, we found that functions previously ascribed to CAFs as unitary cell types are actually performed by specialised subsets of stromal cells with distinct morphological, spatial and functional properties (Bartoschek et al, 2018). In addition, by sampling cells from the entire TME, we were able to predict paracrine signalling between stromal and immune cell subsets. From this, we analysed large patient gene expression datasets to show significant association between inflammatory‐like CAFs and differentiated‐PVL cells with immune evasion. Our human TNBC single‐cell datasets provide a new taxonomy of human cancer‐associated stromal cells, which we envisage can be used to further develop TME‐directed therapies.

Results

Composition of triple‐negative breast cancers at cellular resolution

We performed scRNA‐Seq on primary breast tumours collected from five patients (Fig EV1A and B) using a marker‐free approach. Fresh tissues were dissociated into single‐cell suspensions prior to single‐cell capture on the Chromium controller (10× Genomics) and sequencing on the NextSeq 500 (Illumina) (Figs 1A and EV1C). In total, we sequenced 24,271 cells, with an average of 4,854 cells per patient (Fig EV1D). A total of 28,118 genes were detected with an average of 1,658 genes expressed, and 6,215 unique molecular identifiers (UMIs) detected per cell (Fig EV1E–H). Data from individual tumours were integrated and clustered using canonical correlation analysis (CCA) in Seurat (Satija et al, 2015).

Figure EV1. Clinical pathological features and overview of single‐cell RNA sequencing metrics.

Figure EV1

  • A
    Clinical and pathological features of patient age, breast cancer subtype, tumour grade, Ki67 status, treatment history and TIL count of the 5 primary breast carcinoma samples analysed in the study.
  • B
    Representative haematoxylin–eosin (H&E)‐stained sections for each patient analysed by single‐cell RNA sequencing in this study.
  • C
    Quality control metrics as generated by the Cellranger software (10X Genomics).
  • D
    Number of cells that passed quality control and filtering using EmptyDroplets per patient.
  • E
    Number of cells that passed quality control and filtering using EmptyDroplets per cell type and patient.
  • F–H
    Boxplots showing the number of detected genes (F), UMIs (G) and proportion of mitochondrial counts (H) per cell type across all five tumour samples, respectively. Data points represent individual cells. The central band and boxes correspond with the median and lower/upper quartiles, respectively. Whiskers extend to the lowest or largest value no further than 1.5 × the interquartile range, with points outside this range considered outliers.

Figure 1. Cellular composition of five triple‐negative breast carcinomas.

Figure 1

  1. Schematic highlighting the application of our single‐cell RNA sequencing experimental and analytical workflow for primary patient tissue.
  2. UMAP visualisation of 4,986 epithelial cells aligned using canonical correlation analysis in Seurat. Cells are coloured by their cell type annotation (left) and patient of origin (right).
  3. Log‐normalised expression of markers for epithelial (EPCAM), mature luminal epithelial (ESR1), myoepithelial (KRT5, KRT14 and ACTA2) and proliferating cancer cells (MKI67).
  4. UMAP visualisation of 19,285 stromal and immune cells aligned and visualised as represented in (B).
  5. Log‐normalised expression of markers for fibroblasts (PDGFRB, THY1, COL1A1, ITGB1 and S100A4), endothelial cells (PECAM1), T cells (CD3D), CD8 T cells (CD8A), T‐regulatory cells (FOXP3), B cells (MS4A1), myeloid cells (CD68) and plasma cells (JCHAIN).
  6. Proportion of cell types across each patient.

Epithelial cells (Fig 1B and C) and stromal‐immune cells (Fig 1D and E) were first annotated through the expression of canonical cell type gene markers. This revealed four major cell states within the epithelial compartment (Fig 1B and C), including a major cluster of 4,095 cancer cells (16.9% of all cells; EPCAM +, ESR1 ) and a second cluster of 614 cancer cells with high proliferation (2.5%; MKI67 +). The remaining two smaller epithelial clusters had gene expression features consistent with normal luminal (277 cells, 0.9%; EPCAM +, ESR1 +) and myoepithelial cells (212 cells, 0.9%; EPCAM lo, KRT5 + , KRT14 + and ACTA2 +). Neoplastic or normal status of these cell clusters was confirmed by inferring genome copy number alterations over large genomic regions using InferCNV (Appendix Fig S1) (Patel et al, 2014). In addition to marker genes, stromal and immune clusters were further classified through scoring against published cell type signatures from the XCell database with an area under the curve approach (AUCell) (Fig EV2A; Aibar et al, 2017; Aran et al, 2017). In the immune compartment (Fig 1D and E), we identified 7,990 T lymphocytes (32.9%; CD3D), 1,245 B cells (5.1%; MS4A1), 1,955 plasma cells (8.1%; JCHAIN) and 4,606 myeloid cells (19.0%; CD68). Through re‐clustering of the T lymphocytes (Fig EV2B–D), we identified 175 T‐follicular helper cells (2.2%; CXCL13 and CD200), 994 T‐regulatory cells (12.4%; FOXP3 and BATF), 2,003 other CD4+ T cells (25.1% of all T cells; CD4, IL7R and CD40LG), 3,691 CD8+ T cells (46.2%; CD8A and GZMH), 605 proliferating T cells (7.6%; MKI67), 358 NK Cells (4.5%; GNLY, KLRD1, NCR1, XCL1 and NCAM1) and 164 NKT cells (2.1%; GNLY, KLRD1, NCR1 and CD3D ). The remaining cells consisted of 610 endothelial cells (2.5%; PECAM1) and two distinct clusters (with 1,409 and 320 cells, 5.8 and 1.3%, respectively) sharing the expression of common stromal markers including PDGFRB, S100A4 (FSP‐1), ITGB1 (CD29) and THY1 (CD90). These non‐endothelial nor immune cells (collectively referred to as stromal in this study) were enriched for a fibroblast cell type signature from XCell (Fibroblasts_FANTOM_1; Fig EV2A). All annotated cell types were detected in each patient, with varying proportions of cell types between cases, indicating no patient specific subpopulations in our integrated dataset (Fig 1F).

Figure EV2. Scoring of cell type signatures for cluster annotation and re‐clustering of T cells.

Figure EV2

  • A
    Featureplots highlighting the area under the curve (AUC) value for selected cell type signatures derived from various studies collated in the XCELL study. AUC values are calculated on a per cell basis using the AUCell package with default parameters. Selected signatures for fibroblasts (Fantom_1), endothelial cells (Fantom_2), B cells (Fantom_1), Plasma cells (IRIS_2), CD4+ T cells (Fantom_3), CD8+ T cells (HPCA_3), T‐regulatory cells (Fantom_3) and monocytes (Fantom_3).
  • B–D
    Re‐clustering of 7,990 T cells identifies 175 T‐follicular helper cells (2.2%; CXCL13 and CD200), 994 T‐regulatory cells (12.4%; FOXP3 and BATF), 2,003 other CD4+ T cells (25.1% of all T cells; CD4, IL7R and CD40LG), 3,691 CD8+ T cells (46.2%; CD8A and GZMH), 605 proliferating T cells (7.6%; MKI67), 358 NK Cells (4.5%; GNLY, KLRD1, NCR1, XCL1 and NCAM1) and 164 NKT cells (2.1%; GNLY, KLRD1, NCR1 and CD3D ). Shown are t‐SNE representations of re‐clustered T cells coloured by the annotated subsets (B) and patient ID (C). (D) Heatmap of the top 10 DEGs per T‐cell subset.
  • E
    AUC values for all stromal cells scored against published human and mouse pancreatic CAF signatures for the myofibroblast‐like CAF, inflammatory‐like CAF and antigen‐presenting CAF subsets (Ohlund et al, 2017; Biffi et al, 2018; Elyada et al, 2019).
  • F
    Heatmap of the stromal cluster averaged expression of genes from human pancreatic CAF signatures, as in (E).

Re‐clustering stromal cells revealed four distinct sub‐clusters in human TNBCs

Although the stromal clusters shared many common markers used to study CAFs, we further inspected their heterogeneity through re‐clustering each population (Fig 2A). Sub‐clusters were detected across multiple clustering resolutions in the FindClusters function in Seurat (resolutions 0.2, 0.3 and 0.4), with varying proportions from each patient (Fig 2B). The first cluster, which was classified as CAFs through the expression of fibroblast‐specific markers (PDGFRA, COL1A1, FAP and PDPN), formed two sub‐clusters (Fig 2A–C). The first CAF sub‐cluster was comprised of 280 cells (16.2% of all stromal; red cluster) and was classified as myofibroblast‐like CAFs (myCAFs) through the elevated expression of activated fibroblast markers (ACTA2, FAP and PDPN) and collagen‐related genes (COL1A1 and COL1A2) (Fig 2C and D; Ohlund et al, 2017; Biffi et al, 2018; Elyada et al, 2019). The second CAF sub‐cluster comprised of 1,129 cells (65.3%; orange cluster; Fig 2A–C) and resembled inflammatory‐CAFs (iCAFs) through the enrichment of the CAF chemokine marker CXCL12 (also known as SDF‐1) (Fig 2C and D; Ohlund et al, 2017; Biffi et al, 2018; Elyada et al, 2019). We next compared our CAF clusters to the subsets previously reported in pancreatic cancer (Ohlund et al, 2017; Biffi et al, 2018; Elyada et al, 2019). This was performed by scoring published CAF gene signatures across our stromal clusters using the AUCell method (Fig EV2E; Aibar et al, 2017). This revealed the enrichment of pancreatic myCAF and iCAF signatures in our breast myCAF and iCAF clusters, respectively, suggesting similar phenotypes likely exist across both tissue sites (Fig EV2E). While the signatures were largely conserved, a number of human PDAC CAF markers were detected in opposing cell types, for example IL6 was expressed by PVL cells rather than iCAFs (Fig EV2F). No clusters showed any particular enrichment for signatures of antigen‐presenting CAFs, potentially because they are a rare cell type that was not sampled, or are unique to pancreas tumours (Fig EV2E).

Figure 2. Stromal landscape of TNBCs reveals four subpopulations of cancer‐associated fibroblasts and perivascular‐like cells.

Figure 2

  1. t‐SNE representation of the four subclasses of cancer‐associated fibroblasts (CAFs) and perivascular‐like cells (PVL), named myofibroblast‐like CAFs (myCAFs; 280 cells), inflammatory‐like CAFs (iCAFs; 1,129 cells), differentiated‐PVL cells (dPVL cells; 122 cells) and immature‐PVL cells (imPVL cells; 198 cells).
  2. Plot showing the proportion of the four stromal subsets across all five patients.
  3. Log expression of parenchymal gene markers commonly associated with CAFs and perivascular cells.
  4. Cluster averaged log‐normalised expression of the top 300 differentially expressed genes between the four stromal subsets with stromal‐related genes of interest annotated. Expression values are scaled per cluster.
  5. Circle histogram plot of the top gene ontologies enriched in each of the four stromal subsets, with pathways broadly grouped for ECM, development and signalling, muscle contractile features and angiogenesis and adhesion. Scale bar represents the −log10 q‐value for the enrichment of individual GO terms, as determined using ClusterProfiler.

In contrast, the second stromal cluster was enriched for perivascular markers, including genes associated with pericytes and smooth muscle cells (ACTA2, MCAM, CAV1, TAGLN, MYH11, MYLK and RGS5; Fig 2C and D; Hamzah et al, 2008). MCAM (also known as CD146) has shown to be a robust marker to differentiate perivascular cells from fibroblasts in human tissues (Li et al, 2003; Middleton et al, 2005; Covas et al, 2008; Crisan et al, 2008). PVL cells were further classified as either differentiated‐PVL (dPVL; 122 cells in light blue, 7.1%), characterised through the enrichment of myogenic differentiation genes (TAGLN, MYH11 and MYLK), or immature‐PVL (imPVL; 198 cells in dark blue; 11.5%), characterised by the elevated expression of genes associated with an immature phenotype (PDGFRB, CD36 and RGS5) (Fig 2C and D; Song et al, 2005). To our surprise, both PVL subsets were also enriched for the human PDAC myCAF signature, suggesting PVL cells share some similarities in gene expression profile with myCAFs (Fig EV2E and F). All four stromal subsets were detected in all five patients; however, there were differences in the proportions between the patients (Figs 2B, and EV3A and B). The stromal profiles of Patient‐1 (P1) and P2 were predominantly comprised of iCAFs, myCAFs were highest in P3, while PVL cells were highly abundant in P4 and P5 (Figs 2B, and EV3A and B).

Figure EV3. TNBC stromal subsets per patient, pseudotime trajectory and validation of stromal cultures.

Figure EV3

  • A
    t‐SNE representation of the four subclasses of cancer‐associated fibroblasts (CAFs) and perivascular‐like (PVL) cells by patient ID.
  • B
    Differential gene expression heatmaps showing the composition of the four stromal subsets in Patient‐1 to Patient‐5.
  • C, D
    Pseudotime trajectory of CAFs from Patient‐2 (C) and PVL cells from Patient‐1 (D) using the Monocle method annotated by the subsets derived from Seurat based re‐clustering. (C) Increased expression of marker genes such as ACTA2, COL1A1, FAP, TAGLN and THY1 as cells move throughout pseudotime indicate that iCAFs transition towards myCAFs. In contrast, iCAF marker CXCL12 decreases as cells move throughout pseudotime. (D) Increased expression of marker genes such as ACTA2 and MYH11 as cells move throughout pseudotime indicate that imPVL cells transition towards dPVL cells. In contrast, imPVL cell markers CD36 and RGS5 decreases as cells move throughout pseudotime.
  • E
    Four technical replicates of CAF sorting of myCAF and iCAF fractions using FAPHIGH and FAPnegative/LOW, respectively.
  • F
    FACS analysis showing the co‐expression of CD90 (THY1) with FAPHIGH CAFs. This is represented through overlaying CD90 signal over a replicate sample used for FACS as in (E) (top) and through a contour plot of FAP vs. CD90 signal (bottom; n = 4 biological replicates).
  • G
    Quantitative PCR validation of FAP, ACTA2, CXCL12, EGFR, PDGFRA and PDGFRB in FAPHIGH and FAPnegative/LOW CAF sorted fractions (n = 4 biological replicates). Error bars represent standard error of the mean (SEM). The ACTB housekeeping gene was used for normalisation. P‐values were determined using pairwise comparison with Student's t‐test with P‐values denoted by asterisks: *P < 0.05. FAP and ACTA2 are enriched in FAPHIGH sorted myCAF‐like fractions, while CXCL12 and EGFR are enriched in FAPLOW sorted iCAF‐like fractions.

Next, we identified differentially expressed genes (DEGs) between the four subsets using the MAST method, which compares each subset against all other subsets (Finak et al, 2015). This identified a total of 894, 610, 258 and 289 DEGs (log fold change threshold of 0.1, P‐value threshold of 1 × 10−5 and FDR threshold of 0.05) by myCAFs, iCAFs, dPVL and imPVL cells, respectively (Fig 2D; Dataset EV1). We performed gene ontology (GO) analysis using the top 250 DEGs from each subset using the clusterProfiler tool (Fig 2E; Dataset EV2) to determine the pathway level differences driving stromal heterogeneity (Yu et al, 2012). This revealed an enrichment of collagen biosynthesis and ECM regulatory pathways in myCAFs, which included fibrillar collagen genes COL1A1 and COL1A2 and ECM remodelling metalloproteinases MMP1 and MMP11 (Fig 2D and E). We identified the enrichment of developmental signalling pathways and chemotactic regulation in iCAFs, including soluble factors such as IGF1, FIGF and PDGFD, and the chemokines CXCL12 and CXCL13 (Fig 2D and E). Stem cell markers including ALDH1A1 and ID2, and the growth factor receptor EGFR were also upregulated in iCAFs (Fig 2D). Within the PVL cells, the dPVL cluster was enriched for pathways related to the muscle system and contractility, while the imPVL cluster was enriched for pathways related to focal and substrate adhesion, including the integrin molecule ITGA1 (Fig 2D and E). No stromal clusters expressed canonical markers for proliferation, including MKI67 and AURKA. As many of the genes and pathways identified were related to cell activation and contractility, we hypothesised that the stromal sub‐clusters resembled cell differentiation stages rather than distinct subpopulations. Cell trajectories were examined using the Monocle method, which revealed subsets of CAFs and PVL cells distributed across pseudotemporal space (Fig EV3C and D; Qiu et al, 2017). For example, COL1A1, ACTA2 and CXCL12 expression transitioned throughout CAF differentiation (Fig EV3C), while CD36, RGS5 and MYH11 transitioned throughout PVL differentiation (Fig EV3D). Our findings indicate that the stroma in TNBC is comprised of four major transcriptional states related to cell differentiation, which branch from the two major fibroblast and perivascular‐like lineages.

Transcription factor pathways enriched across stromal subclasses

We next sought to investigate if gene regulatory networks could further explain the underlying heterogeneity in stromal subpopulations. To examine the activity of CAF and PVL transcription factors (TFs), we applied the SCENIC method to build gene regulatory networks from scRNA‐Seq data and identify activating cis‐regulatory elements (Aibar et al, 2017; Moerman et al, 2018). Through applying this to the normalised stromal gene expression matrix, SCENIC identified a total of 190 activated TFs, of which 166 were identified to be significantly different across the four stromal subsets (one‐way ANOVA; P‐value threshold of 1 × 10−5). We focused on the top 50 strongest candidates based on their average AUC values (Fig 3; Appendix Fig S2).

Figure 3. Polarised gene regulatory states between cancer‐associated fibroblasts and perivascular‐like subclasses.

Figure 3

  1. Polarised gene regulatory states underlying stromal subclasses. Heatmap shows the averaged regulon activity (area under the curve; AUC) for the top 50 highest TFs regulons as estimated using SCENIC. All regulons are statistically enriched across the four subsets (P < 1 × 10−5 One‐way ANOVA). Heatmap is clustered using Euclidean distance and complete linkage.
  2. Candidate transcriptional drivers of each CAF and PVL subset. Violin plots showing the log‐normalised gene expression (left) of the TF and its respective AUC regulon activity (right). TFs ZEB1 and FOXP1 enriched in myofibroblast‐like CAFs, EGR2 and TCF7L2 enriched in inflammatory‐like CAFs, MEF2C enriched in PVL cells and NR2F2 enriched in immature‐PVL cells.

In examining the top candidate TFs (Fig 3; Appendix Fig S2), ZEB1 and FOXP1 were enriched in myCAFs. A recent study inhibiting stromal ZEB1 in the PyMT mouse model of breast cancer reduced tumour growth, invasion and impaired ECM deposition (Fu et al, 2019). In other tissue contexts, FOXP1 was reported to regulate the fibrotic potential of stromal cells via the Wnt/beta‐catenin pathway, including myCAF marker genes such as ACTA2 and COL1A1 (Shao & Wei, 2018). Known roles of such TFs are consistent with the predicted ECM‐regulating phenotype of myCAFs. The EGR2 and TCF7L2 regulons were enriched in iCAFs (Fig 3). EGR2 is known to regulate the expression of immunomodulatory molecules in mesenchymal stem cells (Tamama & Barbeau, 2012). The TCF family including TCF7L2 (also known as TCF4) are Wnt‐regulated TFs that are highly expressed during early development (Hrckulak et al, 2016). As iCAFs also expressed the stem cell markers ALDH1A1 and ID2, we hypothesised that they resemble a stem or progenitor‐like state.

For PVL cells, MEF2C was a highly enriched driver in both subsets (Fig 3). Myocyte enhancer factor 2 (MEF2) is a well‐defined regulator for the development of vascular smooth muscle cells (Creemers et al, 2006; Gordon et al, 2009). We identified KLF2 enriched in dPVL cells, and NR2F2 enriched in imPVL cells (Fig 3). KLF2 is required for smooth muscle cell migration and maturation in blood vessel formation, consistent with the predicted differentiation state of dPVL cells (Wu et al, 2008). Furthermore, NR2F2, also known as COUP‐TFII, is highly expressed by myogenic precursors and is known to inhibit muscle development, which is consistent with the predicted immature state of imPVL cells (Lee et al, 2017). In summary, we identified unique and novel TF drivers in each of the four stromal subclasses, providing further insights into the transcriptional drivers underlying stromal heterogeneity.

Validation of stromal subsets in primary breast cancer tissue

To validate the existence of the four stromal subclasses described above in TNBC patient tissue (Fig 4A), we first performed fluorescence‐activated cell sorting (FACS) isolation on scRNA‐Seq matched human tissue sections (Fig 4B). Our gating strategy used EPCAM, CD45 and CD31 as negative markers to exclude epithelial, immune and endothelial cells, respectively (Fig 4B). We additionally used PDGFRβ to positively select all stromal populations and avoid contaminations from cancer stem cells and breast myoepithelial cells which have low EPCAM expression (Prater et al, 2014; Hyun et al, 2016). Based on our initial scRNA‐Seq findings, we determined PDGFRα and CD146 (MCAM) as good markers to discriminate CAFs and PVL cells, respectively. Following the initial isolation and culturing of CAFs (PDGFRβ+/PDGFRα+/CD146) and PVL cells (PDGFRβ+/PDGFRα/CD146+), we next performed simultaneous FACS analysis of additional stromal markers to validate the presence of the four stromal subsets in culture. We show that myCAFs and iCAFs could be distinguished by FAPHIGH/CD90HIGH and FAPLOW/CD90LOW expression, respectively (Figs 4B, and EV3E and F), while imPVL cells could be discriminated from dPVL cells by CD36+ expression (Fig 4B). We validated the gene expression of cultured bulk and sorted CAF fractions using quantitative real‐time PCR (qPCR) (Fig EV3G). As controls, PDGFRA and PDGFRB were expressed in both the FAP‐high and FAP‐low populations. Consistent with the FACS sorting strategy and scRNA‐Seq findings, FAP and ACTA2 were enriched in FAPHIGH(myCAF) sorted cells, while CXCL12 and EGFR were enriched in FAPLOW (iCAF) sorted cells (Fig EV3G). We next performed immunofluorescence (IF) to further validate additional markers and explore potential morphological differences. Here, ⍺‐SMA expression was used to identify myCAFs from iCAFs (Fig 4C; Appendix Fig S3), and CD36 to distinguish imPVL from dPVL cells (Fig 4D; Appendix Fig S3). From our observations, myCAFs and dPVL cells had a more elongated morphology in comparison to iCAFs and imPVL cells (Fig 4C; Appendix Fig S3), which is consistent with the predicted differentiation state of each subset. Importantly, we defined a novel gating strategy that allowed us to purify the four stromal subsets for subsequent in vitro functional characterisation.

Figure 4. Morphological, phenotypic and spatial differences underlying stromal heterogeneity.

Figure 4

  • A
    Summary of the markers distinguishing each of the four stromal subpopulations identified in this study.
  • B
    FACS validation in matched patient tissue. Stromal cells are negatively gated for EPCAM (epithelial), CD45 (immune) and CD31 (endothelium) and positively selected for PDGFRβ. Subsequent markers PDGFRα and CD146 (MCAM) are used to distinguish CAFs and PVL cells, respectively. Expression of FAPHIGH, FAPLOW, CD36+ and CD36 is further used to define myofibroblast‐like CAFs, inflammatory‐like CAFs, immature‐PVL cells and differentiated‐PVL cells, respectively.
  • C, D
    Immunofluorescence of cultured human CAFs (C) and PVL cells (D) from passage 8, staining for CD34 (CAFs), ⍺‐SMA (myCAFs and PVL cells), CD146 (PVL cells) and CD36 (imPVL cells).
  • E, F
    Quantitative analysis of collagen abundance (E) and orientation (F) using second harmonic generation (SHG) from cellular derived matrices from stromal subsets and representative images multiphoton SHG images (n = 3 biological replicates). All SHG intensity values within each replicate were normalised to the SHG intensity of the myCAFs. Error bars represent standard deviation. Statistical significance for collagen abundance (E) was determined using unpaired two‐tailed Student's t‐test with equal standard deviation with P‐values denoted by asterisks: *P < 0.05, **P < 0.01 and ***P < 0.001. After normalisation of the orientation peak distributions (F), statistical significance was determined using a Kruskal–Wallis test with Dunn's post hoc multiple comparisons test (P‐value < 0.0001).
  • G, H
    Immunohistochemical staining of PDGFRβ, ⍺‐SMA, CD34 and CD146 in serial sections cut 4 μm apart from matched cases; Patient‐2 (G) and Patient‐4 (H). Images were aligned using FIJI. Tumour (T) regions are annotated by the solid yellow line. Co‐localisation of CD34 and CD146 was used to distinguish blood vessels (BV), where their differential staining was used to identify CAFs and PVL cells. (G) MyCAFs were found to be localised at the invasive stromal interface (insert A), while iCAFs were located at distal regions (insert B) with a high abundance of tumour‐infiltrating lymphocytes (TILs). (H) Case with a high abundance of perivascular‐like (PVL) cells in regions surrounded by blood vessels.
  • I
    Validation of disseminated PVL cells from blood vessels using co‐immunofluorescence of CD31 (red), CD146 (green) and DAPI (blue). Representative images from Patient‐4 is shown.

Source data are available online for this figure.

Myofibroblast‐like CAFs have elevated capabilities for collagen secretion and alignment

From the above results, we predicted myCAFs to be the predominant subset synthesising ECM components. To investigate this, we generated cell‐derived matrices (CDMs) to compare the ability of each human stromal subset to lay down collagen, as previously described (Cukierman et al, 2001). Purified stromal subsets were seeded and cultured onto glass for 7 days. To assess Collagen I deposition, we used second harmonic generation (SHG) microscopy, which is a sensitive method for quantifying fibrillar collagen density and orientation in an unlabelled manner. This revealed FAPHIGH myCAFs had a significant increase in SHG signal intensity compared to FAPLOW iCAFs, while PVL cells had a significantly lower SHG signal compared to both CAF subsets (Fig 4E). Higher densities of stromal collagen are a hallmark of breast tumour growth, invasiveness and risk of disease development (McCormack & dos Santos Silva, 2006; Levental et al, 2009; Huo et al, 2015). Our findings also indicate that PVL cells do not adopt fibroblast‐like traits in contributing to the collagenous TME. Further analyses of collagen fibre orientation also revealed that in addition to increased amounts, the orientation of the collagen fibres deposited by myCAFs was more uniformly aligned compared to iCAFs and PVL cells (indicated by the higher, narrow peak in Fig 4F). It has been previously shown that tumour‐associated collagen signatures (TACs), characterised by the alignment of collagen fibres, is a good factor for predicting breast cancer survival (Conklin et al, 2011). In further parallels to pancreatic cancers, FAP‐overexpressing fibroblasts have been shown to produce more parallel aligned fibres, enhancing the directionality and velocity of cancer cell invasion (Lee et al, 2011). Importantly, these data highlight that the regulation of the ECM, namely in collagen density and orientation, is mainly regulated by the specialised myCAF subsets. In summary, our findings demonstrate that the stromal subclasses described here are functionally distinct and provide a novel strategy for their purification from breast cancers.

Stromal subclasses are spatially distinct

To investigate the spatial localisation of CAFs and PVL cells, we performed immunohistochemistry (IHC) with markers identified by scRNA‐Seq on data matched patient tissues. We also wanted to validate that CAFs and PVL cells localise to the intratumoural regions of tumour specimens and are not from adjacent normal tissue or blood vessels. We stained serial 4 μm sections and identified stromal cell types using a combination of markers identified previously by scRNA‐Seq and DGE (Fig 2C): pan‐stromal (PDGFRβ+), myCAFs (PDGFRβ+, ⍺‐SMAHIGH and CD146), iCAFs (PDGFRβ+, ⍺‐SMA, CD34HIGH and CD146) and PVL cells (PDGFRβ+, ⍺‐SMAHIGH, CD34 and CD146+). As CD34 and CD146 are commonly used markers of the endothelium but are mutually exclusive in CAFs and PVL cells, we used their co‐localisation in combination with PDGFRβ staining and morphology (rings surrounding lumen) to identify endothelial cells (Middleton et al, 2005). This IHC strategy revealed regions where myCAFs (⍺‐SMAHIGH) were located in close proximity to the invasive tumour interface, while iCAFs (CD34HIGH) were relatively distal to this interface (Fig 4G). In these particular cases, no PVL cells were present in these regions and CD146 was completely restricted to blood vessels (Fig 4G). In distal regions which were enriched for iCAFs, we also identified a high co‐localisation of tumour‐infiltrating lymphocytes as identified by morphology (Fig 4G).

By definition, vascular smooth muscle cells (vSMCs) and pericytes should be localised around arteries and veins to facilitate vascular development and stability. To examine whether PVL cells are vessel‐associated, we used co‐IF staining for CD31 and CD146 to mark endothelial cells and PVL cells, respectively. We readily detected PVL cells at non‐blood vessel regions in the stroma of 4 out of 5 matched patient tissue sections (all cases except P3), including P4 where it was highly abundant (Figs 4H and I, and EV4A). Consistent with the cell proportions identified by scRNA‐Seq, PVL cells were highly abundant in P4, and lowly detected in P3 (Fig 2B). PVL cells were highly dispersed throughout the tumour stroma with no obvious co‐localisation to the invasive malignant borders. Importantly, our findings suggest that these smooth muscle‐like cells, like CAFs, can be readily identified disseminated throughout the stroma, independent of blood vessels.

Figure EV4. Immunohistochemistry and immunofluorescence of human breast cancers and normal breast tissue.

Figure EV4

  • A
    Validation of PVL cells detached from blood vessels using co‐immunofluorescence of CD31 (red), CD146 (green) and DAPI (blue) for sections from all five patients analysed in this study.
  • B, C
    H&E images (B) and immunohistochemical staining of PDGFRβ, ⍺‐SMA, CD34 and CD146 in serial sections (C) cut 4 μm apart from normal breast tissues collected from four women (N3B, N3D, N3E and N3G). Images were aligned using FIJI.
  • D
    Co‐immunofluorescence of CD31 (red), CD146 (green) and DAPI (blue) from normal breast tissue samples. CD146 is completely colocalised with CD31 at blood vessel (BV) regions, indicating that no detached PVL cells are present in normal breast tissues.

Source data are available online for this figure.

To understand how the four stromal subpopulations correspond to their normal tissue counterparts, we repeated the staining of PDGFRβ, CD34, ⍺‐SMA and CD146 on healthy breast tissue collected from four women. This revealed a high abundance of iCAF‐like fibroblasts (PDGFRβ+, ⍺‐SMA, CD34HIGH and CD146) surrounding ductal regions, while myCAF‐like fibroblasts (PDGFRβ+, ⍺‐SMAHIGH and CD146) were sparsely detected across all four cases (Fig EV4B and C). While this small panel of markers do not highlight the large transcriptional changes that may occur upon CAF activation, it does suggest that the broad iCAF‐like and myCAF‐like fibroblast subsets are resident cell types which are reactivated during carcinogenesis. For PVL cells, IHC staining of CD146 was completely restricted to blood vessels (Fig EV4C). This further confirmed using co‐IF staining for CD31 and CD146 on the normal tissue cases, where CD146 was completely restricted to CD31‐positive blood vessels (Fig EV4D). Our findings suggest that disseminated PVL cells are a distinct feature in a subset of TNBCs.

Distinct ligand–receptor expression predicts diverse stromal crosstalk to the tumour microenvironment

We next sought to investigate how spatially distinct stromal subclasses may interact with other cells within the TME. Here, we annotated our scRNA‐Seq dataset using a published set of curated human ligand–receptor pairs (Ramilowski et al, 2015). We used these annotations to construct a cell‐to‐cell communication network and predict intratumoural signalling between the four stromal clusters, and the surrounding neoplastic, immune and endothelial microenvironment. This revealed diverse stromal signalling profiles (Fig 5A), with myCAFs and iCAFs having the highest overall predicted ligand activity out of all the cell types (Fig 5B). The “interaction strength”, or the weight of each edge, was defined as the product of expression levels of the corresponding ligand and receptor. All ligand–receptor pairs with an arbitrary “interaction strength” cut‐off of 0.1 were classified as candidate signalling molecules, which revealed a total of 570, 482, 437 and 357 unique predicted interactions between stromal clusters with cancer epithelial cells, endothelial cells, myeloid cells (Appendix Fig S4A–C) and T‐cell subpopulations, respectively (Appendix Fig S4D; Dataset EV3).

Figure 5. Predicted stromal crosstalk to cancer and immune cells.

Figure 5

Overview of the predicted stromal paracrine signalling conserved across the five TNBC patients. The scRNA‐Seq dataset was annotated by ligand–receptor pairs as curated in Ramilowski et al (2015).
  • A
    Circos plot summary of the stromal ligand–receptor interactions. Outer sectors are weighted according to the number of annotated ligand–receptor interactions per cell type. Links between sectors are weighted according to the “Interaction Strength”, calculated as a product of ligand and receptor expression. Links are coloured by the respective stromal subsets; myCAFs (red), iCAFs (orange), dPVL cells (blue) and imPVL cells (light blue).
  • B
    Summary of the total ligands and receptors annotated per cell type.
  • C–E
    Imputed gene expression of selected candidate signalling molecules identified between the four stromal subsets and malignant (C) epithelial, (D) myeloid and (E) T cells. Expression of ligands in stromal clusters is represented on the left, with cognate receptors on target cells on the right.

Consistent with the enrichment of growth factor signalling gene ontologies in iCAFs (Fig 2E), we identified a strong upregulation of crosstalk via the FGF (FGF7 and FGF10), BMP (BMP4 and BMP7), HGF and IGF1 pathways to their cognate receptors across cancer cells and endothelial cells (Fig 5C; Appendix Fig S4A and B). These factors are known to be highly expressed in breast tumours and associated with breast cancer proliferation, invasion and inducing cancer stem cell (CSC) phenotypes (Palmieri et al, 2003; Alarmo et al, 2007; de Ostrovich et al, 2008; Kuang et al, 2017). Different ligands from these pathways were also identified from myCAFs and dPVL cells, suggesting that neoplastic phenotypes could also be influenced by different stromal cells (Appendix Fig S4A). As we identified iCAFs to be located more distal to the invasive tumour interface, we hypothesise that these secreted factors function from a distance. For signalling to the endothelial compartment, iCAFs and PVL cells were both enriched for well‐characterised growth factors involved in angiogenesis (Appendix Fig S4B). Classical angiogenic pathways including VEGFs (FIGF, also known as VEGFD), PDGFs (PDGFC), IGFs (IGF1 and IGF2) and Notch signalling (DLK1) were enriched in signals emanating from iCAFs (Appendix Fig S4B). These pathways suggest that the inflammatory‐CAF phenotype is also associated with tumour neovascularisation (Samani et al, 2007; Wang et al, 2019). In addition, PVL‐derived signals were enriched for the canonical ANGPT1/ANGPT2‐TIE1 pathway, which are known stimuli that can induce the sprouting of new vessels during the formation of new endothelial tubes (Fagiani & Christofori, 2013).

Given the reported immunoregulatory properties of mesenchymal cells (Costa et al, 2018; Givel et al, 2018), we next focused on the signalling of stromal cytokines and checkpoint molecules to immune populations. Here, we identified an enriched interaction between iCAFs and myeloid cells via the complement cascade activation interaction C5‐C5AR1 (Fig 5D; Appendix Fig S4C). C5 activation in the TME acts as a chemotactic factor for the recruitment of immunosuppressive myeloid cells to suppress T‐cell activities (Markiewski et al, 2008). In addition, myCAFs and iCAFs were enriched for TGFB1‐TGFBR1 and TGFB2‐TGFBR1 interactions with myeloid cells, respectively (Fig 5D; Appendix Fig S4C). As TGFβ‐activated myeloid cells have been shown to enhance breast cancer progression and metastasis in vivo, it suggests that both CAF subsets could influence myeloid phenotypes (Li et al, 2012). While the TGFBR1 receptor was predominantly enriched on myeloid clusters, it is worth noting that its expression was also detected by cancer and endothelial clusters (Fig 5D). Although PVL cells had lower ligand expression profiles compared to CAFs, several immunomodulatory cytokine interactions were predicted between PVL cells and myeloid cells, including an enrichment of the CCL8‐CCR1, IL6‐IL6R and CCL2‐CCR1 pathways (Fig 5D; Appendix Fig S4C). CCL2 produced by the microenvironment in other cancers has been shown to be essential for the recruitment of T‐Regs and tumour‐associated macrophages, supporting an additional role of PVL cells in recruiting immunosuppressive cells (Chang et al, 2016).

For the signalling to the lymphocyte compartment, iCAFs had a strong upregulation of the chemo‐attractant pathways CXCL12‐CXCR4 and CXCL13‐CXCR5 with T‐ and B cells (Fig 5E; Appendix Fig S4D). CAF‐derived CXCL12 has been shown to recruit and regulate the activity of CD4+/CD25+ T‐Regs in breast cancers, suggesting iCAFs may have a direct role in recruiting immunosuppressive populations (Costa et al, 2018; Givel et al, 2018). CXCL12 and CXCL13 signalling axes have also been shown to mediate lymphocyte recruitment to tertiary lymphoid structures (TLS) (Sautes‐Fridman et al, 2016). MyCAFs were also enriched for secreted immunoregulatory molecules and checkpoints including CXCL9‐CXCR3, CXCL11‐CXCR3 and CD274PDCD1 (PDL1‐PD1) with T cells (Fig 5E; Appendix Fig S4D). Lastly, only few candidates were identified between PVL cells with T cells, including the enrichment of CCL21‐CCR7, which is associated with immune tolerance in favour of tumour progression (Fig 5E; Appendix Fig S4D) (Shields et al, 2010). It is evident from our signalling predictions that diverse immunoregulatory molecules are expressed in the stroma, highlighting that immune evasion can be regulated by distinct stromal subpopulations in TNBC.

Inflammatory‐CAFs associated with cytotoxic T‐lymphocyte dysfunction

To further investigate the influence of stromal subsets on immune evasion, we explored the association between distinct stromal gene signatures and immune content in three large independent TNBC patient cohorts with associated bulk gene expression data (METABRIC, GSE8812 and GSE21653) (Sabatier et al, 2011; Curtis et al, 2012; Jezequel et al, 2015). Using a computational model called tumour immune dysfunction and exclusion (TIDE), we examined two primary mechanisms of immune evasion. The first examines factors driving the “dysfunction” of cytotoxic T lymphocytes (CTLs), while the second examines factors preventing the infiltration of CTLs to the tumour, known as “exclusion” (described below) (Jiang et al, 2018). TIDE first estimates CTL levels in each sample within a bulk sequencing cohort using the averaged expression of CTL‐specific genes (See Materials and Methods). Patients are then stratified into high and low CTL groups based on comparisons to the mean CTL level within the cohort. For dysfunction, we then evaluated whether gene signatures from each of the stromal subsets influence the beneficial effect of CTL levels on patient prognosis (Jiang et al, 2018). This analysis revealed a strong enrichment of genes from the iCAF signature that were significantly associated with CTL dysfunction in all three bulk tumour cohorts (Fig 6A). In patients with a low iCAF dysfunction signature level, a significant survival benefit was associated with high CTL levels (Figs 6B and EV5A). This is consistent with previous clinical observations in TNBCs where lymphocyte infiltration is a robust prognostic factor for improved disease‐free survival and overall survival benefit (Loi et al, 2013). Remarkably, in patients with a high iCAF dysfunction signature level, CTL levels were not associated with prognosis in any of the three cohorts (Figs 6B and EV5A), suggesting a role for stromal iCAFs in driving dysfunctional CTLs in TNBC. Other stromal subset signatures did not show a significant enrichment of prognostic genes in the context of CTL dysfunction.

Figure 6. Inflammatory‐CAFs and differentiated‐PVL cells associated with immune evasion in TNBC patient cohorts.

Figure 6

Significant associations between iCAF and dPVL gene signatures with cytotoxic T‐lymphocyte (CTL) dysfunction and exclusion in multiple TNBC patient cohorts, respectively, as determined using the tumour immune dysfunction and evasion (TIDE) method.
  • A
    iCAF T‐cell dysfunction gene signature highlighting genes significantly associated with CTL dysfunction in two out of three independent patient cohorts (METABRIC, GSE21653 and GSE58812).
  • B
    Representative cohort (METABRIC) showing the prognostic value of iCAF T‐cell dysfunction signature in the context of CTLs for a total of 233 patients. Kaplan–Meier present two groups of patients, “low CTL” (blue line) and “high CTL” (red line), as estimated according to the average expression of CTL‐specific genes and stratified as compared to the mean. Tumours with low iCAF T‐cell dysfunction signatures (top) show patients with high CTL levels have a better survival outcome. In contrast, this survival benefit is lost in tumours with a high iCAF T‐cell dysfunction signature (bottom). P‐values were defined from the Cox proportional hazard (Cox‐PH) model.
  • C
    Dysfunctional CTLs detected in all five TNBC patients determined through scoring a T‐cell exhaustion signature. UMAP featureplot of the exhaustion signature across all stromal and immune cells as in Fig 1D.
  • D
    Bulk stromal signature associates with CTL exclusion. Pearson correlation was computed between all inferred CTL levels (y axis) and the respective correlation between the bulk sample and the single‐cell cluster (x axis). Signature of all stromal cells divided over all cells correlated negatively with CTL levels, while control CD4+ and CD8+ gene signatures show a positive correlation. P‐values were computed using a two‐sided t‐test for correlation and were adjusted using the Benjamini–Hochberg procedure.
  • E
    dPVL cells associated with CTL exclusion. Repeated analysis in the same manner as in (D), instead with myCAF, iCAF, dPVL and imPVL clusters divided over all stromal cells independently, highlighting that CTL exclusion is mainly driven by dPVL cells. Representative cohort GSE58812 is shown.
  • F–H
    dPVL profiles and CTL exclusion consistent in our study. (F) Patients with the highest dPVL profiles by scRNA‐Seq (P4 and P5) show the lowest tumour‐infiltrating lymphocyte (TIL) pathology counts. (G, H) Accurate quantification of CTLs (G) and representative immunohistochemistry staining for CD8 on matched patient tumour sections (H). = 5 stromal 1 mm2 regions were counted per tumour. The central band, boxes and whiskers represent the median, lower/upper quartile and min/max CTL counts per 1 mm2, respectively. P3 is shown as an example of a low dPVL profile with high CTLs. In contrast, P4 has a high dPVL profile with low CTLs. Statistical significance was determined using pairwise comparison with Student's t‐test with P‐values denoted by asterisks: *P < 0.05, **P < 0.01, ***P < 0.001 and ****P < 0.0001.
Source data are available online for this figure.

Figure EV5. Influence of inflammatory‐CAF and differentiated‐PVL subclasses on T‐cell dysfunction in TNBC patient cohorts.

Figure EV5

The association between stromal gene signatures, cytotoxic T‐cell levels and overall patient survival in all three TNBC patient cohorts examined in this study (METABRIC—233 patients, GSE21653— 84 patients and GSE58812—107 patients). METABRIC data from Fig 6B are re‐displayed here. Using the TIDE method, we show significant associations between iCAF and dPVL gene signatures with cytotoxic T‐lymphocyte (CTL) dysfunction and exclusion.
  1. Prognostic value of iCAF T‐cell dysfunction signature in three independent cohorts. Kaplan–Meier present two groups of patients, “low CTL” and “high CTL,” as estimated according to the average expression of CD8A, CD8B, GZMA, GZMB and PRF1, and stratified as compared to the mean. The top and bottom panels show tumours with low and high iCAF T‐cell dysfunction signature, respectively. Sample divided according to iCAF T‐cell dysfunction signature shows significant association with CTL levels and survival outcome. P‐values were defined from the Cox proportional hazard (Cox‐PH) model.
  2. Bulk stromal signature associates with CTL exclusion in TNBC cohorts from the METABRIC, GSE21653, GSE58812 and The Cancer Genome Atlas (TCGA) studies. Pearson correlation was computed between all inferred CTL levels (y axis) and the respective correlation between the bulk sample and the single‐cell cluster of interest (x axis). Signature of all stromal cells divided over all cells correlated negatively with CTL levels, while control CD4+ and CD8+ gene signatures show a positive correlation. P‐values were computed using a two‐sided t‐test for correlation and were further adjusted using the Benjamini–Hochberg procedure.
  3. dPVL cells associated with CTL exclusion. Repeated analysis in the same manner as in (B), instead with the averaged expression signature of each stromal subset over all stromal cells, highlighting that CTL exclusion is mainly driven by dPVL cells in three out of four cohorts.

To investigate whether CTLs in each patient were indeed dysfunctional, we scored a published T‐cell exhaustion gene signature in our CD8+ T‐cell populations from each patient using an AUC approach (Fig 6C; Blackburn et al, 2009). This gene set includes canonical markers of exhausted T cells including PDCD1 (PD‐1), LAG3, TIGIT and CTLA4 (Blackburn et al, 2009). This revealed heterogeneity for exhausted CD8+ T‐cell populations in all five patients (Fig 6C), with P2 and P4 having the highest average exhausted gene signature score. In contrast, the exhaustion signature was not enriched in any other cell population with the exception of the myeloid cell cluster (Fig 6C). Myeloid cells, which can include tumour‐associated macrophages and myeloid derived suppressor cells, are known to hold immunosuppressive properties and can also express inhibitory molecules associated with T‐cell suppression (Jiang et al, 2015).

Differentiated‐PVL cells associated with cytotoxic T‐lymphocyte exclusion

We next explored whether particular stromal subsets were associated with CTL exclusion, a cold “immune‐desert” phenotype with “low CTL” activity. This was examined using the Pearson correlations between all CTL levels and the respective correlation score between the bulk tumour sample and the single‐cell cluster of interest. The averaged expression of all genes from the single‐cell cluster are referred to as a signature in this section. Previous studies have reported an association between CAFs and CTL exclusion (Jiang et al, 2018). Consistent with this, the collective bulk signature from all stromal cells correlated negatively with CTL levels in four TNBC patient cohorts (Figs 6D and EV5B). As a positive control, CD4+ and CD8+ T‐cell signatures from our dataset positively correlated with CTL levels as expected (Figs 6D and EV5B). To investigate if this was predominantly driven by one stromal subset, we repeated this analysis with the averaged gene expression of myCAFs, iCAFs, dPVL and imPVL clusters independently (Figs 6E and EV5C). This revealed that dPVL cells were the only subset with a significant negative correlation with CTL level in three of four cohorts, suggesting they are the primary subset associated with T‐cell exclusion (Figs 6E and EV5C). To further explore this correlation in our five patients, tumour‐infiltrating lymphocytes (TILs) and CTLs were scored in matched tumour sections by a specialist breast pathologist. Total TILs were estimated using standard H&E‐based assessment (Fig 6F), while stromal CTLs were accurately quantified by CD8 staining and scored as previously described (Fig 6G; Salgado et al, 2015). The latter measurements were performed as TILs can also be comprised of non‐CTL populations including CD4+ T cells, T‐Regs and B cells. TILs and CTL scoring revealed that 2 out of 5 patients (P4 and P5) had very low CTL infiltration (< 5% TILs and < 50 CD8+ T cells per 1 mm2), whereas P3 had a very high infiltration (> 70% TILs and > 200 CD8+ T cells per 1 mm2) (Fig 6F–H). In support of dPVL cells as drivers of T‐cell exclusion, only 4% of stromal cells from P3 were annotated as dPVL cells, while P4 and P5 had the two largest proportions of dPVL profiles with 35.5 and 26.8%, respectively (Fig 2B). Furthermore, no disseminated PVL cells could be readily detected in P3 using co‐IF (Fig EV4A). While small numbers, our findings are consistent with the proposal that specialised stromal subclasses are associated with immune evasion.

Discussion

Our study describes a detailed taxonomy of human stromal subclasses in TNBC at cellular resolution. The activated tumour stroma is classically described using a broad ‘CAF’ classification. Here, we provide evidence that it is also comprised of functionally distinct perivascular‐like cells which are not necessarily associated with the endothelium. We show that stromal heterogeneity diverges to four distinct states: myofibroblast‐like, inflammatory‐like CAFs and differentiated and immature‐PVL cells. Similar to CAFs described in pancreatic ductal adenocarcinoma, we find stromal subclasses are spatially distinct, with myCAFs localised to the invasive tumour front, while iCAFs are located distal to this interface (Ohlund et al, 2017). From our systematic scRNA‐Seq of the TME, we used receptor expression on other cell types to predict diverse stromal‐immune crosstalk via an array of immunoregulatory molecules to immune populations. We go on to show that iCAF and dPVL subsets are highly associated with immune evasion in multiple independent TNBC cohorts, suggesting a clinical relevance for unique stromal subsets (Sabatier et al, 2011; Curtis et al, 2012; Jezequel et al, 2015).

Few studies have investigated the functional heterogeneity of the cancer stroma. A recent scRNA‐Seq study profiled CAFs in a mouse model of breast cancer and defined matrix, vascular‐like, cycling and developmental CAF subsets (Bartoschek et al, 2018). We did not find a cycling‐CAF cluster driven by proliferation markers (Appendix Fig S5A), likely reflecting unique features of animal models. In addition, the authors proposed mouse “developmental CAFs” to be of epithelial to mesenchymal transition origin (Bartoschek et al, 2018). In contrast, we found the expression of proposed developmental CAF markers Scrg1, Sox9 and Sox10 exclusively in cancer epithelial clusters, which are classified based on the expression of epithelial lineage exclusive keratins (Appendix Fig S5B and C). Our comparisons suggest that developmental CAFs are either unique to mouse models or are cancer cells whose expression of EPCAM is down‐regulated, which was a negative marker used for CAF isolation in these studies (Prater et al, 2014; Hyun et al, 2016).

Despite well‐characterised roles in cancer progression, the cellular origins of CAFs remain poorly understood. Our results support the notion that dispersed stromal cells can also arise from perivascular cells, likely delaminated from vascular structures. Although PVL cells clustered distinctly from CAFs and express perivascular markers including MCAM (CD146), CAV1, RGS5, MYH11 and TAGLN (SM‐22‐Alpha), they also expressed an array of markers commonly used to classify CAFs, including ACTA2 (⍺‐SMA), PDGFRB, THY1 (CD90), S100A4 (FSP‐1) and ITGB1 (CD29) (Hamzah et al, 2008; Costa et al, 2018). Similar PVL subsets were identified in a previous mouse model of breast cancer (Bartoschek et al, 2018). The authors defined these cells as “vascular‐like CAFs” through the expression of vessel development markers such as CD146 (Bartoschek et al, 2018). Although the authors hypothesised that vascular‐like CAFs are derived from perivascular cells such as pericytes, the concept of pericyte‐to‐fibroblast transition has been debated (Hosaka et al, 2016). Our findings from functional assays suggest PVL cells do not possess the defining fibroblast trait of collagen deposition and remain phenotypically distinct from the fibroblast lineage (Fig 4E).

The functional role of perivascular cells in breast cancer is poorly understood. A very early study found that 4 out of 10 breast tumours showed substantial infiltration of vascular smooth muscle cells based on staining for markers including ⍺‐SMA, smooth muscle myosin and calponin (Ronnov‐Jessen et al, 1995). This finding went without further exploration until this manuscript, where we validate their existence using state‐of‐the‐art scRNA‐Seq and staining of CD146 in matched patient tissue. From our TF analysis, we predict the MEF2 regulon to be a strong activating TF of the PVL subclass. MEF2C is a well‐defined regulator for establishing vSMCs during development, highlighting a likely vSMC origin of the described PVL cells (Creemers et al, 2006; Gordon et al, 2009). As observed during wound healing, we hypothesise that vSMCs could be stimulated by malignant factors or mitogens, allowing them to migrate from the vessel basement membrane into the stroma (Banerjee et al, 2006; Louis & Zahradka, 2010). This is further supported by in vitro studies showing that breast cancer derived PDGFs can induce the recruitment and migration of vSMCs (Banerjee et al, 2006). As perivascular cells play an important part during angiogenesis and blood vessel stability, it is also possible that their displacement in tumours is stimulated by, or a driver of, dysregulated angiogenesis or hypoxia. Although it is yet to be studied in the context of perivascular cells, studies have reported that the imPVL marker CD36 is enriched in normal tissue regions and is associated with good survival outcome in breast cancer (DeFilippis et al, 2012). However, the origin and functional role of PVL subpopulations remain to be defined by future studies. The staining of CD146 exclusively associated with blood vessels of normal breast tissue suggests that detached PVL cells are a distinct feature of breast cancers.

Importantly, our findings suggest that previous studies characterising CAFs with a small number of markers have likely also studied PVL cells. For example, subsets discriminated by CD146 have been characterised in endocrine‐resistant breast cancers (Brechbuhl et al, 2017). Patients with a CD146+ stroma demonstrated good responses to tamoxifen therapy through the maintenance of oestrogen receptor (ER)‐dependent proliferation in cancer cells. Our findings suggest that PVL cells rather than CAFs are a biomarker for ER‐directed therapeutic response in ER‐positive breast cancers (Brechbuhl et al, 2017), a prediction that requires more detailed validation. Another elegant study reported a subset of chemoresistance‐promoting CAFs, marked by ⍺‐SMA+, GPR77+ and CD10+ expression (Su et al, 2018). Due to the shared expression of ⍺‐SMA between myCAFs and PVL cells, our findings also raise the question whether PVL cells could also contribute to chemoresistance in a subset of patients (Su et al, 2018). Although we did not find an enrichment of GPR77+ CD10+ ⍺‐SMA+ cells in any CAF subclasses, this may be explained by the treatment status of our samples.

Lastly, we found a strong enrichment of immunomodulatory pathways in the predicted signalling between stromal cells and immune cells. We identified an array of important candidates in patient tissue for future experimental studies for functional relevance. It is important to acknowledge; however, that transcript signalling predictions are not always concordant with protein expression. Although no CAF subsets in previous mouse studies were distinguishable by immunomodulatory properties (Bartoschek et al, 2018), there are several reports of predicted CAF‐immune interactions in human tissue. We found that iCAFs expressed an array of immunomodulatory molecules to cognate receptors on T cells. In other studies, CAFs have been implicated in the recruitment and activity of T‐Regs through the regulatory molecules CXCL12, CD40, B7H3, DPP4 and CD73 (Costa et al, 2018). In addition, iCAFs also expressed several molecules known to regulate myeloid cells, including complement C5, IL6 and TGFβ (Markiewski et al, 2008; Li et al, 2012). Myeloid cells, including tumour‐associated macrophages and myeloid derived suppressor cells, are well characterised in contributing to an immunosuppressive TME. Most importantly, gene signatures generated from iCAFs were strongly associated with CTL dysfunction in TNBC patient cohorts. We also report a novel dPVL stromal subset strongly associated with CTL exclusion. We identified an enrichment of dysfunctional/exhausted T cells which correlated with their respective stromal profiles, though we acknowledge that our study consists of small patient numbers. In patients with the highest dPVL profile, we found consistently low TIL and CD8 counts in matched pathology. Considering the proposed origin of detached PVL cells from the vascular structure, we hypothesise that this may be related to reduced lymphocyte extravasation from dysregulated tumour blood vessels. In support of this, previous studies restoring vascular integrity in tumours through vessel normalisation and increased perivascular coverage find an influx of CD8+ T cells in tumour tissue (Hamzah et al, 2008; Johansson‐Percival et al, 2015). In addition, signalling between CD4+ T cells and pericytes have also been reported to play a reciprocal role in tumour vessel normalisation (Tian et al, 2017). It is possible that the association between dPVL cells and CTL exclusion in patient cohorts reflect tumours with low vascular integrity and may act as a biomarker for patients suitable for vessel normalisation therapeutic strategies.

While our findings point to the targeting of stromal cells, future work investigating the transcriptional changes in stromal cells between healthy breast tissue and cancer is required to understand the stromal states that are cancer‐specific vs. reactivated resident cell types. In support of the latter possibility, a recent study showed that there are minimal proteomic differences between normal fibroblasts and CAFs in prostate cancer models (Nguyen et al, 2019). We find that iCAF‐ and myCAF‐like fibroblasts exist in cancer‐free normal breast tissue. It is important to note that desmoplasia is often observed in cancer‐free tissues, particularly in high risk women with high mammographic density (Li et al, 2005). This can be influenced by several physiological factors such as weight, pregnancy and menopausal status, highlighting important factors that need to be considered in future projects examining the normal breast tissue microenvironment such as the human cell atlas project (Li et al, 2005). These differences may exist from distinct epigenetic states between CAFs and normal fibroblasts, indicating another layer of complexity that remains to be explored in the four breast cancer stromal subsets identified in our study (Pidsley et al, 2018). The integration of future assays combining scRNA‐Seq with chromatin states will be important in elucidating the epigenetic regulation of cancer‐associated stromal cells. Identifying specific activation markers in comparison to healthy tissue is also an important prerequisite for the development of precise cancer therapeutic strategies with low toxicities. Future in vitro and in vivo studies will be important in understanding how stromal cells are dynamically reprogrammed and how the subclasses described here restrain or promote tumour growth and invasion.

Clinical trials for mainstream immune checkpoint therapies including anti‐PDL1 have shown limited efficacy in the treatment of advanced TNBC. This hints at alternate mechanisms of immune evasion and novel therapeutic strategies are desperately needed to improve immunotherapy for TNBC. Our findings suggest that co‐targeting stromal subpopulations could elicit a more effective immune response in a subset of patients through inhibiting CTL dysfunction and exclusion. This remains to be experimentally tested. In conclusion, we have comprehensively profiled four functionally distinct stromal subclasses in human TNBC, not previously described in breast cancer, mouse models or other cancer types. Importantly, we described subsets of CAFs and PVL cells with clinical relevance, presenting as candidates to further investigate. While our dataset captures a majority of the expected cell types from the TME, certain cell types such as adipocytes are under‐represented due to biases from standard tissue dissociation protocols. Integration of alternative methods such as single‐nuclei sequencing and spatial transcriptomics in future cancer cell atlas studies will be crucial for a comprehensive understanding of the TME. Our findings in only five patients also highlight the potential of applying scRNA‐Seq methods to larger scale patient cohorts for the identification of new disease relevant cell states and their gene expression features.

Materials and Methods

Ethics approval and consent for publication

Patient tissues used in this work were collected under protocols x13‐0133, x16‐018 and x17‐155. HREC approval was obtained through the SLHD (Sydney Local Health District) Ethics Committee; RPAH (Royal Prince Alfred Hospital) zone, and the St Vincent's hospital Ethics Committee. Site‐specific approvals were obtained for all additional sites. Written consent was obtained from all patients prior to collection of tissue and clinical data stored in a de‐identified manner, following pre‐approved protocols. Consent into the study included the agreement to the use of all patient tissue and data for publication.

Tissue dissociation

Fresh surgically resected tissue was washed with RPMI 1640 (Thermo Fisher Scientific) and minced with scissors. Samples were enzymatically dissociated using Human Tumor Dissociation Kit (Miltenyi Biotec) according to manufacturer's protocol (https://www.miltenyibiotec.com/AU-en/products/macs-sample-preparation/tissue-dissociation-kits/tumor-dissociation-kit-human.html#gref). Following incubation, the sample was then resuspended in RPMI 1640 and filtered through MACS® SmartStrainers (70 μM; Miltenyi Biotec), and the resulting single‐cell suspension was centrifuged at 300 × g for 5 min. Red blood cells were lysed with Lysing Buffer (Becton Dickinson) for 5 min, and the resulting suspension was centrifuged at 300 × g for 5 min. Viability was assessed to be > 80% using Trypan Blue (Thermo Fisher). Viability enrichment was performed using the EasySep Dead Cell Removal (Annexin V) Kit (StemCell Technologies) as per manufacturers protocol. Dissociated cells were resuspended in a final solution of PBS with 10% foetal calf serum solution prior to loading on the 10× Chromium platform. Previously reported stress pathways associated with tissue dissociation, such as FOS/JUN and reactive oxygen species, were not identified as a part of the candidate genes, pathways and transcription factor networks differentiating the stromal subsets in our study (O'Flanagan et al, 2019).

Single‐cell RNA sequencing on the 10× Chromium platform

High‐throughput droplet‐based scRNA‐Seq was performed on the single‐cell suspensions using the Chromium Single Cell 3’ v2 Library, Gel Bead and Multiplex Kit and Chip Kit (10× Genomics) according the to manufacturer's instructions, with a target of 5,000 cells per lane. scRNA‐Seq libraries were sequenced on the Illumina NextSeq 500 platform with pair‐end sequencing and dual indexing according to the recommended Chromium platform protocol; 26 cycles for Read 1, 8 cycles for i7 index and 98 cycles for Read 2.

Data processing

Sample demultiplexing, reference mapping, barcode processing and gene counting were performed using the Cell Ranger Single Cell Software v2.0 (10× Genomics). Reads were aligned to the GRCh38 human reference genome. Raw count matrices were exported and filtered using the EmptyDrops package in R (Lun et al, 2019). EmptyDrops distinguishes “real” barcodes from “noise” by calculating deviations of each cell against a generated ambient background RNA profile. Filtered barcodes were then processed using the Seurat v2.0 package in R (Satija et al, 2015). Additional conservative cut‐offs were further applied based on the number of genes detected per cell (> 200), and the percentage of mitochondrial unique molecular identifier (UMI) counts (< 10%). Individual Seurat objects were then integrated using the canonical correlation analysis (CCA) function RunMultiCCA according the developer guidelines (Butler et al, 2018). The top 2,000 most variable genes from each sample were combined for CCA vector identification. The first 20 CC dimensions were used for the alignment of subspaces and UMAP projection.

Cluster annotation

Cell clusters were annotated using canonical cell type markers for epithelial (EPCAM), myoepithelial (EPCAM LO, ACTA2, KRT5 and KRT14), basal (KRT5 and KRT14), mature luminal (ESR1), endothelial (PECAM1), immune (PTPRC), T cells (CD3D, CD8A and CD4), T‐regulatory cells (FOXP3), B cells (MS4A1), plasmablasts (JCHAIN), myeloid cells (CD68) and stromal cells (PDGFRB and COL1A1). Malignant epithelial cells were distinguished from entrapped normal epithelial cells by inferring copy number variations using the inferCNV package as previously described (Patel et al, 2014). In addition, an area under the curve (AUC) approach using published cell type signatures from the XCELL database was performed using AUCell (Aibar et al, 2017; Aran et al, 2017). AUCell scores single cells with input gene signatures and analyses its activity and distribution across the entire dataset to explore the relative expression of the gene set of interest. AUCell utilises raw gene counts and thus is independent of normalisation bias. CAFs, PVL cells and T cells were independently re‐clustered using the Seurat v3 method. Re‐clustering was performed across resolutions 0.2, 0.3, 0.4 and 0.5 to identify stable clusters.

Differential gene expression and pathway enrichment

The MAST method was used to perform differential gene expression through the FindAllMarkers function in Seurat (log fold change threshold of 0.1, P‐value threshold of 1 × 10−5 and FDR threshold of 0.05) (Finak et al, 2015). The top 250 DEGs from each cluster were then passed on to the ClusterProfiler package for functional enrichment (Yu et al, 2012). The compareCluster function was used with the enrichGO databases CC, MF and BP sub‐ontologies using the human org.Hs.eg.db database.

Pseudotime cell trajectory analysis

The Monocle 2 method was applied to infer cell trajectories for CAFs and PVL cells using default parameters, as recommended by developers’ (Qiu et al, 2017). CAFs from Patient‐2 and PVL cells from Patient‐1 were extracted for Monocle analysis due to adequate cell numbers and representations of each respective subset. Gene expression matrices from each cell type were first exported from Seurat into Monocle 2 to construct a CellDataSet. Variable genes defined by the differentialGeneTest function (q‐val cut‐off < 0.001) were used for cell ordering and dimensionality reduction with the setOrderingFilter and reduceDimension functions, respectively.

Gene regulatory analysis using SCENIC

Investigation of gene regulatory networks using SCENIC was performed using a faster python implementation of the tool (pySCENIC) as described by the developers on the 1,729 stromal cells (Aibar et al, 2017; Moerman et al, 2018). SCENIC explores gene regulatory networks by identifying TF co‐expression modules and binding motif enrichment. The normalised expression matrix generated from Seurat was first filtered for genes as previously described (sum of gene expression > 3 × 0.005 × 1,729) (Lambrechts et al, 2018). Genes detected in at least 0.5% of cells were kept. This resulted in 12,100 genes for pySCENIC input (Lambrechts et al, 2018). Analysis was performed using the hg38 mc9nr motif collection with a TSS ± 10 kB (hg38__refseq‐r80__10kb_up_and_down_tss.mc9nr) for the arboreto and RcisTarget steps. Gene regulons were clustered and plotted using the pheatmap function in R.

Flow cytometry and FACS isolation of stromal cells

Cell sorting and flow cytometry experiments were performed at the Garvan‐Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research. Flow cytometry was performed on a Becton Dickinson CantoII or LSRII SORP flow cytometer using BD FACSDIVA software, and the results were analysed using FlowJo software (Tree Star Inc.). FACS experiments were performed on a FACS AriaIII sorter using the BD FACSorter software. All antibody details used in this study can be found in Appendix Table S1. Cryopreserved single‐cell suspensions from Patient‐4 were thawed, washed with RPMI and incubated with an anti‐CD16/CD32 antibody (1:200, BD Biosciences #564220) in FACS buffer (PBS containing salts, 2% FBS) for 10 min to block nonspecific antibody binding. For the isolation of the different stromal subpopulations for subsequent experiments, cells were pelleted and resuspended in FACS buffer containing the following antibodies: anti‐EPCAM (1:100; BioLegend #324203), anti‐CD31 (1:100; BioLegend #303103), anti‐CD45 (1:100; BioLegend #304005), anti‐PDGFRβ (1:100; BioLegend #323605), anti‐PDGFRα (1:100; BioLegend #323507) and anti‐CD146 (1:100; BioLegend #342011) for 20 min on ice. All epithelial, immune and endothelial cells were excluded together on the FITC channel marking EPCAM, CD45 and CD31. In addition, we performed positive selection using PDGFRβ. CAFs and PVL cells were discriminated using PDGFRα and CD146, respectively. CAFs and PVL cells were isolated and cultured into dishes (Corning® LifeSciences) coated with collagen (0.15 mg/ml) in RPMI 1640 supplemented with 20% (v/v) FBS, 50 μg/ml gentamycin and 1× antibiotic/antimycotic (15‐240‐096, Gibco®) in a 5% O2, 5% CO2 incubator at 37°C. Cell sorting was repeated on cultured CAFs and PVL cells using the previously described experimental conditions with anti‐PDGFRα (1:100; BioLegend #323507), anti‐CD146 (1:100; BioLegend #342011), anti‐FAP (1:100; R&D Systems #FAB3715P‐025), anti‐CD90 (1:100; BioLegend #328113) and anti‐CD36 (1:100; BioLegend #336221). FAPHIGH expression was used to discriminate myCAFs from FAPLOW iCAFs, while CD36 expression was used to identify imPVL cells from dPVL cells.

Immunofluorescence

Primary cells were grown on glass coverslips coated with collagen in the same manner as the CDMs as described below. Media was removed and cells were rinsed with PBS for 5 min. Cells were fixed in 4% paraformaldehyde (ProSciTech) diluted in PBS for 15 min at room temperature then washed three times with PBS for 5 min. Cells were permeabilised with ice cold methanol for 10 min at −20°C followed by three 5 min PBS washes. Cells were blocked in blocking buffer (3% BSA + 0.1% Tween‐20 in PBS) for 1 hr at room temperature. Primary antibody was diluted in blocking buffer at the following dilutions: anti‐CD34 (1:100; Abcam #MA1‐10202), anti‐FAPα (1:200; Abcam #ab53066), anti‐αSMA (1:500; Abcam #ab21027), anti‐CD36 (1:100; BioLegend #336203), anti‐CD146 (1:200; Abcam #ab75769) and anti‐PDGFRβ (1:250; Abcam #ab32570). Coverslips were inverted and incubated on droplets of diluted primary antibody on parafilm in a humidified chamber overnight at 4°C. The following day cells were washed three times for 5 min in PBS. Cells were incubated with fluorescent secondary antibody (Jackson ImmunoResearch) diluted 1:500 in blocking buffer for 1 h at room temperature in a light proof container then washed two times with PBS for 5 min. Nuclei were stained with 1 μg/ml Hoechst 33342 (Sigma) in PBS for 5 min at room temperature followed by two 2 min PBS rinses. Coverslips were mounted with Prolong Diamond antifade mountant (Thermo Fisher Scientific) and allowed to dry overnight at room temperature. Fluorescent images were captured using a Leica DMI Sp8 confocal microscope.

Immunofluorescence was performed on 4 μm FFPE tissue sections prepared as described below for IHC. Antigen retrieval was performed for 20 min in a 100°C water bath in target retrieval buffer, pH 9 (Agilent Technologies). Slides were blocked for 1 h at room temperature in PBS containing 3% BSA and 5% goat serum. Slides were incubated with primary antibodies diluted in blocking buffer: anti‐CD31 (1:50; Agilent Technologies #M0823) and anti‐CD146 (1:600; Abcam #ab75769). Secondary antibody staining, nuclear counterstaining and microscopy were performed as described above.

Quantitative real‐time PCR analysis

RNA was extracted from sorted CAF cells using the Qiagen miRNeasy Mini Kit (Qiagen) and was reverse transcribed using the Transcriptor First Strand cDNA synthesis kit (Roche). TaqMan assays (Thermo Fisher Scientific) were used to analyse mRNA expression levels using a QuantStudio 7 Flex RT–PCR machine (Thermo Fisher Scientific). TaqMan probes used were FAP (Hs00990791_m1), ACTA2 (Hs00426835_g1), CXCL12 (Hs00171022_m1), EGFR (Hs01076078_M1), PDGFRA (Hs00998018_m1), PDGFRB (Hs01019589_m1) and ACTB (Hs99999903_M1). Relative gene expression was calculated using the ΔΔC t method.

Cell‐derived matrices

Cell‐derived matrices (CDMs) were established as previously described (Cukierman et al, 2001). A total of 1.5 × 105 cells/well were allowed to expand until confluent and ascorbic acid (50 mg/ml) was added to culture medium on days 1, 3 and 5. To maintain the structure interact of the matrix architecture, CDMs were imaged using second harmonic generation (SHG) at day 7 with cells still present in the matrix.

Second harmonic generation imaging

Second harmonic generation (SHG) imaging was achieved using an inverted Leica DMS 6000 SP8 confocal microscope with a Ti‐Sapphire femtosecond laser cavity (Coherent Chameleon Ultra II) excitation source, operating at 80 MHz and tuned to a wavelength of 880 nm, as previously described (Timpson et al, 2011; Conway et al, 2017; Vennin et al, 2017). SHG intensity was detected using a 440/20 nm RLD HyD detectors. For CDMs, three representative fields of view (512 μm × 512 μm) were imaged over a 3D z‐stack (80 μm with a 2.52 μm step size, and 30 μm with a 1.51 μm step size, respectively), with a line average of 4 at 25× magnification. Rotation images were acquired on the z‐level of maximum intensity with a line average of 64 at 63× magnification.

Collagen fibre orientation analysis

Collagen fibre orientation analysis in SHG images from CDMs was carried out as previously described (Mayorca‐Guiliani et al, 2017; Cazet et al, 2018). Briefly, the distribution of orientation of collagen within images was assessed based on methodology published by Rezakhaniha et al (2012). The local orientation and isotropic properties of individual pixels making up collagen fibres were derived from structure tensors evaluated by computing the continuous spatial derivatives in the x and y directions using a cubic B‐spline interpolation to obtain the local predominant orientation. Graphical outputs show a hue–saturation–brightness (HSB) colour‐coded map indicating the angles of the oriented structures within the image. Orientation distribution peaks were then aligned. The shape of the distribution indicates the degree of alignment within the image, where wide and broad shapes suggested little coherency in alignment, and tight peaks with small standard deviations implied aligned structures.

Immunohistochemistry and image alignment

In‐house FFPE blocks were made of patient tissues by fixing in 10% neutral buffered formalin for 24 h and processing for paraffin embedding. Where tissue was limited, diagnostic tumour FFPE blocks were accessed for analysis. FFPE blocks were sectioned at 4 μm. These were used for histological analysis, using a standard haematoxylin and eosin stain, and for immunohistochemical analysis on the Leica BOND RX Autostainer. Details of antibodies and staining conditions are described in Appendix Table S1. H&E and IHC slides were imaged using the Aperio CS2 Digital Pathology Slide Scanner. IHC images were imported into FIJI as a virtual stack. Each layer was then aligned using least squared mode (linear feature correspondences), propagating to the first and last layers for rigid transformation. All other parameters were set to default in FIJI.

Cell signalling predictions using ligand–receptor annotation

Genes from the scRNA‐Seq data were annotated based on a published set of human ligand–receptor pairs derived from supporting literature (Ramilowski et al, 2015). We used this knowledge to construct a cell‐to‐cell communication network between the four stromal clusters and other epithelial, immune and endothelial clusters. To investigate conserved signalling modules in TNBCs, we applied this to the cluster averaged expression levels of all ligands and receptors in the integrated dataset of five patients. The “interaction strength,” or the weight of edges between two clusters, was defined as the product of expression values from the ligand and its cognate receptor. All “interaction strengths” greater than an arbitrary cut‐off of 0.1 were considered as cell signalling candidates and kept for subsequent analyses (Dataset EV3). The total number of interaction pairs identified per cluster was used to generate summaries of this data (Fig 5A and B). The top 100 candidates between the four stromal subsets and each target population were clustered using hierarchical clustering (complete and Euclidean distance) and rescaled for visualisation in ggplot2. For the visualisation purposes only, the ligand and receptor expression values in Fig 5C–E were imputed using the MAGIC method to better represent the structure of genes with low expression and dropout (van Dijk et al, 2018). Raw count matrices and cluster IDs identified by Seurat (as previously described) were used as input to MAGIC and run with default parameters as recommended by the developers.

T‐cell dysfunction and exclusion analysis

To investigate the immunomodulatory roles of different stromal subsets, we performed T‐cell dysfunction and exclusion analysis using similar strategy from TIDE (Jiang et al, 2018). We first used the average expression level of CD8A, CD8B, GZMA, GZMB and PRF1 to estimate the cytotoxic T‐lymphocyte (CTL) level in each sample from the bulk sequencing cohort. Patients with a higher and lower CTL level compared to the mean CTL level within the cohort were stratified into high and low CTL groups, respectively. For CTL dysfunction analysis, TIDE evaluates whether gene signatures from each of the stromal subsets influences the beneficial effect of CTL levels on patient prognosis. This is performed using the interaction coefficient d from Cox proportional hazard (Cox‐PH) model to evaluate how the interaction between a candidate gene and the CTL affects the death hazard. Genes with a higher TIDE dysfunction score suggest antagonistic interactions with regard to CTL levels, where the survival benefit of patients with high CTL is lost, and thus suggesting an association with CTL dysfunction. This method was used to calculate the TIDE T‐cell dysfunction score from the differentially expressed genes across the four stromal subsets in TNBC patients from the METARBRIC cohort (Curtis et al, 2012) and two independent TBNC cohorts (Sabatier et al, 2011; Jezequel et al, 2015). A total of 233, 84 and 107 patients were evaluated for the METABRIC, GSE21653 and GSE58812 cohorts, respectively.

For T‐cell exclusion analysis, we examined Pearson correlations between all CTL levels (indicated on the y axis in Fig 6E) and the respective correlation score between the bulk tumour sample and single‐cell cluster of interest (indicated on the x axis in Fig 6E). Here, gene signatures for the single‐cell cluster of interest were defined by the averaged gene expression of all single cells in the cluster, divided over the averaged gene expression of all cells detected in the dataset. This method was used to define signatures in this section, as opposed to a DEG list in the previous CTL dysfunction analysis. This was first performed for all stromal cells, CD4+ and CD8+ T‐cell clusters divided over all detected cells independently, as shown in Fig 6D. We next repeated this for the myCAF, iCAF, dPVL and imPVL clusters divided over all stromal cells independently, as shown in Fig 6E. In each of the breast cancer cohorts, a higher correlation suggests a positive association between the single‐cell cluster of interest and CTL levels (as observed in CD4+ and CD8+ T cells shown in Fig 6D), while a negative correlation suggest a negative association (as observed in dPVL cells shown in Fig 6E). This correlation indicates a potential affluence of each stromal subset on T‐cell infiltration in tumours. For T‐cell exclusion analysis, we examined the three aforementioned TNBC cohorts, as well as the TNBC cohort from The Cancer Genome Atlas (https://www.cancer.gov/tcga) (Cancer Genome Atlas, 2012).

Statistical analysis

For the identification of significant differentially expressed genes in scRNA‐Seq, the MAST method was applied with a P‐value threshold of 1 × 10−5 and FDR threshold of 0.05 (Finak et al, 2015). Comparisons for enriched AUCell regulon scores from the SCENIC pipeline were made using a one‐way analysis of variance (ANOVA) with a P‐value threshold of 1 × 10−5 (Aibar et al, 2017; Moerman et al, 2018). Quantitative analysis of collagen abundance from SHG was determined using unpaired two‐tailed Student's t‐test with equal standard deviation after normalisation of the orientation peak distributions. P‐values were denoted by asterisks: *P < 0.05, **P < 0.01 and ***P < 0.001. For collagen orientation, statistical significant was determined using a Kruskal–Wallis test with Dunn's post hoc multiple comparisons test with a P‐value threshold of 1 × 10−4. For TIDE analysis of CTL dysfunction, P‐value significance was determined using the Cox proportional hazard (Cox‐PH) model and was assigned from the P(> |z|) value from the prognostic differences between CTL high and CTL low groups. For TIDE analysis of CTL exclusion, P‐values were derived from a two‐sided t‐test for each correlation. The Benjamini–Hochberg procedure was used for adjusting P‐values to correct for multiple testing. Statistical significance of CD8 IHC staining was determined using pairwise comparison of each patient against all others (i.e., base‐mean) using a Student's t‐test.

Author contributions

AS conceived the project and directed the study with input from all authors. SZW and AS wrote the manuscript with input from all authors. Clinical collaborators EL, SW, MNH, CC, CM, DS, ER, AP, JB, and LG organised the access to patient tissue. SZW, GA‐E, and KH optimised the tumour dissociation for scRNA‐Seq. C‐LC helped perform the next‐generation sequencing of the scRNA‐Seq libraries. SZW interpreted and performed the pre‐processing and downstream analysis of the scRNA‐Seq data. DLR supervised the scRNA‐Seq analysis. JRT helped with the inferCNV analysis. NB helped with the IHC image analysis and alignment. KH helped performed the IHC staining experiments. SZW, CEL, ASC, KH, and GA‐E helped perform flow sorting experiments. KJM and BP performed the collagen assays and imaging. TRC helped analyse the collagen fibre orientation data. HH performed the IF experiments. CW and XSL performed and helped interpret the T‐cell dysfunction and exclusion and patient survival analysis. SOT independently scored the H&E and IHC stains. AF and RH assisted in the analysis of the cell signalling data. TRC and PT provided intellectual input and helped with the interpretation of the ECM assay data. EL, SJ, and JP provided intellectual input.

Conflict of interest

The authors declare that they have no conflict of interest.

Supporting information

Appendix

Expanded View Figures PDF

Dataset EV1

Dataset EV2

Dataset EV3

Source Data Expanded View and Appendix

Review Process File

Source Data for Figure 4

Source Data for Figure 6

Acknowledgements

This work was supported by funding from John and Deborah McMurtrie, the National Breast Cancer Foundation (NBCF) of Australia, The Petre Foundation and The Sydney Breast Cancer Foundation. A.S. is the recipient of a Senior Research Fellowship from the National Health and Medical Research Council of Australia. S.Z.W. is supported by the Australian Government Research Training Program Scholarship. S.O.T. is supported by the NBCF (PRAC 16‐006), the IIRS 19 084 and the Sydney Breast Cancer Foundation and the Family and Friends of Michael O'Sullivan. T.R.C. is supported by an NHMRC RD Wright Biomedical Career Development Fellowship, a Susan G Komen Career Catalyst Award and a Cancer Institute NSW (CINSW) fellowship. S.J. is supported by a research fellowship from the NBCF. X.S.L. is a supported by the Breast Cancer Research Foundation (BCRF‐19‐100) and the National Institute of Health of the United States (R01CA234018). We would like to thanks the following people for their assistance in the experimental part of this manuscript; Ms. Gillian Lehrbach from the Garvan Tissue Culture Facility; Ms. Anaiis Zaratzian from the Garvan Histopathology Facility for tissue processing and IHC staining; The Garvan‐Weizmann Centre for Cellular Genomics, including Mr. Eric Lam, Ms. Hira Saeed and Ms. Melissa Armstrong for the expertise in flow sorting, and Mr. Dominik Kaczorowski for his help in next‐generation sequencing.

The EMBO Journal (2020) 39: e104063

See also: F Brod & R Fässler (October 2020)

Data availability

The scRNA‐Seq data from this study have been deposited in the European Nucleotide Archive (ENA) under the accession code PRJEB35405 (http://www.ebi.ac.uk/ena/data/view/PRJEB35405). This depository includes the demultiplexed paired ended reads (R1 and R2), Illumina indices and bam files processed using the Cellranger software. The scRNA‐Seq analysis scripts can be found on the website: https://github.com/sunnyzwu/stromal_subclasses. All relevant data are available from the authors upon request.

References

  1. Aibar S, Gonzalez‐Blas CB, Moerman T, Huynh‐Thu VA, Imrichova H, Hulselmans G, Rambow F, Marine JC, Geurts P, Aerts J et al (2017) SCENIC: single‐cell regulatory network inference and clustering. Nat Methods 14: 1083–1086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alarmo EL, Kuukasjarvi T, Karhu R, Kallioniemi A (2007) A comprehensive expression survey of bone morphogenetic proteins in breast cancer highlights the importance of BMP4 and BMP7. Breast Cancer Res Treat 103: 239–246 [DOI] [PubMed] [Google Scholar]
  3. Aran D, Hu Z, Butte AJ (2017) xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol 18: 220 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Banerjee S, Sengupta K, Dhar K, Mehta S, D'Amore PA, Dhar G, Banerjee SK (2006) Breast cancer cells secreted platelet‐derived growth factor‐induced motility of vascular smooth muscle cells is mediated through neuropilin‐1. Mol Carcinog 45: 871–880 [DOI] [PubMed] [Google Scholar]
  5. Bartoschek M, Oskolkov N, Bocci M, Lovrot J, Larsson C, Sommarin M, Madsen CD, Lindgren D, Pekar G, Karlsson G et al (2018) Spatially and functionally distinct subclasses of breast cancer‐associated fibroblasts revealed by single cell RNA sequencing. Nat Commun 9: 5150 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Biffi G, Oni TE, Spielman B, Hao Y, Elyada E, Park Y, Preall J, Tuveson DA (2018) IL1‐induced JAK/STAT signaling is antagonized by TGFbeta to shape CAF heterogeneity in pancreatic ductal adenocarcinoma. Cancer Discov 9: 282–301 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Blackburn SD, Shin H, Haining WN, Zou T, Workman CJ, Polley A, Betts MR, Freeman GJ, Vignali DA, Wherry EJ (2009) Coregulation of CD8 + T cell exhaustion by multiple inhibitory receptors during chronic viral infection. Nat Immunol 10: 29–37 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Brechbuhl HM, Finlay‐Schultz J, Yamamoto TM, Gillen AE, Cittelly DM, Tan AC, Sams SB, Pillai MM, Elias AD, Robinson WA et al (2017) Fibroblast subtypes regulate responsiveness of luminal breast cancer to estrogen. Clin Cancer Res 23: 1710–1721 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Butler A, Hoffman P, Smibert P, Papalexi E, Satija R (2018) Integrating single‐cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol 36: 411–420 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cancer Genome Atlas N (2012) Comprehensive molecular portraits of human breast tumours. Nature 490: 61–70 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cazet AS, Hui MN, Elsworth BL, Wu SZ, Roden D, Chan CL, Skhinas JN, Collot R, Yang J, Harvey K et al (2018) Targeting stromal remodeling and cancer stem cell plasticity overcomes chemoresistance in triple negative breast cancer. Nat Commun 9: 2897 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chang AL, Miska J, Wainwright DA, Dey M, Rivetta CV, Yu D, Kanojia D, Pituch KC, Qiao J, Pytel P et al (2016) CCL2 produced by the glioma microenvironment is essential for the recruitment of regulatory T cells and myeloid‐derived suppressor cells. Cancer Res 76: 5671–5682 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Conklin MW, Eickhoff JC, Riching KM, Pehlke CA, Eliceiri KW, Provenzano PP, Friedl A, Keely PJ (2011) Aligned collagen is a prognostic signature for survival in human breast carcinoma. Am J Pathol 178: 1221–1232 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Conway JRW, Vennin C, Cazet AS, Herrmann D, Murphy KJ, Warren SC, Wullkopf L, Boulghourjian A, Zaratzian A, Da Silva AM et al (2017) Three‐dimensional organotypic matrices from alternative collagen sources as pre‐clinical models for cell biology. Sci Rep 7: 16887 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Costa A, Kieffer Y, Scholer‐Dahirel A, Pelon F, Bourachot B, Cardon M, Sirven P, Magagna I, Fuhrmann L, Bernard C et al (2018) Fibroblast heterogeneity and immunosuppressive environment in human breast cancer. Cancer Cell 33: 463–479 [DOI] [PubMed] [Google Scholar]
  16. Covas DT, Panepucci RA, Fontes AM, Silva WA Jr, Orellana MD, Freitas MC, Neder L, Santos AR, Peres LC, Jamur MC et al (2008) Multipotent mesenchymal stromal cells obtained from diverse human tissues share functional properties and gene‐expression profile with CD146 + perivascular cells and fibroblasts. Exp Hematol 36: 642–654 [DOI] [PubMed] [Google Scholar]
  17. Creemers EE, Sutherland LB, McAnally J, Richardson JA, Olson EN (2006) Myocardin is a direct transcriptional target of Mef2, Tead and Foxo proteins during cardiovascular development. Development 133: 4245–4256 [DOI] [PubMed] [Google Scholar]
  18. Crisan M, Yap S, Casteilla L, Chen CW, Corselli M, Park TS, Andriolo G, Sun B, Zheng B, Zhang L et al (2008) A perivascular origin for mesenchymal stem cells in multiple human organs. Cell Stem Cell 3: 301–313 [DOI] [PubMed] [Google Scholar]
  19. Cukierman E, Pankov R, Stevens DR, Yamada KM (2001) Taking cell‐matrix adhesions to the third dimension. Science 294: 1708–1712 [DOI] [PubMed] [Google Scholar]
  20. Curtis C, Shah SP, Chin SF, Turashvili G, Rueda OM, Dunning MJ, Speed D, Lynch AG, Samarajiwa S, Yuan Y et al (2012) The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature 486: 346–352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. DeFilippis RA, Chang H, Dumont N, Rabban JT, Chen YY, Fontenay GV, Berman HK, Gauthier ML, Zhao J, Hu D et al (2012) CD36 repression activates a multicellular stromal program shared by high mammographic density and tumor tissues. Cancer Discov 2: 826–839 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. van Dijk D, Sharma R, Nainys J, Yim K, Kathail P, Carr AJ, Burdziak C, Moon KR, Chaffer CL, Pattabiraman D et al (2018) Recovering gene interactions from single‐cell data using data diffusion. Cell 174: 716–729 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Elyada E, Bolisetty M, Laise P, Flynn WF, Courtois ET, Burkhart RA, Teinor JA, Belleau P, Biffi G, Lucito MS et al (2019) Cross‐species single‐cell analysis of pancreatic ductal adenocarcinoma reveals antigen‐presenting cancer‐associated fibroblasts. Cancer Discov 9: 1102–1123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Fagiani E, Christofori G (2013) Angiopoietins in angiogenesis. Cancer Lett 328: 18–26 [DOI] [PubMed] [Google Scholar]
  25. Finak G, McDavid A, Yajima M, Deng J, Gersuk V, Shalek AK, Slichter CK, Miller HW, McElrath MJ, Prlic M et al (2015) MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single‐cell RNA sequencing data. Genome Biol 16: 278 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Fu R, Han CF, Ni T, Di L, Liu LJ, Lv WC, Bi YR, Jiang N, He Y, Li HM et al (2019) A ZEB1/p53 signaling axis in stromal fibroblasts promotes mammary epithelial tumours. Nat Commun 10: 3210 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Givel AM, Kieffer Y, Scholer‐Dahirel A, Sirven P, Cardon M, Pelon F, Magagna I, Gentric G, Costa A, Bonneau C et al (2018) miR200‐regulated CXCL12beta promotes fibroblast heterogeneity and immunosuppression in ovarian cancers. Nat Commun 9: 1056 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Gordon JW, Pagiatakis C, Salma J, Du M, Andreucci JJ, Zhao J, Hou G, Perry RL, Dan Q, Courtman D et al (2009) Protein kinase A‐regulated assembly of a MEF2{middle dot}HDAC4 repressor complex controls c‐Jun expression in vascular smooth muscle cells. J Biol Chem 284: 19027–19042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Hamzah J, Jugold M, Kiessling F, Rigby P, Manzur M, Marti HH, Rabie T, Kaden S, Grone HJ, Hammerling GJ et al (2008) Vascular normalization in Rgs5‐deficient tumours promotes immune destruction. Nature 453: 410–414 [DOI] [PubMed] [Google Scholar]
  30. Hosaka K, Yang Y, Seki T, Fischer C, Dubey O, Fredlund E, Hartman J, Religa P, Morikawa H, Ishii Y et al (2016) Pericyte‐fibroblast transition promotes tumor growth and metastasis. Proc Natl Acad Sci USA 113: E5618–E5627 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Hrckulak D, Kolar M, Strnad H, Korinek V (2016) TCF/LEF transcription factors: an update from the internet resources. Cancers 8: 70 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Huo CW, Chew G, Hill P, Huang D, Ingman W, Hodson L, Brown KA, Magenau A, Allam AH, McGhee E et al (2015) High mammographic density is associated with an increase in stromal collagen and immune cells within the mammary epithelium. Breast Cancer Res 17: 79 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Hyun KA, Koo GB, Han H, Sohn J, Choi W, Kim SI, Jung HI, Kim YS (2016) Epithelial‐to‐mesenchymal transition leads to loss of EpCAM and different physical properties in circulating tumor cells from metastatic breast cancer. Oncotarget 7: 24677–24687 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Jezequel P, Loussouarn D, Guerin‐Charbonnel C, Campion L, Vanier A, Gouraud W, Lasla H, Guette C, Valo I, Verriele V et al (2015) Gene‐expression molecular subtyping of triple‐negative breast cancer tumours: importance of immune response. Breast Cancer Res 17: 43 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Jiang Y, Li Y, Zhu B (2015) T‐cell exhaustion in the tumor microenvironment. Cell Death Dis 6: e1792 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Jiang P, Gu S, Pan D, Fu J, Sahu A, Hu X, Li Z, Traugh N, Bu X, Li B et al (2018) Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response. Nat Med 24: 1550–1558 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Johansson‐Percival A, Li ZJ, Lakhiani DD, He B, Wang X, Hamzah J, Ganss R (2015) Intratumoral LIGHT restores pericyte contractile properties and vessel integrity. Cell Rep 13: 2687–2698 [DOI] [PubMed] [Google Scholar]
  38. Kalluri R (2016) The biology and function of fibroblasts in cancer. Nat Rev Cancer 16: 582–598 [DOI] [PubMed] [Google Scholar]
  39. Kawase A, Ishii G, Nagai K, Ito T, Nagano T, Murata Y, Hishida T, Nishimura M, Yoshida J, Suzuki K et al (2008) Podoplanin expression by cancer associated fibroblasts predicts poor prognosis of lung adenocarcinoma. Int J Cancer 123: 1053–1059 [DOI] [PubMed] [Google Scholar]
  40. Kisselbach L, Merges M, Bossie A, Boyd A (2009) CD90 expression on human primary cells and elimination of contaminating fibroblasts from cell cultures. Cytotechnology 59: 31–44 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Kuang W, Deng Q, Deng C, Li W, Shu S, Zhou M (2017) Hepatocyte growth factor induces breast cancer cell invasion via the PI3K/Akt and p38 MAPK signaling pathways to up‐regulate the expression of COX2. Am J Transl Res 9: 3816–3826 [PMC free article] [PubMed] [Google Scholar]
  42. Lambrechts D, Wauters E, Boeckx B, Aibar S, Nittner D, Burton O, Bassez A, Decaluwe H, Pircher A, Van den Eynde K et al (2018) Phenotype molding of stromal cells in the lung tumor microenvironment. Nat Med 24: 1277–1289 [DOI] [PubMed] [Google Scholar]
  43. Lee HO, Mullins SR, Franco‐Barraza J, Valianou M, Cukierman E, Cheng JD (2011) FAP‐overexpressing fibroblasts produce an extracellular matrix that enhances invasive velocity and directionality of pancreatic cancer cells. BMC Cancer 11: 245 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Lee HJ, Kao CY, Lin SC, Xu M, Xie X, Tsai SY, Tsai MJ (2017) Dysregulation of nuclear receptor COUP‐TFII impairs skeletal muscle development. Sci Rep 7: 3136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Levental KR, Yu H, Kass L, Lakins JN, Egeblad M, Erler JT, Fong SF, Csiszar K, Giaccia A, Weninger W et al (2009) Matrix crosslinking forces tumor progression by enhancing integrin signaling. Cell 139: 891–906 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Li Q, Yu Y, Bischoff J, Mulliken JB, Olsen BR (2003) Differential expression of CD146 in tissues and endothelial cells derived from infantile haemangioma and normal human skin. J Pathol 201: 296–302 [DOI] [PubMed] [Google Scholar]
  47. Li T, Sun L, Miller N, Nicklee T, Woo J, Hulse‐Smith L, Tsao MS, Khokha R, Martin L, Boyd N (2005) The association of measured breast tissue characteristics with mammographic density and other risk factors for breast cancer. Cancer Epidemiol Biomarkers Prev 14: 343–349 [DOI] [PubMed] [Google Scholar]
  48. Li Z, Pang Y, Gara SK, Achyut BR, Heger C, Goldsmith PK, Lonning S, Yang L (2012) Gr‐1 + CD11b+ cells are responsible for tumor promoting effect of TGF‐beta in breast cancer progression. Int J Cancer 131: 2584–2595 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Loi S, Sirtaine N, Piette F, Salgado R, Viale G, Van Eenoo F, Rouas G, Francis P, Crown JP, Hitre E et al (2013) Prognostic and predictive value of tumor‐infiltrating lymphocytes in a phase III randomized adjuvant breast cancer trial in node‐positive breast cancer comparing the addition of docetaxel to doxorubicin with doxorubicin‐based chemotherapy: BIG 02‐98. J Clin Oncol 31: 860–867 [DOI] [PubMed] [Google Scholar]
  50. Louis SF, Zahradka P (2010) Vascular smooth muscle cell motility: from migration to invasion. Exp Clin Cardiol 15: e75–e85 [PMC free article] [PubMed] [Google Scholar]
  51. Lun A, Riesenfeld S, Andrews T, Dao TP, Gomes T, Marioni J (2019) EmptyDrops: distinguishing cells from empty droplets in droplet‐based single‐cell RNA sequencing data. Genome Biol 20: 63 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Markiewski MM, DeAngelis RA, Benencia F, Ricklin‐Lichtsteiner SK, Koutoulaki A, Gerard C, Coukos G, Lambris JD (2008) Modulation of the antitumor immune response by complement. Nat Immunol 9: 1225–1235 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Mayorca‐Guiliani AE, Madsen CD, Cox TR, Horton ER, Venning FA, Erler JT (2017) ISDoT: in situ decellularization of tissues for high‐resolution imaging and proteomic analysis of native extracellular matrix. Nat Med 23: 890–898 [DOI] [PubMed] [Google Scholar]
  54. McCormack VA, dos Santos Silva I (2006) Breast density and parenchymal patterns as markers of breast cancer risk: a meta‐analysis. Cancer Epidemiol Biomarkers Prev 15: 1159–1169 [DOI] [PubMed] [Google Scholar]
  55. Middleton J, Americh L, Gayon R, Julien D, Mansat M, Mansat P, Anract P, Cantagrel A, Cattan P, Reimund JM et al (2005) A comparative study of endothelial cell markers expressed in chronically inflamed human tissues: MECA‐79, Duffy antigen receptor for chemokines, von Willebrand factor, CD31, CD34, CD105 and CD146. J Pathol 206: 260–268 [DOI] [PubMed] [Google Scholar]
  56. Moerman T, Aibar S, Gonzalez‐Blas CB, Simm J, Moreau Y, Aerts J, Aerts S (2018) GRNBoost2 and Arboreto: efficient and scalable inference of gene regulatory networks. Bioinformatics 35: 2159–2161 [DOI] [PubMed] [Google Scholar]
  57. Neri S, Ishii G, Hashimoto H, Kuwata T, Nagai K, Date H, Ochiai A (2015) Podoplanin‐expressing cancer‐associated fibroblasts lead and enhance the local invasion of cancer cells in lung adenocarcinoma. Int J Cancer 137: 784–796 [DOI] [PubMed] [Google Scholar]
  58. Nguyen EV, Pereira BA, Lawrence MG, Ma X, Rebello RJ, Chan H, Niranjan B, Wu Y, Ellem S, Guan X et al (2019) Proteomic profiling of human prostate cancer‐associated fibroblasts (CAF) reveals LOXL2‐dependent regulation of the tumor microenvironment. Mol Cell Proteomics 18: 1410–1427 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. O'Flanagan CH, Campbell KR, Zhang AW, Kabeer F, Lim JLP, Biele J, Eirew P, Lai D, McPherson A, Kong E et al (2019) Dissociation of solid tumor tissues with cold active protease for single‐cell RNA‐seq minimizes conserved collagenase‐associated stress responses. Genome Biol 20: 210 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Ohlund D, Handly‐Santana A, Biffi G, Elyada E, Almeida AS, Ponz‐Sarvise M, Corbo V, Oni TE, Hearn SA, Lee EJ et al (2017) Distinct populations of inflammatory fibroblasts and myofibroblasts in pancreatic cancer. J Exp Med 214: 579–596 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Osterreicher CH, Penz‐Osterreicher M, Grivennikov SI, Guma M, Koltsova EK, Datz C, Sasik R, Hardiman G, Karin M, Brenner DA (2011) Fibroblast‐specific protein 1 identifies an inflammatory subpopulation of macrophages in the liver. Proc Natl Acad Sci USA 108: 308–313 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. de Ostrovich KK, Lambertz I, Colby JK, Tian J, Rundhaug JE, Johnston D, Conti CJ, DiGiovanni J, Fuchs‐Young R (2008) Paracrine overexpression of insulin‐like growth factor‐1 enhances mammary tumorigenesis in vivo . Am J Pathol 173: 824–834 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Ozdemir BC, Pentcheva‐Hoang T, Carstens JL, Zheng X, Wu CC, Simpson TR, Laklai H, Sugimoto H, Kahlert C, Novitskiy SV et al (2015) Depletion of carcinoma‐associated fibroblasts and fibrosis induces immunosuppression and accelerates pancreas cancer with reduced survival. Cancer Cell 28: 831–833 [DOI] [PubMed] [Google Scholar]
  64. Palmieri C, Roberts‐Clark D, Assadi‐Sabet A, Coope RC, O'Hare M, Sunters A, Hanby A, Slade MJ, Gomm JJ, Lam EW et al (2003) Fibroblast growth factor 7, secreted by breast fibroblasts, is an interleukin‐1beta‐induced paracrine growth factor for human breast cells. J Endocrinol 177: 65–81 [DOI] [PubMed] [Google Scholar]
  65. Patel AP, Tirosh I, Trombetta JJ, Shalek AK, Gillespie SM, Wakimoto H, Cahill DP, Nahed BV, Curry WT, Martuza RL et al (2014) Single‐cell RNA‐seq highlights intratumoral heterogeneity in primary glioblastoma. Science 344: 1396–1401 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Pidsley R, Lawrence MG, Zotenko E, Niranjan B, Statham A, Song J, Chabanon RM, Qu W, Wang H, Richards M et al (2018) Enduring epigenetic landmarks define the cancer microenvironment. Genome Res 28: 625–638 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Prater MD, Petit V, Alasdair Russell I, Giraddi RR, Shehata M, Menon S, Schulte R, Kalajzic I, Rath N, Olson MF et al (2014) Mammary stem cells have myoepithelial cell properties. Nat Cell Biol 16: 942–950 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Puram SV, Tirosh I, Parikh AS, Patel AP, Yizhak K, Gillespie S, Rodman C, Luo CL, Mroz EA, Emerick KS et al (2017) Single‐cell transcriptomic analysis of primary and metastatic tumor ecosystems in head and neck cancer. Cell 171: 1611–1624 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Qiu X, Hill A, Packer J, Lin D, Ma YA, Trapnell C (2017) Single‐cell mRNA quantification and differential analysis with Census. Nat Methods 14: 309–315 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Ramilowski JA, Goldberg T, Harshbarger J, Kloppmann E, Lizio M, Satagopam VP, Itoh M, Kawaji H, Carninci P, Rost B et al (2015) A draft network of ligand‐receptor‐mediated multicellular signalling in human. Nat Commun 6: 7866 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Rezakhaniha R, Agianniotis A, Schrauwen JT, Griffa A, Sage D, Bouten CV, van de Vosse FN, Unser M, Stergiopulos N (2012) Experimental investigation of collagen waviness and orientation in the arterial adventitia using confocal laser scanning microscopy. Biomech Model Mechanobiol 11: 461–473 [DOI] [PubMed] [Google Scholar]
  72. Ronnov‐Jessen L, Petersen OW, Koteliansky VE, Bissell MJ (1995) The origin of the myofibroblasts in breast cancer. Recapitulation of tumor environment in culture unravels diversity and implicates converted fibroblasts and recruited smooth muscle cells. J Clin Invest 95: 859–873 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Sabatier R, Finetti P, Cervera N, Lambaudie E, Esterni B, Mamessier E, Tallet A, Chabannon C, Extra JM, Jacquemier J et al (2011) A gene expression signature identifies two prognostic subgroups of basal breast cancer. Breast Cancer Res Treat 126: 407–420 [DOI] [PubMed] [Google Scholar]
  74. Salgado R, Denkert C, Demaria S, Sirtaine N, Klauschen F, Pruneri G, Wienert S, Van den Eynden G, Baehner FL, Penault‐Llorca F et al (2015) The evaluation of tumor‐infiltrating lymphocytes (TILs) in breast cancer: recommendations by an International TILs Working Group 2014. Ann Oncol 26: 259–271 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Samani AA, Yakar S, LeRoith D, Brodt P (2007) The role of the IGF system in cancer growth and metastasis: overview and recent insights. Endocr Rev 28: 20–47 [DOI] [PubMed] [Google Scholar]
  76. Satija R, Farrell JA, Gennert D, Schier AF, Regev A (2015) Spatial reconstruction of single‐cell gene expression data. Nat Biotechnol 33: 495–502 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Sautes‐Fridman C, Lawand M, Giraldo NA, Kaplon H, Germain C, Fridman WH, Dieu‐Nosjean MC (2016) Tertiary lymphoid structures in cancers: prognostic value, regulation, and manipulation for therapeutic intervention. Front Immunol 7: 407 [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Shao X, Wei X (2018) FOXP1 enhances fibrosis via activating Wnt/beta‐catenin signaling pathway in endometriosis. Am J Transl Res 10: 3610–3618 [PMC free article] [PubMed] [Google Scholar]
  79. Shields JD, Kourtis IC, Tomei AA, Roberts JM, Swartz MA (2010) Induction of lymphoidlike stroma and immune escape by tumors that express the chemokine CCL21. Science 328: 749–752 [DOI] [PubMed] [Google Scholar]
  80. Sinn M, Denkert C, Striefler JK, Pelzer U, Stieler JM, Bahra M, Lohneis P, Dorken B, Oettle H, Riess H et al (2014) alpha‐Smooth muscle actin expression and desmoplastic stromal reaction in pancreatic cancer: results from the CONKO‐001 study. Br J Cancer 111: 1917–1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Song S, Ewald AJ, Stallcup W, Werb Z, Bergers G (2005) PDGFRbeta+ perivascular progenitor cells in tumours regulate pericyte differentiation and vascular survival. Nat Cell Biol 7: 870–879 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Su S, Chen J, Yao H, Liu J, Yu S, Lao L, Wang M, Luo M, Xing Y, Chen F et al (2018) CD10(+)GPR77(+) cancer‐associated fibroblasts promote cancer formation and chemoresistance by sustaining cancer stemness. Cell 172: 841–856 [DOI] [PubMed] [Google Scholar]
  83. Tamama K, Barbeau DJ (2012) Early growth response genes signaling supports strong paracrine capability of mesenchymal stem cells. Stem Cells Int 2012: 428403 [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Tian L, Goldstein A, Wang H, Ching Lo H, Sun Kim I, Welte T, Sheng K, Dobrolecki LE, Zhang X, Putluri N et al (2017) Mutual regulation of tumour vessel normalization and immunostimulatory reprogramming. Nature 544: 250–254 [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Timpson P, McGhee EJ, Erami Z, Nobis M, Quinn JA, Edward M, Anderson KI (2011) Organotypic collagen I assay: a malleable platform to assess cell behaviour in a 3‐dimensional context. J Vis Exp 56: e3089 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Vennin C, Chin VT, Warren SC, Lucas MC, Herrmann D, Magenau A, Melenec P, Walters SN, Del Monte‐Nieto G, Conway JR et al (2017) Transient tissue priming via ROCK inhibition uncouples pancreatic cancer progression, sensitivity to chemotherapy, and metastasis. Sci Transl Med 9: eaai8504 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Wang FT, Sun W, Zhang JT, Fan YZ (2019) Cancer‐associated fibroblast regulation of tumor neo‐angiogenesis as a therapeutic target in cancer. Oncol Lett 17: 3055–3065 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Wu J, Bohanan CS, Neumann JC, Lingrel JB (2008) KLF2 transcription factor modulates blood vessel maturation through smooth muscle cell migration. J Biol Chem 283: 3942–3950 [DOI] [PubMed] [Google Scholar]
  89. Yamashita M, Ogawa T, Zhang X, Hanamura N, Kashikura Y, Takamura M, Yoneda M, Shiraishi T (2012) Role of stromal myofibroblasts in invasive breast cancer: stromal expression of alpha‐smooth muscle actin correlates with worse clinical outcome. Breast Cancer 19: 170–176 [DOI] [PubMed] [Google Scholar]
  90. Yu G, Wang LG, Han Y, He QY (2012) clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16: 284–287 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix

Expanded View Figures PDF

Dataset EV1

Dataset EV2

Dataset EV3

Source Data Expanded View and Appendix

Review Process File

Source Data for Figure 4

Source Data for Figure 6

Data Availability Statement

The scRNA‐Seq data from this study have been deposited in the European Nucleotide Archive (ENA) under the accession code PRJEB35405 (http://www.ebi.ac.uk/ena/data/view/PRJEB35405). This depository includes the demultiplexed paired ended reads (R1 and R2), Illumina indices and bam files processed using the Cellranger software. The scRNA‐Seq analysis scripts can be found on the website: https://github.com/sunnyzwu/stromal_subclasses. All relevant data are available from the authors upon request.


Articles from The EMBO Journal are provided here courtesy of Nature Publishing Group

RESOURCES