Abstract
Understanding the complexity and heterogeneity of mammary cell subpopulations is vital to delineate the mechanisms behind breast cancer development, progression and prevention. Increasingly sophisticated tools for investigating these cell subtypes has led to the development of a greater understanding of these cell subtypes, complex interplay of certain subtypes and their developmental potential. Of note, increasing accessibility and affordability of single cell technologies has led to a plethora of studies being published containing data from mammary cell subtypes and their differentiation potential in both mice and human data sets. Here, we review the different types of single cell technologies and how they have been used to improve our understanding of mammary gland development.
Keywords: scRNA-seq, Single cell technologies, Mammary gland, scATAC-seq, mass cytometry, scDNA-seq, Breast cancer
1. Introduction
Whilst the key motivating factor to study the development and function of the mammary gland is to better understand the tissue of origin for the deadliest cancer in females worldwide [1], under normal homoeostatic conditions it is the only secretory organ to mature in adulthood. Fundamentally, the organ is a bilayer branched ductal tree tipped by alveoli consisting of inner luminal cells and outer contractile myoepithelial basal cells which interface directly with surrounding supportive stromal cells. The main stages of mammary gland development occur during embryogenesis, puberty and in the adult in the case of pregnancy, lactation and subsequent involution [2]. Dramatic restructuring of the mammary gland occurs at each of these developmental stages, where only during lactation is the mammary gland considered functional and able to fulfil its purpose of milk production. Another source of dramatic reconstruction occurs in the case of breast cancer.
Historically, breast cancer patients were typically stratified for treatment based on the clinical parameters of age, node status, tumour stage and histological grade together with the presence or absence of key hormone receptors for oestrogen (ER), progesterone (PR) and human epidermal growth factor receptor 2 (HER2) [3]. However, since then, molecular portraits of human breast cancers have been identified [4] and have been developed to separate tumours into six major subtypes: Luminal subtype A, Luminal subtype B, Luminal subtype C, ERBB2+, basal-like and normal-breast like [5], [6]. Due to the clear differences between these cell types, questions subsequently arose as to whether these diverse breast cancer phenotypes arose from the same “cell of origin” or from different subpopulations.
The molecular profile of these tumour types were compared to normal human mammary subpopulations which can be enriched for using fluorescence activated cell sorting (FACS) and the markers integrin alpha-6 (CD49f) and epithelial cell adhesion marker (EpCAM) [7]. Following stromal depletion of CD45+ hematopoietic and CD31+ endothelial cells, enrichment and transcriptomic profiling of basal/mammary stem cells (EpCAM-/CD49f+), mature luminal cells (EpCAM+/CD49f-) and luminal progenitor cells (EpCAM+/CD49f+) revealed that each normal cell type resembled different breast cancer subtypes [7]. Despite these findings and the fact that the molecular subtypes can be used to help predict patient outcomes, we still cannot make accurate predictions as to patients’ response to treatment and indeed still do not know conclusively from which mammary cell subpopulation breast cancer is derived [8]. The major hindrance to answering these questions is that studying the human mammary gland is a highly static ex vivo process, where only opportunistic samples from donated tissue can be studied. This means that in any given study, it is only possible to determine cell types and possible cellular hierarchies from the single developmental time point at which the genetically unique individual donates their tissue.
Examining the murine mammary gland offers the possibility to not only examine different stages of development at selected time points in genetically identical animals but also provides the opportunity to conduct extensive in vivo studies. Previous mammary repopulation studies, which involve the transplantation of one or more cells into a cleared murine mammary fat pad, identified a bipotent stem cell within the basal compartment of the mammary gland which was able to generate a fully functional mammary gland [9], [10], [11]. However, due to the nature of this experiment that involves radical changes in the microenvironment of the cell, it has been suggested that this bipotent mammary cell plasticity may be an induced effect that would not necessarily exist in normal biology [12]. Lineage tracing on the other hand allows for the tracking of individual cells and their progeny in vivo, by inducing cells expressing a specific marker to concurrently express a fluorescent and/or colorimetric protein that can be easily identified using a fluorescence microscope [13]. Whilst this technique would seemingly shed light on whether bipotent or unipotent lineage-restricted progenitors drive glandular tissue expansion, this technique is not without its flaws. Many of the lineage tracing studies conducted in the mammary gland have produced contradictory findings where some suggest the presence of multipotent progenitors [14], [15], [16], [17], whilst others find only unipotent progenitors contributing to mammary gland development postpartum [18], [19], [20], [21], [22], [23], [24], [25]. Discrepancies in these studies may have arisen from a number of factors that could be attributed to the lineage tracing technique or non-specific lineage marker selection. The result of these studies is that the evidence suggesting the presence of mammary multipotent stem cells postpartum remains inconclusive and better tools to identify different cell populations in both the mouse and the human are required to answer this question. With the rise of single cell technologies, we are able to discern cell subpopulations and state from different tissues on an unprecedented level.
2. Single cell technologies
Technology to molecularly characterise single cells on a protein, transcriptomic, genomic or epigenomic level has rapidly developed over the past decade and allows for a better understanding of cell subpopulations and differentiation potential. With the development of single cell RNA-sequencing (scRNA-seq) and single cell ATAC-seq (scATAC-seq), cells dissociated from healthy or diseased tissue can be grouped by similar transcriptomes or epigenomes to determine cell subpopulations and maturation states in an unbiased fashion. On the other hand, single cell technologies such as single cell DNA-sequencing (scDNA-seq) can be used to determine copy number variations which provide important insights into cancer cell heterogeneity and tracking of tumour cell evolution [26].
2.1. Single cell DNA-sequencing
Clonal diversity is an important feature of human tumours and can be explored through the use of scDNA-sequencing. After initial isolation of single nuclei from single cells, whole genome amplification is conducted before next generation sequencing is used to determine genome wide copy number profiles of single cells. Initial methodologies allowed for ~10% physical coverage of a single cell genome allowing for identification of copy number aberrations alone [27]. Methodologies such as BGI [28], nuc-seq [29], and SNES [30] that use a Phi29 enzyme to perform multiple-displacement amplifications, generated a greater than 90% coverage of the single cell genome allowing for mutations at a base pair level to be detected [31]. Another early amplification-based scDNA-seq technique, multiple annealing and looping based amplification cycles (MALBAC), also allows for the identification of single-nucleotide variations (SNVs) due to the high genome coverage (93%) offered by this method [32]. More recent transposition-based methodologies direct library preparation (DLP) [33] and DLP+ [34], do not require preamplification of DNA and therefore overcome disadvantages of coverage and polymerase bias. Careful extraction of single cells using laser-capture microdissection and catapulting, combined with adaption of highly multiplexed single nucleus sequencing (HM-SNS) [35] and tissue section image analysis, allows for topographic single cell sequencing (TSCS). TSCS allows for the capture of the genomic profile of single cells whilst preserving their cellular position and morphology within the tissue of interest [36]. This technique, along with the others described, are integral to better understanding tumour cell copy number mutations and breast cancer development, progression and evolution. However, scDNA-seq analysis alone cannot provide information on cell state or type and hence other single techniques are required to comprehensively understand mammary gland development and pathogenesis.
2.2. Flow and mass cytometry
Flow cytometry is one of the most widely used techniques of single cell analysis, using fluorescently labelled antibodies to conduct multiparameter profiling of individual cells. This technique uses polychromatic lasers and filters to detect cell protein expression profiles and is generally limited to 10 simultaneous measurements that can be made at once [37]. Building on similar concepts of flow cytometry, mass cytometry was developed to measure cells that have been labelled with isotope conjugated antibodies which can then be examined with mass spectrometry [38]. Each labelled cell is sprayed in a single droplet into inductively coupled argon plasma which vaporises the cell. During this process the cell’s atomic constituents are ionised and the resulting elemental ions can be sampled using time-of-flight (TOF) mass spectrometry and quantified. The result is that the cell is permanently destroyed, however this technique allows for characterisation of a large number of proteins; where up to 100 parameters per cell can be measured [37], [38], [39]. Whilst this technique is well suited to cost-effective high throughput use to investigate rare cell subpopulations, it relies on a priori knowledge of cell markers, limiting its use for novel cell type discoveries.
2.3. Single cell RNA-sequencing
Another single cell tool that has been widely adopted is scRNA-sequencing which has been predominantly developed to discover and unbiasedly characterise cell subpopulations from a large number of tissue dissociated cells. Since the development of the first scRNA-seq protocol in 2009 [40], many different protocols have been published, where major differences between them include: how the cells are separated and whether mRNA transcripts are amplified to generate full length cDNA or partial coverage. Many of these techniques, along with bioinformatic tools required to analyse them, have been previously expertly reviewed [41], [42], [43], [44]. Hence, for the purposes of this review, we will focus on the major techniques that have been adopted in the field of mammary gland biology. These techniques include SMART-Seq 2 and the Fluidigm C1 platform as well as Drop-seq utilised by the Chromium platform from 10x Genomics.
Switching mechanism at the 5’ end of the RNA template (SMART-Seq) was first developed in 2012 [45] and subsequent chemical refinements of the system led to the development of SMART-Seq2 [46], which has an increased yield and length of cDNA libraries generated for each individual cell. Cells tagged with multi fluorophore antibodies are sorted using a FACS machine into individual wells of a 96- or 384-well plate, allowing for optional index sorting of the cells. Confirmation of single cells per individual well can be subsequently conducted with a microscope before proceeding with cell lysis and downstream library preparation. cDNA is then generated from the full length of the mRNA using specially designed primers which also add sequences including a cell barcode and Illumina sequence, which allow for subsequent amplifications and analysis including multiplexing of up to 96 samples [47]. Library preparation is then conducted before pooling samples. Generation of full-length cDNA provides good read coverage across the entire transcript and allows for detection of gene isoforms or allele-specific expression by determining single nucleotide polymorphisms (SNPs). SMART-Seq 2 provides a cost-effective method to analyse hundreds of cells with high transcript coverage, however it is a laborious process involving many pipetting steps which limits the number of cells that can be easily sequenced. SMART-Seq technology has been incorporated into an automated microfluidic system, Fluidigm C1, which allows for the capture, lysis and reverse transcription of up to 96 cells on an integrated fluidic chip. As with the manual method of SMART-Seq, one can verify the capture of a single cell on the chip using a microscope. However, in this case the user is restricted to only preparing 96 cells per cartridge on the machine, limiting the number of cells that can be analysed. This problem has been somewhat overcome by the introduction of C1 mRNA Seq HT allowing for up to 800 cells to be captured in a single run. This method, however, compromises on read coverage across the entire script due to utilising 3’ end counting mRNA sequencing [41].
The technique of droplet barcoding (or Drop-seq) revolutionised the field of scRNA-seq and allowed this technology to become a more accessible tool for molecular biology. The first protocols captured single cells in nanolitre droplets together with DNA-barcoded beads using a microfluidic system [48], [49]. Within these droplets, cells can be lysed, barcoded and cDNA generated. Subsequently, the cDNA can then be pooled for PCR amplification and sheared to allow for short-read sequencing permitting partial coverage of cDNA sequences that can be aligned to infer gene expression profiles. After development of this technique, a commercially available version was provided by 10x Genomics called the Chromium controller platform. This platform allows for the capture and analysis of thousands of cells on a single cell level from 8 different samples [50] and is a cost-effective technique when calculated per cell compared to many other scRNA-seq techniques. Evidently, it is not possible to check the morphology of the cells prior to sequencing them, nor verify single cell rather than doublet capture, although this has been optimised previously [50]. With the development of cellular indexing of transcriptomes and epitopes by sequencing (CITE-seq), detection of multiplexed protein markers can be done in conjunction with unbiased transcriptome profiling of the cells using this platform among others [51].
2.4. Single-cell ATAC-Seq
Whilst scRNA-sequencing provides in depth information of cell subpopulation gene expression profiles at a snapshot in time, only through examining gene promoter and enhancer chromatin accessibility profiles can we understand a cell’s potential for different cell states. Many of the techniques of single cell capture including array-based technologies, droplet microfluidics and combinatorial indexing through split pooling have allowed for the development of multiple single-cell Assay for Transposase Accessible Chromatin using sequencing (scATAC-seq) protocols [52]. The first method of scATAC-seq involved microfluidic capture of single cells using the Fluidigm system, and subsequent transposition using Tn5 transposases to tag regulatory regions by inserting sequencing adaptors into accessible regions of the genome [53]. Combinatorial cellular indexing through split pooling is a technique involving two rounds of nuclei barcoding, which allows for scATAC-seq analysis to be conducted on pooled single cells, without the need to first physically separate them [54]. An alternative method of scATAC-seq, which allows for the profiling of an even greater number of cells has also been developed by 10x Genomics [55]. Nuclei of cells are isolated from a bulk single cell suspension and transposed before being loaded onto the microfluidic chip. Within the chip, gel beads in emulsion (GEMs) are generated and contain a single nucleus and a bead. The bead contains: the reagents to lyse the nuclei, oligonucleotide sequencing adaptor and a priming sequence which during the linear amplification reaction incorporates a unique barcode into the transposed DNA [55]. Barcoded DNA can then be amplified before sequencing. A major disadvantage of scATAC-seq is inherent sparsity of the generated data, which arises due to the fact that diploid cells have low copy numbers. Despite the powerful computational tools developed to analyse such sparse data, it is thought that only 1–10% of the total accessible peaks are detected [52], where much of the scATAC-seq analysis heavily relies on single cell transcriptomic data to infer cell types.
2.5. Limitations of single cell tools
Evidently for each of the above single cell analysis techniques, it is possible to gain invaluable insights into cellular behaviour or differentiation potential, however no one technique is omnipotent. Whilst scDNA-seq provides information about mutational states of cells through copy number variations or SNVs, it may be limited in its coverage and does not provide any information on cell type, function or state. On the other hand, techniques that aim to characterise immediate cell subpopulations either by examining the direct transcriptional profiles (scRNA-seq) or gene expression products (flow/mass cytometry), are not always in agreement due to post-transcriptional modifications. Mass cytometry allows for identification of lowly expressed proteins which otherwise might be undetected in scRNA-seq data (particularly droplet-based platforms) due to dropout of low-level transcripts. Unlike scRNA-seq which unbiasedly classifies cells based on their whole measured transcriptional profile, mass cytometry relies on appropriate marker selection and antibody availability, without which cells might be misidentified. As mentioned above, only a fraction of the total chromatin accessible sites can be extracted using scATAC-seq, where identification of different cell subpopulations relies on mapping this limited data onto available transcriptomic information. To overcome the limitations of these different single cell techniques, multi-omic approaches aiming to combine different single cell technologies (such as CITE-seq) are emerging and have recently been expertly reviewed [56]. Along with emerging techniques to collect data from different single cell profiling methods, it is important that methods to analyse the increasing amounts of data involving machine learning and novel algorithms continue to evolve.
2.6. Single cell analysis tools
Development of machine learning bioinformatic tools to analyse single cell data is as important as improving the techniques themselves to advance our understanding of tissue subpopulations and differentiation trajectories. As demonstrated above each single cell technology is unique and often requires its own analysis pipeline and tools. Many of these have been previously expertly reviewed for analysis of: scDNA-seq [57], mass cytometry [58], [59], scRNA-seq [60], [61], [62] and scATAC-seq [52] data. Visualisation of single cell datasets has been improved as dimension reduction tools have evolved from: principle component analysis to t-distributed stochastic neighbour embedding (tSNE) [63] to uniform manifold approximation and projection for dimension reduction (UMAP) [64] to diffusion map graphical representation [65]. Tools such as diffusion pseudotime analysis use input from different single cell techniques and construct a temporal order of differentiating cells by measuring transitions between cells using diffusion-like random walks [66] and allow for cellular hierarchies to be inferred. Other interesting tools for scRNA-seq data include: scGen which uses machine learning to model perturbation and infection response of cells across cell types, studies and species [67] and Markov affinity-based graph imputation of cells (MAGIC) which determines gene interactions [68]. On the other hand, ChromVar uses scATAC-seq data to predict transcription factor associated accessibility and allow clustering of epigenomically profiled cells [69], whereas more recent tools such as Model-based AnalysEs of Transcriptome and RegulOme (MAESTRO) directly integrate data from both scRNA-seq and scATAC-seq experiments [70]. These are a few of the hundreds of tools available that can be used to aid in the biological interpretation of the big data generated by single cell analysis.
3. Using single cell technologies to understand mammary gland biology
Since the publication of the first single cell manuscripts characterising the postnatal mammary gland [71], [72], an increasing number of studies (Table 1) have focused on comparing mammary cells across different stages of normal development (Fig. 1). The focus of these studies is to comprehensively characterise mammary cell subpopulations and differentiation trajectories using mass cytometry, scRNA-seq and scATAC-seq, to answer key question such as whether a rare population of bipotent progenitors exist past embryonic development.
Table 1.
Reference | Lab | Species | Tissue type (normal/ cancer/ both) | Sample time points | Mammary fraction (epithelial/stroma-immune/both) | Preparation of cells prior to single cell sequencing | Technique + technology | Number of cells/nuclei profiled |
---|---|---|---|---|---|---|---|---|
Andersson et al. bioRxiv 2020 (preprint) | Lundeberg | Human | Cancer | 8x HER2 + breast tumours | Both | Cells spatially selected from fixed tissue sections | Spatial transcriptomics developed by Ståhl et al. Science 2016 | 1007 spots (where spot refers to a small neighbourhood populated by multiple cells) |
Azizi et al. Cell 2018 | Pe’er and Rudensky | Human | Both | 8x primary breast carcinomas tumour and matched normal breast tissue, peripheral blood and lymph nodes | Stroma-imune | CD45+ FACS sorted cells | scRNA-seq using the inDrop platform | 47,016 cells |
Bach et al. Nature Comm 2017 | Marioni and Khaled | Mouse | Normal | Mammary cells taken during pregnancy (day 14.5), lactation (day 6), involution (day 11) and from virgin 8 week old C57BL/6N mice | Epithelial | Live lineage negative EpCAM+ cells sorted using FACS | scRNA-seq using Chromium 10x Drop-Seq platform | 23,184 cells |
Bach and Pensa et al. Nature Comm 2021 | Khaled and Marioni | Mouse | Both | Tumourogenesis data set: 13 x Blg-Cre; Brca1f/f;p53 + /- mice (aged 30–48 weeks, nulliparous) and 2x C57BL/6N mice (aged 36–40 weeks old) | Both | Viable cells were isolated using MACS Dead Cell Removal Kit | scRNA-seq using Chromium 10x Drop-Seq platform | 102,829 cells |
Pregnancy data set: 9x C57BL/6N mice at 4.5/9.5/14.5 days gestation as well as 3x C57BL/6N mice (aged 12 weeks old) | ||||||||
Bartoschek et al. Nature Comm 2018 | Pietras | Mouse | Cancer | Tumours from 14 week old MMTV-PyMT mice | Stroma-imune | EpCAM-/CD45-/CD31-/NG2- FACS sorted mesenchymal cells | scRNA-seq using Smart-Seq2 | 768 cells |
Baslan et al. eLife 2020 | Hicks | Human | Cancer | Fresh pre-treatment core biopsies were taken from 16 patients’ enroled in phase II clinical trials conducted by the Brown University Oncology Group (BrUOG) | Both | Nuclei was isolated and FACS sorted for ploidy | Single-nuclei copy number analysis developed by Baslan et al. Nature Protocols 2012 | Mean of 116 single-nuclei per tumour |
Carli et al. J Mammary Gland Biol Neoplasia 2020 | Rudolph | Human | Normal | Human milk cells from two participants | Both | Viable cells were isolated using FACS | scRNA-seq using Chromium 10x Drop-Seq platform | 3740 cells |
Casasent et al. Cell 2018 | Edgerton and Navin | Human | Cancer | 10x ductal carcinoma in situ (DCIS) and invasive ductal carcinoma (IDC) tumour samples | Both | Cells selected using single cell laser dissection | Topographic single cell sequencing (TSCS, scDNA sequencing) | On average 129 cells per patient |
Chung et al. Nature Comm 2017 | Park | Human | Cancer | 11x tumour cells from different breast cancer subtypes | Both | Dead cells were removed using Ficoll-Paque PLUS | scATAC-seq using the Fluidigm C1 platform | 515 cells |
Chung et al. Cell Reports 2019 | Wahl | Mouse | Normal | Mammary cells from E18 foetal and 8-week old adult CD1 mice | Epithelial | EpCAM+/Lin- FACS sorted cells | snATAC-seq | 7846 high quality single nuclei |
Engelbrecht and Twigger et al. bioRxiv 2020 (preprint) | Scheel and Khaled | Human | Normal | Mammary tissue from one participant was digested 3 ways (3 h vs. 16 h digest and 10 rpm vs. 100 rpm. shaking speed). | Both | Viable cells were isolated using MACS Dead Cell Removal Kit | scRNA-seq using Chromium 10x Drop-Seq | 11,191 cells |
Gao et al. Nature Comm 2017 | Navin | Human | Cancer | Breast cancer cell lines for technique validation and a single triple negative breast tumour sample | Both | Nuclei isolated from frozen tumour and cell lines | Nanogrid snRNA-seq | 796 primary cell nuclei |
Giraddi et al. Cell Rep 2019 | Wahl and Spike | Mouse | Normal | Mammary cells isolated from day 16/18 embryonic (E16, E18), postnatal day 10 (P10) and 10–16-week-old adult C57BL/6 mice | Epithelial | EpCAM+ FACS sorted cells | scRNA-seq using Chromium 10x Drop-Seq and Fluidigm C1 platforms | 6060 cells using 10x and 262 cells using Fluidigm C1 |
Gkountela et al. Cell 2019 | Aceto | Human | Cancer | Blood samples containing circulating tumour cells (CTCs) from 43 patients with progressive breast cancer | Both | Live CTSs were stained for EpCAM, HER2 and EGFR and sorted for CTCs | Single Cell Whole-genome Bisulfite Sequencing developed by Farlik et al. Cell Rep. 2015 | 89 single CTCs and 71 CTC clusters from patients and xenographs |
Grosselin et al. Nature Genetics 2019 | Griffiths, Vallot and Gérard | Mouse and Human | Cancer | Patient derived xenogrft mouse models of luminal and treatment resistant breast cancers | Both | Viable cells were isolated using MACS Dead Cell Removal Kit | scChIP-seq and scRNA-seq | 2728 cells profiled by scRNA-seq |
Han et al. Cell 2018 | Guo | Mouse | Normal | Mammary cells taken during pregnancy, lactation, involution and from virgin C57BL/6 mice | Both* | Cells from all major mice organs including mammary gland. *However, no mention was made of removal of the mammary lymph nodes prior to mammary gland dissociation. | scRNA-seq using microwell-seq | 61,196 mammary gland cells |
Kanaya et al. Commun Biol 2019 | Chen | Mouse | Normal | Mammary cells from 9-week-old BALB/cj mice which underwent ovariectomy (surgical menopause) and were treated with vehicle, E2 and/or PBDE | Both | Dead cells were removed using microbeads | scRNA-seq using Chromium 10x Drop-Seq platform | 14,856 cells |
Karaayvaz et al. Nature Comm 2018 | Ellisen | Human | Cancer | 6x primary triple negative breast tumours | Both | Viable tumour cells were isolated by FACS sorting | scRNA-seq using Smart-seq2 | 1189 cells |
Knapp et al. Cell Reports 2017 | Eaves | Human | Both | 7x breast cancer cell lines and 8x normal primary breast tissue samples | Both | N/A | Mass cytometry using 35 markers (16 extracellular and 19 intracellular) | Not disclosed |
Li et al. Cell Reports 2020 | Brugge | Mouse | Normal | Mammary cells from young (3–4 month, n = 3) and aged (13–14 month, n = 4) old virgin C57BL/6 J mice | Both | None | scRNA-seq using Chromium 10x Drop-Seq platform | 13,684 cells |
Lo et al. Cancers 2020 | Zhou | Mouse | Normal | Mammary cells were isolated from normal and high fat diet (HFD) fed 2 month old C57BL/6 Brca1-/-;p53+/- mice | Stroma-imune | EpCAM- FACs sorted viable stromal and immune cells | scRNA-seq using Chromium 10x Drop-Seq platform | 3892 cells |
Murrow et al. bioRxiv 2020 (preprint v4) | Gartner | Human | Normal | Mammary cells isolated from 28x premenopausal women of varying BMI and age | Both | Live (DAPI-) cells were sorted using CD31-/CD45-/EpCAM+/-/ CD49f+/- and CD45+ | scRNA-seq using Chromium 10x Drop-Seq platform | 87,793 cells |
Nguyen et al. Nature Comms 2018 | Kessenbrock | Human | Normal | Mammary cells isolated from 7x individuals whom underwent reduction mammoplasties | Epithelial | CD31-/CD45- epithelial cells enriched by FACs sorting using CD49f and EpCAM | scRNA-seq using Chromium 10x Drop-Seq and Fluidigm C1 platforms | 24,646 cells using 10x and 868 cells using Fluidigm C1 |
Pal et al. Nature Comms 2017 | Visvader | Mouse | Normal | Mammary cells were isolated from early postnatal (2 week old), mid-pubertal (5 weeks old) and mature adult (10 weeks, either from virgin or pregnant) FVB/NJ mice | Epithelial | CD45-/CD31-/CD24+ epithelial cells were FACS enriched | scRNA-seq using Chromium 10x Drop-Seq and Fluidigm C1 platforms | 3308 cells using 10x and 460 cells using Fluidigm C1 |
Pervolarakis et al. Cell Reports 2020 | Watanabe and Kessenbrock | Mouse | Normal | Mammary cells were isolated from 10-week-old FVB/NJ mice | Both | FACS sorting using markers CD31, CD45, EpCAM and CD49f separated live cells for scRNA-seq and basal/luminal cells for scATAC-seq | scRNA-seq and scATAC-seq was performed using 10x Genomics Chromium platforms | 26,859 cells were profiled using scRNA-seq and 23,338 cells were profiled using scATAC-seq |
Salmén et al. bioRxiv 2018 (preprint) | Lundeberg | Human | Cancer | Tumour tissue sections from 10 patients diagnosed with HER2 + breast cancer | Both | N/A | Spatial Transcriptomics developed by Ståhl et al. Science 2016 and others | 1007 spatial spots |
Sebastian et al. Cancers 2020 | Loots | Mouse | Cancer | Mammary cells were extracted from 10-week-old BALB/c mice with 4T1 derived mammary tumours (a synergistic model for triple negative breast cancer) | Stroma | Using magnetic cell separation, CD45 depleted or CD45/CD90.1 depleted cells were sequenced, as well as bulk cells and CD140a+/EpCAM-/CD45-/7AAD- FACS sorted fibroblasts. | scRNA-seq using Chromium 10x Drop-Seq platform | 6420 cells |
Sun et al. J. Biol. Chem. 2018 | Deng | Mouse | Normal | Mammary cells were isolated from 3 to 4 month-old virgin or day 12.5 pregnant FVB mice | Epithelial | Lineage positive (Lin+, endothelial or immune) cells were excluded using the EasySeq mouse epithelial cell enrichment kit. Luminal and basal cells were FACS sorted for using CD24 and CD29 | scRNA-seq using Fluidigm C1 platform | 239 cells |
Thong et al. Front. Cell Dev. Biol. 2020 | Colacino | Human | Normal | Mammary cells were isolated from 3x normal mammoplasty tissue samples. Additionally, mammary cells from the 3 patients were conditionally reprogrammed in vitro and also sequenced. | Both | None | scRNA-seq using drop-seq protocols developed by Macosko et al. 2015. | Not disclosed |
The Tabula Muris Consortium Nature 2020 | – | Mouse | Normal | Mammary cells taken from at 3, 18 and 21 months from C57BL/6JN mice | Both* | Cells from all major mice organs including mammary gland. *However, no mention was made of removal of the mammary lymph nodes prior to mammary gland dissociation. | scRNA-seq using Chromium 10x Drop-Seq platform | 15,577 mammary cells |
Tognetti et al. bioRxiv 2020 (preprint) | Bodenmiller | Human cell lines | Both | 62x breast cancer cell lines and 5x normal cell lines | N/A | N/A | Mass cytometry using 34 markers | > 80 million cells |
Twigger et al. bioRxiv 2020 (preprint) | Khaled and Scheel | Human | Normal | 4x samples from participants donating mammoplasty tissue and 4x human milk samples from lactating women | Both | None | scRNA-seq using Chromium 10x Drop-Seq platform | 24,666 non-lactating breast cells and 27,023 human milk cells |
Vatter et al., Cell Reports 2018 | Bodenmiller, LaBarge and Lorens | Human | Normal | 57 samples were profiled consisting of cultured cells from 44 women and 13 uncultured breast epithelia samples. | Epithelia | N/A | Mass cytometry using 29 markers | 880,000 cells |
Wagner at al. Cell 2019 | Bodenmiller | Human | Both | 144x human breast tumours and 50x normal breast tissue samples | Both | N/A | Mass cytometry using 73 markers | 26 million cells |
Wang et al. eLife 2020 | Schwertfeger | Mouse | Normal | Mammary cells isolated from 10-week-old dioestrus FVB/NJ mice | Stroma-imune | CD45+ cells were enriched for using magnetic cell separation | scRNA-seq using Chromium 10x Drop-Seq platform | 13,000 cells were targeted |
Wu et al. EMBO J 2020 | Swarbrick | Human | Cancer | 5x triple negative breast tumours | Both | Viable cells were enriched for using EasySep Dead Cell Removal Kit | scRNA-seq using Chromium 10x Drop-Seq platform | 24,271 cells |
Wuidart et al. Nat Cell Biol 2018 | Blanpain | Mouse | Normal | Mammary cells were isolated from embryonic day 14 (E14) and > 8-week-old adult mice | Epithelial | Embryonic CD49fHi/Lgr5Hi cells, adult CD24+/CD29Hi basal and adult CD24+/CD29Lo luminal cells were FACS enriched | scRNA-seq using Smart-seq2 | 193 cells |
3.1. Embryonic mammary gland composition
Recent studies have compared embryonic and post birth mammary cell subpopulations (Fig. 1) using scRNA-seq and ATAC-seq (Table 1) in an attempt to answer whether bipotent progenitors giving rise to both luminal and basal cells exist postnatally. Using multicolour lineage tracing and scRNA-seq, Wuidart et al. identified that at embryonic day 14 (E14) a single population of embryonic multipotent progenitors (EMPs) exist in the mouse mammary gland. These cells exhibit a hybrid transcriptional signature comprising of marker genes from both the luminal and basal lineages [23]. Within this study it was found that p63 is a master regulator of basal cells, where overexpression in luminal cells is enough to convert them into a basal phenotype. One of the limitations of this study is that only a small number of cells were analysed by single cell analysis and that comparisons between different developmental time points were limited (69 EMPs compared to 51 adult basal cells and 73 adult luminal cells). However, within the same year, another study was published by Giraddi et al. which compared the single cell profile of cells during embryogenesis (E16 and E18), postnatally at day 4 (P4) and from the adult mouse mammary gland. As was observed in the previous study, mammary cells from E16 and E18 generated single separate clusters of foetal cells, whereas postnatally both luminal and basal cells could be identified which was further separated in the adult gland into mature and alveolar luminal clusters and a separate basal cluster [73]. Original bulk ATAC-seq analysis by Dravis et al., referenced in Girraddi et al., showed that E18 foetal mammary stem cells (fMaSCs) presented open features at distal enhancer and proximal promoter regions of both luminal and basal genes [74]. Using single nucleus ATAC-seq (snATAC-seq), follow up studies confirmed that during late embryogenesis (E18), individual cells displayed either a basal-like (Krt5, Acta2) or pan luminal/luminal progenitor-like (Krt8, Krt18, Kit) chromatin accessibility profile together with foetal-enriched genes such as Sox10 and Sox21 [75]. snATAC-seq and scRNA-seq data from Giraddi et al. was integrated to examine the developmental trajectory which found that both E16 and E18 foetal cells remain tightly clustered. Whilst clustered cells at E18 were found to be indistinguishable on a transcriptomic level, lineage priming was found using snATAC-seq, where cells displayed accessibility either to luminal or basal cell fate [75]. Subsequently, only postnatal lineage restricted luminal or basal cells were present, which upon onset of puberty the luminal cells further split into luminal alveolar and hormone responsive luminal cells (see Table 2 for a summary of the commonly used nomenclature for these populations).
Table 2.
Basal |
Luminal |
||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Reference | Organism | Technique | Basal | Luminal progenitor | Secretory alveolar | Hormone responsive | |||||||||||
Bach et al. 2017 | Mouse | scRNA-seq | Basal | Luminal progenitor | Differentiated secretory alveolar | Hormone sensing | |||||||||||
Chung et al. 2019 | Mouse | snATAC-seq | Basal | Luminal progenitor | N.A.a | Mature luminal | |||||||||||
Engelbrecht et al. 2020 | Human | scRNA-seq | Basal/myoepithelial (BA) | Luminal hormone receptor negative progenitors (LHR-) | N.A.a | Luminal hormone-receptor positive mature cells (LHR+) | |||||||||||
Giraddi et al. 2018 | Mouse | scRNA-seq | Basal | Alveolar precursor | N.A.a | Mature luminal | |||||||||||
Han et al. 2018 | Mouse | scRNA-seq | Myoepithelial cells | Luminal progenitor | Ductal luminal | Secretory alveoli | N.A. | ||||||||||
Knapp et al. 2017 | Human | Mass cytometry | Basal cells (BCs) | Luminal progenitors (LPs) | Luminal cells (LCs) | ||||||||||||
Li et al. 2020 | Mouse | scRNA-seq | Myoepithelial | HS-AV | Alveolar (AV) | N.A.a | Hormone sensing (HS) | ||||||||||
Murrow et al. 2020 | Human | scRNA-seq | Basal/Myoepithelial | Secretory luminal | N.A.a | Hormone responsive (HR+) | |||||||||||
Nguyen et al. 2017 | Human | scRNA-seq | Basal | Myoepithelial | Secretory L1-type | N.A.a | Hormone responsive L2-type | ||||||||||
Pal et al. 2017 | Mouse | scRNA-seq | Basal/Myoepithelial | Mammary stem cells (MaSCs) | Luminal progenitor | Luminal intermediate (Lum Int) | Alveolar | Mature luminal/Ductal | |||||||||
Pervolarakis et al. 2020 | Mouse | scRNA-seq/scATAC-seq | Myoepithelial | Luminal secretory (L-sec) | Hormone responsive | ||||||||||||
Luminal progenitor | Lactation progenitor/precursor | Secretory alveolar | |||||||||||||||
Regan and Smalley 2020 | N.A. | N.A. | Myoepithelial cells | Oestrogen receptor negative (ER-) ductal cells | Secretory alveolar cells | Hormone-responsive oestrogen receptor-positive (ER+) cells | |||||||||||
Sun et al. 2018 | Mouse | scRNA-seq | Myoepithelial/Basal | Proliferative luminal cells (PLCs) | Keratinised luminal cells (KLCs) | Mature luminal cells (Mature LCs) | Lipid biosynthetic luminal cells (LBLCs) | Stimulus-responsive luminal cells (SRLCs) | |||||||||
Proliferative basal cells (PBs) | Wnt signalling -responsive basal cells (WRBCs) | Mammary stem cells (MaSCs) | |||||||||||||||
Thong et al. 2020 | Human | scRNA-seq | Myoepithelial | Luminal 1 (L1) | N.A.a | Luminal 2 (L2) | |||||||||||
Twigger et al. 2020 | Human | scRNA-seq | Basal cells (BA) | Luminal progenitor (LP) | Secretory alveolar | Hormone responsive (HR) | |||||||||||
Luminal 1 (LC1) | Luminal 2 (LC2) | ||||||||||||||||
Vatter et al. 2018 | Human | Mass cytometry | Basal myoepithelium (MEP) | Luminal epithelium (LEP) | |||||||||||||
Wuidart et al. 2018 | Mouse | scRNA-seq | Basal cells (BCs) | Luminal cells (LCs) |
N.A. pregnancy/lactation time point wasn’t included in this study, hence secretory alveolar cells not described.
3.2. Adult mammary gland epithelial cell composition
Even through the dramatic changes occurring during pregnancy, lactation and involution in the adult mammary gland, scRNA-seq findings from Bach et al. support the findings that only lineage restricted progenitors exist postnatally. Epithelial cells taken from virgin, pregnant, lactating and involuting mice were characterised to find distinct subpopulations of cells from either the luminal or basal lineages [71]. These cell clusters were ordered along a pseudotime trajectory and it was found that both luminal and basal cells did not have a common progenitor [71]. The luminal cells however were found to have a continuum of differentiation stemming from Aldh1a3+ luminal progenitors and extend to the two major lineages of secretory alveolar and hormone responsive. Clearly replicating such experiments in humans would prove difficult, hence studies conducting scRNA-seq of human milk cells have provided novel findings on the maturation of functional human mammary cells during lactation [76], [77]. Recent work from our lab examining milk cells from 4 donors and 4 non-lactating breast tissue samples found two populations of secretory luminal milk derived cells which are transcriptionally similar to luminal progenitor cells from non-lactating breast [76]. This work suggests that luminal progenitors may give rise to secretory alveolar cells in humans, as well as in mice. Furthermore, another recently published study has found that aberrant differentiation of luminal progenitor cells, resulting in expression of key milk proteins Lalba, Csn2 and Wap, may provide an indication of early tumorigenesis in Brca/p53 mouse models [78]. This highlights the importance of resolving the role luminal progenitors play in development to better understand breast cancer development. Findings of a common luminal progenitor in the adult mammary gland is in agreement with findings from Giraddi et al. but are in contrast to the recent lineage tracing studies which suggest that these two luminal cell lineages are unipotent in adult mice [22], [79], [80], [81], [82]. It is possible that the bipotent luminal cells identified in these scRNA-seq studies are active during early embryonic and prepubertal stages and that later in development, the two luminal lineages are maintained by their respective progenitors. Interesting, it was found that parity induced Aldh1a3+ luminal progenitor cells exist post involution, that were seemingly primed towards the alveolar lineage and expressed many lactation associated genes such as Lipa, Xdh and casein genes such as Csn2 and Csn3 [71]. Findings from a recent study that combined scATAC-Seq and scRNA-seq to study 10-week-old adult mice corroborated findings from the Bach study, identifying a single myoepithelial population as well as a Foxa1+ hormone responsive luminal population and Elf5+/Kit+ luminal cells (termed L-sec for luminal secretory, see Table 2) [83]. The L-sec cells were further divided into Rspo1+/Aldh1a3+ luminal progenitor and Lalba+/Csn2+/Lipa+ lactation progenitor cells which according to pseudo temporal analysis suggests that the former gives rise to the latter cell subtype [83]. A limitation of this study was that analysis was only conducted at one time point, where comparisons between the gene expression profiles of the discovered lactation progenitor and previously described secretory alveolar cells [71] would have developmentally contextualised these findings.
Similar cell populations were reported in the adult human mammary gland at a single time point [84], [85]. Mass cytometry of normal primary mammary tissue identified epithelial subsets of basal, luminal (corresponding to mature luminal, see Table 2) and luminal progenitor cells [85]. Strikingly, within the latter population, a small subset of cells displayed an elevated content of active caspase-3 whilst maintaining clonogenicity, suggesting potential greater genomic instability and increased risk of oncogenic transformation in this set of luminal progenitor cells [85]. Nguyen et al. also identified 3 major epithelial cell types using scRNA-seq which they termed myoepithelial, luminal 1 (L1) and luminal 2 (L2) cells (see Table 2). The SLPI+ L1 population was split into ELF5+/KIT+ L1.2 (resembling luminal progenitor cells, Table 2) and LTF+ L1.1 cells (resembling secretory alveolar cells, Table 2). L1.2 luminal progenitor cells were found to sit above the other luminal cell types on the differentiation hierarchy. L2 cells expressed the marker ANKRD30A together with hormone receptors ESR1, PGR and AR indicative of a hormone responsive subpopulation (Table 2). Interestingly, upon closer examination of the basal cluster identified using 10x Genomics scRNA-seq, a subcluster of cells highly expressed contractile genes (ACTA2, TGLN, KRT14) which the authors subsequently identified as specific myoepithelial cells [84]. However, data from a recent pre-print suggests that differential expression of adhesion markers may be an artefact of long digestion duration rather than due to distinct subclusters in human mammary cell subtypes [86]. Within the Nguyen paper, the authors manually set the start of the pseudotime to be within the basal cell type and found that the resulting trajectory differentiated from the basal cells to a myoepithelial branch and a bifurcated luminal branch, topped by L1.2 luminal progenitors [84]. Whilst the studies described thus far appear to represent a concordant view of mammary cell subpopulations in the embryonic and postnatal mammary gland of both mice and humans, contradictory findings have been published [72], [87], [88], [89] and reviewed [90], [91] suggesting bipotent progenitors existing postnatally.
3.3. Multipotent lineage progenitors in the adult mammary gland
Similar to Bach et al., another study published at the same time [72] investigated postnatal mammary cell subpopulations in mice at slightly different developmental time points (pre-puberty, puberty, adult and pregnancy as well as different stages of the oestrus cycle) however came to different conclusions. Findings from this study suggest that prior to puberty a majority of epithelial cells consist of a basal phenotype, where a rare basal-like population expressing some luminal features is positive for the marker Cd55 [72]. During puberty this population seemingly expands, where after puberty the Cd55+ cells predominantly reside in the luminal cell compartment. Further luminal subpopulations were identified including intermediate (sharing markers of both luminal progenitor and hormone responsive cells) and mixed (expressing both Kit/Elf5 luminal markers and Acta/Krt14 basal markers) cell subpopulations [72]. Detailed discussion on this study, together with other early mammary scRNA-seq studies that classified new cell types based on expression of markers such as c-Kit (Kit), can be found in [92]. A more recent study in mice also found a hybrid luminal cell subpopulation, which expressed genes characteristic of both hormone sensing (HS) and alveolar progenitors (AV) termed HS-AV cells, which decreased with age [93]. Both the described “intermediate” [72] and “HS-AV” [93] luminal cells expressed higher levels of Prlr and Cited than luminal progenitor/alveolar cells, and higher levels of Csn3 and Trf than hormone responsive cells. Li et al. went further and confirmed using immunofluorescence these double luminal lineage cells by illustrating co-staining of progesterone receptor with milk fat globule-EGF factor 8 (PR+/MFGE8+) and oestrogen receptor with lactoferrin (ER+/LTF+) [93]. Despite the identification of new luminal cell types, Pal et al. did not observe a separate rare population of mammary stem cells positive for Epcam and Procr or Lgr5 and Tspan8 that was expected according to data from previous studies [72]. Similarly, Sun et al. examined epithelial cells taken from the adult mouse in the virgin and pregnant state and also noted an absence of a unique population of cells that expressed Lgr5 and Tspan8 [87]. Findings from this study observed a Cdh5+/Procr+ population of cells that resided in the basal compartment, which proliferated in vitro and had in vivo mammary repopulation potential [87]. Analysis of developmental trajectories suggested that this cell type was at the top of a mammary epithelial cell hierarchy during pregnancy, however this was not the case in the virgin gland.
Follow up studies utilised novel bioinformatic tools to compare across different data sets in an attempt to reconcile the differences found between the single cell studies to date. One such study developed a model called Landscape of Single Cell Entropy (LandSCENT) which provides a potency score for single cells independent of using lineage markers which they then used to define root states for lineage trajectory algorithms [88]. Using this method, this study reanalysed an adult human data set [84], where they identified high potency cells mapping to the periphery of the basal and immature alveolar clusters. Cells that sit in this bipotent state overexpressed transcription factors YBX1 and ENO1 which have been implicated in basal breast cancer risk and were suggested as potential markers identifying multipotent progenitors [88]. However, throughout this paper it was noted that data from the Nguyen et al. study was not of a sufficient quality to identify stem cell populations without this technique and that future studies should investigate more cells to a greater depth. A more recent study also conducting retrospective comparisons between many data sets and, in addition, produced a novel dataset characterising normal human mammoplasty cells from 3 donors together with cultured cells [89]. Whilst this study’s main focus was on the epithelial compartment of mammary cells, they captured all noted epithelial populations together with stromal and immune cell subtypes [89]. Primary mammary cells were cultured in 2D on top of irradiated fibroblasts to conditionally reprogramme the cells [94]. After which, the authors identified different “hybrid” epithelial cell populations emerging including basal/luminal (KRT14+/KRT18+), epithelial/mesenchymal (EPCAM+/VIM+) and some that expressed all 4 markers (quadruple positive cells, KRT14+/KRT18+/EPCAM+/VIM+). In particular, they found by comparing their data with previous studies [71], [73], [84] that normal human mammary tissue mapped to the adult mammary tissue of mice and conditionally reprogrammed cells map closer to the stem-like population of embryonic mouse cells. After comparison with all these time points, they noted that the “hybrid” cells existed at different time points predominantly during the in utero period, gestation and lactation. This paper provides a first look at all the mammary cell subpopulations (including stroma) that exist in the adult human mammary gland but does not take into account differences in marker expression profiles between mice and human cells, namely that KRT14 expression has been routinely observed by pathologists in both luminal and basal cells in the adult human [95]. Accurate use of markers to define different cell types is important when describing whether cell subpopulations contain markers from both luminal and basal cells. In addition, it is important to remember that cells captured in these studies are done so at a single snapshot in time and that their temporal trajectories are inferred only from these single time points. Therefore conclusive statements of biopotency must be further verified using in vivo models, as discussed by Watson and Khaled [96], and may take advantage of emerging technologies such as scGASTALT/LINNAEUS [97], [98] that combine unbiased lineage tracing with scRNA-seq-detectable genetic scaring.
3.4. Challenges remaining in the face of generating a unified mammary cell atlas
Before we attempt to understand the cells driving breast cancer formation, it is becoming increasingly clear that we need to unify findings from mammary development studies to be able to describe which mammary cell subpopulations universally exist, which species-specific markers they express and define commonly used nomenclature for each of these cell subtypes. To this end, we have attempted to summarise the major mammary single cell studies (Table 1) and common nomenclature used for the four major adult epithelial subtypes (Table 2). Not all studies identify the same mammary subtypes, however most studies agree that cell subpopulations such as adipocytes or secretory alveolar cells during lactation (see Bach et al.) have been incompletely profiled due to difficulties in isolating intact cells during dissociation. Future efforts to build a mammary cell atlas must explore different methods to dissociate tissue/isolate cells, such as modifying digestion duration [86] or temperature of enzymes [99], to overcome certain cell types being disproportionately enriched or depleted during sample preparation.
As described above, some subclusters classified as epithelial cells were inimitably observed, including Cd55+ [72] and Cdh5+ [87] subclusters or ZEB1+/TCF4+ [84], VIM+/EPCAM+ and KRT18+/KRT14+ cells [89]. Cd55+ and Cdh5+ clusters highly express genes such as Axl and Sparc or Cd36 and Pecam1, that have been associated to fibroblasts and endothelial stromal subpopulations [100]. This suggests the utilised CD24/CD29 sorting strategy may not have been sufficient to exclude contaminating stromal cells. Similarly, a cluster of Procr+ cells classified as basal cells also expressed some but not all markers of pericytes and hence the authors conceded that these cells may be contaminating non-epithelial cells [71]. Further rare epithelial cell subtypes such as ZEB1+/TCF4+ cells [84], VIM+/EPCAM+ cells and KRT18+/KRT14+ cells [89] were identified by marker expression alone, where cells did not form separate clusters arising from clustering or dimension reduction analysis. As such, without orthogonal validation (such as tissue staining) it is difficult to determine the biological relevance of these results due to potential technical error such as doublet artefacts or detection of spurious transcripts. Clearly, it is remarkably difficult to characterise mammary cell subpopulations based on a limited number of markers. Hence, moving forward, cells profiled using single cell technologies should be classified into subpopulations using the cell’s global transcriptome, preferentially utilising cell signatures made up of an abundance of different validated genes.
Indeed, the issue of unifying diverse cell types across different organs is a universal problem of cell biology and various consortiums have begun to catalogue different cell types using single cell technologies. A recent study by Han et al. characterised a range of different tissues including: the mammary gland, lung, kidney, testis and placenta using scRNA-seq as a resource in developing the mouse cell atlas [101]. Specifically, in regard to the mammary gland, tissue was examined from virgin, pregnant, lactating and involution tissue. Whilst many of the briefly described findings fit what has been described in studies above [101], due to the lymph nodes not being removed prior to tissue digestion it is hard to distinguish whether the reported immune cells are lymph node-resident or mammary tissue-resident. Another similar study conducted by the Tabula Muris consortium attempted to discern how many different organs aged differently [102]. Cells of different organs were extracted and sequenced from mice between 1 month to 2.5 years. Interestingly, it was noted that for the mammary gland, there was a significant decline in T cells (which could be confounded by the lymph nodes potentially not being removed prior to gland dissociation), and overall there was an upregulation of API transcription factor family (Junb, Jund, Fos) with age [102]. Findings from these studies highlight that it is important to consider factors outside of normal development such as age that might influence mammary cell subpopulations and gene expression profiles.
3.5. The impact aging, menopause, parity and obesity have on the adult mammary gland
The effect ageing, menopause, parity and obesity have on mammary cell subpopulation proportions has been investigated across a number of studies [93], [100], [103], [104]. Findings from an article examining primary mammary cells of healthy women using mass cytometry observed age-related increase in certain luminal cell subpopulations with a corresponding decrease in myoepithelial cell proportions [104]. Similarly, a recent article examining the mammary glands of young compared to aged mice has found that the gland contains altered proportions and gene expression profiles of both epithelial and stromal cells [93]. Interestingly, they find an increase in the proportion of epithelial cells as well as identifying an increase specifically in the numbers of alveolar cells. It was also noted that in myoepithelial cells, there was a decrease in the expression levels of genes associated with basement membrane protein synthesis as well as an increase in inflammatory cytokine production by the myoepithelial, vascular endothelial and macrophage cells [93]. These findings suggest dramatic changes occurring in the stroma in aging cells that may provide a pro-tumourigenic microenvironment.
As a result of aging, menopause is a risk factor for breast cancer development and similarly various changes in the mammary cell subpopulations may provide a mechanism behind this phenomenon. In the context of surgical menopause (through an ovariectomy), the impact oestrogen (17beta-estadiol, E2) supplementation with or without exposure to harmful endocrine disrupting environmental contaminant PBDE (polybrominated diphenyl ethers) has on mammary cell subpopulation proportions was investigated [100]. This study finds in the menopausal vehicle mouse, there is an abundance of fibroblasts and stomal cells with a lower proportion of epithelial cells. However, the use of E2 induces the regrowth of terminal end bud like structures which is enhanced in the presence of PBDE. Administering E2 and PBDE together induced expression of progesterone receptor (PR) independent of oestrogen receptor (ERα, Esr1-) cells in two luminal cell populations as well as an increase in M2 macrophage populations which likely contribute to the mammary tissue remodelling and production of a pro-tumour microenvironment.
Another set of factors thought to change the risk of developing breast cancer include parity and obesity. Murrow et al. in a recently updated preprint explored differences in the human mammary gland composition as a result of either obesity or parity [103]. It was found that parity increased the number of myoepithelial cells and transcriptional response of hormone responsive cells in the mammary gland, as well as altering the size of alveoli. Interestingly, they found that luminal progenitor proportions do not seem to correlate with parity and that hormone responsive cells decrease in number in obese women. Coming from a slightly different perspective, another study examined the impact high fat diet (HFD)-induced obesity had on the stomal compartment of the mammary glands of Brca1-/-, p53+/- breast cancer model mice [105]. This study found that HFD induced an increase in the expression of extracellular matrix genes (Col3a1, Col6a3, Eln and Sparc) and pro-tumourigenic M2 macrophage markers in monocytes suggesting an alteration in stromal cell function and microenvironment in obese breast-cancer prone mice. From these studies, it is clear that certain risk factors such as age, menopause and obesity alter both the cellular states and proportions of epithelial and stromal subtypes and suggest that careful characterisation of stomal cells is essential to understand breast cancer development.
3.6. Understanding mammary stroma diversity is integral to understanding breast cancer development
Theories surrounding the mammary stroma inducing a wound healing phenotype elicited in breasts with a high mammographic density, during obesity and in post-lactational remodelling have been put forward as a mechanism for breast cancer development [90]. As such, it is important to carefully characterise normal stromal cell compositions and how these might change as a mammary gland acquires a tumourigenic phenotype. Indeed, an early scRNA-seq study that profiled cells from breast tumours found that many of the cells with a normal phenotype (as identified by estimates of copy number variations) were immune cells [106]. Increasing interest in the macrophage proportion of mammary immune cells led one group to investigate the CD45+ population of 10-week-old murine mammary glands. Using scRNA-seq they identified a Lyve-1+ subpopulation of tissue resident macrophages which expressed high levels of extracellular matrix remodelling associated genes [107]. In human tissue, arising from either breast tumour or matched normal breast tissue, scRNA-seq analysis of the CD45+ compartment found that immune cell subpopulations identified in normal tissue were only a small subset of those identified in tumours [108]. Interestingly they often found M1 and M2 macrophage associated genes in the same cell which were positively correlated, suggesting that each state might not be as mutually exclusive as previously thought and cell state might exist instead along a continuum.
Another key stomal cell type of interest are mammary fibroblasts and in particular their similarity to cancer associated fibroblasts (CAFs). Focusing on sorted fibroblasts extracted from a precancer MMTV-PyMT mouse mammary cells, three major populations of CAFs have been identified including: vascular (vCAF, with a subset of cycling CAFs, cCAFs), matrix (mCAF) and developmental (dCAF) associated CAFs [109]. These cells were found to be transcriptionally and spatially distinct, where vCAFs were found to come from a perivascular location, resident fibroblasts gave rise to mCAFs and dCAFs are malignant cells that have undergone epithelial to mesenchymal transition [109]. On the other hand, a subsequent study investigating CAFs and normal fibroblasts in triple-negative breast cancer (TNBC)-modelling BALB/c-derived 4T1 mammary tumours, identified 6 CAF subpopulations [110]. These subpopulations included: Ly6c1high, -SMAhigh, Cd53high, Crabp1high, Cd74high and cycling CAFs, which were identified in the above study. This study further compared their findings to those of Bartoschek et al. and found that mCAFs were enriched for Crabp1, and other markers expressed by Crabp1high cells, suggesting that these cells may represent similar subpopulations. A recent study by Wu et al. examined the epithelial and stromal compartment of primary TNBC tumours from 6 patients and found aside from the expected normal mammary subpopulations, a basal epithelial cancer cluster, two CAF subpopulations and two perivascular-like (PVL) subpopulations [111]. The noted subpopulations included: myofibroblast-like CAFs (myCAFs), inflammatory CAFs (iCAFs) and mature and immature PVL cells. Here the authors suggest that the cells described as vCAFs in the Bartoschek et al. study, may be PVL cells identified by this study [111]. Not all CAF subsets were identified across studies, highlighting the need to synchronise findings as much as possible to determine translatable cell subpopulations.
3.7. Profiling breast cancer tumours
Many single cell techniques have been utilised to examine human mammary tumours and may be considered the beginning of a breast cancer cell atlas. A recent study by Wagner et al. used mass cytometry to examine the expression of 73 proteins across 144 human breast tumours and 50 non-tumour tissue samples to determine different cell subpopulations [112]. It was found that despite tumour subtype, there was a significant amount of cellular variation within individual tumours. Common to all breast cancer subtypes however, were PD-1+ T-cells and PD-L1+ tumour associated macrophages (TAMs). Another recent mass cytometry study also generated a cell atlas but instead examined 62 breast cancer cell lines and five lines from healthy tissue [113].
Comparisons between breast cancer cell lines and primary breast cancer tumours were made by Gao et al. using a novel technique of single-nucleus RNA-sequencing [114]. Similar to Wagner et al., different subpopulations of cells could be found within a single tumour. However, through examining all populations together, a rare subpopulation of highly proliferative cells was identified that upregulated breast cancer associated genes [114]. Focusing more closely on six triple negative breast cancer tumours, Karaayvaz et al. found that although most clusters arose from separate tumours from different patients, one cluster was contributed by all tumour samples [115]. This subcluster had a gene expression profile associated with multiple signatures of treatment resistance and metastasis and was characterised by activated glycosphingolipid metabolism and associated innate immunity pathways [115].
Using spatial transcriptomics, Salmén et al. instead examined another breast cancer subtype, HER-2 positive tumours and found that by integrating topographical information from the tumour section images and spatial spots, immune cells infiltrated into the invasive regions of the cancers [116]. In contrast, Andersson et al. who also used spatial transcriptomics but instead integrated image analysis with scRNA-seq data, found that epithelial cells remained segregated from the cross-talking stromal cells [117]. In many tumours, plasma cells were spatially segregated from B-cells, which were found to rather colocalize near T-cells in several patients [117]. Spatial information has also been integrated with scDNA-sequencing, where analysis of breast tissue from patients with ductal carcinoma in situ (DCIS) with invasive ductal carcinoma (IDC), found that genome evolution occurs in the ducts before tumour cells escape the basement membrane and begin generation of the IDC tissue [36].
Analysis of tumour cell genome and epigenome is essential to identifying how cancer cells acquire mutations and progress to become metastatic. Use of HM-SNS (see single cell DNA-sequencing section), found that copy number aberrations (CNA) were acquired in the earliest stages of triple negative breast cancer evolution in short, punctuated bursts rather than gradual evolution, as was previously thought [35]. Interestingly, Wang et al. found that triple-negative tumour cells had an increased mutation rate compared to ER+ cells [29]. Further to findings of Gao et al. [35], Wang found that aneuploid arrangements arose early in tumour development and remained stable across clonal expansion [29]. A more recent study using single-cell copy number analysis found that pseudo-diploid single cells existed across different PAM50 subtypes and contained cancer specific alterations (such as 1q gain and 16q loss), suggesting intrinsic genomic stability and providing a potential identifying feature that could be used in early disease stage risk assessment [118]. Analysis of chromatin states in patient-derived treatment-resistant xenograft models found that a subset of cells across different tumours shared a common chromatin signature consisting of a loss of H3K27me3 chromatin mark [119]. Another study examining the epigenome of breast cancer cells focused particularly on the DNA methylation patterns of circulating tumour cells and found that binding sites for stemness and proliferation-associated transcripts are specifically hypomethylated [120]. Utilisation of many different single cell tools provides new understandings about breast tumour cell diversity and growth potential; however, in many cases normal mammary cells may not have been profiled in the same way and thus cannot be compared. Where similar technologies have compared tumour and normal mammary cells, it is often the case that they have vastly different profiles, making it difficult to identify intermediate cells and discern the cell of origin for cancer. Increasing numbers of studies are now focused on examining normal mammary tissue from individuals with high breast cancer risk that may assist in identifying intermediate pre-clinical cells that can be targeted for early detection of disease.
4. Conclusions
As is evident from the increasing number of studies utilising these tools, single cell technologies is playing an increasingly important role in understanding the cellular composition of the mammary gland. Here we presented an overview of many of the findings from the emerging studies, which together are beginning to bring us a clearer picture of the different mammary subpopulations that exist in both the human and murine gland and how they change in normal development and cancer. Increasingly, there is a need to better unify findings from different studies through the development of consortiums such as the human cell atlas. Important to remember, is that disparate isolation techniques may influence findings of different cell subpopulations, and computational tools may be required to correct for batch or indeed cellular isolation effects that may have provided discordant views of cell subpopulations in previous literature. Whilst the power of computational tools to overcome limitations of different cell isolation techniques may be required, it is important to remember that in silico modelling also has its limitations. All single cell technologies to date only allow for the capture and examination of single cell profiles at a snap-shot in time, where any differentiation trajectories in the cell populations have only been inferred in silico. Hence, it is important that other in vitro and in vivo techniques such as organoid modelling or lineage tracing experiments are used to validate findings inferred from single cell analysis. This is, however, an exciting era for molecular biology where future studies should take full advantage of current and emerging techniques. Topographical/spatial transcriptomics such as Slide-seq [121] permits scRNA-seq of cells from known positions in ex vivo tissue sections. Techniques such as these allow for verification of cellular identity determined by transcriptomic profile by examining the cell’s position within a tissue, as well as interaction partners such as surrounding cells or extracellular matrix. Techniques that combine two single cell technologies, such as CITE-seq, allows for verification of cell subpopulation identity by combining cell surface marker analysis with scRNA-seq generated transcriptomic profiles. Similarly, SHARE-seq allows for the prediction of future cell state by integrating scRNA-seq and scATAC-seq in the same cells. This technique provides information on cell identity and function through transcriptomic profiling and cell states through chromatin accessibility. Together, techniques such as these will allow us to answer some of the most fundamental and interesting questions of mammary gland biology research, such as how the mammary gland is arranged, which definitive mammary subpopulations exist and the differentiation potential of different cell types.
Acknowledgment
Our thanks are extended to our reviewers whose insightful comments allowed us to develop a more comprehensive review. Apologies are extended for any studies we did not include due to space limitations. AJT is funded by a Biotechnology and Biological Sciences Research Council (BBSRC, United Kingdom) project grant (BB/S006745/1) to WTK. WTK is funded by Cancer Research United Kingdom (CRUK) grant (C56829/A22053).
Contributor Information
Alecia-Jane Twigger, Email: ajt215@cam.ac.uk.
Walid T. Khaled, Email: wtk22@cam.ac.uk.
References
- 1.Azamjah N., Soltan-Zadeh Y., Zayeri F. Global trend of breast cancer mortality rate: a 25-year study. Asian Pac. J. Cancer Prev. 2019;20(7):2015–2020. doi: 10.31557/APJCP.2019.20.7.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Watson C.J., Khaled W.T. Mammary development in the embryo and adult: a journey of morphogenesis and commitment. Development. 2008;135(6):995–1003. doi: 10.1242/dev.005439. [DOI] [PubMed] [Google Scholar]
- 3.Prat A., Perou C.M. Deconstructing the molecular portraits of breast cancer. Mol. Oncol. 2011;5(1):5–23. doi: 10.1016/j.molonc.2010.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Perou C.M., Sorlie T., Eisen M.B., van de Rijn M., Jeffrey S.S., Rees C.A., Pollack J.R., Ross D.T., Johnsen H., Akslen L.A., Fluge O., Pergamenschikov A., Williams C., Zhu S.X., Lonning P.E., Borresen-Dale A.L., Brown P.O., Botstein D. Molecular portraits of human breast tumours. Nature. 2000;406(6797):747–752. doi: 10.1038/35021093. [DOI] [PubMed] [Google Scholar]
- 5.Prat A., Perou C.M. Mammary development meets cancer genomics. Nat. Med. 2009;15(8):842–844. doi: 10.1038/nm0809-842. [DOI] [PubMed] [Google Scholar]
- 6.Sorlie T., Perou C.M., Tibshirani R., Aas T., Geisler S., Johnsen H., Hastie T., Eisen M.B., van de Rijn M., Jeffrey S.S., Thorsen T., Quist H., Matese J.C., Brown P.O., Botstein D., Lonning P.E., Borresen-Dale A.L. Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc. Natl. Acad. Sci. USA. 2001;98(19):10869–10874. doi: 10.1073/pnas.191367098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Lim E., Vaillant F., Wu D., Forrest N.C., Pal B., Hart A.H., Asselin-Labat M.L., Gyorki D.E., Ward T., Partanen A., Feleppa F., Huschtscha L.I., Thorne H.J., kConFab, Fox S.B., Yan M., French J.D., Brown M.A., Smyth G.K., Visvader J.E., Lindeman G.J. Aberrant luminal progenitors as the candidate target population for basal tumor development in BRCA1 mutation carriers. Nat. Med. 2009;15(8):907–913. doi: 10.1038/nm.2000. [DOI] [PubMed] [Google Scholar]
- 8.Visvader J.E. Keeping abreast of the mammary epithelial hierarchy and breast tumorigenesis. Genes Dev. 2009;23(22):2563–2577. doi: 10.1101/gad.1849509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Shackleton M., Vaillant F., Simpson K.J., Stingl J., Smyth G.K., Asselin-Labat M.L., Wu L., Lindeman G.J., Visvader J.E. Generation of a functional mammary gland from a single stem cell. Nature. 2006;439(7072):84–88. doi: 10.1038/nature04372. [DOI] [PubMed] [Google Scholar]
- 10.Stingl J., Eirew P., Ricketson I., Shackleton M., Vaillant F., Choi D., Li H.I., Eaves C.J. Purification and unique properties of mammary epithelial stem cells. Nature. 2006;439(7079):993–997. doi: 10.1038/nature04496. [DOI] [PubMed] [Google Scholar]
- 11.Sleeman K.E., Kendrick H., Ashworth A., Isacke C.M., Smalley M.J. CD24 staining of mouse mammary gland cells defines luminal epithelial, myoepithelial/basal and non-epithelial cells. Breast Cancer Res. 2005;8(1):R7. doi: 10.1186/bcr1371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Visvader J.E., Stingl J. Mammary stem cells and the differentiation hierarchy: current status and perspectives. Genes Dev. 2014;28(11):1143–1158. doi: 10.1101/gad.242511.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wu S.S., Lee J.H., Koo B.K. Lineage tracing: computational reconstruction goes beyond the limit of imaging. Mol. Cells. 2019;42(2):104–112. doi: 10.14348/molcells.2019.0006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Wang D., Cai C., Dong X., Yu Q.C., Zhang X.O., Yang L., Zeng Y.A. Identification of multipotent mammary stem cells by protein C receptor expression. Nature. 2015;517(7532):81–84. doi: 10.1038/nature13851. [DOI] [PubMed] [Google Scholar]
- 15.Rios A.C., Fu N.Y., Lindeman G.J., Visvader J.E. In situ identification of bipotent stem cells in the mammary gland. Nature. 2014;506(7488):322–327. doi: 10.1038/nature12948. [DOI] [PubMed] [Google Scholar]
- 16.van Amerongen R., Bowman A.N., Nusse R. Developmental stage and time dictate the fate of Wnt/beta-catenin-responsive stem cells in the mammary gland. Cell Stem Cell. 2012;11(3):387–400. doi: 10.1016/j.stem.2012.05.023. [DOI] [PubMed] [Google Scholar]
- 17.Song W., Wang R., Jiang W., Yin Q., Peng G., Yang R., Yu Q.C., Chen J., Li J., Cheung T.H., Jing N., Zeng Y.A. Hormones induce the formation of luminal-derived basal cells in the mammary gland. Cell Res. 2019;29(3):206–220. doi: 10.1038/s41422-018-0137-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Davis F.M., Lloyd-Lewis B., Harris O.B., Kozar S., Winton D.J., Muresan L., Watson C.J. Single-cell lineage tracing in the mammary gland reveals stochastic clonal dispersion of stem/progenitor cell progeny. Nat. Commun. 2016;7:13053. doi: 10.1038/ncomms13053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Van Keymeulen A., Rocha A.S., Ousset M., Beck B., Bouvencourt G., Rock J., Sharma N., Dekoninck S., Blanpain C. Distinct stem cells contribute to mammary gland development and maintenance. Nature. 2011;479(7372):189–193. doi: 10.1038/nature10573. [DOI] [PubMed] [Google Scholar]
- 20.Wuidart A., Ousset M., Rulands S., Simons B.D., Van Keymeulen A., Blanpain C. Quantitative lineage tracing strategies to resolve multipotency in tissue-specific stem cells. Genes Dev. 2016;30(11):1261–1277. doi: 10.1101/gad.280057.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Elias S., Morgan M.A., Bikoff E.K., Robertson E.J. Long-lived unipotent Blimp1-positive luminal stem cells drive mammary gland organogenesis throughout adult life. Nat. Commun. 2017;8(1):1714. doi: 10.1038/s41467-017-01971-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Lilja A.M., Rodilla V., Huyghe M., Hannezo E., Landragin C., Renaud O., Leroy O., Rulands S., Simons B.D., Fre S. Clonal analysis of Notch1-expressing cells reveals the existence of unipotent stem cells that retain long-term plasticity in the embryonic mammary gland. Nat. Cell Biol. 2018;20(6):677–687. doi: 10.1038/s41556-018-0108-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Wuidart A., Sifrim A., Fioramonti M., Matsumura S., Brisebarre A., Brown D., Centonze A., Dannau A., Dubois C., Van Keymeulen A., Voet T., Blanpain C. Early lineage segregation of multipotent embryonic mammary gland progenitors. Nat. Cell Biol. 2018;20(6):666–676. doi: 10.1038/s41556-018-0095-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Prater M.D., Petit V., Alasdair Russell I., Giraddi R.R., Shehata M., Menon S., Schulte R., Kalajzic I., Rath N., Olson M.F., Metzger D., Faraldo M.M., Deugnier M.A., Glukhova M.A., Stingl J. Mammary stem cells have myoepithelial cell properties. Nat. Cell Biol. 2014;16(10):1–7. doi: 10.1038/ncb3025. 942-50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Scheele C.L., Hannezo E., Muraro M.J., Zomer A., Langedijk N.S., van Oudenaarden A., Simons B.D., van Rheenen J. Identity and dynamics of mammary stem cells during branching morphogenesis. Nature. 2017;542(7641):313–317. doi: 10.1038/nature21046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Shlien A., Malkin D. Copy number variations and cancer. Genome Med. 2009;1(6):62. doi: 10.1186/gm62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Navin N., Kendall J., Troge J., Andrews P., Rodgers L., McIndoo J., Cook K., Stepansky A., Levy D., Esposito D., Muthuswamy L., Krasnitz A., McCombie W.R., Hicks J., Wigler M. Tumour evolution inferred by single-cell sequencing. Nature. 2011;472(7341):90–94. doi: 10.1038/nature09807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Xu X., Hou Y., Yin X., Bao L., Tang A., Song L., Li F., Tsang S., Wu K., Wu H., He W., Zeng L., Xing M., Wu R., Jiang H., Liu X., Cao D., Guo G., Hu X., Gui Y., Li Z., Xie W., Sun X., Shi M., Cai Z., Wang B., Zhong M., Li J., Lu Z., Gu N., Zhang X., Goodman L., Bolund L., Wang J., Yang H., Kristiansen K., Dean M., Li Y., Wang J. Single-cell exome sequencing reveals single-nucleotide mutation characteristics of a kidney tumor. Cell. 2012;148(5):886–895. doi: 10.1016/j.cell.2012.02.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Wang Y., Waters J., Leung M.L., Unruh A., Roh W., Shi X., Chen K., Scheet P., Vattathil S., Liang H., Multani A., Zhang H., Zhao R., Michor F., Meric-Bernstam F., Navin N.E. Clonal evolution in breast cancer revealed by single nucleus genome sequencing. Nature. 2014;512(7513):155–160. doi: 10.1038/nature13600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Leung M., Wang Y., Waters J., Navin N.E. SNES: single nucleus exome sequencing. Genome Biol. 2015;16:55. doi: 10.1186/s13059-015-0616-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Navin N. The first five years of single-cell cancer genomics and beyond. Genome Res. 2015;25(10):1499–1507. doi: 10.1101/gr.191098.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Zong C., Lu S., Chapman A.R., Xie X.S. Genome-wide detection of single-nucleotide and copy-number variations of a single human cell. Science. 2012;338(6114):1622–1626. doi: 10.1126/science.1229164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zahn H., Steif A., Laks E., Eirew P., VanInsberghe M., Shah S.P., Aparicio S., Hansen C.L. Scalable whole-genome single-cell library preparation without preamplification. Nat. Methods. 2017;14(2):167–173. doi: 10.1038/nmeth.4140. [DOI] [PubMed] [Google Scholar]
- 34.Laks E., McPherson A., Zahn H., Lai D., Steif A., Brimhall J., Biele J., Wang B., Masud T., Ting J., Grewal D., Nielsen C., Leung S., Bojilova V., Smith M., Golovko O., Poon S., Eirew P., Kabeer F., Ruiz de Algara T., Lee S.R., Taghiyar M.J., Huebner C., Ngo J., Chan T., Vatrt-Watts S., Walters P., Abrar N., Chan S., Wiens M., Martin L., Scott R.W., Underhill T.M., Chavez E., Steidl C., Da Costa D., Ma Y., Coope R.J.N., Corbett R., Pleasance S., Moore R., Mungall A.J., Mar C., Cafferty F., Gelmon K., Chia S., Team C.I.G.C., Marra M.A., Hansen C., Shah S.P., Aparicio S. Clonal decomposition and DNA replication states defined by scaled single-cell genome sequencing. Cell. 2019;179(5):1207–1221. doi: 10.1016/j.cell.2019.10.026. e22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Gao R., Davis A., McDonald T.O., Sei E., Shi X., Wang Y., Tsai P.C., Casasent A., Waters J., Zhang H., Meric-Bernstam F., Michor F., Navin N.E. Punctuated copy number evolution and clonal stasis in triple-negative breast cancer. Nat. Genet. 2016;48(10):1119–1130. doi: 10.1038/ng.3641. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Casasent A.K., Schalck A., Gao R., Sei E., Long A., Pangburn W., Casasent T., Meric-Bernstam F., Edgerton M.E., Navin N.E. Multiclonal invasion in breast tumors identified by topographic single cell sequencing. Cell. 2018;172(1–2):205–217. doi: 10.1016/j.cell.2017.12.007. e12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ornatsky O., Bandura D., Baranov V., Nitz M., Winnik M.A., Tanner S. Highly multiparametric analysis by mass cytometry. J. Immunol. Methods. 2010;361(1–2):1–20. doi: 10.1016/j.jim.2010.07.002. [DOI] [PubMed] [Google Scholar]
- 38.Bandura D.R., Baranov V.I., Ornatsky O.I., Antonov A., Kinach R., Lou X., Pavlov S., Vorobiev S., Dick J.E., Tanner S.D. Mass cytometry: technique for real time single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass spectrometry. Anal. Chem. 2009;81(16):6813–6822. doi: 10.1021/ac901049w. [DOI] [PubMed] [Google Scholar]
- 39.Bendall S.C., Simonds E.F., Qiu P., Amir el A.D., Krutzik P.O., Finck R., Bruggner R.V., Melamed R., Trejo A., Ornatsky O.I., Balderas R.S., Plevritis S.K., Sachs K., Pe’er D., Tanner S.D., Nolan G.P. Single-cell mass cytometry of differential immune and drug responses across a human hematopoietic continuum. Science. 2011;332(6030):687–696. doi: 10.1126/science.1198704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Tang F., Barbacioru C., Wang Y., Nordman E., Lee C., Xu N., Wang X., Bodeau J., Tuch B.B., Siddiqui A., Lao K., Surani M.A. mRNA-Seq whole-transcriptome analysis of a single cell. Nat. Methods. 2009;6(5):377–382. doi: 10.1038/nmeth.1315. [DOI] [PubMed] [Google Scholar]
- 41.See P., Lum J., Chen J., Ginhoux F., Single-Cell A. A single-cell sequencing guide for immunologists. Front. Immunol. 2018;9:2425. doi: 10.3389/fimmu.2018.02425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Valihrach L., Androvic P., Kubista M. Platforms for single-cell collection and analysis. Int. J. Mol. Sci. 2018;19(3):807. doi: 10.3390/ijms19030807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hwang B., Lee J.H., Bang D. Single-cell RNA sequencing technologies and bioinformatics pipelines. Exp. Mol. Med. 2018;50(8):1–14. doi: 10.1038/s12276-018-0071-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Chen G., Ning B., Shi T. Single-cell RNA-seq technologies and related computational data analysis. Front. Genet. 2019;10:317. doi: 10.3389/fgene.2019.00317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Ramskold D., Luo S., Wang Y.C., Li R., Deng Q., Faridani O.R., Daniels G.A., Khrebtukova I., Loring J.F., Laurent L.C., Schroth G.P., Sandberg R. Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells. Nat. Biotechnol. 2012;30(8):777–782. doi: 10.1038/nbt.2282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Picelli S., Bjorklund A.K., Faridani O.R., Sagasser S., Winberg G., Sandberg R. Smart-seq2 for sensitive full-length transcriptome profiling in single cells. Nat. Methods. 2013;10(11):1096–1098. doi: 10.1038/nmeth.2639. [DOI] [PubMed] [Google Scholar]
- 47.Picelli S., Faridani O.R., Bjorklund A.K., Winberg G., Sagasser S., Sandberg R. Full-length RNA-seq from single cells using Smart-seq2. Nat. Protoc. 2014;9(1):171–181. doi: 10.1038/nprot.2014.006. [DOI] [PubMed] [Google Scholar]
- 48.Klein A.M., Mazutis L., Akartuna I., Tallapragada N., Veres A., Li V., Peshkin L., Weitz D.A., Kirschner M.W. Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell. 2015;161(5):1187–1201. doi: 10.1016/j.cell.2015.04.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Macosko E.Z., Basu A., Satija R., Nemesh J., Shekhar K., Goldman M., Tirosh I., Bialas A.R., Kamitaki N., Martersteck E.M., Trombetta J.J., Weitz D.A., Sanes J.R., Shalek A.K., Regev A., McCarroll S.A. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015;161(5):1202–1214. doi: 10.1016/j.cell.2015.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Zheng G.X., Terry J.M., Belgrader P., Ryvkin P., Bent Z.W., Wilson R., Ziraldo S.B., Wheeler T.D., McDermott G.P., Zhu J., Gregory M.T., Shuga J., Montesclaros L., Underwood J.G., Masquelier D.A., Nishimura S.Y., Schnall-Levin M., Wyatt P.W., Hindson C.M., Bharadwaj R., Wong A., Ness K.D., Beppu L.W., Deeg H.J., McFarland C., Loeb K.R., Valente W.J., Ericson N.G., Stevens E.A., Radich J.P., Mikkelsen T.S., Hindson B.J., Bielas J.H. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 2017;8:14049. doi: 10.1038/ncomms14049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Stoeckius M., Hafemeister C., Stephenson W., Houck-Loomis B., Chattopadhyay P.K., Swerdlow H., Satija R., Smibert P. Simultaneous epitope and transcriptome measurement in single cells. Nat. Methods. 2017;14(9):865–868. doi: 10.1038/nmeth.4380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Chen H., Lareau C., Andreani T., Vinyard M.E., Garcia S.P., Clement K., Andrade-Navarro M.A., Buenrostro J.D., Pinello L. Assessment of computational methods for the analysis of single-cell ATAC-seq data. Genome Biol. 2019;20(1):241. doi: 10.1186/s13059-019-1854-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Buenrostro J.D., Wu B., Litzenburger U.M., Ruff D., Gonzales M.L., Snyder M.P., Chang H.Y., Greenleaf W.J. Single-cell chromatin accessibility reveals principles of regulatory variation. Nature. 2015;523(7561):486–490. doi: 10.1038/nature14590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Cusanovich D.A., Daza R., Adey A., Pliner H.A., Christiansen L., Gunderson K.L., Steemers F.J., Trapnell C., Shendure J. Multiplex single cell profiling of chromatin accessibility by combinatorial cellular indexing. Science. 2015;348(6237):910–914. doi: 10.1126/science.aab1601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Satpathy A.T., Granja J.M., Yost K.E., Qi Y., Meschi F., McDermott G.P., Olsen B.N., Mumbach M.R., Pierce S.E., Corces M.R., Shah P., Bell J.C., Jhutty D., Nemec C.M., Wang J., Wang L., Yin Y., Giresi P.G., Chang A.L.S., Zheng G.X.Y., Greenleaf W.J., Chang H.Y. Massively parallel single-cell chromatin landscapes of human immune cell development and intratumoral T cell exhaustion. Nat. Biotechnol. 2019;37(8):925–936. doi: 10.1038/s41587-019-0206-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Lee J., Hyeon D.Y., Hwang D. Single-cell multiomics: technologies and data analysis methods. Exp. Mol. Med. 2020;52(9):1428–1442. doi: 10.1038/s12276-020-0420-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Mallory X.F., Edrisi M., Navin N., Nakhleh L. Methods for copy number aberration detection from single-cell DNA-sequencing data. Genome Biol. 2020;21(1):208. doi: 10.1186/s13059-020-02119-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Kimball A.K., Oko L.M., Bullock B.L., Nemenoff R.A., van Dyk L.F., Clambey E.T. A beginner’s guide to analyzing and visualizing mass cytometry data. J. Immunol. 2018;200(1):3–22. doi: 10.4049/jimmunol.1701494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Liu X., Song W., Wong B.Y., Zhang T., Yu S., Lin G.N., Ding X. A comparison framework and guideline of clustering methods for mass cytometry data. Genome Biol. 2019;20(1):297. doi: 10.1186/s13059-019-1917-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Huber W., Carey V.J., Gentleman R., Anders S., Carlson M., Carvalho B.S., Bravo H.C., Davis S., Gatto L., Girke T., Gottardo R., Hahne F., Hansen K.D., Irizarry R.A., Lawrence M., Love M.I., MacDonald J., Obenchain V., Oles A.K., Pages H., Reyes A., Shannon P., Smyth G.K., Tenenbaum D., Waldron L., Morgan M. Orchestrating high-throughput genomic analysis with bioconductor. Nat. Methods. 2015;12(2):115–121. doi: 10.1038/nmeth.3252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Luecken M.D., Theis F.J. Current best practices in single-cell RNA-seq analysis: a tutorial. Mol. Syst. Biol. 2019;15(6) doi: 10.15252/msb.20188746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Lun A.T., McCarthy D.J., Marioni J.C. A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor. F1000Res. 2016;5:2122. doi: 10.12688/f1000research.9501.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Maaten L.V.D., Hinton G. Visualizing data using t-SNE. J. Mach. Learn. Res. 2008;9:2579–2605. [Google Scholar]
- 64.L. McInnes, J. Healy, J. Melville, UMAP: Uniform manifold approximation and projection for dimension reduction, arxiv, 2018.
- 65.Haghverdi L., Buettner F., Theis F.J. Diffusion maps for high-dimensional single-cell analysis of differentiation data. Bioinformatics. 2015;31(18):2989–2998. doi: 10.1093/bioinformatics/btv325. [DOI] [PubMed] [Google Scholar]
- 66.Haghverdi L., Büttner M., Wolf Af, Buettner F., Theis F.J. Diffusion pseudotime robustly reconstructs lineage branching. Nat. Methods. 2016;13(10):845–848. doi: 10.1038/nmeth.3971. [DOI] [PubMed] [Google Scholar]
- 67.Lotfollahi M., Wolf F.A., Theis F.J. scGen predicts single-cell perturbation responses. Nat. Methods. 2019;16(8):715–721. doi: 10.1038/s41592-019-0494-8. [DOI] [PubMed] [Google Scholar]
- 68.van Dijk D., Sharma R., Nainys J., Yim K., Kathail P., Carr A.J., Burdziak C., Moon K.R., Chaffer C.L., Pattabiraman D., Bierie B., Mazutis L., Wolf G., Krishnaswamy S., Pe’er D. Recovering gene interactions from single-cell data using data diffusion. Cell. 2018;174(3):716–729. doi: 10.1016/j.cell.2018.05.061. e27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Schep A.N., Wu B., Buenrostro J.D., Greenleaf W.J. chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat. Methods. 2017;14(10):975–978. doi: 10.1038/nmeth.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Wang C., Sun D., Huang X., Wan C., Li Z., Han Y., Qin Q., Fan J., Qiu X., Xie Y., Meyer C.A., Brown M., Tang M., Long H., Liu T., Liu X.S. Integrative analyses of single-cell transcriptome and regulome using MAESTRO. Genome Biol. 2020;21(1):198. doi: 10.1186/s13059-020-02116-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Bach K., Pensa S., Grzelak M., Hadfield J., Adams D.J., Marioni J.C., Khaled W.T. Differentiation dynamics of mammary epithelial cells revealed by single-cell RNA sequencing. Nat. Commun. 2017;8(1):2128. doi: 10.1038/s41467-017-02001-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Pal B., Chen Y., Vaillant F., Jamieson P., Gordon L., Rios A.C., Wilcox S., Fu N., Liu K.H., Jackling F.C., Davis M.J., Lindeman G.J., Smyth G.K., Visvader J.E. Construction of developmental lineage relationships in the mouse mammary gland by single-cell RNA profiling. Nat. Commun. 2017;8(1):1627. doi: 10.1038/s41467-017-01560-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Giraddi R.R., Chung C.Y., Heinz R.E., Balcioglu O., Novotny M., Trejo C.L., Dravis C., Hagos B.M., Mehrabad E.M., Rodewald L.W., Hwang J.Y., Fan C., Lasken R., Varley K.E., Perou C.M., Wahl G.M., Spike B.T. Single-cell transcriptomes distinguish stem cell state changes and lineage specification programs in early mammary gland development. Cell Rep. 2018;24(6):1653–1666. doi: 10.1016/j.celrep.2018.07.025. e7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Dravis C., Chung C.Y., Lytle N.K., Herrera-Valdez J., Luna G., Trejo C.L., Reya T., Wahl G.M. Epigenetic and transcriptomic profiling of mammary gland development and tumor models disclose regulators of cell state plasticity. Cancer Cell. 2018;34(3):466–482. doi: 10.1016/j.ccell.2018.08.001. e6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Chung C.Y., Ma Z., Dravis C., Preissl S., Poirion O., Luna G., Hou X., Giraddi R.R., Ren B., Wahl G.M. Single-cell chromatin analysis of mammary gland development reveals cell-state transcriptional regulators and lineage relationships. Cell Rep. 2019;29(2):495–510. doi: 10.1016/j.celrep.2019.08.089. e6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.A.-J. Twigger, L.K. Engelbrecht, K. Bach, I. Schultz-Pernice, S. Petricca, C.H. Scheel, W. Khaled, Transcriptional changes in the mammary gland during lactation revealed by single cell sequencing of cells from human milk, bioRxiv, 2020. [DOI] [PMC free article] [PubMed]
- 77.Martin Carli J.F., Trahan G.D., Jones K.L., Hirsch N., Rolloff K.P., Dunn E.Z., Friedman J.E., Barbour L.A., Hernandez T.L., MacLean P.S., Monks J., McManaman J.L., Rudolph M.C. Single cell RNA sequencing of human milk-derived cells reveals sub-populations of mammary epithelial cells with molecular signatures of progenitor and mature states: a novel, non-invasive framework for investigating human lactation physiology. J. Mammary Gland Biol. Neoplasia. 2020;25:367–387. doi: 10.1007/s10911-020-09466-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Bach K., Pensa S., Zarocsinceva M., Kania K., Stockis J., Pinaud S., Lazarus K.A., Shehata M., Simoes B.M., Greenhalgh A.R., Howell S.J., Clarke R.B., Caldas C., Halim T.Y.F., Marioni J.C., Khaled W.T. Time-resolved single-cell analysis of Brca1 associated mammary tumourigenesis reveals aberrant differentiation of luminal progenitors!Abstract. Nat. Commun. 2021;12:1502. doi: 10.1038/s41467-021-21783-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Wang C., Christin J.R., Oktay M.H., Guo W. Lineage-biased stem cells maintain estrogen-receptor-positive and -negative mouse mammary luminal lineages. Cell Rep. 2017;18(12):2825–2835. doi: 10.1016/j.celrep.2017.02.071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Van Keymeulen A., Fioramonti M., Centonze A., Bouvencourt G., Achouri Y., Blanpain C. Lineage-restricted mammary stem cells sustain the development, homeostasis, and regeneration of the estrogen receptor positive lineage. Cell Rep. 2017;20(7):1525–1532. doi: 10.1016/j.celrep.2017.07.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Rodilla V., Dasti A., Huyghe M., Lafkas D., Laurent C., Reyal F., Fre S. Luminal progenitors restrict their lineage potential during mammary gland development. PLoS Biol. 2015;13(2) doi: 10.1371/journal.pbio.1002069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Chang T.H., Kunasegaran K., Tarulli G.A., De Silva D., Voorhoeve P.M., Pietersen A.M. New insights into lineage restriction of mammary gland epithelium using parity-identified mammary epithelial cells. Breast Cancer Res. 2014;16(1):R1. doi: 10.1186/bcr3593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Pervolarakis N., Nguyen Q.H., Williams J., Gong Y., Gutierrez G., Sun P., Jhutty D., Zheng G.X.Y., Nemec C.M., Dai X., Watanabe K., Kessenbrock K. Integrated single-cell transcriptomics and chromatin accessibility analysis reveals regulators of mammary epithelial cell identity. Cell Rep. 2020;33(3) doi: 10.1016/j.celrep.2020.108273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Nguyen Q.H., Pervolarakis N., Blake K., Ma D., Davis R.T., James N., Phung A.T., Willey E., Kumar R., Jabart E., Driver I., Rock J., Goga A., Khan S.A., Lawson D.A., Werb Z., Kessenbrock K. Profiling human breast epithelial cells using single cell RNA sequencing identifies cell diversity. Nat. Commun. 2018;9(1):2028. doi: 10.1038/s41467-018-04334-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Knapp D., Kannan N., Pellacani D., Eaves C.J. Mass cytometric analysis reveals viable activated caspase-3(+) luminal progenitors in the normal adult human mammary gland. Cell Rep. 2017;21(4):1116–1126. doi: 10.1016/j.celrep.2017.09.096. [DOI] [PubMed] [Google Scholar]
- 86.L.K. Engelbrecht, A.-J. Twigger, H.M. Ganz, C.J. Gabka, A.R. Bausch, H. Lickert, M. Sterr, I. Kunze, W.T. Khaled, C.H. Scheel, A strategy to address dissociation-induced compositional and transcriptional bias for single-cell analysis of the human mammary gland, bioRxiv, 2021.
- 87.Sun H., Miao Z., Zhang X., Chan U.I., Su S.M., Guo S., Wong C.K.H., Xu X., Deng C.X. Single-cell RNA-Seq reveals cell heterogeneity and hierarchy within mouse mammary epithelia. J. Biol. Chem. 2018;293(22):8315–8329. doi: 10.1074/jbc.RA118.002297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Chen W., Morabito S.J., Kessenbrock K., Enver T., Meyer K.B., Teschendorff A.E. Single-cell landscape in mammary epithelium reveals bipotent-like cells associated with breast cancer risk and outcome. Commun. Biol. 2019;2:306. doi: 10.1038/s42003-019-0554-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Thong T., Wang Y., Brooks M.D., Lee C.T., Scott C., Balzano L., Wicha M.S., Colacino J.A. Hybrid stem cell states: insights into the relationship between mammary development and breast cancer using single-cell transcriptomics. Front. Cell Dev. Biol. 2020;8:288. doi: 10.3389/fcell.2020.00288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Anstine L.J., Keri R. A new view of the mammary epithelial hierarchy and its implications for breast cancer initiation and metastasis. J. Cancer Metastasis Treat. 2019;2019 doi: 10.20517/2394-4722.2019.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Cristea S., Polyak K. Dissecting the mammary gland one cell at a time. Nat. Commun. 2018;9(1):2473. doi: 10.1038/s41467-018-04905-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Regan J.L., Smalley M.J. Integrating single-cell RNA-sequencing and functional assays to decipher mammary cell states and lineage hierarchies. NPJ Breast Cancer. 2020;6(1):32. doi: 10.1038/s41523-020-00175-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Li C.M., Shapiro H., Tsiobikas C., Selfors L.M., Chen H., Rosenbluth J., Moore K., Gupta K.P., Gray G.K., Oren Y., Steinbaugh M.J., Guerriero J.L., Pinello L., Regev A., Brugge J.S. Aging-associated alterations in mammary epithelia and stroma revealed by single-cell RNA sequencing. Cell Rep. 2020;33(13) doi: 10.1016/j.celrep.2020.108566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Liu X., Ory V., Chapman S., Yuan H., Albanese C., Kallakury B., Timofeeva O.A., Nealon C., Dakic A., Simic V., Haddad B.R., Rhim J.S., Dritschilo A., Riegel A., McBride A., Schlegel R. ROCK inhibitor and feeder cells induce the conditional reprogramming of epithelial cells. Am. J. Pathol. 2012;180(2):599–607. doi: 10.1016/j.ajpath.2011.10.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Dontu G., Ince T.A. Of mice and women: a comparative tissue biology perspective of breast stem cells and differentiation. J. Mammary Gland Biol. Neoplasia. 2015;20(1–2):51–62. doi: 10.1007/s10911-015-9341-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Watson C.J., Khaled W.T. Mammary development in the embryo and adult: new insights into the journey of morphogenesis and commitment. Development. 2020;147(22) doi: 10.1242/dev.169862. [DOI] [PubMed] [Google Scholar]
- 97.Spanjaard B., Hu B., Mitic N., Olivares-Chauvet P., Janjuha S., Ninov N., Junker J.P. Simultaneous lineage tracing and cell-type identification using CRISPR-Cas9-induced genetic scars. Nat. Biotechnol. 2018;36(5):469–473. doi: 10.1038/nbt.4124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Raj B., Gagnon J.A., Schier A.F. Large-scale reconstruction of cell lineages using single-cell readout of transcriptomes and CRISPR-Cas9 barcodes by scGESTALT. Nat. Protoc. 2018;13(11):2685–2713. doi: 10.1038/s41596-018-0058-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.O’Flanagan C.H., Campbell K.R., Zhang A.W., Kabeer F., Lim J.L.P., Biele J., Eirew P., Lai D., McPherson A., Kong E., Bates C., Borkowski K., Wiens M., Hewitson B., Hopkins J., Pham J., Ceglia N., Moore R., Mungall A.J., McAlpine J.N., Team C.I.G.C., Shah S.P., Aparicio S. Dissociation of solid tumor tissues with cold active protease for single-cell RNA-seq minimizes conserved collagenase-associated stress responses. Genome Biol. 2019;20(1):210. doi: 10.1186/s13059-019-1830-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Kanaya N., Chang G., Wu X., Saeki K., Bernal L., Shim H.J., Wang J., Warden C., Yamamoto T., Li J., Park J.S., Synold T., Vonderfecht S., Rakoff M., Neuhausen S.L., Chen S. Single-cell RNA-sequencing analysis of estrogen- and endocrine-disrupting chemical-induced reorganization of mouse mammary gland. Commun. Biol. 2019;2:406. doi: 10.1038/s42003-019-0618-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Han X., Wang R., Zhou Y., Fei L., Sun H., Lai S., Saadatpour A., Zhou Z., Chen H., Ye F., Huang D., Xu Y., Huang W., Jiang M., Jiang X., Mao J., Chen Y., Lu C., Xie J., Fang Q., Wang Y., Yue R., Li T., Huang H., Orkin S.H., Yuan G.C., Chen M., Guo G. Mapping the mouse cell atlas by microwell-Seq. Cell. 2018;172(5):1091–1107. doi: 10.1016/j.cell.2018.02.001. e17. [DOI] [PubMed] [Google Scholar]
- 102.Tabula Muris C. A single-cell transcriptomic atlas characterizes ageing tissues in the mouse. Nature. 2020;583(7817):590–595. doi: 10.1038/s41586-020-2496-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.L.M. Murrow, R.J. Weber, J. Caruso, C.S. McGinnis, K. Phong, P. Gascard, A.D. Borowsky, T.A. Desai, M. Thomson, T. Tlsty, Z.J. Gartner, Changes in epithelial proportions and transcriptional state underlie major premenopausal breast cancer risks, bioRxiv, 2020.
- 104.Pelissier Vatter F.A., Schapiro D., Chang H., Borowsky A.D., Lee J.K., Parvin B., Stampfer M.R., LaBarge M.A., Bodenmiller B., Lorens J.B. High-dimensional phenotyping identifies age-emergent cells in human mammary epithelia. Cell Rep. 2018;23(4):1205–1219. doi: 10.1016/j.celrep.2018.03.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Lo P.K., Yao Y., Zhou Q. Single-cell RNA-seq reveals obesity-induced alterations in the Brca1-mutated mammary gland microenvironment. Cancers. 2020;12(8):2235. doi: 10.3390/cancers12082235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Chung W., Eum H.H., Lee H.O., Lee K.M., Lee H.B., Kim K.T., Ryu H.S., Kim S., Lee J.E., Park Y.H., Kan Z., Han W., Park W.Y. Single-cell RNA-seq enables comprehensive tumour and immune cell profiling in primary breast cancer. Nat. Commun. 2017;8:15081. doi: 10.1038/ncomms15081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Wang Y., Chaffee T.S., LaRue R.S., Huggins D.N., Witschen P.M., Ibrahim A.M., Nelson A.C., Machado H.L., Schwertfeger K.L. Tissue-resident macrophages promote extracellular matrix homeostasis in the mammary gland stroma of nulliparous mice. Elife. 2020;9 doi: 10.7554/eLife.57438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Azizi E., Carr A.J., Plitas G., Cornish A.E., Konopacki C., Prabhakaran S., Nainys J., Wu K., Kiseliovas V., Setty M., Choi K., Fromme R.M., Dao P., McKenney P.T., Wasti R.C., Kadaveru K., Mazutis L., Rudensky A.Y., Pe’er D. Single-cell map of diverse immune phenotypes in the breast tumor microenvironment. Cell. 2018;174(5):1293–1308. doi: 10.1016/j.cell.2018.05.060. e36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Bartoschek M., Oskolkov N., Bocci M., Lovrot J., Larsson C., Sommarin M., Madsen C.D., Lindgren D., Pekar G., Karlsson G., Ringner M., Bergh J., Bjorklund A., Pietras K. Spatially and functionally distinct subclasses of breast cancer-associated fibroblasts revealed by single cell RNA sequencing. Nat. Commun. 2018;9(1):5150. doi: 10.1038/s41467-018-07582-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Sebastian A., Hum N.R., Martin K.A., Gilmore S.F., Peran I., Byers S.W., Wheeler E.K., Coleman M.A., Loots G.G. Single-cell transcriptomic analysis of tumor-derived fibroblasts and normal tissue-resident fibroblasts reveals fibroblast heterogeneity in breast cancer. Cancers. 2020;12(5):1307. doi: 10.3390/cancers12051307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Wu S.Z., Roden D.L., Wang C., Holliday H., Harvey K., Cazet A.S., Murphy K.J., Pereira B., Al-Eryani G., Bartonicek N., Hou R., Torpy J.R., Junankar S., Chan C.L., Lam C.E., Hui M.N., Gluch L., Beith J., Parker A., Robbins E., Segara D., Mak C., Cooper C., Warrier S., Forrest A., Powell J., O’Toole S., Cox T.R., Timpson P., Lim E., Liu X.S., Swarbrick A. Stromal cell diversity associated with immune evasion in human triple-negative breast cancer. EMBO J. 2020;39 doi: 10.15252/embj.2019104063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Wagner J., Rapsomaniki M.A., Chevrier S., Anzeneder T., Langwieder C., Dykgers A., Rees M., Ramaswamy A., Muenst S., Soysal S.D., Jacobs A., Windhager J., Silina K., van den Broek M., Dedes K.J., Rodriguez Martinez M., Weber W.P., Bodenmiller B. A single-cell atlas of the tumor and immune ecosystem of human breast cancer. Cell. 2019;177(5):1330–1345. doi: 10.1016/j.cell.2019.03.005. e18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.M. Tognetti, A. Gabor, M. Yang, V. Cappelletti, J. Windhager, K. Charmpi, N. de Souza, A. Beyer, P. Picotti, J. Saez-Rodriguez, B. Bodenmiller, Deciphering the signaling network landscape of breast cancer improves drug sensitvity prediction, bioRxiv, 2020.
- 114.Gao R., Kim C., Sei E., Foukakis T., Crosetto N., Chan L.K., Srinivasan M., Zhang H., Meric-Bernstam F., Navin N. Nanogrid single-nucleus RNA sequencing reveals phenotypic diversity in breast cancer. Nat. Commun. 2017;8(1):228. doi: 10.1038/s41467-017-00244-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Karaayvaz M., Cristea S., Gillespie S.M., Patel A.P., Mylvaganam R., Luo C.C., Specht M.C., Bernstein B.E., Michor F., Ellisen L.W. Unravelling subclonal heterogeneity and aggressive disease states in TNBC through single-cell RNA-seq. Nat. Commun. 2018;9(1):3588. doi: 10.1038/s41467-018-06052-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.F. Salmén, S. Vickovic, L. Larsson, L. Stenbeck, J. Vallon-Christersson, A. Ehinger, J. Häkkinen, Å. Borg, J. Frisén, P.L. Ståhl, J. Lundeberg, Multidimensional transcriptomics provides detailed information about immune cell distribution and identity in HER2+ breast tumors, bioRxiv, 2018.
- 117.A. Andersson, L. Larsson, L. Stenbeck, F. Salmén, A. Ehinger, S. Wu, G. Al-Eryani, D. Roden, A. Swarbrick, Å. Borg, J. Frisén, C. Engblom, J. Lundeberg, Spatial deconvolution of HER2-positive breast tumors reveals novel intercellular relationships, bioRxiv, 2020. [DOI] [PMC free article] [PubMed]
- 118.Baslan T., Kendall J., Volyanskyy K., McNamara K., Cox H., D'Italia S., Ambrosio F., Riggs M., Rodgers L., Leotta A., Song J., Mao Y., Wu J., Shah R., Gularte-Merida R., Chadalavada K., Nanjangud G., Varadan V., Gordon A., Curtis C., Krasnitz A., Dimitrova N., Harris L., Wigler M., Hicks J. Novel insights into breast cancer copy number genetic heterogeneity revealed by single-cell genome sequencing. Elife. 2020;9 doi: 10.7554/eLife.51480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Grosselin K., Durand A., Marsolier J., Poitou A., Marangoni E., Nemati F., Dahmani A., Lameiras S., Reyal F., Frenoy O., Pousse Y., Reichen M., Woolfe A., Brenan C., Griffiths A.D., Vallot C., Gerard A. High-throughput single-cell ChIP-seq identifies heterogeneity of chromatin states in breast cancer. Nat. Genet. 2019;51(6):1060–1066. doi: 10.1038/s41588-019-0424-9. [DOI] [PubMed] [Google Scholar]
- 120.Gkountela S., Castro-Giner F., Szczerba B.M., Vetter M., Landin J., Scherrer R., Krol I., Scheidmann M.C., Beisel C., Stirnimann C.U., Kurzeder C., Heinzelmann-Schwarz V., Rochlitz C., Weber W.P., Aceto N. Circulating tumor cell clustering shapes DNA methylation to enable metastasis seeding. Cell. 2019;176(1–2):98–112. doi: 10.1016/j.cell.2018.11.046. e14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Rodriques S.G., Stickels R.R., Goeva A., Martin C.A., Murray E., Vanderburg C.R., Welch J., Chen L.M., Chen F., Macosko E.Z. Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution. Science. 2019;363(6434):1463–1467. doi: 10.1126/science.aaw1219. [DOI] [PMC free article] [PubMed] [Google Scholar]