Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Sep 10;15(9):e0238594. doi: 10.1371/journal.pone.0238594

High accuracy gene expression profiling of sorted cell subpopulations from breast cancer PDX model tissue

Warren Porter 1,#, Eileen Snowden 1,#, Friedrich Hahn 1, Mitchell Ferguson 1, Frances Tong 1, W Shannon Dillmore 1, Rainer Blaesius 1,*
Editor: Randall J Kimple2
PMCID: PMC7482927  PMID: 32911489

Abstract

Intratumor Heterogeneity (ITH) is a functionally important property of tumor tissue and may be involved in drug resistance mechanisms. Although descriptions of ITH can be traced back to very early reports about cancer tissue, mechanistic investigations are still limited by the precision of analysis methods and access to relevant tissue sources. PDX models have provided a reproducible source of tissue with at least a partial representation of naturally occurring ITH. We investigated the properties of phenotypically distinct cell populations by Fluorescence activated cell sorting (FACS) tissue derived cells from multiple tumors from a triple negative breast cancer patient derived xenograft (PDX) model. We subsequently subjected each population to in depth gene expression analysis. Our findings suggest that process related gene expression changes (caused by tissue dissociation and FACS sorting) are restricted to Immediate Early Genes (IEGs). This allowed us to discover highly reproducible gene expression profiles of distinct cellular compartments identifiable by cell surface markers in this particular tumor model. Within the context of data from a previously published model our work suggests that gene expression profiles associated with hypoxia, stemness and drug resistance may reside in tumor subpopulations predictably growing in PDX models. This approach provides a novel opportunity for prospective mechanistic studies of ITH.

Introduction

Phenotypic differences between patients with the same tumor type (intertumor heterogeneity) and among cancer cells within the same patient (intratumor heterogeneity/ITH) have always been integral to descriptions of tumor appearance [1], yet they were neglected as a source of useful information for a long time. While intertumor heterogeneity has been integrated into clinical practice, particularly in breast cancer [2], ITH remains to be included as a standard part of cancer tissue analysis. The technical difficulty to assess ITH has hindered efficient ways to build on observations of inherent complexity described in the 1970s and 80s and arguably a general resistance to this concept [3] prevented an unbiased appreciation. The past 10–15 years have seen a “re-discovery” of ITH as a mainstream topic. Models of “tumor as an ecosystem” [4] or “tumor as an organ” [5] explain ITH not as incidental phenomenon but as a foundational property. Focusing on transcriptional heterogeneity in particular has become an effective way to link ITH to clinically meaningful phenomena [6]. Maximizing and understanding the accuracy of this approach will be an important factor in its efficient translation into the clinic.

scRNASeq has arguably been the main tool to accelerate discovery of transcriptional heterogeneity in tumors and healthy tissues alike. The results have prompted models which transcend the limits set by traditional cell type definitions and replace the artificially fixed categories with continuous gene expression spaces of individual cells [7, 8]. This is likely a much more accurate depiction of cell biological reality since each organism develops from a single cell in a continuous process rather than discrete “jumps”. The drawback, however, lies mainly in a technical limitation: scRNASeq shows a ‘dropout’ effect due to inefficient mRNA capture [9] which makes it very difficult to create a high resolution model for a given sample. For elucidation of organisms (especially with high genetic homogeneity such as in inbred strains) this can be counteracted by sequencing large numbers of cells; maybe the most impressive example is represented in the “Tabula Muris”, a high resolution analysis of 20 murine organs generated from over 100,000 cells [10]. This publication highlighted the enormous resources required but also showed that scRNASeq can be powerfully enhanced by combining it with cell sorting. Our investigations cause us to argue that the two methods are complementary rather than competing and demonstrate that this approach enables uncovering of highly reproducible gene expression patterns with striking functional implications.

Patient derived xenograft models allow propagation of human tumor tissue in immune compromised mice in ways that seem much superior to tissue culture conditions, particularly with regard to maintaining ITH [11]. Previously we had established methods in our lab that clearly demonstrate the presence of distinct subpopulations within breast cancer tissue emerging reproducibly in separate host animals implanted with tissue from the identical primary tumor [12]. Here we addressed the possible influence of fluorescence activated cell sorting (FACS) on transcriptional profiling and investigated the presence of multiple different subpopulations within the same breast cancer model tissue. As in the previous work we chose surface markers identifying cell compartments with sufficient numbers for molecular analysis of the subpopulations, for this model CD49f and CD133. Those two markers as well as CD184 which we had employed for the previous tumor model have been shown in numerous studies to allow identification of cancer cells with distinct roles in tumorigenesis and have frequently been used in the search of cancer stem cells [1214]. Using quadruplicate samples we were able to demonstrate the potential this approach provides for detailed studies of specific cellular compartments within a complex tumor tissue. In addition, our data indicates that RNASeq of sorted PDX tumor cell populations complements scRNASeq through a) a much higher accuracy by avoiding the drop-out effects of single cell sequencing and b) providing a concept for replenishable sources of cell subpopulations with predictable gene expression signatures.

Methods

All experimental procedures were essentially as previously described [12], any differences are noted below.

PDX tumor extraction

The protocol was approved by the BD Institutional IACUC committee. BD is licensed by the USDA and is accredited by AALAC. All methods were carried out in accordance to best practices outlined in the Guide for the care and use of Laboratory Animals [15].

Mice were sourced from The Jackson Laboratory. The properties of the mouse model used in the study was a triple negative breast cancer model (BRC13, Jackson Lab designation TM00999) previously characterized for surface marker expression in our lab [12]. Female NOD/SCID/ILIIrg-/- (NSG) mice age 6 to 7 weeks were purchased from The Jackson Laboratory with tumor material already implanted. All animal studies were performed under an Institutional Animal Care and Use Committee–approved protocol. A dedicated room to safely house immune compromised mice was used. Positive air flow and filtration was maintained though a BioBubble system. Mice were socially housed in groups of five within a micro isolation cage rack. Food and water were available ad lib. Each cage contained environmental enrichments, including nestlets and igloos. Tumors were assessed weekly for growth and measured with a digital caliper. Tumor volume was estimated by the formula 0.5 x (length x width2). Animals were monitored daily with additional measurements taken if needed. Animals were euthanized if grafted tumor tissue showed signs of necrosis. Additional health signs were monitored with the following termination points: “rapid” weight loss of >10%, specific clinical signs of severe organ involvement (e.g. circling, head-tilt, respiratory rates, paralysis). Also, Body Condition Score (BCS) was monitored, with a score of 2 or below serving as a humane endpoint. Once tumor size reached >500 mm3 (several weeks to several months after inoculation), they were harvested for analysis. Mice were humanely euthanized via exsanguination under anesthesia at the time of tumor extraction, and the tumor was immediately removed via blunt dissection and placed in cold PBS for dissociation and analysis.

Preparation of single-cell suspensions from tumor tissue

Single cell suspensions were prepared as previously described. Briefly, excised tissue was weighed and then minced using a scalpel or scissors to a size not exceeding 1 to 2 mm3. Tissue was then enzymatically dissociated into single cells using Horizon™ Dri Tumor & Tissue Dissociation Reagent (BD Biosciences, San Jose, CA) in a 37°C water bath for 30 minutes with frequent agitation. The enzymatic reaction was stopped by rinsing in PBS (Cellgro) and 1% BSA (Sigma-Aldrich). Dissociated tissue was filtered through a 70 mm sieve and treated with ACK Buffer (Life Technologies) to remove contaminating red blood cells. Cells were resuspended in PBS, without magnesium and calcium.

Cell staining for flow cytometry and analysis

Before staining, single-cell suspensions were blocked for 30 minutes, on ice in FcR block (Miltenyi Biotec,). For surface marker characterization, cells were diluted in PBS and distributed into wells of a 96-well plate containing viability dye (LIVE/ DEAD Fixable Near-IR or Aqua, Invitrogen) and Hoechst 33342 (Invitrogen; at 0.2 mg/mL) for viable nucleated cell identification. The basic sorting strategy was as described previously [12]. Monoclonal antibodies for CD49f (clone G0H3) were purchased from BD Biosciences, for CD133 from Miltenyi (Bergisch Gladbach, Germany). A complete list of reagents is provided in S1 Table. Cells were incubated with antibody solutions for 30 minutes at 4°C, in the dark, and then rinsed twice in PBS before acquisition on BD LSRII flow cytometer. For sorting, cells were stained using a four-color panel combining amine viability reagent, anti-mouse cocktail, and markers identified from the primary characterization plate. After staining, cells were rinsed, and resuspended in BD FACS Pre-Sort Buffer, filtered again through a 70 μ m sieve, and populations of interest were sorted on the BD FACSAria II (BD Biosciences) at 20 p.s.i. using a 100- μm nozzle. Cell doublets or clumps were removed with electronic doublet discrimination gating. Post-sorting analysis of sorted subpopulations regularly showed purity >95%. Analysis of results was performed using FACSDiva software (version 6.1.3 LSRll and version 8.0 FACSAria II). Graphical representations of the data were performed using R or MS Excel.

RNA isolation, RNASeq and sequence data analysis

20–50,000 cells stained for a specific surface marker were collected during FACS sorting and processed in bulk for RNA extraction. Quality assessments were determined using Agilent BioAnalyzer and NanoDrop readings. RNA specimens underwent QC analysis to determine their suitability for target labeling. QC analyses included determination of concentration and volume, A260/A280 ratio, and 28S/18S rRNA ratio (where applicable), and an RNA integrity summary score (Agilent RIN or Caliper RQS). RNASeq was performed using the Illumina RNASeq TruSeq Stranded mRNA SQ905 Kit. All samples were sequenced in one run across four lanes on an Illumina HiSeq with 2 x 50 paired end reads. The primary sequencing analysis was done through a 3rd party (EA Genomics, Morrisville, NC) using STAR v2.4 for sequence alignment and RSEM v1.2.14 for transcription quantification.

RSEM counts from the sequencing results were used for the secondary analysis performed in R. Non-human genes were filtered out of the data set. Differential expression was calculated using the edgeR v3.14.0 package:

1.) Groups were created for each of the sorted populations ("Tissue", "Unsorted_0hr", "Unsorted_3hr", Human_Only", "CD49f_Low", CD49f_High", CD133_Low", CD133_High"). 2.) Normalization factors were calculated with calcNormFactors using the default settings, to convert observed library sizes to effective library sizes. 3.Counts per million (CPM) and log CPM were calculated from the normalized data. 4. glmQLFit was used to model the fit of the normalized data. 5. glmQLFTest was used to model pairwise comparisons between the groups. 6. topTags was used to extract the most differentially expressed genes from each comparison object created from step 5 (glmQLFTest). 7. The resulting differential expression data was used for comparison to 3rd party gene lists.

Statistical significance was calculated and displayed in all figures using false discovery rate (FDR).

To assess the biological interpretation of gene expression patterns all DEGs with FDR<0.1 and x-fold ≥2 were analyzed using the Enrichr tool [16, 17].

Results

Influence of tissue dissociation and FACS sorting on gene expression profiles

Our study was based on pairwise comparisons of gene expression profiles for (8) different cell populations from a triple negative breast cancer PDX model. The first (4) samples were used to validate our sample preparation and gene expression analysis processes, while subsequent samples were FACS sorted using (2) surface markers to compare biological similarities and differences in gene expression (Figs 1 and S1). Variables investigated in the sample preparation included tissue dissociation, duration of the method and flow sorting as possible sources of bias. RNA isolated from freshly collected tissue using a standard method served as a reference.

Fig 1. Schematic of experimental setup.

Fig 1

Tumor tissue was extracted and gene expression profiles from 8 different samples per tumor were generated from 4 mice (i.e. total of 8x4 = 32 RNASeq profiles) of the same breast cancer PDX model, BRC13. The left part addressed the sample handling aspects of the procedure while the samples marked by the grey box were generated to investigate ITH present in this model. See text for details.

To control for the technical variability as well as biological variability between individual host mice, we carried out the experiment in quadruplicate, (i.e. each of the 8 conditions shown in Fig 1 represents 4 separate tumors from the same model grown in individual host animals harvested on different days) resulting in a total of 32 samples. Three samples did not pass QC due to limited amounts of RNA and/or poor RNA integrity.

Out of the 3 major variables (tissue dissociation, protocol duration and flow sorting) tissue dissociation was the only sample processing variable introducing significant changes in expression patterns (Fig 2). Among 24 differentially expressed genes (DEGs) after dissociation, 23 were upregulated above a cutoff of FDR<0.05/ 4-fold and only a single one was downregulated, suggesting de novo RNA synthesis rather than RNA degradation as the main effect triggered by this sample preparation step.

Fig 2. Pairwise comparisons between gene expression profiles generated through bulk RNASeq for 4 different conditions.

Fig 2

0h time point represents cells immediately after dissociation; 3h time point represents cells left undisturbed for 3 hours after dissociation while sorted cells underwent flow sorting which generally took up to 3 hours. Volcano plots show the log2 fold-change (x-axis) and the statistical significance as–log(FDR) (y-axis). Lines indicate the cutoff values chosen (4-fold and FDR<0.05). FDR values were calculated across the entire data set.

Investigation of the characteristics of the significantly changed genes implicated a specific mechanism causing these observations. A database query of the significantly changed genes (Table 1) revealed that a great majority (20/24) have been described as immediate early genes (IEG). Of the 4 genes not identified in the IEG literature, 2 (AL832891 and AK024092) have 99% homology to other IEGs (Cyr61 and CHD2) and the remaining 2 are non-coding RNAs. Similar trends were seen with gene sets analyzed with lower stringency cutoff values (S2 Table).

Table 1. List of genes identified as DE before and immediately after tissue dissociation (FDR<0.05, x-fold>4).

Gene FDR x-fold change Reference identifying gene as IEG
1 FOS 2.3E-15 263.0 [18]
2 EGR1 2.2E-13 83.5 [18]
3 FOSB 1.2E-12 437.0 [18]
4 DUSP1 2.0E-12 33.5 [18]
5 IER2 3.5E-12 16.3 [19]
6 JUN 1.5E-10 14.7 [18]
7 JUNB 1.0E-09 11.1 [18]
8 ZFP36 1.6E-09 20.6 [18]
9 NR4A2 2.2E-09 23.7 [18]
10 RHOB 2.2E-09 14.7 [20]
11 KLF2 1.2E-08 10.6 [21]
12 GADD45B 1.1E-07 6.2 [18]
13 EGR2 1.1E-07 36.7 [22]
14 NR4A1 1.8E-07 25.0 [18]
15 ATF3 1.2E-06 17.1 [18]
16 SOCS3 8.3E-06 6.5 [23]
17 FAM71A 4.9E-05 100.9 [24]
18 AL832891 4.6E-03 5.0 *
19 LOC284454 6.8E-03 5.9 ncRNA
20 BTG2 1.9E-02 9.1 [25]
21 CTD-2311B13.7:ENSG00000257931 2.2E-02 -640.1 ncRNA
22 CXCL2 2.4E-02 4.9 [26]
23 AK024092 2.9E-02 4.8 *
24 DUSP2 4.3E-02 4.4 [27]

* no reference found

Gene expression profiles of cell subpopulations identified by CD49f and CD133

The second half of the experiments addressed ITH and whether subpopulations identified by surface markers differed in their gene expression profiles. Previously, we identified surface marker ITH in nine triple negative breast cancer models [12]. Each model had a unique and reproducible marker profile. Indication of ITH (i.e. presence of both positive and negative cells for a given marker) existed with respect to many of the markers but also differed among the models. Investigation of CD184hi and CD184lo sorted cells in one model revealed major differences in gene expression profiles between marker-high and low populations [12]. In the current study we focused on another PDX triple negative breast cancer model exhibiting varying expression levels of the surface markers CD49f and CD133. Both markers have been described as indicators of functionally distinct roles in cancer tissue such as drug resistance, metastasis, EMT and stemness [2830]. For our investigation their choice was also opportunistic since they allowed isolation of sufficient cell numbers for both marker positive and negative phenotypes. The previous model as well as the one investigated here contained 15% of mouse cells which were excluded from the subsequent gene expression analysis [12].

We FACS sorted the dissociated tumor cell population into CD49fhi and CD49flo as well as CD133hi and CD133lo, and performed (whole transcriptome) RNASeq on each pool. Both sorting approaches identified a number of DEGs, even at very stringent statistical significance levels (Fig 3 and Table 2), suggesting significant differences between the respective subpopulations. The finding that gene expression correlated with protein expression (i.e. ITGA6 for CD49f and PROM1 for CD133 were significantly upregulated in the marker positive population) suggests gene variance is due to biological changes and not due technical noise. Moreover, the statistical significance for both genes was highest among upregulated genes in the marker positive population in both experiments (S2 Fig).

Fig 3. Volcano plots of DEGs identified between tumor subpopulations sorted with either CD49f (left) or CD133 (right).

Fig 3

Cutoff values shown (black lines, for illustration only) are >4-fold and FDR <0.05.

Table 2. Number of differentially expressed genes at different cutoff levels.

Cutoff CD49fhi CD49flo CD133hi CD133lo
0.1 668 620 432 274
0.05 423 369 233 161
0.01 150 129 57 33

Among all genes identified in both experiments the majority were unique to one arm but among the ones in common (ca. 1/3 of the CD133 gene set overlapped with 1/8 of the CD49f set) an unexpected inverse correlation was detected (Fig 4). 120 of the 121 genes in common were inversely correlated at a qualitative level (i.e. identical direction of change) and at a quantitative level the correlation coefficient was R = - 0.85, strongly suggesting that this was not caused by random noise.

Fig 4.

Fig 4

Flow cytometric analysis of sorted populations using CD49f and CD133 (a) and correlations between DEGs identified by using either CD49f or CD133 as sorting criteria (b and c, FDR<0.05).

A priori an inverse correlation could be a sign of a simple negative correlation between the two markers. However both the flow analysis (Fig 4A) as well as the finding that more DE genes are unique to one arm of the experiment than common in both arms argue for a more complex relationship between the populations.

To investigate the possible roles for the respective subpopulations we determined GO terms for each of the 6 upregulated gene sets (S5S10 Tables). Possibly the best defined functional indicator could be determined for the BRC13/CD133hi population which–very much like the BRC12/CD184hi population contained many GO terms associated with cell proliferation (S6 and S10 Tables). The BRC13/CD49lo population shows a variety of signs consistent with organization of the microenvironment, such as ECM organization, angiogenesis and regulation of cell migration which all appear as cellular processes indicated for the BRC12/CD184lo subpopulation (S7 and S9 Tables). The genes upregulated in BRC13/CD49fhi and BRC13/CD133lo populations did not provide GO terms which would indicate a clear consistent function but–consistent with the inverse correlation described above- 8 out of the top 50 GO terms in those two lists are identical (S5 and S8 Tables). While neither BRC13 DEG list provided evidence for a strong and unambiguous functional interpretation of the gene profiles, the CD133hi and CD49flo lists mirror a number of gene sets identified in breast cancer as well as other solid cancers, indicating the possibility of functional relevance of the subpopulations described. Particularly, the BRC12/CD184lo and BRC13/CD49flo populations contained many overexpressed genes in common with profiles described under conditions of hypoxia (Fig 5A, addt’l examples in supplementary data). In contrast, the BRC12/CD184hi and BRC13/CD133hi populations contained equally pronounced similarities to profiles identified in connection with high proliferation in healthy breast tissue, regulation by the E2F transcription factor family, DNA damage response and stem cell properties (Fig 5B and supplementary data). Such parallels to previously documented results could also be identified within functional gene families; especially within the keratin family of genes 8 out of 9 members DE with FDR<0.1 in both BRC12/CD184lo and BRC13/CD49flo cells tracked in almost identical patterns (S3 Table).

Fig 5. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations.

Fig 5

A hypoxia gene signature composed of 3 published signatures [3032] was found in BRC12 CD184lo and BRC13 CD49flo subpopulations (a) while a proliferative gene signature identified in histologically normal breast tissue from cancer patients [41] was present in BRC12 CD184hi and BRC13 CD133hi subpopulations (b). Genes with lowest FDR and highest x-fold values are labeled in volcano plots and marked with arrows in heatmaps, respectively.

Discussion

Tissue transcriptional profiling is dependent on the accurate reflection of cells in their physiological environment. In other words, distortion caused by sample handling or signal amplification are a frequent hurdle to identifying genes that are consistently expressed in a differential state that reflects physiology [31]. We established a workflow that accurately identifies distinct subpopulations in tissue from breast cancer PDX models [12]. In this study, we set out to determine if sample handling or flow cytometry introduced bias in transcriptional profiling and show different subpopulation profiles indicating complex ITH in tumor tissue from a breast cancer PDX model. The surface markers were chosen based on their expression levels within this model rather than for known biological roles.

To test accuracy we compared expression profiles assessed by whole genome RNASeq from dissociated tissue cells before and after FACS sorting with the “gold standard” profile of bulk fresh tissue. Using quadruplicate samples (i.e. 4 separate tumors from the same tumor model) allowed us to statistically assess random fluctuations in individual samples from consistent gene expression differences caused by the steps investigated. We found no evidence of compromised mRNA integrity in any of the steps including flow sorting. Our results indicate that tissue dissociation caused significant changes in a limited set of genes. However, outside of this specific group (IEGs representing less than 0.5% of the human genome) no significant expression differences were detected when compared to fresh tissue. In contrast, transcriptional profiling of sorted subpopulations derived from this tissue identified distinct gene clusters differentially expressed in cells identified by specific surface markers. Furthermore, our data provides evidence that those differences can be detected with extremely high precision.

IEGs have been extensively described in the literature as fast responders to multiple stimuli. It’s been shown that IEGs in a variety of cell types are activated within minutes after exposure to various perturbations, including mechanical stress [32]. Their ability to rapidly activate transcription likely represents part of a cascade of events that are involved in tight gene regulation. The cascade is initiated with a distinct set of immediate early genes, followed by a set of delayed early genes [18, 33], a nomenclature derived from the sequence of viral gene expression after successful entry of the virus into its host cell. Due to this cascade of early events, it’s unlikely any published tissue dissociation protocol (or detachment of adherent tissue culture cells) will prevent activation of IEGs. However, our data suggests artefactual changes to the rest of the transcriptome can be avoided, presumably by limiting the ability of cells to execute the full cascade. The specific list of IEGs involved in a given experiment is context dependent [32]. Our survey of published literature revealed that many IEGs overlap between different experiments and their total number is unlikely to exceed 150–200 genes. As a consequence, we have shown that our protocol allows us to effectively limit process related bias and accurately detect gene expression changes in >99% of genes. Bioinformatic filtering of all known IEGs and inclusion of control samples offer additional measures to mitigate artefacts when processing cells derived from tissue. It is also remarkable that aside from the IEGs discussed above, fewer than 200 genes would be categorized as DE even up to a FDR <0.9 cutoff. This strongly argues that no systematic distortion is caused by the methods used here and experimental noise is minimal (see S3 Fig). In conclusion, our data suggests that our methods detect DEGs between different experimental conditions with high accuracy and reproducibility.

The exploratory segment of our experiment targeted the discovery of gene expression profiles of interest in tumor tissue subpopulations identified by various surface markers. A previously published example from our lab had validated this approach [12] and we aimed to uncover whether using different surface markers as well as a different tumor model may result in similar or different subpopulation profiles.

We successfully identified distinct DEG sets with each marker containing hundreds of DEGs even at stringent statistical cutoff values (Table 2). Several arguments derived from our results would indicate that RNASeq performed on FACS sorted cell populations enables sensitive and precise identification of genes unique to these cells: 1) The genes for both surface markers used in this study (ITGA6/CD49f and Prom1/CD133) as well as our previous study (CXCR4/CD184) were among the top 3 DEGs (ranked by FDR, S2 Fig). 2) Among the range of differentially expressed genes with very high statistical significance (FDR<0.01) a number were detected at lower than 2-fold differential expression (8 for CD133 and 39 for CD49f sorting). 3) The DEGs common to both CD133 sorting as well as CD49f sorting were strongly correlated; the 121 genes in common between each sort (at FDR<0.05) have a coefficient of R = -0.85 (Fig 4). Such a clear correlation is much more consistent with an underlying biological control of those genes as opposed to a random occurrence and further supports the notion of a very low noise level. All of these observations are consistent with “clean” separation of expression profiles in subpopulations isolated through FACS sorting.

When we compared the two DEG lists from this work with our previously published list (using a different TNBC model labeled BRC12) and with gene profiles from other investigators we found a number of surprising parallels. For the two cancer models used in our lab there was substantial overlap between DEGs in the BRC12/CD184lo and BRC13/CD49flo populations on one hand and BRC12/CD184hi and BRC13/CD133hi on the other (Figs 5 and S4S13). This was not only far more than would be expected by chance but each overlap also aligned with gene expression patterns described by other investigators related to biological functions important to tumorigenesis. BRC12/CD184lo and BRC13/CD49flo cells expressed many genes related to hypoxia and drug resistance [3436], (Figs 5A and S10S13) while BRC12/CD184hi and BRC/CD133hi cells had high expression in genes indicating proliferation, DNA Damage Response and stemness (Figs 5B and S4S9).

Focusing on specific gene families we found further evidence of co-regulation of genes in a manner consistent with functional relevance. The best such example is presented in the keratin gene family. The KRT genes 5,6, 14, 16 and 17 which appear to be co-regulated and have been used as markers for basal breast cancer [3740] are all upregulated in BRC12/CD184lo and BRC13/CD49flo cells with remarkable quantitative consistency (S3 Table). Their upregulation is also consistent with the activity of Hypoxia Inducible Factor (HIF), i.e. the presence of hypoxia [41, 42]. Furthermore, especially some genes for which evidence for functional roles in hypoxia has been demonstrated repeatedly and in cancer cells (e.g. L1CAM, ANGPTL4, CA9, VEGFA, LDHA, P4HA1, LOX, LOXL2, LOXL4) are almost all significantly upregulated in both BRC12/CD184lo and BRC13/CD49flo cells (S3 Table). Hypoxia is often correlated with therapeutic resistance [43, 44]. In accordance with this, gene expression signatures of drug resistance can also be found in those same populations (S8 Fig).

While functional tests clearly are required to confirm phenotypic relevance of our results we believe that these distinct distributions of overexpressed genes in separate compartments point to some very intriguing possibilities. The two mouse models we investigated showed a high degree of similarities, especially considering the functions the gene patterns are implicated in. scRNASeq has led to a much better appreciation of subtle gene expression differences between otherwise similar tissue cells and a number of those studies with TNBC result in similar gene patterns as we describe here (S9S11 Figs). Often gradients are found across different locations within the tissue but they don’t necessarily exist across straight lines which is one among multiple challenges when investigating their functional relevance. It may be useful to expand on the historical notion of what a “cell type” is by considering “cell states” with distinct functional properties [45]. In a recent publication Uri Alon and colleagues proposed a model where cells exist along different axes within a multidimensional space of functionality and each end of a given axis is defined by the presence of “specialist” cells with respect to the function in question [8]. scRNASeq is highly efficient at detecting such gradients but gene dropout effects limit its resolution and function can only be deduced from correlations. A variety of spatial methods have been developed to locate cells and reveal the tissue architecture but similar limitations apply and those studies are highly resource intensive. We believe our approach is complementing those tools by providing direct access to such “specialist” cells in quantities sufficient to carry out in depth functional investigations. Acquired cancer drug resistance might be a primary example of where this could be of major clinical use. Together with the findings of other investigators our data argue that such “division of labor” exists in breast cancer tissue where hypoxia/drug resistance may reside in one compartment while proliferation/stemness/DNA damage response may be driven by another. We show for the first time that such compartments emerge spontaneously and reproducibly in different PDX models. Some overlaps of our profiles with drug resistance signatures in other cancer types (e.g. lung, melanoma; see S8 Fig) suggests that this type of ITH may be common across solid cancer tissues. This provides powerful options for setting up prospective studies in vivo. Recent developments of PDX models with humanized immune systems [46] will likely further improve those options. Studies targeting drug response variations have been conducted in cell culture [44] and resulted in changes of many of the genes related to hypoxia and TGFβ activity described in our study.

A number of studies have described ITH regarding both phenotype as well as genotype and suggested interactions between the different compartments [4749]. The successive engraftment of heterogeneous patient derived cancer tissue through multiple passages [47] showed continued co-existence of distinct genotypic clones growing in each new generation of host mice. The implausibility of perfectly synchronized independent genotypes suggests that clones are interdependent which is consistent with numerous reports of non-cell autonomous proliferation [50, 51]. Our data do not include DNA sequencing and it is thus unclear whether the surface marker identified subpopulations are genotypically distinct, identical or mixed themselves. The overlapping gene expression patterns we observed with the different subpopulations identified in BRC13 (Fig 4) are more consistent with a model of gradients within multidimensional gene expression space as the one pioneered by Uri Alon and colleagues [7] than with a mixture of uniform clones. In our experience surface marker profiles across different mice from the same model, including BRC12 and BRC13 remain quite stable [12]. Although this could be interpreted as a sign of stable relative ratios but it is important to keep in mind that for technical reasons most PDX tissue is analyzed within a fairly narrow range of tumor sizes. Some investigators reported observations that both the relative clonal ratios and total combined cell numbers influence growth kinetics [48] and it is certainly possible that the relative ratios might differ if the same tumors are harvested at a different stage of their growth. Studies combining our approach with DNA sequencing, lineage barcoding and propagation of clonal mixtures through multiple passages as well as analysis at different stages of tumor development provide many opportunities to clarify the respective contribution of each category of ITH described. One of the two PDX models investigated by Jeff Chuang and colleagues [47] happens to be the same one we used in our previous study (BRC12 aka TM00096) and thus would offer a particularly rich data set as background for further work. The use of short-term cultures of defined mixtures of cells as described by Bruna et al. [49] offers an additional efficient way towards such investigations. We believe our work describes a near optimal source of cells for this purpose since reproducibly emerging subpopulations with well defined gene expression patterns grown in the same tumor eliminate the confounding factors afflicting cells grown on plastic or derived from multiple patients. The two most important implications of our findings are technical and biological in nature, respectively: 1) our approach is an efficient option to investigate subpopulations at a functional level due to their accessibility through well established methods of surface marker directed FACS sorting. 2) subpopulations with distinct gene expression signatures related to hypoxia/drug response and proliferation/DNA damage response may be common across different breast cancer models prior to drug challenges. We have successfully applied FACS analysis based on these methods to patient derived tissues [52] raising the possibility that once the functional implications of the various gene profiles can be better understood optimized assays targeted to clinical samples could likely be developed.

Supporting information

S1 File

(DOCX)

S1 Fig. Flow cytometric analysis of representative BRC 13 sample.

Plots represent sample before (left) and after (right, overlay of 4 samples) sorting with CD49f and CD133, resp. These plots are representative of all replicate tumors.

(PDF)

S2 Fig. Surface markers used for sorting and respective gene expressions.

Genes encoding the surface markers which were used to sort the populations (CD184/CXCR4, CD49f/ITGA6, CD133/Prom1) were found to be differentially expressed with high statistical relevance in each experiment. CD184 (left) data is from a prior publication [12]. The horizontal line marks FDR = 0.1.

(PDF)

S3 Fig. Number of genes on the y-axis shown as a function of the FDR.

Even up to very high FDR values the number of genes assessed to be different is very low for all comparisons between samples during the sample prep phase while the number of genes differentially expressed between sorted subpopulations grows continuously with increasing FDR.

(PDF)

S4 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for breast tissue proliferation.

Overlap of a proliferative gene signature from histologically normal breast tissue (30 genes, [53], top; same as in Fig 5) and "hallmark E2F target genes" (34 genes, [54], bottom) with resp. gene sets within this publication are shown in volcano plots Numbers provided are genes above FDR cutoff of 0.1 up (right) or down (left) regulated.

(PDF)

S5 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for breast tissue proliferation.

Overlap of a proliferative gene signature from histologically normal breast tissue (30 genes, [53], left; same as in Fig 5) and "hallmark E2F target genes" (34 genes, [54], right) with resp. gene sets within this publication are shown in heatmaps. 2 genes (TOP2A and MCM2, see arrows) overlap between the signatures.

(PDF)

S6 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for stemness and genomic instability.

Overlap of a stem cell gene signature [55], (top/) and a genomic instability signature identified in glioma [56] (bottom) are shown in volcano plots.

(PDF)

S7 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for stemness and genomic instability.

Overlap of a stem cell gene signature [55], (left) and a genomic instability signature identified in glioma [56] (right) are shown in heatmaps (b).

(PDF)

S8 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for poor outcome and DNA damage response.

Overlap of a subpopulation signature associated with poor outcome in lung adenocarcinoma [57] (top) and a signature associated with DNA damage response (DDR) and resistance to PARP inhibition [58] (bottom) are shown in volcano plots.

(PDF)

S9 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for poor outcome and DNA damage response.

Overlap of a subpopulation signature associated with poor outcome in lung adenocarcinoma [57] (left) and a signature associated with DNA damage response (DDR) and resistance to PARP inhibition [58] (right) are shown in heatmaps (b).

(PDF)

S10 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for hypoxia.

Overlap of a hypoxia signature composed from 3 publications (see Fig 5a, [3436], top) and a larger hypoxia gene set described in [59], (bottom) are shown in volcano plots.

(PDF)

S11 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for hypoxia.

Overlap of a hypoxia signature composed from 3 publications (see Fig 5a, [3436], left) and a larger hypoxia gene set described in [59], (right) are shown in heatmaps (b).

(PDF)

S12 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for drug resistance.

Overlap of a drug resistance signature described in lung carcinoma and melanoma [60] are shown in a volcano plot.

(PDF)

S13 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for drug resistance.

Overlap of a drug resistance signature described in lung carcinoma and melanoma [60] are shown in a heatmap.

(PDF)

S14 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations with overlap of clusters identified in scRNASeq experiments on TNBC samples.

Overlap of 19 genes associated with proliferation in a sscRNASeq analysis of TNBC tissue [61].

(PDF)

S15 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations with overlap of clusters identified in scRNASeq experiments on TNBC samples.

Overlap of 18 genes associated with an epithelial phenotype in a scRNASeq study on TNBC patient samples [62].

(PDF)

S16 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations with overlap of clusters identified in scRNASeq experiments on TNBC samples.

Overlap genes identified in different clusters within single cells from the same tissue in PDX models of TNBC [63] A) 24 combined genes most upregulated in clusters 1,3,6 B) 20 combined genes most upregulated in clusters 2,4.

(PDF)

S1 Table. List of reagents used in flow sorting.

(XLSX)

S2 Table. IEGs identified with statistical cutoff of FDR<0.1.

(XLSX)

S3 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC13/CD133lo subpopulation.

(XLSX)

S4 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC13/CD133hi subpopulation.

(XLSX)

S5 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC13/CD49flo subpopulation.

(XLSX)

S6 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC13/CD49fhi subpopulation.

(XLSX)

S7 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC12/CD184lo subpopulation.

(XLSX)

S8 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC12/CD184hi subpopulation.

(XLSX)

S9 Table. Select genes differentially expressed in BRC12/CD184lo and BRC13/CD49flo.

A number of members of the keratin family of genes (a) as well as functionally relevant hypoxia genes (b) were found to be regulated in a very similar fashion in BRC12 and BRC13 subpopulations.

(XLSX)

S10 Table. Gene lists with expression profiles selected from literature.

(XLSX)

S11 Table. DEG list from previous experiment using BRC12 and CD184.

(XLSX)

S12 Table. DEG list from 2nd experiment using BRC13 and CD49f.

(XLSX)

S13 Table. DEG list from 2nd experiment using BRC13 and CD133.

(XLSX)

Acknowledgments

The authors wish to thank Drs. Jeff Baker, Eric Dixon, BDTI and Ben Stanger, University of Pennsylvania for helpful comments.

Data Availability

All data are within the manuscript and supporting information files. The complete RNASeq raw data can be accessed at the EMBL-EBI ArrayExpress server under the accession number “E-MTAB-9428” with the title “High accuracy gene expression profiling of sorted cell subpopulations from breast cancer PDX model tissue” (https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-9428/).

Funding Statement

This study was completely funded by BD Technologies and Innovation. The funder provided support in the form of salaries for authors WP, ES, FT, WSD, FH, MF and RB, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.

References

  • 1.Marusyk A, Almendro V, Polyak K. Intra-tumour heterogeneity: a looking glass for cancer? Nat Rev Cancer. 2012;12(5):323–34. [DOI] [PubMed] [Google Scholar]
  • 2.Perou CM, Sorlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, et al. Molecular portraits of human breast tumours. Nature. 2000;406(6797):747–52. Epub 2000/08/30. 10.1038/35021093 . [DOI] [PubMed] [Google Scholar]
  • 3.Heppner GH. Tumor heterogeneity. Cancer Res. 1984;44(6):2259–65. Epub 1984/06/01. . [PubMed] [Google Scholar]
  • 4.Heppner GH, Shekhar M. Tumor heterogeneity is fundamental to the tumor ecosystem. Oncology (Williston Park, NY). 2014;28(9):780–1. Epub 2014/09/17. . [PubMed] [Google Scholar]
  • 5.Egeblad M, Nakasone ES, Werb Z. Tumors as organs: complex tissues that interface with the entire organism. Dev Cell. 2010;18(6):884–901. 10.1016/j.devcel.2010.05.012 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Hinohara K, Polyak K. Intratumoral Heterogeneity: More Than Just Mutations. Trends in cell biology. 2019;29(7):569–79. 10.1016/j.tcb.2019.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Korem Y, Szekely P, Hart Y, Sheftel H, Hausser J, Mayo A, et al. Geometry of the Gene Expression Space of Individual Cells. PLoS computational biology. 2015;11(7):e1004224 Epub 2015/07/15. 10.1371/journal.pcbi.1004224 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Adler M, Korem Kohanim Y, Tendler A, Mayo A, Alon U. Continuum of Gene-Expression Profiles Provides Spatial Division of Labor within a Differentiated Cell Type. Cell Systems. 2019;8(1):43-52.e5. 10.1016/j.cels.2018.12.008 [DOI] [PubMed] [Google Scholar]
  • 9.van Dijk D, Sharma R, Nainys J, Yim K, Kathail P, Carr AJ, et al. Recovering Gene Interactions from Single-Cell Data Using Data Diffusion. Cell. 2018;174(3):716-29.e27. Epub 2018/07/03. 10.1016/j.cell.2018.05.061 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Schaum N, Karkanias J, Neff NF, May AP, Quake SR, Wyss-Coray T, et al. Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris. Nature. 2018;562(7727):367–72. Epub 2018/10/05. 10.1038/s41586-018-0590-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Byrne AT, Alferez DG, Amant F, Annibali D, Arribas J, Biankin AV, et al. Interrogating open issues in cancer precision medicine with patient-derived xenografts. Nat Rev Cancer. 2017;17(4):254–68. Epub 2017/01/21. 10.1038/nrc.2016.140 . [DOI] [PubMed] [Google Scholar]
  • 12.Snowden E, Porter W, Hahn F, Ferguson M, Tong F, Parker JS, et al. Immunophenotyping and Transcriptomic Outcomes in PDX-derived TNBC Tissue. Molecular Cancer Research. 2017;15(4):429–38. Epub 12/30/2016. [DOI] [PubMed] [Google Scholar]
  • 13.Krebsbach PH, Villa-Diaz LG. The Role of Integrin α6 (CD49f) in Stem Cells: More than a Conserved Biomarker. Stem cells and development. 2017;26(15):1090–9. Epub 2017/05/13. 10.1089/scd.2016.0319 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Brugnoli F, Grassilli S, Al-Qassab Y, Capitani S, Bertagnolo V. CD133 in Breast Cancer Cells: More than a Stem Cell Marker. Journal of oncology. 2019;2019:7512632 Epub 2019/10/23. 10.1155/2019/7512632 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.National Research Council (US) Committee for the Update of the Guide for the Care And Use of Laboratory Animals. Guide for the Care and Use of Laboratory Animals. Guide for the Care and Use of Laboratory Animals. 8 ed. Washington (DC): National Academies Press (US) National Academy of Sciences.; 2011.
  • 16.Kuleshov MV, Jones MR, Rouillard AD, Fernandez NF, Duan Q, Wang Z, et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic acids research. 2016;44(W1):W90–7. Epub 2016/05/05. 10.1093/nar/gkw377 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Chen EY, Tan CM, Kou Y, Duan Q, Wang Z, Meirelles GV, et al. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinformatics. 2013;14:128 Epub 2013/04/17. 10.1186/1471-2105-14-128 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Tullai JW, Schaffer ME, Mullenbrock S, Sholder G, Kasif S, Cooper GM. Immediate-early and delayed primary response genes are distinct in function and genomic architecture. The Journal of biological chemistry. 2007;282(33):23981–95. Epub 2007/06/19. 10.1074/jbc.M702044200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Moriya S, Chourasia D, Ng KW, Khel NB, Parhar IS. Cloning and localization of immediate early response 2 (ier2) gene in the brain of medaka. Journal of chemical neuroanatomy. 2016;77:24–9. Epub 2016/05/03. 10.1016/j.jchemneu.2016.04.005 . [DOI] [PubMed] [Google Scholar]
  • 20.Jahner D, Hunter T. The ras-related gene rhoB is an immediate-early gene inducible by v-Fps, epidermal growth factor, and platelet-derived growth factor in rat fibroblasts. Molecular and cellular biology. 1991;11(7):3682–90. Epub 1991/07/01. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Cullingford TE, Butler MJ, Marshall AK, Tham el L, Sugden PH, Clerk A. Differential regulation of Kruppel-like factor family transcription factor expression in neonatal rat cardiac myocytes: effects of endothelin-1, oxidative stress and cytokines. Biochim Biophys Acta. 2008;1783(6):1229–36. Epub 2008/04/15. 10.1016/j.bbamcr.2008.03.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Rossi SL, Lumpkin CJ, Harris AW, Holbrook J, Gentillon C, McCahan SM, et al. Identification of early gene expression changes in primary cultured neurons treated with topoisomerase I poisons. Biochemical and biophysical research communications. 2016;479(2):319–24. Epub 2016/09/20. 10.1016/j.bbrc.2016.09.068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Waelput W, Verhee A, Broekaert D, Eyckerman S, Vandekerckhove J, Beattie JH, et al. Identification and expression analysis of leptin-regulated immediate early response and late target genes. The Biochemical journal. 2000;348 Pt 1:55–61. Epub 2000/05/05. [PMC free article] [PubMed] [Google Scholar]
  • 24.Pirola L, Balcerczyk A, Tothill RW, Haviv I, Kaspi A, Lunke S, et al. Genome-wide analysis distinguishes hyperglycemia regulated epigenetic signatures of primary vascular cells. Genome Res. 2011;21(10):1601–15. Epub 2011/09/06. 10.1101/gr.116095.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Qiao X, Lu JY, Hofmann SL. Gene expression profiling in a mouse model of infantile neuronal ceroid lipofuscinosis reveals upregulation of immediate early genes and mediators of the inflammatory response. BMC neuroscience. 2007;8:95 Epub 2007/11/21. 10.1186/1471-2202-8-95 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Qiu LQ, Lai WS, Bradbury A, Zeldin DC, Blackshear PJ. Tristetraprolin (TTP) coordinately regulates primary and secondary cellular responses to proinflammatory stimuli. Journal of leukocyte biology. 2015;97(4):723–36. Epub 2015/02/07. 10.1189/jlb.3A0214-106R [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Kelly K, Siebenlist U. Immediate-early genes induced by antigen receptor stimulation. Current opinion in immunology. 1995;7(3):327–32. Epub 1995/06/01. . [DOI] [PubMed] [Google Scholar]
  • 28.Sansone P, Berishaj M, Rajasekhar VK, Ceccarelli C, Chang Q, Strillacci A, et al. Evolution of Cancer Stem-like Cells in Endocrine-Resistant Metastatic Breast Cancers Is Mediated by Stromal Microvesicles. Cancer Res. 2017;77(8):1927–41. Epub 2017/02/17. 10.1158/0008-5472.can-16-2129 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gomez-Miragaya J, Palafox M, Pare L, Yoldi G, Ferrer I, Vila S, et al. Resistance to Taxanes in Triple-Negative Breast Cancer Associates with the Dynamics of a CD49f+ Tumor-Initiating Population. Stem cell reports. 2017;8(5):1392–407. Epub 2017/05/02. 10.1016/j.stemcr.2017.03.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Vassilopoulos A, Chisholm C, Lahusen T, Zheng H, Deng CX. A critical role of CD29 and CD49f in mediating metastasis for cancer-initiating cells isolated from a Brca1-associated mouse model of breast cancer. Oncogene. 2013;33(47). Epub 2013/12/10. 10.1038/onc.2013.516 [DOI] [PubMed] [Google Scholar]
  • 31.Wang E. RNA amplification for successful gene profiling analysis. Journal of translational medicine. 2005;3:28-. 10.1186/1479-5876-3-28 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Bahrami S, Drablos F. Gene regulation in the immediate-early response process. Advances in biological regulation. 2016;62:37–49. Epub 2016/05/26. 10.1016/j.jbior.2016.05.001 . [DOI] [PubMed] [Google Scholar]
  • 33.Sironen RK, Karjalainen HM, Elo MA, Kaarniranta K, Torronen K, Takigawa M, et al. cDNA array reveals mechanosensitive genes in chondrocytic cells under hydrostatic pressure. Biochim Biophys Acta. 2002;1591(1–3):45–54. Epub 2002/08/17. . [DOI] [PubMed] [Google Scholar]
  • 34.Casillas AL, Toth RK, Sainz AG, Singh N, Desai AA, Kraft AS, et al. Hypoxia-Inducible PIM Kinase Expression Promotes Resistance to Antiangiogenic Agents. Clin Cancer Res. 2018;24(1):169–80. Epub 2017/11/01. 10.1158/1078-0432.ccr-17-1318 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Amelio I, Mancini M, Petrova V, Cairns RA, Vikhreva P, Nicolai S, et al. p53 mutants cooperate with HIF-1 in transcriptional regulation of extracellular matrix components to promote tumor progression. Proc Natl Acad Sci U S A. 2018;115(46):E10869–e78. Epub 2018/11/02. 10.1073/pnas.1808314115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Lu H, Tran L, Park Y, Chen I, Lan J, Xie Y, et al. Reciprocal Regulation of DUSP9 and DUSP16 Expression by HIF1 Controls ERK and p38 MAP Kinase Activity and Mediates Chemotherapy-Induced Breast Cancer Stem Cell Enrichment. Cancer Res. 2018;78(15):4191–202. Epub 2018/06/09. 10.1158/0008-5472.can-18-0270 . [DOI] [PubMed] [Google Scholar]
  • 37.Teschendorff AE, Miremadi A, Pinder SE, Ellis IO, Caldas C. An immune response gene expression module identifies a good prognosis subtype in estrogen receptor negative breast cancer. Genome biology. 2007;8(8):R157-R. 10.1186/gb-2007-8-8-r157 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Hoggarth ZE, Osowski DB, Freeberg BA, Garrett SH, Sens DA, Sens MA, et al. The urothelial cell line UROtsa transformed by arsenite and cadmium display basal characteristics associated with muscle invasive urothelial cancers. PLoS One. 2018;13(12):e0207877 Epub 2018/12/15. 10.1371/journal.pone.0207877 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Pan X, Hobbs RP, Coulombe PA. The expanding significance of keratin intermediate filaments in normal and diseased epithelia. Current opinion in cell biology. 2013;25(1):47–56. Epub 2012/12/29. 10.1016/j.ceb.2012.10.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Loschke F, Seltmann K, Bouameur JE, Magin TM. Regulation of keratin network organization. Current opinion in cell biology. 2015;32:56–64. Epub 2015/01/17. 10.1016/j.ceb.2014.12.006 . [DOI] [PubMed] [Google Scholar]
  • 41.Xing D, Liu L, Marti GP, Zhang X, Reinblatt M, Milner SM, et al. Hypoxia and hypoxia-inducible factor in the burn wound. Wound Repair Regen. 2011;19(2):205–13. 10.1111/j.1524-475X.2010.00656.x . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Krishnamachary B, Berg-Dixon S, Kelly B, Agani F, Feldser D, Ferreira G, et al. Regulation of colon carcinoma cell invasion by hypoxia-inducible factor 1. Cancer Res. 2003;63(5):1138–43. Epub 2003/03/05. . [PubMed] [Google Scholar]
  • 43.Sullivan R, Pare GC, Frederiksen LJ, Semenza GL, Graham CH. Hypoxia-induced resistance to anticancer drugs is associated with decreased senescence and requires hypoxia-inducible factor-1 activity. Molecular cancer therapeutics. 2008;7(7):1961–73. Epub 2008/07/23. 10.1158/1535-7163.mct-08-0198 . [DOI] [PubMed] [Google Scholar]
  • 44.O'Brien SK, Chen L, Zhong W, Armellino D, Yu J, Loreth C, et al. Breast cancer cells respond differentially to modulation of TGFbeta2 signaling after exposure to chemotherapy or hypoxia. Cancer Res. 2015;75(21):4605–16. Epub 2015/09/06. 10.1158/0008-5472.can-15-0650 . [DOI] [PubMed] [Google Scholar]
  • 45.Su Y, Wei W, Robert L, Xue M, Tsoi J, Garcia-Diaz A, et al. Single-cell analysis resolves the cell state transition and signaling dynamics associated with melanoma drug-induced resistance. Proc Natl Acad Sci U S A. 2017;114(52):13679–84. Epub 2017/12/13. 10.1073/pnas.1712064115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Wang M, Yao LC, Cheng M, Cai D, Martinek J, Pan CX, et al. Humanized mice in studying efficacy and mechanisms of PD-1-targeted cancer immunotherapy. FASEB journal: official publication of the Federation of American Societies for Experimental Biology. 2018;32(3):1537–49. Epub 2017/11/18. 10.1096/fj.201700740R [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Kim H, Kumar P, Menghi F, Noorbakhsh J, Cerveira E, Ryan M, et al. High-resolution deconstruction of evolution induced by chemotherapy treatments in breast cancer xenografts. Scientific reports. 2018;8(1):17937 Epub 2018/12/19. 10.1038/s41598-018-36184-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Nguyen LV, Cox CL, Eirew P, Knapp DJ, Pellacani D, Kannan N, et al. DNA barcoding reveals diverse growth kinetics of human breast tumour subclones in serially passaged xenografts. Nature communications. 2014;5:5871 Epub 2014/12/24. 10.1038/ncomms6871 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Bruna A, Rueda OM, Greenwood W, Batra AS, Callari M, Batra RN, et al. A Biobank of Breast Cancer Explants with Preserved Intra-tumor Heterogeneity to Screen Anticancer Compounds. Cell. 2016;167(1):260-74.e22. Epub 2016/09/20. 10.1016/j.cell.2016.08.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Janiszewska M, Tabassum DP, Castano Z, Cristea S, Yamamoto KN, Kingston NL, et al. Subclonal cooperation drives metastasis by modulating local and systemic immune microenvironments. Nature cell biology. 2019;21(7):879–88. Epub 2019/07/03. 10.1038/s41556-019-0346-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Weber RJ, Desai TA, Gartner ZJ. Non-autonomous cell proliferation in the mammary gland and cancer. Current opinion in cell biology. 2017;45:55–61. Epub 2017/03/18. 10.1016/j.ceb.2017.02.009 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Sen M, Hahn F, Black TA, DeMarshall M, Porter W, Snowden E, et al. Flow based single cell analysis of the immune landscape distinguishes Barrett's esophagus from adjacent normal tissue. Oncotarget. 2019;10(38):3592–604. Epub 2019/06/21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Chen DT, Nasir A, Culhane A, Venkataramu C, Fulp W, Rubio R, et al. Proliferative genes dominate malignancy-risk gene signature in histologically-normal breast tissue. Breast cancer research and treatment. 2010;119(2):335–46. Epub 2009/03/07. 10.1007/s10549-009-0344-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Smith BA, Sokolov A, Uzunangelov V, Baertsch R, Newton Y, Graim K, et al. A basal stem cell signature identifies aggressive prostate cancer phenotypes. Proc Natl Acad Sci U S A. 2015;112(47):E6544–52. Epub 2015/10/16. 10.1073/pnas.1518007112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Shats I, Gatza ML, Chang JT, Mori S, Wang J, Rich J, et al. Using a stem cell-based signature to guide therapeutic selection in cancer. Cancer Res. 2011;71(5):1772–80. Epub 2010/12/21. 10.1158/0008-5472.can-10-1735 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Zhang Y, Li J, Yi K, Feng J, Cong Z, Wang Z, et al. Elevated signature of a gene module coexpressed with CDC20 marks genomic instability in glioma. Proc Natl Acad Sci U S A. 2019;116(14):6975–84. Epub 2019/03/17. 10.1073/pnas.1814060116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Min JW, Kim WJ, Han JA, Jung YJ, Kim KT, Park WY, et al. Identification of Distinct Tumor Subpopulations in Lung Adenocarcinoma via Single-Cell RNA-seq. PLoS One. 2015;10(8):e0135817 Epub 2015/08/26. 10.1371/journal.pone.0135817 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Hassan S, Esch A, Liby T, Gray JW, Heiser LM. Pathway-Enriched Gene Signature Associated with 53BP1 Response to PARP Inhibition in Triple-Negative Breast Cancer. Molecular cancer therapeutics. 2017;16(12):2892–901. Epub 2017/09/30. 10.1158/1535-7163.mct-17-0170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Harris AL. Hypoxia—a key regulatory factor in tumour growth. Nat Rev Cancer. 2002;2(1):38–47. Epub 2002/03/21. 10.1038/nrc704 . [DOI] [PubMed] [Google Scholar]
  • 60.Tirosh I, Izar B, Prakadan SM, Wadsworth MH, Treacy D, Trombetta JJ, et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science. 2016;352(6282):189–96. 10.1126/science.aad0501 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Yang B, Chou J, Tao Y, Wu D, Wu X, Li X, et al. An assessment of prognostic immunity markers in breast cancer. NPJ Breast Cancer. 2018;4:35-. 10.1038/s41523-018-0088-0 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Karaayvaz M, Cristea S, Gillespie SM, Patel AP, Mylvaganam R, Luo CC, et al. Unravelling subclonal heterogeneity and aggressive disease states in TNBC through single-cell RNA-seq. Nature communications. 2018;9(1):3588 Epub 2018/09/06. 10.1038/s41467-018-06052-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Savage P, Blanchet-Cohen A, Revil T, Badescu D, Saleh SMI, Wang YC, et al. A Targetable EGFR-Dependent Tumor-Initiating Program in Breast Cancer. Cell reports. 2017;21(5):1140–9. Epub 2017/11/02. 10.1016/j.celrep.2017.10.015 . [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Randall J Kimple

15 Jun 2020

PONE-D-20-11376

High accuracy gene expression profiling of sorted cell subpopulations from breast cancer PDX model tissue

PLOS ONE

Dear Dr. Blaesius,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Thank you for your patience with the review process. We apologize for the length of time required durign the COVID-19 pandemic. 

Please submit your revised manuscript by Jul 30 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Randall J. Kimple

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. At this time, we request that you  please report additional details in your Methods section regarding animal care, as per our editorial guidelines:

(1) Please provide details of animal welfare (e.g., shelter, food, water, environmental enrichment)

(2) Please include the method of euthanasia  

(3) Please describe the care received by the animals, including the frequency of monitoring and the criteria used to assess animal health and well-being during the course of the tumor study.

Thank you for your attention to these requests.

3. Thank you for including your ethics statement: 'All animal studies were performed under an Institutional Animal Care and Use Committee–approved protocol.'  

(a) Please amend your current ethics statement to include the full name of the ethics committee/institutional review board(s) that approved your specific study.  

(b) Once you have amended this/these statement(s) in the Methods section of the manuscript, please add the same text to the “Ethics Statement” field of the submission form (via “Edit Submission”).

For additional information about PLOS ONE ethical requirements for human subjects research, please refer to http://journals.plos.org/plosone/s/submission-guidelines#loc-human-subjects-research.

4. Please note that PLOS does not permit references to “data not shown.” Authors should provide the relevant data within the manuscript, the Supporting Information files, or in a public repository. If the data are not a core part of the research study being presented, we ask that authors remove any references to these data.

5. Please provide the product number and any lot numbers of the primary antibodies purchased from chemical companies for your study.

6. To comply with PLOS ONE submission guidelines, in your Methods section, please provide additional information regarding your statistical analyses. For more information on PLOS ONE's expectations for statistical reporting, please see https://journals.plos.org/plosone/s/submission-guidelines.#loc-statistical-reporting.

7. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

8. Thank you for stating the following in the Competing Interests section:

WP, ES, FT, WSD and RB are employees of BD and FH and MF were BD employees

at the time the work was carried out.

We note that one or more of the authors are employed by a commercial company: BD Technologies and Innovation,

a. Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form.

Please also include the following statement within your amended Funding Statement.

“The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.”

If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement.

b. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc.  

Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to  PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and  there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.

Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf.

Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: I Don't Know

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: In “High accuracy gene expression profiling of sorted cell subpopulations from breast cancer PDX model tissue”, Porter et al study the influence of fluorescence activated cell sorting (FACS) on transcriptional profiling and investigate the presence of multiple different subpopulations within the same breast cancer model tissue.” The manuscript seeks to understand whether there are consistent expression features of cell subpopulations in tumors, by using a xenograft model system. This is an important question for the field. While the techniques used are not at the forefront (scRNA-seq would be preferable), the concept of the work is sound. I have a few major concerns and some minor critiques.

Major

Line 180: The authors should provide a formal gene set analysis of the genes that were DE in both experiments, as well as for the DE genes unique to individual experiments. The analysis provided in lines 192-203 is relevant to this but is too inexact to be evaluated.

The authors have a sizable number of RNA-seq replicates (line 132 suggests there are 32 RNA-seq datasets, though it is difficult to tell from the text) both across and within xenografts. However, there is little discussion of the robustness of the DE analysis across and within mice. This high number of replicates is the major strength of the work, so comparisons of replicates should be explicit.

The authors should describe how many cells were in each of the sorted populations. The proportions of cell populations that appear within xenograft ecosystems has been studied in pubs such as [Cell 167, 260–274.e22 (2016); Nat Comms 5, 5871 (2014); Sci Rep 8, 17937 (2018)] and may be relevant for tumor growth. Comments on this topic would strengthen the work. On a related point, are there published scRNA TNBC datasets available that the authors can compare to? Comparisons to such data would clarify the robustness of the observed effects.

Minor

Line 146: Please clarify that the references indicate the genes’ IEG status. This is not apparent from the table 1 caption.

Line 182: “120 of the 121 genes in common were qualitatively correlated (i.e. identical direction of change) and on a quantitative level the correlation coefficient was R= - 0.85” : The first and second parts of the sentence are stated inconsistently. If the direction of change is identical then the correlation coefficient should be positive. Rephrase.

Some aspects of the article are out of date. E.g. Ref 12 lists a publication from 2017 as “in press.”

Reviewer #2: Porter et al. have submitted a will written report investigating intratumoral heterogeneity of the transcriptome within solid human tumors by isolation of distinct cancer cell populations by FACS. Across tumor cases and replicates the authors identified immediate early gene changes, primarily due to tumor dissociation, that agreed to a high degree with previous publications. Cell populations were sorted based on high/low expression of the cell surface markers CD49f, CD133, and CD184, which were then analyzed by RNAseq. Differentially expressed genes were identified between populations indicating that each population may have distinct roles within the tumors. Overall, the experiments were designed well with proper controls, and this research will likely pave the way for more functional assays in the future (e.g. therapeutic response). I have no major concerns with the manuscript and have only a few minor comments/suggestions.

- In the Introduction I would suggest adding a brief description of the biological roles of CD49f, CD133, and CD184 and/or the types of cancer cell populations they have previously been used to isolate and define.

- For whole fresh tumor and the “bulk” dissociated cell population, what was the degree to mouse stromal (fibroblast, endothelial, and immune cell) cell infiltration? This can often be 10-50% in solid tumors. Did this have any effect on the RNAseq analysis for these samples?

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Sep 10;15(9):e0238594. doi: 10.1371/journal.pone.0238594.r002

Author response to Decision Letter 0


30 Jul 2020

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

We have tried to adhere to the specifications for the figure files but as some metrics are not completely unambiguous would request some guidance or assistance in case there are discrepancies.

2. At this time, we request that you please report additional details in your Methods section regarding animal care, as per our editorial guidelines:

(1) Please provide details of animal welfare (e.g., shelter, food, water, environmental enrichment)

A dedicated room to safely house immune compromised mice was used. Positive air flow and filtration was maintained though a BioBubble system. Mice were socially housed in groups of five within a micro isolation cage rack. Food and water were available ad lib. Each cage contained environmental enrichments, including nestlets and igloos.

(2) Please include the method of euthanasia

Euthanasia was performed via exsanguination under anesthesia at the time of tumor extraction.

(3) Please describe the care received by the animals, including the frequency of monitoring and the criteria used to assess animal health and well-being during the course of the tumor study.

Tumor measurements and animal weights were collected on at least a weekly basis following the development of visible tumors. Animals were monitored daily with additional measurements taken if needed. Animals were euthanized if grafted tumor tissue shows signs of necrosis. Additional health signs were monitored with the following termination points: “rapid” weight loss of >10%, specific clinical signs of severe organ involvement (e.g. circling, head-tilt, respiratory rates, paralysis). Also, Body Condition Score (BCS) was monitored, with a score of 2 or below serving as a humane endpoint.

The following clinical signs are indications of morbidity:

1. Persistent anorexia or dehydration

2. Consistent or rapid weight loss of 20% maintained for 72 hours.

3. Unable to maintain an upright posture or to move.

4. Muscle atrophy or emaciation.

5. Lethargy or failure to respond to gentle stimuli.

6. Hyperthermia.

7. Unconsciousness or coma.

8. Bloodstained or mucropurulent discharge from any orifice.

9. Labored respiration – particularly if accompanied by nasal discharge or cyanosis.

10. Anemia.

11. Significant abdominal distension.

12. Incontinence or prolonged diarrhea.

13. Dragging a limb or having trouble ambulating.

Animals were observed daily for signs or pain/distress. Animals that exhibited any of these behaviors were humanely euthanized.Thank you for your attention to these requests.

Changes were made to the methods section to include the details requested. We opted to omit the list of clinical signs of indications of morbidity we use (below) for conciseness but would be happy to include this as well if requested:

The following clinical signs are indications of morbidity:

1. Persistent anorexia or dehydration

2. Consistent or rapid weight loss of 20% maintained for 72 hours.

3. Unable to maintain an upright posture or to move.

4. Muscle atrophy or emaciation.

5. Lethargy or failure to respond to gentle stimuli.

6. Hyperthermia.

7. Unconsciousness or coma.

8. Bloodstained or mucropurulent discharge from any orifice.

9. Labored respiration – particularly if accompanied by nasal discharge or cyanosis.

10. Anemia.

11. Significant abdominal distension.

12. Incontinence or prolonged diarrhea.

13. Dragging a limb or having trouble ambulating.

3. Thank you for including your ethics statement: 'All animal studies were performed under an Institutional Animal Care and Use Committee–approved protocol.'

(a) Please amend your current ethics statement to include the full name of the ethics committee/institutional review board(s) that approved your specific study.

(b) Once you have amended this/these statement(s) in the Methods section of the manuscript, please add the same text to the “Ethics Statement” field of the submission form (via “Edit Submission”).

For additional information about PLOS ONE ethical requirements for human subjects research, please refer to http://journals.plos.org/plosone/s/submission-guidelines#loc-human-subjects-research.

We believe this does not apply to our study since we used PDX models commercially available from Jackson Labs and an IRB was not required. Please advise if this is in conflict with your interpretation of this requirement.

4. Please note that PLOS does not permit references to “data not shown.” Authors should provide the relevant data within the manuscript, the Supporting Information files, or in a public repository. If the data are not a core part of the research study being presented, we ask that authors remove any references to these data. Done

5. Please provide the product number and any lot numbers of the primary antibodies purchased from chemical companies for your study.

A table has been added in the supplemental material listing this information.

6. To comply with PLOS ONE submission guidelines, in your Methods section, please provide additional information regarding your statistical analyses. For more information on PLOS ONE's expectations for statistical reporting, please see https://journals.plos.org/plosone/s/submission-guidelines.#loc-statistical-reporting.

The Methods section has been augmented with the respective information

7. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

The respective data will be made public by uploading the entire RNASeq data set to an appropriate public server (see below).

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

N/A

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

In process; we will submit confirmation as soon as available.

We will update your Data Availability statement on your behalf to reflect the information you provide.

8. Thank you for stating the following in the Competing Interests section:

WP, ES, FT, WSD and RB are employees of BD and FH and MF were BD employees

at the time the work was carried out.

We note that one or more of the authors are employed by a commercial company: BD Technologies and Innovation,

(I have corrected the exact wording)

a. Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form.

(I have added an amended Funding Statement with the Cover Letter using other company papers as guidance for the wording)

Please also include the following statement within your amended Funding Statement.

“The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.”

If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement.

(I have completed using other company papers as guidance)

b. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc.

Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.

(I have added an amended Competing Interest Statement with the Cover Letter using other company papers as guidance for the wording)

Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf.

Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests

Response to Reviewers

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: In “High accuracy gene expression profiling of sorted cell subpopulations from breast cancer PDX model tissue”, Porter et al study the influence of fluorescence activated cell sorting (FACS) on transcriptional profiling and investigate the presence of multiple different subpopulations within the same breast cancer model tissue.” The manuscript seeks to understand whether there are consistent expression features of cell subpopulations in tumors, by using a xenograft model system. This is an important question for the field. While the techniques used are not at the forefront (scRNA-seq would be preferable), the concept of the work is sound. I have a few major concerns and some minor critiques.

Major

Line 180: The authors should provide a formal gene set analysis of the genes that were DE in both experiments, as well as for the DE genes unique to individual experiments. The analysis provided in lines 192-203 is relevant to this but is too inexact to be evaluated.

A gene set analysis in Enrichr was performed and supplied in the supplemental material along with a description in the results section.

The authors have a sizable number of RNA-seq replicates (line 132 suggests there are 32 RNA-seq datasets, though it is difficult to tell from the text) both across and within xenografts. However, there is little discussion of the robustness of the DE analysis across and within mice. This high number of replicates is the major strength of the work, so comparisons of replicates should be explicit.

Clarifying language was added in line 132 (caption Fig.1) and 137.

The authors should describe how many cells were in each of the sorted populations. The proportions of cell populations that appear within xenograft ecosystems has been studied in pubs such as [Cell 167, 260–274.e22 (2016); Nat Comms 5, 5871 (2014); Sci Rep 8, 17937 (2018)] and may be relevant for tumor growth. Comments on this topic would strengthen the work.

We appreciate this helpful suggestion. We provided the ratio of the respective supopulations in the supplemental data (S2) and added a paragraph about various forms of ITH (genotypic and phenotypic) and reports of outcomes of mixing cells with defined genotypic and or phenotypic characteristics. Since the primary aim of our work is in providing the most efficient path towards future functional studies of subpopulation interactions we refrain from drawing conclusions on the dynamic involved in our work for which we believe more than one time point would be required.

On a related point, are there published scRNA TNBC datasets available that the authors can compare to? Comparisons to such data would clarify the robustness of the observed effects.

We surveyed the literature specifically to address this point. There is no single repository with standard search protocols without which an exhaustive search would exceed our resources. We were however able to identify multiple examples of significant overlap between DEGs detected in scRNASeq data from TNBC and our gene lists. While we cannot exclude the possibility that additional relevant profiles exist we added three representative examples as supplemental figures (S9-S11) and edited the discussion accordingly.

Additionally we would like to point out that the gene set identified by Hassan et al. 2017 (see Fig. S6) in our original manuscript which correlates strongly with our data was identified in TNBC, albeit based on regulation within whole populations of cells.

Minor

Line 146: Please clarify that the references indicate the genes’ IEG status. This is not apparent from the table 1 caption.

The header of the table was changed to state “Reference identifying gene as IEG” as clarification.

Line 182: “120 of the 121 genes in common were qualitatively correlated (i.e. identical direction of change) and on a quantitative level the correlation coefficient was R= - 0.85” : The first and second parts of the sentence are stated inconsistently. If the direction of change is identical then the correlation coefficient should be positive. Rephrase.

We appreciate pointing out this error and have changed the wording to clearly state that it was an inverse correlation.

Some aspects of the article are out of date. E.g. Ref 12 lists a publication from 2017 as “in press.”

We have corrected the identified publication. Additionally we have replaced previous reference #9 with a peer reviewed publication by the same authors on the topic (previous reference was a pre-publication in Biorxiv).

Reviewer #2: Porter et al. have submitted a will written report investigating intratumoral heterogeneity of the transcriptome within solid human tumors by isolation of distinct cancer cell populations by FACS. Across tumor cases and replicates the authors identified immediate early gene changes, primarily due to tumor dissociation, that agreed to a high degree with previous publications. Cell populations were sorted based on high/low expression of the cell surface markers CD49f, CD133, and CD184, which were then analyzed by RNAseq. Differentially expressed genes were identified between populations indicating that each population may have distinct roles within the tumors. Overall, the experiments were designed well with proper controls, and this research will likely pave the way for more functional assays in the future (e.g. therapeutic response). I have no major concerns with the manuscript and have only a few minor comments/suggestions.

- In the Introduction I would suggest adding a brief description of the biological roles of CD49f, CD133, and CD184 and/or the types of cancer cell populations they have previously been used to isolate and define.

A passage has been added in the introduction including additional references to address this.

- For whole fresh tumor and the “bulk” dissociated cell population, what was the degree to mouse stromal (fibroblast, endothelial, and immune cell) cell infiltration? This can often be 10-50% in solid tumors. Did this have any effect on the RNAseq analysis for these samples?

We added the information in the results. Both models investigated by us so far contained 15% mouse cells which were excluded from the RNASeq analysis (in silico from the bulkk analysis, physically for “human only” and marker sorted cell populations) and to date we have not investigated differing models which would allow us to draw any conclusions in that regard.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

Randall J Kimple

20 Aug 2020

High accuracy gene expression profiling of sorted cell subpopulations from breast cancer PDX model tissue

PONE-D-20-11376R1

Dear Dr. Blaesius,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Randall J. Kimple

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I am satisfied with the revisions. All points have been addressed and I would recommend acceptance of the article.

Reviewer #2: All of my comments and concerns were addressed by the authors. However, I would suggest having the name of the repository of the uploaded NGS data in the final draft of the manuscript. Thank you.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Acceptance letter

Randall J Kimple

26 Aug 2020

PONE-D-20-11376R1

High accuracy gene expression profiling of sorted cell subpopulations from breast cancer PDX model tissue

Dear Dr. Blaesius:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Randall J. Kimple

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File

    (DOCX)

    S1 Fig. Flow cytometric analysis of representative BRC 13 sample.

    Plots represent sample before (left) and after (right, overlay of 4 samples) sorting with CD49f and CD133, resp. These plots are representative of all replicate tumors.

    (PDF)

    S2 Fig. Surface markers used for sorting and respective gene expressions.

    Genes encoding the surface markers which were used to sort the populations (CD184/CXCR4, CD49f/ITGA6, CD133/Prom1) were found to be differentially expressed with high statistical relevance in each experiment. CD184 (left) data is from a prior publication [12]. The horizontal line marks FDR = 0.1.

    (PDF)

    S3 Fig. Number of genes on the y-axis shown as a function of the FDR.

    Even up to very high FDR values the number of genes assessed to be different is very low for all comparisons between samples during the sample prep phase while the number of genes differentially expressed between sorted subpopulations grows continuously with increasing FDR.

    (PDF)

    S4 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for breast tissue proliferation.

    Overlap of a proliferative gene signature from histologically normal breast tissue (30 genes, [53], top; same as in Fig 5) and "hallmark E2F target genes" (34 genes, [54], bottom) with resp. gene sets within this publication are shown in volcano plots Numbers provided are genes above FDR cutoff of 0.1 up (right) or down (left) regulated.

    (PDF)

    S5 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for breast tissue proliferation.

    Overlap of a proliferative gene signature from histologically normal breast tissue (30 genes, [53], left; same as in Fig 5) and "hallmark E2F target genes" (34 genes, [54], right) with resp. gene sets within this publication are shown in heatmaps. 2 genes (TOP2A and MCM2, see arrows) overlap between the signatures.

    (PDF)

    S6 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for stemness and genomic instability.

    Overlap of a stem cell gene signature [55], (top/) and a genomic instability signature identified in glioma [56] (bottom) are shown in volcano plots.

    (PDF)

    S7 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for stemness and genomic instability.

    Overlap of a stem cell gene signature [55], (left) and a genomic instability signature identified in glioma [56] (right) are shown in heatmaps (b).

    (PDF)

    S8 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for poor outcome and DNA damage response.

    Overlap of a subpopulation signature associated with poor outcome in lung adenocarcinoma [57] (top) and a signature associated with DNA damage response (DDR) and resistance to PARP inhibition [58] (bottom) are shown in volcano plots.

    (PDF)

    S9 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for poor outcome and DNA damage response.

    Overlap of a subpopulation signature associated with poor outcome in lung adenocarcinoma [57] (left) and a signature associated with DNA damage response (DDR) and resistance to PARP inhibition [58] (right) are shown in heatmaps (b).

    (PDF)

    S10 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for hypoxia.

    Overlap of a hypoxia signature composed from 3 publications (see Fig 5a, [3436], top) and a larger hypoxia gene set described in [59], (bottom) are shown in volcano plots.

    (PDF)

    S11 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for hypoxia.

    Overlap of a hypoxia signature composed from 3 publications (see Fig 5a, [3436], left) and a larger hypoxia gene set described in [59], (right) are shown in heatmaps (b).

    (PDF)

    S12 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for drug resistance.

    Overlap of a drug resistance signature described in lung carcinoma and melanoma [60] are shown in a volcano plot.

    (PDF)

    S13 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations for drug resistance.

    Overlap of a drug resistance signature described in lung carcinoma and melanoma [60] are shown in a heatmap.

    (PDF)

    S14 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations with overlap of clusters identified in scRNASeq experiments on TNBC samples.

    Overlap of 19 genes associated with proliferation in a sscRNASeq analysis of TNBC tissue [61].

    (PDF)

    S15 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations with overlap of clusters identified in scRNASeq experiments on TNBC samples.

    Overlap of 18 genes associated with an epithelial phenotype in a scRNASeq study on TNBC patient samples [62].

    (PDF)

    S16 Fig. Gene expression signatures identified in BRC12 and BRC13 tumor cell subpopulations with overlap of clusters identified in scRNASeq experiments on TNBC samples.

    Overlap genes identified in different clusters within single cells from the same tissue in PDX models of TNBC [63] A) 24 combined genes most upregulated in clusters 1,3,6 B) 20 combined genes most upregulated in clusters 2,4.

    (PDF)

    S1 Table. List of reagents used in flow sorting.

    (XLSX)

    S2 Table. IEGs identified with statistical cutoff of FDR<0.1.

    (XLSX)

    S3 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC13/CD133lo subpopulation.

    (XLSX)

    S4 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC13/CD133hi subpopulation.

    (XLSX)

    S5 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC13/CD49flo subpopulation.

    (XLSX)

    S6 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC13/CD49fhi subpopulation.

    (XLSX)

    S7 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC12/CD184lo subpopulation.

    (XLSX)

    S8 Table. List of GO terms generated in Enrichr for DEGs upregulated in the BRC12/CD184hi subpopulation.

    (XLSX)

    S9 Table. Select genes differentially expressed in BRC12/CD184lo and BRC13/CD49flo.

    A number of members of the keratin family of genes (a) as well as functionally relevant hypoxia genes (b) were found to be regulated in a very similar fashion in BRC12 and BRC13 subpopulations.

    (XLSX)

    S10 Table. Gene lists with expression profiles selected from literature.

    (XLSX)

    S11 Table. DEG list from previous experiment using BRC12 and CD184.

    (XLSX)

    S12 Table. DEG list from 2nd experiment using BRC13 and CD49f.

    (XLSX)

    S13 Table. DEG list from 2nd experiment using BRC13 and CD133.

    (XLSX)

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    All data are within the manuscript and supporting information files. The complete RNASeq raw data can be accessed at the EMBL-EBI ArrayExpress server under the accession number “E-MTAB-9428” with the title “High accuracy gene expression profiling of sorted cell subpopulations from breast cancer PDX model tissue” (https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-9428/).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES