Abstract
Although thousands of breast cancer cells disseminate and home to bone marrow until primary surgery, usually less than a handful will succeed in establishing manifest metastases months to years later. To identify signals that support survival or outgrowth in patients, we profile rare bone marrow-derived disseminated cancer cells (DCCs) long before manifestation of metastasis and identify IL6/PI3K-signaling as candidate pathway for DCC activation. Surprisingly, and similar to mammary epithelial cells, DCCs lack membranous IL6 receptor expression and mechanistic dissection reveals IL6 trans-signaling to regulate a stem-like state of mammary epithelial cells via gp130. Responsiveness to IL6 trans-signals is found to be niche-dependent as bone marrow stromal and endosteal cells down-regulate gp130 in premalignant mammary epithelial cells as opposed to vascular niche cells. PIK3CA activation renders cells independent from IL6 trans-signaling. Consistent with a bottleneck function of microenvironmental DCC control, we find PIK3CA mutations highly associated with late-stage metastatic cells while being extremely rare in early DCCs. Our data suggest that the initial steps of metastasis formation are often not cancer cell-autonomous, but also depend on microenvironmental signals.
Subject terms: Breast cancer, Cancer
Metastatic dissemination in breast cancer patients occurs early in malignant transformation, raising questions about how disseminated cancer cells (DCC) progress at distant sites. Here, the authors show that DCCs in bone marrow are activated via IL6-trans-signaling and thereby acquire stemness traits relevant for metastasis formation.
Introduction
In breast cancer, dissemination to distant sites precedes the clinical manifestation of metastasis by 6–8 years in the median, ranging from <1 year to >40 years1–3. These clinical data derived from breast cancer growth kinetics and imaging studies are strongly supported by recent experimental evidence. Whereas dissemination from the primary site occurs preferentially in early tumor stages4–6, specific mechanisms reduce cancer cell dissemination in anatomically and molecularly advanced stages5. Furthermore, analysis of cancer growth kinetics suggests that not only cancer cells disseminate early but also that micrometastatic colony formation is initiated early. However, its manifestation may take considerable time1,3,7. Such data are consistent with the observation that early disseminated cancer cells (DCCs) often lack critical genetic and genomic alterations, which they need to acquire at the distant site in breast and other cancers. This process could explain the much longer clinical latency periods observed in humans as compared to mouse models5,8,9 and be particularly relevant for cancers displaying late relapses, such as hormone receptor-positive (HR+) breast cancer. However, early dissemination and prolonged clinical latency at distant sites raise questions about the identity and nature of signals conferring survival, genomic progression, and outgrowth of DCCs over extended periods of time.
DCCs are extremely rare. They are detected at very low frequencies (1–2 DCCs per 106 BM cells10,11) in bone marrow (BM) of ~30% of breast cancer patients with no evidence of manifest metastasis. Besides genomic studies, the assessment of the DCC phenotype has been limited to testing for selected antigens12 and to anecdotal transcriptomic studies13,14. Moreover, spontaneous or transgenic mouse models, such as the PyMT- or Her2-driven models, do not generate bone metastases. Hence, there is no in vivo model available to study the spontaneous progression and genomic evolution from early BM infiltration to manifestation of bone metastasis.
To unravel mechanisms operative during clinical latency periods, we interrogate transcript-derived pathway information from DCCs isolated from BM of breast cancer patients. Since early breast cancer DCCs often display close-to-normal genomes5,15, we use mammary epithelial cells isolated from reduction mammoplasties and available immortalized premalignant breast cancer cell lines as cellular models for functional testing of candidate mechanisms in vitro. We identified interleukin-6 (IL-6) trans-signaling as a pathway that (i) activates normal and premalignant cells, (ii) induces a proliferative stem/progenitor-like phenotype in mammary epithelial cells, and (iii) whose activation in DCCs depends on regulatory niche cells in BM. These data shed light onto the so far dark stage of early metastasis formation in patients. Moreover, it may inform about future ways to delay or prevent metachronous metastasis in patients whose breast cancer is diagnosed to be locally confined by standard clinical means.
Results
Early DCCs do not engraft in immunodeficient mice
For decades attempts to culture early DCCs, that is, DCCs from nonmetastasized M0-stage patients, have failed. Only anecdotal reports have been published that were not reproduced since then16–18. We recently observed in melanoma that early DCCs failing to generate xenografts differ genomically from DCCs that successfully engrafted, indicating a causative role of genomic maturation for metastasis and xenograft formation. Specific alterations were identified that are closely linked to colonization success in mice and patients9. We repeated these experiments for BM-derived breast and prostate cancer DCCs from early (M0) and advanced (M1) stages. Given the very low frequency of BM-DCCs (<10−6), we either injected CD45-depleted or EpCAM-enriched BM cells or generated and transplanted spheres as these have a higher engraftment likelihood9. In total, we tested 46 patient samples and different routes of application, including subcutaneous, orthotopic (site of origin), intrafemoral and intravenous injection. We then assessed tumor formation at the cutaneous injection sites and metastatic spread to lungs or BM. BM-derived DCCs from M1-stage patients engrafted in two out of four cases. In contrast, early DCCs from 42 M0-stage patients did not establish xenografts (Fig. 1a, b), neither at the injection sites nor in the lungs (p = 0.006; Fisher’s exact test). We also explored the presence of minimal systemic cancer by testing for human cytokeratin (CK) or EpCAM+ cells in murine BM. Interestingly, albeit DCCs of nonmetastatic patients did not expand in mice, they survived in murine BM in 4 out of 42 cases. We detected human EpCAM+ or CK+ DCCs at a frequency of 1–5 DCCs/million BM cells 4–14 weeks after injection of CD45-depleted human BM cells (Fig. 1c). For one of these rare events, we could not only prove human and epithelial, but also malignant origin by single-cell copy number alteration (CNA) analysis (Fig. 1d).
In summary and consistent with our findings in melanoma, early DCCs from patients without manifest metastasis failed to generate xenografts. Besides lower absolute cell numbers and fewer genetic alterations (see below), microenvironmental dependence of early DCCs could account for these results. We therefore decided to retrieve candidate interactions of early DCCs with the microenvironment via direct molecular analysis of early DCCs from breast cancer patients and implement these results into surrogate in vitro models.
Pathway activation in mammary stem and progenitor cells
We hypothesized that stemness traits are necessary for the ability to survive and progress in a hostile environment and to initiate metastasis. Therefore, we tested for pathways activated in cells with progenitor or stem-like traits using our highly sensitive whole transcriptome amplification (WTA) method14,19. To identify these cells, we labeled freshly isolated primary human mammary epithelial cells (HMECs) from reduction mammoplasties of healthy patients with the membrane dye PKH26. Labeled cells were then cultured under nonadherent mammosphere conditions, which support the expansion of stem/early progenitor cells and formation of multicellular spheroids of clonal origin with self-renewing capacity20. Cell divisions during mammosphere formation diluted the dye until only a few label-retaining cells (LRCs) were visible under the microscope (Fig. 2a). Isolating LRCs and non-LRCs (nLRCs) from disaggregated PKH26-labeled HMEC spheres and plating them as single cell per well confirmed that the sphere-forming ability was solely confined to LRCs (Fig. 2b, Fisher’s exact test p = 0.02), which is consistent with previous findings21. For transcriptome analysis, we isolated: (i) LRCs, (ii) nLRCs from disaggregated spheres, and (iii) LRCs that had not divided or formed spheres over time, but remained as single cells and were therefore termed quiescent single cells (QSCs). From each group, we isolated single cells (LRC, n = 8; nLRC, n = 5; QSC, n = 10; Supplementary Table 1), and performed WTA and microarray analysis as previously described19,22. Bioinformatics analysis indicated most variable gene expression between LRCs and QSCs/nLRCs (Fig. 2c and Supplementary Tables 2 and 3), and we found 12 pathways significantly enriched in LRCs over QSCs/nLRCs (Fig. 2d and Supplementary Table 2).
Identification of EpCAM+ DCCs in BM
In order to test whether any of these pathways were enriched in DCCs isolated from BM of breast cancer patients, we aimed to isolate DCCs with confirmed malignant origin14. We followed the reasoning that epithelial cell identity in BM of a cancer patient plus the presence of genetic alterations is sufficient to claim malignant epithelial origin or malignant potential of a cell. DCCs were detected by screening diagnostic BM aspirates from 246 M0-stage and 18 M1-stage patients for cells that stained positively for the epithelial marker EpCAM (Supplementary Fig. 1a). Forty percent of M0-stage and 72% of M1-stage patients harbored EpCAM+ cells. However, EpCAM is a surface marker that is not as specific for DCCs as the diagnostically used CKs14 in BM because it is expressed by cells from the B cell lineage (own unpublished data and ref. 23). Therefore, we sought to differentiate between EpCAM+ cells of breast cancer patients and noncancer patients.
CNAs are found in <5% of nonmalignant cells with a median of 1.8%24–26 and are diagnostically used to differentiate normal and malignant cells27. We therefore performed combined genome and transcriptome analysis and isolated genomic DNA and mRNA from the same single cell5,14. Although this approach fails in 10–50%13,14,28 and a CNA profile cannot be obtained for every cell, we found that 50% and 88% of successfully analyzed EpCAM+ cells from M0- and M1-stage patients, respectively, harbored CNAs (Fig. 3a and Supplementary Fig. 1a, b). We selected these DCCs for single-cell RNA-sequencing (RNA-Seq) analysis (M0: n = 30 DCCs, 21 patients; M1: n = 11 DCCs, 5 patients). To provide additional evidence that the aberrant EpCAM+ cells are derived from a nonhematopoietic cell lineage, we compared them with autochthonous EpCAM+ BM cells. The latter were isolated from patients without known malignant disease undergoing hip replacement surgery. Of note, cancer patient-derived and noncancer patient-derived EpCAM+ cells could be clearly separated using the overall gene expression as well as epithelial and B cell-annotated genes—with the exception of one M0-stage cell, which was therefore excluded from further analysis (Fig. 3b). Moreover, many cells from M0-stage breast cancer patients strongly expressed the marker gene KIT (Fig. 3b and Supplementary Fig. 1c), characteristic for mammary luminal progenitor (LumProg) cells29 (see below). KIT expression was significantly higher in M0-stage DCCs compared to M1-stage DCCs and absent in EpCAM+ cells from noncancer patients. KIT transcript expression in M0-stage DCCs was comparable to EPCAM transcript expression (Supplementary Fig. 1c). Together, CNAs and the epithelial, nonhematopoietic phenotype of cells isolated from patients with breast cancer provided compelling evidence that the selected cells were true DCCs.
IL-6 pathway activation in DCCs
We then tested whether any of the pathways enriched in mammary stem cells (LRCs; Fig. 2d) were also enriched in DCCs using the pathway membership enrichment analysis. We found 4 out of the 12 pathways to be significantly enriched in DCCs (Fig. 3c, d and Supplementary Table 4), including the pathway “IL-6-mediated signaling events,” the “TCPTP” pathway, the “VEGF-VEGFR3,” and “angiopoietin-Tie2 receptor” pathways. We decided to experimentally follow-up on the pathway “IL-6-mediated signaling events” for several reasons: (i) IL-6 signaling was previously found to be relevant for stemness maintenance, that is, mammosphere formation of ductal breast carcinoma and normal mammary gland;30 (ii) the TCPTP pathway, a negative regulator of IL-6 signaling31, was also enriched; and (iii) assessment of individual genes expressed in these pathways (Supplementary Fig. 1d) revealed a substantial overlap of the four pathways (Fig. 3e and Supplementary Table 5), indicating that related signaling modules had been triggered. We also tested the expression of the extracellular signal receptors and found that neither the receptor VEGFR3 nor Tie2 was expressed by DCCs. In contrast, while the mRNA of IL6RA (the IL-6 binding receptor unit) was also absent in DCCs, the IL-6 signal-transducing unit gp130 was expressed (IL6ST; Fig. 3d), indicating the amenability of DCCs to solely IL-6 trans-signaling (see below) and thereby to microenvironmental control. Given the hints for a role of IL-6 signaling for stemness maintenance and the restricted expression of IL-6 signaling molecules, we decided to explore the activation of the IL-6 pathway in normal and premalignant mammary cells in detail.
IL-6 trans-signaling activates sphere-forming ability
The IL-6 pathway can be activated directly or in trans. Direct or classical IL-6 signaling involves IL-6 binding to the heterodimeric receptor consisting of the ubiquitously expressed signal-transducing receptor subunit gp130 (gp130) and the membrane-bound IL-6 receptor alpha chain CD12632. In contrast, trans-signaling does not involve the membranous IL-6R alpha chain (mIL-6RA), but binding of IL-6 to the soluble IL-6RA (sIL-6RA) prior to binding to gp130 on the cell surface. sIL-6RA can be generated by alternative splicing or limited proteolysis of the membrane-bound receptor and provided via autocrine and paracrine secretion. To explore the impact of the different modes of IL-6 signaling on stemness or early progenitor traits, we used the premalignant human mammary epithelial cell lines MCF-10A and hTERT-HME1 and primary HMECs as models for early, genetically immature DCCs. Since metastasis founder cells are thought to display stem-like features33,34, cells were cultured under mammosphere conditions. The analysis of expression and secretion of IL-6 signaling molecules by MCF-10A and primary HMECs using enzyme-linked immunosorbent assay (ELISA), flow cytometry, and single-cell PCR of LRCs and nLRCs (Supplementary Fig. 2a–d) indicated that (i) mIL-6RA is expressed only in a fraction of LRCs and nLRCs; (ii) expression of IL-6 signaling molecules (IL-6RA, IL-6, gp130) does not significantly differ between LRCs and nLRCs, (iii) coexpression of all signaling molecules in individual cells is extremely rare, and (iv) sIL-6RA is generated by shedding of mIL-6RA and not by splicing. Therefore, and in line with our results for breast cancer DCCs, IL-6 trans-signaling via binding of IL-6 to sIL-6RA and subsequent binding to gp130 is much more likely involved in pathway activation than classical IL-6 signaling. We therefore asked if stemness or early progenitor traits in mammary epithelial cells or DCCs are activated by a paracrine mode via classical signaling or trans-signaling. As a model for endogenous trans-signaling activation, we identified normal mammary cell-derived hTERT-HME1 cells with a knock-in of constitutively active epidermal growth factor receptor (EGFR) (hTERT-HME1-EGFR∆746–750)35. This genetic change resulted in significantly increased amounts of IL-6 trans-signaling components in the culture supernatant (Supplementary Fig. 2e).
We then treated mammosphere cultures of MCF-10A, hTERT-HME1, hTERT-HME1-EGFR∆746–750, and primary HMECs with activators or inhibitors of both pathways: (i) an anti-IL-6 antibody to inhibit IL-6 classical and trans-signaling, (ii) IL-6 to activate classical and trans-signaling, and (iii) hyper-IL-6 (HIL6) to selectively activate trans-signaling. HIL6 is a fusion protein consisting of sIL-6RA, a linker chain, and IL-6 and is used as a molecular model of the IL-6/sIL-6RA complex36,37. Adding IL-6 or HIL6 to MCF-10A, hTERT-HME1 cells or HMEC cultures significantly increased sphere formation (Fig. 4a–c, Student’s t test p < 0.01 or one-way analysis of variance (ANOVA)/Dunnett’s test p < 0.0001). Interestingly, primary HMECs responded only to HIL6, but not IL-6 (Fig. 4c), indicating that (i) the increase in sphere number was due to IL-6 trans-signaling, (ii) spheres originated from cells without mIL-6RA expression, and (iii) endogenous sIL-6RA is a limiting factor (Supplementary Fig. 2c). Of note, hTERT-HME1-EGFR∆746–750 could only marginally be stimulated by the addition of HIL6, suggesting that it added little to the already available IL-6/sIL-6RA complexes (Fig. 4d).
To dissect the impact of classical and trans-signaling on the observed increase in sphere formation and hence the number of cells with stem-like activity, we specifically inhibited IL-6 trans-signaling, but not classical signaling by adding the soluble form of gp130 (sgp130-Fc) to IL-6-stimulated MCF-10A sphere cultures38,39. At both concentrations of sgp130-Fc tested, IL-6-induced sphere formation was abolished (Fig. 4e, one-way ANOVA/Dunnett’s test p < 0.0001), demonstrating that cells devoid of mIL-6RA accounted for the increase in sphere numbers by acquiring or activating stem-like functions in response to IL-6 trans-signaling. Consistently, blocking of endogenous classical and trans-signaling in hTERT-HME1-EGFR∆746–750 by anti-IL-6 did not reduce sphere formation to a greater extent than blocking IL-6 trans-signaling only (Fig. 4f, one-way ANOVA/Dunnett’s test p < 0.05 and p < 0.01).
IL-6 trans-signaling converts progenitor into stem-like cells
We noted that IL-6- and HIL6-stimulated mammosphere cultures showed an increase in the relative abundance of CD44high/CD24low cells (Fig. 5a, b), a phenotype that has been ascribed to neoplastic and nontumorigenic mammary cells enriched in tumor-initiating and sphere-forming cells, respectively33,40. Here, HIL6-stimulated cultures displayed the highest increase (Fig. 5a, b, one-way ANOVA/Dunnett’s test ctrl. vs. IL-6+/− sgp130-Fc, p < 0.01; ctrl. vs. HIL6, p < 0.0001). The increase in CD44high/CD24low cells was not the result of increased proliferation of any CD24/CD44 subpopulation, but seemed to be caused by the conversion of non-stem-like CD44high/CD24high/intermediate into CD44high/CD24low stem-like cells (Supplementary Fig. 3a–c). To corroborate these findings, we compared IL-6/HIL6-induced differential gene expression in MCF-10A cells to differences in gene expression between mammary stem cell-enriched (MaSC), LumProg, and mature luminal (MatLum) cells as published by Lim et al.41. The overlap between the respective differentially expressed genes was highly significant in almost all comparisons (Fig. 5c, Supplementary Fig. 3d, and Supplementary Table 6) and the observed expression fold changes were consistent with the notion that IL-6/HIL6 stimulation recruits progenitor populations from both more differentiated and more stem-like populations, with the dedifferentiation branch (MatLum → LumProg) being more consistent than differentiation (MaSC → LumProg). It should be noted that these in vitro generated data are fully consistent with the strong expression of the LumProg marker KIT in DCCs (Fig. 3b and Supplementary Fig. 1c) that we found to be activated via IL-6 signaling.
To confirm these findings, we tested if ex vivo derived primary HMECs converted to stem-like cells by HIL6 activation. We isolated nLRCs from non-IL-6-stimulated HMEC mammospheres (Supplementary Fig. 3e) and replated them as single cell per well with or without HIL6. Whereas in the absence of HIL6 nLRCs were unable to form spheres, the proportion of sphere-forming cells induced from nLRCs in the presence of HIL6 was similar to that of non-HIL6-stimulated LRCs (3% vs. 5% sphere formation; Figs. 2b and 5d). Moreover, replacing HIL6 by IL-6 in the first or second week of a 2-week mammosphere assay showed that continuous IL-6 trans-signaling is needed to induce and maintain the number of cells with stem-like activity (Supplementary Fig. 3f). Finally, we confirmed that primary HIL6-treated HMEC spheres retain their ability to form acinar and tubular structures in vitro (Fig. 5e) and mammary ducts in immunodeficient NSG mice (Fig. 5f). As MCF-10A cells do not form tumors in immunodeficient mice, we selected the LumProg-derived MDA-MB-231 cells to test whether the HIL6-induced increase in sphere formation translates into higher malignancy in vivo. Like MCF-10A cells, MDA-MB-231 cells show increased sphere formation in response to HIL6 (Fig. 5g). Upon xenotransplantation of an equal number of MDA-MB-231 cells pretreated with phosphate-buffered saline (PBS), anti-IL-6 or HIL6 for 3 h, tumors in the HIL6-group were significantly larger than in the control group (Fig. 5h). This was not caused by increased proliferation or decreased apoptosis as the percentage of Ki-67+ tumor cells did not differ significantly between the groups (Fig. 5i), and caspase-3+ cells were not detected in any of the tumors. While this is consistent with an elevated stemness of MDA-MB-231 cells, limiting dilution experiments, ideally performed with additional cell lines, would be needed to fully establish the stemness-conferring activity of IL-6 trans-signaling to human cell line models in xenotransplantation.
BM niche cells regulate responsiveness to IL-6 trans-signaling
As gp130 expression is essential for IL-6 signaling, we tested whether BM stromal cells modulate the ability of mammary epithelial cells to receive IL-6 signals. We isolated primary human mesenchymal stem cells (MSCs) from diagnostic BM aspirates of nonmetastasized breast cancer patients or healthy volunteers and confirmed their ability to differentiate into adipocytes and osteoblasts (OBs) in vitro (Fig. 6a and Supplementary Fig. 4a). We then cocultured MCF-10A cells with (i) MSCs, (ii) in vitro differentiated OBs, or (iii) human umbilical vein endothelial cells (E4ORF1 HUVECs42) under nonsphere conditions. Interestingly, flow cytometric analysis revealed cell surface downregulation of gp130 on MCF-10A cells cocultured with MSCs and OBs, but not with HUVECs (Fig. 6b). Separation of MCF-10A and MSCs by a transwell or using MSC-conditioned medium (CM) showed gp130 cell surface downregulation to be independent of cell–cell contact (Supplementary Fig. 4b). Moreover, downregulation was not immediate but observed between 6 and 14 h after initiation of the coculture with MSCs, OBs, or MSC-CM from healthy donors (HDs) or breast cancer patients (Fig. 6c and Supplementary Fig. 4c). This kinetic is consistent with the known independency of gp130 internalization from ligand binding43 and points towards a transcriptional regulation of gp130 surface expression. To test this, we determined gp130 gene expression levels in single cells isolated from MCF-10A/MSC and MCF-7/MSC cocultures. Interestingly, both cell lines decreased their gp130 gene expression in response to MSCs (Fig. 6d), which is consistent with transcriptional regulation and demonstrates that early DCCs as well as more advanced cancer cells can respond to signals from neighboring cells.
To explore the functional impact of gp130 downregulation induced by MSCs, we tested MCF-10A cells pretreated for 14 h with MSC-CM for their sphere-forming ability. Pretreated MCF-10A showed a significant decrease in sphere number and an increase in single, nonsphere-forming cells in the presence of both endogenously produced IL-6/sIL-6RA and exogenously added HIL6 (Fig. 6e, f, Student’s t test, both p < 0.001). The data indicate that the microenvironment in which early DCCs reside determines (i) their responsiveness to IL-6 trans-signaling, with stromal and osteoblastic niches disabling IL-6 trans-signaling in DCCs, and as a consequence, (ii) the number of DCCs with stem-like phenotype and function, that is, metastasis-initiating ability.
Fully malignant DCCs escape IL-6 trans-signaling dependence by oncogenic pathway activation
Cancer progression is driven by genetic and epigenetic evolution overriding microenvironmental control mechanisms. Although consensus about the nature of the metastatic niche is still lacking, MSC/OB-rich endosteal or vascular niches are believed to regulate the fate of DCCs42,44. Our experiments indicate that endosteal niches, although being rich in IL-6 and sIL-6RA molecules45–48, render DCCs unresponsive to IL-6 trans-signals. However, gp130+ DCCs in vascular niches would respond to IL-6/sIL-6RA complexes. Therefore, in both niches pathway activation by mutation would provide a selection advantage for DCCs that otherwise might depend on microenvironmental signals. We consequently sought for corroborating evidence that genetically variant DCCs may evade the need for IL-6 trans-signaling and become selected. We considered the phosphatidylinositol-3-kinase (PI3K) pathway a strong candidate for such a selected oncogenically activated pathway49 as (i) IL-6 signaling activates not only the Janus kinase/signal transducer and activator of transcription (JAK/STAT) pathway, but also the PI3K/AKT pathway50, (ii) early DCCs expressed PIK3CA as core element of the four identified stemness-associated pathways (Fig. 3e), (iii) activating PIK3CA mutations in exons 9 and 20 are among the most frequent mutations occurring in human breast cancer, and (iv) constitutive activation of the PI3K pathway has been shown to evoke cell dedifferentiation of mammary gland cells into a multipotent stem-like state51. To test PI3K signaling, we analyzed sphere formation in response to HIL6 in premalignant MCF-10A and malignant breast cancer cell lines with or without a PIK3CA-activating mutation. Interestingly, HIL6 increased sphere formation only in cells with wild-type PIK3CA (Fig. 7a), but not in cells with an activating PIK3CAE545K/+ mutation. Using isogenic MCF-10A cells with (knock-in of PIK3CAE545K/+) and without PIK3CA mutation, we noted that HIL6 activated the STAT3 pathway in mutant and wild-type cells (Fig. 7b). In contrast, IL-6 trans-signaling increased the activation level of the ERK and AKT pathway only in wild-type cells as these pathways were already highly activated in PIK3CA-mutant cells (Fig. 7b). Consistently, untreated PIK3CA-mutant cells formed significantly more spheres than wild-type PIK3CA cells (Fig. 7c, Student’s t test, p < 0.0001). MSC-induced gp130 downregulation was unaffected by the PIK3CA mutational status (Supplementary Fig. 4d). In summary, these data show that PIK3CA activation overrides regulation of stemness traits by IL-6 trans-signaling and renders cancer cells more independent from microenvironment control.
Based on these findings, we tested for direct evidence of genetic selection in the PIK3CA pathway during cancer progression by analyzing the PIK3CA gene for mutations in exons 9 and 20 in BM-derived DCCs from nonmetastasized (M0-stage) and in circulating tumor cells (CTCs) from metastasized (M1-stage) breast cancer patients mostly displaying bone metastases. Both groups of CK+ cancer cells had previously been shown to display CNAs5,52. As HR+ breast cancer is the most frequent breast cancer type, we focused on this disease type to obtain meaningful sample numbers. Consistent with previous data, PIK3CA hotspot mutations were found in DCCs/CTCs of patients with HR+ tumors (Fig. 7d). Strikingly, only 4.4% of M0-stage patients harbored PIK3CA mutations, whereas manifest HR+ breast cancer patients with metastasis displayed them in 34.3% of cases (Fisher’s exact test, p < 0.0001; Fig. 7d). These data are fully consistent with a scenario in which early DCCs depend on IL-6 trans-signaling and become increasingly independent thereof during cancer evolution.
Discussion
In this study, we provide evidence for a role of the niche microenvironment to enable and drive the earliest stages of human metastasis formation. We identified IL-6 trans-signaling as an activator of stem-like and progenitor traits underlying epithelial colony formation, a mechanism that is characterized by dependence on both IL-6 and sIL-6RA, in contrast to IL-6 alone, when a cell is equipped with mIL-6RA. Our finding questions the concept of fully malignant and autonomous cancer cells as founders of metastasis. However, during subsequent malignant evolution, DCCs may evade microenvironmental control by acquiring IL-6 independence. Our data indicate that this could occur via mutational activation of the PI3K pathway.
Several observations characterize early metastatic BM colonization. First, breast cancer dissemination in humans often starts from lesions often measuring <1–4 mm in diameter2,5. Second, initially DCCs often do not display the typical karyotypic changes of breast cancer tumors or metastases, but are detected long before manifestation of clinical metastasis5,8,15. Third, metastasis formation in BM usually takes years to decades53, indicating that evolutionary mechanisms and slow growth kinetics dictate the pace. This framework precludes the use of transgenic mouse models that are too short-lived to mimic the process in patients and rarely form bone metastases. The same applies to in vitro and in vivo studies involving commonly used and genomically highly aberrant breast cancer cell lines derived from manifest metastases or primary tumors. All of these do not represent the biology under investigation.
We therefore aimed at expanding early DCCs in immune-deficient mice, but failed. Reasons for this can be manifold: DCCs are extremely rare and we estimate that often <10 cells/mouse were injected. Also, species barriers may preclude engraftment of cancer cells that may critically depend on certain microenvironmental signals, among them the species-specific IL-6 signaling54. Our failure is also consistent with data from melanoma where we had identified that engraftment of DCCs requires activation of specific oncogenic pathways55—which in case of PIK3CA mutations early (M0-stage) breast cancer DCCs commonly lack. Consistently, M1-stage DCCs formed xenografts in two out of four cases. We therefore based our approach on two starting points: first, we analyzed transcriptomic data of early human breast cancer DCCs to identify candidate pathways, and second, we used genomically close-to-normal and normal mammary epithelial cells as models to functionally interrogate the transcriptome data.
We focused our analysis on EpCAM+ cells from BM. We followed the rationale that cells with an epithelial phenotype and concomitant genetic alterations that are isolated from mesenchymal BM of a patient with known carcinoma are bona fide cancer cells. We are aware that this does not provide a formal proof of their origin from the individual cancer. However, given the enormous intratumoral heterogeneity not only of the cancer at diagnosis, but also of its complete evolutionary history, such a formal proof may be difficult to establish. Currently, EpCAM is the best marker to identify and isolate viable epithelial cells from BM. However, it is neither fully specific14 as cells of the B cell lineage may express EpCAM23 nor does it identify all breast cancer-initiating cells56. However, EpCAM+ mammary progenitor cells give rise to chromosomally unstable cancers, such as triple-negative basal-like/postepithelial mesenchymal transition and HR+ cancers57. Since we used chromosomal instability as an inclusion criterion to identify DCCs, our findings do not apply to claudin-low cancers, which are derived from EpCAM−, chromosomally stable cells. Interestingly, early M0-stage DCCs isolated from BM highly expressed KIT, which is characteristic for LumProg cells. LumProg cells are the last common precursor of triple-negative, basal-like and HR+ cancers58. Since HR+breast cancers can experimentally be normalized by functionally impairing their LumProg cells59, cells with this phenotype are likely to comprise metastasis founder cells of EpCAM+ triple-negative and HR+ cancers.
This reasoning is fully consistent with our observed effects of IL-6 on normal mammary stem and progenitor cells. We identified an activated IL-6 pathway in DCCs and our in vitro models revealed that IL-6 trans-signaling, but not classical signaling, induces dedifferentiation of mammary epithelial cells and endows them with stemness and progenitor traits. IL-6 trans-signaling makes cells dependent on the microenvironment; however, BM represents an IL-6-rich environment47. There, IL-6/sIL-6RA is important for the regulation of hematopoiesis60 and for the generation of hematopoiesis-supporting BM stromal cells61 using autocrine/paracrine feedback loops48. In the context of breast cancer, it is noteworthy that serum levels of IL-6 and sIL-6R and their local production in BM by OBs depend on sex steroids, which change with the menstrual cycle, and are negatively regulated by estrogen and hormone replacement therapy and increase up to tenfold during menopause48,62,63. Therefore, systemic microenvironmental changes may provide a mechanism by which DCCs become activated in postmenopausal breast cancer patients. Consistently, bone-only metastasis is significantly associated with higher age at primary diagnosis in HR+ breast cancer patients64, known for very late relapses.
We have shown that IL-6 trans-signaling equips normal and transformed mammary cells with stemness and progenitor traits likely to be crucial for DCCs. Although ~6000–27,000 DCCs may lodge in the BM compartment65, only a few of them will progress to manifest metastasis. Our data are in line with this observation as coculture with BM niche cells like OBs or stromal MSCs resulted in the loss of gp130 expression, rendering only DCCs in vascular niches possibly responsive to IL-6 trans-signaling. Moreover, only ~3–5% of mammary cells acquired mammosphere formation ability upon HIL6 stimulation. Genetic analysis of DCCs revealed that DCCs from HR+ breast cancer progressing to manifest metastasis often acquired PIK3CA mutations, possibly because the activated PI3K/AKT pathway rendered mammary epithelial cells independent of IL-6 trans-signaling.
We are aware of data indicating that some metastases may be derived from late-disseminating cells66,67. One study showed that in three out of six cases tested CK+ cells displayed alteration profiles very similar to their matched primary tumors66. Future studies will need to carefully address the role of tumor (sub)type, disease stage and duration, growth kinetics, and a possible interdependence of early and late DCCs for metastasis formation. However, the findings presented here demonstrate that early DCCs are able to interpret signals from the BM environment that activate them and might drive their progression. Thereby, our study may lay the groundwork for novel adjuvant therapies. Early DCCs striving to form metastasis during yearlong evolutionary processes may be sensitive to different drugs than fully malignant cancer cells. Their Achilles’ heel may consist of microenvironmental signals supporting their survival and genomic progression. Since there is hope that oncogene- or tumor-suppressor-gene-associated drug resistance mechanisms are not yet operative, depriving early DCCs of microenvironmental support pathways could render them vulnerable to nongenotoxic drugs.
Methods
Patient material
Human noncancerous mammary tissue was obtained from female patients undergoing reduction mammoplasty surgeries at the University Center of Plastic-, Aesthetic, Hand-, and Reconstructive Surgery, University of Regensburg, Germany after informed, written consent of patients was obtained (ethics vote number 07/043, ethics committee of the University Regensburg). After verification of the noncancerous origin of the tissue by a pathologist, mammary glands were dissociated and primary HMECs isolated.
Human disseminated cancer cells were obtained from BM aspirates of breast or prostate cancer patients without and with distant metastases. EpCAM+ cells were obtained from BM of patients without known malignant disease undergoing hip replacement surgery (HD, healthy donor). Human MSCs were obtained from BM aspirates of breast cancer patients or HDs. Written informed consent of cancer and control patients was obtained and the ethics committee of the University of Regensburg (ethics vote number 07/79) approved BM sampling and analysis of isolated cells.
Enrichment and detection of CTCs was performed within the SUCCESS (EUDRA-CT number 2005-000490-21) and DETECT (EUDRA-CT number 2010-024238-46)68 studies using the CellSearch® system69. Written informed consent for CTC analysis and characterization was obtained for all patients included. All experiments conformed to the principles set out in the WMA Declaration of Helsinki and were approved by the ethical committees responsible for the corresponding studies (Universities of Munich, Dusseldorf, Tuebingen, and Ulm). Isolation and molecular analysis of CTCs was approved by the ethics committee of Regensburg (ethics vote number 07/079).
Mice
NOD.Cg-Prkdcscid IL2rgtmWjl/Sz (also termed NSG) or NOD.Cg-Prkdcscid mice were purchased from the Jackson Laboratory, USA and maintained under specific pathogen-free conditions, with acidified water and food ad libitum in the research animal facilities of the University of Regensburg, Germany. All approved experimental animal procedures were conducted according to German federal and state government of Upper Palatinate, 54-2531.1-10/07, 54-2532.1-34/11; 54-2532.1-17/11, 54-2532.1-1/12, 54-2532.4-7/12).
Cell lines
MCF-7 and MDA-MB-231 breast cancer cell lines were obtained from the German Collection of Microorganisms and Cell Cultures (DSMZ) and Cell Lines Service (CLS), respectively. MCF-10A (CRL-10317), a nontumorigenic mammary epithelial cell line, was obtained from the American Type Culture Collection70. The isogenic cell line MCF-10A PIK3CAE545K/+ (HD 101-002) together with its parental cell line MCF-10A-parental (HD PAR-003) were purchased from Horizon Discovery, United Kingdom. MCF-10A-GFP cells were generated by transducing MCF-10A cells with pRRL.sin.cPPT.hCMV-GFP.WPRE (generously provided by Luigi Naldini, Italy). hTERT-HME1-derived cell lines, E4ORF1-transduced primary HUVECs, and murine embryonic fibroblasts C3H10T1/2 were generously provided by Alberto Bardelli, (University of Turin, Italy), Cyrus Ghajar (Fred Hutchinson Cancer Research Center, USA), and Max Wicha (University of Michigan, USA), respectively. The identity of all cell lines was confirmed by DNA fingerprinting analysis utilizing the GenePrint 10 System (Promega).
All MCF-10A-derived cell lines were cultured in Ham’s Dulbecco’s modified Eagle’s/F12 (DMEM/F12) medium (Pan-Biotech, Germany) supplemented with 5% horse serum (Sigma-Aldrich, Germany), 2 mM l-glutamine (Pan-Biotech, Germany), 1% penicillin/streptomycin (Pan-Biotech, Germany), 20 ng/ml EGF (Sigma-Aldrich, Germany), 0.5 μg/ml hydrocortisone (Sigma-Aldrich, Germany), 10 μg/ml insulin (Sigma-Aldrich, Germany), and 0.1 µg/ml cholera toxin (Sigma-Aldrich, Germany). All hTERT-HME1-derived cells lines were maintained in DMEM/F12 medium (Pan-Biotech, Germany) supplemented with 10% fetal calf serum (FCS) (Sigma-Aldrich, Germany), 2 mM l-glutamine (Pan-Biotech, Germany), 1% penicillin/streptomycin (Pan-Biotech, Germany), 20 ng/ml EGF (Sigma-Aldrich, Germany), 0.5 μg/ml hydrocortisone (Sigma-Aldrich, Germany), and 10 μg/ml insulin (Sigma-Aldrich, Germany). MDA-MB-231 cells were cultured in DMEM medium (Pan-Biotech, Germany) supplemented with 10% FCS (Sigma-Aldrich, Germany), 2 mM l-glutamine (Pan-Biotech, Germany), and 1% penicillin/streptomycin (Pan-Biotech, Germany). MCF-7 cells were propagated in RPMI-1640 medium (Pan-Biotech, Germany) supplemented with 10% FCS, 2 mM l-glutamine, and 1% penicillin/streptomycin. Murine embryonic fibroblasts C3H10T1/2 were grown in DMEM (Pan-Biotech, Germany) medium supplemented with 5% FCS (Pan-Biotech, Germany), 2 mM l-glutamine (Pan-Biotech, Germany), and 1% penicillin/streptomycin (Pan-Biotech, Germany). E4ORF1-transduced primary HUVECs were cultured using the EGM-2 Bullet Kit (Lonza, Germany). All cell lines were kept at 37 °C and 5% CO2 in a fully humidified incubator and negatively tested for mycoplasma by PCR.
Isolation of disseminated cancer cells from BM
Mononuclear cells from BM of nonmetastasized breast cancer patients were plated on adhesive slides (Thermo Fisher) at a density of 0.5–1 × 106 cells/slide. Slides were stored at −20 °C. From each patient, 1–2 106 BM cells were stained with the monoclonal antibody A45-B/B3 (AS Diagnostik, Germany) against CK8/18/19 and developed with the anti-mouse AB-Polymer (Zytomed Systems, Germany). Unspecific binding was blocked using PBS/10% AB serum (Bio-Rad, Germany). Alkaline phosphatase was developed with 5-bromo-4-chloro-3-indolyl phosphate and nitroblue tetrazolium (Bio-Rad, Germany) as substrate. Slides were covered with PBS under a cover glass and assessed by bright-field microscopy. An identical number of cells served as a control for staining with mouse IgG1 kappa (MOPC-21) without known binding specificity. After removal of the cover glass, positive cells were isolated from the slide with a micromanipulator (Eppendorf PatchMan NP2) and subjected to whole-genome amplification (WGA) for subsequent PIK3CA mutation analysis.
For isolation of both RNA and DNA from the same disseminated cancer cells, mononuclear cells from BM of nonmetastasized breast cancer patients were subjected to immunofluorescent staining for EpCAM (Ber-EP4-FITC, Agilent or HEA-125-PE, Miltenyi Biotec, Germany). Positive cells were isolated with a micromanipulator (Eppendorf PatchMan NP2, Eppendorf, Germany) and cells were subjected to WTA to isolate RNA for subsequent PCR analyses, transcriptome microarrays or RNA-Seq, and WGA for isolation of genomic DNA for subsequent analysis of CNAs.
Isolation of CTCs
Up to three 7.5 ml blood samples per patient were collected into CellSave® tubes (Menarini Silicon Biosystems, Italy). The CellSearch® Epithelial Cell Test (Menarini Silicon Biosystems, Italy) was applied for CTC enrichment and enumeration according to the manufacturer’s instruction. Samples from the SUCCESS study were prepared using a slightly modified protocol, pooling three separate CellSave® tubes (30 ml) as described elsewhere52. CTC+ cartridges were sent from clinical centers to the Chair of Experimental Medicine and Therapy Research, Regensburg for cell isolation and molecular analysis. Cells were extracted from CellSearch® cartridges and isolated using the DEPArrayTM system (Menarini Silicon Biosystems, Italy) and single-cell DNA was amplified by WGA for subsequent PIK3CA mutation analysis.
Isolation of human primary mammary epithelial cells
Primary human noncancerous mammary tissue was mechanically dissociated using scalpels and then subjected to enzymatic digestion overnight at 37 °C in DMEM/F12 (Pan-Biotech, Germany) supplemented with 10 mM HEPES (Sigma-Aldrich, Germany), 2% bovine serum albumin (Sigma-Aldrich, Germany), 5 μg/ml insulin, 0.5 μg/ml hydrocortisone, 10 ng/ml cholera toxin (Sigma-Aldrich, Germany), 300 U/ml collagenase, and 100 U/ml hyaluronidase (all from Sigma-Aldrich, Germany). After removal of organoids and adipocytes by centrifugation at 210 × g for 2 min, the cell suspension was passed over a 100 and 40 μm cell strainer to obtain a single-cell suspension. Separation of fibroblasts from epithelial cells was accomplished by centrifugation at 350 × g for 4 min and epithelial cells from the cell pellet were cultured as mammospheres.
Mammosphere culture
Cell lines and primary HMECs were seeded at a density of 10,000 cells/ml and 50,000 cells/ml, respectively, in 3-, 6-, or 10-cm cell culture dishes or 96-well flat-bottom plates (Thermo Fisher Scientific, Germany; Sigma-Aldrich, Germany; TPP AG, Switzerland). For analyses using the Operetta high-content imaging system cells were plated with 10,000–50,000 cells/ml in 96-well µClear plates (Greiner Bio-One, Germany). To prevent attachment of cells all dishes/plates were coated with polyhydroxyethylmethacrylate (PolyHEMA) (12 mg/ml in 95% ethanol, Sigma-Aldrich, Germany) overnight. PolyHEMA-coated dishes/plates were ultraviolet sterilized for 30 min. Cells were cultured in mammosphere medium consisting of MEBM (mammary epithelial cell basal medium) (Lonza, Germany) supplemented with 1% penicillin/streptomycin (Sigma-Aldrich, Germany), 1×B27 (Life Technologies, Germany), 10 ng/ml EGF (Sigma-Aldrich, Germany), 10 ng/ml basic fibroblast growth factor (bFGF) (Sigma-Aldrich, Germany), 4 µg/ml heparin (Sigma-Aldrich, Germany), and 1% methylcellulose, if the Operetta CLS high-content imaging system was used. For some analyses, mammosphere media were supplemented additionally with 10 ng/ml IL-6 (Sigma-Aldrich, Germany), 1.5 µg/ml anti-IL-6 antibody (Sigma-Aldrich, Germany), 20 ng/ml HIL6, and 0.1 or 10 ng/ml recombinant human sgp130-Fc (R&D Systems, Germany). Mammospheres were cultured in a humidified atmosphere with 5.5% CO2 and 7% O2 at 37 °C for 4 or 7 days.
For setting up of secondary mammosphere cultures, conducting flow cytometric or single-cell expression analyses, first-generation mammospheres were collected on day 7 by gentle centrifugation (100 × g), dissociated into single-cell suspension with trypsin-EDTA (Pan-Biotech, Germany) for 3 min, followed by trypsin-neutralizing solution (Lonza, Germany). Single-cell suspensions of secondary mammospheres were obtained as described for day 7 first-generation mammospheres.
Mammosphere counting
The number of spheres with a diameter ≥50 µm was determined by manually counting of a complete plate/dish at day 7 using an inverted microscope (Olympus, ×10 air objective). Alternatively, spheres were stained and imaged using the Operetta CLS high-content imaging system (PerkinElmer, Hamburg, Germany) by adding CyTRAK Orange (BioStatus Ltd, UK) at day 4 to the wells at a final concentration of 10 µM. After 60 min incubation, fluorescence imaging of the plates was performed using a ×5 air objective and imaging of nine regions per well that were stitched to cover the entire well surface. Harmony high-content analysis software was used to analyze the images and to count the formation of spheres with a diameter ≥50 µm (Version 4.8; PerkinElmer, Hamburg, Germany).
Isolation of LRCs, nLRCs, and QSCs from mammosphere cultures
HMECs were labeled with the PKH26 Red Fluorescent Cell Linker Kit (Sigma-Aldrich, Germany) at 40 nM for 2 min at room temperature (RT), the reaction was stopped with 10% FBS containing medium, and cells were washed three times before plating into primary or secondary mammosphere cultures. LRCs and nLRCs were isolated from spheres of secondary mammosphere cultures that were dissociated with trypsin-EDTA (Pan-Biotech, Germany) for 3 min, neutralized with trypsin-neutralizing solution (Lonza, Germany), and stained with 4′,6-diamidino-2-phenylindole (DAPI) (Roche Diagnostics, Germany) for live/dead cell discrimination. Single LRCs and nLRCs were isolated as single PKH+/DAPI− and PKH−/DAPI− cells using a micromanipulator (Eppendorf PatchMan NP2, Eppendorf, Germany) or flow cytometric activated cell sorting. QSCs were isolated at day 7 from primary mammosphere cultures as single, DAPI−/EpCAM+/PKH+ cells that did not form spheres and using the micromanipulator.
For flow cytometric assessment of the proliferation of MCF-10A-LRCs and MCF-10A-nLRCs, MCF-10A cells were labeled with carboxyfluorescein diacetate succinimidyl ester (ebioscience, Germany) at 2 µM for 10 min at 37 °C in PBS/1% FBS, washed twice after stopping of the reaction with 10% FBS containing medium, and cultured as mammospheres. At day 4, single-cell suspensions were obtained as described above and analyzed by flow cytometry.
In vitro differentiation of HMECs
Matrigel (growth factor reduced, without phenol red, BD Biosciences, Germany) was diluted 1:1 with a differentiation medium (Ham’s DMEM/F12 medium (Pan-Biotech, Germany), 5% FCS (Pan-Biotech, Germany), 5 μg/ml insulin (Sigma-Aldrich, Germany), 1 μg/ml hydrocortisone (Sigma-Aldrich, Germany), 10 μg/ml cholera toxin (Sigma-Aldrich, Germany), 10 ng/ml EGF (Sigma-Aldrich, Germany), 1× penicillin/streptomycin/fungizone (Lonza, Germany)), smeared in 2-well slides and incubated for 15 min at 37 °C. On top, 50,000 cells from disaggregated secondary mammospheres were added. After incubation for 30 min at 37 °C, cells were covered with an additional Matrigel layer and incubated for additional 15 min at 37 °C. The differentiation medium was added at the end of the embedding procedure and exchanged every 2 days. Cultures were examined 3–4 weeks post embedding for the development of tubular and acinar structures.
Culture of primary human MSCs and generation of OBs
Mononuclear cells from BM aspirates were cultured at a density of 2 × 106 cells in a T75 flask (Sarstedt, Germany) in DMEM with 1 g/l glucose, 4 mM glutamine, and 1 mM sodium pyruvate (all from Life Technologies, Germany), supplemented with 10% MSC-qualified FBS (WKS Diagnostik, Germany), 1% penicillin/streptomycin (Life Technologies, Germany), and 1 ng/ml bFGF (Peprotech, Germany). Adherent cells were cultured for 3 weeks and cryo-conserved. Before cryo-conservation MSCs were tested for the expression of CD45, CD34, CD90, CD105, CD44, and Nestin by flow cytometry (Supplementary Fig. 4). In addition, the ability of MSCs to differentiate into adipocytes and OBs were tested71: OBs for coculture experiments were generated by culturing confluent MSC cultures in DMEM with high glucose (Life Technologies, Germany) supplemented with 10% MSC-qualified FBS (WKS Diagnostik, Frankfurt, Germany) 1% penicillin/streptomycin (Life Technologies, Germany), 10−7 M dexamethasone, 25 µg/ml l-ascorbic acid, and 3 mM sodium dihydrogen phosphate (all from Sigma-Aldrich, Germany) for 21 days with the medium being changed every other day.
Cocultures of MCF-10A with MSCs, OBs, and HUVECs
MSCs and HUVECs were plated at a density of 4 × 105 cells/well of a 6-well plate (Corning, Germany) in their respective growth medium. The next day, the medium was exchanged to MCF-10A growth medium and 1 × 105 MCF-10A-GFP cells were added to each well. In case of cocultures with OBs, 4 × 105 MSCs per 9.6 cm2 surface of a 6-well plate (Corning, Germany) were plated and differentiated into OBs for 21 days. On day 22, the medium was exchanged to MCF-10A growth medium and 1 × 105 MCF-10A-GFP cells were added to each well. For cultures with transwells, MSCs were plated at 1.75 × 105 cells per 4.2cm2 of a 6-well transwell insert (Falcon 353090, VWR, Germany).
Xenotransplantations of DCCs, HMECs, and MDA-MB-231 cells
For xenotransplantations of DCCs, mononuclear cells from BM aspirates of nonmetastasized or metastasized breast or prostate cancer patients were enriched for human EpCAM or depleted of human CD45+CD33+CD11b+ cells and erythrocytes using a mix of CD45, CD33, CD11b, and glycophorin A microbeads according to the manufacturer’s instructions (Miltenyi Biotec, Germany). Each sample was then split into halves: one half was subjected to DCC enumeration by staining for CK8/18/19 or EpCAM. The other half of the cell suspension was transplanted without ex vivo expansion into NSG mice using one to two injection routes (nonmetastasized patients) or three to four different injection routes (metastasized patients). In some cases, mononuclear cells were cultured as mammospheres in 6-cm culture plates coated with PolyHEMA (12 mg/ml, Sigma), under hypoxic conditions (7% O2) at 37 °C and in mammosphere medium containing 10 nM HEPES (Sigma-Aldrich, Germany), 10 µg/ml insulin (all from PAN-Biotech, Germany), 5 ng/ml growth-regulated oncogene-α (R&D Systems, Germany), 20 ng/ml HIL6 (kindly provided by S. Rose-John), and 0.2% methylcellulose (Sigma-Aldrich, Germany). Cultures were monitored weekly for sphere growth.
To transplant spheres or EpCAM-enriched or CD45/CD11b/erythrocyte-depleted BM, cells/spheres were collected in a microwell (volume 10–15 µl, Terasaki, Greiner Bio-One, Germany) precoated with PolyHEMA (12 mg/ml, Sigma-Aldrich, Germany). Cells or spheres were transplanted in a final volume of 30 µl and 25% high-concentration Matrigel (BD Biosciences, Germany) as published before9. Cells were injected with an insulin syringe (Microfine, 29G, U-50, BD Biosciences, Germany) subcutaneously, intravenously, intrafemorally, or subrenally in 4–8-week-old male or female NSG or NOD.Cg-Prkdcscid mice. Mammary fat pad injections were performed in the fourth precleared mammary fat pad of 3-week-old female mice in 50% Matrigel (BD Biosciences, Germany). Breast or prostate cancer origin of xenografts was verified by a pathologist.
To assess the differentiation ability of HMEC spheres in vivo, secondary mammospheres were dissociated and 200,000 cells were mixed with 225,000 preirradiated (15 Gy) C3H10T1/2 mouse fibroblasts. The cell suspension was then mixed 1:1 with Matrigel (growth factor reduced without phenol red, BD Bioscience, Germany) and injected in the fourth precleared mammary fat pad of 3-week-old female NSG mice. Mice were euthanized 8 weeks after transplantation and analyzed for the presence of human mammary gland tissue.
MDA-MB-231 cells grown under adherent conditions were pretreated with PBS, an anti-IL-6 antibody (1.5 µg/ml, Sigma-Aldrich, Germany) or HIL6 (20 ng/ml, a kind gift of Stefan Rose-John, Christian-Albrechts-University, Germany) for 3 h and 20,000 cells were transplanted into the mammary fat pad of NSG mice as 1:1 mixture with Matrigel (BD Biosciences, Germany) in the fourth precleared mammary fat pad of 3-week-old female NSG mice. All mice were analyzed when the first tumors reached a diameter of ~10 mm.
Detection of human DCCs and mammary gland in NSG mice
Lung and BM of mice transplanted with human DCCs were analyzed for the presence of DCCs or metastasis. Lungs were examined by a pathologist. For identification of disseminated cancer cells in the mouse BM, mononuclear cells were screened accordingly to the method for human DCCs using immunofluorescent staining of the cell suspension with anti-human EpCAM (Ber-EP4-FITC, Agilent, or HEA-125-PE, Miltenyi Biotec Germany) or adhesive slides and staining anti-CK8/18/19 (A45-B/B3, AS Diagnostik, Germany). Unspecific binding was blocked by using PBS with 5% human AB serum (Bio-Rad, Germany) and 5% mouse serum (Agilent, Germany). Positive cells were isolated with the micromanipulator and subjected to WGA.
Mammary glands of mice transplanted with HMECs were dissected, fixed, and stained with an anti-human CK18 antibody (20 µg/ml, clone CK2, Millipore, Germany). Cells expressing human CK18 were laser microdissected (PALM Microbeam System, Bernried, Germany) and subjected to WGA.
The human origin of DCCs or CK18+ cells isolated from mouse BM or laser microdissected from mammary glands of NSG mice was confirmed by a PCR discriminating between the human and mouse CK19 gene: forward primer, 5′-TTC ATG CTC AGC TGT GAC TG-3′ and reverse primer, 5′-GAA GAT CCG CGA CTG GTA C-3′, with an annealing temperature of 58 °C and an amplicon size of 621 bp for the human sequence.
Quantification of HER2 and PGR staining in tissue sections by TissueFAX cytometry
Tissue sections were stained with an automated staining machine (Ventana: BenchMark ULTRA). Tissue sections used for the analysis were stained within the same run. Images of stained tissue sections were scanned with the TissueFAXSi-plus imaging system (TissueGnostics, Vienna, Austria; acquisition software: TissueFAXS version 3.5.129) equipped with a digital Pixelink color camera (PCO AG, Kelheim, Germany). Images for the analysis of Ki-67 staining were analyzed with the HistoQuest software version 6.0.1.130 (TissueGnostics, Vienna, Austria). Using that software, two markers were created: hematoxylin as “master marker” (nucleus) and Ki-67 as “nonmaster marker.” To achieve optimal cell detection, the following parameters were adjusted: (i) nuclei size; (ii) discrimination by area; (iii) discrimination by gray; and (iv) background threshold. For the evaluation of the percentage of Ki-67-expressing cells, scatter plots were created, allowing the visualization of corresponding cells in the source region of interest using the real-time backgating feature. The cut-off discriminated between false events and specific signals according to cell size and intensity of Ki-67 staining. For the PBS group 112,098 cells (33 regions, 8.25 mm2), anti-IL-6 group 161,279 cells (41 regions, 10.25 mm2), and HIL6-group 98,812 cells (26 regions, 6.50 mm2) were analyzed.
WGA and analysis of CNAs
Single-cell genomic DNA was subjected to WGA using the previously described72,73 or the commercially available version (Ampli1TM WGA, Menarini Silicon Biosystems). DCCs isolated from BM of patients or NSG mice were subjected to CNA analysis (mCGH)72,73 or using the Ampli1TM LowPass Kit (Menarini Silicon Biosystems) according to the manufacturer’s instructions.
PIK3CA sequencing of single cells
PIK3CA mutation in CTCs and DCCs was assessed using the Ampli1TM PIK3CA Seq Kit (Menarini Silicon Biosystems) or amplicon-based sequencing on single cells following Ampli1TM WGA. For amplicon-based sequencing, the following primers were used: exon 9 forward primer, 5′-AAG CAA TTT CTA CAC GAG A-3′ and reverse primer, 5′-CC TTA TTT ATT TCG TCT TAA ATG-3′, with an annealing temperature of 58 °C and an amplicon size of 189 bp; exon 20 forward primer, 5′-TCT AGC TAT TCG ACA GCA TGC-3′ and reverse primer, 5′-T ACC TAA CCT AGA AGG TGT GTT-3′, with an annealing temperature of 58 °C and an amplicon size of 221 bp. For each exon, 1 µl of WGA product of CTCs or DCCs was used for the PCR. Resulting products were loaded on a 1.5% agarose gel and negative PCR results were considered dropouts for PIK3CA analysis. Positive CTC samples were purified using the QIAquick Purification Kit (Qiagen, Germany) according to the manufacturer’s protocol with the exception that elution at the end of the protocol was in 25 µl water. Purified CTC samples were sent to a sequencing provider (Sequiserve, Germany). PCR products from positive DCC samples were purified by the sequencing provider (GATC, Germany).
WTA of single spheres and cells
Single cells or spheres were placed in 5 μl lysis buffer (Active Motif, Belgium) supplemented with 1 μg protease (Active Motif, Belgium), 1 μl biotinylated oligo-dT peptide nucleic acids (PNAs) (Midi-Kit, Active Motif, dissolved in 400 μl water), and 10 ng tRNA (Roche, Germany). The proteolytic digest was performed at 45 °C for 10 min, followed by 1 min at 75 °C and 15 min at 22 °C for PNA annealing. mRNA was isolated with 4 μl streptavidin beads (Active Motif, Belgium) during 45 min rotation at room temperature. Ten microliters of cDNA wash buffer 1 (50 mM Tris-HCl, pH 8.3, 75 mM KCl, 3 mM MgCl2, 10 mM dithiothreitol (DTT), and 0.25% Igepal) were added and the tubes placed into a magnetic rack. The supernatant containing the genomic DNA was collected and the beads were washed using 20 μl of cDNA wash buffer 2 (50 mM Tris-HCl, pH 8.3, 75 mM KCl, 3 mM MgCl2, 10 mM DTT, 0.5% Tween-20). The supernatant was removed and the step repeated with 20 μl cDNA wash buffer 1. Reverse transcription was carried out under rotation for 45 min at 44 °C in a mix containing 0.5 mM dNTPs (VWR, Germany), 200 U Superscript II (Thermo Fisher Scientific, Germany), 30 μM CFL15CN8 primer (C15GTCTAGAN8), 15 μM CFL15CT24 primer (C15GTCTAGAT24VN), 0.25% Igepal, 10 mM DTT (Invitrogen, Germany), and the buffer supplied by the manufacturer in a final volume of 20 μl. Primers were annealed at room temperature for 10–15 min before the addition of the enzyme. After completing the reverse transcription, beads were washed in 20 μl tailing wash buffer (50 mM KH2PO4, pH 7, 1 mM DTT, 0.25% Igepal) and resuspended in 10 μl tailing buffer (10 mM KH2PO4, pH 7, 4 mM MgCl2, 0.1 mM DTT, 200 μM dGTP). After denaturation of the cDNA–mRNA hybrids at 94 °C for 4 min, 10 U terminal deoxynucleotide transferase (Affymetrix, Germany) were added and the samples were incubated at 37 °C for 60 min for the G-tailing reaction. After inactivation of the tailing enzyme (70 °C, 5 min), PCR mix I [4 μl buffer 1 (Expand Long Template, Roche, Germany), 3% deionized formamide] was added to each sample. A hotstart PCR was performed adding 5.5 μl PCR mix II [350 mM dNTPs, 1.2 μM CP2 primer (TCAGAATTCATGC15) and 5 U Pol Mix (Expand Long Template)]. Forty cycles were run: 20 cycles of 15 s at 94 °C, 30 s at 65 °C, 2 min at 68 °C, and 20 cycles with an elongation of the extension time of 10 s and a final elongation step of 7 min at 68 °C.
The quality of WTA products was assessed by expression analysis of three housekeeping genes: EE1A1 forward primer, 5′-CTG TGT CGG GGT TGT AGC CA-3′ and reverse primer, 5′-TGC CCC AGG ACA CAG AGA CT-3′, with an annealing temperature of 58 °C and an amplicon size of 290 bp; ACTB forward primer, 5′-GCG TGA CAT TAA GGA GAA GCT G-3′ and reverse primer, 5′-CGC TCA GGA GGA GCA ATG AT-3′, with an annealing temperature of 58 °C and an amplicon size of 378 bp; GAPDH forward primer, 5′-CCA TCT TCC AGG AGC GAG AT-3′ and reverse primer, 5′-CAG TGG GGA CAC GGA AGG-3′, with an annealing temperature of 58 °C and an amplicon size of 489 bp. Only samples positive for all three markers were used for downstream analyses.
mRNA microarray experiments
MCF-10A cells were cultured as mammospheres in the presence or absence of 10 ng/ml IL-6 (Sigma-Aldrich, Germany), 10 ng/ml IL-6 + 0.1 ng/ml recombinant human sgp130-Fc (R&D Systems, Germany) or 20 ng/ml HIL6 (a kind gift of S. Rose-John, Christian-Albrechts-University, Germany) for 12 and 24 h. Cells were seed in triplicates for all conditions and time points. After 12 and 24 h, cells were collected by centrifugation (5 min at 500 × g) and RNA was isolated using RNeasy Mini Kit (Qiagen, Germany) according to the manufacturer’s protocol. Microarray analysis was performed using the Whole Human Genome Microarray Kit, 4x44K (G4112F, Agilent Technologies, Germany).
For transcriptome analysis of LRCs, nLRCs, and QSCs, HMECs were cultured and cells isolated as described above and cDNA was obtained from manually isolated single cells using WTA.
Labeling of cDNA was performed by PCR with Cy5-labeled primers. Reaction mix contained 5 μl of buffer I (Expand Long Template, Roche, Germany), 3% (v/v) deionized formamide, 0.35 mM each dNTP, 2.5 μM 5′-U*CAGAAU*TCAUGCCC*CCCC*CCCC*C-3′ primer (*denotes nucleotides conjugated with Cy5 fluorophore; Metabion), 3.75 U of Pol Mix (Expand Long Template, Roche, Germany) and 1 μl of WTA product or 100 ng cDNA from bulk RNA preparations of MCF-10A cells in a final volume of 49 μl. PCR parameters were: one cycle with 1 min at 95 °C, 11 cycles with 15 s at 94 °C, 1 min at 60 °C, and 3 min 30 s at 65 °C, three cycles where the elongation time was increased 10 s per cycle, and finally one cycle with an elongation time of 7 min. Labeled products were purified using a PCR Purification Kit (Qiagen, Germany) according to the instructions of the vendor. Purified Cy5-labeled DNA was denatured by incubation for 5 min at 95 °C, followed by incubation on ice. Hybridization solution was prepared by mixing 42 μl of denatured Cy5-labeled DNA, 55 μl of 2× HiRPM hybridization buffer (Agilent, Germany), 11 μl of 10× GE Blocking Agent (Agilent, Germany), 4 μl of 25% (v/v) Tween-20, and 4 μl of 25% (v/v) Igepal. Four 100 μl samples of hybridization mix were overlaid on four hybridization fields of Agilent Whole Human Genome Microarray Kit (4x44K) with SurePrint microarray slides and incubated for 17 h at 65 °C under constant rotation. After hybridization, slides were washed in Agilent Wash Buffer 1 for 1 min on a shaker in the dark and incubation continued in Agilent Wash buffer 2 prewarmed to 37 °C. Slides were dried by washing for 30 s in acetonitrile and scanned on a GenePix 4400A scanner (Molecular Devices, Germany). Numerical readouts of fluorescence intensities (GPR files) were generated using GenePixPro 7 (Molecular Devices, Germany).
NGS mRNA library preparation and sequencing
The majority of the (TTT)7 and (CCC)5 nucleotides forming the ends of cDNA products were removed by a limited-cycle PCR with primers introducing BpuEI and BglI restriction sites (Protocol 1; Supplementary Table 1) or a BglI restriction site (Protocol 2; Supplementary Table 1), followed by a restriction enzyme digestion. For Protocol 1, 1 µl each of two separate 1/5 dilutions of the original WTA sample was used in two sets of five separate PCR reactions (total of ten reactions) with a total volume of 20 µl per reaction with 1.44 µM final concentration of primer CP2-BpuEI (5′-TCA GAA TTC ATG (CCC)5 GTC TTG AGT TTT TT-3′) and 1.44 µM final concentration of primer Cp2-BglI-13C (5′-TCA GAA TTC ATG (CCC)2 CGG (CCC)2-3′) for amplification. After an initial denaturation at 95 °C for 1 min, five cycles of 94 °C for 15 s, 55 °C for 1 min, and 65 °C for 180 s and three cycles of 95 °C for 15 s, 55 °C for 1 min, and 65 °C for 210 s (+10 s/cycle) were carried out, followed by a final extension step of 7 min. For Protocol 2, 1 µl of a 1/5 dilution of the original WTA sample was used for five separate PCR reactions with a total volume of 20 µl per reaction with 2.4 µM final concentration of primer Cp2-BglI-13C (sequence above) for amplification. After an initial denaturation at 94 °C for 2 min, eight cycles of 94 °C for 15 s, 68 °C for 1 min, and 68 °C for 240 were carried out, followed by a final extension step of 7 min. Following the PCR, the single PCR reactions were pooled and the remaining steps were identical for both protocols. Resulting cDNA products were purified with 1.8 volume of Ampure XP beads (Beckman Coulter, USA) according to the manufacturer’s instructions and eluted in 40 µl of distilled water. Next, 5 µl of EcoRI buffer supplemented with 80 µM S-adenosyl methionine (New England Biolabs, Germany), 2.5 µl distilled water, and 2.5 µl BpuEI (5 U/µl) were added for a total volume of 50 µl and incubated at 37 °C for 1 h, followed by heat inactivation of the enzyme for 20 min at 65 °C. Subsequently, 1 µl of EcoRI buffer supplemented with 80 µM S-adenosyl methionine, 6.5 µl distilled water, and 2.5 µl BglI (10 U/µl) were added for a final volume of 60 µl and incubated for 3 h at 37 °C, followed by heat inactivation of the enzyme for 20 min at 65 °C. The complete restriction digest was purified with 1.8 volume of Ampure XP beads according to the manufacturer’s instructions and eluted in 16 µl of 10 mM Tris-Cl, pH 8.5 (Elution buffer, Qiagen, Germany). The length distribution of purified cDNA populations was determined on the Bioanalyzer 2100 (Agilent Technologies, USA). Optimal Covaris settings for fragmentation of each purified cDNA sample to 350 bp insert size were determined on the basis of the average length distribution. Subsequently, sequencing libraries were prepared according to the TruSeq DNA PCR-Free Library Prep Kit (Illumina, USA). Resulting libraries were quantified with KAPA Library Quantification Kit for Illumina Platforms (Kapa Biosystems, RSA), pooled in equal molar ratios, and sequenced on Illumina NovaSeq 6000 platforms.
IL-6, IL-6RA, and gp130 mRNA expression analysis in single cells
IL-6, membrane IL-6 receptor, spliced IL-6 receptor, and gp130 expression was assessed by PCR using the MJ Research Peltier Thermal Cycler Tetrad (Bio-Rad, Germany) with the following primers: IL-6 (forward primer, 5′-GAG AAA GGA GAC ATG TAA CAA GAG T-3′ and reverse primer, 5′-GCG CAG AAT GAG ATG AGT TGT-3′, with an annealing temperature of 62 °C and an amplicon size of 388 bp), membrane vs. spliced IL-6RA (forward primer, 5′-CTG CAA ATG CGA CAA GCC TC-3′ and reverse primer, 5′-GTG CCA CCC AGC CAG CTA TC-3′, with an annealing temperature of 62 °C). The spliced and membrane-bound IL-6 receptor can be distinguished according to their PCR product size: mIL-6RA 380 bp and spliced IL-6RA 286 bp. Gp130 forward primer was 5′-GGA CCA AAG ATG CCT CAA CT-3′ and reverse primer was 5′-GGC AAT GTC TTC CAC ACG A-3′, with an annealing temperature of 58 °C and an amplicon size of 280 bp.
gp130 expression was assessed by quantitative PCR (qPCR) on reamplified and purified (Qiagen PCR Purification Kit, Qiagen, Germany) WTA products of single cells74. To normalize for the template input, quantification of yields in the individual samples was spectrophotometrically conducted using the NanoDrop 2000 instrument. The DNA input for each qPCR of a single cell was normalized to 2.5 ng and the qPCR run74 with the following primers: gp130 forward primer, 5′-ATA TTG CCC AGT GGT CAC CT-3′ and reverse primer, 5′-AGG CTT TTT GTC ATT TGC TTC T-3′, with an annealing temperature of 58 °C and an amplicon size of 125 bp. Fold changes in gp130 expression were calculated from the delta Cp values between MCF-10A or MCF-7 cultured with and without MSCs.
Flow cytometry
Spheres or adherent cells were trypsinized with trypsin/EDTA (Pan-Biotech, Germany) for 3 min, if not stated otherwise. MSC monocultures and cocultures of MCF-10A-GFP cells with MSCs, OBs, and HUVECs were harvested by trypsin/EDTA (Pan-Biotech, Germany) for 5 min and using cell-scrapers. To reduce nonspecific binding single-cell suspensions were incubated for 5 min at 4 °C with PBS/10% AB serum (Bio-Rad, Germany), subsequently stained with fluorescence-labeled or biotinylated antibodies for 15 min at 4 °C and washed once with PBS/2% FCS/0.01% NaN3. In case of biotinylated primary antibodies, phycoerythrin (PE)-labeled streptavidin (Dianova, Germany) was used as a secondary staining reagent. Cells were stained using the following antibodies: anti-human CD24-APC (ML5), anti-human CD34-PE (581), anti-human CD44-V450 (G44-26), anti-human CD45-FITC, APC, or PerCP-Cy5.5 (HI30), anti-human CD90 Alexa Flour 700 (5E10), anti-human CD105-FITC (43A3), anti-human CD130-APC (2E1B02), anti-human Nestin-PE (10C2), biotinylated anti-human IL6R (UV4), isotype control mouse IgG2a-APC (MOPC-21), isotype control mouse IgG2b-V450 (MOPC-21), isotype control IgG1-biotin (MOPC-21) (all purchased from BioLegend, Germany), and anti-human EpCAM (HEA-125, Miltenyi Biotech, Germany). Viability dye eFlour 780 (ebioscience, Germany) was used for live/dead cell discrimination. Cells were analyzed on an LSR II machine equipped with the FACS DIVA 5.03 software (BD Bioscience, Germany) and data were analyzed with FlowJo 8.8.6, 10.1, or 10.5.3 (Treestar, USA). Sorting of PKH26-labeled LRC and nLRCs was performed with a FACSAria cell sorter (BD Bioscience, Germany).
IL-6 and sIL-6RA detection by ELISA
IL-6 and sIL-6RA concentrations were assessed in 100 µl cultured media obtained from HMECs or MCF-10A cells propagated under anchorage-dependent or anchorage-independent conditions with the Human IL-6 DuoSet or Human sIL-6R alpha DuoSet ELISA Kit (R&D Systems, Germany) following the manufacturer’s recommendations.
Inhibition of ADAM proteases
MCF-10A cells were treated for 48 h with 20 µM TAPI-2 (TNF protease inhibitor 2) acetate salt (Sigma-Aldrich, Germany). The culture supernatant was tested for the presence of IL-6 and sIL-6RA by ELISA.
Immuno-(western) blotting
Cell lysates were prepared using ice-cold RIPA buffer (radioimmunoprecipitation assay buffer) supplemented with cOmplete, EDTA-free Protease Inhibitor Cocktail and PhosSTOPTM (all from Sigma-Aldrich, USA). The protein concentration of lysates was determined with PierceTM BCA Protein Assay Kit (Thermo Fisher Scientific, USA). Cell lysates were mixed with 4× Laemmli Sample Buffer (Bio-Rad, USA) containing 10% 2-mercaptoethanol (Sigma-Aldrich, USA) and denatured for 5 min at 95 °C. Ten micrograms of protein/lane were loaded on 12% Mini-PROTEAN® TGXTM Gels (Bio-Rad, USA) and protein separation was performed with sodium dodecyl sulfate (SDS)-polyacrylamide gel electrophoresis running buffer (25 mM Tris, 192 mM glycine, 0.1% SDS). Proteins were blotted onto Immobilon-P PVDF Membranes (Millipore, USA). For washing of membranes TBS-T (137 mM NaCl, 20 mM Tris, 0.05% (w/v) Tween-20, pH 7.6) was used. To detect signaling protein, the following primary antibodies (all from Cell Signaling Technology, USA) were used at dilutions according to the manufacturer’ s instructions: anti-phospho-STAT3Tyr705 (clone D3A7, 1:2000), anti-phospho-AKTSer473 (clone D9E, 1:2000), anti-phospho-ERK1/2Thr202/Tyr204 (clone E10, 1:2000), anti-STAT3 (clone 124H6, 1:1000), anti-AKT (clone 40D4, 1:2000), and anti-ERK1/2 (clone 137F5, 1:1000). As a loading control, an anti-α-tubulin antibody (Sigma-Aldrich, USA, clone DM1A; 1:5000) was used. This was followed by incubation with horseradish peroxidase-conjugated goat anti-rabbit IgG (anti-phospho-STAT3Tyr705, anti-phospho-AKTSer473, anti-ERK1/2) or goat anti-mouse IgG (anti-STAT3, anti-AKT, anti-phospho-ERK1/2Thr202/Tyr204, anti-α-Tubulin). Secondary antibodies were used at 1:10000 (both from Sigma-Aldrich, USA). Protein bands were visualized using SuperSignalTM West Pico PLUS Chemiluminescent Substrate (Thermo Fisher Scientific, USA). Chemiluminescence was recorded by a ChemiDocTM MP Imaging System (Bio-Rad, USA) and analyzed with Image LabTM (version 6.1, Bio-Rad, USA). Membranes were stripped for reprobing using with anti-AKT, anti-ERK1/2, anti-STAT or anti-α-Tubulin using Restore™ Plus Western Blot Stripping Buffer (Thermo Fisher Scientific, USA) for 15 min at RT. Removal of the enzyme conjugate was tested by incubating the membrane with new chemiluminescent substrate working solution. We noted that remnant anti-phospho-AKT and anti-phospho-ERK1/2 antibodies reduced anti-AKT and anti-ERK1/2 antibody binding.
MCF-10A HIL6/IL-6 stimulation mRNA microarray data
Gene expression data were obtained using the Agilent Whole Human Genome Microarray Kit (4x44K) and quality assessed by inspection of chip raw images and gene expression frequency distributions. All 24 expression profiles (three biological replicates, four treatment groups, two time points) were of sufficiently high quality for further bioinformatic analysis. Raw gene expression data were background corrected (limma Bioconductor package75, version 3.36.5, normexp method), log2-transformed, and normalized by quantile normalization. Replicated probes (identical Agilent IDs) were replaced by their median per sample. Gene ranking was performed using empirical array quality weights76 and linear models from the limma Bioconductor package (version 3.36.5) using standard treatment vs. control contrasts. Gene annotation (Supplementary Table 3) was obtained by aligning Agilent oligo sequences to NCBI RefSeq genes (August 8, 2019) using BLAST77 (version 2.9.0) requiring 100% identical matches, a maximum length difference between oligo and target sequence of one, and <100 hits per oligo. In addition, ensembl annotation78 (version 97) was retrieved and used as a secondary information source (e.g., for oligos that were unannotated by NCBI RefSeq). GENCODE metadata (version 32)79 were used as complementary annotation. For gene lists, graphical display, and functional annotation, probes targeting the same gene were disambiguated by retaining only the probe with the lowest p value. Differential gene expression was defined by a maximum false discovery rate (FDR)-adjusted p value of 0.05 and a minimum absolute log2fold change of log2(1.5) = 0.58. Computations were performed using R version 3.5.180.
Mammary cell subpopulation mRNA microarray data
Human mRNA expression data from Lim et al.41 based on Illumina HumanWG-6 v3.0 BeadChip microarrays were downloaded from the Gene Expression Omnibus (series GSE16997). Data preprocessing, analysis, and annotation was performed analogous to the procedure detailed above for MCF-10A cells, except that the linear model included all pairwise contrasts between the three cell types.
Fold-change analysis of the MCF-10A and mammary cell subpopulation data were performed by first selecting a pairwise comparison from the MCF-10A data (e.g., classical IL-6 stimulation vs. control) and another from the mammary subpopulation data (e.g., LumProg vs. MatLum), each performed according to moderated t-testing (limma Bioconductor package, version 3.36.5). The differential gene lists of both comparisons were intersected and the randomness of their overlap quantified using hypergeometric testing (Supplementary Table 6). Second, the log fold changes of both comparisons were correlated without centering (i.e., without subtracting the respective group means) because reference to zero log fold was intended. Correlation p values were calculated according to centered Pearson’s correlation.
LRC/QSC/nLRC mRNA microarray data
Gene expression data were obtained using the Agilent Whole Human Genome Microarray Kit (4x44K). All chips passed quality assessment and were preprocessed and annotated as described for MCF-10A cells above, except that no fold-change limit (originally used in the data of Lim et al.41 and thus also employed for MCF-10A cells) was applied. The data showed patient effects that were accounted for by including patient IDs as second covariate in the linear model after safeguarding independence between patients and sample groups: Cramer’s V with bias correction = 0; R-package rcompanion version 2.3.781. For graphical display, these effects were compensated by using the function removeBatchEffect from the Bioconductor package limma (version 3.36.5). Dimension reduction to two-dimensional according to t-SNE (Rtsne R-package, version 0.1582; Fig. 2c) and pairwise differential expression analysis (number of differentially expressed genes was: 35 for nLRC vs. QSC, 127 for LRC vs. nLRC, and 163 for LRC vs. QSC; FDR-adjusted p value <0.05; Supplementary Table 2) revealed that nLRC and QSC were much more similar to each other as compared to LRC. To concentrate on the main effects, nLRC and QSC were pooled resulting in 216 differentially expressed genes for LRC vs. (nLRC + QSC). Enrichment analysis was aimed at the NCI-Nature Pathway Interaction Database83 for its focus on cancer research and treatment and conducted using the R-package enrichR84 (version 2.1).
DCC and HD mRNA sequencing data
The sequencing quality was evaluated per sample with FastQC (version 0.11.8)85 and in a multisample comparison with MultiQC (version 1.8)86 before and after adapter trimming and contamination screening. Briefly, raw sequencing data of single cells (30 M0-stage DCCs, 11 M1-stage DCCs, and 15 EpCAM+ cells from HDs from 21, 5, and 7 patients, respectively) were trimmed, and remaining adapter sequences as well as low sequencing quality bases at the end of each read were removed using BBDuk87. In order to increase the mapping quality (lowering false-positive alignments), read decontamination was performed using BioBloom Tools88 with filters for the genomes of Homo sapiens (hg38), Mus musculus (mm10), Escherichia coli (BL21), Mycoplasma pneumoniae (M129), Sphingobium sp. (SYK-6), Bradyrhizobium japonicum (USDA 110), Pichia pastoris (GS115), Malessia globosa (CBS 7966), Aspergillus fumigatus (Af293), and a set of viral genomes (RefSeq, 5k+ genomes). All reads that did not map exclusively to hg38 (GENCODE version 27, GRCh38.p10) or did not map at all were defined as likely contaminations and discarded from downstream processing. Subsequently, the cleaned sample reads were aligned to the reference genome hg38 with STAR (version 2.5.1b)89. Uniquely mapped reads were counted per gene per sample using featureCounts from Subread90. We performed quality control and checked for outlier samples with the Bioconductor package scater (version 1.12.2)91 using the functions calculateQCMetrics and plotPCA for QC metrics with outlier detection enabled. The results showed that none of the samples was an outlier. Thus, we kept all samples for further analysis. Samples were sequenced in two batches with only very little association between batches and phenotype (M0/M1/HD): Cramer’s V with bias correction = 0.11; R-package rcompanion81 (version 2.3.7). We applied the multiBatchNorm (Bioconductor package: batchelor 1.0.192) to all cells and further rescaleBatches (Bioconductor package: batchelor 1.0.1) to DCCs to remove batch effects and get the log2 normalized counts. After batch correction, we obtained 8626 and 7359 expressed genes for HD cells and DCCs on average, respectively.
The top 500 most variable genes were analyzed using principal component analysis (PCA). From the genes annotated by GO terms containing “B cell,” “epithelial,” or “epithelium,” the top 100 most variable were subjected to PCA. PCAs were calculated using prcomp (R stats package).
For pathway enrichment analysis, we filtered for protein coding genes and compiled 2 × 2 contingency tables for each sample and each pathway according to whether genes were expressed (log2 (normalized counts) > 0) and present in the pathway. Contingency tables were subsequently evaluated according to one-tailed Fisher’s exact test (R stats package). Calculations were performed using R version 3.6.0.
Analysis of CNAs
To enable the combined analysis of mCGH- and LowPass-Seq-derived CNA profiles, the genomic coordinates obtained with the LowPass bioinformatics analysis pipeline (Menarini Silicon Biosystems, Italy) were converted to cytoband information using a custom script for R80 and the UCSC Goldenpath reference (version hg3893). Afterwards, the aberrations were manually screened and compared to the respective CNA profile images before being annotated according to the specifications of the International System for Human Cytogenetic Nomenclature (ISCN)94. Small aberrations <1 Mb as well as recurring technical artifacts in chromosome 1p and centromeric and telomeric regions were excluded. Finally, the combined ISCN-annotated aberration data (mCGH and LowPass-Seq) were stratified into M0 and M1 groups and submitted to the Progenetix user data tool95 to generate individual frequency plots for M0 and M1 cells.
Statistical analysis
Statistical analysis was performed using the GraphPad Prism 6.0 software (GraphPad Software, Inc., USA). Differences in mean values between groups were analyzed by Student’s t test, Mann–Whitney test, or one-way ANOVA, followed by post hoc statistical testing, where appropriate. Time dependencies were analyzed by regression analysis (F test). Independence in contingency tables was assessed by Fisher’s exact test. All tests were realized two-sided. A p value < 0.05 was considered statistically significant.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank Irene Nebeja, Irina Hartmann, Michaela Becker, and Silvia Materna-Reichelt for excellent technical assistance. We are grateful to Alberto Bardelli for hTERT-HME1 cell lines and Cyrus Ghajar for E4ORF1 HUVECs. This work was supported by grants to C.A.K. from the Deutsche Krebshilfe (TransLUMINAL-B 111536, to C.A.K. and BP DETECT CTC Project 6, 70112504), the Dr. Josef Steiner foundation, the ERC (grant 322602), the Bavarian ministry of economic affairs, energy and technology (AZ 20-3410.1-1-1), the Deutsche Forschungsgemeinschaft (DFG; KL 1233/10-2), and the Bavarian Research Foundation (Bayerische Forschungsstiftung, DOK-165-13). The work of M.W.-K. has been funded by the DFG (WE 4632/1-1, WE 4632-4/1. and WE 4632/5-1 within the FOR2127). TissueFAX was supported by the DFG (INST 89/341-1 FUGG). The work of S.R.-J. has been funded by grants from DFG in the SFB841 (project C1), the SFB 877 (projects A1 and A14), by the Deutsche Krebshilfe, and by the German-Israeli Foundation for Scientific Research and Development.
Source data
Author contributions
Conception and design: C.A.K. and M.W.-K. Development of methodology: M.W.-K., A.G., M.O., M.H., S.K., S.T., J.W., K.H., and S.R.-J. Acquisition of data: M.W.-K., A.G., C.I., M.O., S.T., C.K., C.B., K.W., C.W., S.K., M.G., K.H., G.F., I.B., S.G., E.S., G.H., N.P., S.G., S.H., B.R., N.H., S.B., P.R., and N.H. Analysis and interpretation of data: M.W.-K., A.G., M.H., X.L., H.K.-Q., B.P., J.W., K.H., Z.C., C.A.K. Writing of the manuscript: M.W.-K., M.H., and C.A.K. Review and/or revision of the manuscript: all authors.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Data availability
The microarray (Figs. 2c and 5c) and RNA-Seq (Fig. 3b and Supplementary Fig. 1c) data have been deposited in the European Genome-phenome Archive under the accession code EGAS00001004597. All the other data supporting the findings of this study are available within the article and its Supplementary information files and from the corresponding author upon reasonable request. A reporting summary for this article is available as a Supplementary information file. Source data are provided with this paper.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Melanie Werner-Klein, Ana Grujovic.
Contributor Information
Melanie Werner-Klein, Email: melanie.werner-klein@ukr.de.
Christoph A. Klein, Email: Christoph.klein@ukr.de
Supplementary information
Supplementary information is available for this paper at 10.1038/s41467-020-18701-4.
References
- 1.Engel J, et al. The process of metastasisation for breast cancer. Eur. J. Cancer. 2003;39:1794–1806. doi: 10.1016/S0959-8049(03)00422-2. [DOI] [PubMed] [Google Scholar]
- 2.Klein CA. Parallel progression of primary tumours and metastases. Nat. Rev. Cancer. 2009;9:302–312. doi: 10.1038/nrc2627. [DOI] [PubMed] [Google Scholar]
- 3.Klein CA. Framework models of tumor dormancy from patient-derived observations. Curr. Opin. Genet. Dev. 2011;21:42–49. doi: 10.1016/j.gde.2010.10.011. [DOI] [PubMed] [Google Scholar]
- 4.Harper KL, et al. Mechanism of early dissemination and metastasis in Her2(+) mammary cancer. Nature. 2016;540:588–592. doi: 10.1038/nature20609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hosseini H, et al. Early dissemination seeds metastasis in breast cancer. Nature. 2016;540:552–558. doi: 10.1038/nature20785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Husemann Y, et al. Systemic spread is an early step in breast cancer. Cancer Cell. 2008;13:58–68. doi: 10.1016/j.ccr.2007.12.003. [DOI] [PubMed] [Google Scholar]
- 7.Weedon-Fekjaer H, Lindqvist BH, Vatten LJ, Aalen OO, Tretli S. Breast cancer tumor growth estimated through mammography screening data. Breast Cancer Res. 2008;10:R41. doi: 10.1186/bcr2092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Schmidt-Kittler O, et al. From latent disseminated cells to overt metastasis: genetic analysis of systemic breast cancer progression. Proc. Natl Acad. Sci. USA. 2003;100:7737–7742. doi: 10.1073/pnas.1331931100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Werner-Klein M, et al. Genetic alterations driving metastatic colony formation are acquired outside of the primary tumour in melanoma. Nat. Commun. 2018;9:595. doi: 10.1038/s41467-017-02674-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Braun S, et al. A pooled analysis of bone marrow micrometastasis in breast cancer. N. Engl. J. Med. 2005;353:793–802. doi: 10.1056/NEJMoa050434. [DOI] [PubMed] [Google Scholar]
- 11.Riethdorf S, Wikman H, Pantel K. Review: biological relevance of disseminated tumor cells in cancer patients. J. Int. Cancer. 2008;123:1991–2006. doi: 10.1002/ijc.23825. [DOI] [PubMed] [Google Scholar]
- 12.Pantel K, Brakenhoff RH. Dissecting the metastatic cascade. Nat. Rev. Cancer. 2004;4:448–456. doi: 10.1038/nrc1370. [DOI] [PubMed] [Google Scholar]
- 13.Guzvic M, et al. Combined genome and transcriptome analysis of single disseminated cancer cells from bone marrow of prostate cancer patients reveals unexpected transcriptomes. Cancer Res. 2014;74:7383–7394. doi: 10.1158/0008-5472.CAN-14-0934. [DOI] [PubMed] [Google Scholar]
- 14.Klein CA, et al. Combined transcriptome and genome analysis of single micrometastatic cells. Nat. Biotechnol. 2002;20:387–392. doi: 10.1038/nbt0402-387. [DOI] [PubMed] [Google Scholar]
- 15.Schardt JA, et al. Genomic analysis of single cytokeratin-positive cells from bone marrow reveals early mutational events in breast cancer. Cancer Cell. 2005;8:227–239. doi: 10.1016/j.ccr.2005.08.003. [DOI] [PubMed] [Google Scholar]
- 16.Pantel K, et al. Establishment of micrometastatic carcinoma cell lines: a novel source of tumor cell vaccines. J. Natl Cancer Inst. 1995;87:1162–1168. doi: 10.1093/jnci/87.15.1162. [DOI] [PubMed] [Google Scholar]
- 17.Hosch S, et al. Malignant potential and cytogenetic characteristics of occult disseminated tumor cells in esophageal cancer. Cancer Res. 2000;60:6836–6840. [PubMed] [Google Scholar]
- 18.O’Sullivan GC, et al. Micrometastases in esophagogastric cancer: high detection rate in resected rib segments. Gastroenterology. 1999;116:543–548. doi: 10.1016/S0016-5085(99)70175-7. [DOI] [PubMed] [Google Scholar]
- 19.Hartmann CH, Klein CA. Gene expression profiling of single cells on large-scale oligonucleotide arrays. Nucleic Acids Res. 2006;34:e143. doi: 10.1093/nar/gkl740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dontu G, et al. In vitro propagation and transcriptional profiling of human mammary stem/progenitor cells. Genes Dev. 2003;17:1253–1270. doi: 10.1101/gad.1061803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Pece S, et al. Biological and molecular heterogeneity of breast cancers correlates with their cancer stem cell content. Cell. 2010;140:62–73. doi: 10.1016/j.cell.2009.12.007. [DOI] [PubMed] [Google Scholar]
- 22.Suzuki T, et al. Mice produced by mitotic reprogramming of sperm injected into haploid parthenogenotes. Nat. Commun. 2016;7:12676. doi: 10.1038/ncomms12676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Cackowski FC, et al. Detection and isolation of disseminated tumor cells in bone marrow of patients with clinically localized prostate cancer. Prostate. 2019;79:1715–1727. doi: 10.1002/pros.23896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Jakubek YA, et al. Large-scale analysis of acquired chromosomal alterations in non-tumor samples from patients with cancer. Nat. Biotechnol. 2020;38:90–96. doi: 10.1038/s41587-019-0297-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Loh PR, et al. Insights into clonal haematopoiesis from 8,342 mosaic chromosomal alterations. Nature. 2018;559:350–355. doi: 10.1038/s41586-018-0321-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Vattathil S, Scheet P. Extensive hidden genomic mosaicism revealed in normal tissue. Am. J. Hum. Genet. 2016;98:571–578. doi: 10.1016/j.ajhg.2016.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Bauer J, Bastian BC. Distinguishing melanocytic nevi from melanoma by DNA copy number changes: comparative genomic hybridization as a research and diagnostic tool. Dermatol. Ther. 2006;19:40–49. doi: 10.1111/j.1529-8019.2005.00055.x. [DOI] [PubMed] [Google Scholar]
- 28.Klein CA, et al. The hematopoietic system-specific minor histocompatibility antigen HA-1 shows aberrant expression in epithelial cancer cells. J. Exp. Med. 2002;196:359–368. doi: 10.1084/jem.20011838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Lim E, et al. Aberrant luminal progenitors as the candidate target population for basal tumor development in BRCA1 mutation carriers. Nat. Med. 2009;15:907–913. doi: 10.1038/nm.2000. [DOI] [PubMed] [Google Scholar]
- 30.Sansone P, et al. IL-6 triggers malignant features in mammospheres from human ductal breast carcinoma and normal mammary gland. J. Clin. Investig. 2007;117:3988–4002. doi: 10.1172/JCI32533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Schaper F, Rose-John S. Interleukin-6: biology, signaling and strategies of blockade. Cytokine Growth Factor Rev. 2015;26:475–487. doi: 10.1016/j.cytogfr.2015.07.004. [DOI] [PubMed] [Google Scholar]
- 32.Scheller J, Chalaris A, Schmidt-Arras D, Rose-John S. The pro- and anti-inflammatory properties of the cytokine interleukin-6. Biochim. Biophys. Acta. 1813;878–888:2011. doi: 10.1016/j.bbamcr.2011.01.034. [DOI] [PubMed] [Google Scholar]
- 33.Al-Hajj M, Wicha MS, Benito-Hernandez A, Morrison SJ, Clarke MF. Prospective identification of tumorigenic breast cancer cells. Proc. Natl Acad. Sci. USA. 2003;100:3983–3988. doi: 10.1073/pnas.0530291100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Ginestier C, et al. ALDH1 is a marker of normal and malignant human mammary stem cells and a predictor of poor clinical outcome. Cell Stem Cell. 2007;1:555–567. doi: 10.1016/j.stem.2007.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Di Nicolantonio F, et al. Replacement of normal with mutant alleles in the genome of normal human cells unveils mutation-specific drug responses. Proc. Natl Acad. Sci. USA. 2008;105:20864–20869. doi: 10.1073/pnas.0808757105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Rose-John S. IL-6 trans-signaling via the soluble IL-6 receptor: importance for the pro-inflammatory activities of IL-6. Int. J. Biol. Sci. 2012;8:1237–1247. doi: 10.7150/ijbs.4989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Fischer M, et al. I. A bioactive designer cytokine for human hematopoietic progenitor cell expansion. Nat. Biotechnol. 1997;15:142–145. doi: 10.1038/nbt0297-142. [DOI] [PubMed] [Google Scholar]
- 38.Jostock T, et al. Soluble gp130 is the natural inhibitor of soluble interleukin-6 receptor transsignaling responses. Eur. J. Biochem. 2001;268:160–167. doi: 10.1046/j.1432-1327.2001.01867.x. [DOI] [PubMed] [Google Scholar]
- 39.Scheller J, Schuster B, Holscher C, Yoshimoto T, Rose-John S. No inhibition of IL-27 signaling by soluble gp130. Biochem. Biophys. Res. Commun. 2005;326:724–728. doi: 10.1016/j.bbrc.2004.11.098. [DOI] [PubMed] [Google Scholar]
- 40.Mani SA, et al. The epithelial–mesenchymal transition generates cells with properties of stem cells. Cell. 2008;133:704–715. doi: 10.1016/j.cell.2008.03.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Lim E, et al. Transcriptome analyses of mouse and human mammary cell subpopulations reveal multiple conserved genes and pathways. Breast Cancer Res. 2010;12:R21. doi: 10.1186/bcr2560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Ghajar CM, et al. The perivascular niche regulates breast tumour dormancy. Nat. Cell Biol. 2013;15:807–817. doi: 10.1038/ncb2767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Thiel S, et al. Constitutive internalization and association with adaptor protein-2 of the interleukin-6 signal transducer gp130. FEBS Lett. 1998;441:231–234. doi: 10.1016/S0014-5793(98)01559-2. [DOI] [PubMed] [Google Scholar]
- 44.Shiozawa Y, et al. Human prostate cancer metastases target the hematopoietic stem cell niche to establish footholds in mouse bone marrow. J. Clin. Investig. 2011;121:1298–1312. doi: 10.1172/JCI43414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Xie Z, et al. Interleukin-6/interleukin-6 receptor complex promotes osteogenic differentiation of bone marrow-derived mesenchymal stem cells. Stem Cell Res. Ther. 2018;9:13. doi: 10.1186/s13287-017-0766-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Vermes C, et al. Shedding of the interleukin-6 (IL-6) receptor (gp80) determines the ability of IL-6 to induce gp130 phosphorylation in human osteoblasts. J. Biol. Chem. 2002;277:16879–16887. doi: 10.1074/jbc.M200546200. [DOI] [PubMed] [Google Scholar]
- 47.Erices A, Conget P, Rojas C, Minguell JJ. Gp130 activation by soluble interleukin-6 receptor/interleukin-6 enhances osteoblastic differentiation of human bone marrow-derived mesenchymal stem cells. Exp. Cell Res. 2002;280:24–32. doi: 10.1006/excr.2002.5627. [DOI] [PubMed] [Google Scholar]
- 48.Girasole G, Giuliani N, Modena AB, Passeri G, Pedrazzoni M. Oestrogens prevent the increase of human serum soluble interleukin-6 receptor induced by ovariectomy in vivo and decrease its release in human osteoblastic cells in vitro. Clin. Endocrinol. (Oxf.) 1999;51:801–807. doi: 10.1046/j.1365-2265.1999.00896.x. [DOI] [PubMed] [Google Scholar]
- 49.Schafer ZT, Brugge JS. IL-6 involvement in epithelial cancers. J. Clin. Investig. 2007;117:3660–3663. doi: 10.1172/JCI34237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Heinrich PC, et al. Principles of interleukin (IL)-6-type cytokine signalling and its regulation. Biochem. J. 2003;374:1–20. doi: 10.1042/bj20030407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Koren S, et al. PIK3CA(H1047R) induces multipotency and multi-lineage mammary tumours. Nature. 2015;525:114–118. doi: 10.1038/nature14669. [DOI] [PubMed] [Google Scholar]
- 52.Polzer B, et al. Molecular profiling of single circulating tumor cells with diagnostic intention. EMBO Mol. Med. 2014;6:1371–1386. doi: 10.15252/emmm.201404033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kennecke H, et al. Metastatic behavior of breast cancer subtypes. J. Clin. Oncol. 2010;28:3271–3277. doi: 10.1200/JCO.2009.25.9820. [DOI] [PubMed] [Google Scholar]
- 54.Peters M, et al. The function of the soluble interleukin 6 (IL-6) receptor in vivo: sensitization of human soluble IL-6 receptor transgenic mice towards IL-6 and prolongation of the plasma half-life of IL-6. J. Exp. Med. 1996;183:1399–1406. doi: 10.1084/jem.183.4.1399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Werner-Klein M, et al. Immune humanization of immunodeficient mice using diagnostic bone marrow aspirates from carcinoma patients. PLoS ONE. 2014;9:e97860. doi: 10.1371/journal.pone.0097860. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Keller PJ, et al. Defining the cellular precursors to human breast cancer. Proc. Natl Acad. Sci. USA. 2012;109:2772–2777. doi: 10.1073/pnas.1017626108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Morel AP, et al. A stemness-related ZEB1-MSRB3 axis governs cellular pliancy and breast cancer genome stability. Nat. Med. 2017;23:568–578. doi: 10.1038/nm.4323. [DOI] [PubMed] [Google Scholar]
- 58.Fu NY, Nolan E, Lindeman GJ, Visvader JE. Stem cells and the differentiation hierarchy in mammary gland development. Physiol. Rev. 2020;100:489–523. doi: 10.1152/physrev.00040.2018. [DOI] [PubMed] [Google Scholar]
- 59.Abu-Tayeh H, et al. ‘Normalizing’ the malignant phenotype of luminal breast cancer cells via alpha(v)beta(3)-integrin. Cell Death Dis. 2016;7:e2491. doi: 10.1038/cddis.2016.387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Bernad A, et al. Interleukin-6 is required in vivo for the regulation of stem cells and committed progenitors of the hematopoietic system. Immunity. 1994;1:725–731. doi: 10.1016/S1074-7613(94)80014-6. [DOI] [PubMed] [Google Scholar]
- 61.Rodriguez Mdel C, Bernad A, Aracil M. Interleukin-6 deficiency affects bone marrow stromal precursors, resulting in defective hematopoietic support. Blood. 2004;103:3349–3354. doi: 10.1182/blood-2003-10-3438. [DOI] [PubMed] [Google Scholar]
- 62.Abrahamsen B, Bonnevie-Nielsen V, Ebbesen EN, Gram J, Beck-Nielsen H. Cytokines and bone loss in a 5-year longitudinal study-hormone replacement therapy suppresses serum soluble interleukin-6 receptor and increases interleukin-1-receptor antagonist: the Danish Osteoporosis Prevention Study. J. Bone Miner. Res. 2000;15:1545–1554. doi: 10.1359/jbmr.2000.15.8.1545. [DOI] [PubMed] [Google Scholar]
- 63.Giuliani N, et al. Serum interleukin-6, soluble interleukin-6 receptor and soluble gp130 exhibit different patterns of age- and menopause-related changes. Exp. Gerontol. 2001;36:547–557. doi: 10.1016/S0531-5565(00)00220-5. [DOI] [PubMed] [Google Scholar]
- 64.Diessner J, et al. Evaluation of clinical parameters influencing the development of bone metastasis in breast cancer. BMC Cancer. 2016;16:307. doi: 10.1186/s12885-016-2345-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Klein CA, Holzel D. Systemic cancer progression and tumor dormancy: mathematical models meet single cell genomics. Cell Cycle. 2006;5:1788–1798. doi: 10.4161/cc.5.16.3097. [DOI] [PubMed] [Google Scholar]
- 66.Demeulemeester J, et al. Tracing the origin of disseminated tumor cells in breast cancer using single-cell sequencing. Genome Biol. 2016;17:250. doi: 10.1186/s13059-016-1109-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Yates LR, et al. Genomic evolution of breast cancer metastasis and relapse. Cancer Cell. 2017;32:169–184 e167. doi: 10.1016/j.ccell.2017.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Fehm T, et al. HER2 status of circulating tumor cells in patients with metastatic breast cancer: a prospective, multicenter trial. Breast Cancer Res. Treat. 2010;124:403–412. doi: 10.1007/s10549-010-1163-x. [DOI] [PubMed] [Google Scholar]
- 69.Riethdorf S, et al. Detection of circulating tumor cells in peripheral blood of patients with metastatic breast cancer: a validation study of the CellSearch system. Clin. Cancer Res. 2007;13:920–928. doi: 10.1158/1078-0432.CCR-06-1695. [DOI] [PubMed] [Google Scholar]
- 70.Soule HD, et al. Isolation and characterization of a spontaneously immortalized human breast epithelial cell line, MCF-10. Cancer Res. 1990;50:6075–6086. [PubMed] [Google Scholar]
- 71.Delorme B, Charbord P. Culture and characterization of human bone marrow mesenchymal stem cells. Methods Mol. Med. 2007;140:67–81. doi: 10.1007/978-1-59745-443-8_4. [DOI] [PubMed] [Google Scholar]
- 72.Klein CA, et al. Genetic heterogeneity of single disseminated tumour cells in minimal residual cancer. Lancet. 2002;360:683–689. doi: 10.1016/S0140-6736(02)09838-0. [DOI] [PubMed] [Google Scholar]
- 73.Klein CA, et al. Comparative genomic hybridization, loss of heterozygosity, and DNA sequence analysis of single cells. Proc. Natl Acad. Sci. USA. 1999;96:4494–4499. doi: 10.1073/pnas.96.8.4494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Durst FC, et al. Targeted transcript quantification in single disseminated cancer cells after whole transcriptome amplification. PLoS ONE. 2019;14:e0216442. doi: 10.1371/journal.pone.0216442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Ritchie ME, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Ritchie ME, et al. Empirical array quality weights in the analysis of microarray data. BMC Bioinform. 2006;7:261. doi: 10.1186/1471-2105-7-261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 78.Zerbino DR, et al. Ensembl 2018. Nucleic Acids Res. 2018;46:D754–D761. doi: 10.1093/nar/gkx1098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Frankish A, et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2019;47:D766–D773. doi: 10.1093/nar/gky955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, Vienna, 2017).
- 81.Mangiafico, S. S. Summary and Analysis of Extension Program Evaluation in R, Version 1.18.1 (Rutgers Cooperative Extension, New Brunswick, 2016).
- 82.van der Maaten LHG. Visualizing data using t-SNe. J. Mach. Learn. Res. 2008;9:2579–2605. [Google Scholar]
- 83.Schaefer CF, et al. PID: the Pathway Interaction Database. Nucleic Acids Res. 2009;37:D674–D679. doi: 10.1093/nar/gkn653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Kuleshov MV, et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 2016;44:W90–W97. doi: 10.1093/nar/gkw377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Babraham Bioinformatics. FastQC. https://www.bioinformatics.babraham.ac.uk/projects/fastqc (2019).
- 86.Ewels P, Magnusson M, Lundin S, Kaller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016;32:3047–3048. doi: 10.1093/bioinformatics/btw354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.JGI DataScience. BBtools software suite. https://jgi.doe.gov/data-and-tools/bb-tools (2019).
- 88.Chu J, et al. BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters. Bioinformatics. 2014;30:3402–3404. doi: 10.1093/bioinformatics/btu558. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Dobin A, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Liao Y, Smyth GK, Shi W. The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Res. 2013;41:e108. doi: 10.1093/nar/gkt214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.McCarthy DJ, Campbell KR, Lun AT, Wills QF. Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R. Bioinformatics. 2017;33:1179–1186. doi: 10.1093/bioinformatics/btw777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Haghverdi L, Lun ATL, Morgan MD, Marioni JC. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat. Biotechnol. 2018;36:421–427. doi: 10.1038/nbt.4091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.UCSC Genome Annotation Database. Goldenpath. http://hgdownload.cse.ucsc.edu/goldenpath/hg38/database (2019).
- 94.ISCN. ISCN rules for listing chromosomal rearrangements. Curr. Protoc. Hum. Genet.10.1002/0471142905.hga04cs17 (2001). [DOI] [PubMed]
- 95.Baudis M, Cleary ML. Progenetix.net: an online repository for molecular cytogenetic aberration data. Bioinformatics. 2001;17:1228–1229. doi: 10.1093/bioinformatics/17.12.1228. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The microarray (Figs. 2c and 5c) and RNA-Seq (Fig. 3b and Supplementary Fig. 1c) data have been deposited in the European Genome-phenome Archive under the accession code EGAS00001004597. All the other data supporting the findings of this study are available within the article and its Supplementary information files and from the corresponding author upon reasonable request. A reporting summary for this article is available as a Supplementary information file. Source data are provided with this paper.