Abstract
Paediatric rhabdomyosarcoma (RMS) is a soft tissue malignancy of mesenchymal origin that is thought to arise as a consequence of derailed myogenic differentiation. Despite intensive treatment regimens, the prognosis for high-risk patients remains dismal. The cellular differentiation states underlying RMS and how these relate to patient outcomes remain largely elusive. Here, we use single-cell mRNA sequencing to generate a transcriptomic atlas of RMS. Analysis of the RMS tumour niche reveals evidence of an immunosuppressive microenvironment. We also identify a putative interaction between NECTIN3 and TIGIT, specific to the more aggressive fusion-positive (FP) RMS subtype, as a potential cause of tumour-induced T-cell dysfunction. In malignant RMS cells, we define transcriptional programs reflective of normal myogenic differentiation and show that these cellular differentiation states are predictive of patient outcomes in both FP RMS and the less aggressive fusion-negative subtype. Our study reveals the potential of therapies targeting the immune microenvironment of RMS and suggests that assessing tumour differentiation states may enable a more refined risk stratification.
Subject terms: Cancer genomics, Paediatric cancer, Cancer microenvironment
The cellular differentiation states of paediatric rhabdomyosarcoma (RMS) remain to be explored. Here, single-cell RNA sequencing analysis of RMS tumours reveals an immunosuppressive microenvironment and distinct transcriptional programs predictive of patient outcomes.
Introduction
Rhabdomyosarcoma (RMS) is the most commonly diagnosed soft tissue sarcoma (STS) in children and adolescents, accounting for ~3.5% of all paediatric malignancies1. Several characteristics, including expression of the myogenic regulatory transcription factors MYOD1 and MYOG2 and the presence of rhabdomyoblasts3 (cells reminiscent of terminally differentiating myocytes), point to RMS being the result of impaired skeletal muscle myogenesis. However, the disease may also arise at body sites devoid of skeletal muscle, and RMS models of non-myogenic origin have been described4. Despite intense, multimodal treatment strategies, outcomes remain dismal for patients with high-risk or metastatic disease, the latter of which exhibits a long-term overall survival rate (OS) of ~30%5. This emphasises the need to improve our understanding of RMS tumour biology to enable the development of novel therapeutic approaches.
Historically, RMS has been divided into two main subtypes, alveolar and embryonal, based on the histological features of tumours6. However, recent work has shown that the molecular classification as either fusion-positive (FP) or fusion-negative (FN) is a more powerful prognostic indicator7,8. FP RMS is characterised by recurrent chromosomal translocations resulting in the expression of a chimeric fusion protein containing the DNA binding domains of either PAX3 or PAX7, both key transcriptional regulators of normal myogenesis9, coupled to a strong transactivation domain, most often that of FOXO110,11. The genetic lesions driving FN RMS, on the other hand, are diverse and may include mutations in signal transduction pathways (especially RAS and PI3K), cell cycle regulators and the P53 pathway, among others12. Notably, FP RMS carries a significantly worse prognosis than FN RMS and is more often metastatic at diagnosis8.
In addition to the inter-tumoral genetic heterogeneity characteristic of RMS, it has been recognised that there exists a degree of heterogeneity within tumours, as exemplified by the diversity in cellular morphology13 and variation in immunohistochemical staining for myogenic markers14. However, the characteristics and clinical implications of this heterogeneity remain unclear. In addition, the composition of the tumour microenvironment (TME) and the interplay between malignant cells and the TME have not been comprehensively profiled.
Here, we compile a single-cell transcriptomic atlas comprising both FN and FP RMS and find distinct differences in cellular composition and differentiation states between and within subtypes that relate to clinical outcomes and suggest potential immunotherapeutic interventions.
Results
A single-cell atlas of paediatric RMS tumours
We implemented a protocol for performing plate-based single-cell mRNA-sequencing15 (SORT-seq) on viably frozen primary RMS tumour samples and recently established patient-derived tumour organoid models16 (Fig. 1a and Supplementary Fig. 1a,c). Opting for a plate-based method allowed for the generation of high-quality single-cell transcriptomes from primary samples with low viability (including pre-treated samples) or where limited material was available (e.g. small needle biopsies). From our cohort of 27 RMS samples (19 primary samples and 8 previously established tumour organoid models16), encompassing the major molecular and histological subtypes (FP, FN, alveolar and embryonal, Fig. 1b,c, Supplementary Fig. 1b and Supplementary Data 1), we obtained 10,216 high-quality single-cell transcriptomes which passed quality thresholds (median of 420 per primary sample and 319 per tumour organoid).
To distinguish RMS cells in primary samples from non-malignant cell types comprising the TME, two complementary approaches were employed. First, the similarity between each single-cell transcriptome and a reference collection of bulk transcriptomes derived from healthy cell types and RMS tumours was assessed using SingleR17 (see Methods). Clustering of the resulting similarity scores revealed a clear distinction between cells with a high correspondence to bulk RMS tumours (malignant cells) and those which resembled one of several immune or stromal cell types (Supplementary Fig. 1d). Second, single-cell copy number variant (CNV) profiles were inferred and clustered on a per tumour basis. In all tumours, cells harbouring coherent whole and sub-chromosomal CNVs (malignant cells) could be distinguished from those which appear to be copy neutral (Supplementary Fig. 2). In general, single-cell derived CNV profiles were highly similar to those defined by whole genome sequencing of bulk tumour samples16 (Supplementary Fig. 2). Cells classified as “malignant” or “normal” using both methods were retained, while divergently classified cells were excluded from further analysis ( < 2% of total primary cells, Supplementary Fig. 1e). The median percentage of malignant cells per primary sample was 56%, though this varied widely (2%-97%) and did not differ significantly between molecular subtypes (Fig. 1f and Supplementary Fig. 1e). As expected, putative malignant cells expressed high levels of classical RMS marker genes DES, MYOD1 and MYOG (Fig. 1e). SingleR cell-type similarity scores and the expression of known marker genes were then used to discern the identities of non-malignant cells (Fig. 1e and Supplementary Fig. 1b). As with the overall percentage of malignant cells, the proportion of each non-malignant cell type varied extensively between tumours but did not differ significantly based on fusion status (Fig. 1f and Supplementary Fig. 1e). Projecting the classified primary single-cell transcriptomes in Uniform Manifold Approximation and Projection (UMAP) space revealed that inter-tumoral heterogeneity and molecular subtype classification (FN or FP) drove the clustering of malignant cells, while non-malignant cells clustered by cell type (Fig. 1d), as has previously been described for other tumour entities18–21. Clustering based on molecular subtype and inter-tumoral heterogeneity was also observed for the tumour organoid cells, supporting the presence of subtype-specific transcriptomic differences that are retained after in vitro expansion (Supplementary Fig. 1g).
Characterisation of the RMS microenvironment reveals general and subtype-specific immune dysfunction
To explore the composition and functional characteristics of tumour-infiltrating immune cells, graph-based clustering was performed on the myeloid and T/NK compartments of primary RMS samples (Fig. 2a,e). Examination of marker gene expression across myeloid clusters revealed the presence of undifferentiated (M0) and differentiated (Mq) macrophages, as well as conventional (cDC) and plasmacytoid (pDC) dendritic cells (Fig. 2b and Supplementary Fig. 3a). Scoring differentiated macrophages for M1/M2-specific gene signatures22 (listed in Supplementary Table 1) indicated that they existed predominantly in the M2 polarisation state (Fig. 2c), which is associated with several pro-tumorigenic functions including the suppression of inflammation and promotion of angiogenesis23. This finding was supported by immunofluorescence (IF) microscopy on primary tumour tissue showing infiltration of CD206+ and CD163+ cells across multiple patient samples (Fig. 2d and Supplementary Fig. 3b).
Among the T/NK cell clusters, several subtypes could be discerned including naïve and gamma delta (GD) T cells, regulatory T cells (Tregs), cytotoxic (CD8 + ) T cells and multiple subtypes of CD4 + T helper cells (IL7R+ and ISG + ) (Fig. 2f and Supplementary Fig. 3c). IHC for immune cell markers confirmed the presence of infiltrating T cells in RMS tissues (Supplementary Fig. 1f). Interestingly, interferon-stimulated T helper cells (ISG + ) were found almost exclusively in FN tumours, which may reflect a higher degree of immunogenicity (Supplementary Fig. 3b). Within T cell subgroups, the expression of several genes encoding molecules associated with immune dysfunction and the suppression of immune responses24 was observed, including LAG3 and PDCD1 (PD1) in CD8 + T cells, CTLA4 and TIGIT in Tregs and HAVCR2 in NK cells (Supplementary Fig. 3c). Strikingly, gene set enrichment analysis (GSEA) comparing CD8 + T cells between RMS subtypes indicated that dysfunction was more prevalent in FP samples, which were enriched for gene sets related to PD-1 signalling, oxidative phosphorylation and T cell exhaustion, while cells from FN tumours were enriched for interferon response and stimulation signatures (Fig. 2g). To define putative cell-cell interactions regulating immune dysfunction, we used CellChat25 to model ligand-receptor interactions between malignant cells, per subtype, and cell types within the TME. This analysis highlighted a putative interaction, specific to FP tumours, between NECTIN3 expressed on malignant cells and the TIGIT receptor on Tregs and CD8 + T cells (Supplementary Fig. 3e). The specificity of this interaction was due to the significantly higher expression of NECTIN3 in FP tumour cells, while the expression of TIGIT in Tregs and CD8 + T cells was comparable between subtypes (Supplementary Fig. 3f). Supporting this finding, IF microscopy on primary tumour tissue revealed the presence of TIGIT-positive cells in both subtypes, while a consistently more prevalent staining pattern of NECTIN3 was observed in FP RMS (Fig. 2h). Taken together, analysis of the TME in RMS highlighted evidence of general immune dysfunction, as indicated by the prevalence of M2 polarised macrophages, as well as a putative FP-specific T-cell exhaustion phenotype which may in part be regulated by the interaction between NECTIN3 and TIGIT.
Malignant cell states in RMS mirror normal myogenic differentiation
While it has been proposed that RMS tumours arise as a result of myogenic differentiation gone awry, the identification of the precise developmental origin(s) of RMS remains an active area of investigation26. To place primary RMS tumour cells within the context of normal myogenic differentiation, a series of logistic regression models were trained, as previously described27, to predict the similarity of malignant single-cell transcriptomes to the main cell types defined by a recently published single-cell atlas of human pre- and post-natal myogenesis28. This analysis showed that, on average, FN RMS cells resembled both myogenic progenitors and myogenic mesenchymal cells, while FP cells most closely corresponded to committed myoblasts (Supplementary Fig. 4a). This is in line with the notion that FN tumours often exhibit an undifferentiated “embryonal” histology, while FP more widely express the key myogenic regulatory factors responsible for orchestrating terminal differentiation, MYOD1 and MYOG6 (Fig. 1b,c and Supplementary Fig. 4b). However, when analysing at single-cell resolution we found that individual cells from each subtype and tumour spanned the spectrum of myogenic differentiation, indicating that there exists large-scale intra-, as well as inter-tumoral heterogeneity in cellular differentiation states (Supplementary Fig. 4a).
NMF-defined differentiation trajectories in FN RMS reflect early myogenesis
To probe the prospective sources of heterogeneity, non-negative matrix factorisation (NMF) was applied, independently per molecular subtype, to define the underlying transcriptional programs active in malignant cells from each of the primary tumours in our RMS scRNA-seq cohort (see Methods). In FN RMS samples, this analysis revealed three clusters of highly correlated transcriptional programs, which we merged into three meta-programs (Fig. 3a-left panel). Notably, the constituent programs underlying each meta-program were derived from several tumour samples, indicating that clustering was not driven by inter-tumoral differences. To interpret the biological relevance of each meta-program, we assessed the expression of their top-weighted genes (Fig. 3a-right panel, Supplementary Table 2 and Supplementary Data 2). The first program, which we termed “mesenchymal-like”, was enriched for genes related to extracellular matrix (ECM) organization, including FN1, TGFBI and several collagen-encoding genes, among others (Supplementary Fig. 5a). The second program, referred to as the “progenitor-like” program, included genes expressed during early myogenesis28, such as FGFR4 and GPC3, as well as markers of proliferation, including MKI67 and TOP2A. As this suggests, cells with high expression of progenitor-like program genes were inferred to have high cell cycle activity (Fig. 3a-right panel). Finally, the “myoblast-like” program was characterised by genes involved in the regulation of terminal myogenic differentiation, including MYOD1, MYOG, MEF2C and CDH15 as well as genes encoding structural and functional components of terminally differentiated striated muscle, such as TTN and CKM. Scoring FN cells for each meta-program revealed that expression of the myoblast-like and mesenchymal-like programs was mutually exclusive, while expression of the highly proliferative progenitor-like state was restricted to cells that scored low for the mesenchymal-like as well as the myoblast-like programs (Fig. 3b). These associations were corroborated using a dataset from a recently published independent single-nucleus RNA-seq cohort of RMS tumours29 (Supplementary Fig. 5b).
Meta-program scores were then used to define the discrete “state” of each cell. This analysis revealed a high degree of variation between tumours in the distribution of cell states (Fig. 3c). Interestingly, some tumours (e.g. RMS012 and RMS000HEI) were dominated by mesenchymal-like cells, while others (e.g. RMS444 and RMS000FWE) almost exclusively contained progenitor-like cells. RNA fluorescence in-situ hybridisation (RNA-FISH) was used to validate the presence of each cell state and the distribution of the progenitor-like and mesenchymal-like states within individual tumours (Fig. 3d and Supplementary Fig. 5c). To investigate the hierarchy of cell states in FN RMS, the data were modelled as a differentiation trajectory by projecting single-cell transcriptomes in diffusion maps space and using pseudotime and RNA velocity to assess directionality (Fig. 3e and Methods). This analysis suggested that cells transition from the highly proliferative progenitor-like state into the more differentiated mesenchymal-like or myoblast-like states. Variation in differentiation status was also evident when comparing the malignant cell-state scores with the similarity scores to normal myogenic cell types. This showed that the progenitor-like score correlated strongly with undifferentiated myogenic progenitors, while the mesenchymal-like and myoblast-like scores with more differentiated normal cell types, namely myogenic mesenchymal cells, and myoblasts/myocytes, respectively (Fig. 3f). We then leveraged the SCENIC pipeline30 to investigate the relationship between transcription factor (TF) activity and trajectory-specific pseudotime (see Methods). This analysis showed that as cells progress towards a mesenchymal-like state, the activity of several TFs known to play a role in the epithelial-to-mesenchymal transition (EMT), including TWIST1 and ZEB1 increased in activity (Supplementary Fig. 5d,e). Conversely, cells progressing towards a myoblast-like state exhibited upregulation in the activity of key regulators of terminal myogenic differentiation, including MYOD1, MYOG and MYF6 (MRF4). Finally, the highly proliferative progenitor-like cells (early along pseudotime in both trajectories) displayed an increase in the activity of cell cycle progression regulators, including several E2F TFs, as well as the myogenic progenitor marker SOX8. Together, these data show that transcriptional cell states in FN RMS cells can be organised in a differentiation trajectory mirroring that of early myogenic differentiation, where progenitor-like cells can give rise to cells resembling terminally differentiating myoblasts, or those progressing towards a mesenchymal-like state reminiscent of myogenic mesenchymal cells.
Differentiation states in FP RMS mirror skeletal muscle regeneration
Extending the NMF analysis revealed three meta-programs specific to primary FP RMS, as defined by correlating transcriptional programs across tumour samples (Fig. 4a-left panel). The proliferative program consisted almost entirely of genes involved in mitotic cell processes, including MKI67, TOP2A and CENPE, among others (Supplementary Fig. 6a). As expected, nearly all cells inferred to be in S or G2/M phases scored high for this meta-program (Fig. 4a-right panel, Supplementary Table 3 and Supplementary Data 3). The myoblast-like program, like that found in FN RMS tumours, was marked by the expression of terminal myogenic differentiation genes, such as MYOG, TTN and CKB. Finally, the program termed “satellite cell-like” (SC-like) was characterised by the expression of the NOTCH3 receptor gene, Notch pathway targets, including HEY1 and HES1, and genes encoding type V and VI collagens. These genes are known to play roles in the context-specific regulation of quiescence, self-renewal, and activation in muscle-resident satellite cells31,32. Scoring single cells for each meta-program revealed a mutually exclusive relationship between the myoblast-like and SC-like programs, while the proliferative program did not correlate with either and was, in general, restricted to cells scoring low for the two former programs (Fig. 4b). Again, the relationship between meta-program scores was confirmed in an independent dataset29 (Supplementary Fig 6b).
As with the FN samples, there was a high degree of variation in discrete cell-state proportions between tumours, particularly among the proliferative and SC-like states (Fig. 4c). The expression of each meta-program, as well as the mutual exclusivity of the myoblast-like and SC-like programs, was validated using RNA-FISH (Fig. 4d). Comparisons between meta-program scores and the logistic regression-defined cell similarity scores showed that the myoblast-like program correlated strongly with cell types undergoing terminal differentiation (myoblasts and myocytes) while the SC-like program was most comparable to post-natal satellite-cells (Fig. 4e). The proliferative program score did not strongly correlate with any of the normal myogenic cell types, supporting the notion that this program was indicative only of cell cycle activity. Trajectory inference indicated that cells scoring high for the myoblast -like or SC-like programs lay at opposite ends of the differentiation continuum, while the proliferative cells appeared as an undifferentiated intermediate state (Fig. 4f). In this case, however, the RNA velocity results did not definitively imply a strict directionality of the trajectory (Fig. 4e). Finally, TF activity analysis showed that cells progressing towards a myoblast-like state had high activity of MYOG and MYOD1, while cells in the proliferative state were marked by high E2F TF activity (Supplementary Fig. 6c, d). Notably, in addition to high activity for the NOTCH pathway effector TF HEY1, the SC-like state was associated with an upregulation in the activity of the key satellite cell regulator PAX7, further supporting the resemblance between the SC-like cell state in FP RMS and normal satellite cells. Altogether, these data showed that the shared cell-state heterogeneity in FP RMS forms a differentiation trajectory reminiscent of that underlying skeletal muscle regeneration, where SC-like cells connect to cells resembling terminally differentiating myoblasts through a proliferative, undifferentiated cell state.
Validation of subtype-specific malignant cell states in tumour organoid models of RMS
To determine whether the subtype-specific cell states defined above and validated in an independent cohort of RMS samples29 are tumour cell intrinsic, we sought to decouple the influence of the TME from malignant cells. It was previously demonstrated that RMS tumour organoid models, shown above to display the molecular-subtype-specific clustering observed in primary malignant cells (Supplementary Fig. 1g), faithfully recapitulate RMS tumours16. As RMS tumour organoids are composed solely of malignant cells, we reasoned that if the intra-tumoral heterogeneity observed in the primary tumours is tumour cell intrinsic, it would be reflected in these models. To investigate this, we scored tumour organoid cells for each of the subtype-specific meta-programs defined using NMF. Among FN tumour organoid cells, the relationship between the 3 programs resembled what was observed in primary RMS tumours: expression of the mesenchymal-like and myoblast-like programs was mutually exclusive, while cells scoring highest for the progenitor-like program scored, in general, low for the other two programs (Supplementary Fig. 7a,b). Similarly, there was a strong negative association between the myoblast-like and SC-like programs in FP tumour organoid cells, whereas high scores for the proliferative program were found more frequently in cells scoring low for the two former (Supplementary Fig. 7c,d). Next, to assess whether patient-specific cell state heterogeneity was retained, we directly compared the meta-program scores between cells from primary tumours and derived tumour organoids for the 4 patients with matched samples. Overall, subtype-specific meta-program scores overlapped between the primary and tumour organoid samples (Supplementary Fig. 7e–h). In RMS012, cells from the tumour organoid generally scored lower for the mesenchymal-like program than primary tumour cells, which may be the result of in-vitro selection against the less proliferative mesenchymal-like state cells. Together, these data show that the subtype-specific cell states present in primary tumours are also reflected in RMS tumour organoid models, suggesting that this heterogeneity is indeed largely intrinsic to malignant cells and not induced by the TME.
Malignant cell states are predictive of patient outcomes
Taken together, results from the analysis of NMF-defined transcriptional programs allowed us to propose a unified model of cell states and differentiation trajectories in FN and FP RMS tumours (Fig. 5a,b). In FN tumours, highly proliferative cells with characteristics of early myogenic progenitors (progenitor-like) seem to give rise to cells which resemble either of two more differentiated types: myogenic mesenchymal cells (mesenchymal-like) or terminally differentiating myoblasts/myocytes (myoblast-like). In FP tumours, on the other hand, highly proliferative cells (proliferative) are an intermediate between cells closely resembling differentiating myocytes (myoblast-like), or post-natal skeletal muscle-resident satellite cells (SC-like). To investigate whether the differentiation state of RMS tumours affects their clinical behaviour, a published cohort of bulk tumour gene expression profiles8 was scored for each meta-program. Strikingly, FN RMS patients whose tumours had a high differentiation score (mesenchymal-like + myoblast-like) exhibited a significantly better OS probability than those with a low score (p = 0.00069, Fig. 5c-left panel). This result was particularly intriguing, as neither cell-state program was predictive of outcomes on its own (Supplementary Fig. 8a,b). Conversely, a high score for the undifferentiated progenitor-like program was indicative of significantly worse OS than FN tumours with a low score (p = 0.035, Fig. 5c-right panel). We compared the predictive power of the differentiation and progenitor-like scores with a previously validated meta-gene score for stratifying FN RMS patients (MG5.FN)7. While the distinction between high- and low-risk cases using the MG5.FN score was slightly more significant than with the progenitor-like score, the differentiation score predicted a group of patients with a significantly worse prognosis (Fig. 5c and Supplementary Fig. 8c). Notably, none of the genes present in the MG5.FN score overlapped with genes comprising any of the meta-program scores. In FP RMS patients, high expression of the SC-like program was associated with prolonged OS (p = 0.017), while a high proliferative score was indicative of shorter OS (p = 0.029, Fig. 5d). Differential expression of the myoblast-like program in FP tumours was not predictive of patient survival (Supplementary Fig. 8d). In summary, these data show that, in both RMS subtypes, tumours with higher proportions of cells in more differentiated states exhibit better outcomes than those with high levels of proliferative, less differentiated cells.
Discussion
We generated a single-cell transcriptomic atlas of primary RMS tumours and patient-derived tumour organoid models, detailing cell states of both malignant cells and those constituting the TME. In our investigation of the TME, we found that, among differentiated macrophages in both RMS subtypes, the immunosuppressive, pro-tumour M2 polarisation state was predominant. Future work should focus on identifying the mechanism by which RMS tumour-associated macrophages are induced to an M2 polarisation, e.g. through cytokine secretion or signalling through direct cell-cell contact between RMS cells or other cell types in the TME. In addition, it will be important to understand the functional implications of tumour-associated M2 macrophages in RMS and determine whether polarisation can be impeded to encourage tumour-associated macrophages to take on a less immunosuppressive phenotype or exclude them from tumours entirely. Similar approaches are currently under clinical evaluation in other tumour entities33. We also described a putative interaction between FP tumour cells expressing NECTIN3 and the TIGIT receptor on Tregs and NK cells, which was supported by immunofluorescence staining of patient tissue samples. Interactions between TIGIT and several ligands, including NECTIN3, have been shown to suppress anti-tumour immune responses through several mechanisms34. As such, targeting this interaction may represent an opportunity to sensitise FP RMS tumours to immune-mediated killing using checkpoint inhibitors to block the TIGIT receptor34. Overall, we observed a higher proportion of T/NK cells, relative to myeloid cells (~1:1), than has been previously described in studies utilising immunohistochemistry35,36 or scRNA-seq29. Furthermore, beyond endothelial cells, we were unable to detect any other non-immune cell types in the TME, such as cancer-associated fibroblasts (CAFs) or non-malignant skeletal muscle cells. We ascribe these inconsistencies to biases introduced by the sample preparation protocol (e.g. the freeze/thaw cycle tumour samples were subjected to). However, including a larger number of T and NK cells had the benefit of allowing us to resolve and characterise functional subtypes not previously identified in RMS tumours.
In analysing NMF-defined transcriptional meta-programs and the similarity of RMS single-cells to normal myogenic differentiation, we defined subtype-specific hierarchies of malignant cell states. While directionality of the trajectories defined for FN RMS cells could be inferred, it was unclear in FP RMS whether SC-like cells derive from those in the proliferative state or “de-differentiate” and give rise to other cell states. It will be important, therefore, in future studies to examine the dynamic relationships between cell states in FP RMS using, for instance, phylogenetic analyses37,38 or functional assays in pre-clinical RMS models. Notably, our results illustrate the utility of RMS tumour organoid models in addressing questions such as these, as they share the presence of subtype-specific cell states observed in vivo. Our model of RMS differentiation trajectories has several clinical and biological implications. First, the observation that high levels of cells in more differentiated states are associated with better patient outcomes suggests the use of “differentiation therapy”, where tumour cells are pharmacologically induced to undergo differentiation39, would be a useful treatment strategy for RMS. In support of this, several studies using pre-clinical models of RMS have demonstrated that inhibiting critical pathways or regulators of tumorigenesis, including MEK in mutant RAS-driven FN RMS40 and BAF complexes in FP tumours41, leads to the induction of terminal myogenic differentiation. This approach could be expanded upon in future studies through the systematic elucidation of key regulators of RMS cell states which could be targeted to induce differentiation. On the other hand, the observation that high levels of proliferation are associated with worse outcomes supports the potential utility of compounds targeting key cell cycle regulators, including WEE1, PLK1 or CDK4/6 inhibitors, all of which are being investigated as therapeutic additions to RMS treatment regimens42. In addition to informing future treatment strategies, our results suggest that the differentiation state of RMS tumours could be a valuable metric for patient stratification, particularly in FN RMS. The translation of this finding could help advance a key goal of RMS clinical research: the de-intensification of treatment, where possible, to reduce toxicity and treatment-induced late effects43. However, these results first need to be validated in independent patient cohorts, an effort which is complicated by the overall lack of publicly available data sets combining gene expression of RMS tumours with clinical follow-up information.
Importantly, several of the cell states defined here using NMF are confirmed by findings detailed in other recent studies of RMS intra-tumoral heterogeneity. Patel et al.29 and Wei and Qin et al.44 also found the presence of highly proliferative cell states, as well as cells appearing to undergo terminal myogenic differentiation in both RMS subtypes. Likewise, both detected the presence of a largely quiescent population of cells, unique to FN RMS tumours, exhibiting mesenchymal characteristics (termed “mesoderm” cells in Patel et al.29), with Wei and Qin et al.44 also detailing their similarity to the recently identified SkM. Mesen cells (referred to here as myogenic mesenchymal cells). In Wei and Qin et al.29, these “mesenchymal-like” cells arose from proliferative or “ground” state cells in-vitro, consistent with the differentiation trajectory presented here, but could re-enter the cell cycle and give rise to other cell states under stress conditions. Their further finding that enriching for these cells improved the efficiency of tumour growth in mice, as well as data from Patel et al.29 showing that “mesoderm” cells were treatment resistant, was intriguing considering our finding that a high mesenchymal-like score (in combination with a high myoblast-like score) was predictive of better OS in FN RMS patients. Additional work will be required to decouple how the mesenchymal-like state influences outcomes in FN RMS.
In our comprehensive analysis of single-cell transcriptomes from paediatric RMS, we characterised the immune component of the TME and defined cell-states mirroring normal myogenic differentiation trajectories. Based on these findings, we propose that targeting immune checkpoint molecules and suppressive M2 macrophages are promising therapeutic approaches for RMS that merit further investigation. Furthermore, the validation and clinical implementation of differentiation state as a prognostic indicator should be a priority, given its potential to improve patient risk stratification.
Methods
Tumour sample acquisition
RMS tumour samples were obtained via an established sample acquisition route as part of the biobank initiative of the Princess Máxima Center for Pediatric Oncology, Utrecht, Netherlands (remaining tumour samples). Ethics approval was granted for the biobanking initiative by the Medical Research Ethics Committee (METC) of the University Medical Center Utrecht, and the Maxima biobank committee granted approval for the present project. All patients and/or their legal representatives signed informed consent to have tumour samples taken for biobank usage. Experiments conformed to the principles set out in the WMA Declaration of Helsinki and the Department of Health and Human Services Belmont Report.
Tumour organoid samples
Tumour organoid models used in this study were established previously, as described in Meister et al.16.
Sample processing and single-cell RNA-sequencing
Viably frozen primary tumour samples were rapidly thawed in a water bath, minced using a scalpel and then transferred to a tube containing 4.5 ml of BM1* medium (Advanced DMEM/F12 [Gibco, cat no. 12634010] supplemented with 1% Glutamax [Gibco, cat no. 35050061], 1% Penicillin/Streptomycin [Gibco, cat no. 15140122], 2% B27 minus vitamin A [Gibco, cat no. 12587010], 1% N2 [Gibco, cat no. 17502048], 0.25% N-acetylcysteine [500 mM, Sigma, cat no. A9165], 1% MEM non-essential amino acids [Gibco, cat no. 11140035], 1% sodium pyruvate [100 mM, Gibco, cat no. 11360070], 0.01% heparin [5000 U/ml, Sigma, cat no. H3149-10KU], 1% hEGF [2 µg/ml, Peprotech, cat no. AF-100-15], 0.1% hFGF-basic [40 µg/ml, Peprotech, cat no. 100-18B], 0.02% hIGF1 [100 µg/ml, Peprotech, cat no. 100-11], 0.01% Rho kinase inhibitor [Y-27632, 100 mM, AbMole Bioscience, cat no. M1817] and 0.1% A83-01 [5 mM, Tocris Bioscience, cat no. 2939]). To this, 0.5 ml of Collagenase D (Roche, #11088866001, 1:10 dilution) and DNAseI (Stemcell #07900, stock diluted 1:40 in PBS, further 1:100 diluted in the BM1* mixture) were added, and samples were allowed to dissociate in a shaker set to 250 rpm for 30 min at 37 °C. Following digestion, samples were passed through a 70 µm strainer which was subsequently flushed with an additional 5 ml of BM1* (supplemented with DNAseI) to increase the yield. Samples were then washed twice with 5 ml of washing medium (Advanced DMEM/F12 supplemented with 1% Glutamax, 1% Penicilin/Streptomycin and 1% HEPES [1 M, Gibco, cat no. 15630049]), centrifuging at 300 g for 5 min (at 4 °C) in between steps. After the final washing step samples were resuspended in BM1* (supplemented with DNAseI) to a final concentration of <1 × 106 cells per ml. Viably frozen tumour organoid samples, generated previously in our lab16, were rapidly thawed in a water bath and immediately resuspended in BM1* (supplemented with DNAseI) to a final concentration of <1 × 106 cells per ml after a washing step to remove the DMSO necessary for freezing. Prior to sorting, 4′,6-diamidino-2-phenylindole (DAPI, Sigma-Aldrich, #D9542) and DRAQ5 (Thermo Fisher, #65-0880-92) were added to single-cell suspensions up to final concentrations of 1 µM and 5 µM, respectively. Viable single-cells (DAPI−, DRAQ + ) were then sorted into 384-well plates containing 10 µl of mineral oil (Sigma, #M5310) and 50 nl of barcoded RT primers using a SONY SH800S Cell Sorter (SONY SH800S system software v2.1). Libraries were prepared according to the SORT-seq15 protocol and sequenced on an Illumina NextSeq500 (paired-end, 75 bp read chemistry) by Single Cell Discoveries B.V.
Immunohistochemistry and H&E staining
Immunohistochemistry (IHC) and haematoxylin and eosin (H&E) staining experiments were performed on 4 µm thick formalin fixed and paraffin embedded (FFPE) tissue sections using a Ventana automated tissue staining system (BenchMark Ultra, Roche). For IHC, the antibodies used were anti-CD3 clone LN10 (Leica, PA0533), anti-CD8 4B11 (Leica, PA0183) and anti-CD68 514H12 (Leica, PA0273).
Immunofluorescence microscopy
Mounted tumour sections (5 µm thick FFPE) were baked at 60 °C for 1 h, then deparaffinized and rehydrated using sequential washes of Xylene (2 × 100%), Ethanol (2 × 100%, 2 × 95%, 1 × 75%, 1 × 50% and 1 × 25%) and demineralised H2O (2 × 1 min, 1 × 5 min). Antigen retrieval was then performed by boiling slides in either Tris-EDTA, pH 9 (anti-TIGIT and anti-NECTIN3) or Sodium citrate, pH 6 (anti-CD206 and anti-CD163) for 20 min in a benchtop autoclave. Slides were then washed 3 × 5 min in PBST (PBS + 0.1% Tween 20) and incubated with blocking solution (PBST + 1% BSA) for 1 h at room temperature. After blocking slides were incubated with primary antibody diluted in blocking solution overnight at 4 °C. The following day, slides were washed 3 × 5 min with PBS and then incubated with secondary antibody, diluted in PBST, in the dark for 1 h at room temperature. Slides were washed an additional 3 × 5 min with PBS before adding mounting medium containing DAPI counterstain (Vector labs, H-1200) and applying glass coverslips. Images were acquired on a Leica SP8 confocal microscope (40×/1.3NA oil immersion objective), and maximum projections of Z-stacks were obtained using the FIJI software (v2.0.0-rc-69/1.52i)45. Primary antibodies used: anti-TIGIT (Cell Signaling, #99567, 1:500 dilution), anti-NECTIN3 (R&D systems, AF3064, 1:200 dilution of a 0.2 µg/µl solution in PBS), anti-CD206 (Cell Signaling, #91992, 1:200) and anti-CD163 (Abcam, ab182422, 1:200 dilution). Secondary antibodies used: Donkey anti-Goat Alexa 647 (Abcam, ab150131, diluted 1:1000) and Donkey anti-Rabbit Alexa 568 (Abcam, ab175470, diluted 1:1000).
RNA fluorescence in-situ hybridisation (RNAscope)
RNA-FISH experiments were performed on 5 µm FFPE tissue sections using the RNAscopeTM Multiplex Fluorescent v2 kit (ACD bio), according to the manufacturer’s instructions. The following probes were used for hybridisation: Hs-MKI67-C3 (591771-C3), Hs-TTN (550361), Hs-NOTCH3-C2 (558991-C2), Hs-FGFR4-no-XMm-C2 (443431-C2) and Hs-TGFBI (478491). In addition, the following fluorescent dyes were used for detection (diluted 1:1500): Opal 520 (FP1487001KT), Opal 570 (FP1488001KT) and Opal 690 (FP1497001KT). Images were acquired on a Leica SP8 confocal microscope (40x/1.3NA oil immersion objective), and maximum projections of Z-stacks were obtained using the FIJI software (v2.0.0-rc-69/1.52i).
Data processing and quality control
Sequencing reads were demultiplexed, mapped to the GRCh38v2020-A genome, available from 10× genomics (https://support.10xgenomics.com/single-cell-gene-expression/software/release-notes/build), and transcript counts were generated using the zUMIs pipeline (v5.6)46. Using the Seurat R package (v4.1.0)47, count tables (per plate) were then loaded in R (v4.1.0), merged and metadata fields were compiled. Single cells were excluded if they expressed <500 unique genes, <800 or >50,000 unique transcripts, contained a percentage of mitochondrial transcripts >50%, contained >1% haemoglobin gene transcripts or contained a ratio of intergenic to genic transcripts >2. The data were then log normalised using a scale factor of 10,000 transcripts, and the normalised data were scaled and centered. The top 2000 most variably expressed genes were defined using the FindVariableFeatures function in Seurat (default parameters), and their expression was used as input for principal component analysis (PCA). Finally, the first 50 principal components were used to project single-cell transcriptomes in 2-dimensional space using uniform manifold approximation and projection (UMAP, default parameters). The top variable features and scaled gene expression values were recalculated to run PCA for the tumour organoid samples, and the top 30 principal components were used to project in UMAP space. The cell cycle phase of each cell was inferred using the CellCycleScoring function implemented in Seurat, using the built-in gene lists.
Module scoring
Module scores were calculated as implemented in the Seurat function AddModuleScore, taking into account 25 expression bins and 100 control genes per query gene.
Cell type classification
To discriminate between malignant and healthy cells, we first used the SingleR R package (v1.6.1)17 to annotate single cells based on their similarity to reference bulk transcriptomes of healthy cells (Human Primary Cell Atlas data48) and RMS tumours (EGAD00001008467). We then used the InferCNV R package (v1.8.0)49 to define and cluster single-cell copy number variant profiles per tumour sample (default parameters, using an average expression threshold of 0.3 and standard deviation filter of 2). A SORT-seq dataset of cord blood mononuclear cells (CBMC’s) and other normal cell types was used as reference. CNV profiles derived from bulk DNA sequencing were plotted for comparison (see Supplementary Fig. 2), and single-cell clusters containing CNVs were manually selected and annotated as “malignant”. In the final annotation, cells were called malignant when they were classified as such using both approaches and cells which were divergently classified (labelled ambiguous) were excluded from further analyses. The broad cell-type of non-malignant cells was inferred from the hierarchical clustering of the similarity scores.
Analysis of the immune microenvironment
To reach sharper biological distinctions between immune cell subsets, SCTransform50 normalisation was performed on the full dataset to normalise and scale the data for unbiased clustering. To further improve detailed immune cell sub-clustering, sample-specific gene expression was removed to reduce technical effects and enhance biological variation. Sample-specific genes were identified by differential gene expression analyses among tumour cells and immune cells separately and comparing the individual samples. Genes that were differentially expressed in both the tumour cells and immune cells of a specific samples were considered sample-specific noise and were removed from the variable gene list. To avoid clustering of cells based on specific cell processes, genes associated with sex (XIST, TSIX, and Y chromosome-specific genes), cell cycle phase, dissociation stress (heat shock proteins; GO:0006986), and activity (ribosomal protein genes; GO:0022626), were also removed from the variable gene list.
Healthy clusters were subset and clustered using 40 principal components and a resolution of 0.3 (Louvain algorithm) was used to define clusters of the main cell types. For in-depth analysis of the T and NK cells, the respective clusters were subset, and UMAP was re-run using 40 PCs and a resolution of 0.5 was used to define subclusters. For in-depth analysis of the myeloid compartment, SCTransfrom normalisation was re-run, sample-specific and cell process-specific genes were removed from the variable gene list and 6 PCs and a resolution of 0.3 was used to define subclusters.
Immune cell type identification
Cluster annotation was performed using SingleR, using the Human Primary Cell Atlas reference dataset to annotate main cell types, and additionally using the Novershtern Hematopoietic Data51 and Monaco Immune Data52 reference datasets to annotate the immune cell (sub)clusters. Cell annotations were further refined by consulting cluster-specific (up-regulated) differentially expressed marker genes using Seurat’s FindAllMarkers function. The output genes were compared to known cell-type specific marker genes from previous studies53–56.
Gene set enrichment analysis (GSEA)
For GSEA, differential expression analysis between two groups was performed using the FindMarkers Seurat function, using the following adjusted parameters: logfc.threshold = 0, min.pct = 0, min.cells.feature = 0, min.cells.group = 0. Genes were pre‐ranked by their Fold Change and GSEA was performed using the R package fgsea (version 1.20.0). Gene sets with an FDR < 0.25 were considered significantly enriched. Gene sets were obtained for MSigDB version 7.2 using the msigdbr R package (version 7.4.1).
Ligand-receptor interaction analysis
The CellChat algorithm was applied as implemented in the CellChat R package (v1.0.0) to perform an unbiased ligand‐receptor interaction analysis, using the curated ligand‐receptor database of CellPhoneDB (RRID: SCR_017054)57.
Logistic regression analysis
Determination of the similarity between RMS single-cells and normal myogenic cell types (given as a probability value) was estimated as previously described in ref. 27. Briefly, we obtained the data described in ref. 28 from the gene expression omnibus (GSE147457) and trained logistic regression models using the main myogenic cell type labels. Correlations between meta-program scores and normal myogenic cell types used the logit-transformed probability values.
Non-negative matrix factorisation
Non-negative matrix factorisation (NMF) was carried out using the NMF R package (v0.23)58. For each RMS subtype (FN or FP), a list of shared variable features (n = 2000) was compiled using the SelectIntegrationFeatures function in Seurat. The expression of these genes was then scaled, per tumour, and used as input to determine the appropriate NMF rank, by running 50 iterations (Brunet algorithm) for ranks between 2 and 10 (default settings). The optimal rank was determined, per tumour, by manually assessing in cophenetic coefficients, dispersion values and silhouette scores between rank values. We then re-performed NMF at 250 iterations using the optimal rank value. Per subtype, pairwise Pearson correlation coefficients were calculated between NMF-defined transcriptional programs (across all tumours) and hierarchical clustering was used to determine groups. Highly correlated groups of programs were merged into meta-programs by averaging gene weights. Cell-state scores were by using the top 30 weighted genes per meta-program to calculate module scores. Discrete cell-states were determined through manual inspection of the distribution of cell-state module scores.
Gene list enrichment analysis
Functional enrichment of gene lists was performed using the enrichR R package (v3.0)59 (default settings) using the Reactome 2016 database.
Comparison with data from Patel et al.
Single-nucleus RNA-seq data from the manuscript of Patel et al.29, was downloaded from the Single-Cell Pediatric Cancer Atlas Portal (https://scpca.alexslemonade.org/projects/SCPCP000005) and loaded into Seurat. We inferred malignant cells using SingleR, as described for the data presented in this study and applied an additional cut-off of >800 unique transcripts for a cell to be considered valid. Data were then split by molecular subtype and module scores were calculated, as described above, using the top 30 genes per meta-program.
Differentiation trajectory modelling
Modelling of differentiation trajectories was done, per subtype, by projecting cells in DiffusionMaps space using expression of the top 30 meta-program-specific genes (destiny R package v3.1.1)60. The top 3 diffusion components were then used as input for trajectory modelling and cell lineage inference using Slingshot (v2.0.0)61. RNA velocity analysis was performed using the scVelo python package (v0.2.2, python v3.7)62. Briefly, input data per subtype, was filtered to include only genes with 20 shared (spliced and un-spliced) counts and log normalised. First and second order moments were calculated per cell using expression of the top 30 meta-program-specific genes and 30 nearest neighbours. RNA velocity was then estimated using the stochastic model and vectors were overlaid on the DiffusionMaps projections.
Transcription factor activity analysis
The estimation of transcription factor activity was performed as previously described using the pySCENIC implementation (v0.12.1) of the SCENIC pipeline30. For each RMS subtype, 10 SCENIC runs were performed using the unnormalized count data as input, along with the auxiliary input databases “motifs-v9-nr.hgnc-m0.001-o0.0”, “hs_hgnc_tfs” and “hg38__refseq-r80__10kb_up_and_down_tss.mc9nr” (accessed from https://resources.aertslab.org/cistarget/). We considered transcription factors only if they were identified in ≥8 independent runs and calculated activity per cell as the mean AUCell values across runs. We then used the tradeSeq R package (v1.10.0)63 to fit a negative binomial generalised additive model (NB-GAM) for each transcription factor (setting the number of knots to 5).
Survival analysis
Microarray gene expression profiles and the accompanying clinical follow-up information for the ITCC RMS cohort8 was downloaded from the R2 genomics platform (R2: Genomics Analysis and Visualisation Platform (http://r2.amc.nl). Samples which did not exhibit either of the two main RMS histological classifications (alveolar or embryonal) were excluded. The data were divided based on fusion transcript status and Z-scores were calculated per gene. To generate meta-program scores, the average Z-score of the top 30 genes per meta-program (in the appropriate dataset) was calculated per tumour. Based on the distribution of scores, the “high” scoring groups (and vice versa) were defined as either the top 25% or 75% of tumours. The MG5.FN score was applied as described in Missiaglia et al.7 Briefly, the per patient (FN) score was calculated as the sum of the weighted expression (signed Z-score) of the 5 genes. The cohort was then divided into tertiles, with the top tertile corresponding to high MG5.FN expression and the bottom two with low expression. Survival models were generated using the survival R package (v3.2-11) and p values were calculated using a Log-Rank test.
Statistics and reproducibility
Immunohistochemical, immunofluorescence and RNA-FISH stainings were carried out once per patient due to limited material availability. When interpreting imaging results, at least 3 randomly chosen fields were surveyed.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank the patients and their families who made this research possible by consenting to this study. We also thank the many clinicians in our institute with whom we work closely, as well as the Princess Máxima Center’s flow cytometry, single-cell genomics and imaging core facilities. This research was supported by the Foundation Children Cancer-free (Kika) and a European Research Council (ERC) starting grant #850571 (J.D.). M.M. received financial support from the Deutsche Forschungsgemeinschaft (#408083583).
Source data
Author contributions
J.DeMartino, M.M., L.V., A.W., F.H., J.Drost and T.M. conceived and designed the experiments and analyses. J.DeMartino and L.V. analysed the data with contributions from J.Drost, T.S. and T.M. J.DeMartino, M.M. and M.G.K. processed tumour samples and performed FACS sorting for SORT-seq with contributions from T.M. L.H-J. provided detailed technical advice and acquired samples for the imaging experiments. J.DeMartino and M.B performed the imaging experiments. J.DeMartino prepared the paper with contributions from L.V., J.Drost, T.M., J.M. and F.H. J.Drost, T.M., A.R. and F.H. supervised the work. All authors read and approved the paper.
Peer review
Peer review information
Nature Communications thanks Takayuki Ohkuri, Beat Schäfer and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Data availability
The raw sequencing data generated in this study have been deposited in the European genome-phenome archive. The accession number for the single-cell RNA-sequencing data is EGAD00001009385 (“Single-cell mRNA-sequencing to generate a transcriptomic atlas of RMS”). To protect patient privacy, as required by law, access to the sequencing data deposited in the EGA is controlled by the Data Access Committee (DAC) of the Princess Maxima Center. All researchers can obtain access by submitting a project proposal to the DAC (biobank-2 [at] prinsesmaximacentrum [dot] nl). Requests will be handled within ~2 weeks. The DAC will also determine the length of permitted access. The publicly available whole genome sequencing data generated by Meister et al.16 and used in Supplementary Fig. 2 can be accessed with the identifier EGAD00001008466 (“WGS soft tissue sarcoma tumoroid biobank”)16. The publicly available bulk RNA-sequencing data generated by Meister et al.16 and used in Supplementary Fig. 1d can be accessed with the identifier EGAD00001008467 (“RNA-Seq soft tissue sarcoma tumoroid biobank”)16. The publicly available single-nucleus RNA-sequencing data of RMS tumours generated by Patel et al.29 and used in this study are deposited in the single-cell paediatric cancer atlas portal under accession number SCPCP00000529 The publicly available single-cell RNA-sequencing data of normal myogenic differentiation used in this study are deposited in the GEO repository under accession number GSE14745728 and were accessed from https://skeletal-muscle.cells.ucsc.edu. The publicly available ITCC RMS microarray dataset is deposited in the gene expression omnibus (GEO) under accession number GSE92689 and were accessed from the R2 Genomics Analysis and Visualisation Platform (http://r2.amc.nl). The GRCh38v2020-A reference genome used to map the single-cell RNA-seq data was downloaded from the 10× genomics website (https://support.10xgenomics.com/single-cell-gene-expression/software/release-notes/build). The publicly available Human Primary Cell Atlas data, Novershtern Hematopoietic data and Monaco Immune data were accessed using the celldex R package (v1.2.0, https://github.com/LTLA/celldex). The publicly available CellPhoneDB ligand-receptor interaction data was accessed from https://www.cellphonedb.org (RRID: SCR_017054). The publicly available auxiliary input databases for the SCENIC analysis were accessed from https://resources.aertslab.org/cistarget/. The processed data used in and generated by this study, including compiled count tables and processed R objects, have been made publicly available via Zenodo (10.5281/zenodo.7928694)64. The remaining data are available within the Article, Supplementary Information or Source Data file. Source data are provided with this paper.
Code availability
All data analysis scripts have been made publicly available via Zenodo (10.5281/zenodo.7928694)64.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Jeff DeMartino, Michael T. Meister, Lindy L. Visser.
Contributor Information
Thanasis Margaritis, Email: A.Margaritis-2@prinsesmaximacentrum.nl.
Jarno Drost, Email: J.Drost@prinsesmaximacentrum.nl.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-38886-8.
References
- 1.Li J, Thompson TD, Miller JW, Pollack LA, Stewart SL. Cancer incidence among children and adolescents in the United States, 2001-2003. Pediatrics. 2008;121:e1470–e1477. doi: 10.1542/peds.2007-2964. [DOI] [PubMed] [Google Scholar]
- 2.Sebire NJ, Malone M. Myogenin and MyoD1 expression in paediatric rhabdomyosarcomas. J. Clin. Pathol. 2003;56:412–416. doi: 10.1136/jcp.56.6.412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Stiller D, Holzhausen HJ. Ultrastructural organization of cell characteristics relevant to diagnosis in rhabdomyosarcomas. Zentralblatt f.ür. Allg. Pathologie Pathologische Anat. 1988;134:449–466. [PubMed] [Google Scholar]
- 4.Drummond CJ, et al. Hedgehog Pathway Drives Fusion-Negative Rhabdomyosarcoma Initiated From Non-myogenic Endothelial Progenitors. Cancer Cell. 2018;33:108–124.e5. doi: 10.1016/j.ccell.2017.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Oberlin O, et al. Prognostic factors in metastatic rhabdomyosarcomas: Results of a pooled analysis from United States and European Cooperative Groups. J. Clin. Oncol. 2008;26:2384–2389. doi: 10.1200/JCO.2007.14.7207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Siegal G. P., Bloem J. L., Cates J. M. M., H M. WHO Classification of Tumors 5th edition Soft Tissue and Bone Tumors. vol. 3 (WHO Iarc., 2020).
- 7.Missiaglia E, et al. PAX3/FOXO1 fusion gene status is the key prognostic molecular marker in rhabdomyosarcoma and significantly improves current risk stratification. J. Clin. Oncol. 2012;30:1670–1677. doi: 10.1200/JCO.2011.38.5591. [DOI] [PubMed] [Google Scholar]
- 8.Williamson D, et al. Fusion gene-negative alveolar rhabdomyosarcoma is clinically and molecularly indistinguishable from embryonal rhabdomyosarcoma. J. Clin. Oncol. 2010;28:2151–2158. doi: 10.1200/JCO.2009.26.3814. [DOI] [PubMed] [Google Scholar]
- 9.Buckingham M, Relaix F. The role of Pax genes in the development of tissues and organs: Pax3 and Pax7 regulate muscle progenitor cell functions. Annu. Rev. Cell Dev. Biol. 2007;23:645–673. doi: 10.1146/annurev.cellbio.23.090506.123438. [DOI] [PubMed] [Google Scholar]
- 10.Galili N, et al. Fusion of a fork head domain gene to PAX3 in the solid tumour alveolar rhabdomyosarcoma. Nat. Genet. 1993;5:230–235. doi: 10.1038/ng1193-230. [DOI] [PubMed] [Google Scholar]
- 11.Barr FG, et al. In vivo amplification of the PAX3-FKHR and PAX7-FKHR fusion genes in alveolar rhabdomyosarcoma. Hum. Mol. Genet. 1996;5:15–21. doi: 10.1093/hmg/5.1.15. [DOI] [PubMed] [Google Scholar]
- 12.Shern JF, et al. Comprehensive genomic analysis of rhabdomyosarcoma reveals a landscape of alterations affecting a common genetic axis in fusion-positive and fusion-negative tumors. Cancer Discov. 2014;4:216–231. doi: 10.1158/2159-8290.CD-13-0639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Patton RB, Horn RG. Rhabdomyosarcoma: Clinical and pathological features and comparison with human fetal and embryonal skeletal muscle. Surgery. 1962;52:572–584. [PubMed] [Google Scholar]
- 14.Morotti RA, et al. An immunohistochemical algorithm to facilitate diagnosis and subtyping of rhabdomyosarcoma: The children’s oncology group experience. Am. J. Surg. Pathol. 2006;30:962–968. doi: 10.1097/00000478-200608000-00005. [DOI] [PubMed] [Google Scholar]
- 15.Muraro MJ, et al. A Single-Cell Transcriptome Atlas of the Human Pancreas. Cell Syst. 2016;3:385–394.e3. doi: 10.1016/j.cels.2016.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Meister MT, et al. Mesenchymal tumor organoid models recapitulate rhabdomyosarcoma subtypes. EMBO Mol. Med. 2022;14:16001. doi: 10.15252/emmm.202216001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Aran D, et al. Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nat. Immunol. 2019;20:163–172. doi: 10.1038/s41590-018-0276-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Venteicher AS, et al. Decoupling genetics, lineages, and microenvironment in IDH-mutant gliomas by single-cell RNA-seq. Science (1979) 2017;355:eaai8478. doi: 10.1126/science.aai8478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Puram SV, et al. Single-Cell Transcriptomic Analysis of Primary and Metastatic Tumor Ecosystems in Head and Neck Cancer. Cell. 2017;171:1611–1624.e24. doi: 10.1016/j.cell.2017.10.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Tirosh I, et al. Single-cell RNA-seq supports a developmental hierarchy in human oligodendroglioma. Nature. 2016;539:309–313. doi: 10.1038/nature20123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Tirosh I, et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science (1979) 2016;352:189–196. doi: 10.1126/science.aad0501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cheng S, et al. A pan-cancer single-cell transcriptional atlas of tumor infiltrating myeloid cells. Cell. 2021;184:792–809.e23. doi: 10.1016/j.cell.2021.01.010. [DOI] [PubMed] [Google Scholar]
- 23.Liu J, Geng X, Hou J, Wu G. New insights into M1/M2 macrophages: key modulators in cancer progression. Cancer Cell Int. 2021;21:389. doi: 10.1186/s12935-021-02089-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Li H, et al. Dysfunctional CD8 T Cells Form a Proliferative, Dynamically Regulated Compartment within Human Melanoma. Cell. 2019;176:775–789.e18. doi: 10.1016/j.cell.2018.11.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Jin S, et al. Inference and analysis of cell-cell communication using CellChat. Nat. Commun. 2021;12:1088. doi: 10.1038/s41467-021-21246-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kashi VP, Hatley ME, Galindo RL. Probing for a deeper understanding of rhabdomyosarcoma: Insights from complementary model systems. Nat. Rev. Cancer. 2015;15:426–439. doi: 10.1038/nrc3961. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Young MD, et al. Single-cell transcriptomes from human kidneys reveal the cellular identity of renal tumors. Science (1979) 2018;361:594–599. doi: 10.1126/science.aat1699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Xi H, et al. A Human Skeletal Muscle Atlas Identifies the Trajectories of Stem and Progenitor Cells across Development and from Human Pluripotent Stem Cells. Cell Stem Cell. 2020;27:158–176.e10. doi: 10.1016/j.stem.2020.04.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Patel AG, et al. The myogenesis program drives clonal selection and drug resistance in rhabdomyosarcoma. Dev. Cell. 2022;57:1226–1240.e8. doi: 10.1016/j.devcel.2022.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.van de Sande B, et al. A scalable SCENIC workflow for single-cell gene regulatory network analysis. Nat. Protoc. 2020;15:2247–2276. doi: 10.1038/s41596-020-0336-2. [DOI] [PubMed] [Google Scholar]
- 31.Gioftsidi S, Relaix F, Mourikis P. The Notch signaling network in muscle stem cells during development, homeostasis, and disease. Skelet. Muscle. 2022;12:9. doi: 10.1186/s13395-022-00293-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Urciuolo A, et al. Collagen VI regulates satellite cell self-renewal and muscle regeneration. Nat. Commun. 2013;4:1964. doi: 10.1038/ncomms2964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Pathria P, Louis TL, Varner JA. Targeting Tumor-Associated Macrophages in Cancer. Trends Immunol. 2019;40:310–327. doi: 10.1016/j.it.2019.02.003. [DOI] [PubMed] [Google Scholar]
- 34.Harjunpää H, Guillerey C. TIGIT as an emerging immune checkpoint. Clin. Exp. Immunol. 2020;200:108–119. doi: 10.1111/cei.13407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Dancsok AR, et al. Tumor-associated macrophages and macrophage-related immune checkpoint expression in sarcomas. Oncoimmunology. 2020;9:1747340. doi: 10.1080/2162402X.2020.1747340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kather JN, et al. CD163+ immune cell infiltrates and presence of CD54+ microvessels are prognostic markers for patients with embryonal rhabdomyosarcoma. Sci. Rep. 2019;9:9211. doi: 10.1038/s41598-019-45551-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Custers L, et al. Somatic mutations and single-cell transcriptomes reveal the root of malignant rhabdoid tumours. Nat. Commun. 2021;12:1407. doi: 10.1038/s41467-021-21675-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Coorens THH, et al. Lineage-Independent Tumors in Bilateral Neuroblastoma. N. Engl. J. Med. 2020;383:1860–1865. doi: 10.1056/NEJMoa2000962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.de Thé H. Differentiation therapy revisited. Nat. Rev. Cancer. 2018;18:117–127. doi: 10.1038/nrc.2017.103. [DOI] [PubMed] [Google Scholar]
- 40.Yohe ME, et al. MEK inhibition induces MYOG and remodels super-enhancers in RAS-driven rhabdomyosarcoma. Sci. Transl. Med. 2018;10:eaan4470. doi: 10.1126/scitranslmed.aan4470. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Laubscher D, et al. BAF complexes drive proliferation and block myogenic differentiation in fusion-positive rhabdomyosarcoma. Nat. Commun. 2021;12:6924. doi: 10.1038/s41467-021-27176-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.van Erp AEM, Versleijen-Jonkers YMH, van der Graaf WTA, Fleuren EDG. Targeted therapy-based combination treatment in rhabdomyosarcoma. Mol. Cancer Ther. 2018;17:1365–1380. doi: 10.1158/1535-7163.MCT-17-1131. [DOI] [PubMed] [Google Scholar]
- 43.Chen C, Dorado Garcia H, Scheer M, Henssen AG. Current and Future Treatment Strategies for Rhabdomyosarcoma. Front. Oncol. 2019;9:1458. doi: 10.3389/fonc.2019.01458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Wei Y, et al. Single-cell analysis and functional characterization uncover the stem cell hierarchies and developmental origins of rhabdomyosarcoma. Nat. Cancer. 2022;3:961–975. doi: 10.1038/s43018-022-00414-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Schindelin J, et al. Fiji: An open-source platform for biological-image analysis. Nat. Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Parekh S, Ziegenhain C, Vieth B, Enard W, Hellmann I. zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs. GigaScience. 2018;7:giy059. doi: 10.1093/gigascience/giy059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Hao Y, et al. Integrated analysis of multimodal single-cell data. Cell. 2021;184:3573–3587.e29. doi: 10.1016/j.cell.2021.04.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Mabbott NA, Baillie JK, Brown H, Freeman TC, Hume DA. An expression atlas of human primary cells: Inference of gene function from coexpression networks. BMC Genom. 2013;14:632. doi: 10.1186/1471-2164-14-632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Tickle, T., Tirosh, I., Georgescu, C., Brown, M. & Haas, B. inferCNV of the Trinity CTAT Project (Klarman Cell Observatory, Broad Institute of MIT and Harvard, 2019).
- 50.Hafemeister C, Satija R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 2019;20:296. doi: 10.1186/s13059-019-1874-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Novershtern N, et al. Densely interconnected transcriptional circuits control cell states in human hematopoiesis. Cell. 2011;144:296–309. doi: 10.1016/j.cell.2011.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Monaco G, et al. RNA-Seq Signatures Normalized by mRNA Abundance Allow Absolute Deconvolution of Human Immune Cell Types. Cell Rep. 2019;26:1627–1640.e7. doi: 10.1016/j.celrep.2019.01.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Tang-Huau TL, et al. Human in vivo-generated monocyte-derived dendritic cells and macrophages cross-present antigens through a vacuolar pathway. Nat. Commun. 2018;9:2570. doi: 10.1038/s41467-018-04985-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Szabo PA, et al. Single-cell transcriptomics of human T cells reveals tissue and activation signatures in health and disease. Nat. Commun. 2019;10:4706. doi: 10.1038/s41467-019-12464-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Zavidij O, et al. Single-cell RNA sequencing reveals compromised immune microenvironment in precursor stages of multiple myeloma. Nat. Cancer. 2020;1:493–506. doi: 10.1038/s43018-020-0053-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Zhao J, et al. Single cell RNA-seq reveals the landscape of tumor and infiltrating immune cells in nasopharyngeal carcinoma. Cancer Lett. 2020;477:131–143. doi: 10.1016/j.canlet.2020.02.010. [DOI] [PubMed] [Google Scholar]
- 57.Efremova M, Vento-Tormo M, Teichmann SA, Vento-Tormo R. CellPhoneDB: inferring cell–cell communication from combined expression of multi-subunit ligand–receptor complexes. Nat. Protoc. 2020;15:1484–1506. doi: 10.1038/s41596-020-0292-x. [DOI] [PubMed] [Google Scholar]
- 58.Gaujoux R, Seoighe C. A flexible R package for nonnegative matrix factorization. BMC Bioinforma. 2010;11:367. doi: 10.1186/1471-2105-11-367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Chen EY, et al. Enrichr: Interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinforma. 2013;14:128. doi: 10.1186/1471-2105-14-128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Angerer P, et al. Destiny: Diffusion maps for large-scale single-cell data in R. Bioinformatics. 2016;32:1241–1243. doi: 10.1093/bioinformatics/btv715. [DOI] [PubMed] [Google Scholar]
- 61.Street K, et al. Slingshot: Cell lineage and pseudotime inference for single-cell transcriptomics. BMC Genom. 2018;19:477. doi: 10.1186/s12864-018-4772-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Bergen V, Lange M, Peidli S, Wolf FA, Theis FJ. Generalizing RNA velocity to transient cell states through dynamical modeling. Nat. Biotechnol. 2020;38:1408–1414. doi: 10.1038/s41587-020-0591-3. [DOI] [PubMed] [Google Scholar]
- 63.van den Berge K, et al. Trajectory-based differential expression analysis for single-cell sequencing data. Nat. Commun. 2020;11:1201. doi: 10.1038/s41467-020-14766-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.DeMartino, J., Visser, L., Meister, M., Margaritis, T. & Drost, J. Processed data supporting the manuscript “Single-cell transcriptomics reveals immune suppression and cell states predictive of patient outcomes in rhabdomyosarcoma”. 10.5281/ZENODO.7928694 (2023). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw sequencing data generated in this study have been deposited in the European genome-phenome archive. The accession number for the single-cell RNA-sequencing data is EGAD00001009385 (“Single-cell mRNA-sequencing to generate a transcriptomic atlas of RMS”). To protect patient privacy, as required by law, access to the sequencing data deposited in the EGA is controlled by the Data Access Committee (DAC) of the Princess Maxima Center. All researchers can obtain access by submitting a project proposal to the DAC (biobank-2 [at] prinsesmaximacentrum [dot] nl). Requests will be handled within ~2 weeks. The DAC will also determine the length of permitted access. The publicly available whole genome sequencing data generated by Meister et al.16 and used in Supplementary Fig. 2 can be accessed with the identifier EGAD00001008466 (“WGS soft tissue sarcoma tumoroid biobank”)16. The publicly available bulk RNA-sequencing data generated by Meister et al.16 and used in Supplementary Fig. 1d can be accessed with the identifier EGAD00001008467 (“RNA-Seq soft tissue sarcoma tumoroid biobank”)16. The publicly available single-nucleus RNA-sequencing data of RMS tumours generated by Patel et al.29 and used in this study are deposited in the single-cell paediatric cancer atlas portal under accession number SCPCP00000529 The publicly available single-cell RNA-sequencing data of normal myogenic differentiation used in this study are deposited in the GEO repository under accession number GSE14745728 and were accessed from https://skeletal-muscle.cells.ucsc.edu. The publicly available ITCC RMS microarray dataset is deposited in the gene expression omnibus (GEO) under accession number GSE92689 and were accessed from the R2 Genomics Analysis and Visualisation Platform (http://r2.amc.nl). The GRCh38v2020-A reference genome used to map the single-cell RNA-seq data was downloaded from the 10× genomics website (https://support.10xgenomics.com/single-cell-gene-expression/software/release-notes/build). The publicly available Human Primary Cell Atlas data, Novershtern Hematopoietic data and Monaco Immune data were accessed using the celldex R package (v1.2.0, https://github.com/LTLA/celldex). The publicly available CellPhoneDB ligand-receptor interaction data was accessed from https://www.cellphonedb.org (RRID: SCR_017054). The publicly available auxiliary input databases for the SCENIC analysis were accessed from https://resources.aertslab.org/cistarget/. The processed data used in and generated by this study, including compiled count tables and processed R objects, have been made publicly available via Zenodo (10.5281/zenodo.7928694)64. The remaining data are available within the Article, Supplementary Information or Source Data file. Source data are provided with this paper.
All data analysis scripts have been made publicly available via Zenodo (10.5281/zenodo.7928694)64.