Abstract
Cutaneous T cell lymphoma (CTCL) represents a heterogeneous group of non-Hodgkin lymphoma distinguished by the presence of clonal malignant T cells. The heterogeneity of malignant T cells and the complex tumor microenvironment remain poorly characterized. With single-cell RNA analysis and bulk whole-exome sequencing on 19 skin lesions from 15 CTCL patients, we decipher the intra-tumor and inter-lesion diversity of CTCL patients and propose a multi-step tumor evolution model. We further establish a subtyping scheme based on the molecular features of malignant T cells and their pro-tumorigenic microenvironments: the TCyEM group, demonstrating a cytotoxic effector memory T cell phenotype, shows more M2 macrophages infiltration, while the TCM group, featured by a central memory T cell phenotype and adverse patient outcome, is infiltrated by highly exhausted CD8+ reactive T cells, B cells and Tregs with suppressive activities. Our results establish a solid basis for understanding the nature of CTCL and pave the way for future precision medicine for CTCL patients.
Subject terms: Tumour immunology, Skin cancer, Tumour heterogeneity, Oncogenesis, T-cell lymphoma
Cutaneous T cell lymphomas (CTCL) are still poorly characterised at the molecular and single-cell level. Here, the authors analyse CTCL patient samples using single-cell RNA-seq, TCR and whole-exome sequencing, revealing the molecular profiles of malignant T cells and their association with the microenvironment and clinical outcomes.
Introduction
Cutaneous T cell lymphomas (CTCL) is a heterogeneous group of extranodal non-Hodgkin’s lymphomas characterized by cutaneous infiltration of clonal malignant T cells. Mycosis fungoides (MF), Sézary syndrome and primary cutaneous anaplastic large cell lymphoma (pcALCL) constitute the majority of CTCLs and are believed to be derived from skin-homing mature T cells. MF begins with multiple patches, plaques and tumors in the skin, and it may involve lymph nodes, peripheral blood and viscera in the advanced stage, leading to a poor prognosis1. Sézary syndrome, the leukemic form of CTCL, is characterized by generalized skin erythema with leukemic malignant T cells in the blood, while pcALCL is characterized by single or multiple skin tumors and a tendency for systemic involvement in the late stage. These three CTCL variants are highly related and have overlapping clinical and immunophenotypic features2,3. These entities frequently coexist in the same patient, whether they develop simultaneously or as a secondary lymphoma4. Given the numerous shared and disparate properties of CTCL variants, it remains unclear whether these entities are distinct disorders or a continuum with a degree of genetic diversity and varied microenvironments.
Despite tremendous progression in the understanding of CTCL in recent decades, the origin and nature of malignant T cells remain to be defined. Unlike solid tumors, CTCL starts with multi-focal skin lesions, which leads to controversies regarding the tumor origin and evolution pattern of CTCL. Previous studies suggested that neoplastic T cells in CTCL originated from mature, monoclonal, skin-resident memory T cells5. However, recent evidence indicates that malignant T cells are derived from circulating immature precursor cells, which seed the skin and evolve into clonally heterogeneous lymphomas6–8. This discrepancy is largely due to the difficulty in discriminating true malignant T cells from benign reactive T cells in CTCL skin lesions, since there are no specific markers to define malignant T cells, which also results in delayed diagnosis and poor prognosis for CTCLs.
Owing to the poor understanding of disease pathogenesis, effective treatments for CTCL are limited, and the treatment response varies greatly among patients. CTCL generally exhibits an indolent course, but a portion of patients progresses rapidly and exhibit widespread disease beyond the skin despite aggressive treatment, highlighting the underlying heterogeneity of this disease. Moreover, CTCL patients frequently show inter-lesional diversity in treatment responses, adding to the complexity of the clinical challenge of the disease and often creating obstacles to achieving effective therapy9. However, the molecular mechanisms underlying this inter-lesional diversity remain to be defined.
The difficulty in distinguishing malignant T cells from reactive T cells presents an obstacle to understand the immune landscape of the tumor microenvironment (TME) in CTCL. Although studies have demonstrated that tumor infiltrating immune cells, including B cells and M2 macrophages, participate in the pathogenesis and persistence of CTCL through diverse cytokines and chemokines, the complex interplay between malignant T cells and TME compartments remains poorly understood10,11. In particular, malignant T cells preserve certain immunological features of mature T cells, and thus are active in cytokine production. Elucidating how malignant T cells redefine and alter the TME, as well as clarifying the landscapes and properties of the multicellular ecosystem in CTCL, will create new opportunities for the development and application of immunotherapies.
Here, we employed single-cell RNA sequencing (scRNA-seq) and paired single-cell T cell receptor (TCR) sequencing on CTCL skin tumors. Malignant T cells were defined by copy number variations (CNVs) and matched TCRα and TCRβ clonotypes at the single-cell level. With this approach, we determined the mono-clonal nature of CTCL and characterized the temporal and topological subclonal evolutionary process of malignant T cells, revealing a high degree of tumoral cellular heterogeneity. The genetic basis of this intratumor heterogeneity (ITH) was defined by paired whole exome sequencing (WES). We generated a molecular subtyping scheme based on the intrinsic identity of malignant T cells across CTCL variants, and we deciphered the complex crosstalk among immune cells within the TME for each subtype.
Results
Single-cell transcriptome analysis defines malignant T cells and reveals inter-tumor transcriptional heterogeneity in malignant T cells
To systematically examine the cellular profiles of CTCL, we performed scRNA-seq on cells from 19 freshly dissociated CTCL skin samples (n = 15 patients, including 11 CD4+ MF, 2 CD8+ MF, and 2 pcALCLs) (Fig. 1a and Supplementary Data File 1). For 4 CD4+ MF patients (patients MF21, MF27, MF28 and MF30), two biopsies at different anatomic sites were obtained. All diagnoses were verified by two board-certified dermatopathologists and all biopsies were taken from thick plaques or tumors. Fluorescence-activated cell sorting (FACS) was utilized to enrich for several fractions; (i) CD45+CD3+ T cells, (ii) CD45+CD3− immune cells and (iii) CD45− non-immune cells (Fig. 1a and Supplementary Fig. 1a). To balance the gene coverage and cell productivity, we analyzed 3 samples (patients MF4, MF6 and MF7) using 5′ unique molecular identifier (UMI)-based Smart-Seq2 method, and 16 samples were subjected to a droplet-based 10× Genomics platform. For the 10× Genomics method, the single-cell 5′ reagent kit coupled with TCR V(D)J analyses was used to profile TCR clonotypes at single-cell resolution for 14 samples, while a single-cell 3′ reagent kit was used for 2 samples (Supplementary Data File 2). Whole exome sequencing was also performed on 13 CTCL skin samples from 11 patients.
Following stringent quality control (see Methods), a total of 60,324 cells were retained for subsequent analysis (Supplementary Data File 2). We performed unsupervised clustering and projected cells in two dimensions using uniform manifold approximation and projection (UMAP). We identified 14 major cell types based on the expression of canonical marker genes, including T cells, B cells, macrophages, plasma cells, natural killer (NK) cells, type 1 innate lymphoid cells (ILC1s), dendritic cells (DCs), plasmacytoid dendritic cells (pDCs), mast cells and skin-resident non-immune cells (including epithelial cells, endothelial cells, fibroblasts, myofibroblasts and melanocytes), from a total of 58, 926 cells in the 10× Genomics dataset (Fig. 1b, c)12, demonstrating a multicellular ecosystem in CTCL skin lesions. A similar pipeline was applied to the 5′ UMI Smart-Seq2 dataset (Supplementary Fig. 1b, c). We were able to capture more cells per skin sample using 10× Genomics (median of 2,143 cells versus 453 cells using Smart-Seq2) with both methods providing comparable median number of genes per cell (1,483 genes using 10× Genomics versus 1,578 genes using Smart-seq2) (Supplementary Fig. 1e and Supplementary Data File 2).
To differentiate malignant T cells from reactive T cells, TCR repertoires and large-scale chromosomal CNVs inferred from transcriptome sequencing were examined9,12,13. Malignant T cells were defined as presenting TCR clonal expansion coupled with apparent CNVs (Fig. 1d, e and Supplementary Fig. 1d, f). The CNV patterns of malignant T cell subsets were highly consistent with those generated from paired bulk whole exome sequencing (Supplementary Fig. 1g). As visualized in the UMAP plot of all T cells, reactive T cells, including conventional CD4+ T cells (Tconvs), CD4+ regulatory T cells (Tregs) and CD8+ T cells, were clustered by cell type, and the transcriptome profiles from each sample showed a high degree of overlap, whereas malignant T cells grouped into clear patient-specific clusters, demonstrating the remarkable inter-tumor heterogeneity of malignant T cells (Fig. 1f and Supplementary Fig. 1d and 2a).
The proportion of malignant T cells within the total T cell population was highly variable across samples, even between samples from the same patient, adding to the complexity of inter-tumor heterogeneity among CTCL lesions (Fig. 1g). After excluding MF18 and MF27, which had a limited number of malignant T cells (<80), we selected the remaining 16 samples from 13 patients for further malignant T cell analysis. In addition, among the benign reactive T cells without apparent CNVs and presenting a polyclonal phenotype, cells without expression of either CD4 or CD8B were excluded from subsequent analysis (Fig. 1f).
By interrogating the paired TCRα and TCRβ repertories all T cells (n = 35,754), we showed that 79.7% (28,511/35,754) contained at least one productive TCRα or β chain and 73.6% (26318/35754) had both α and β chains (Supplementary Fig. 1h). We observed that malignant T cells from each sample harbored a unique clonotype (clonotype frequency > 210 in all samples), and biopsies from different anatomical compartments of the same patient shared the same TCR clonotype (Fig. 1h). In patients MF17, MF26 and MF28, there were two TCRα chains coupled with a single TCRβ chain in the malignant clone. Approximately 30% of mature αβ T cells express dual TCRA chains due to the absence of transcriptional allelic exclusion14. These results strongly supported the hypothesis of a monoclonal origin for CTCL5. Interestingly, in patient pcALCL1, in which CD4+ and CD8+ T cells were both predominant according to histological analysis, CD8+ T cells were defined as malignant T cells with clonal expansion and aberrant CNVs. Thus, pcALCL1 was reclassified as a CD8+ pcALCL, which has rarely been reported9.
Notably, we also observed a fraction of reactive T cells (no apparent CNVs) exhibited low-level clonal expansions in all samples (clonotype frequency 2–49), indicating an active T cell-mediated immune reaction in CTCL lesions (Fig. 1h).
To summarize, scRNA-seq analysis with TCR profiling unambiguously identified malignant T cells in CTCL samples and revealed a remarkable inter-tumor heterogeneity of malignant T cells.
The constitutive activation/proliferation programs of malignant T cells in each patient determined the inter- and intra-tumor heterogeneity of CTCL
Concordant with the characteristics of neoplastic cells previously described in advanced CTCL, a lack of CD7 expression was observed in malignant T cell subsets15 (Supplementary Fig. 2a). Malignant T cells showed variable expression of SELPLG (encoding CLA), which was indicative of a skin-homing T cell phenotype5. All CD4+ T cells, including both reactive and malignant T cells from all samples, presented a T-helper (Th)-2 skewing phenotype (defined by GATA3 expression), consistent with previous findings that Th2 cytokines produced by malignant T cells repressed the Th1 immune response, fostered a Th2-biased phenotype in normal T helper cells, and impaired antitumor immunity in advanced CTCL16. In MF17 and MF22, malignant T cells displayed a Th1 (with TBX21 expression) and Th2 phenotypes simultaneously. A concurrent Th17 phenotype (defined by RORC expression) was identified in pcALCL2, which was consistent with a previous report demonstrating a dual Th2-Th17 phenotype in a portion of pcALCLs17.
In the malignant T cell subsets, a cell cluster characterized by a loss of TCR expression was identified in five of the patients (patients MF17, MF26, MF28, MF30 and pcALCL1) (Fig. 1e and Fig. 2a, b). In this cluster, only less than 20% of cells harbored productive TCRA and TCRB pairs, which showed the same clonotype with the TCR-competent malignant T cell cluster. This TCR-loss cluster existed exclusively in malignant T cell populations, but not in reactive T cells, and it was grouped individually in each sample instead of mixed into malignant T cell subsets as a “dropout” event, pointing to a distinct transcriptional identity of these cells. These TCR-loss clusters showed lower CD3E expression and TOX expression compared with other malignant T cells, as well as slightly different CNV patterns (Fig. 2c and Supplementary Fig. 2b). To confirm the existence of these clusters of cells, we performed immunofluorescence in corresponding MF skin lesions, which revealed scattered TCRαlow CD4+ T cells in clinical samples (Fig. 2d).
Down-regulation of TCR expression has been reported in aggressive T-cell lymphomas, including primary epitheliotropic intestinal T-cell lymphoma and primary cutaneous gamma-delta T cell lymphoma (PCGDTL)18,19. Some recent studies suggested the existence of a special TCR-silent type of lymphoma, while others proposed that this phenotype represented a common phenomenon of TCR instability18,19. T cell activation could trigger TCR complex downregulation in normal mature human T cells. Although such downregulation is mainly due to post-translational degradation of the TCR complex, down-regulation of TCRα and TCRβ mRNA expression does during this process20,21. To further confirm our finding, we measured TCR gene expression in a variety of CTCL cell lines using RNA sequencing. In comparison with normal peripheral CD4+ T cells from healthy volunteers, all CTCL cell lines showed remarkably decreased expression of TRAC, TRBC1 and TRBC2 (Fig. 2e), coinciding with our findings of transcriptional down-regulation of TCRs in a portion of CTCLs.
Malignant T cells in CTCL are unusual tumor cells because they possess the features of immune cells as well as neoplastic cells. To clarify the activation/proliferation states of malignant T cells, all T cells were plotted by previously defined gene signatures (see Methods). As expected, reactive T cells showed a range of T cell activation statuses with a low G2M score, while a subset of malignant T cells showed a high G2M score with an attenuated activation status, and another subset retained the activation status and showed a low G2M score (Fig. 2f). The discrete activation/proliferation states observed among malignant T cells indicated a high level of heterogeneity in CTCL.
To further explore intratumor transcriptional diversity, a non-negative matrix factorization (NMF) analysis was applied to malignant T cells from each sample (see Methods). On average, five programs from each sample were extracted to generate a total of 65 gene signatures, revealing high intratumor heterogeneity (ITH) in each sample (Supplementary Fig. 2c). Hierarchical clustering of these signatures revealed four main meta-programs, which indicated the collective behaviors of malignant T cells across the heterogeneous transcriptional spectrum of all tumors (Fig. 2g). The four meta-programs represented distinct functional signatures annotated by the top-ranking genes, including T cell signaling and activation (meta-program 1: HLA-DRB1, CD69 and MYC; and meta-program 4: ITK, FYN and CBLB), cell cycle (meta-program 2: MCM7, PCNA and BIRC5) and cell metabolism (meta-program 3: GAPDH, BUA52 and RPS3) (Fig. 2h). Each meta-program existed in multiple samples and represented a biological process operating in a subset of malignant T cells in each sample. Interestingly, meta-program 4 consisted of the TCR-loss clusters in each sample, except for MF28 and MF30. This phenomenon may indicate that the activation-induced TCR complex down-modulation frequently occurs in malignant T cells, and the molecular features of TCR-loss clusters demonstrate heterogeneity across patients. We also noticed that meta-program 3 was enriched with genes encoding ribosomal proteins. It may reflect the scRNA-seq preferentiality of detecting highly expressed genes, or suggesting a dysregulated ribosomal protein expression in malignant T cells22.
Next, we determined whether these four meta-programs were related to patient prognosis by evaluating the top-ranking genes in each meta-program in an independent cohort from our group of 49 tumor-stage MF patients with bulk RNA-seq data on skin tumors23. Notably, a high T cell activation signature (meta-program 1 and meta-program 4) was associated with a favorable prognosis, while a high proliferation signature (meta-program 2) predicted a poor patient outcome (Fig. 2i). The adverse prognostic value of meta-program 2 was further validated in an independent pcALCL cohort with 15 patients, while meta-program 1 and 4 showed no significant effect on clinical outcomes in this pcALCL cohort, which may be associated with the limited sample sizes (Supplementary Fig. 2d and Supplementary Data File 3). These results indicate that malignant T cells in CTCL showed high intra-tumor heterogeneity with regard to transcription and function, and the composition of activation/proliferation programs in each sample determined the clinical behavior of the particular case and patient prognosis.
Inter-lesion divergence in CTCL is the result of a multi-step seeding process by monoclonal malignant T cells with parallel subclonal evolution
ITH provides a diverse genetic and epigenetic background for selection and cancer evolution. However, CTCL presents as multiple skin lesions at the very early stage of the disease. The manner in which the high transcriptional ITH of CTCL is related to the evolution and spreading of skin lesions remains undetermined. To explore the temporal and spatial spreading patterns of CTCL, we analyzed paired tumors obtained from different anatomical sites from 3 CD4+ MF patients (patients MF21, MF28 and MF30). Interestingly, the transcriptome heterogeneity increased along with the time interval of lesion development and the anatomical distance between lesions in individual patients.
In patient MF30 with a ten-year history of MF, in which the two tumors had the longest time interval and distance (MF30-1 developed on the right forearm 12 months prior to biopsy, whereas MF30-2 developed on the left cheek two months prior to biopsy), malignant T cells from two biopsies showed disparate transcriptome profiles, although they shared the same TCR clonotype, while the transcriptomes of benign T cells overlapped, as visualized by UMAP plot (Fig. 3a, b). Consistently, trajectory inference revealed that the evolutionary pathways of the malignant T cells from two separate lesions overlapped minimally along pseudotime (see Methods and Fig. 3c). To reveal the genetic background of this inter-lesion diversity, the single nucleotide variant (SNV) patterns of two samples were analyzed. As expected, the two tumors shared less than 10% non-synonymous mutations, and >90% mutations were private (Fig. 3j). The substantial degree of genetic divergence suggested that these two tumors arose from an early clone that was seeded to the skin of the entire body in the early stage of the disease, possibly before clinically detected lesions appeared, and subclonal tumor evolution subsequently occurred in parallel in the skin.
In contrast, in patient MF21 with a fifteen-year history of MF, whose samples were obtained from two tumors within the same anatomical compartment (MF21-1 developed above the left knee 6 months prior to biopsy, whereas MF21-2 developed below the left knee two months prior to biopsy), the transcriptome profiles of malignant T cells from the two lesions were closely related (Fig. 3d, e). Trajectory analysis revealed similar evolutionary dynamics, indicating the resemblance of transcriptional patterns between adjacent tumors (Fig. 3f). The phylogenetic relationship showed that >60% mutations were on the stem of the tree, illustrating a high degree of shared mutations (Fig. 3k). These results suggested a model in which the subclonal malignant T cells in a well-developed tumor may re-circulate and seed into adjacent skin areas and generate late-arising lesions.
This model was further confirmed by patient MF28 with a three-year history of MF, in which two newly developed tumors (less than 1 month old) at adjacent sites (MF28-1 was on the neck, whereas MF28-2 was between the eyebrows) exhibited highly overlapping transcriptional patterns and almost identical evolutionary trends (Fig. 3g–i). These two tumors might have arisen from the same subclone of an adjacent tumor. Consistently, this patient had folliculotropic MF and showed multiple skin tumors in the head and neck area. Interestingly, a similar pattern of ITH was observed in the two lesions, including the existence of a TCR-loss cluster in each sample (Fig. 3h). This phenomenon suggested that the late-arising lesions were seeded by a cluster of cells, which transmitted the genetic diversity for early-arising lesions via a mechanism analogous to consecutive seeding during metastasis of solid tumors24.
Our results suggested that tumor evolution was a multi-step seeding process. This dynamic evolution leads to the accumulation of ITH within each skin lesion, as well as inter-lesion diversity, and contributes to the heterogeneous clinical features and treatment response observed among different skin lesions in the same patient.
The intrinsic features of malignant T cells suggest distinct CTCL tumor origins and determine diverse clinical outcomes
The complexity of CTCL in clinical settings lies in its various subtypes and classifications. The current classification scheme depends largely on clinical characteristics (e.g., folliculotropic, depigmented, or poikilodermic), cell morphology (e.g., large anaplastic cells or small-medium cells) and T cell types (e.g., CD4, CD8, and CD30), rather than on the intrinsic molecular features of malignant T cells. Since we have defined malignant T cells in each sample, we aimed to subtype the CTCL samples according to the transcriptome patterns of malignant T cells.
We profiled the transcriptional expression patterns of all 25,919 malignant T cells and identified the differentially expressed genes (DEGs) between malignant T cells and their respective reactive CD4+ or CD8+ T cells in each sample from 13 patients. Unsupervised clustering analysis of these DEGs separated all samples from the 10× Genomic dataset into two groups independent of disease subtype (Fig. 4a). In one group, which contained CD4+ MF, CD8+ MF, CD4+ pcALCL, and CD8+ pcALCL, the malignant T cells exhibited an effector memory T phenotype, defined as CD45RO+, CD27−, CD62L (encoded by SELL)- and CCR7−, except for one CD8+ MF (MF14), which exhibited a CD45RA+CD45RO−CD27−CD62L−CCR7− phenotype, consistent with the effector memory T cells re-expressing CD45RA (TEMRA) as previously described (Fig. 4a, b and Supplementary Fig. 3a-b)25,26. Interestingly, malignant T cells in this group expressed a core set of cytotoxic markers, including GZMA, GZMB, PRF1, GNLY and IFNG, especially in all CD4+ MFs (Fig. 4a, c and Supplementary Fig. 3a).
In contrast, the other group, consisting of CD4+ MF samples, uniformly expressed high levels of CD45RO, CD62L, CD27 and CCR7, canonical markers of central memory T cells, whereas cytotoxic makers were absent (Fig. 4a, b and Supplementary Fig. 3a). A similar grouping pattern was seen in the 3 samples in the 5′ UMI Smart-Seq2 dataset (Supplementary Fig. 3c). Accordingly, we divided the 13 patients into a cytotoxic effector memory T cell (TCyEM) group and a central memory T cell (TCM) group. The TCyEM group included four CD4+ MF patients, one CD4+ pcALCL patient, two CD8+ MF patients and one CD8+ pcALCL patient, while all five TCM cases were CD4+ MF patients. Samples from different anatomical sites from one patient were included in the sample group, as expected.
To define the gene expression signature of each group, DEGs common to all samples in each group were identified. To maximize the representativeness of these gene signatures, we excluded samples with a limited number of DEGs. Thereby, 13-gene and 27-gene signatures were identified in the TCyEM and TCM groups, respectively (Fig. 4d). Malignant T cells in the TCyEM group showed upregulated expression of a series of cytotoxic molecules and chemokines, including GZMA, GZMH, NKG7 and CCL4, while the gene signature of the TCM group included several genes previously reported in studies of advanced-stage CTCL, including TOX, KIR3DL2, CD40LG, GTSF1, PTTG1, GAPDH, PPIA and HSPD127–32. To eliminate the effect of the cytotoxic nature of CD8+ MF and pcALCL on the gene signature of the TCyEM group, a refined analysis including only CD4+ MFs in the TCyEM group identified a 19-gene signature. This 19-gene list was highly consistent with the 13-gene signature, including multiple cytotoxic molecules (GZMA, GZMH and NKG7), which confirmed the cytotoxic phenotype of the malignant TCyEM cells (Supplementary Fig. 3d). Within the gene signature of the TCM group, TOX is a crucial T cell development regulator and has been shown as an adjunctive diagnostic marker in CTCL28. KIR3DL2 participates in the regulation of immune response of T cells and has been identified in leukemic CTCL33,34. CD40L (encoded by CD40LG) expression in MF has been related to increased cell growth and homing of neoplastic cells to the skin35. GTSF1 is one of cancer testis (CT) genes which are aberrantly expressed in CTCL32. High expression of GAPDH, indicating a switch to aerobic glycolysis to meet the metabolic demand for tumor proliferation, has been shown to promote aggressive T cell lymphomas36,37. HSPD1 was found to be upregulated in advanced CTCL38. PTTG1 and PPIA were described in previous single-cell analyses in CTCL27,31.
To better understand the characteristics of the two groups, we evaluated the clinical relevance and functional profiles of malignant T cells in each group (see Methods). Clinically, all five TCM patients were at late stage with large cell transformation (LCT) (Fig. 4a), indicating that patients in this TCM group were related to aggressive clinical courses and poor prognoses1. Consistently, malignant T cells in the TCyEM group showed a low exhaustion/high cytotoxicity state, while the TCM group was highly exhausted, supporting the previous finding of high expression levels of exhaustion markers in a portion of CTCL cases39, including PDCD1, CTLA4, TIGIT, and TCF7, a marker for precursor exhausted T cells40 (Fig. 4e and Supplementary Fig. 3e, f). Moreover, discrete activation/proliferation states were identified in different subtypes. Malignant TCyEM cells were highly activated and had low proliferation capacity, while malignant TCM cells were highly proliferative, indicating an increased tendency for progression in this group (Fig. 4f). These phenomena highly recapitulate the features of normal effector memory T cells and central memory T cells26.
We further explored the expression patterns of the four meta-programs in the two groups (Supplementary Fig. 3g). In accordance with the two main profiles, malignant TCyEM cells showed higher scores of meta-program 1 and 4 (T cell signaling and activation), while malignant TCM cells exhibited higher scores of meta-program 2 (cell cycle) and meta-program 3 (cell metabolism). To validate this finding, we analyzed the prognostic values of the gene signatures from the two groups in our independent MF cohort. The TCM gene signature was associated with shorter progression-free survival, whereas the TCyEM signature had no significant effect on prognosis, confirming that TCM features were associated with adverse patient outcomes (Fig. 4g and Supplementary Fig. 3h). Therefore, comparing to the TCyEM group, the molecular features of TCM patients were related to more advanced disease stages, aggressive behavior, and adverse prognosis.
These results raise the question of whether TCyEM is a feature restricted to the early stages of MF, so that phenotype switching to TCM occurs when the disease progresses. To address this possibility, we performed trajectory analysis on all CD4+ T cells in all samples (Fig. 4h). Starting as a benign naïve T cell subset, malignant TCyEM and TCM cells moved towards opposing divergent branches. Notably, the cellular trajectories of Tregs developing from naïve CD4+ T cell were closer to those of the malignant TCM cells. A recent single-cell study identified FOXP3 as a driver of clonal evolution in Sézary cells31, which raised the question of whether CTCL with central memory T cell phenotype was Treg origin. However, Tregs did not show apparent CNVs (Fig. 1d) and had no overlapping clonotypes with the malignant cells in the TCM group (data not shown). Therefore, there was no evidence for Treg-derived malignant T cells in our study.
This finding strongly suggests that malignant TCyEM and TCM T cells represented two distinct tumor origins rather than a phenotypic transition in the process of tumor progression. This hypothesis was further validated by patient MF7, a TCM group patient with more than ten years of follow-up assessment. This patient was diagnosed with early-stage MF (T1N1M0B0) ten years ago. The patch/plaque-stage skin lesions obtained from MF7 ten years ago exhibited a TCM phenotype (coexpressing CD45RO and CD27), confirming that the phenotype of the malignant T cells was determined at the outset (Supplementary Fig. 3i).
We also note that TCR-loss clusters were located at the far end of the trajectories of both of the two distinct malignant developmental pathways (Fig. 4h). In each group, the gene expression profiles of TCR-loss clusters were distinct from their TCR-competent counterparts in each group. In particular, the TCR-loss cluster in the TCyEM group expressed even higher levels of T cell activation-related genes in comparison with the TCR-competent cells, including PTPRC, AHNAK, ITK, ZAP70 and FYN (Supplementary Fig. 3j). In the TCM group, increased expression of cellular metabolic genes, including GAPDH, GTSF1 and PTTG1 were seen in TCR-loss cluster (Supplementary Fig. 3k). These findings were highly consistent with our NMF analysis, and confirmed that the TCR-loss clusters represent a terminal state of malignant T cells. Loss of TCR expression in CTCL may be a result of clonal evolution to avoid deleterious external stimulation in the process of tumor progression.
Collectively, we established a molecular subtyping scheme in CTCL based on the transcriptome of malignant T cells. We defined a TCyEM group, in which the malignant T cells showed an activated cytotoxic effector memory T cell phenotype, as well as a TCM group, in which malignant T cells were central memory T cell-like cells with high proliferation and exhaustion status. This subtyping may represent the distinct origins of malignant T cells, and are related to patient outcome as assessed in a larger cohort (n = 49).
CD8+ TILs are major antitumor effector cells in CTCL and are more exhausted in the TCM group
Tumor progression is mediated by reciprocal interaction between tumor cells and their surrounding TME41. The TME influences cancer cells, and in turn cancer cells have been shown to dictate and modify the surrounding TME13. The manner in which differences in malignant T cell phenotypes influence the TME in CTCL lesions is not well understood. To this end, we analyzed and compared cells in the TME from the TCyEM and TCM groups, with a focus on tumor infiltrating lymphocytes (TILs). We first investigated the TCR repertoires of reactive T cells in all samples (Fig. 5a). As expected, the reactive CD8+ T cell subset harbored the most frequent clonal expansion, and 44.3% of these cells were clonally expanded, suggesting that CD8+ TILs are major anti-tumor effector T cells in CTCL. Tregs exhibited a lower level of clonal expansion (less than 20% cells), while CD4+ Tconv cells were least expanded (2.1%) (Fig. 5a), in accordance with previous findings that malignant T cells conferred an impaired antitumor immunity upon normal T helper cells in CTCL16.
We re-clustered all CD8+ TILs and identified two subsets with distinct exhaustion status (Exlow subset and Exhigh subset)., defined by the expression of exhaustion markers PDCD1, HAVCR2, CTLA4, LAG3, and TIGIT (Fig. 5b). Exlow CD8+ TILs exhibited a relatively higher TCF7 expression and lower PDCD1, HAVCR2, CTLA4, LAG3 and TIGIT, indicative of a precursor exhausted T cell phenotype40. Exhigh CD8+ TILs showed a low TCF7 expression, with higher levels of the other exhaustion markers, suggesting a terminally differentiated exhausted T cell phenotype. Notably, the TCyEM group harbored more Exlow CD8+ TILs with more frequent clonal expansion, whereas the TCM group was enriched with more Exhigh CD8+ TILs featured by a restricted TCR repertoire (Fig. 5c). CD8+ TILs are antigen-specific and require class I major histocompatibility complex (MHC-I) restriction. Therefore, MHC-I expression by malignant T cells was explored. Significant down-regulation of MHC-I molecules was observed in the TCM group, which may impair the immune recognition of CD8+ TIL cells in this group (Fig. 5d). Moreover, malignant T cells in the TCM group expressed a higher level of Galectin-9, encoded by LGALS9, a canonical ligand of exhaustion marker Tim-3 (encoded by HAVCR2) (Fig. 5d, e)42. The Galectin-9-Tim-3 interaction was shown to be involved in immune escape by human cancers42. Therefore, we proposed that global down-regulation of MHC-I molecules and upregulation of galectin 9 collaboratively contributed to tumor evasion from cytolytic CD8+ TIL-mediated antitumor immunity in the TCM group.
The cell origin of malignant T cells defines TME landscapes
Emerging data have highlighted the complexity and heterogeneous composition of the TME in T cell lymphoma43. To gain more insight into the potential roles of other immune cells in the TME, we next re-clustered non-T cells from all samples to reveal the global immune landscape (Fig. 6a). Among the diverse cellular components, B cells and macrophages were the dominant cell populations. Interestingly, the TCyEM group showed a relatively higher fraction of macrophages, and the TCM group had remarkably higher abundance of B cells and plasma cells (Fig. 6a and Supplementary Fig. 4a).
A subset of NK cells, an important component of cytotoxic immune cells, existed in all samples. The NK subset displayed a CD16high CD56low (encoded by FCGR3A and NCAM1, respectively) phenotype, considered as a mature and highly cytotoxic state44 (Supplementary Fig. 4a). In addition, a subset of ILC1s exhibiting NK-like gene markers (NKG7, NCR1 and TBX21) with an IL7Rhigh/EOMESlow molecular profile was identified as previously described45. NK cells demonstrated cytolytic capacity greater than that of ILC1s, as suggested by the expression levels of cytotoxic markers (Supplementary Fig. 4a). The NK/ILC1 ratios of the TCM and TCyEM groups were comparable, suggesting that NK cells were universally activated in all CTCL cases and may serve as a therapeutic agent for anti-tumor immunotherapy in CTCLs.
Extensive interactions between malignant T cells and various immune cells in both groups were predicted by immune-associated ligand-receptor pairs using the CellPhoneDB repository (see Methods and Fig. 6b). The reciprocal tumor/macrophage and tumor/B cell interaction patterns were distinct between the two groups, prompting us to explore their respective roles in the two CTCL subtypes.
M1 and M2 scores for tumor associated macrophages (TAMs) were calculated according to previously reported markers46. Notably, a higher M2 score was observed in TAMs in the TCyEM group (Fig. 6c). CD163, a classic M2 macrophage marker, was highly expressed in the TCyEM group (Fig. 6d and Supplementary Fig. 4b, c). The presence of CD163-expressing M2 TAMs has been associated with poor clinical outcomes in CTCL, and depletion of M2 macrophage was found to suppress CTCL development in vivo10,47,48. CD163+ M2 macrophages could foster an immunosuppressive and pro-tumorigenic inflammatory microenvironment by inducing T cell tolerance and producing pro-angiogenic cytokines such as vascular endothelial growth factor (VEGF)10,49. Accordingly, we observed high expression levels of VEGFA and MAF (encoding a key transcriptional factor of IL10) in the macrophage population in TCyEM patients (Fig. 6d).
To determine the interplay between M2 TAMs and malignant T cells in TCyEM patients, further analysis revealed significantly upregulated CCL5 and CCL4 in malignant T cells, coupled with higher expression of their classical receptors, CCR1 and CCR5, in the macrophage cluster (Fig. 6d, e). CCL5 and CCL4 are involved in regulating macrophage trafficking, and CCL5/CCR5 interaction was reported to induce the polarization of CD163+ M2 in cancers50. Therefore, we proposed that malignant TCyEM cells appeared to produce high levels of CCL5 and CCL4 to recruit and polarize CD163-expressing M2 TAMs, forming a tumor-supporting environment in TCyEM cases.
On the other hand, the B cell cluster, which was highly enriched in TCM patients, expressed a high level of CD40, which is associated with B cell activation and Breg differentiation under CD40L stimulation (Supplementary Fig. 4a)51. Since CD40LG was included in the gene signature of malignant T cells in the TCM group, we hypothesized that malignant T cells may employ the CD40L/CD40 axis to activate B cells in TCM cases. We confirmed the presence of B cells with high tumor infiltration capacity in this group by immunostaining (Supplementary Fig. 4b, c). CD40L-expressing malignant T cells were located in proximity to CD40-expressing B cells in TCM lesions (Fig. 6f, h and Supplementary Fig. 4g). We further validated the close relationship between B cells and TCM cells in our independent MF cohort, and we identified a positive correlation between malignant TCM marker CD40LG and mature B cell marker CD20 (encoded by MS4A1) (Supplementary Fig. 4d).
B cells are abundant in many human cancers and play cancer-specific roles in antitumor immunity52. B cells have been reported to be upregulated in a portion of MF patients53. A recent study reported that depletion of mature B cells in an aggressive folliculotropic MF patient achieved improved disease control, suggesting a deleterious effect of B cells in CTCL11. Studies have revealed that B cells, especially Bregs, could promote tumor progression by inducing resting CD4+ T cells to transform into Tregs in various cancers51. Although the TCM and TCyEM groups showed a similar number of Tregs and similar expression levels of FOXP3, the Treg population of TCM cases showed higher expression levels of LAG3 and TIGIT, suggesting enhanced suppressor activity in this subset of cells54,55(Fig. 6g and Supplementary Fig. 4e, f). Specifically, the LAG3+ Treg has been recognized as a subset of cells endowed with potent immunosuppressive capacity, and this type of Treg has been found to preferentially expand in tumors56. Therefore, we proposed that malignant T cells activated B cells in TCM patients via a CD40L/CD40 axis, and highly infiltrated B cells may promote tumor progression by activating Tregs.
Taken together, our results demonstrate the roles of M2 macrophages and the B cell-Treg axis in the TMEs of the TCyEM and TCM groups, respectively. To corroborate and illustrate the intricate interactions between malignant T cells and the TME, multicolor IHC staining of tumor tissues was performed. The interplay between M2 macrophages/TCyEM cells and B cells/LAG3+ Tregs/TCM cells was demonstrated in the TCyEM and TCM groups (Fig. 6h and Supplementary Fig. 4g). Our results suggest that the diversity of malignant T cell origins determines the heterogeneity of TME, which may contribute to patient outcomes. Thus, anti-CTCL immunotherapy should be tailored according to each patient’s molecular subtype.
Discussion
High-throughput multi-dimensional single-cell analysis provides a powerful tool to reveal the dynamics of tumor heterogeneity. In our study, using scRNA-seq with paired TCR V(D)J sequencing, we carried out a comprehensive transcriptional analysis on 19 skin lesions from 15 patients across two major CTCL variants MF and pcALCL, and depicted the nature of malignant T cells and their multicellular ecosystems. We put forward a tumor evolution model underlying the pathogenesis of CTCL and present a dimension of molecular subtyping based on tumor origins.
We adopted a reliable approach to precisely distinguish malignant T cells from reactive T cells, providing a solid basis for achieving a deep understanding of the pathobiology of CTCL. Our data lend strong support for a monoclonal origin for malignant T cells derived from mature skin-homing T cells, in contrast with recent studies demonstrating an oligoclonal or polyclonal nature of CTCL8. This discrepancy might be caused by contamination of reactive T cells, because laser-capture microdissection based on cell morphology has been widely used to dissect tumor cells8. We found that reactive T cells, especially CD8+ TILs, displayed a substantial amount of clonal expansion, which could result in the designation of polyclonality in CTCL lesions. To a lesser extent, the absence of allelic exclusion of TCRA might be another reason for this discrepancy14. The bi-allelic TCRA expression has been previously reported on Sézary cells by WES study57. Our study was able to pair the bi-allelic TCRA to TCRB sequence in the single-cell dimension to demonstrate the monoclonal nature of malignant T cells. Therefore, the single-cell approach appears to be a superior method for tracing the development of tumor progression in future studies.
Interestingly, malignant T cells in a portion of patients lacked TCR expression, which was previously reported in Sézary cells57. These TCR-loss clusters exist in both TCyEM and TCM groups, representing a universal phenomenon in CTCL, although the clinical significance regarding patient prognosis was not clear due to the limited sample size of our scRNA-seq cohort. Down-regulation of TCR expression represents a negative feedback mechanism for constraining T cell effector function to avoid excess inflammatory damage in reactive T cells21. Whether malignant T cells induced a similar mechanism to prevent harmful signal transduction during tumor progression warrants further exploration.
Our data revealed diverse transcriptome programs in the malignant T cells within each sample, demonstrating remarkable subclonal evolution in CTCL. This finding was consistent with a recent study on Sézary syndrome, in which malignant T cells from peripheral blood developed multiple subclones according to culture conditions58. We found that distinct activation/proliferation status of the malignant T cells determined the properties of each subclone. Malignant T cells demonstrated either a high proliferation score or a high activation score, suggesting diverse underlying genetic backgrounds. These results are consistent with a previous genomic analysis revealing that putative driver oncogenes in CTCL were varied and involved multiple pathways, including T cell activation, chromatin modification, and cell cycle control59. Our prognosis analyses on each meta-program across all samples also demonstrated that the intra-tumor and inter-tumor heterogeneity in the composition of the activation/proliferation cell clusters determined the clinical behavior of each skin lesion and the patient prognosis.
Our data confirmed the monoclonal origin of malignant T cells across skin lesions in individual CTCL patients. Because each skin tumor is identifiable once fully established, and accessible to biopsy, we are able to show a multi-step and parallel transformation model for malignant T cells: a single mutated ancestor of the malignant T cell clonally expands and seeds into skin niches at different anatomical sites long before clinically detectable skin lesions occur. Parallel subclonal evolution occurs independently over time and results in the development of new skin rashes. As the skin tumor evolves, multiple waves of dissemination of subclones occur and colonize adjacent anatomical sites, developing new lesions (Fig. 7a). This model explains the intra-lesion heterogeneity and inter-lesion diversity of CTCL patients, as well as differences in the therapeutic vulnerabilities of lesions at different anatomical sites.
In contrast with solid tumors which initiate from a primary site, or the skin involvement of hematological malignancies which demonstrate homogenous skin lesions60, CTCL, especially MF, generally begins with multiple skin lesions appearing in succession across long time intervals, even decades, and the lesions on different anatomical sites are varied61. Patches, plaques, and tumor-stage lesions may exist simultaneously on the same patient, and they may show distinct responses to treatment9. Our findings suggest that skin lesion spreading is an evolutionary process with temporal and spatial dynamics that highly resemble those of the multi-step process of metastatic colonization by solid tumors24,62.The seeding events take place across the whole course of the disease development and may depend on the characteristics of malignant T cells and the microenvironment of different skin niches.
The route of malignant T cell dissemination is still debated. It has been reported that skin-resident T-cells do not recirculate; they survive and proliferate in the skin without migrating to the lymph nodes63,64. More recent studies have shown that human skin CD4+ resident memory T cells can exit the skin, reenter the circulation, and travel to secondary human skin sites65,66, supporting our model of the spreading of CTCL lesions. Previous studies with TCR-sequencing also supported this model by demonstrating low but detectable malignant clones in adjacent non-lesional skin and peripheral blood in patients with skin-limited CTCL67,68. Noting that our model is based on a limited sampling of skin lesions from a limited number of patients, it is impossible to conclude when the cells first disseminate and whether the temporal or the spatial aspect determines the differences in the trajectories. Factors other than time and anatomical proximity may also play vital roles in the process of tumor evolution. Future large‐scale studies that involve repeated biopsies of multiple skin samples from the same patient combined with longitudinal and spatial sampling will help to corroborate this model. Robust bioinformatic analysis will be required for reliable documentation of malignant T cell dissemination patterns in CTCL.
We proposed a molecular subtyping scheme for CTCL based on the transcriptomic features of malignant T cells. Our findings expand our understanding of CTCL cellular origin of CTCL, which was first elucidated by a seminal study demonstrating that MF arises from a skin-resident effector memory T cell subset, while Sézary syndrome arises from central memory T cells with variable expression of skin addressin CLA5. We excluded the possibility that the five patients in the TCM group were Sézary syndrome cases, because none of them were in B2 stage. Therefore, our data showed varied T cell subset origins within MF patients. This finding was supported by a recent study describing discrete CD4+ T-cell subsets in the neoplastic cells of MF, including naive T cells (TN), TCM, TEM and TEMRA subsets25. We identified distinct molecular signatures in the TCM and TCyEM subtypes. The TCyEM group showed high expression of cytotoxic molecules. Accordingly, natural cytotoxicity receptor NKp46 and granzyme B have been reported to be expressed by atypical cells in some CD4+ CTCL cases, including MF and pcALCL patients69–71. We also observed that CD28 was downregulated in this group (Supplementary Fig. 3a), which was consistent with a previous study demonstrating that loss of CD28 expression was a hallmark of CD4+ cytotoxic T cells72. This group contains CD4+ MF, CD8+ MF, and pcALCL cases, suggesting that a portion of MF are closely related to pcALCL. These results may also explain the highly overlapping histology features between pcALCL and MF, and the frequent concurrence of pcALCL and MF in the same patient39. The gene signatures of the TCM group include TOX, KIR3DL2 and GTSF1, which were previously reported in tumor-stage MF and Sézary syndrome32,34,73,74. TOX was shown to be highly expressed in both MF and Sézary syndrome, and its expression is related to poor patient prognosis28,73. KIR3DL2 has been as a diagnostic marker for Sézary syndrome34. KIR3DL2-targeted therapy shows potent antitumor activity against CTCL in preclinical and clinical studies33. The TCM group consists of pure CD4+ MFs, and its signature is associated with an adverse effect on patient survival, suggesting that some CD4+ MF lesions may share a cellular origin with Sézary syndrome and could require more aggressive treatment. Targeted treatment for Sézary syndrome, including anti-KIR3DL2 antibodies, may be applicable in this group of MF patients. Therefore, MF shows a diversified T cell subset origin, which underlies its varied biological and clinical characteristics.
We showed that the origins of malignant T cells determined the particular characteristics of their distinct TMEs (Fig. 7b, c), which could be exploited to develop tailored immunotherapy strategies for different subtypes of patients. Tumor-associated macrophages, especially M2 TAMs, promote angiogenesis, tumor cell invasion, intravasation, and tumor metastasis, and macrophages have been shown to be good targets for anticancer therapy49. Bexarotene, an FDA-approved drug for advanced CTCL, has been shown to reduce production of CCL22 from macrophages75. Thus, bexarotene, macrophage-targeted therapy, or drugs repolarizing macrophages, e.g. interferon α76, may benefit patients in the TCyEM group. Alternatively, malignant TCM cells may evade anti-lymphoma immunity by an orchestrated interaction among malignant T cells, exhausted CD8+ TILs, B cells, and Tregs. Administration of immune checkpoint blockers or B cell depletion treatment may show enhanced efficacy in patients in the TCM group. The major obstacle to conducting clinical trials in CTCL patients is that the response rates of almost all available treatments are highly variable. Therefore, molecular subtyping is critical to stratify CTCL patients. Future precision medicine for CTCL patients relies on further elucidation of the distinct characteristics of the tumor ecosystem of each patient subset.
This study, however, is subject to several limitations. First, our study is mainly descriptive, with limited sample size. Further validation studies with a large cohort in CTCL will be required. Second, our sorting strategy in scRNA-seq aimed to maximize the sequencing of T-cell populations; thus, non-T cells were sorted disproportionally in the process of FACS, which may limit the interpretation of the tumor microenvironment analysis. Although we have validated the abundances of the non-T immune cells in each sample, especially macrophages and B cells, with immunohistochemistry, this sorting strategy still influenced the relative number of CD45- non-immune cell population, especially in the Smart-seq samples, which were densely infiltrated by atypical T cells in histology and only a few non-T cells were recovered. Third, due to the technical limitation of scRNA-seq with 10× Genomics platform, the numbers of detected genes from scRNA-seq data are lower than the bulk RNA-seq method77. This drawback mainly affects the detection of low-abundance genes, e.g., cytokines and chemokines, limiting the interpretation of T cell functions and cell-cell interactions in this study.
Collectively, our work establishes a conceptual foundation for understanding CTCL tumorigenesis and proposes a molecular subtyping strategy incorporating the phenotypes of malignant T cells and the multicellular immune landscape. Our findings may create therapeutic benefits for CTCL patients and serve as a useful resource for further studies.
Methods
Patient sample
Fifteen CTCL patients with informed consent were recruited from the Skin Lymphoma Clinic of Peking University First Hospital in the period from 2018 to 2019. All patients were either treatment naïve or did not receive anti-lymphoma therapy at least 6 months prior to the biopsy. All diagnoses were verified by at least two dermatopathologists according to previously described criteria9. For scRNA-seq, thick plaques or tumors were selected and resected from 15 patients corresponding to 19 samples. Each freshly dissociated sample was transported to laboratory immediately. All participants provided written consent for specimen collection and analysis under the study protocol approved by the Peking University First College Hospital Ethics Committee. We have complied with all relevant ethical regulations for work with human participants.
Tissue dissociation and single-cell suspension preparation
After immediate transportation, each fresh tissue was minced into pieces (<1 mm3) on ice and digested with collagenase. To obtain single-cell suspensions with a high viability rate, the solution was transferred into a gentleMACS C Tube (Miltenyi Biotec, 130-093-237) and placed into a gentleMACS octo dissociator (Miltenyi Biotec, 130-095-937), in which it was incubated for 45 min at 37 °C. After digestion, the suspension was filtered using a Falcon 40-μm cell strainer (Corning, 352340). Next, 10 μL cell suspension was used to confirm the cell viability with trypan blue staining (Solarbio, C0040). The sample was then centrifuged and resuspended with serum-free phosphate-buffered saline (PBS) to prepare it for cell staining and flow cytometry.
Single cell isolation
Each single-cell suspension was stained with CD45-PE and CD3-BV421 for 15 min on ice in the dark. 7-AAD (BD Pharmingen) was added prior to fluorescence-activated cell sorting (FACS). Single cell isolation was performed using FlowJo (v10.7.1, BD Inc, USA). We sorted all T cell populations within the CD45+CD3+ gate. To elucidate the tumor microenvironment, we also sorted other subtypes disproportionally, including the non-T immune cell population (CD45+CD3−) and non-immune cell population (CD4−). Sorted single cells were collected into 96-well plates for downstream 5′ UMI SmartSeq2-seq (with 10 μL lysis buffer in each well) or mixed into a FACS tube containing 10 μL lysis buffer for downstream drop-based scRNA-seq.
Immunohistochemistry and immunofluorescence
Paraffin-embedded tissue sections were deparaffinized and rehydrated. After heat antigen retrieval, endogenous peroxidase activity was blocked (only for immunohistochemistry). For immunohistochemistry, the tissue sections were visualized by NDP View2 Viewing software. Staining results were evaluated by Image-Pro Plus 6.0 (Media Cybernetics, USA). For Immunofluorescence, images were captured by fluorescence confocal microscopy (Leica Confocal) and digitalized by Leica software.
Primary antibodies used in immunohistochemistry and immunofluorescence are listed as follows: CD4 (1:100, Abcam, ab133616, clone EPR6855), TCRα (1:100, H-1, santa, sc-515719), CD45RO (1:400, CST, 55618 s, clone UCHL1), CD27 (1:500, Abcam, ab131254, clone EPR8569), granzyme A (1:100, Abcam, ab209205, clone EPR20161), granzyme B (1:800, CST, 17215 s, clone D2H2F), perforin (1:10, eBioscience, 14-9994-82, clone deltaG9), granulysin (1:250, Abcam, ab241333, clone EPR22110-101), TOX (1:300, Abcam, ab237009, clone NAN448B), LGALS9 (1:100, Abcam, ab227046, clone EPR22214), CD40 (1:250, Abcam, ab224639, clone EPR20735), CD40L (1:100, Abcam, ab257319, clone CD40LG/2761), CD20 (1:50, Abcam, ab78237, clone EP459Y), FOXP3 (1:100, Abcam, ab20034, clone 236 A/E7), LAG3 (1:1000, Abcam, ab209236, clone EPR20261), CD163 (1:500, Abcam, ab182422, clone EPR19518), CD8 (1:1000, Proteintech, 66868-1-1 g, clone 1G2B10) and CD45RA (1:300, Millipore, 05-1413, clone MEM 56), TIGIT (1:500, CST, 99567 T, clone E5Y1W).
Multicolor immunohistochemistry
To evaluate the multicellular ecosystem and the spatial distributions of different cell types within the TMEs of the TCyEM and TCM groups, we performed multicolor immunohistochemistry using FFPE tissue sections of 7 patients from our cohort in this study. The PANO Mutiplex IHC Kit (Panovue) was used for panel 1: TOX (1:200, Abcam, ab237009, clone NAN448B), CD20 (1:50, Abcam, ab78237, clone EP459Y), FOXP3 (1:100, Abcam, ab20034, clone 236 A/E7) and LAG3 (1:100, Abcam, ab209236, clone EPR20261) or Panel 2: GZMA (1:200, Abcam, ab20034, clone EPR20161), CD4 (1:100, Abcam, ab133616, clone EPR6855), CD8 (1:1000, Proteintech, 66868-1-1 g, clone 1G2B10) and CD163 (1:100, Abcam, ab182422, clone EPR19518). The primary antibodies in each panel were sequentially applied, followed by secondary antibody incubation and tyramide signal amplification (TSA). Between each round of staining, antigen retrieval was performed on the slides with heat-treatment. Finally, DAPI was applied to stain nuclei. Each slide was scanned using a Mantra System (PerkinElmer) by capturing the fluorescent spectrum at 20-nm wavelength intervals (420–720 nm) with uniform exposure time, after which the spectra were combined into a single stack image in each panel. The spectrum of each fluorophore and tissue autofluorescence were extracted from the images of single-stained and unstained tissue samples and analyzed by InForm software (PerkinElmer).
Pre-processing and quality control of scRNA-seq data (10× genomics)
Raw sequencing data from the 10× Genomics platform were converted to fastq format by ‘CellRanger mkfastq’ (v.3.0.2). Next, scRNA-seq reads were aligned to the hg19 reference genome using ‘CellRanger count’ (v3.0.2), and scRNA-TCR data were aligned to the vdj-GRCh38 reference genome. We filtered mitochondrial and ribosomal genes for further analysis. All cellranger output was combined using ‘CellRanger aggr’ (v3.0.2).
To further aggregate and analyze results from the above pipeline using ‘CellRanger aggr’, a stringent data quality control procedure was conducted in the downstream analysis. Only genes detected in at least 10 cells were retained. We filtered out cells with fewer than 500 or more than 5000 detected genes and those with a high mitochondrial content (>10%)78.
After discarding poor-quality cells, a total of 58,926 cells were retained for downstream analysis. To normalize the library size effect in each cell, we scaled UMI counts using scale.factor = 10,000. Following log transformation of the data, other factors, including “S.Score”, “G2M.Score”, “percent.mt”, “nCount_RNA” and “nFeature_RNA”, were corrected for variation regression using the ScaleData function in Seurat (v3.0.2).
The corrected-normalized data metrics were applied to the standard analysis as described in the Seurat R package. The top 3,000 variable genes were extracted for principal component analysis (PCA). The top 30 principal components were kept for UMAP visualization and clustering. We performed cell clustering using the FindClusters function (resolution = 0.2) implemented in the Seurat R package.
Processing of scRNA-seq data based on 96-well plates
First, raw reads data were demultiplexed based on the cell barcode sequence from Read 2 of the paired-end reads. Then, we trimmed the sequences of read 1 to remove low-quality bases (N > 10%), the TSO and polyA tail sequences, and sequences contaminated with adapters. The filtered sequence of read 1 was aligned to the hg19 human genome reference using TopHat (v2.0.12). All the uniquely mapping reads were counted using the htseq-count function from R HTseq package. The duplicated transcripts with identical UMIs in each gene were discarded. Finally, we quantified the transcript number based on the distinct UMIs of each gene in each cell. To visualize the expression matrices of scRNA-seq data based on 96-well plates, the Seurat R package was also applied with a pipeline similar to that described above.
Cell type annotation and cell state definition
To annotate the cell type for each single cell within all clusters, differential expression genes (DEGs) of each cluster from the 10× Genomics dataset were identified using the FindAllMarkers analysis in the Seurat R package. The top 50 DEGs of each cluster were carefully reviewed. The markers used to define cell types included: PTPRC, CD3E, CD3D, and CD3G for T cells; CD19, CD79A, MS4A1, and CD79B for B cells; LYZ, CD14, CD68, and C1QA for macrophages; ACTA2, TAGLN, MYLK, and AOC3 for myofibroblasts; PLVAP, CLDN5, ENG, and AQP1 for endothelial cells; IGLL5, MZB1, DERL3, and FKBP11 for plasma cells; DCN, COL1A1, COL3A1, and COL1A2 for fibroblasts; GNLY, KLRB1, IL7R, GZMB, NKG7, PRF1, FCGR3A, and KLRD1 for ILC1s and NKs; KRT14, KRT5, KRT6A, and KRT17 for epithelial cells; IRF7, LILRA4, PLD4, and PLAC8 for pDCs; FSCN1, LAMP3, CD1E, and CCR7 for DCs; CPA3, HPGD, CTSG, and GATA2 for mast cells; PMEL, TYRP1, DCT, and MLANA for melanocytes46,79. In parallel, heatmap plots were generated used the top five genes in each cell type. Each cluster from the 5′ UMI scRNA-seq data was annotated following the process described above.
Cell subclustering analysis
For the subclustering analysis, T cell clusters, non-T cell clusters and CD4+ T cells from individual patients (MF21, MF28 and MF30) were extracted from the overall integrated dataset and integrated for further subclustering. After integration, genes were scaled to unit variance. Scaling, PCA, and clustering (using a resolution of 0.6 for T cell clusters, 0.2 for non-T cell clusters and 0.2 for CD4+ T cells from individual patients) analyses were performed as described above.
Copy number variations (CNVs) based on WES Data
We obtained WES data from a subset of samples (n = 13) corresponding to 11 patients. DNA was isolated from the CTCL biopsies using the TIANamp genomic DNA Kit (TIANGEN, DP304). Granulocyte DNA isolated from matched blood was used as a germline control sample80. We performed paired end, 150 bp read-length sequencing using Illumina NovaSeq 6000 platform. The mean sequencing depths of skin samples and blood control samples were 190.0× (range 101.4× to 256.4×) and 105.9× (range 89.9× to 132.2×), respectively (Supplementary Table 2). For the WES data, the sequenza R package (v3.0.0) was used to call CNVs81. Briefly, we used BAM files from the WES data for tumors and matched blood samples as input to calculate the depth ratio, which was normalized based on the GC content bias and data ratio. To acquire segmented copy numbers and estimate tumor purity, the following parameters were used: breaks.method = ‘full’, gamma = 40, kmin = 5, gamma.pcf = 200, kmin.pcf = 200 and assembly = “hg38”. For each tumor sample, the copy numbers of segments were then divided by ploidy following log2 transformation. The tumor purity of each sample was in the range of 0.21–0.73 (Supplementary Table 2).
Inferring CNVs based on scRNA-seq data
To differentiate malignant T cells from reactive T cells, large-scale CNVs were inferred from the scRNA-seq data using the R infercnv package (v1.7.1). Reactive T cells and other cell types were used as controls for the CNV analysis. We estimated the CNV patterns of all cells in each sample using default parameters12. For the WES data, the CNV patterns were called by the sequenza R package (v3.0.0) as described above. The true CNV was used as a positive control to identify differences in the two CNV patterns. Malignant T cells were distinguished from reactive T cells mainly based on inferred large-scale CNVs, aneuploid status, feature marker gene expression and cell cluster distribution.
Signature score analysis based on gene sets
We used a core gene set to perform cell cycle analysis based on 43 G1/S genes and 54 G2/M genes12. A cell cycle score was calculated for each cell using the CellCycleScoring R function. Finally, each cell was assigned with a prediction classification in either the S, G2M or G1 phase based on its S and G2M scores.
To evaluate the status of T cell clusters, we evaluated cell states using datasets described in Jin et al. 12 and Gene Ontology (GO): a cytotoxicity gene set (GZMA, GZMB, GZMK, IFNG, NKG7, PRF1, CST7 and CCL4), an exhaustion gene set (PDCD1, LAG3, TIGIT, HAVCR2 and CTLA4) and an activation gene set (GO:0002286). We calculated the gene signature scores for these gene sets across all subgroups and samples using the R ggplot2 package (v3.3.2).
Pseudotime analysis
To construct developmental trajectories along pseudotime for CD4+ T cells, including malignant T cells and reactive T cells, the monocle R package (v2.16.0) was applied to the transcriptional expression patterns of the cell subtypes in all samples and individual patients (MF21, MF28 and MF30) from the 10× Genomics dataset. The expression matrices of the scRNA-seq data were analyzed with default settings following the vignette of Monocle 2. We plotted CD4+ reactive T cells and malignant cells along the inferred cell trajectories.
Expression programs of intratumor heterogeneity
To reveal the intratumor heterogeneity of CTCL, a non-negative matrix factorization algorithm documented by the R NMF package (v0.23.0) was applied to malignant T cells from each patient (including the 5′ UMI and 10× Genomics datasets)82. For the expression matrix, we filtered genes with standard deviation of expression <0.5 within each patient. NMF was applied to the relative expression values, with all negative values replaced by zero. We successfully extracted five potential programs from each patient, which generated 65 intratumor expression programs among the 13 patients. To investigate common programs among the 65 signatures, the programs were clustered into meta-programs by hierarchical clustering, using overlapping genes (among the 50 top-scoring genes of each program) as a similarity metric. Four meta-programs (two T cell signaling and activation, one cell cycle and one cell metabolism) were identified and further analyzed. For meta-programs, all genes within the programs were defined as the meta-signature. The gene lists of the 4 meta-programs are provided in Source Data file.
Cell-cell interaction analysis
We conducted cell-cell interactions analysis utilizing the cellphonedb function curated by the CellPhoneDB v.2.0 database (www.cellphonedb.org)83. ScRNA-seq counts files and cell type markers were used as input data. According to the expression of a receptor by one cell type and a ligand by another cell type, enriched ligand-receptor interactions between pairs of cell types were calculated. The P value for the likelihood of cell-type specificity of a given ligand-receptor complex was calculated based on the proportion of means that were as high as or higher than the actual mean. The significant cell-cell interactions were selected with a P value threshold of <0.05. We used the R ggplot2 package (v3.3.2) to visualize the results.
Mutation calling with WES data
For the WES data, somatic variants were detected using the GATK (v4.1.8.1) Best Practices Pipeline (GATK broadinstitute.org). Paired-end reads were aligned to human genome hg38 (UCSC) using the Burrows-Wheeler Aligner (BWA) (v0.7.17) with default parameters84. Samtools (v0.1.19) was used to convert SAM files to compressed BAM files and sort the BAM files by chromosomal coordinates85. Then, GAKT MarkDuplicates was used to mark PCR duplicates. Next, base quality recalibration was performed with GAKT BaseRecalibrator and ApplyBQSR. Granulocyte DNA isolated from matched blood was used as a germline control sample80. We used GAKT Mutect2 to call somatic SNVs and indels, after which we used GATK FilterMutectCalls to filter out false callings.
Phylogenetic tree analysis
To reveal the clonal relationship between paired tumors from patients MF21 and MF30, phylogenetic trees were constructed using MEGA-X (v10.2.2)86. Sequences 20 bp in length surrounding the non-synonymous mutations (including SNVs and INDELs) were extracted to construct the phylogenetic trees of each patient based on the maximum-parsimony algorithm. All phylogenetic trees were further optimized using Adobe Illustrator. We labeled potential driver events for each patient on each tree.
Quantification and statistical analysis
In this study, R (v 4.0.3) and GraphPad Prism (v6.1) software were used to conduct statistical analysis. The Mann–Whitney U test, Spearman rank correlation test, log-rank test and Pearson’s chi-square test were used. Detailed statistical analysis methods are described in the Figure Legends or main text.
Illustration tool
Some of the elements of Figs.1a, 3a, d, g and 7a–c were created with BioRender.com (https://biorender.com).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This study was supported by the National Nature Science Foundation of China (81922058, Y.W.), National Science and Technology Major Project (2019YFC1315702, F.B.), Guangdong Province Key Research and Development Program (2019B020226002, F. B.), National Youth Top-Notch Talent Support Program (283812, Y.W.), and Peking University Clinical Medicine plus X Youth Project (PKU2019LCXQ012, Y.W. and F.B.). We appreciate the generous gifts of CTCL lines MJ, PB2B, Myla, Hut78 and SZ4 from Youwen Zhou (Department of Dermatology and Skin Science, University of British Columbia) and Mac-1 and Mac-2A from Marshall E. Kadin (School of Medicine, Boston University).
Source data
Author contributions
Y.W. and F.B. conceived the study, directed and supervised the research. F.B. was responsible for bioinformatics investigation. Y.W. and P.T. reviewed the histopathologic diagnoses. Y.W., P.T., X.L., H.P., P.L., J.S., Y.G., F.L., Y.X., Yy.L., Yj.W. Z.C., B.X. and Y.L. contributed to the enrollment of CTCL patients and the collection of patient information. M.R., Q.L., S.Y. and H.L. contributed to the preparation of patient clinical samples. X.L., S.J. and S.H. performed the tissue dissociation and single-cell suspension preparation and flow cytometry analysis. S.A.T. and M.H. helped with the design of the study as well as provided advice on the statistical analysis of scRNA data and WES data. S.J., S.H., X.L., R.L., Y.L., D.X., Z.L. and Y.Z. contributed to data processing and Figures generation. X.L., H.P., P.L., Y.X. and Yy.L. conducted experiments and interpreted the data. All the authors contributed to manuscript preparation. X.L. wrote the original manuscript. Y.W., F.B., S.A.T. and M.H. reviewed and edited the manuscript. Y.W. and F.B. are the corresponding authors of this manuscript.
Peer review
Peer review information
Nature Communications thanks Anton Langerak and the other anonymous reviewer(s) for their contribution to the peer review of this work.
Data availability
All relevant data are available in publicly accessible repositories. The raw data from the scRNA-seq, TCR-seq, bulk exome sequencing and some bulk RNA-seq data have been deposited in the Genome Sequence Archive for human (GSA-Human) under accession number HRA000166. The remaining raw bulk RNA-seq data have been deposited in the Gene Expression Omnibus database under accession numbers GSE168508 and GSE10962023,87. To comply with the “Guidance of the Ministry of Science and Technology (MOST) for the Review and Approval of Human Genetic Resources”, we are required to deposit the genomic data of Chinese patients under a controlled access at the GSA in Beijing Institute of Genomics Data Center. To gain access to the raw data under accession number HRA000166, please submit requests to the GSA-Human online page for this study [https://ngdc.cncb.ac.cn]. For scientific research purposes, the access will be granted and the data can be downloaded in a typical one month time window. The remaining data are available within the Article, Supplementary Information or Source Data file. The source data underlying Fig. 1c, g, 2e, g, 3j, k, 4d, 5c, 6a, c and Supplementary Fig. 3d are provided in the Source Data file. Source data are provided with this paper.
Code availability
The computer code for marker selection, clustering, and differential expression is available at https://github.com/husimeng0717/scRNAseq-for-CTCL.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Xiangjun Liu, Shanzhao Jin, Simeng Hu.
Contributor Information
Fan Bai, Email: fbai@pku.edu.cn.
Yang Wang, Email: yangwang_dr@bjmu.edu.cn.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-28799-3.
References
- 1.Arulogun SO, et al. Long-term outcomes of patients with advanced-stage cutaneous T-cell lymphoma and large cell transformation. Blood. 2008;112:3082–3087. doi: 10.1182/blood-2008-05-154609. [DOI] [PubMed] [Google Scholar]
- 2.Davis TH, Morton CC, Miller-Cassman R, Balk SP, Kadin ME. Hodgkin’s disease, lymphomatoid papulosis, and cutaneous T-cell lymphoma derived from a common T-cell clone. N. Engl. J. Med. 1992;326:1115–1122. doi: 10.1056/NEJM199204233261704. [DOI] [PubMed] [Google Scholar]
- 3.Fauconneau A, et al. Assessment of diagnostic criteria between primary cutaneous anaplastic large-cell lymphoma and CD30-rich transformed mycosis fungoides; a study of 66 cases. Br. J. Dermatol. 2015;172:1547–1554. doi: 10.1111/bjd.13690. [DOI] [PubMed] [Google Scholar]
- 4.Gao C, et al. The importance of differentiating between mycosis fungoides with CD30-positive large cell transformation and mycosis fungoides with coexistent primary cutaneous anaplastic large cell lymphoma. J. Am. Acad. Dermatol. 2021;84:185–187. doi: 10.1016/j.jaad.2020.04.073. [DOI] [PubMed] [Google Scholar]
- 5.Campbell JJ, Clark RA, Watanabe R, Kupper TS. Sezary syndrome and mycosis fungoides arise from distinct T-cell subsets: a biologic rationale for their distinct clinical behaviors. Blood. 2010;116:767–771. doi: 10.1182/blood-2009-11-251926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Hamrouni A, Fogh H, Zak Z, Ødum N, Gniadecki R. Clonotypic diversity of the T-cell receptor corroborates the immature precursor origin of cutaneous T-cell lymphoma. Clin. Cancer Res. 2019;25:3104–3114. doi: 10.1158/1078-0432.CCR-18-4099. [DOI] [PubMed] [Google Scholar]
- 7.Iyer A, et al. Branched evolution and genomic intratumor heterogeneity in the pathogenesis of cutaneous T-cell lymphoma. Blood Adv. 2020;4:2489–2500. doi: 10.1182/bloodadvances.2020001441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Iyer A, et al. Skin colonization by circulating neoplastic clones in cutaneous T-cell lymphoma. Blood. 2019;134:1517–1527. doi: 10.1182/blood.2019002516. [DOI] [PubMed] [Google Scholar]
- 9.Willemze R, et al. WHO-EORTC classification for cutaneous lymphomas. Blood. 2005;105:3768–3785. doi: 10.1182/blood-2004-09-3502. [DOI] [PubMed] [Google Scholar]
- 10.Sugaya M, et al. Association of the numbers of CD163(+) cells in lesional skin and serum levels of soluble CD163 with disease progression of cutaneous T cell lymphoma. J. Dermatol Sci. 2012;68:45–51. doi: 10.1016/j.jdermsci.2012.07.007. [DOI] [PubMed] [Google Scholar]
- 11.Theurich S, et al. Targeting tumor-infiltrating B cells in cutaneous T-cell lymphoma. J. Clin. Oncol. 2016;34:e110–e116. doi: 10.1200/JCO.2013.50.9471. [DOI] [PubMed] [Google Scholar]
- 12.Jin S, et al. Single-cell transcriptomic analysis defines the interplay between tumor cells, viral infection, and the microenvironment in nasopharyngeal carcinoma. Cell Res. 2020;30:950–965. doi: 10.1038/s41422-020-00402-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ma L, et al. Tumor cell biodiversity drives microenvironmental reprogramming in liver cancer. Cancer Cell. 2019;36:418–430 e416. doi: 10.1016/j.ccell.2019.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Schuldt NJ, Binstadt BA. Dual TCR T cells: identity crisis or multitaskers? J. Immunol. 2019;202:637–644. doi: 10.4049/jimmunol.1800904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Scala E, et al. Skewed expression of activation, differentiation and homing-related antigens in circulating cells from patients with cutaneous T cell lymphoma associated with CD7- T helper lymphocytes expansion. J. Invest Dermatol. 1999;113:622–627. doi: 10.1046/j.1523-1747.1999.00718.x. [DOI] [PubMed] [Google Scholar]
- 16.Guenova E, et al. TH2 cytokines from malignant cells suppress TH1 responses and enforce a global TH2 bias in leukemic cutaneous T-cell lymphoma. Clin. Cancer Res. 2013;19:3755–3763. doi: 10.1158/1078-0432.CCR-12-3488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wen Y. et al. IL-13 Signaling in CD30(+) Cutaneous Lymphoproliferative Disorders. J Invest Dermatol, (2020). [DOI] [PubMed]
- 18.Tan SY, et al. Type II EATL (epitheliotropic intestinal T-cell lymphoma): a neoplasm of intra-epithelial T-cells with predominant CD8αα phenotype. Leukemia. 2013;27:1688–1696. doi: 10.1038/leu.2013.41. [DOI] [PubMed] [Google Scholar]
- 19.Caballero A, et al. Silent T-cell receptor cutaneous T-cell lymphoma associated to a clonal plasma cell proliferation. Hematol. Rep. 2019;11:7841. doi: 10.4081/hr.2019.7841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Paillard F, Sterkers G, Vaquero C. Transcriptional and post-transcriptional regulation of TcR, CD4 and CD8 gene expression during activation of normal human T lymphocytes. Embo j. 1990;9:1867–1872. doi: 10.1002/j.1460-2075.1990.tb08312.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Gallegos AM, et al. Control of T cell antigen reactivity via programmed TCR downregulation. Nat. Immunol. 2016;17:379–386. doi: 10.1038/ni.3386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Guimaraes JC, Zavolan M. Patterns of ribosomal protein expression specify normal and malignant human cells. Genome Biol. 2016;17:236. doi: 10.1186/s13059-016-1104-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Liu F. et al. PEG10 amplification at 7q21.3 potentiates large-cell transformation in cutaneous T-cell lymphoma. Blood, (2021). [DOI] [PMC free article] [PubMed]
- 24.Heyde A, Reiter JG, Naxerova K, Nowak MA. Consecutive seeding and transfer of genetic diversity in metastasis. Proc. Natl Acad. Sci. 2019;116:14129–14137. doi: 10.1073/pnas.1819408116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Horna P, Moscinski LC, Sokol L, Shao H. Naive/memory T-cell phenotypes in leukemic cutaneous T-cell lymphoma: Putative cell of origin overlaps disease classification. Cytom. B Clin. Cytom. 2019;96:234–241. doi: 10.1002/cyto.b.21738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Chang JT, Wherry EJ, Goldrath AW. Molecular regulation of effector and memory T cell differentiation. Nat. Immunol. 2014;15:1104–1115. doi: 10.1038/ni.3031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Gaydosik AM, et al. Single-cell lymphocyte heterogeneity in advanced cutaneous T-cell lymphoma skin tumors. Clin. Cancer Res. 2019;25:4443–4454. doi: 10.1158/1078-0432.CCR-19-0148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zhang Y, et al. Molecular markers of early-stage mycosis fungoides. J. Invest Dermatol. 2012;132:1698–1706. doi: 10.1038/jid.2012.13. [DOI] [PubMed] [Google Scholar]
- 29.Vieyra-Garcia P., et al. Benign T cells drive clinical skin inflammation in cutaneous T cell lymphoma. JCI Insight4, e124233 (2019). [DOI] [PMC free article] [PubMed]
- 30.Hashikawa K, et al. Microarray analysis of gene expression by microdissected epidermis and dermis in mycosis fungoides and adult T-cell leukemia/lymphoma. Int J. Oncol. 2014;45:1200–1208. doi: 10.3892/ijo.2014.2524. [DOI] [PubMed] [Google Scholar]
- 31.Borcherding N, et al. Single-cell profiling of cutaneous T-cell lymphoma reveals underlying heterogeneity associated with disease progression. Clin. Cancer Res. 2019;25:2996–3005. doi: 10.1158/1078-0432.CCR-18-3309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.van Kester MS, et al. A meta-analysis of gene expression data identifies a molecular signature characteristic for tumor-stage mycosis fungoides. J. Investigative Dermatol. 2012;132:2050–2059. doi: 10.1038/jid.2012.117. [DOI] [PubMed] [Google Scholar]
- 33.Bagot M, et al. IPH4102, a first-in-class anti-KIR3DL2 monoclonal antibody, in patients with relapsed or refractory cutaneous T-cell lymphoma: an international, first-in-human, open-label, phase 1 trial. Lancet Oncol. 2019;20:1160–1170. doi: 10.1016/S1470-2045(19)30320-1. [DOI] [PubMed] [Google Scholar]
- 34.Battistella M, et al. KIR3DL2 expression in cutaneous T-cell lymphomas: expanding the spectrum for KIR3DL2 targeting. Blood. 2017;130:2900–2902. doi: 10.1182/blood-2017-06-792382. [DOI] [PubMed] [Google Scholar]
- 35.Storz M, et al. Coexpression of CD40 and CD40 ligand in cutaneous T-cell lymphoma (mycosis fungoides) Cancer Res. 2001;61:452–454. [PubMed] [Google Scholar]
- 36.Chiche J, et al. GAPDH expression predicts the response to R-CHOP, the tumor metabolic status, and the response of DLBCL patients to metabolic inhibitors. Cell Metab. 2019;29:1243–1257.e1210. doi: 10.1016/j.cmet.2019.02.002. [DOI] [PubMed] [Google Scholar]
- 37.Mondragon L, et al. GAPDH overexpression in the T cell lineage promotes angioimmunoblastic T cell lymphoma through an NF-kappaB-dependent mechanism. Cancer Cell. 2019;36:268–287 e210. doi: 10.1016/j.ccell.2019.07.008. [DOI] [PubMed] [Google Scholar]
- 38.Shin J, et al. Lesional gene expression profiling in cutaneous T-cell lymphoma reveals natural clusters associated with disease outcome. Blood. 2007;110:3015–3027. doi: 10.1182/blood-2006-12-061507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Gao Y., et al. Differential SATB1 expression reveals heterogeneity of cutaneous T-cell lymphoma. J Invest Dermatol, (2020). [DOI] [PubMed]
- 40.Blank CU, et al. Defining ‘T cell exhaustion’. Nat. Rev. Immunol. 2019;19:665–674. doi: 10.1038/s41577-019-0221-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Binnewies M, et al. Understanding the tumor immune microenvironment (TIME) for effective therapy. Nat. Med. 2018;24:541–550. doi: 10.1038/s41591-018-0014-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Holderried TAW, et al. Molecular and immune correlates of TIM-3 (HAVCR2) and galectin 9 (LGALS9) mRNA expression and DNA methylation in melanoma. Clin. Epigenetics. 2019;11:161. doi: 10.1186/s13148-019-0752-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Berger CL, et al. The growth of cutaneous T-cell lymphoma is stimulated by immature dendritic cells. Blood. 2002;99:2929–2939. [PubMed] [Google Scholar]
- 44.Zimmer CL, et al. NK cells are activated and primed for skin-homing during acute dengue virus infection in humans. Nat. Commun. 2019;10:3897. doi: 10.1038/s41467-019-11878-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Weizman OE, et al. ILC1 Confer Early Host Protection at Initial Sites of Viral Infection. Cell. 2017;171:795–808 e712. doi: 10.1016/j.cell.2017.09.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Azizi E, et al. Single-Cell Map of Diverse Immune Phenotypes in the Breast Tumor Microenvironment. Cell. 2018;174:1293–1308 e1236. doi: 10.1016/j.cell.2018.05.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Wu X, et al. Depletion of M2-like tumor-associated macrophages delays cutaneous T-cell lymphoma development in vivo. J. Invest Dermatol. 2014;134:2814–2822. doi: 10.1038/jid.2014.206. [DOI] [PubMed] [Google Scholar]
- 48.Assaf C, Hwang ST. Mac attack: macrophages as key drivers of cutaneous T-cell lymphoma pathogenesis. Exp. Dermatol. 2016;25:105–106. doi: 10.1111/exd.12894. [DOI] [PubMed] [Google Scholar]
- 49.Coussens LM, Zitvogel L, Palucka AK. Neutralizing tumor-promoting chronic inflammation: a magic bullet? Science. 2013;339:286–291. doi: 10.1126/science.1232227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Halama N, et al. Tumoral immune cell exploitation in colorectal cancer metastases can be targeted effectively by anti-CCR5 therapy in cancer patients. Cancer Cell. 2016;29:587–601. doi: 10.1016/j.ccell.2016.03.005. [DOI] [PubMed] [Google Scholar]
- 51.Zhou X, Su YX, Lao XM, Liang YJ, Liao GQ. CD19(+)IL-10(+) regulatory B cells affect survival of tongue squamous cell carcinoma patients and induce resting CD4(+) T cells to CD4(+)Foxp3(+) regulatory T cells. Oral. Oncol. 2016;53:27–35. doi: 10.1016/j.oraloncology.2015.11.003. [DOI] [PubMed] [Google Scholar]
- 52.Yuen GJ, Demissie E, Pillai S. B lymphocytes and cancer: a love-hate relationship. Trends Cancer. 2016;2:747–757. doi: 10.1016/j.trecan.2016.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Nielsen PR, et al. Role of B-cells in mycosis fungoides. Acta Derm. Venereol. 2021;101:adv00413. doi: 10.2340/00015555-3775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Joller N, et al. Treg cells expressing the coinhibitory molecule TIGIT selectively inhibit proinflammatory Th1 and Th17 cell responses. Immunity. 2014;40:569–581. doi: 10.1016/j.immuni.2014.02.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Huang CT, et al. Role of LAG-3 in regulatory T cells. Immunity. 2004;21:503–513. doi: 10.1016/j.immuni.2004.08.010. [DOI] [PubMed] [Google Scholar]
- 56.Aoki T, et al. Single-cell transcriptome analysis reveals disease-defining T-cell subsets in the tumor microenvironment of classic hodgkin lymphoma. Cancer Disco. 2020;10:406–421. doi: 10.1158/2159-8290.CD-19-0680. [DOI] [PubMed] [Google Scholar]
- 57.Wang L, et al. Genomic profiling of Sezary syndrome identifies alterations of key T cell signaling and differentiation genes. Nat. Genet. 2015;47:1426–1434. doi: 10.1038/ng.3444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Poglio S, et al. Xenograft and cell culture models of Sézary syndrome reveal cell of origin diversity and subclonal heterogeneity. Leukemia. 2021;35:1696–1709. doi: 10.1038/s41375-020-01068-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Park J, et al. Genomic analysis of 220 CTCLs identifies a novel recurrent gain-of-function alteration in RLTPR (p.Q575E) Blood. 2017;130:1430–1440. doi: 10.1182/blood-2017-02-768234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Kaddis N, Fisher D, Jacobsen ED. Cutaneous involvement of hematologic malignancies. Hematol. Oncol. Clin. North Am. 2019;33:163–172. doi: 10.1016/j.hoc.2018.08.005. [DOI] [PubMed] [Google Scholar]
- 61.Jawed SI, Myskowski PL, Horwitz S, Moskowitz A, Querfeld C. Primary cutaneous T-cell lymphoma (mycosis fungoides and Sezary syndrome): part I. Diagnosis: clinical and histopathologic features and new molecular and biologic markers. J. Am. Acad. Dermatol. 2014;70:205 e201–205 e216. doi: 10.1016/j.jaad.2013.07.049. [DOI] [PubMed] [Google Scholar]
- 62.Turajlic S, Swanton C. Metastasis as an evolutionary process. Science. 2016;352:169–175. doi: 10.1126/science.aaf2784. [DOI] [PubMed] [Google Scholar]
- 63.Clark RA, et al. Skin effector memory T cells do not recirculate and provide immune protection in alemtuzumab-treated CTCL patients. Sci. Transl. Med. 2012;4:117ra117. doi: 10.1126/scitranslmed.3003008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Watanabe R, et al. Human skin is protected by four functionally and phenotypically discrete populations of resident and recirculating memory T cells. Sci. Transl. Med. 2015;7:279ra239. doi: 10.1126/scitranslmed.3010302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Klicznik M. M. et al. Human CD4(+)CD103(+) cutaneous resident memory T cells are found in the circulation of healthy individuals. Sci Immunol4, eaav8995 (2019). [DOI] [PMC free article] [PubMed]
- 66.Piconese S, Campello S, Natalini A. Recirculation and residency of T cells and Tregs: lessons learnt in anacapri. Front Immunol. 2020;11:682. doi: 10.3389/fimmu.2020.00682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Rindler K, et al. Single-cell RNA sequencing reveals tissue compartment-specific plasticity of mycosis fungoides tumor cells. Front. Immunol. 2021;12:666935. doi: 10.3389/fimmu.2021.666935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Kirsch IR, et al. TCR sequencing facilitates diagnosis and identifies mature T cells as the cell of origin in CTCL. Sci. Transl. Med. 2015;7:308ra158. doi: 10.1126/scitranslmed.aaa9122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Ortonne N, et al. CD158k/KIR3DL2 and NKp46 are frequently expressed in transformed mycosis fungoides. Exp. Dermatol. 2012;21:461–463. doi: 10.1111/j.1600-0625.2012.01489.x. [DOI] [PubMed] [Google Scholar]
- 70.Kummer JA, Vermeer MH, Dukers D, Meijer CJ, Willemze R. Most primary cutaneous CD30-positive lymphoproliferative disorders have a CD4-positive cytotoxic T-cell phenotype. J. Invest Dermatol. 1997;109:636–640. doi: 10.1111/1523-1747.ep12337633. [DOI] [PubMed] [Google Scholar]
- 71.Vermeer MH, Geelen FAMJ, Kummer JA, Meijer CJLM, Willemze R. Expression of cytotoxic proteins by neoplastic T cells in mycosis fungoides increases with progression from plaque stage to tumor stage disease. Am. J. Pathol. 1999;154:1203–1210. doi: 10.1016/S0002-9440(10)65372-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Hoeks C. et al. Treg-resistant cytotoxic CD4(+) T cells dictate T helper cells in their vicinity: TH17 skewing and modulation of proliferation. Int J Mol Sci22, 5660 (2021). [DOI] [PMC free article] [PubMed]
- 73.Huang Y, Su MW, Jiang X, Zhou Y. Evidence of an oncogenic role of aberrant TOX activation in cutaneous T-cell lymphoma. Blood. 2015;125:1435–1443. doi: 10.1182/blood-2014-05-571778. [DOI] [PubMed] [Google Scholar]
- 74.Litvinov IV, et al. Ectopic expression of cancer-testis antigens in cutaneous T-cell lymphoma patients. Clin. Cancer Res. 2014;20:3799–3808. doi: 10.1158/1078-0432.CCR-14-0307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Tanita K, et al. Bexarotene reduces production of CCL22 from tumor-associated macrophages in cutaneous T-cell lymphoma. Front Oncol. 2019;9:907. doi: 10.3389/fonc.2019.00907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Muller E, et al. Both type I and type II interferons can activate antitumor M1 macrophages when combined With TLR stimulation. Front Immunol. 2018;9:2520. doi: 10.3389/fimmu.2018.02520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Haque A, Engel J, Teichmann SA, Lonnberg T. A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications. Genome Med. 2017;9:75. doi: 10.1186/s13073-017-0467-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Dong R, et al. Single-cell characterization of malignant phenotypes and developmental trajectories of adrenal neuroblastoma. Cancer Cell. 2020;38:716–733.e716. doi: 10.1016/j.ccell.2020.08.014. [DOI] [PubMed] [Google Scholar]
- 79.Zhang Q, et al. Landscape and dynamics of single immune cells in hepatocellular carcinoma. Cell. 2019;179:829–845 e820. doi: 10.1016/j.cell.2019.10.003. [DOI] [PubMed] [Google Scholar]
- 80.Prasad A, et al. Identification of gene mutations and fusion genes in patients with sezary syndrome. J. Invest Dermatol. 2016;136:1490–1499. doi: 10.1016/j.jid.2016.03.024. [DOI] [PubMed] [Google Scholar]
- 81.Favero F, et al. Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data. Ann. Oncol. 2015;26:64–70. doi: 10.1093/annonc/mdu479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Izar B, et al. A single-cell landscape of high-grade serous ovarian cancer. Nat. Med. 2020;26:1271–1279. doi: 10.1038/s41591-020-0926-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Efremova M, Vento-Tormo M, Teichmann SA, Vento-Tormo R. CellPhoneDB: inferring cell-cell communication from combined expression of multi-subunit ligand-receptor complexes. Nat. Protoc. 2020;15:1484–1506. doi: 10.1038/s41596-020-0292-x. [DOI] [PubMed] [Google Scholar]
- 84.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Li H, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018;35:1547–1549. doi: 10.1093/molbev/msy096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Sun J, et al. SATB1 defines a subtype of cutaneous CD30+ lymphoproliferative disorders associated with a T-helper 17 cytokine profile. J. Investigative Dermatol. 2018;138:1795–1804. doi: 10.1016/j.jid.2018.02.028. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are available in publicly accessible repositories. The raw data from the scRNA-seq, TCR-seq, bulk exome sequencing and some bulk RNA-seq data have been deposited in the Genome Sequence Archive for human (GSA-Human) under accession number HRA000166. The remaining raw bulk RNA-seq data have been deposited in the Gene Expression Omnibus database under accession numbers GSE168508 and GSE10962023,87. To comply with the “Guidance of the Ministry of Science and Technology (MOST) for the Review and Approval of Human Genetic Resources”, we are required to deposit the genomic data of Chinese patients under a controlled access at the GSA in Beijing Institute of Genomics Data Center. To gain access to the raw data under accession number HRA000166, please submit requests to the GSA-Human online page for this study [https://ngdc.cncb.ac.cn]. For scientific research purposes, the access will be granted and the data can be downloaded in a typical one month time window. The remaining data are available within the Article, Supplementary Information or Source Data file. The source data underlying Fig. 1c, g, 2e, g, 3j, k, 4d, 5c, 6a, c and Supplementary Fig. 3d are provided in the Source Data file. Source data are provided with this paper.
The computer code for marker selection, clustering, and differential expression is available at https://github.com/husimeng0717/scRNAseq-for-CTCL.