Abstract
Pancreatic ductal adenocarcinoma (PDAC) is the most frequent and aggressive pancreatic tumor characterized by high metastatic risk and special tumor microenvironment. To comprehensively delineate the complex intra-tumoral heterogeneity and the underlying mechanism during metastatic lesions malignant progression, single-cell RNA sequencing (scRNA-seq) was employed. PCA and TSNE were used for dimensionality reduction analysis and cell clustering. Find All Markers function was used to calculate differential genes in each cluster, and Do Heatmap function was used to plot the distribution of differential genes in each cluster. GSVA was employed to assign pathway activity estimates to individual cells. Lineage trajectory progression was inferred by monocle. CNV status was inferred to compare the heterogeneity among patients and subtypes by infercnv. Ligand-receptor interactions were identified by CellPhoneDB, and regulons network of cells was analyzed by SCENIC. Through RNA-sequencing of 6236 individual cells from 5 liver metastatic PDAC lesions, 10 major cell clusters are identified by using unbiased clustering analysis of expression profiling and well-known cell markers. Cells with high CNV level were considered as malignant cells and pathway analyses were carried out to highlight intratumor heterogeneity in PDAC. Pseudotime trajectory analysis revealed that components of multiple tumor-related pathways and transcription factors (TFs) were differentially expressed along PDAC progression. The complex cellular communication suggested potential immunotherapeutic targets in PDAC. Regulon network identified multiple candidates for promising cell-specific transcriptional factors. Finally, metastatic-related genes expression levels and signaling pathways were validated in bulk RNA Sequencing data. This study contributed a comprehensive single-cell transcriptome atlas and contributed into novel insight of intratumor heterogeneity and molecular mechanism in metastatic PDAC.
Subject terms: Pancreatic cancer, Tumour heterogeneity
Introduction
Pancreatic ductal adenocarcinoma (PDAC) is one of the most malignant tumors with a 5-year survival rate of 8% according to annual cancer statistical reports [1, 2]. This plight is mainly attributed to the lack of reliable markers for early diagnosis, low radical excision rate, and chemoradiotherapy resistance of PDAC, and only 15–20% of newly diagnosed patients are eligible for surgical resection [3, 4].
The intrinsic transcriptomic characteristics and dynamic immunological heterogeneity significantly contributed to the tumor progression and sensitivity toward treatment. Mounting genomic research have revealed crucial gene mutations, such as KRAS driver mutation and frequent inactivation of TP53 tumor suppressors, were the main drivers of tumorigenicity of PDAC [5]. Mutations of these genes participated in specific biological processes, including Wnt/TGF-β pathways, DNA repair processes, and chromatin remodeling mechanisms [6]. It was well established that PDAC is one of the malignant cancers characterized by diverse immune microenvironment comprised of abundant subpopulations of immune cells. A primary reason for the high mortality rate of PDAC is the abundant extracellular matrixes, which challenged against drugs delivery leading to the therapeutic resistance [7]. Thus, it is highly desirable to reveal the mechanisms underlying the tumorigenesis and metastasis of PDAC, and to determine robust and promising molecular targets for individualised therapeutic strategy.
In our work, singe cell RNA-sequencing analysis was employed to present a comprehensive landscape of the transcriptomic profiles of 6236 qualified single cells from 5 liver metastatic PDAC lesions, further dissecting intra-tumoral heterogeneity and identify crucial factors during PDAC metastatic progression. A total of ten major cell subpopulations were determined and the molecular properties of malignant cells were characterized. In addition, PDAC metastatic lesion constituted of diverse malignant cell subtypes was characterized by highly heterogeneous. We further identified a list of novel gene expression changes that affected several known cancerous pathways. With trajectory analysis of tumor progression, the distant dissemination function becomes increased. The comprehensive atlas of the multicellular ecosystem of TME were delineated and potential transcriptional regulators network underlying tumor progression was established. In addition, metastasis-related genes expression and signaling pathways were further confirmed in bulk RNA Sequencing data. Thus, this single-cell study will contribute novel insight into cellular landscape for deciphering intra-tumoral heterogeneity and understanding the molecular mechanism in tumor metastasis, with valuable significance for therapeutic management of PDAC.
Results
Single-cell transcriptome atlas in PDAC metastatic lesions
To investigate the cellular diversity in PDAC metastatic lesions, scRNA-seq analysis was implemented on metastatic tumor samples from 5 PDAC patients. After carrying out the initial quality control assessment described previously (Supplementary Fig. 1A and B), a total of 6,236 cells were profiled for subsequent analysis, of which single-cell transcriptomes were obtained. Following gene expression normalization, principal component analysis was performed on variably expressed genes (Supplementary Fig. 1C) across all cell samples. By using unbiased clustering based on t-distributed stochastic neighbor embedding (t-SNE) analyses, 10 main segregated cell clusters were identified in parallel (Supplementary Fig. 2A, Supplementary Table 1). Next, representative genes were developed by employing differential gene expression analysis to identify of each cell cluster according to expression profiles and well-known cell markers (Fig. 1A). And the clustering distribution of single cell from five patients is presented in Supplementary Fig. 2B and 2C. The top 5 differentially expressed genes (DEGs) for the cellular subclusters were presented in the heatmap (Fig. 1B), with more details extended in the following sections.
In particular, these cell lineages were as follows: (1) the Cancer stem cells characterized with high ALDH1A1 [8] and NES [9] expression; (2) the Ductal cells specifically expressing cell markers MMP7, TSPAN8, SOX9, and LCN2; [10, 11] (3) the Monocyte highly expressing BASP1 [12] and TNF; [13] (4) the T cells with high expression of CD3D [14] and CD27; [15] (5) the Macrophage specifically express the markers HLA − DRB5 and NUPR1; [16] (6) the NK cells expressing KLRC1 [17] and TMIGD2 [12]. The expression patterns of the cluster-specific marker genes in the cellular populations were demonstrated (Fig. 1C, Supplementary Fig. 3A). Notably, five distinct types of ductal cells were identified. Both types of ductal cells showed a high level of ductal markers: MMP7, TSPAN8, SOX9, and LCN2 [10, 11]. To characterize these ductal clusters, the expression profiles of subtype-specific marker genes were examined respectively (Fig. 1D). Results showed that CAPN12 and CA12 were highly expressed in type 1 ductal cells compared with other subtypes, suggesting suitable markers of CAPN12 and CAP12 for type 1 ductal cells. Type 2 ductal cells exhibited the highest expression levels of HMMR and TPX2, indicating that HMMR and TPX2 may identify these cells. SCGB2A1 and IGFBP3 were uniquely expressed in type 3 ductal cells, prompting reliable markers of SCGB2A1 and IGFBP3 for type 3 ductal cells. CTGF and HSPA6 were specifically expressed in type 4 ductal cells, thus, they could be used as specific markers for the detection of these cells. Type 5 ductal cells experienced unique expression of REG4 and CEACAM5, supporting specific markers of REG4 and CEACAM5 for type 5 ductal cells. Moreover, all ductal cells exhibited significant higher expression of reported poor prognosis and distant metastasis PDAC markers, such as CEACAM1/5/6 [18] and KRT19 [19], suggesting that these ductal subtypes might be malignant ductal cells (Supplementary Fig. 3B and C).
The dot plots illustrated the proportion of cells expressing representative markers and their scaled relative expression values in different cell clusters (Fig. 1E). The proportions of the cellular subpopulations diverse well among the tumor lesions (Fig. 1F), suggesting the intertumoral heterogeneity as well as the consistency among the lesions.
Copy-number alterations in PDACs
To distinguish malignant cells, we calculated and identified large-scale chromosomal copy number variation (CNV) by inferCNV for each sample based on transcriptomes [20]. An inferCNV clustered heatmap was created per sample, which corresponds to the normalized expression values of immune cells plotted in the top panel and ductal cells in the bottom panel. In the resultant CNA heatmap, the regions of gain are depicted in red and regions of loss in blue. Most non-stromal cells presented relatively low CNV levels except for monocyte and T cells, which showed moderate CNV levels. We discovered that these five ductal cell subtypes experienced remarkable CNV events compared with cells in control samples (Fig. 2A–E). Notably, type 1 ductal cells presented significantly higher CNV levels than other types of ductal cells (Fig. 2F). These results suggested that five subtypes of ductal cells are all malignant cells in metastatic lesions.
Transcriptional heterogeneity of malignant stromal cells
Through the t-SNE analysis on malignant stromal cells, six cellular clusters in total, of which five belonged to ductal lineage and 1 was of the cancer stem lineage (Fig. 3A). The distribution of each cluster of single cells from five different patients was shown (Fig. 3B). The expression profiling in distinct subclusters of the malignant ductal and cancer stem cells were presented (Fig. 3C). As demonstrated in the top DEGs, specifically cancer stem lineage clusters expressed high levels of stemness feature genes including ALDH1A1, PROM1, and NES, while ductal lineage cluster expressed high levels of ductal markers such as MMP7, TSPAN8, and SOX9 (Fig. 3D).
To further reveal the biological roles of each cellular cluster in tumorigenicity and progression, competitive gene set variation analysis (GSVA) was performed (Fig. 3E, F). Type 1 ductal cells showed heightened activities of heterologous metabolism and bile acid metabolism processes and adipocytokine signaling pathway, indicating that type 1 ductal cells were identified as cells of relatively low malignancy. Most genes with high expression levels in type 2 ductal cells were enriched in cell cycle and DNA replication, suggesting that type 2 ductal cells were associated to tumor proliferation and growth. Upregulated genes expressed in type 3 ductal cells were mainly mediated in tumor migration-related processes like Wnt/β-catenin and KRAS signaling pathways, indicating that type 3 ductal cells were identified as cells of relatively high malignancy. Type 4 ductal cells experienced high enrichment involved in tumor progressions, such as p53, KRAS, and apoptosis-related signaling pathways, highlighting that type 3 ductal cells were also malignant cells. Notably, type 5 ductal cells mainly enriched with IL6/JAK/STAT3, PI3K/AKT/MTOR and TGF-β signaling pathways and Epithelial-Mesenchymal Transformation, supporting that type 5 ductal cells were the major factor of malignant cells in metastatic lesions.
Gene expression patterns in ductal cells during PDAC progression
There is not a consensus yet regarding which pancreatic cell type being responsible for the point of origin of tumor cells. To further elucidate the origins of neoplastic cells in the carcinogenesis of PDAC, pseudotime trajectory analysis was performed using cancer stem cells and five types of ductal cells. R package monocle was then used to sort individual cells by these genes and construct the tree-like structure of the entire lineage differentiation trajectory (Fig. 4A). Along with trajectory progression, cells experienced three states: starting point of branching (pre-branch) and the two other branches (Fate1 and Fate2; Fig. 4B). Cancer stem cells and type 3 ductal cells were mainly shown in the beginning of the trajectory. Type 1 and 4 ductal cells predominately appeared at the end of the trajectory branch 1 and type 5 ductal cells were mostly at the end of the trajectory branch 2 (Fig. 4C, D). Notably, type 2 ductal cells were appeared at the ends of different branches, suggesting transitional form of type 2 ductal cells for tumor progression. Our findings indicated the possibility that both cancer stem cells and type 3 ductal cells could be the origin of malignant cells and may transform to two distinct types of neoplastic ductal cells during the PDAC progression. Along with the trajectory, epithelial cell marker EPCAM and ductal marker KRT19 exhibited sustained high expression levels during PDAC progression, while genes previously reported to be involved in tumor progression, such as MUC1 and CEACAM6, gradually upregulated along with the transition (Fig. 4E).
Moreover, the gene expression patterns mediated in the cell state transition along with progression trajectory were dissected and 7603 dynamical genes with significant expression changes were determined (Supplementary Table 2). Particularly, the alteration of multiple crucial drivers participated in the PDACs cacogenesis (Fig. 4F), including several regulators potentially mediated in progression of PDAC, such as VEGF/VEGFR, HIF2, and P53 signaling pathway. Meanwhile, the transcriptional factors involved in immune cell differentiation and proliferation, such as MEF2C, HMGB1, HMGB2, CREM, LITAF, ID1, and ID3, etc. (Fig. 4G), were remarkably dysregulated with the trajectory transition process.
Subsequently, these genes were further subclustered into initial expression patterns and terminal patterns with specific expression patterns (Fig. 5A). In addition, the potential key genes associated with distant metastasis of PDAC, such as MMP7, TSPAN8, MSLN, LAMC2, KLK6, and LY6D (Fig. 5B), were discovered to significantly change along with progression trajectory. Next, branched heatmap was employed to present the gene pattern of distinct cell fate branches, genes are assigned into two different clusters based on the expression dynamics (Fig. 5C; Supplementary Table 3). Genes most significantly diverted between two branches were recognized: Alpha-enolase (ENO1), Interferon-α inducible protein 6 (IFI6), Interferon-stimulated gene product 15 (ISG15), receptor activity modifying protein 1 (RAMP1), Retinoblastoma binding protein-1 (RBP1), and specific glucose transporter GLUT1 (SLC2A1; Fig. 5D).
Complex cell–cell communication networks in the TME
To characterize the tumor microenvironment of PDAC metastatic lesion, CellPhoneDB was applied to detected the intercellular communication among malignant cell types and immune cells. In the output of the results of the ligand receptor, the heatmap plot function is used to analyze the interaction between immune cells. The results showed that macrophages and monocytes were significantly active and interacted with a variety of cells (Supplementary Fig. 4A and B). Broadcast ligand–receptor pairs were identified demonstrated extensive molecular interactions among the major cell types (Fig. 6A, B).
Previous results suggested that type 5 ductal cells may be the main source of malignant cells, thus, the ligand-receptor complex among type 5 ductal cells and other major cell types were investigated. Notably, inhibitory receptor–ligand pairs between type 5 ductal cells and immune cells were widely identified. In addition to CD47-SIRPG and CD47-SIRB1 complex, other cell–cell interactions that have rarely been reported in PDAC (e.g., PVR-TIGIT, PDCD1-FAM3C, and LGALS9-CD47) were also observed (Fig. 6C). Interestingly, costimulatory interactions between type 5 ductal cells and immune cells were also widely discovered (Fig. 6D), from which specific ligand–receptor complexes (e.g., TNFRSF1A-GRN, MIF-TNFRSF14 and LGALS9-CD44) were identified. Of note, macrophage expressed relatively high levels of SPP1, and the ligands of SPP1 (CD44) were overexpressed by type 5 ductal cells, indicating the presence of functional interactions between macrophage and type 5 ductal cells. Macrophage participated in the pancreatic cancer metastasis, which was demonstrated by previous reports [21]. In addition, chemokines were relatively higher expressed in immune cells (e.g., CCL4, and CXCL2), while the type 5 ductal cells commonly expressed high levels of the corresponding receptors, indicating that these chemokines served as critical players in regulating diverse types of immune cells infiltration in PDAC tumor microenvironment (Fig. 6E). The previously reported T helper (Th) 1 immune response could be used as a biological marker for immunotherapy in metastatic lymph nodes of PDAC [22]. Specifically, the interactions of Th1-mediated responses between immune cells and type 5 ductal cells (Fig. 6F) suggested that LTBR and its ligands may be the crucial drivers in aggressiveness of PDAC. Moreover, the putative ligands for which cognate receptors of Th17-mediated responses were also identified, such as the intercellular communications produced by CCR6-CCL20 (Fig. 6G). corresponding to the recent study supporting pathogenic TH17 responses are be demonstrated to be responsible for neoantigen-induced tumor progression in PDAC [7].
Mapping malignancy-specific regulon networks by SCENIC
In addition, single-cell regulatory network inference and clustering (SCENIC) was employed to further explore the difference of potential regulons (i.e., transcriptional factors (TFs) and their target genes) activity of between type 5 ductal cell-specific and other cellular clusters.
Based on 337 regulons activity with 35,684 filtered genes with default filter parameters (Supplementary Fig. 5A), the regulon activity could also distinguish subclones of the different ductal subtypes (Fig. 7A). Accordingly, type 5 ductal cell-specific regulon activity was binarized and matched with cellular clusters (Fig. 7B). Next, TFAP2A, GATA6, ZBTB7A, and HOXB9 were identified as candidate TFs with specifically regulated expressions in type 5 ductal cell clusters, in addition, corresponding motifs were listed (Fig. 7C). The expression patterns of the representative TFs in the cellular populations were demonstrated (Fig. 7D and Supplementary Fig. 5B–E).
Validation of metastatic-related hub genes and signaling pathways in bulk RNA-sequencing data
To validate the metastasis-associated genes and signaling pathways discovered in the aforementioned results, PDAC cohort (ICGC-PACA-AC) were used in the subsequent analyses. Firstly, expression distribution of metastasis-related genes in primary and metastatic tumor samples was plotted (Fig. 8A). Then, expression levels of metastasis-related genes were compared between primary tumors and metastatic lesions. As shown in Fig. 8B, most metastasis-related genes experienced significantly dysregulation in metastatic tumors, consistent with previous findings. The HPA database was used to explore protein expression levels in PDAC samples. The results showed that relative to normal samples, proteins (LAMC2, HMGA1, CSTB) were significantly upregulated in tumor tissues (Fig. 9A–F). In addition, their mRNA expression levels were also investigated in pancreatic cancer cell lines (Fig. 9G, H and I). Our results indicated that LAMC2, HMGA1, and CSTB showed significantly higher trends in CFPAC-1 cell lines.
Moreover, representative hub genes expression levels were significantly elevated in metastatic lesions than primary samples (Fig. 9J). Moreover, GSVA analysis between primary and metastatic tumors was performed (Fig. 9K). The results of GSVA showed that enrichment pathways of metastatic lesions prominently associated with tumorigenesis-relevant processes, including VEGF/VEGFR, HIF2, E2F, FOXM1, and P53 regulation pathways, which was in accordance with the previous results (Fig. 4F).
Discussion
PDAC was considered as malignant tumor with high intra-tumoral heterogeneity and aggressive features which brings great challenges to both early precision diagnosis and effective personalized treatment. Therefore, there was an urgent need to reveal the underlying molecular mechanism of intra-tumoral heterogeneity and tumor metastasis characteristics with high cellular resolution in the TME of metastatic PDAC.
Herein, 10 predominant populations of cells with t-SNE clustering were identified in the metastatic PDAC biopsies that included 6 stromal cellular types and 4 immune cell clusters. Their molecular and cellular features were also characterized with regard to their role in the tumorigenesis of PDAC. Notably, five types of ductal cells with distinct transcriptomic patterns, were recognized. According to subsequent CNV level analysis, all five types ductal cells presented in metastatic lesions were demonstrated to be malignant cells and type 1 ductal cells experienced the highest CNV scores. GSVA on each subgroup of malignant cells demonstrated the heterogeneity of underlying biological mechanism in distinct types of cells. Interestingly, the unique functions of type 5 ductal cells were significantly related to Epithelial-Mesenchymal Transformation (EMT), which contributed into the capability to invade surrounding tissues and travel through the peripheral circulation. Since EMT+ cancer cell subpopulations were correlated with poorer patient prognosis and more aggressive disease [23], our findings suggested pivotal roles of type 5 ductal cells for distant metastasis in PDAC.
In addition, the activation EMT- and TGF-β-related pathways participated in the formation of peritumoral stroma [24]. Targetable TGF-β molecular blockaders have been demonstrated to reprogram the contexture of TME and reshape the anti-cancer immunology [25, 26]. Thus, it recommended that the type 5 ductal cellular populations might be suitable for the administration of TGF-β inhibitors.
It is well-established that cancer stem cells as the main drivers of tumorigenicity were the origins of neoplastic cells in PDAC [27]. However, mounting studies demonstrated the origin of malignant cells derived from ductal cells [28, 29]. Herein, pseudotime trajectory analysis using Monocle also indicated that tumor cells may derive from both cancer stem cells and type 3 ductal cells transdifferentiation.
Notably, various canonical oncogenic pathways, such as VEGF/VEGFR [30], HIF2 [31], E2F [32], FOXM1 [33], and P53 regulation pathways [34], were activated during trajectory progression. Moreover, these results were validated between primary and metastatic cancer samples in bulk RNA sequencing data. Our findings offer novel insight to support that these signaling pathways could potentially serve as underlying molecular mechanism of metastasis in PDAC. Nonetheless, our findings should be confirmed in future studies. In addition, LAMC2, HMGA1 and CSTB were discovered to be upregulated significantly along with trajectory transition and elevated in the metastatic tumor lesions, which was consistent with published researches focusing on biological roles of LAMC2/ HMGA1 [35, 36]. However, it was little to known the CSTB-mediated underlying mechanism in invasiveness of PDAC and CSTB-targeting precision interventions might represent promising therapeutic strategies, which required further investigation in future studies. Notably, the expression level of MET was increased during tumor progression. This finding is consistent with metastasis theory in which EMT-tumor cells regained its epithelial ability to form metastatic sites, suggesting the metastatic lesions were originated from cancer cells that have undergone EMT [37]. We also discovered some unidentified regulators such as TM9SF3, LYZ, and ARL4C, which are potentially engaged in the cellular transition from the cancer stem cells into metastatic malignant cells. The above results indicate that both cancer stem cell as well as type 3 ductal cells might be the origin of pancreatic neoplastic cells lineages, which facilitated enhancing our understanding of PDAC carcinogenesis.
Moreover, the intimate cell–cell communication among type5 ductal cell and major cell types were identified in the TME. High expression of inhibitory receptor–ligand complex, like CD47-SIRPG, may be crucial role of immunotherapeutic resistance in PDAC. Moreover, upregulated activity of chemokine, such as CCL4, and CXCL2, between tumor cancer and immune might be the hub driver of recruitment of infiltrating immune cells. Macrophage with high expression of costimulatory ligands exhibited the most complex interaction with other cell types, indicating potential anti-cancer effect of these macrophages in PDAC.
Meanwhile, a network of regulons was established using the SCENIC analysis, that can be identified as potential candidates for type 5 ductal cell-specific TF programs. By employing this algorithm, several specific TFs in type 5 ductal cells were determined, such as TFAP2A, GATA6, HOXB9, and ZBTB7A. Overexpression of TFAP2C was demonstrated to resensitize tumor cells to gemcitabine in PDAC [38]. The biological function of GATA6 in PDAC remains controversial yet. A previous study reported that GATA6 inhibits the epithelial-mesenchymal transition (EMT) in vitro, however, high GATA6 levels are related with better prognosis and are found in well-differentiated tumors [39]. Downregulation of HOXB9 mediated by TGF-β1-induced Kindlin-2 expression was correlated with PDAC progression [40]. However, the potential role of ZBTB7A is still elusive in PDAC, which needs further exploration.
Conclusions
In our work, the intratumoral heterogeneity of PDAC cells were demonstrated with corresponding molecular characteristics and transcriptome features identified. In addition, the cellular lineage transition of malignant cells from original cells in metastatic lesions. Moreover, complex interactions of major cell types were deciphered in the TIME of PDAC. In addition, regulons networks were established to determine the candidates of cell type-specific transcriptional factors (TFs) and their target genes. Moreover, metastatic-related genes expression and signaling pathways activity were further confirmed in bulk RNA Sequencing data. In conclusion, this study lays a new foundation for the identification of therapeutic targets to enhance precision therapeutic strategy in metastatic PDAC.
Materials and methods
Sources of datasets
The single-cell RNA sequencing information of GSE154778 were obtained from Gene Expression Omnibus (GEO) database, which contained 6,236 cell samples from 5 liver metastatic PDAC biopsies. The scRNA-seq gene-barcode matrix, features data and UMI count tables of barcodes has been described by Lin et al. [41]. The RNA sequencing profile of the patients from ICGC-PACA-CA dataset, which contains 195 primary tumor samples and 13 metastatic cancer lesions, was obtained from ICGC portal (https://dcc.icgc.org/). There was no necessity to obtain Ethics Committee approval, owing to all information were publicly available and open-access. The Human Protein Atlas (http://www.proteinatlas.org) was used to investigate the protein levels of metastatic-related genes.
Quality control and the dimensionality reduction
The Seurat object with gene expression data was imported into the Seurat (v2.3.0) R toolkit with the Read10× () function [42]. Gene-cell matrixes were filtered to exclude cells (<500 transcripts/cell, >5% mitochondrial genes) and genes (<1000 cells/gene and >200,00 cells/gene). For each sample, the gene expression was represented as the fraction of the gene and multiplied by 10,000, which were converted into natural logarithm and normalized after adding 1 to avoid taking the log of 0. The top 1000 highly variable genes (HVGs) from the normalized expression matrix were generated to perform the principal component analysis (PCA) based on these HVGs. Significant principal components were determined using Jackstraw analysis and visualization of heatmaps focusing on PCs 1 to 40. PCs 1 to 13 were employed for graph-based clustering (at res = 0.5) to identify distinct groups of cells. The following Seurat functions (FindNeighbors and FindClusters) were used to calculate the dimension-reduction coordinates. Finally, single-cell clustering was visualized by t-SNE (t-Distributed Stochastic Neighbor Embedding).
Cell-clustering and annotation
The cluster-specific marker genes were identified by running the FindAllMarkers function in the Seurat package using the default non-parametric Wilcoxon rank sum test with Bonferroni correction. To identify differentially expressed genes between two clusters, we used the ‘find.markers’ function. The cell groups were annotated based on the DEGs and the well-known cellular markers from the literature. Detailed information of the cell markers was recorded in Supplementary Table 1.
Single-cell copy-number variation (CNV) evaluation
The CNV evaluation of each cell was conducted by infercnv R package (version 1.4.0; https://github.com/broadinstitute/inferCNV/wiki) [20]. The CNVs of Epithelial cells were calculated and the immune cells were applied as the reference. The inferCNV analysis was performed with parameters including “denoise”, default hidden Markov model (HMM) settings, and a value of 0.1 for “cutoff”. To reduce the false-positive CNV calls, the default Bayesian latent mixture model was implemented to identify the posterior probabilities of the CNV alterations in each cell with the default value of 0.5 as the threshold.
DEGs identification and GSVA
We identified the differentially over-expressed genes in the specific cluster when compared to other remaining clusters with the Wilcoxon Rank-Sum Test with the FindMarkers function in Seurat (adjusted P-value < 0.05, only.pos = TRUE and logfc.threshold = 0.25). Predominantly, pathway analyses were carried out to evaluate activation of hallmark pathways and metabolic pathways, which were described in the MSigDB databases (https://www.gsea-msigdb.org/gsea/msigdb) [43]. Then, we applied GSVA [44] in the GSVA package (version 1.36.3) to assign pathway activity estimates to assess the relative pathway activities in the Cancer stem cells and epithelial cells.
Constructing single-cell trajectories in PDAC
The R package Monocle2 (v2.16.0) was applied to conduct single-cell trajectory analysis with the assumption that one-dimensional ‘time’ can describe the high-dimensional expression values to discover the cell-state transitions [45]. The clusters identified as Cancer stem cell and Epithelial cells were loaded into R environment. The newCellDataSet function was applied to create an object with the parameter expressionFamily = negbinomial.size. In the trajectory analysis, we used genes meeting the thresholds that mean_expression ≥ 0.1 and dispersion_empirical ≥ 1 * dispersion_fit identified by Monocle2 to sort cells in pseudo-time order. The reduceDimension() function using the parameters reduction_method = “DDRTree” and max_components = 2 was applied to reduce dimensions and the visualization functions ‘plot_cell_trajectory’ were used to plot the minimum spanning tree on cells. Genes that changed along with the pseudotime were calculated (q-val < 10 − 5) by the “differentialGeneTest” function and visualized with the plot_pseudotime_heatmap and the genes were clustered into subgroups according to the gene expression patterns. Detailed information of the cell markers was recorded in Supplementary Table 2. To identify the genes that separate cells into branches, the branch expression analysis modeling (BEAM) analysis were performed and genes resulting from the BEAM analysis with a q-value < 10 − 10 were separated into groups and visualized with the plot_genes_branched_heatmap() function. Detailed information of the cell markers was recorded in Supplementary Table 3.
Cell–cell communication analysis
To investigate potential interactions across different cell types in the TME, cell–cell communication analysis was performed using CellPhoneDB, which is a publicly available repository of curated receptors and ligands and their interactions [46]. CellPhoneDB analysis was performed using the CellPhoneDB Python package (2.1.7). Single-cell transcriptomic data of cells annotated as Cancer stem cells, ductal cells type 1-5, macrophages, T cells, NK cells and monocyte were input into CellPhoneDB for cell–cell interaction analysis. Enriched receptor–ligand interactions between two cell types were derived based on the expression of a receptor by one cell type and the expression of the corresponding ligand by another cell type. Then, we identified the most relevant cell type-specific interactions between ligands and receptors, and only receptors and ligands expressed in more than 10% of the cells in the corresponding subclusters were considered.
Pairwise comparisons were performed between the included cell types. We first randomly permuted the cluster labels of all cells 1000 times to determine the mean of the average receptor and ligand expression levels of the interacting clusters. This generated a null distribution for each receptor–ligand pair. By calculating the proportion of the means that were higher than the actual mean, a P value for the likelihood of the cell-type specificity of the corresponding receptor–ligand complex was obtained. We then selected interactions that were biologically relevant.
The regulon activity of TFs with SCENIC
The SCENIC algorithm had been developed to assess the regulatory network analysis regard to TFs and discover regulons (that is, TFs and their target genes) in individual cells. Log-normalized expression matrix with gene names in rows and cells in columns generated using Seurat was input to SCENIC (version 1.2.4) [47]. Then, motif dataset (hg19-tss-centered-10kb-7species.mc9nr.feather) was used to construct regulons for each TF in SCENIC. The co-expressed genes for each TF were constructed with GENIE3 software, followed by Spearman’s correlation between the TF and the potential targets, and then the “runSCENIC” procedure assisted to generate the GRNs (also termed regulons). Finally, regulon activity was analyzed by AUCell (Area Under the Curve) software, where a default threshold was applied to binarize the specific regulons (“0” present “off” of TFs, and “1” refer to “on”). Detailed information of the cell markers was recorded in Supplementary Table 4.
Experimental validation
CFPAC-1 (metastatic pancreatic cell line) and three primary pancreatic cancer cell lines (BxPC-3 cells, MiaPaCa-2 cells, and PANC-1 cells) were purchased from the Cell Bank of the Type Culture Collection of the Chinese Academy of Sciences, Shanghai Institute of Biochemistry and Cell Biology. All cells utilized were free of mycoplasma contamination and regularly tested for mycoplasma and purity of the culture by Short-tandem repeat polymorphism analysis (STR) profiling. The cell lines were all cultured in Roswell Park Memorial Institute (RPMI-1640) medium plus 10% fetal bovine serum (FBS; Invitrogen, Carlsbad, CA, USA). All cell lines were grown without antibiotics in a humidified atmosphere of 5% CO2 and 99% relative humidity at 37 °C. Three different cell lines were subjected to quantitative real-time polymerase chain reaction (qRT-PCR). Quantitative real-time PCR was analyzed as described previously [48]. All samples were analyzed in triplicates. Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) levels were used as the endogenous control and relative expression of LAMC2/CSTB/HMGA1 was calculated using the 2-ΔΔCt method. The sequences of primers used for PCR were as follows: LAMC2, 5′- TCGGGATACTCACAGGCTCATCAC-3′ (forward) and 5′- GTGGTCTGAGGCAGGAATGTTAGTG-3′ (reverse); CSTB, 5′- GTGAGGTCCCAGCTTGAAGAGAAAG-3′ (forward) and 5′- GCGACCACCTGGCTCTTGAATG-3′ (reverse); HMGA1, 5′- CGAAGTGCCAACACCTAAGAGACC-3′ (forward) and 5′-GATGCCCTCCTCTTCCTCCTTCTC-3′ (reverse); and GAPDH, 5′- CAGGAGGCATTGCTGATGAT-3′ (forward) and 5′-GAAGGCTGGGGCTCATTT-3′ (reverse).
Statistical analysis
All statistical analyses were performed using R (http://www.r-project.org). We do not display each data point in all box and violin plots because a large number of data points would obscure the overall distribution. A two-sided paired or unpaired Student’s t-test and unpaired Wilcoxon rank-sum test was used where indicated. P < 0.05 was considered to indicate statistical significance.
Supplementary information
Acknowledgements
The authors would like to give our sincere appreciation to the reviewers for their helpful comments on this article and research groups for the GSE154778 and ICGC-PACA-AC, which provided data for this collection.
Author contributions
HW designed the overall study and revised the paper, XQH drafted the manuscript, and performed data analysis. CSH and HYB participated in data collection. All authors read and approved the final manuscript.
Funding statement
This study was supported by Funding of Wenzhou Municipal Science and Technology Bureau (Grant No. Y2020971).
Data availability
The datasets generated for this study can be found in the GEO database (https://www.ncbi.nlm.nih.gov/geo/) and the ICGC dataset (https://dcc.icgc.org/). All the data generated or analyzed during this study are included in this article and its supplementary information files or available from the author upon reasonable request.
Code availability
R and other custom scripts for analyzing data are available upon request.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Qianhui Xu, Shaohuai Chen, Yuanbo Hu.
Supplementary information
The online version contains supplementary material available at 10.1038/s41420-021-00663-1.
References
- 1.Rahib L, Smith BD, Aizenberg R, Rosenzweig AB, Fleshman JM, Matrisian LM. Projecting cancer incidence and deaths to 2030: the unexpected burden of thyroid, liver, and pancreas cancers in the United States. Cancer Res. 2014;74:2913–21. doi: 10.1158/0008-5472.CAN-14-0155. [DOI] [PubMed] [Google Scholar]
- 2.Siegel RL, Miller KD, Jemal A. Cancer statistics, 2018. CA. 2018;68:7–30. doi: 10.3322/caac.21442. [DOI] [PubMed] [Google Scholar]
- 3.Ryan DP, Hong TS, Bardeesy N. Pancreatic adenocarcinoma. N Engl J Med. 2014;371:1039–49. doi: 10.1056/NEJMra1404198. [DOI] [PubMed] [Google Scholar]
- 4.Vincent A, Herman J, Schulick R, Hruban RH, Goggins M. Pancreatic cancer. Lancet. 2011;378:607–20. doi: 10.1016/S0140-6736(10)62307-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Raphael BJ, Hruban RH, Aguirre AJ. The Cancer Genome Atlas Research Network. Integrated genomic characterization of pancreatic ductal adenocarcinoma. Cancer Cell. 2017;32:203.e13. doi: 10.1016/j.ccell.2017.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Qian Y, Gong Y, Fan Z, Luo G, Huang Q, Deng S, et al. Molecular alterations and targeted therapy in pancreatic ductal adenocarcinoma. J Hematol Oncol. 2020;13:130. doi: 10.1186/s13045-020-00958-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hegde S, Krisnawan VE, Herzog BH, Zuo C, Breden MA, Knolhoff BL, et al. Dendritic cell paucity leads to dysfunctional immune surveillance in pancreatic cancer. Cancer Cell. 2020;37:289–307. doi: 10.1016/j.ccell.2020.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Dorado J, Lonardo E, Miranda-Lorenzo I, Heeschen C. Pancreatic cancer stem cells: new insights and perspectives. J Gastroenterol. 2011;46:966–73. doi: 10.1007/s00535-011-0422-x. [DOI] [PubMed] [Google Scholar]
- 9.Matsuda Y, Kure S, Ishiwata T. Nestin and other putative cancer stem cell markers in pancreatic cancer. Med Mol Morphol. 2012;45:59–65. doi: 10.1007/s00795-012-0571-x. [DOI] [PubMed] [Google Scholar]
- 10.Segerstolpe Å, Palasantza A, Eliasson P, Andersson EM, Andréasson AC, Sun X, et al. Single-cell transcriptome profiling of human pancreatic islets in health and type 2 diabetes. Cell Metab. 2016;24:593–607. doi: 10.1016/j.cmet.2016.08.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Reichert M, Rustgi AK. Pancreatic ductal cells in development, regeneration, and neoplasia. J Clin Investig. 2011;121:4572–8. doi: 10.1172/JCI57131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Young MD, Mitchell TJ, Vieira Braga FA, Tran MGB, Stewart BJ, Ferdinand JR, et al. Single-cell transcriptomes from human kidneys reveal the cellular identity of renal tumors. Science. 2018;361:594–9. doi: 10.1126/science.aat1699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Jaitin DA, Weiner A, Yofe I, Lara-Astiaso D, Keren-Shaul H, David E, et al. Dissecting immune circuits by linking CRISPR-pooled screens with single-cell RNA-seq. Cell. 2016;167:1883–96. doi: 10.1016/j.cell.2016.11.039. [DOI] [PubMed] [Google Scholar]
- 14.Schelker M, Feau S, Du J, Ranu N, Klipp E, MacBeath G, et al. Estimation of immune cell content in tumour tissue using single-cell RNA-seq data. Nat Commun. 2017;8:2032. doi: 10.1038/s41467-017-02289-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Palmer DR, Krzych U. Cellular and molecular requirements for the recall of IL-4-producing memory CD4(+)CD45RO(+)CD27(-) T cells during protection induced by attenuated Plasmodium falciparum sporozoites. Eur J Immunol. 2002;32:652–61. doi: 10.1002/1521-4141(200203)32:3<652::AID-IMMU652>3.0.CO;2-9. [DOI] [PubMed] [Google Scholar]
- 16.Li L, Dong J, Yan L, Yong J, Liu X, Hu Y, et al. Single-cell RNA-seq analysis maps development of human germline cells and gonadal niche interactions. Cell Stem Cell. 2017;20:858–73. doi: 10.1016/j.stem.2017.03.007. [DOI] [PubMed] [Google Scholar]
- 17.Sugita S, Makabe K, Iwasaki Y, Fujii S, Takahashi M. Natural killer cell inhibition by HLA-E molecules on induced pluripotent stem cell-derived retinal pigment epithelial cells. Investigative Ophthalmol Vis Sci. 2018;59:1719–31. doi: 10.1167/iovs.17-22703. [DOI] [PubMed] [Google Scholar]
- 18.Gebauer F, Wicklein D, Horst J, Sundermann P, Maar H, Streichert T, et al. Carcinoembryonic antigen-related cell adhesion molecules (CEACAM) 1, 5 and 6 as biomarkers in pancreatic cancer. PLoS ONE. 2014;9:e113023. doi: 10.1371/journal.pone.0113023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yao H, Yang Z, Liu Z, Miao X, Yang L, Li D, et al. Glypican-3 and KRT19 are markers associating with metastasis and poor prognosis of pancreatic ductal adenocarcinoma. Cancer Biomarkers: Sect A Dis Markers. 2016;17:397–404. doi: 10.3233/CBM-160655. [DOI] [PubMed] [Google Scholar]
- 20.Patel AP, Tirosh I, Trombetta JJ, Shalek AK, Gillespie SM, Wakimoto H, et al. Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma. Science. 2014;344:1396–401. doi: 10.1126/science.1254257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Nielsen SR, Quaranta V, Linford A, Emeagi P, Rainer C, Santos A, et al. Macrophage-secreted granulin supports pancreatic cancer metastasis by inducing liver fibrosis. Nat Cell Biol. 2016;18:549–60. doi: 10.1038/ncb3340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Nizri E, Sternbach N, Bar-David S, Ben-Yehuda A, Gerstenhaber F, Ofir T, et al. T-helper 1 immune response in metastatic lymph nodes of pancreatic ductal adenocarcinoma: a marker for prolonged survival. Ann Surgical Oncol. 2018;25:475–81. doi: 10.1245/s10434-017-6237-0. [DOI] [PubMed] [Google Scholar]
- 23.Hosein AN, Huang H, Wang Z, Parmar K, Du W, Huang J, et al. Cellular heterogeneity during mouse pancreatic ductal adenocarcinoma progression at single-cell resolution. JCI Insight. 2019;5:e129212. doi: 10.1172/jci.insight.129212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Tauriello DVF, Palomo-Ponce S, Stork D, Berenguer-Llergo A, Badia-Ramentol J, Iglesias M, et al. TGFβ drives immune evasion in genetically reconstituted colon cancer metastasis. Nature. 2018;554:538–43. doi: 10.1038/nature25492. [DOI] [PubMed] [Google Scholar]
- 25.Mariathasan S, Turley SJ, Nickles D, Castiglioni A, Yuen K, Wang Y, et al. TGFβ attenuates tumour response to PD-L1 blockade by contributing to exclusion of T cells. Nature. 2018;554:544–8. doi: 10.1038/nature25501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Panagi M, Voutouri C, Mpekris F, Papageorgis P, Martin MR, Martin JD, et al. TGF-β inhibition combined with cytotoxic nanomedicine normalizes triple negative breast cancer microenvironment towards anti-tumor immunity. Theranostics. 2020;10:1910–22. doi: 10.7150/thno.36936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wang VM, Ferreira RMM, Almagro J, Evan T, Legrave N, Zaw Thin M, et al. CD9 identifies pancreatic cancer stem cells and modulates glutamine metabolism to fuel tumour growth. Nat Cell Biol. 2019;21:1425–35. doi: 10.1038/s41556-019-0407-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Pylayeva-Gupta Y, Lee KE, Hajdu CH, Miller G, Bar-Sagi D. Oncogenic Kras-induced GM-CSF production promotes the development of pancreatic neoplasia. Cancer Cell. 2012;21:836–47. doi: 10.1016/j.ccr.2012.04.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.von Figura G, Fukuda A, Roy N, Liku ME, Morris Iv JP, Kim GE, et al. The chromatin regulator Brg1 suppresses formation of intraductal papillary mucinous neoplasm and pancreatic ductal adenocarcinoma. Nat Cell Biol. 2014;16:255–67. doi: 10.1038/ncb2916. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Korc M. Pathways for aberrant angiogenesis in pancreatic cancer. Mol Cancer. 2003;2:8. doi: 10.1186/1476-4598-2-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Choueiri TK, Kaelin WG., Jr Targeting the HIF2-VEGF axis in renal cell carcinoma. Nat Med. 2020;26:1519–30. doi: 10.1038/s41591-020-1093-z. [DOI] [PubMed] [Google Scholar]
- 32.Kent LN, Leone G. The broken cycle: E2F dysfunction in cancer. Nat Rev Cancer. 2019;19:326–38. doi: 10.1038/s41568-019-0143-7. [DOI] [PubMed] [Google Scholar]
- 33.Gartel AL. FOXM1 in cancer: interactions and Vulnerabilities. Cancer Res. 2017;77:3135–9. doi: 10.1158/0008-5472.CAN-16-3566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Donehower LA, Soussi T, Korkut A, Liu Y, Schultz A, Cardenas M, et al. Integrated analysis of TP53 gene and pathway alterations in The Cancer Genome Atlas. Cell Rep. 2019;28:1370–84. doi: 10.1016/j.celrep.2019.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Okada Y, Takahashi N, Takayama T, Goel A. LAMC2 promotes cancer progression and gemcitabine resistance through modulation of EMT and ATP-binding cassette transporters in pancreatic ductal adenocarcinoma. Carcinogenesis. 2021;42:546–56. doi: 10.1093/carcin/bgab011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Liau SS, Jazag A, Whang EE. HMGA1 is a determinant of cellular invasiveness and in vivo metastatic potential in pancreatic adenocarcinoma. Cancer Res. 2006;66:11613–22. doi: 10.1158/0008-5472.CAN-06-1460. [DOI] [PubMed] [Google Scholar]
- 37.Yao D, Dai C, Peng S. Mechanism of the mesenchymal-epithelial transition and its relationship with metastatic tumor formation. Mol Cancer Res. 2011;9:1608–20. doi: 10.1158/1541-7786.MCR-10-0568. [DOI] [PubMed] [Google Scholar]
- 38.Xiong G, Huang H, Feng M, Yang G, Zheng S, You L, et al. MiR-10a-5p targets TFAP2C to promote gemcitabine resistance in pancreatic ductal adenocarcinoma. J Exp Clin Cancer Res. 2018;37:76. doi: 10.1186/s13046-018-0739-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Martinelli P, Carrillo-de Santa Pau E, Cox T, Sainz B, Jr., Dusetti N, Greenhalf W, et al. GATA6 regulates EMT and tumour dissemination, and is a marker of response to adjuvant chemotherapy in pancreatic cancer. Gut. 2017;66:1665–76. doi: 10.1136/gutjnl-2015-311256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Zhan J, Song J, Wang P, Chi X, Wang Y, Guo Y, et al. Kindlin-2 induced by TGF-β signaling promotes pancreatic ductal adenocarcinoma progression through downregulation of transcriptional factor HOXB9. Cancer Lett. 2015;361:75–85. doi: 10.1016/j.canlet.2015.02.039. [DOI] [PubMed] [Google Scholar]
- 41.Lin W, Noel P, Borazanci EH, Lee J, Amini A, Han IW, et al. Single-cell transcriptome analysis of tumor and stromal compartments of pancreatic ductal adenocarcinoma primary tumors and metastatic lesions. Genome Med. 2020;12:80. doi: 10.1186/s13073-020-00776-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Satija R, Farrell JA, Gennert D, Schier AF, Regev A. Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. 2015;33:495–502. doi: 10.1038/nbt.3192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011;27:1739–40. doi: 10.1093/bioinformatics/btr260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Hänzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinforma. 2013;14:7. doi: 10.1186/1471-2105-14-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Trapnell C, Cacchiarelli D, Grimsby J, Pokharel P, Li S, Morse M, et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol. 2014;32:381–6. doi: 10.1038/nbt.2859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Vento-Tormo R, Efremova M, Botting RA, Turco MY, Vento-Tormo M, Meyer KB, et al. Single-cell reconstruction of the early maternal-fetal interface in humans. Nature. 2018;563:347–53. doi: 10.1038/s41586-018-0698-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Aibar S, González-Blas CB, Moerman T, Huynh-Thu VA, Imrichova H, Hulselmans G, et al. SCENIC: single-cell regulatory network inference and clustering. Nat Methods. 2017;14:1083–6. doi: 10.1038/nmeth.4463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Xu Q, Xu H, Deng R, Li N, Mu R, Qi Z, et al. Immunological significance of prognostic alternative splicing signature in hepatocellular carcinoma. Cancer Cell Int. 2021;21:190. doi: 10.1186/s12935-021-01894-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated for this study can be found in the GEO database (https://www.ncbi.nlm.nih.gov/geo/) and the ICGC dataset (https://dcc.icgc.org/). All the data generated or analyzed during this study are included in this article and its supplementary information files or available from the author upon reasonable request.
R and other custom scripts for analyzing data are available upon request.