Abstract
Background
Pancreatic ductal adenocarcinoma (PDAC) is a highly aggressive malignant tumor of the digestive system. Its grim prognosis is mainly attributed to the lack of means for early diagnosis and poor response to treatments. Genomic instability is shown to be an important cancer feature and prognostic factor, and its pattern and extent may be associated with poor treatment outcomes in PDAC. Recently, it has been reported that long non-coding RNAs (lncRNAs) play a key role in maintaining genomic instability. However, the identification and clinical significance of genomic instability-related lncRNAs in PDAC have not been fully elucidated.
Methods
Genomic instability-derived lncRNA signature (GILncSig) was constructed based on the results of multiple regression analysis combined with genomic instability-associated lncRNAs and its predictive power was verified by the Kaplan-Meier method. And real-time quantitative polymerase chain reaction (qRT-PCR) was used for simple validation in human cancers and their adjacent non-cancerous tissues. In addition, the correlation between GILncSig and tumor microenvironment (TME) and epithelial-mesenchymal transition (EMT) was investigated by Pearson correlation analysis.
Results
The computational framework identified 206 lncRNAs associated with genomic instability in PDAC and was subsequently used to construct a genome instability-derived five lncRNA-based gene signature. Afterwards, we successfully validated its prognostic capacity in The Cancer Genome Atlas (TCGA) cohort. In addition, via careful examination of the transcriptome expression profile of PDAC patients, we discovered that GILncSig is associated with EMT and an adaptive immunity deficient immune profile within TME.
Conclusions
Our study established a genomic instability-associated lncRNAs-derived model (GILncSig) for prognosis prediction in patients with PDAC, and revealed the potential functional regulatory role of GILncSig.
Keywords: long non-coding RNAs, genome instability, tumor microenvironment, pancreatic ductal adenocarcinoma, epithelial-mesenchymal transition
Highlights
We established a mutational hypothesis-derived computational framework for identifying genomic instability-associated lncRNAs in PDAC and constructed lncRNA signatures to better predict the prognosis of PDAC patients.
In addition, we investigated the immune profile characteristics and potential functional regulatory effects associated with lncRNA signatures, and we found that GILncSig is associated with EMT as well as adaptive immune deficiency immune profiles within the TME.
Our findings may improve prognostic prediction methods for PDAC and provide potential guidance for precise immunotherapy in the future.
1 Introduction
Pancreatic ductal adenocarcinoma is one of the aggressive solid malignancies, and the rising incidence of PDAC is expected to be the second leading cause of cancer-related mortality by 2030 (1, 2). However, only 10% to 20% of pancreatic cancer patients have the chance of surgery as most patients have distant metastasis of the lesion at the time of diagnosis (3). Moreover, even for patients with the chance of surgery, they still possessed a rather low 5-year survival rate and over 80% postoperative recurrence rate (4). Despite recent advances in pancreatic cancer research, there has been no significant reduction in overall mortality and morbidity (5), because of the lack of specific symptoms and reliable biomarkers for early diagnosis, as well as poor response to treatment due to tumor dissemination. Therefore, there is an urgent need to develop new and effective strategies that can predict prognosis and improve therapeutic targeting to achieve personalized treatment.
Genomic instability is shown to be an important cancer characteristic as well as a prognostic factor, and the pattern and degree of which is associated with tumor progression and recurrence (6, 7). Although the specific molecular mechanisms affecting genomic instability are not fully understood yet. Recently, long non-coding RNAs, a group of non-coding RNAs with more than 200 nucleotides in length (8, 9), are considered to have the potential to quantitatively measure genomic instability (10–16). Interestingly, quite a few previous studies have reported a variety of lncRNAs that may contribute to the carcinogenesis and development of PDAC (17–19). Therefore, we believe that lncRNAs may represent a new class of PDAC biomarkers and therapeutic targets. Likewise, genomic-instability related lncRNAs were successfully used to build prognostic models for other types of cancer, including breast cancer, gastric cancer, and glioblastoma (20–22).
Besides, it is widely believed that studies targeting the crosstalk between tumor cells and the TME will shed light on the novel treatment measures for pancreatic cancer. Immune checkpoint inhibitors have been reported to show durable clinical benefits in many malignancies (23). However, we found that the effect of this class of drugs was not satisfactory in PDAC, which may be ascribed to the distinct TME profile. Cancer often creates a favorable TME for its successful growth by disrupting the immune, vascular, and connective tissue components of the stroma that counter the physiological responses to damage. Among them, the intensive interstitial and highly immunosuppressive environment is a special weapon for PDAC (24–26). At the same time, the degree of T-cell infiltration in PDAC patients correlates with disease progression, and patients with a higher level of T-cell infiltration are generally more sensitive to immunotherapy (27, 28).
In this study, we employed bioinformatics and statistics methods, combined with the lncRNA expression profile of tumor genomes, the somatic mutation profile of tumor genomes, and the clinical features of PDCA patients to establish a genomic-instability associated lncRNA-derived signature called GILncSig. The GILncSig risk score was calculated as a surrogate tool for assessing the likelihood of survival in patients with PDCA, and its prognostic value was further validated by survival analysis. Moreover, to understand the concomitant functional regulatory effects of GILncSig on the transcriptomic expression profiles, we conducted graph-based clustering analysis, differential expression analysis and functional enrichment analysis to in-depth analyze transcriptome expression characteristics associated with GILncSig, revealing that GILncSig is related to EMT. In addition, we investigated the lncRNA signature associated immune profile characteristics and potential functional regulatory effects. We believe that our findings may improve the prognostic prediction method of PDAC and provide potential guidance for future precise immunotherapy.
2 Materials and methods
2.1 Data collection
RNA-seq expression data, clinical features, clinicopathological characteristics, survival information, and somatic mutation information of patients with pancreatic ductal adenocarcinoma were collected from TCGA database (https://portal.gdc.cancer.gov/). LncRNA expression data were downloaded from the TANRIC database (http://bioinformatics.mdanderson.org/main/TANRIC: Overview, version 1.0.6). Due to the missing values in the follow-up dataset, 7 patients from the TCGA cohort were excluded, and the remaining 171 samples were retained for further study. A flow chart of the inclusion and exclusion criteria for patient data is presented in Supplementary Figure 1 . The patients with PDAC used in this study were randomly divided into the following two patient sets after matching for gender, age, and tumor stage: training set (n = 87) and test set (n = 84), with no significant differences in clinical features between these two sets. The training set was used to identify prognostic lncRNA signature and build prognostic risk model, while the testing set was used to independently validate the performance of the prognostic risk model.
2.2 Identification of genome instability-associated lncRNAs
As described in the previous study, a mutator hypothesis-derived computational frame combining lncRNA expression profiles and somatic mutation profiles in a tumor genome was used to identify genome instability-associated lncRNAs (29). In brief, the cumulative number of somatic mutations was first calculated for each pancreatic cancer patient and sorted in descending order. Next, the first 25% and last 25% of patients were defined as genomic instability (GU) and genomic stability (GS) sample groups, respectively. Finally, the expression profiles of lncRNAs of the GU and GS groups were compared using the significance analysis of microarrays (SAM) method, and the differentially expressed lncRNAs between these two groups (fold change > 1.5 or<0.67 and false discovery rate (FDR) adjusted P< 0.05) were defined as genome instability-associated lncRNAs (29).
2.3 Functional enrichment analysis
To understand the potential functional regulation exerted by the identified genome instability-associated lncRNAs, an exploratory graph-based clustering analysis using the Louvain clustering algorithm was performed on the entire TCGA dataset, identifying three distinct clusters in all pancreatic ductal adenocarcinoma patients. Functional enrichment analysis of the extracted top 50 differentially expressed genes and top 30 differentially expressed transcription factors (TFs) from each cluster was conducted using the Metascape webtool (www.metascape.org) to determine significantly enriched Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways (30). As a result, we found that cluster 1 was implicated in EMT.
2.4 Statistical analysis
We used univariate and multivariate Cox proportional hazard regression analysis to evaluate the association between the expression level of genome instability-associated lncRNA and overall survival. According to the coefficients from the multivariate regression analysis and the expression levels of prognostic genome instability-associated lncRNAs, a genome instability-derived lncRNA signature for outcome prediction was constructed as follows:
GILncSig = , (i=1,2,3…n)
In this equation, GILncSig, the established prognostic risk score for patients with pancreatic ductal adenocarcinoma, is calculated by adding up the product of the coefficient of each lncRNA derived from the multivariate regression analysis and its expression level. The median score of the patients in the training set was used as the risk cutoff value to classify patients into either a high-risk group with high GILncSig or a low-risk group with low GILncSig.
Median survival and survival rates were calculated for each prognostic risk group using the Kaplan-Meier method, and the log-rank test was used to assess the survival difference between the high-risk and low-risk groups at the 5% significance level. The independence of GILncSig from other key clinical factors was clarified by multivariate Cox regression and stratification analysis. The performance of GILncSig was also assessed by receiver operating characteristic (ROC) curves. Unsupervised hierarchical clustering analysis for the identification of GU-like and GS-like groups was performed using Euclidean distance and Ward’s linkage method, whereas unsupervised hierarchical clustering analysis for the investigation of the potential functional regulatory role of GILncSig was performed using Louvain clustering algorithm and graph-based clustering method. The association between GILncSig risk groups and cluster 1 was consolidated via Pearson correlation analysis between GILncSig risk score and EMT signature score. All statistical analyses were performed using R-version 3.6.
2.5 Gene set enrichment analysis
Gene set enrichment analysis was carried out using two EMT signatures constructed from previously reported literature (31, 32). Enriched-ness of gene expression in each gene set of patients was defined by a signature score calculated through R VISION package.
2.6 Estimated the immune profile of tumor microenvironment
To better understand the GILncSig-related immune landscape, we used the CIBERSORT algorithm (https://cibersort.stanford.edu/index.php) combined with LM22 to estimate the abundances of immune cell subsets within the TME in each patient, as designed in the previous study (33). An empirical P-value for the deconvolution using Monte Carlo sampling was thereby produced, and cases with a resulting P-value< 0.05 were available for further analysis. We then estimated the Immune Score, namely the ratio of immune matrix components in the TME of each sample, by the ESTIMATE algorithm (34). The higher the Immune Score, the larger the ratio of the immune components in TME. Furthermore, the correlation between GILncSig risk scores and Immune Score was validated by Pearson correlation analysis.
2.7 Tissue specimens
Five formalin-fixed paraffin-embedded pancreatic ductal adenocarcinoma tumor specimens and their adjacent tissues were obtained from the pathology department of the hospital, and qRT-PCR was performed on these tissues. These patients met the following inclusion criteria: (1) Adult patients aged ≥ 18 years and ≤ 75 years, histologically (non-cytologically) diagnosed with PDAC; (2) Patients with stage I-III according to the 8th edition of the American Joint Committee on Cancer (AJCC) classification; (3) Patients with a life expectancy ≥3 months. The exclusion criteria were: (1) Patients diagnosed with other types of pancreatic malignant tumor or malignant tumor(s) of other tissues; (2) Patients with other severe concomitant disease or disorder such as heart, liver, or renal failure; (3) Patients having no adjacent non-cancerous tissue in the paraffin-embedded pancreatic tissue. This study was approved by the local Ethics Committee (Second Xiangya Hospital Ethics Committee) (approved no. 2020-465). The requirement for written informed consent was waived for the retrospective tissue samples included in the pancreatic ductal adenocarcinoma tumor specimens.
2.8 RNA isolation and qRT-PCR analysis
Total RNA was prepared using The ReliaPrep™ FFPE Total RNA Miniprep System (Promega) according to the manufacturer’s instructions. The concentration of the total RNA was detected by NanoDrop 2000 (Thermo Scientific™). Total RNA (1000 ng) was reverse transcribed into cDNA using RevertAid First Strand cDNA Synthesis Kit (Thermo Scientific™). The relative expression of target genes to the housekeeping gene GAPDH was determined by qRT-PCR using GoTaq® qPCR Master Mix (Promega). All primer sequences used in this study were listed in Supplementary Table 1 . Analysis between the two groups was performed by an unpaired t-test; P< 0.05 was considered statistically significant.
3 Results
3.1 Identification of genomic instability-related lncRNAs in pancreatic cancer patients
3.1.1 Identification of genomic instability-related lncRNAs
The flowchart of the study is described in Supplementary Figure 2 . To identify lncRNAs associated with genomic instability, we calculated the cumulative number of somatic mutations per patient and sorted them in ascending order. The top 25% (n = 40) and the last 25% (n = 43) of patients were assigned to GS-like and GU-like groups based on the cumulative number of somatic mutations. Next, a total number of 206 differentially expressed lncRNAs (labeled as DE lncRNAs) between GU-like and GS-like groups were identified based on the comparison of their lncRNA expression profiles (with the absolute value of logFC greater than 1 and FDR-adjusted P-value less than 0.05). Among them, 95 lncRNAs were upregulated and 111 lncRNAs were downregulated in the GU-like group.
3.1.2 Clustering analysis for patient classification
To classify all 178 TCGA patients into either GU-like or GS-like groups, unsupervised hierarchical clustering analysis was performed using 206 DE lncRNAs. As shown in Figure 1A , all 178 samples were clustered into two groups based on the expression levels of the 206 DE lncRNAs. The GU-like group has significantly higher cumulative somatic mutation counts compared with the GS-like group (P< 0.001, Mann–Whitney U test; Figure 1B ). Comparison of the expression level of UBQLN4 gene, a newly identified biomarker for genomic instability, reveals that the GU-like group has a significantly higher expression level of UBQLN4 compared with the GS-like group (P< 0.001, Mann–Whitney U test; Figure 1C ).
3.2 Establishment of genomic instability-derived lncRNA signature and outcome prediction
3.2.1 Screening of prognostic-related lncRNAs using Cox proportional hazard regression analysis in the training set
To explore the potential prognostic values of 206 DE lncRNAs, 7 pancreatic cancer patients from the TCGA cohort were excluded due to missing values in their follow-up dataset. The remaining 171 pancreatic cancer patients were randomly split into 2 sets: training set (n = 87) and testing set (n = 84). Statistical comparison of key clinical features between the patients within the training set and the testing set revealed that no significant differences exist between patients in these 2 sets ( Table 1 ). The training set will be used for the following procedures. To screen for lncRNAs that can be used as prognostic factors, univariate Cox proportional hazard regression analysis was first performed to determine if the expression level of each DE lncRNA is significantly associated with the prognosis, i.e, the overall survival (OS) of pancreatic cancer patients. As a result, 22 DE lncRNAs were identified (P< 0.05, Table 2 ). Next, these 22 candidate DE lncRNAs were subjected to multivariate Cox proportional hazards regression analysis with common clinical features such as age, gender, and tumor grade to further screen lncRNAs with prognostic ability independent of other lncRNAs. Subsequently, five lncRNAs (TM4SF1-AS1, CASC8, PRDM16-DT, LINC00996, AP000892.3, labeled siglncRNAs) were identified (P< 0.1, Table 3 ) as independent prognostic factors.
Table 1.
Covariates | Type | TCGA set | Training set | Testing set | P value |
---|---|---|---|---|---|
Age | <=65 | 90 (52.63%) | 50 (57.47%) | 40 (47.62%) | 0.2556 |
>65 | 81 (47.37%) | 37 (42.53%) | 44 (52.38%) | ||
Gender | FEMALE | 78 (45.61%) | 40 (45.98%) | 38 (45.24%) | 1 |
MALE | 93 (54.39%) | 47 (54.02%) | 46 (54.76%) | ||
Grade | G1-2 | 120 (70.18%) | 63 (72.41%) | 57 (67.86%) | 0.467 |
G3-4 | 49 (28.65%) | 22 (25.29%) | 27 (32.14%) | ||
Unknown | 2 (1.17%) | 2 (2.3%) | 0 (0%) | ||
Stage | Stage I-II | 161 (94.15%) | 82 (94.25%) | 79 (94.05%) | 0.9743 |
Stage III-IV | 7 (4.09%) | 3 (3.45%) | 4 (4.76%) | ||
unknown | 3 (1.75%) | 2 (2.3%) | 1 (1.19%) | ||
T | T1-2 | 28 (16.37%) | 14 (16.09%) | 14 (16.67%) | 1 |
T3-4 | 141 (82.46%) | 72 (82.76%) | 69 (82.14%) | ||
Unknown | 2 (1.17%) | 1 (1.15%) | 1 (1.19%) | ||
M | M0 | 77 (45.03%) | 39 (44.83%) | 38 (45.24%) | 0.662 |
M1 | 4 (2.34%) | 3 (3.45%) | 1 (1.19%) | ||
Unknown | 90 (52.63%) | 45 (51.72%) | 45 (53.57%) | ||
N | N0 | 47 (27.49%) | 19 (21.84%) | 28 (33.33%) | 0.1155 |
N1-3 | 119 (69.59%) | 66 (75.86%) | 53 (63.1%) | ||
Unknown | 5 (2.92%) | 2 (2.3%) | 3 (3.57%) |
Table 2.
lncRNA | Hazard Ratio | P Value |
---|---|---|
PRDM16-DT | 0.72247472 | 0.01877915 |
BX640514.2 | 1.31449287 | 0.00069123 |
AC008969.1 | 0.29543606 | 0.03292011 |
LINC00996 | 0.30959956 | 0.04539756 |
AL121929.3 | 0.61423882 | 0.0292435 |
AC120049.1 | 0.24663223 | 0.01804794 |
LINC02716 | 0.17449104 | 0.04206901 |
LINC02577 | 1.35591727 | 0.04757554 |
AC132938.2 | 0.23067713 | 0.0080087 |
SOCS2-AS1 | 0.22994143 | 0.02242125 |
TM4SF1-AS1 | 1.45664584 | 0.00014889 |
AL355803.1 | 0.39480085 | 0.02972008 |
AP000892.3 | 0.35868049 | 0.01728944 |
LINC01133 | 1.01990993 | 0.00016229 |
AP000757.2 | 0.78413922 | 0.02385144 |
LINC02041 | 1.13331718 | 0.00108853 |
AC087752.3 | 0.28291938 | 0.04381474 |
AL359504.1 | 0.1520814 | 0.00430568 |
AC104695.4 | 1.12588936 | 0.04270116 |
CASC8 | 1.2432148 | 0.00063661 |
SH3PXD2A-AS1 | 1.1224769 | 0.03158918 |
AC015911.3 | 0.3222307 | 0.03580903 |
Table 3.
lncRNA | Coefficient | Hazard ratio | P value |
---|---|---|---|
PRDM16-DT | -0.2782435 | 0.75711245 | 0.05078589 |
LINC00996 | -1.1024105 | 0.33206967 | 0.09393344 |
TM4SF1-AS1 | 0.29167955 | 1.33867397 | 0.00544925 |
AP000892.3 | -1.042062 | 0.35272661 | 0.02984887 |
CASC8 | 0.16646916 | 1.18112711 | 0.02519788 |
3.2.2 Construction of GILncSig and outcome prediction for the training set, testing set, and combined TCGA set
Afterward, a genomic instability-derived lncRNA signature was constructed based on the coefficients of the aforementioned multivariate Cox proportional hazard regression model and the expression level of siglncRNAs. The formula is as follows: GILncSig score = (0.2917 × expression level of TM4SF1-AS1) + (0.1665 × expression level of CASC8) + (-0.2782 × expression level of PRDM16-DT) + (-1.1024 × expression level of LINC00996) + (-1.0421 × expression level of AP000892.3). Of the GILncSig, the coefficient of lncRNA TM4SF1-AS1 and CASC8 were positive, suggesting that they are risk factors as their expressions were correlated with a poor prognosis, whereas the coefficient of lncRNA PRDM16-DT, LINC00996 and AP000892.3 were negative, suggesting that they are protective factors as their expressions were correlated with a better outcome. To predict the survival of pancreatic patients, the risk score for each patient in the training set was obtained through GILncSig. Using the median risk score (1.146) as the cut-off value, these patients were classified into two prognostic groups—either high-risk or low-risk groups. Kaplan-Meier analysis revealed that the survival outcomes of patients in the high-risk group are significantly worse than those in the low-risk group (median OS 1.46 years versus 4.12 years, P< 0.01, log-rank test; Figure 2A ). The survival rate of the high-risk group was 18.6% at 3 years and that of the low-risk group was 54.3%. The area under curve (AUC) yielded by the time-dependent ROC curves analysis of GILncSig was 0.725 ( Figure 2B ). Similarly, the survival analysis and time-dependent ROC curves analysis were applied to the testing set and the combined TCGA cohort. For the testing set, Kaplan-Meier analysis revealed that the survival outcomes of patients in the high-risk group are significantly worse than those in the low-risk group (median OS 1.08 years versus 1.92 years, P = 0.04, log-rank test; Figure 2C ). The survival rate of the high-risk group was 22.5% at 3 years and that of the low-risk group was 48.3%. The AUC yielded by the time-dependent ROC curves analysis of GILncSig was 0.727 ( Figure 2D ). For the combined TCGA cohort, Kaplan-Meier analysis revealed that the survival outcomes of patients in the high-risk group are significantly worse than those in the low-risk group (median OS 1.30 years versus 3.65 years, P< 0.01, log-rank test; Figure 2E ). The survival rate of the high-risk group was 18.8% at 3 years and that of the low-risk group was 51.2%. The AUC yielded by the time-dependent ROC curves analysis of GILncSig was 0.721 ( Figure 2F )
3.2.3 Verification of GILncSig as a valid prognostic factor independent of key clinical features
To clarify the independence of GILncSig from other clinical features, both univariate and multivariate Cox proportional hazard regression analysis were utilized. First, we preprocessed the data of patients in TCGA and excluded data with missing grades, stages, or ages. Next, univariate Cox proportional hazard regression analysis was carried out on GILncSig score and clinical features including score, age, gender, pathological stage, and tumor grade. As a result, GILncSig score, age, and tumor grade were identified as significant prognostic factors (P< 0.05, Table 4 ). Later, multivariate Cox proportional hazard regression analysis was performed among GILncSig score, age, and tumor grade. Finally, GILncSig and age retained their prognostic significance (P< 0.05, Table 5 ), which demonstrated that GILncSig could act as an independent prognostic factor. As age is significantly correlated with the overall survival of pancreatic patients, a stratification analysis is in need to reassure that the significantly different predicted survival outcomes of the high-risk and low-risk groups determined by GILncSig were not attributed to age difference. To do so, we stratified patients in the TCGA set into a young-patient group (n = 90) and an old-patient group (n = 81) according to the median age (age = 65) of the whole TCGA cohort. Then, the GILncSig risk score for patients in each age group was calculated to further divide them into high-risk or low-risk groups. As shown in Figure 2 , the high-risk group has significantly worse overall survival compared with the low-risk group in both the young-patient group (P< 0.001, log-rank test; Figure 2G ) and old-patient group (P = 0.027, log-rank test; Figure 2H ).
Table 4.
Variables | Hazard ratio | P value |
---|---|---|
Age | 1.02720658 | 0.01218871 |
Gender | 0.87372328 | 0.52335853 |
Grade | 1.39198899 | 0.02575897 |
Stage | 1.36518243 | 0.105923 |
GILncSig risk score | 1.01201713 | 0.01771823 |
Table 5.
Variables | Hazard ratio | P value |
---|---|---|
Age | 1.02728148 | 0.0151126 |
Grade | 1.3172337 | 0.06937243 |
GILncSig risk score | 1.01540857 | 0.00362892 |
3.2.4 Alignment of GILncSig scores with somatic mutation and UBQLN4 gene expression patterns
We further explored the variation patterns of somatic mutation counts and UBQLN4 gene expression levels with increasing GILncSig scores to consolidate GILncSig’s association with genomic instability. As can be seen in the heatmap of Figure 3A , patients in the training set, testing set, and combined TCGA set were sorted from left to right based on the values of their computed GILncSig scores on the X axis, while the expression levels of 5 siglncRNAs were shown on the Y axis. The corresponding somatic mutation counts and UBQLN4 gene expression levels of these patients were acquired and plotted accordingly in Figures 3B, C . Upregulated expression levels of risky lncRNAs (TM4SF1-AS1, CASC8) and downregulated expression levels of protective lncRNAs (PRDM16-DT, LINC00996, AP000892.3) were detected in patients with high GILncSig scores, whereas the opposite expression patterns were detected in patients with low GILncSig scores. Statistical comparison of somatic mutation counts and UBQLN4 expression levels between high-risk and low-risk groups within the training set, testing set, and combined TCGA set was performed. It is revealed that high-risk groups have significantly higher somatic mutation counts compared with low-risk groups in all three patient sets (from left to right: P< 0.001, P = 0.002, P< 0.001, Mann–Whitney U test; Figure 3D ) and that high-risk groups have significantly higher UBQLN4 expression levels compared with low-risk groups in the training set and combined TCGA set. Even though the difference did not reach significance in the testing set, there is still a discernible trend from the box and dot plot that the high-risk group possessed a higher expression level of UBQLN4 versus the low-risk group (from left to right: P = 0.0017, P = 0.14, P< 0.001, Mann–Whitney U test; Figure 3E ).
3.3 GILncSig adds value to the current literature field of prognostic pancreatic cancer biomarker
To investigate whether GILncSig can stand as a solid prognostic biomarker for pancreatic cancer, we tested its correlation with some mutated genes in pancreatic cancer and compared their survival outcome predicting capability. KRAS is a classic oncogene that is actively involved in the pathogenesis of pancreatic cancer (35–37). In addition, growing evidence revealed that KRAS is firmly implicated in the diagnosis and prognosis of pancreatic cancer and is heralded as a potential therapeutic target (35, 37). First of all, we compared the proportion of patients with KRAS mutation within the high-risk and low-risk groups, and as can be seen in Figure 4A , high-risk groups occupied a higher percentage of patients with KRAS mutation compared with low-risk groups in the training set, testing set, and combined TCGA set. Next, we categorized patients from the TCGA set into four different sub-groups based on their KRAS mutation status and GILncSig risk group membership. In other words, the following four groups were classified: KRAS Mutation/GU-like group, KRAS Mutation/GS-like group, KRAS Wild/GU-like group and KRAS Wild/GS-like group. Since there was only 1 patient who belonged to the KRAS Mutation/GS-like group, this group was removed from the following analysis. Later, survival analysis was performed. As can be inferred from Figure 4B , the survival outcome was significantly different among these three groups (P = 0.015). KRAS Mutation/GU-like group was predicted to have the worst outcome (median survival time: 1.46 years, 3-year survival rate: 28.0%), KRAS Wild/GS-like group was predicted to have the best outcome (3-year survival rate: 61.2%), whereas KRAS Wild/GU-like group was in between (median survival time: 1.72 years, 3-year survival rate: 28.7%). Our data indicated that GILncSig is able to identify a sub-population of pancreatic cancer patients who might be at a higher mortality rate and thus deserve a more radical treatment regimen that could otherwise go unnoticed due to their KRAS wild type status. Results for other pancreatic cancer-associated mutated genes were also consistent with KRAS, and results for TP53 are presented in Supplementary Figure 3 . Therefore, we believe GILncSig can be an asset to the current literature field of prognostic pancreatic cancer biomarkers.
3.4 Performance comparison of GILncSig with existing lncRNA-related signatures in survival prediction
Finally, we compared the prediction performance of GILncSig with two recently published lncRNA signatures: 3-lncRNA signature obtained from Wu’s study (hereinafter referred as WuLncSig) (38) and 3-lncRNA signature derived from Shi’s study (hereinafter referred as ShiLncSig) (39) using our TCGA patient cohort. As shown in Figure 4C , the AUC at 3 years of OS for our GILncSig is 0.721, which is significantly higher than WuLncSig (AUC = 0.639) and significantly higher than ShiLncSig as well (AUC = 0.651). Even though both WuLncSig and ShiLncSig used a smaller number of lncRNAs (n = 3) than GILncSig (n = 5), we still maintained that our signature should be considered the better model since it can offer a more accurate prediction.
3.5 Functional regulation of GILncSig in pancreatic cancer is potentially associated with EMT and lack of adaptive immunity participation within the TME
3.5.1 Clustering analysis of TCGA dataset yields three different clusters of pancreatic cancer patients
To further understand the potential functional regulatory effects of GILncSig, an explorational graph-based clustering analysis was performed for the whole TCGA dataset. As a result, three distinct clusters were identified among all pancreatic cancer patients and projected onto the UMAP coordinate shown in Figure 5A . The clusters were labeled as cluster 0, cluster 1, and cluster 2 respectively. Top 50 differentially expressed genes and top 30 differentially expressed transcription factors for each cluster were extracted and shown in the heatmaps ( Figures 5B, C ).
3.5.2 Cluster 1 is associated with EMT
Via close examination of the differentially expressed genes and transcription factors derived from each cluster, it can be inferred that cluster 1 is related to EMT, as SNAI2 and ZBED2, which are pivotal EMT-inducing transcription factors, are differentially upregulated within cluster 1 (40, 41). To further confirm this association, we carried out a gene set enrichment analysis using two EMT signatures from previous literature (32, 42). As shown in Figure 5D , patients’ enriched-ness in expression of genes within each gene set were defined by a signature score calculated through R VISION package. We observed that patients highly enriched for each gene set (whose signature score is ranked at 95th percentile or above) all belonged to cluster 1, further statistical comparison also consolidated that those patients of cluster 1 have the highest EMT signature score ( Figure 5E ).
3.5.3 GILncSig is associated with EMT
To explore the relationship between GILncSig and the identified clusters, we performed survival analysis for these three clusters and compared their GILncSig risk score levels. As shown in Figure 6A , patients from cluster 1 have the worst prognosis, whereas patients from cluster 0 have the best prognosis. Interestingly, the survival outcome of the patients from these three clusters corresponds tightly to their GILncSig risk score levels, with patients belonging to cluster 1 having the highest GILncSig risk score and patients belonging to cluster 0 harboring the lowest GILncSig risk score ( Figure 6B ). Furthermore, Pearson correlation analysis showed that the GILncSig risk score is significantly positively associated with EMT signature scores ( Figure 6C ). Together, these data indicate that pancreatic cancer patients with high GILncSig risk score are more likely to undergo EMT within the tumor, which in return may confer them a worse survival outcome.
3.5.4 TME estimation revealed inadequacy of adaptive immunity participation within GILncSig high-risk group
Finally, we conducted TME estimation using the CIBERSORT algorithm to understand the GILncSig-related immune landscape. Statistical comparison of the concentration of 22 immune cell types within the TME revealed a strikingly diminished adaptive immunity participation in the GILncSig high-risk group. As can be seen from Figures 7A, B , the concentration of naive B cells, activated CD4+ memory T cells and CD8+ T cells were significantly reduced in the GILncSig high-risk group. This observation is further affirmed by the Pearson correlation analysis, which showed that the ImmuneScore is significantly inversely correlated with GILncSig risk score, while T cell exclusion score is significantly positively correlated with GILncSig risk score ( Figures 7C, D ). Together, these data suggest that the worse survival outcome of pancreatic cancer patients from the GILncSig high-risk group might be in part attributed to the inadequacy of robust adaptive immune cells, i.e., B cell and T cell infiltration within the TME.
In summary, our data provide evidence that the aberrant expression pattern of GILncSig in pancreatic cancer patients can lead to a more invasive cancer subtype, thereafter rendering them a worse survival prognosis. This may be achieved through the regulation of genes promoting EMT and the hindrance of adaptive immune cell infiltration within the TME.
3.6 Validation of lncRNA signature by real-time quantitative PCR
The expression levels of all lncRNAs and their target genes in five pancreatic ductal adenocarcinoma tissues and matched normal tissues were detected using qRT-PCR. Compared with adjacent normal pancreas, the mRNA expression of TM4SF1-AS1 and CASC8 was higher, while the expression of PRDM16-DT(P<0.05), AP000892.3(P<0.05), and LINC00996 was lower in PDAC tissues ( Figure 8A ). The expression trend of target genes was consistent with that of lncRNAs ( Figure 8B ). The results of our validation were consistent with the model, but the statistics of most of the lncRNAs did not show significance due to the large difference in cancer tissues among different individuals.
4 Discussion
Here, we established a computational framework for the identification of genomic instability-related lncRNAs, through which 206 lncRNAs associated with genomic instability were identified, and five lncRNAs (TM4SF1-AS1, CASC8, PRDM16-DT, LINC00996, AP000892.3) were selected from them as independent prognostic factors for constructing GILncSig. It has been found that mutations in key genes or aberrant signaling pathways drive the pathogenesis of PDAC, such as the mutation of oncogene KRAS and the frequent inactivation of tumor suppressors including TP53, SMAD4, and CDKN2A. Moreover, these gene mutations converge in KRAS, TGF-β, Wnt, Notch, and ROBO/SLIT signaling pathways as well as chromatin remodeling, DNA repair and other pathways and processes (43). This suggests that the high heterogeneity of PDAC is achieved by the overactivation of many signaling pathways related to growth and proliferation and the alteration of the expression levels of tumor suppressor genes, thereby affecting cell proliferation, survival and invasion. KRAS mutations, on the other hand, are considered the earliest event in PDAC initiation (44). Therefore, we compared the predictive ability of GILncSig with KRAS for survival outcome and found that GILncSig was able to identify pancreatic cancer patients with KRAS mutations who may have a higher mortality rate than the rest of the patients. Studies have pointed out that the different subtypes defined by gene expression patterns and clinical features of patients with pancreatic cancer may be of great value in predicting the prognosis of patients and guiding precision medicine (45).
In addition, three of the five selected lncRNAs associated with genomic instability (PRDM16-DT, LINC00996, AP000892.3) were protective factors, while TM4SF1-AS1 and CASC8 often served as risk factors associated with poor prognosis (46–50). These lncRNAs have been demonstrated to play an important role in the occurrence, development, and prognosis of a variety of malignant tumors (51–53). However, most of them were found to be associated with the prognosis of PDAC for the first time. Notably, CASC8 is not only strongly associated with poor survival in pancreatic ductal adenocarcinoma, but may also be involved in the process of EMT by competitively binding miR-671 (54).
EMT is a complex biological trans-differentiation process that allows epithelial cells to transiently acquire mesenchymal features, including motility and metastatic potential (55, 56). Activation of EMT is thought to be a major driver of tumor progression from initiation to metastasis. For instance, the EMT transcription factor Zeb1 is not only a key factor in lesion formation, invasion and significant metastasis, but also can affect the stemness and colonization ability of tumor cells, especially phenotypic/metabolic plasticity (55). Further studies revealed that lncRNA Linc-ROR can promote tumor invasion and metastasis by regulating Zeb1 (57). Besides, Zhu, W. et al. found that overexpression of lncRNA-CASC8 resulted in up-regulation of TOB1 and low expression of miR-129-5p, which were associated with an increased frequency of lymph node metastasis and a higher trend of pathological stage, respectively, thus validating the CASC8-miR-129-5p-TOB1 regulatory axis (58). These studies all suggested the association of GILncSig with EMT, which we also confirmed in the present study.
The tumor microenvironment of PDAC contains immune components such as interstitial cells, inflammatory cells, and cytokines, which comprise a complex network to promote tumor growth and invasion. It has also been shown that the TME of PDAC is closely related to EMT as well as KRAS (24, 59). So, we used CIBERSORT and estimation algorithms to analyze the details of GILncSig-associated TME profiles, including the estimated proportion of tumor-infiltrating immune cells (TICs) and the quantification of adaptive immune cell exclusion level. The results showed that naïve B cells, activated CD4 + memory T cells, and CD8 + T cell concentrations were significantly lower in the high-risk group. The immune score and T cell exclusion score were significantly negatively and positively correlated with the GILncSigrisk score, respectively, suggesting that this model can predict the degree of infiltration of immune cells. It has been found that reduced infiltration of these adaptive immune cells is associated with the development of pancreatic cancer armed with immune evasion mechanisms and may also impair the effect of immunotherapy in patients (60–63). Therefore, we believe that the treatment of patients can be better guided with the help of this model, especially in the responsiveness of immunotherapy. To our knowledge, this is the first and most comprehensive study to date describing the prognostic and immunotherapeutic response predictive value of TME in patients with PDAC.
Although our study provides important insights for evaluating genomic instability and the prognosis of patients with pancreatic ductal adenocarcinoma, and reveals the association of GILncSig with PDAC immune profiles, there are still some limitations. First, the TCGA cohort containing 171 patients was relatively smaller than the cohort of patients with other cancer types such as breast or lung cancer. In addition, GILncSig was validated only in the TCGA dataset and in five patient specimens due to the lack of a reliable large independent dataset. These shortcomings can only be remedied by further development of these public databases in the future. Second, we need further functional studies to understand the exact regulatory mechanism of GILncSig in maintaining genomic instability. Similarly, the GILncSig-associated tumor microenvironment is achieved by bioinformatics methods, so the results may require more in-depth studies to confirm. Third, there is no data on immunotherapy in the TCGA dataset, so the predictive power of GILncSig for the responsiveness to immunotherapy is indirectly assessed.
In conclusion, we identified lncRNAs associated with genomic instability through a computational framework based on the mutant hypothesis in the present study. By combining lncRNA expression profiles, somatic mutation profiles, and clinical information of pancreatic cancers as case studies, we identified a genomic instability-derived lncRNA signature as an independent prognostic marker to stratify pancreatic cancer patients at risk and validated it in the TCGA cohort. In addition, we used CIBERSORT and estimation algorithms to comprehensively understand the tumor microenvironment in pancreatic cancer patients. Lastly, through the functional enrichment analysis of a distinct cluster of PDAC patients with high GILncSig scores and worse survival prognosis, we found that the distraught expression of genes that promote EMT and prevent adaptive immune cell infiltration in the TME may be the key down-stream regulatory network for GILncSig.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/ Supplementary Material .
Ethics statement
This study was approved by the local Ethics Committee (Second Xiangya Hospital Ethics Committee) (approved no. 2020-465). Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.
Author contributions
HY and WZ conceived the study and wrote the manuscript. LX and JH collected the information of patients with pancreatic ductal adenocarcinoma from public databases. JD and WZ performed statistical analysis as well as functional enrichment analysis of the data. YS and HY collected tissue specimens from PDAC patients and completed the validation of GILncSig. ZM, ZS, WP, and YC modifified the methods, expressions, and contents in the study, respectively. YX and ZS are responsible for all aspects of the work to ensure that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the National Key Research and Development Program (2018YFE0114500 to YX), the National Science Foundation of Hunan Province for Excellent Young Scholars (2020JJ3056 to YX), the National Natural Science. Foundation of China (NSFC) grant (81870577 to YX), the Natural Science Foundation of Hunan Province for Youths (2022JJ40718 to LX), the Natural Science Foundation of Changsha (kq2202404 to JH), and the Natural Science Foundation of Hunan Province for Youths (2022JJ40689 to JH).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2022.970588/full#supplementary-material
References
- 1. Rahib L, Smith BD, Aizenberg R, Rosenzweig AB, Fleshman JM, Matrisian LM. Projecting cancer incidence and deaths to 2030: the unexpected burden of thyroid, liver, and pancreas cancers in the united states. Cancer Res (2014) 74(11):2913–21. doi: 10.1158/0008-5472.CAN-14-0155 [DOI] [PubMed] [Google Scholar]
- 2. Arnold M, Abnet CC, Neale RE, Vignat J, Giovannucci EL, McGlynn KA, et al. Global burden of 5 major types of gastrointestinal cancer. Gastroenterology (2020) 159(1):335–49.e315. doi: 10.1053/j.gastro.2020.02.068 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Strobel O, Neoptolemos J, Jäger D, Büchler MW. Optimizing the outcomes of pancreatic cancer surgery. Nat Rev Clin Oncol (2019) 16(1):11–26. doi: 10.1038/s41571-018-0112-1 [DOI] [PubMed] [Google Scholar]
- 4. Garrido-Laguna I, Hidalgo M. Pancreatic cancer: from state-of-the-art treatments to promising novel therapies. Nat Rev Clin Oncol (2015) 12(6):319–34. doi: 10.1038/nrclinonc.2015.53 [DOI] [PubMed] [Google Scholar]
- 5. Hidalgo M. Pancreatic cancer. N Engl J Med (2010) 362(17):1605–17. doi: 10.1056/NEJMra0901557 [DOI] [PubMed] [Google Scholar]
- 6. Hause RJ, Pritchard CC, Shendure J, Salipante SJ. Classification and characterization of microsatellite instability across 18 cancer types. Nat Med (2016) 22(11):1342–50. doi: 10.1038/nm.4191 [DOI] [PubMed] [Google Scholar]
- 7. Pilié PG, Tang C, Mills GB, Yap TA. State-of-the-art strategies for targeting the DNA damage response in cancer. Nat Rev Clin Oncol (2019) 16(2):81–104. doi: 10.1038/s41571-018-0114-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Liu H. Linking lncRNA to genomic stability. Sci China Life Sci (2016) 59(3):328–9. doi: 10.1007/s11427-016-5009-6 [DOI] [PubMed] [Google Scholar]
- 9. Munschauer M, Nguyen CT, Sirokman K, Hartigan CR, Hogstrom L, Engreitz JM, et al. The NORAD lncRNA assembles a topoisomerase complex critical for genome stability. Nature (2018) 561(7721):132–6. doi: 10.1038/s41586-018-0453-z [DOI] [PubMed] [Google Scholar]
- 10. Habermann JK, Doering J, Hautaniemi S, Roblick UJ, Bündgen NK, Nicorici D, et al. The gene expression signature of genomic instability in breast cancer is an independent predictor of clinical outcome. Int J Cancer (2009) 124(7):1552–64. doi: 10.1002/ijc.24017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Zhang S, Yuan Y, Hao D. A genomic instability score in discriminating nonequivalent outcomes of BRCA1/2 mutations and in predicting outcomes of ovarian cancer treated with platinum-based chemotherapy. PLoS One (2014) 9(12):e113169. doi: 10.1371/journal.pone.0113169 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. D'Alessandro G, Whelan DR, Howard SM, Vitelli V, Renaudin X, Adamowicz M, et al. BRCA2 controls DNA:RNA hybrid level at DSBs by mediating RNase H2 recruitment. Nat Commun (2018) 9(1):5376. doi: 10.1038/s41467-018-07799-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Hu WL, Jin L, Xu A, Wang YF, Thorne RF, Zhang XD, et al. GUARDIN is a p53-responsive long non-coding RNA that is essential for genomic stability. Nat Cell Biol (2018) 20(4):492–502. doi: 10.1038/s41556-018-0066-7 [DOI] [PubMed] [Google Scholar]
- 14. Petti E, Buemi V, Zappone A, Schillaci O, Broccia PV, Dinami R, et al. SFPQ and NONO suppress RNA:DNA-hybrid-related telomere instability. Nat Commun (2019) 10(1):1001. doi: 10.1038/s41467-019-08863-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Chen B, Dragomir MP, Fabris L, Bayraktar R, Knutsen E, Liu X, et al. The long noncoding RNA CCAT2 induces chromosomal instability through BOP1-AURKB signaling. Gastroenterology (2020) 159(6):2146–62.e2133. doi: 10.1053/j.gastro.2020.08.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Hu Z, Mi S, Zhao T, Peng C, Peng Y, Chen L, et al. BGL3 lncRNA mediates retention of the BRCA1/BARD1 complex at DNA damage sites. EMBO J (2020) 39(12):e104133. doi: 10.15252/embj.2019104133 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Huang F, Chen W, Peng J, Li Y, Zhuang Y, Zhu Z, et al. LncRNA PVT1 triggers cyto-protective autophagy and promotes pancreatic ductal adenocarcinoma development via the miR-20a-5p/ULK1 axis. Mol Cancer (2018) 17(1):98. doi: 10.1186/s12943-018-0845-6 [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 18. Ottaviani S, Stebbing J, Frampton AE, Zagorac S, Krell J, de Giorgio A, et al. TGF-β induces miR-100 and miR-125b but blocks let-7a through LIN28B controlling PDAC progression. Nat Commun (2018) 9(1):1845. doi: 10.1038/s41467-018-03962-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Li N, Yang G, Luo L, Ling L, Wang X, Shi L, et al. lncRNA THAP9-AS1 promotes pancreatic ductal adenocarcinoma growth and leads to a poor clinical outcome via sponging miR-484 and interacting with YAP. Clin Cancer Res (2020) 26(7):1736–48. doi: 10.1158/1078-0432.CCR-19-0674 [DOI] [PubMed] [Google Scholar]
- 20. Zhou M, Zhang Z, Zhao H, Bao S, Cheng L, Sun J. An immune-related six-lncRNA signature to improve prognosis prediction of glioblastoma multiforme. Mol Neurobiol (2018) 55(5):3684–97. doi: 10.1007/s12035-017-0572-9 [DOI] [PubMed] [Google Scholar]
- 21. Ma B, Li Y, Ren Y. Identification of a 6-lncRNA prognostic signature based on microarray re-annotation in gastric cancer. Cancer Med (2020) 9(1):335–49. doi: 10.1002/cam4.2621 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Shen Y, Peng X, Shen C. Identification and validation of immune-related lncRNA prognostic signature for breast cancer. Genomics (2020) 112(3):2640–6. doi: 10.1016/j.ygeno.2020.02.015 [DOI] [PubMed] [Google Scholar]
- 23. Bagchi S, Yuan R, Engleman EG. Immune checkpoint inhibitors for the treatment of cancer: Clinical impact and mechanisms of response and resistance. Annu Rev Pathol (2021) 16:223–49. doi: 10.1146/annurev-pathol-042020-042741 [DOI] [PubMed] [Google Scholar]
- 24. Ren B, Cui M, Yang G, Wang H, Feng M, You L, et al. Tumor microenvironment participates in metastasis of pancreatic cancer. Mol Cancer (2018) 17(1):108. doi: 10.1186/s12943-018-0858-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Ligorio M, Sil S, Malagon-Lopez J, Nieman LT, Misale S, Di Pilato M, et al. Stromal microenvironment shapes the intratumoral architecture of pancreatic cancer. Cell (2019) 178(1):160–75.e127. doi: 10.1016/j.cell.2019.05.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Ho WJ, Jaffee EM, Zheng L. The tumour microenvironment in pancreatic cancer - clinical challenges and opportunities. Nat Rev Clin Oncol (2020) 17(9):527–40. doi: 10.1038/s41571-020-0363-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Stromnes IM, Hulbert A, Pierce RH, Greenberg PD, Hingorani SR. T-Cell localization, activation, and clonal expansion in human pancreatic ductal adenocarcinoma. Cancer Immunol Res (2017) 5(11):978–91. doi: 10.1158/2326-6066.CIR-16-0322 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Peng J, Sun BF, Chen CY, Zhou JY, Chen YS, Chen H, et al. Single-cell RNA-seq highlights intra-tumoral heterogeneity and malignant progression in pancreatic ductal adenocarcinoma. Cell Res (2019) 29(9):725–38. doi: 10.1038/s41422-019-0195-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Bao S, Zhao H, Yuan J, Fan D, Zhang Z, Su J, et al. Computational identification of mutator-derived lncRNA signatures of genome instability for improving the clinical outcome of cancers: a case study in breast cancer. Brief Bioinform (2020) 21(5):1742–55. doi: 10.1093/bib/bbz118 [DOI] [PubMed] [Google Scholar]
- 30. Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun (2019) 10(1):1523. doi: 10.1038/s41467-019-09234-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Creighton CJ, Gibbons DL, Kurie JM. The role of epithelial-mesenchymal transition programming in invasion and metastasis: a clinical perspective. Cancer Manag Res (2013) 5:187–95. doi: 10.2147/cmar.S35171 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Jung AR, Jung CH, Noh JK, Lee YC, Eun YG. Epithelial-mesenchymal transition gene signature is associated with prognosis and tumor microenvironment in head and neck squamous cell carcinoma. Sci Rep (2020) 10(1):3652. doi: 10.1038/s41598-020-60707-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods (2015) 12(5):453–7. doi: 10.1038/nmeth.3337 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Yoshihara K, Shahmoradgoli M, Martínez E, Vegesna R, Kim H, Torres-Garcia W, et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat Commun (2013) 4:2612. doi: 10.1038/ncomms3612 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Waters AM, Der CJ. KRAS: The critical driver and therapeutic target for pancreatic cancer. Cold Spring Harb Perspect Med (2018) 8(9):656–70. doi: 10.1101/cshperspect.a031435 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. di Magliano MP, Logsdon CD. Roles for KRAS in pancreatic tumor development and progression. Gastroenterology (2013) 144(6):1220–9. doi: 10.1053/j.gastro.2013.01.071 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Buscail L, Bournet B, Cordelier P. Role of oncogenic KRAS in the diagnosis, prognosis and treatment of pancreatic cancer. Nat Rev Gastroenterol Hepatol (2020) 17(3):153–68. doi: 10.1038/s41575-019-0245-4 [DOI] [PubMed] [Google Scholar]
- 38. Wu B, Wang K, Fei J, Bao Y, Wang X, Song Z, et al. Novel three−lncRNA signature predicts survival in patients with pancreatic cancer. Oncol Rep (2018) 40(6):3427–37. doi: 10.3892/or.2018.6761 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Shi X, Zhao Y, He R, Zhou M, Pan S, Yu S, et al. Three-lncRNA signature is a potential prognostic biomarker for pancreatic adenocarcinoma. Oncotarget (2018) 9(36):24248–59. doi: 10.18632/oncotarget.24443 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Ganesan R, Mallets E, Gomez-Cambronero J. The transcription factors slug (SNAI2) and snail (SNAI1) regulate phospholipase d (PLD) promoter in opposite ways towards cancer cell invasion. Mol Oncol (2016) 10(5):663–76. doi: 10.1016/j.molonc.2015.12.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Somerville TDD, Xu Y, Wu XS, Maia-Silva D, Hur SK, de Almeida LMN, et al. ZBED2 is an antagonist of interferon regulatory factor 1 and modifies cell identity in pancreatic cancer. Proc Natl Acad Sci U.S.A. (2020) 117(21):11471–82. doi: 10.1073/pnas.1921484117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Chae YK, Chang S, Ko T, Anker J, Agte S, Iams W, et al. Epithelial-mesenchymal transition (EMT) signature is inversely associated with T-cell infiltration in non-small cell lung cancer (NSCLC). Sci Rep (2018) 8(1):2918. doi: 10.1038/s41598-018-21061-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Ying H, Dey P, Yao W, Kimmelman AC, Draetta GF, Maitra A, et al. Genetics and biology of pancreatic ductal adenocarcinoma. Genes Dev (2016) 30(4):355–85. doi: 10.1101/gad.275776.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Eser S, Schnieke A, Schneider G, Saur D. Oncogenic KRAS signalling in pancreatic cancer. Br J Cancer (2014) 111(5):817–22. doi: 10.1038/bjc.2014.215 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Bailey P, Chang DK, Nones K, Johns AL, Patch AM, Gingras MC, et al. Genomic analyses identify molecular subtypes of pancreatic cancer. Nature (2016) 531(7592):47–52. doi: 10.1038/nature16965 [DOI] [PubMed] [Google Scholar]
- 46. Zimta AA, Tomuleasa C, Sahnoune I, Calin GA, Berindan-Neagoe I. Long non-coding RNAs in myeloid malignancies. Front Oncol (2019) 9:1048. doi: 10.3389/fonc.2019.01048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Zhang Y, Ye Q, He J, Chen P, Wan J, Li J, et al. Recurrence-associated multi-RNA signature to predict disease-free survival for ovarian cancer patients. BioMed Res Int (2020) 2020:1618527. doi: 10.1155/2020/1618527 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Zhou F, Wang J, Chi X, Zhou X, Wang Z. lncRNA TM4SF1-AS1 activates the PI3K/AKT signaling pathway and promotes the migration and invasion of lung cancer cells. Cancer Manag Res (2020) 12:5527–36. doi: 10.2147/cmar.S254072 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. López de Maturana E, Rodríguez JA, Alonso L, Lao O, Molina-Montes E, Martín-Antoniano IA, et al. A multilayered post-GWAS assessment on genetic susceptibility to pancreatic cancer. Genome Med (2021) 13:15. doi: 10.1186/s13073-020-00816-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Shao L. Identification of hub lncRNAs in head and neck cancer based on weighted gene co-expression network analysis and experiments. FEBS Open Bio (2021) 11(7):2060–73. doi: 10.1002/2211-5463.13134 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Ge H, Yan Y, Wu D, Huang Y, Tian F. Potential role of LINC00996 in colorectal cancer: a study based on data mining and bioinformatics. Onco Targets Ther (2018) 11:4845–55. doi: 10.2147/OTT.S173225 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Yang B, Gu B, Zhang J, Xu L, Sun Y. CASC8 lncRNA promotes the proliferation of retinoblastoma cells through downregulating miR34a methylation. Cancer Manag Res (2020) 12:13461–7. doi: 10.2147/CMAR.S268380 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Zhou S, Fang J, Sun Y, Li H. Integrated analysis of a risk score system predicting prognosis and a ceRNA network for differentially expressed lncRNAs in multiple myeloma. Front Genet (2020) 11:934. doi: 10.3389/fgene.2020.00934 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Wang Y, Yang Y, Wang Y, Li X, Xiao Y, Wang W. High cancer susceptibility candidate 8 expression is associated with poor prognosis of pancreatic adenocarcinoma: Validated analysis based on four cancer databases. Front Cell Dev Biol (2020) 8:392. doi: 10.3389/fcell.2020.00392 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Krebs AM, Mitschke J, Lasierra Losada M, Schmalhofer O, Boerries M, Busch H, et al. The EMT-activator Zeb1 is a key factor for cell plasticity and promotes metastasis in pancreatic cancer. Nat Cell Biol (2017) 19(3):518–29. doi: 10.1038/ncb3513 [DOI] [PubMed] [Google Scholar]
- 56. Saitoh M. Involvement of partial EMT in cancer progression. J Biochem (2018) 164(4):257–64. doi: 10.1093/jb/mvy047 [DOI] [PubMed] [Google Scholar]
- 57. Zhan HX, Wang Y, Li C, Xu JW, Zhou B, Zhu JK, et al. LincRNA-ROR promotes invasion, metastasis and tumor growth in pancreatic cancer through activating ZEB1 pathway. Cancer Lett (2016) 374(2):261–71. doi: 10.1016/j.canlet.2016.02.018 [DOI] [PubMed] [Google Scholar]
- 58. Zhu W, Gao W, Deng Y, Yu X, Zhu H. Identification and development of long non-coding RNA associated regulatory network in pancreatic adenocarcinoma. Onco Targets Ther (2020) 13:12083–96. doi: 10.2147/OTT.S265036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Hou P, Kapoor A, Zhang Q, Li J, Wu CJ, Li J, et al. Tumor microenvironment remodeling enables bypass of oncogenic KRAS dependency in pancreatic cancer. Cancer Discov (2020) 10(7):1058–77. doi: 10.1158/2159-8290.CD-19-0597 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Li J, Byrne KT, Yan F, Yamazoe T, Chen Z, Baslan T, et al. Tumor cell-intrinsic factors underlie heterogeneity of immune cell infiltration and response to immunotherapy. Immunity (2018) 49(1):178–93.e177. doi: 10.1016/j.immuni.2018.06.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Das S, Shapiro B, Vucic EA, Vogt S, Bar-Sagi D. Tumor cell-derived IL1β promotes desmoplasia and immune suppression in pancreatic cancer. Cancer Res (2020) 80(5):1088–101. doi: 10.1158/0008-5472.CAN-19-2080 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Mirlekar B, Michaud D, Lee SJ, Kren NP, Harris C, Greene K, et al. B cell-derived IL35 drives STAT3-dependent CD8(+) T-cell exclusion in pancreatic cancer. Cancer Immunol Res (2020) 8(3):292–308. doi: 10.1158/2326-6066.CIR-19-0349 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Takahashi R, Macchini M, Sunagawa M, Jiang Z, Tanaka T, Valenti G, et al. Interleukin-1β-induced pancreatitis promotes pancreatic ductal adenocarcinoma via b lymphocyte-mediated immune suppression. Gut (2021) 70(2):330–41. doi: 10.1136/gutjnl-2019-319912 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/ Supplementary Material .