Abstract
Non-small cell lung cancer (NSCLC) is the most common cancer and cause of cancer-related mortality globally. Increasing evidence suggested that the long non-coding RNAs (lncRNAs) were involved in cancer-related death. To explore the possible prognostic lncRNA biomarkers for NSCLC patients, in the present study, we conducted a comprehensive lncRNA profiling analysis based on 1902 patients from Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) datasets. In the discovery phase, we employed 682 patients from the combination of four GEO datasets (GSE30219, GSE31546, GSE33745 and GSE50081) and conducted a seven-lncRNA formula to predict overall survival (OS). Next, we validated our risk-score formula in two independent datasets, TCGA (n=994) and GSE31210 (n=226). Stratified analysis revealed that the seven-lncRNA signature was significantly associated with OS in stage I patients from both discovery and validation groups (all P<0.001). Additionally, the prognostic value of the seven-lncRNA signature was also found to be favorable in patients carrying wild-type KRAS or EGFR. Bioinformatical analysis suggested that the seven-lncRNA signature affected patients’ prognosis by influencing cell cycle-related pathways. In summary, our findings revealed a seven-lncRNA signature that predicted OS of NSCLC patients, especially in those with early tumor stage and carrying wild-type KRAS or EGFR.
Keywords: long non-coding RNA, NSCLC, overall survival, risk score, stage I
Introduction
Non-small cell lung cancer (NSCLC) is one of the most common and lethal malignant diseases worldwide during the past decades due to lacking of early diagnostic and predictive biomarkers [1,2]. With the development of the molecular targeted agents [3] and the therapeutic strategy [4], the 5-year overall survival (OS) rate of NSCLC has been prolonged. However, only one third of NSCLC patients are diagnosed at an early stage [5]. To date, there are limited methods for us to provide a prognostic prediction of NSCLC patients, which is important to choose appropriate therapies for those patients [6]. Therefore, our study focused on finding a prognostic risk-score model derived from the whole-genome data.
Long non-coding RNAs (lncRNAs) are defined as non-protein coding transcripts longer than 200 nucleotides [7,8]. During the past decade, genome-wide sequencing helped researchers discover a large amount of lncRNAs [8]. Increasing studies suggested that the lncRNAs were involved in multiple cellular biological process and were associated with different human diseases including cancer [9,10]. Furthermore, several lncRNAs have been identified as oncogenes, such as MALAT1 in lung cancer [11], HOTAIR in breast cancer [12], and PANDA in hepatocellular carcinoma [13]. Above lncRNAs were also found to be associated with the prognosis of cancer patients [13–15].
Recently, the prognostic values of lncRNAs have attracted much attention in the field of cancer research. Researchers put great effort into investigating the lncRNA biomarkers in multiple malignancies, including gastric cancer [16], breast cancer [17], hepatocellular carcinoma [18] and ovarian cancer [19]. In lung cancer, Tu et al explored the lncRNA biomarkers in based on 739 patients derived from GEO datasets [20] and identified an eight-lncRNA signature which was associated with clinical outcomes. In the present study, we conducted a large sample size (n=1902) and cross-platform (including RNA-seq and microarray data) analysis, and performed multiple validation. Considering the above clear advantages, our findings will provide robust evidence for the exploration of lncRNA biomarkers in NSCLC.
RESULTS
Identification of seven lncRNAs as prognostic biomarkers in discovery group
By subjecting the lncRNA profiling data of discovery group to univariable Cox proportional hazards regression analysis, we obtained 457 lncRNAs whose P-value were less than 0.05. Among them, seven lncRNAs were selected for further analysis, including APTR, DHRS4-AS1, ITGA9-AS1, LINC01137, LOC101927972, RPARP-AS1 and SH3BP5-AS1. Next, we fitted the 7 lncRNAs expression, the corresponding survival status and survival time into the multivariable Cox regression. Using the coefficients obtained from the multivariable Cox regression, a risk-score formula was constructed as following: risk score = - 0.06 x APTR - 0.20 x DHRS4-AS1 - 0.35 x ITGA9-AS1 - 0.23 x LINC01137 - 0.25 x LOC101927972 - 0.06 x RPARP-AS1 - 0.20 x SH3BP5-AS1. Notably, in the formula, every lncRNA expression value is weight by a negative coefficient (Table 1). As shown in Figure 1A and 1B, the heatmap revealed that the expression level of seven lncRNAs was decreased accompanying with the higher risk scores. Furthermore, we calculated the association between risk score and cancer-related death (Figure 1C). Our data showed that mortality rate in high risk group was significant higher than low risk group (Figure 1D), indicating the seven lncRNAs might play a protective role in NSCLC.
Table 1. Seven lncRNAs significantly associated with the OS of NSCLC patients in the training group.
Gene name | Ensemble ID | Chr. | Coordinate | Z score | P value a | Coefficient b |
APTR | ENSG00000214293 | 7 | 77,657,660-77,697,345 | -3.25 | 1.16E-03 | -0.06 |
DHRS4-AS1 | ENSG00000215256 | 14 | 23,938,731-23,988,839 | -3.14 | 1.69E-03 | -0.20 |
ITGA9-AS1 | ENSG00000235257 | 3 | 37,745,432-37,861,780 | -2.83 | 4.62E-03 | -0.35 |
LINC01137 | ENSG00000233621 | 1 | 37,454,879-37,474,443 | 3.26 | 1.12E-03 | -0.23 |
LOC101927972 | ENSG00000269985 | 6 | 5,451,683-5,458,075 | -2.98 | 2.89E-03 | -0.25 |
RPARP-AS1 | ENSG00000269609 | 10 | 102,449,817-102,461,106 | -3.35 | 8.05E-04 | -0.06 |
SH3BP5-AS1 | ENSG00000224660 | 3 | 15,254,184-15,264,498 | -2.92 | 3.45E-03 | -0.20 |
a Derived from the univariable Cox proportional hazards regression analysis in the discovery group.
b Derived from the multivariable Cox proportional hazards regression analysis in the discovery group.
The prognostic values of seven-lncRNA signature in discovery and validation groups
Considering the relationship between the seven-lncRNA signature and patients’ deaths, we next explored whether the seven-lncRNA signature could affect patients’ OS time. Our data in discovery group (Meta-GEO dataset, n=682) showed that the patients who had low risk-scores were supposed to have a longer OS time than higher risk group (Figure 2A). To validate above finding, we employed two validation groups, TCGA and GSE31210 dataset, containing 994 and 226 patients, respectively. As expected, patients in high risk group had a significant increased mortality risk than low risk group either in TCGA (HR=1.34, 95% CI= 1.10-1.63, P =0.004) or GSE31210 datasets (HR=3.06, 95% CI= 1.58-5.93, P =0.002) (Figure 2B and 2C). Moreover, prognostic meta-analysis based on above three groups (n=1902) confirmed that the seven-lncRNA signature was a risk factor for NSCLC patients (combined HR=1.76, 95% CI= 1.26-2.47, P=0.001) (Figure 2D).
The seven-lncRNA signature was associated with OS in stage I patients
To explore the impacts of clinical characteristics on the prognostic values of the seven-lncRNA signature, we performed a set of predefined stratified analysis. As shown in Table 2, we stratified the NSCLC patients by five clinical characteristics (including age, gender, smoking status, pathological subtypes and AJCC stage). According to the results of stratified analysis, the most significant result was found in stage I patients in both discovery and two validation groups (all P<0.001) (Table 2). To validate above findings, we further divided stage I patients into Ia and Ib according AJCC system, and calculated the prognostic values of the seven-lncRNA signature in each subgroups, respectively. Kaplan-Meier method was used to visualize the OS probabilities between high- and low-risk groups. As expected, the overall survival time of high risk group was all significantly shorter than low risk group in either stage Ia or Ib patients (Figure 3). The prognostic meta-analysis showed that the stage Ia patients with high risk scores had a two-fold increased mortality risk than those with low risk scores (combined HR=2.17, 95% CI= 1.15-4.09, P=0.017) (Figure 3D). Similarly, the association between the seven-lncRNA signature and OS also reached a statistical significance in stage Ib patients (combined HR=1.88, 95% CI= 1.30-2.72, P=0.001) (Figure 3H).
Table 2. The association between seven-lncRNA signature and OS of NSCLC patients in discovery and validating groups.
Variable | Discovery Group | Validation Group-1 | Validation Group-2 | ||||||
Number (High/Low) |
HR (95%CI) | P value | Number (High/Low) |
HR (95%CI) | P value | Number (High/Low) |
HR (95%CI) | P value | |
Total | 341/341 | 1.85 (1.53-2.25) | <0.001 | 497/497 | 1.34 (1.10-1.63) | 0.004 | 113/113 | 3.06 (1.45-5.44) | 0.002 |
Age | |||||||||
≤70 | 214/225 | 1.73 (1.36-2.16) | <0.001 | 291/284 | 1.30 (0.99-1.70) | 0.056 | 111/110 | 3.18 (1.45-5.79) | 0.003 |
>70 | 111/85 | 2.16 (1.61-3.28) | <0.001 | 189/206 | 1.39 (1.04-1.87) | 0.029 | 2/3 | 1.24 (0.12-12.79) | 0.855 |
Gender | |||||||||
Male | 203/252 | 1.72 (1.36-2.12) | <0.001 | 281/315 | 1.37 (1.06-1.75) | 0.015 | 47/58 | 5.04 (1.52-9.20) | 0.004 |
Female | 138/89 | 1.95 (1.38-3.02) | <0.001 | 216/182 | 1.25 (0.91-1.75) | 0.170 | 66/55 | 1.69 (0.64-4.52) | 0.298 |
Smoking history | |||||||||
Never smoker | 23/5 | 2.31 (0.41-26.27) | 0.275 | 57/32 | 2.07(1.10-5.46) | 0.032 | 62/53 | 2.14(0.77-5.80) | 0.152 |
Ever smoker | 66/25 | 1.66 (0.80-3.83) | 0.163 | 105/147 | 1.18(0.81-1.71) | 0.398 | 51/60 | 3.84(1.33-7.68) | 0.009 |
Current smoker | 44/13 | 1.68 (0.75-4.54) | 0.193 | 319/299 | 1.35(1.04-1.74) | 0.023 | 0/0 | NA | NA |
Histology | |||||||||
Squamous Carcinoma | 55/136 | 1.32 (0.90-1.88) | 0.170 | 185/309 | 1.41 (1.05-1.83) | 0.020 | NA | NA | NA |
Adenocarcinoma | 199/119 | 1.71 (1.30-2.44) | <0.001 | 312/188 | 1.26 (0.94-1.73) | 0.120 | NA | NA | NA |
AJCC stage | |||||||||
Stage I | 254/203 | 1.70 (1.35-2.23) | < 0.001 | 275/235 | 1.68 (1.25-2.31) | < 0.001 | 91/77 | 9.32 (2.27-15.35) | < 0.001 |
Stage II | 56/70 | 1.97 (1.24-3.02) | 0.004 | 119/158 | 1.09 (0.75-1.57) | 0.649 | 22/36 | 1.03 (0.40-2.66) | 0.944 |
Stage III | 24/59 | 1.13 (0.71-1.82) | 0.608 | 74/89 | 1.05 (0.69-1.59) | 0.818 | 0/0 | NA | NA |
Stage IV | 6/7 | 3.09 (1.25-14.41) | 0.030 | 21/11 | 1.25 (0.51-3.17) | 0.607 | 0/0 | NA | NA |
Abbreviations: HR, Hazard ratio; 95%CI, 95% confidence interval; AJCC, the American Joint Committee on Cancer.
The seven-lncRNA signature was associated with OS in patients carrying wild-type KRAS or EGFR genes
EGFR and KRAS were the most frequent mutant genes and associated with poor prognosis in NSCLC. Previous studies suggested that the mutation frequency of above two genes was significantly decreased in patients with early stage. Bearing this in mind, we also performed stratified analysis based on EGFR or KRAS mutation status. Our data showed that the higher risk score was associated with higher mortality risk in wild-type KRAS or EGFR subgroups (Figure 4). Above results were consistent in the two validation groups, suggesting the seven-lncRNA signature acted as a risk factor for patients carrying wild-type KRAS or EGFR genes.
Cell cycle-related pathways were associated with the seven-lncRNA signature
Finally, to explore the potential mechanisms of the seven-lncRNA signature, we used the WGCNA method to cluster genes that highly correlated with the risk scores based on the profiling data of TCGA dataset [21]. We identified a total of 11 modules and found that pink module was the only positively correlated with the risk-score (P =2x10-105) (Figure 5A and 5B). Pathway enrichment analysis was then performed on an online-based web tool “Metascape” (http://metascape.org/) using the genes in pink module. As shown in Figure 5C, genes were significantly enriched in cell cycle-related pathways, including “cell cycle”, “cell cycle phase transition” and “regulation of cell cycle process” pathways, suggesting the activation of cell cycle-related pathways might contribute to higher mortality risk in patients with high risk scores.
DISCUSSION
In the past decade, lncRNAs have been demonstrated by accumulating evidence to contribute in carcinogenesis and tumor progression [22]. In the present study, we conducted a seven-lncRNA signature, and validated their association with OS in two independent datasets. Moreover, we found the prognostic values of the seven-lncRNA signature reached a high statistical significance in patients with stage I and carrying wild-type KRAS and EGFR genes. To the best of our knowledge, this is the first study, which was based on the high-throughput lncRNA profiling data from nearly two thousand patients, to explore the lncRNAs as prognostic biomarkers for early stage NSCLC patients. Our findings might provide convincing evidence for clinical application.
In the present study, we identified a total seven lncRNAs which were associated with OS in NSCLC, including APTR, DHRS4-AS1, ITGA9-AS1, LINC01137, LOC101927972, RPARP-AS1 and SH3BP5-AS1. Most of these identified lncRNAs were reported in cancer for the first time, except APTR (Alu-mediated p21 transcriptional regulator). Previous studies reported that APTR could promote the cell proliferation and contribute to carcinogenesis by repressing p21 promoter activities in human glioblastomas [21]. By contrast, in this study, we found that the coefficient of APTR was minus, indicating that APTR might play a protective role in NSCLC. The detailed molecular mechanism warrant further investigations.
The most important finding in this study was that the seven-lncRNA signature was significantly associated with OS in stage I patients. Our data showed their association reached a high statistical significance (P<0.001) either in discovery group or in validation groups, indicating the seven-lncRNA signature can provide a powerful prognostic tool for NSCLC with early stage. Recently, the oncogenic roles of EGFR and KRAS mutations in NSCLC have attracted increasing attentions. Patients carrying mutant EGFR or KRAS always had a poorer prognosis than those carrying wild-type genes [23,24]. Thus, the frequencies of wild-type EGFR and KRAS were always high in patients with early stage. We reviewed the EGFR and KRAS mutation status in TCGA-NSCLC dataset, and found that the mutation frequencies of EGFR and KRAS in stage 1 patients were only 10.2% and 5.7%, respectively, which were lower than advanced stage according to literature reports [25,26]. Interestingly, our data also showed that the seven-lncRNA signature was significantly associated with OS in patients carrying wild-type EGFR or KRAS genes, which might partially explain the favorable prognostic values of the seven-lncRNA signature in early stage patients.
In order to investigate the potential mechanisms affected by the seven lncRNAs, we also performed bioinformatic analysis. By using the WGCNA method, we clustered 11 gene modules from more than 5000 differentially expressed genes, and found that only one of them (pink) was positively correlated with seven-lncRNA signature. Pathway enrichment analysis further suggested that the genes in pink module were mostly enriched in cell cycle-related signaling pathways, indicating the seven-lncRNA signature might affect cell cycle-related pathways and consequently contributed to tumor progression.
In summary, we constructed a risk-score model derived from seven lncRNAs to predict the OS time of NSCLC patients. The risk association of the seven-lncRNA signature with survival was more evident in patients with early stage and carrying wild-type EGFR or KRAS genes. We warrant further studies, especially the large, well-performed cohorts to confirm or refuse our findings.
MATERIALS AND METHODS
Sample sources and study design
The raw data of gene expression and corresponding clinical information of NSCLC patients were downloaded from the Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) websites. Four GEO datasets (GSE30219, GSE31546, GSE37745 and GSE50081) were combined and used as the discovery group. After removal the five samples without survival data or enough clinical information, 682 patients from four GEO datasets were finally used for further analysis. Meanwhile, 994 patients from TCGA dataset were employed as validation group. Moreover, we also employed 226 patients from another independent GEO dataset (GSE31210) as the second validation group. All patients in GSE31210 belonged to the stage I and II of the American Joint Committee on Cancer (AJCC) system. The RNA expression profiling data of five GEO datasets were all performed on Affymetrix U133 Plus 2.0 microarray platform and the TCGA data was performed on Illumina sequencing platform. The clinical characteristics of discovery and validation groups were shown in Table 3.
Table 3. Clinical features of NSCLC patients in the training and validating groups.
Features | Discovery group (n=682) |
Validation group-1 (n=994) |
Validation group-2 (n=226) |
Age (years), no (%) | |||
≤70 | 469 (71.7) | 575 (58.7) | 221 (97.8) |
>70 | 196 (28.3) | 404 (41.3) | 5 (2.2) |
Gender, no (%) | |||
Male | 227 (33.3) | 596 (60.0) | 105 (46.5) |
Female | 455 (66.7) | 398 (40.0) | 121 (53.5) |
Smoking status, no (%) | |||
Never smoker | 28 (15.9) | 89 (9.3) | 115 (50.9) |
Ever smoker Current smoker |
91 (51.7) 57 (32.4) |
252 (26.3) 618 (64.4) |
111 (49.1) 0 (0) |
Pathological grade, no (%) | |||
Squamous Carcinoma | 191 (37.5) | 494 (49.7) | NA |
Adenocarcinoma | 318 (62.5) | 500 (50.3) | NA |
AJCC stage, no (%) | |||
I | 457 (67.3) | 510 (51.9) | 168 (74.3) |
II | 126 (18.6) | 277 (28.2) | 58 (25.7) |
III | 83 (12.2) | 163 (16.6) | 0 (0) |
IV | 13 (1.9) | 32 (3.3) | 0 (0) |
KRAS Mutant type Wild type EGFR Mutant type Wild type |
NA NA NA NA |
24 (31.2) 53 (68.8) 101 (17.9) 463 (82.1) |
20 (8.8) 206 (91.2) 127 (56.2) 99 (43.8) |
Death status (%) | |||
Yes | 418 (61.3) | 394 (39.6) | 9 (8.0) |
No | 264 (38.7) | 600 (60.4) | 104 (92.0) |
Recurrence status (%) | |||
Yes | 222 (39.4) | 221 (22.2) | 64 (28.3) |
No | 341 (60.6) | 773 (77.8) | 162 (71.7) |
Abbreviations: AJCC, the American Joint Committee on Cancer; KRAS, Kirsten rat sarcoma viral oncogene homolog; EGFR, epidermal growth factor receptor.
Normalization and lncRNA annotation of GEO data and TCGA data
The raw data of five GEO datasets (GSE30219, GSE31210, GSE31546, GSE37745 and GSE50081) were downloaded as probe-level CEL files. Then, all raw data were quantile normalized using Robust Multi-array Average (RMA) method. With the lncRNA-specific probes presented on Affymetrix U133 Plus 2.0 downloaded from Affymetrix website (http://www.affymetrix.com) [15], we finally identified 2986 lncRNA transcripts with RefSeq transcript IDs. As for TCGA data, we downloaded the sequencing data with the type of fragments per kilobase of exon per million fragments (FPKM) from the TCGA website (https://tcga-data.nci.nih.gov/).
Construction of risk-score formula
After normalization and annotation of GEO raw data, we subjected 2986 lncRNAs to univariable Cox regression proportional hazards regression analysis to select lncRNAs which were associated with OS of NSCLC patients. Those lncRNAs with a statistical significance in univariable Cox regression were then selected into multivariable Cox regression to obtain the coefficients. By linearly combining the expression value of selected lncRNAs weighted by their coefficients, a risk-score formula was constructed as following [20]:
In our formula, the is the number of 7, the are the expression value of every 7 lncRNAs and the are their corresponding coefficients from the multivariable Cox regression analysis [27].
Survival analysis
According to above formula, the risk scores of NSCLC patients were calculated, by which patients were divided into high- and low-risk group with the cutoff of the median. The Kaplan-Meier method was used to assess the difference in survival time of high- and low- risk NSCLC patients. Difference of survival times between the two groups was considered as significant according to the P value of two-sided log-rank test less than 0.05.
Prognostic meta-analysis
To investigate the combined prognostic values of the seven-lncRNA signature of discovery and two validation groups, we performed a prognostic meta-analysis. All meta-analysis was performed on STATA software, version 12.0 (StataCorp LP, College Station, TX, United States). Pooled HR value was calculated using the random-effects model.
Weighted Correlation Network analysis (WGCNA)
To find the gene modules associated with our risk scores, we construct a co-expression network using the R package “WGCNA” according to our previous reports [28]. The soft thresholding power was selected to 9 to produce a weighted network. The enrolled genes were hierarchically clustered into 11 modules except the gray module.
Pathway enrichment analysis
The pink modules, the most significant modules being associated with risk score, were picked out to perform the pathway enrichment analysis. Pathway enrichment analysis of genes in pink module was performed on an online-based web tool “Metascape” (http://metascape.org/).
Footnotes
CONFLICTS OF INTEREST: The authors have no conflict of interests to declare.
FUNDING: This work was supported by the National Natural Science Foundation of China (No. 81871998) and the Natural Science Foundation of Shaanxi Province (Nos. 2018JM7013).
REFERENCES
- 1.Fintelmann FJ, Gottumukkala RV, McDermott S, Gilman MD, Lennes IT, Shepard JO. Lung Cancer Screening: Why, When, and How? Radiol Clin North Am. 2017; 55:1163–81. 10.1016/j.rcl.2017.06.003 [DOI] [PubMed] [Google Scholar]
- 2.New M, Keith R. Early Detection and Chemoprevention of Lung Cancer. F1000 Res. 2018; 7:61. 10.12688/f1000research.12433.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Reungwetwattana T, Dy GK. Targeted therapies in development for non-small cell lung cancer. J Carcinog. 2013; 12:22. 10.4103/1477-3163.123972 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Petrella F, Spaggiari L. Therapeutic options following pneumonectomy in non-small cell lung cancer. Expert Rev Respir Med. 2016; 10:919–25. 10.1080/17476348.2016.1188694 [DOI] [PubMed] [Google Scholar]
- 5.Tang Y, Qiao G, Xu E, Xuan Y, Liao M, Yin G. Biomarkers for early diagnosis, prognosis, prediction, and recurrence monitoring of non-small cell lung cancer. OncoTargets Ther. 2017; 10:4527–34. 10.2147/OTT.S142149 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.i H, Cho JY. Lung Cancer Biomarkers. Adv Clin Chem. 2015; 72:107–70. 10.1016/bs.acc.2015.07.003 [DOI] [PubMed] [Google Scholar]
- 7.Mercer TR, Dinger ME, Mattick JS. Long non-coding RNAs: insights into functions. Nat Rev Genet. 2009; 10:155–59. 10.1038/nrg2521 [DOI] [PubMed] [Google Scholar]
- 8.Rinn JL, Chang HY. Genome regulation by long noncoding RNAs. Annu Rev Biochem. 2012; 81:145–66. 10.1146/annurev-biochem-051410-092902 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Kino T, Hurt DE, Ichijo T, Nader N, Chrousos GP. Noncoding RNA gas5 is a growth arrest- and starvation-associated repressor of the glucocorticoid receptor. Sci Signal. 2010; 3:ra8. 10.1126/scisignal.2000568 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Wang KC, Yang YW, Liu B, Sanyal A, Corces-Zimmerman R, Chen Y, Lajoie BR, Protacio A, Flynn RA, Gupta RA, Wysocka J, Lei M, Dekker J, et al. A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression. Nature. 2011; 472:120–24. 10.1038/nature09819 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wang JZ, Xiang JJ, Wu LG, Bai YS, Chen ZW, Yin XQ, Wang Q, Guo WH, Peng Y, Guo H, Xu P. A genetic variant in long non-coding RNA MALAT1 associated with survival outcome among patients with advanced lung adenocarcinoma: a survival cohort analysis. BMC Cancer. 2017; 17:167. 10.1186/s12885-017-3151-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Soudyab M, Iranpour M, Ghafouri-Fard S. The Role of Long Non-Coding RNAs in Breast Cancer. Arch Iran Med. 2016; 19:508–17. [PubMed] [Google Scholar]
- 13.Mehra M, Chauhan R. Long Noncoding RNAs as a Key Player in Hepatocellular Carcinoma. Biomark Cancer. 2017; 9:X17737301. 10.1177/1179299X17737301 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Tian X, Xu G. Clinical value of lncRNA MALAT1 as a prognostic marker in human cancer: systematic review and meta-analysis. BMJ Open. 2015; 5:e008653. 10.1136/bmjopen-2015-008653 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhang KJ, Zhang Y, Luo ZL, Liu L, Yang J, Wu LC, Yu SS, Liu JB. [Long non-coding RNA HOTAIR in plasma as a potential biomarker for breast cancer diagnosis]. Nan Fang Yi Ke Da Xue Xue Bao. 2016; 36:488–92. [PubMed] [Google Scholar]
- 16.Zhu X, Tian X, Yu C, Shen C, Yan T, Hong J, Wang Z, Fang JY, Chen H. A long non-coding RNA signature to improve prognosis prediction of gastric cancer. Mol Cancer. 2016; 15:60. 10.1186/s12943-016-0544-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Meng J, Li P, Zhang Q, Yang Z, Fu S. A four-long non-coding RNA signature in predicting breast cancer survival. J Exp Clin Cancer Res. 2014; 33:84. 10.1186/s13046-014-0084-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wang Z, Wu Q, Feng S, Zhao Y, Tao C. Identification of four prognostic LncRNAs for survival prediction of patients with hepatocellular carcinoma. PeerJ. 2017; 5:e3575. 10.7717/peerj.3575 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yang K, Hou Y, Li A, Li Z, Wang W, Xie H, Rong Z, Lou G, Li K. Identification of a six-lncRNA signature associated with recurrence of ovarian cancer. Sci Rep. 2017; 7:752. 10.1038/s41598-017-00763-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Tu Z, He D, Deng X, Xiong M, Huang X, Li X, Hao L, Ding Q, Zhang Q. An eight-long non-coding RNA signature as a candidate prognostic biomarker for lung cancer. Oncol Rep. 2016; 36:215–22. 10.3892/or.2016.4817 [DOI] [PubMed] [Google Scholar]
- 21.Negishi M, Wongpalee SP, Sarkar S, Park J, Lee KY, Shibata Y, Reon BJ, Abounader R, Suzuki Y, Sugano S, Dutta A. A new lncRNA, APTR, associates with and represses the CDKN1A/p21 promoter by recruiting polycomb proteins. PLoS One. 2014; 9:e95216. 10.1371/journal.pone.0095216 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Luo X, Qiu Y, Jiang Y, Chen F, Jiang L, Zhou Y, Dan H, Zeng X, Lei YL, Chen Q. Long non-coding RNA implicated in the invasion and metastasis of head and neck cancer: possible function and mechanisms. Mol Cancer. 2018; 17:14. 10.1186/s12943-018-0763-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Tu HY, Ke EE, Yang JJ, Sun YL, Yan HH, Zheng MY, Bai XY, Wang Z, Su J, Chen ZH, Zhang XC, Dong ZY, Wu SP, et al. A comprehensive review of uncommon EGFR mutations in patients with non-small cell lung cancer. Lung Cancer. 2017; 114:96–102. 10.1016/j.lungcan.2017.11.005 [DOI] [PubMed] [Google Scholar]
- 24.Lee CK, Davies L, Wu YL, Mitsudomi T, Inoue A, Rosell R, Zhou C, Nakagawa K, Thongprasert S, Fukuoka M, Lord S, Marschner I, Tu YK, et al. Gefitinib or Erlotinib vs Chemotherapy for EGFR Mutation-Positive Lung Cancer: Individual Patient Data Meta-Analysis of Overall Survival. J Natl Cancer Inst. 2017; 109. 10.1093/jnci/djw279 [DOI] [PubMed] [Google Scholar]
- 25.Dogan S, Shen R, Ang DC, Johnson ML, D’Angelo SP, Paik PK, Brzostowski EB, Riely GJ, Kris MG, Zakowski MF, Ladanyi M. Molecular epidemiology of EGFR and KRAS mutations in 3,026 lung adenocarcinomas: higher susceptibility of women to smoking-related KRAS-mutant cancers. Clin Cancer Res. 2012; 18:6169–77. 10.1158/1078-0432.CCR-11-3265 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Marchetti A, Milella M, Felicioni L, Cappuzzo F, Irtelli L, Del Grammastro M, Sciarrotta M, Malatesta S, Nuzzo C, Finocchiaro G, Perrucci B, Carlone D, Gelibter AJ, et al. Clinical implications of KRAS mutations in lung cancer patients treated with tyrosine kinase inhibitors: an important role for mutations in minor clones. Neoplasia. 2009; 11:1084–92. 10.1593/neo.09814 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zhou M, Guo M, He D, Wang X, Cui Y, Yang H, Hao D, Sun J. A potential signature of eight long non-coding RNAs predicts survival in patients with non-small cell lung cancer. J Transl Med. 2015; 13:231. 10.1186/s12967-015-0556-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lv B, Zhang L, Miao R, Xiang X, Dong S, Lin T, Li K, Qu K. Comprehensive analysis and experimental verification of LINC01314 as a tumor suppressor in hepatoblastoma. Biomed Pharmacother. 2018; 98:783–92. 10.1016/j.biopha.2018.01.013 [DOI] [PubMed] [Google Scholar]