Skip to main content
OncoTargets and Therapy logoLink to OncoTargets and Therapy
. 2018 May 25;11:3129–3140. doi: 10.2147/OTT.S151121

Construction of a specific SVM classifier and identification of molecular markers for lung adenocarcinoma based on lncRNA-miRNA-mRNA network

Jingming Zhao 1, Wei Cheng 1, Xigang He 2, Yanli Liu 1, Ji Li 3, Jiaxing Sun 1, Jinfeng Li 1, Fangfang Wang 1, Yufang Gao 4,
PMCID: PMC5975616  PMID: 29872324

Abstract

Background

Novel diagnostic predictors and drug targets are needed for LUAD (lung adenocarcinoma). We aimed to build a specific SVM (support vector machine) classifier for diagnosis of LUAD and identify molecular markers with prognostic value for LUAD.

Methods

The expression differences of miRNAs, lncRNAs and mRNAs between LUAD and normal samples were compared using data from TCGA (The Cancer Genome Atlas) database. A LUAD related miRNA-lncRNA-mRNA network was constructed, based on which feature genes were selected for the construction of LUAD specific SVM classifier. The robustness and transferability of SVM classifier were validated using gene expression profile datasets GSE43458 and GSE10072. Prognostic markers were identified from the network. A set of LUAD-related differentially expressed miRNAs, lncRNAs and miRNAs were identified and a LUAD related miRNA-lncRNA-mRNA network was obtained. The LUAD specific SVM classifier constructed on the basis of the network was robust and efficient for classification of samples from TCGA dataset and two independent validation datasets.

Results

Eight RNAs with prognostic value were identified, including hsa-miR-96, hsa-miR-204, PGM5P2 (phosphoglucomutase 5 pseudogene 2), SFTA1P (surfactant associated 1), RGS20 (regulator of G protein signaling 20), RGS9BP (RGS9-binding protein), FGB (fibrinogen beta chain) and INA (alpha-internexin). Among them, RGS20 and INA were regulated by hsa-miR-96. RGS20 was also regulated by hsa-miR-204, which was a potential target of SFTA1P.

Conclusion

The LUAD specific SVM classifier may serve as a novel diagnostic predictor. hsa-miR-96, hsa-miR-204, PGM5P2, SFTA1P, RGS20, RGS9BP, FGB and INA may serve as prognostic markers in clinical practice.

Keywords: lung adenocarcinoma, lncRNA-miRNA-mRNA network, SVM classifier, molecular marker, prognosis

Introduction

LUAD (lung adenocarcinoma) is the most common subtype of non-small cell lung cancer, accounting for about 40% of lung cancer worldwide.1,2 Molecularly targeted therapies using TKIs (tyrosine kinase inhibitors) are standard treatments for LUAD patients with mutations in EGFR (epidermal growth factor receptor) and fusions of ALK (anaplastic lymphoma kinase), ROS1 (ROS proto-oncogene 1), and RET (rearranged during transfection).3,4 Acquired resistance, however, often occurs approximately 1–2 years after TKI treatment.4 Moreover, few effective therapies have been developed to target alterations in other genes, such as TP53 (tumor protein p53),5 KEAP1 (kelch-like ECH associated protein 1)6 and STK11 (serine/threonine kinase 11).7 Therefore, it is still urgent for developing new drug targets for the diagnosis and treatment of LUAD.

Increasing evidence has highlighted the involvement of ncRNAs (non-coding RNAs) in tumorigenesis.8 Two typical subtypes of ncRNAs are miRNAs (microRNAs) and lncRNAs (long non-coding RNAs).911 miRNAs are small ncRNAs with about 22 nucleotides, which can interact with target mRNAs to degrade mRNAs or inhibit the translation of mRNA.9,10 In comparison to miRNAs, lncRNAs are much longer ncRNAs with more than 200 nucleotides and function through more diverse mechanisms.9,11 In addition to directly targeting mRNAs, it has also been shown to function as ceRNAs (competing endogenous RNAs), interacting with miRNAs to indirectly regulate mRNAs.11,12 It is thus believed that interplays between lncRNAs and miRNAs may play an important role in tumorigenesis.12 Recently, investigations about the lncRNA-miRNA-mRNA ceRNA networks provide a better understanding of the roles lncRNA-miRNA interactions in mRNAs regulation and LUAD development.13,14 Important regulatory pathways, as well as therapeutic targets, could be revealed based on lncRNA-miRNA-mRNA networks. For example, MEG3 (maternally expressed 3), MIAT (myocardial infarction associated transcript) and LINC00115 may serve as prognostic lncRNAs and may be involved in regulatory pathways in LUAD.14 According to the lncRNA-miRNA-mRNA network, MEG3 and MIAT regulate MAPK9 (mitogen-activated protein kinase 9) by interacting with miR-106, whereas LINC00115 regulate FGF2 (fibroblast growth factor 2) by interacting with miR-7.14

Two gene expression profile datasets GSE4345815 and GSE1007216 have been used to reveal genes related to LUAD. It has been shown that ETS2 (V-ets erythroblastosis virus E26 oncogene homolog 2) is downregulated in LUAD, using GSE43458 dataset.15 ETS2 may inhibit cancer cell invasion, migration and growth by suppressing MET activation.15 Cigarette smoking related signature genes in LUAD patients have been identified using GSE10072 dataset.16 It is remarkable that most of the signatures are involved in cell cycle, such as NEK2, TTK, and PRC1.16 Though advances have been made to identify LUAD related signatures, efficient diagnostic predictors and potential drug targets of LUAD are still in need.

In order to identify novel diagnostic predictors and molecular markers, we first constructed a LUAD specific lncRNA-miRNA-mRNA ceRNA network in our study, using data from TCGA (The Cancer Genome Atlas). A LUAD specific SVM (support vector machine) classifier was built and prognosis related nodes were identified based on the ceRNA network. GSE43458 and GSE10072 datasets were further used to validate the efficiency and robustness of the SVM classifier in predicting LUAD. The SVM classifier and the prognosis related nodes may contribute to the diagnosis and treatment of LUAD in clinical practice.

Materials and methods

Data source and data preprocessing

The mRNA and miRNA expression data of LUAD-related samples was downloaded from TCGA (https://gdc-portal.nci.nih.gov/). After checking the barcode information of samples, a total of 464 LUAD samples with both mRNA and miRNA data were obtained for subsequent analysis, including 445 LUAD and 19 normal samples. All the clinical information related to these samples was also obtained.

Two independent validation datasets GSE10072 (contributed by Landi et al)16 and GSE43458 (contributed by Kabbout et al)15 were downloaded from GEO (Gene Expression Omnibus) database (https://www.ncbi.nlm.nih.gov/geo/). In total, 107 lung samples (58 LUAD versus 49 normal samples, GPL96 [HG-U133A] platform) were included in the GES10072 dataset, and 110 lung samples (80 LUAD versus 30 normal samples, GPL6244 [HuGene-1_0-st] platform) were included in the GES43458 dataset. The package oligo17 under R was used for background adjustment of expression values and normalization preprocessing of expression profile data, including conversion of the original data format, imputation of missing values and data standardization.

Identification of LUAD related lncRNAs, miRNAs and mRNAs

According to annotation information from HGNC (HUGO Gene Nomenclature Committee, http://www.genenames.org/), the lncRNA data of LUAD-related samples downloaded from TCGA were obtained based on the gene ID. Expression differences of mRNAs and miRNA-seq data between LUAD and normal samples were analyzed using edgeR package18 under R3.0.1 and FDR (false discovery rate) was calculated using multtest package.19 LncRNAs, miRNAs and mRNAs with FDR <0.05 and FC (fold change) >1.5 or <0.67 (|logFC|>0.585) were considered to be significantly differentially expressed between LUAD and normal samples.

Identification of lncRNAs, miRNAs and mRNAs related to clinical features

LUAD samples downloaded from TCGA were binary classified according to clinical information. Classifications included age (≥60 versus <60), gender (female versus male), pathologic M (M1 versus M0), pathologic N (N3 + N2 versus N0 + N1), pathologic T (T3 + T4 versus T1 + T2), pathologic stage (I + II versus III + IV), cancer status (with versus without), smoking history (yes versus no) and vital status (living versus deceased). The mRNAs, miRNAs and lncRNAs related to clinical features were then screened from differentially expressed RNAs between LUAD and normal samples, using edgeR package and multtest package. lncRNAs, miRNAs and mRNAs with FDR <0.05 and |logFC|>0.585 were considered to be related to clinical features.

Construction of LUAD-related lncRNA-miRNA-mRNA ceRNA network

The miRNAs targeted by differentially expressed lncRNAs were predicted using miRcode (version 11, http://www.mircode.org/)20 and starBase (version 2.0)21 databases. Results from these two databases were combined and intersected with differentially expressed miRNAs. The intersection contained differentially expressed miRNAs targeted by differentially expressed lncRNAs. A LUAD-related lncRNA-miRNA regulation network was thus obtained.

Similarly, differentially expressed mRNAs targeted by differentially expressed miRNAs were obtained based on the information of miRTarBase (version 6.0, http://mirtarbase.mbc.nctu.edu.tw).22,23 Then the common PPIs (protein–protein interactions) existed in three databases, including BioGRID (http://thebiogrid.org/),24 HPRD (Human Protein Reference Database, http://www.hprd.org/)25 and DIP (Database of Interacting Proteins, http://dip.doe-mbi.ucla.edu/),26 were identified. PPIs corresponding to differentially expressed mRNAs targeted by differentially expressed miRNAs were extracted and then integrated with differentially expressed miRNA-mRNA regulatory relationships, generating a LUAD-related miRNA-mRNA regulation network.

The lncRNA-miRNA and miRNA-mRNA regulatory networks were combined to obtain a comprehensive lncRNA-miRNA-mRNA ceRNA regulatory network.

Functional and pathway annotation of mRNAs in the ceRNA network

In order to reveal LUAD-related biological functions and pathways, GO (gene ontology) biological process27 analysis and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analysis28 were performed for mRNAs in the ceRNA regulatory network. Fisher’s exact test was used during the enrichment process. Fisher’s score could be calculated according to Table 1 and the following equation:

p=1i=0x1(Mi)(NMKi)(NK),

where N indicates the total number of genes, M indicates the number of pathway genes, K indicates the number of differentially expressed genes, and the Fisher’s score p indicates the possibility that at least x in K differentially expressed genes were pathway genes.

Table 1.

The parameters for calculating Fisher’s score

DEGs Non-DEGs Total
Pathway genes n11 n12 M
Non-pathway genes n21 n22 NM
Total K NK N

Abbreviations: DEG, differentially expressed genes; N, total number of genes; M, number of pathway genes; K, number of differently expressed genes.

Construction of SVM classification model

The optimal subset of feature genes used for SVM classification model was selected from differentially expressed mRNAs in the LUAD specific ceRNA network, using recursive feature elimination (RFE),29 an algorithm of machine learning. Specifically, the optimal subset was selected through a leave-one-out cross-validation approach. Expression values of selected feature genes in the combination were used as eigenvalues to estimate the possibility that a sample belonged to certain classification. Based on the possibility, a sample was classified to be LUAD or a normal sample. The optimal subset was the combination giving the best SVM classification accuracy for TCGA samples. The LUAD specific SVM classifier was built based on the optimal subset of feature genes.

GSE10072 and GSE43458 datasets were used to validate the robustness and transferability of the SVM classifier. This SVM classifier was trained with a fivefold cross validation strategy and its performance was assessed by establishing receiver operation characteristic (ROC) curve, followed by detection of prediction accuracy, sensitivity, specificity, positive prediction value, negative prediction value and AUC (area under ROC curve).

Identification of prognosis related mRNAs, miRNAs and lncRNAs

The expression value of each differentially expressed mRNA, miRNA and lncRNA and the survival information of each sample were extracted from TCGA dataset. Prognosis-associated lncRNAs, miRNAs and mRNAs were identified by univariate Cox regression using the survfit function of the survival package (version 2.40-1, https://cran.r-project. org/package=survival)30 under R. Cancerous samples were divided into two groups based on the cutoff (median expression value), followed Kaplan–Meier curve analysis.

Results

Differentially expressed lncRNAs, miRNAs and mRNAs in LUAD samples

A total of 811 lncRNAs, 1,047 miRNAs and 18,013 mRNAs were obtained from mRNA-seq data. RNAs with low expression level (expression value less than 1.0) were removed, with 396 lncRNAs, 517 miRNAs and 14,012 mRNAs remained. Significant differentially expressed lncRNAs, miRNAs and mRNAs were obtained by comparing LUAD and normal samples. In total, 21, 53 and 925 differentially expressed lncRNAs, miRNAs and mRNAs were obtained in LUAD samples. Sample hierarchical cluster analysis was then performed based on the expression value of these differentially expressed RNAs. The results of heatmap (Figure 1A–C) showed that LUAD samples were clustered together and discriminated from normal samples.

Figure 1.

Figure 1

Hierarchical clustering analysis of TCGA samples using differentially expressed lncRNA (A), miRNA (B) and mRNA (C).

Abbreviation: TCGA, The Cancer Genome Atlas.

Key lncRNAs, miRNAs and mRNAs related to clinical features

In order to screen lncRNAs, miRNAs and mRNAs related to clinical features, LUAD samples were binary classified according to age (≥60 versus <60), gender (female versus male), pathologic M (M1 versus M0), pathologic N (N3 + N2 versus N0 + N1), pathologic T (T3 + T4 versus T1 + T2), pathologic stage (I + II versus III + IV), cancer status (with versus without), smoking history (yes versus no) and vital status (living versus deceased). The differentially expressed lncRNAs, miRNAs and mRNAs were further compared and identified between each two groups according to different clinical features, which were summarized in Table 2.

Table 2.

Clinical features related differentially expressed lncRNAs, miRNAs and mRNAs

Comparisons Upregulated
Downregulated
lncRNA miRNA mRNA lncRNA miRNA mRNA
Age (≥60 versus <60) hsa-mir-133a-1 ADH1B, CLEC9A, TMEM132D, C6, CHST9, DNAH9, MT1A, RGS6, C2orf40, PZP, RSPO2, SOSTDC1, CPB2, TMEM132C, ADH1A FLJ41941, DGCR9, TERC DLL3, HAVCR1, FEZF1, GAL, KCNK12, CDK5R2, ZNF695, CRYGN, VGF, B4GALNT4, MUC13, INHA, NKAIN4, PTPRN

Gender (M/F) DKFZp779M0652, KIAA0087 hsa-mir-1247, hsa-mir-133a-1 TEPP, SLC5A8, PCSK2, LHX9, MEGF11 C15orf54, C2orf48, CECR7, DGCR9, EGOT hsa-mir-503 TFF1, VIL1, RAB3B, FGL1, ANKRD34B, DUSP9, AKR1B10, FGB, AKR7A3, INHA, GLB1L3, ABCC2, SERPINB5, PCSK1, SLC6A3, IVL, HGD

New tumor (yes/no) FAM138F, KIAA0087, SFTA1P TMEM213, TMEM132D, C6, KIAA0408, CST5 EGOT, DGCR9 hsa-mir-196a-1, hsa-mir-1269 HOXC13, PRAME, HOXC10, GAL, IGF2BP1, HHIPL2, KRT6C, HIST1H2BG, INHA, GLB1L3, KRT6B, KRT6A, CLDN6, PPP2R2C, SERPINB5

Pathologic M (M1/M0) SFTA1P IRX6, CAPSL, KIAA0408, C8B C1orf220, C2orf48, KIAA0087, C15orf54 hsa-mir-323b, hsa-mir-31, hsa-mir-1269, hsa-mir-539 DUSP13, HAVCR1, BARX1, FEZF1, C1orf61, GAL, CASKIN1, RAB3B, NKAIN1, CRYGN, AKR1B10, VGF, C1orf220, INHA, GLB1L3, PTPRN, INA, BCAN, ABCC2, C1QL1, GPX2

Pathologic N (N2 + N3/N0 + N1) DKFZp779M0652, KIAA0087 hsa-mir-184 PENK, C20orf85, MT1A, TEKT1, S100A12, C6orf118, DNAI2, HAS1, PPBP, ANKRD1 C20orf197, CECR7 hsa-mir-1269, hsa-mir-31, hsa-mir-577 HOXB9, GAL, KLK6, IGF2BP1, FCRL4, SLC7A10, FGB, PAEP, CCDC154, PCP4, FOXE1, TNFRSF13C, STRC, SPINK2

Pathologic T (T3 + T4/T1 + T2) C22orf34, DIO3OS, KIAA0087 hsa-mir-133a-1 FBN3, KLK7 C15orf54, C10orf91, C20orf197, CECR7 hsa-mir-1269, hsa-mir-31, hsa-mir-323b, hsa-mir-450b C11orf86, TFF1, HAVCR1, DMRTA2, KLK6, CRABP1, FCRL4, ARL14, MUC13, PRSS3, TRIM31, CD19, CR2, IL20RB

Pathologic stage (III + IV/I + II) DIO3OS, DKFZp779M0652, KIAA0087 hsa-mir-1247 DNAH12, RSPH10B2, VWA3A, IL5RA, C20orf85, MS4A15, MT1A, DNAI2 C20orf197, CECR7, EGOT, HAR1B hsa-mir-1269, hsa-mir-577, hsa-mir-9-2 BARX1, GAL, CDK5R2, CRYGN, KLK6, IGF2BP1, FCRL4, FGB, CCDC154, PCP4, IL17REL, TNFRSF13C, MIA, RUFY4, CD19, PNOC

Cancer status (with/without) KIAA0087, SFTA1P TMEM132D, SCN2B, C6, SCGB1A1, CLDN18, KIAA0408, CST5, GRIA1, C8B, TMEM132C, F11 EGOT, HAR1B, C2orf48 hsa-mir-539, hsa-mir-1269 PRAME, GAL, IGF2BP1, KRT6C, CNGA3, HMGA2, KRT6B, UGT2B15, KRT6A, ABCC2, SERPINB5

Smoking history (yes/no) DIO3OS, DKFZp779M0652, KIAA0087 hsa-mir-184 TEPP, REN, PCSK2, PLA2G3, CST5, RETN, C8B C15orf54, C20orf197, C2orf48, EGOT, FLJ12825 hsa-mir-1269, hsa-mir-31 DLL3, PADI1, BARX1, C6orf222, YBX2, CA9, FGB, CCDC129, GLB1L3, UPK3A, UGT2B15, MSMB

The miRNA-lncRNA and miRNA-mRNA regulatory relationships

Elucidation of the physiological roles of lncRNAs is challenging as complex and diverse mechanisms are involved.11 We used bioinformatics methods to predict the roles of lncRNAs in regulating miRNAs in LUAD. The regulatory relationships between significant differentially expressed miRNAs and differentially expressed lncRNAs were predicted using miRecode20 and starBase21 database. We first acquired 264 lncRNA-miRNA regulation pairs from miRecode and 217 regulation pairs from starBase, of which lncRNAs were differentially expressed between LUAD and normal samples. Combining these two sets, a total of 291 lncRNA-miRNA pairs were obtained, 41 of which were LUAD related differentially expressed miRNAs. The 41 lncRNA-miRNA pairs were integrated to build a miRNA-lncRNA regulatory network consisting of 31 nodes, including 6 lncRNAs (3 upregulated versus 3 downregulated) and 25 miRNAs (6 upregulated versus 19 downregulated) (Figure 2A).

Figure 2.

Figure 2

LUAD specific lncRNA-miRNA-mRNA ceRNA network. LUAD specific lncRNA-miRNA regulatory network (A), miRNA-mRNA regulatory network (B) and ceRNA network (C). The ceRNA network is acquired by integrating lncRNA-miRNA and miRNA-mRNA regulatory network. Squares, triangles and circles indicate lncRNAs, miRNAs and mRNAs, respectively. Upregulated lncRNAs, miRNAs and mRNAs in LUAD are shown as red and downregulated ones shown as green. Red lines and blue lines indicate lncRNA-miRNA and miRNA-mRNA regulatory relationships, whereas gray lines indicate protein–protein interactions of corresponding mRNAs.

Abbreviation: LUAD, lung adenocarcinoma.

The regulatory relationships between significant differentially expressed miRNAs and significant differentially expressed mRNAs were obtained using miRTarBase database, a database providing the latest and broadest experimental validated miRNA-mRNA interactions.22,23 Most miRNAs in Figure 2A were predicted to have targeted differentially expressed mRNAs, except hsa-miR-139 and hsa-miR-590. A total of 126 differentially expressed mRNAs were found to be targets of these miRNAs. Based on the information of BioGRID, HPRD and DIP databases, PPIs corresponding to these target mRNAs were predicted. A miRNA-mRNA network was constructed by integrating miRNA-mRNA regulatory relationships and PPIs of target mRNAs. As shown in Figure 2B, the miRNA-mRNA regulatory network contained 25 miRNAs (including hsa-miR-139 and hsa-miR-590) and 126 mRNAs, which formed a total of 549 edges, 115 of which were mRNA-mRNA interactions and 434 were miRNA-mRNA regulation relationships.

Construction of lncRNA-miRNA-mRNA ceRNA network

To provide an insight about how lncRNAs and miRNAs cooperate to regulate mRNAs in LUAD, a ceRNA network (Figure 2C) was constructed, through the integration of lncRNA-miRNA network and miRNA-mRNA network. All nodes in the ceRNA network were LUAD related differentially expressed lncRNAs, miRNAs or mRNAs. A total of 157 nodes were included in the ceRNA network, including 6 lncRNAs, 25 miRNAs (including hsa-miR-139 and hsa-miR-590) and 126 mRNAs. In total, 588 edges were formed, including 39 lncRNA-miRNA regulation relationships, 434 miRNA-mRNA regulation relationships and 115 PPIs of corresponding mRNAs.

In order to reveal the functional processes involved in LUAD development and progression, mRNAs in the ceRNA network (Figure 2C) were subjected to Fisher’s exact test-based GO biological process analysis. We acquired 18 significantly related GO biological processes, most of which were associated with cell cycle (Table 3). We also performed KEGG pathway analysis for mRNAs in the ceRNA network, and 5 significant KEGG pathways were identified, including ErbB signaling pathway, cell cycle, homologous recombination, neuroactive ligand-receptor interaction and pathways in cancer (Table 3).

Table 3.

Functional annotation of mRNAs in the ceRNA network

Term Count P-value
GO-BPs
GO:0000278~mitotic cell cycle 31 5.16E–19
GO:0022403ĉell cycle phase 32 9.85E–19
GO:0007067~mitosis 25 1.25E–17
GO:0000280~nuclear division 25 1.25E–17
GO:0000087~M-phase of mitotic cell cycle 25 1.92E–17
GO:0048285~organelle fission 25 3.24E–17
GO:0000279~M-phase 28 6.32E–17
GO:0022402~cell cycle process 33 8.46E–16
GO:0007049~cell cycle 37 2.12E–15
GO:0051301~cell division 22 2.78E–11
GO:0007059~chromosome segregation 11 1.43E–06
GO:0007017~microtubule-based process 13 0.001431
GO:0007346~regulation of mitotic cell cycle 10 0.005949
GO:0010564~regulation of cell cycle process 9 0.005999
GO:0000070~mitotic sister chromatid 6 0.01543
segregation
GO:0000819~sister chromatid segregation 6 0.017727
GO:0006259~DNA metabolic process 16 0.02048
GO:0006260~DNA replication 10 0.036122
KEGG pathways
hsa04012:ErbB signaling pathway 6 0.001138
hsa04110:Cell cycle 6 0.005563
hsa03440:Homologous recombination 3 0.027134
hsa04080:Neuroactive ligand-receptor 7 0.029121
interaction
hsa05200:Pathways in cancer 7 0.079398

Abbreviations: GO-BPs, gene ontology-biological processes; KEGG, Kyoto Encyclopedia of Genes and Genomes.

SVM classification model of cancerous samples

In order to provide an efficient and reliable molecular tool for LUAD diagnosis, we build a LUAD specific SVM classifier based on the feature genes associated with LUAD. Optimal subset of feature genes was selected from differentially expressed mRNAs in the ceRNA network (Figure 2C) using RFE.29 The accuracy reached the best (95.3%) when the number of selected feature genes in the optimal subset was 44 (Figure 3A). The 44 selected feature genes were summarized in Table 4 and used for the construction of LUAD specific SVM classifier. Scatter plot of TCGA samples based on the SVM classifier was shown as Figure 3B.

Figure 3.

Figure 3

Construction and validation of the LUAD specific SVM classifier. (A) Feature gene selection based on recursive feature elimination. The prediction accuracy versus the number of selected feature genes is plotted as blue line. The red dashed line labels the best prediction accuracy (95.3%, 442 out of 464 TCGA samples), with the corresponding number of selected feature genes being 44. (B) Scatter plot of TCGA samples based on the LUAD specific SVM classifier. (C) ROC curves of TCGA (black), GSE10072 (blue) and GSE43458 (orange) datasets generated using the LUAD specific SVM classifier. AUCs are calculated to be 0.996, 0.963 and 0.985 for each data.

Abbreviations: LUAD, lung adenocarcinoma; SVM, support vector machine; TCGA, The Cancer Genome Atlas; ROC, receiver operating characteristic; AUC, area under ROC curve.

Table 4.

Selected feature genes from the ceRNA network

Gene LogFC P-value FDR Gene LogFC P-value FDR
TERT −3.10475 2.24E–15 3.57E–13 ALS2CR11 −0.65072 0.000921 0.009757
HOXB9 −2.72135 3.37E–19 1.07E–16 SLC6A3 −0.61702 0.001212 0.012241
FGB −1.67587 6.18E–16 1.11E–13 PRMT8 0.604752 0.000166 0.00223
KRT6C −1.64033 3.87E–09 1.72E–07 SH2D1B 0.606032 0.000124 0.001713
MMP3 −1.62809 3.02E–10 1.83E–08 CAMK2A 0.651515 6.89E–05 0.001019
HIST1H2BG −1.4877 7.22E–09 2.97E–07 NRG1 0.660387 1.92E–06 4.34E–05
TRIM54 −1.45014 1.73E–08 6.45E–07 NRG2 0.66095 0.000318 0.003954
HMGA2 −1.35673 4.59E–11 3.42E–09 KCNA5 0.679193 1.20E–05 0.000223
GFAP −1.33536 5.52E–07 1.44E–05 SCN2B 0.702108 3.72E–07 1.01E–05
KRT6B −1.14566 1.92E–07 5.59E–06 SOX5 0.706481 1.46E–06 3.38E–05
RGS20 −1.14019 5.99E–07 1.55E–05 CCDC33 0.720253 1.62E–05 0.000289
CKM −1.13563 2.26E–06 4.99E–05 RGS9BP 0.726607 2.48E–05 0.000421
INA −1.0867 5.42E–06 0.000109 RGS9 0.729584 2.47E–07 7.05E–06
GRIK2 −0.95974 1.39E–05 0.000252 KCNE1 0.75071 3.69E–08 1.29E–06
MMP10 −0.9316 1.33E–06 3.11E–05 CXCR1 0.839385 1.81E–08 6.69E–07
BCAN −0.92189 6.43E–05 0.00096 KHDRBS2 0.946016 4.26E–10 2.42E–08
CNGB1 −0.76123 0.000196 0.002587 RGS6 0.950831 1.54E–08 5.79E–07
TWIST1 −0.7507 9.53E–05 0.001363 ASPA 1.08277 3.86E–11 2.92E–09
TMEM171 −0.72205 0.000455 0.005339 RXRG 1.158801 1.99E–15 3.21E–13
GNB3 −0.71549 0.000379 0.004581 ACTN2 1.189844 1.04E–14 1.43E–12
TSHR −0.71264 0.006238 0.048666 GRIA1 1.275478 8.00E–22 4.87E–19
CAMK2B −0.68691 0.001514 0.014765 SLC6A4 1.801564 9.18E–46 1.29E–41

Abbreviations: FC, fold change; FDR, false discovery rate.

To validate the robustness and transferability of the SVM classifier, two independent datasets under accession number of GSE1007216 and GSE4345815 were downloaded from GEO. After normalization, samples in the validation datasets were classified using the SVM classifier. As a result, samples in the GSE10072 dataset could be correctly classified with an accuracy of 90.7% (97 out of 107 samples), and samples in the GSE43458 dataset could be classified with a precision of 97.3% (107 out of 110 samples) (Table 5). Besides prediction accuracy, the performance of our SVM classification model were also assessed using sensitivity, specificity, positive prediction value, negative prediction value and AUC (area under ROC curve) (Figure 3C, Table 5).

Table 5.

Performance of support vector machine classifier in training and validation datasets

Datasets No of samples Correct rate Se Sp PPV NPV AUC
TCGA 464 95.3% 0.889 0.957 0.886 0.995 0.996
GSE10072 107 90.7% 0.918 0.897 0.882 0.929 0.963
GSE43458 110 97.3% 0.967 0.975 0.935 0.987 0.985

Abbreviations: TCGA, The Cancer Genome Atlas; Se, sensitivity; Sp, specificity; PPV, positive prediction value; NPV, negative prediction value; AUC, area under ROC curve.

The lncRNAs, miRNAs and mRNAs related to prognosis

Prognosis-related RNAs for LUAD were identified from differentially expressed lncRNAs, miRNAs and mRNAs using univariate cox analysis. In total, 5 lncRNAs, 6 miRNAs and 44 mRNAs were identified to be prognosis related (Table 6). Among them, PGM5P2 (phosphoglucomutase 5 pseudogene 2) and SFTA1P (surfactant associated 1) were lncRNAs and hsa-miR-96 and hsa-miR-204 were miRNAs in the ceRNA network. RGS20 (regulator of G protein signaling 20), RGS9BP (RGS9-binding protein), FGB (fibrinogen beta chain) and INA (alpha-internexin) were mRNAs in the feature subset of the SVM classifier. According to the ceRNA network (Figure 2C), two miRNA-mRNA pairs and an lncRNA-miRNA-mRNA triplet were formed among these prognosis related RNAs, specifically hsa-miR-96-INA, hsa-miR-96-RGS20 and SFTA1P-hsa-miR-204-RGS20.

Table 6.

Prognosis related lncRNAs, miRNAs and mRNAs

RNA Upregulated Downregulated
lncRNA KIAA0087, PGM5P2, SFTA1P C15orf54, C20orf197
miRNA hsa-miR-184, hsa-miR-204 hsa-miR-651, hsa-miR-188, hsa-miR-96, hsa-miR-708
mRNA CCDC81, LDLRAD1, ACSM5, WDR63, CAMK2A, DTHD1, PNMT, C1orf194, RGS9BP, SLC8A3, TTLL10, PCSK2, ENPP6, TSLP, KCNMB2, LHX9, PRIMA1, C11orf88, S100A12, FAM189A1, GPC5, IHH KISS1R, DMBX1, INA, CST4, RGS20, DUSP13, ECEL1, IGF2BP1, HPCA, KRT6C, NPW, FGB, NOX1, SOHLH2, INHA, NKAIN4, PFN4, CLDN6, KIF4B, NAT8L, FLRT1, GDA

We further performed Kaplan–Meier curve analyses for these prognosis-related RNAs (Figure 4). Our results showed that LUAD patients with higher expression level of PGM5P2, SFTA1P, RGS9BP and INA had a better prognosis, and patients with higher expression level of hsa-miR-96, hsa-miR-204, RGS20 and FGB had a worse prognosis (Figure 4). Meanwhile, the expression level of PGM5P2, SFTA1P, hsa-miR-204 and RGS9BP were downregulated in LUAD samples whereas hsa-miR-96, RGS20, FGB and INA were upregulated.

Figure 4.

Figure 4

Kaplan–Meier analysis of prognosis related lncRNAs, miRNAs and mRNAs. (A, B) Kaplan–Meier curves of two lncRNAs PGM5P2 and SFTA1P. (C, D) Kaplan–Meier curves of two miRNAs hsa-miR-96 and hsa-miR-204. (EH) Kaplan–Meier curves of four mRNAs RGS20, RGS9BP, FGB and INA. Red and blue lines indicate patient groups with expression level above and below median value, respectively. P-value indicates the significance of difference.

Abbreviations: PGM5P2, phosphoglucomutase 5 pseudogene 2; SFTA1P, surfactant associated 1; RGS20, regulator of G protein signaling 20; RGS9BP, RGS9-binding protein; FGB, fibrinogen beta chain; INA, alpha-internexin.

Discussion

In the present study, we constructed a ceRNA network delineating interplays among differentially expressed lncRNAs, miRNAs and mRNAs between LUAD and normal samples. An optimal subset of 44 selected feature genes was identified in the network and the SVM classifier SVM constructed with these 44 feature genes could accurately classify samples in both TCGA training data and GSE10072 and GSE43458 validation data. Remarkably, we also identified key prognosis-related RNAs in the ceRNA network, including 2 miRNAs (hsa-miR-96, hsa-miR-204), 2 lncRNAs (PGM5P2, SFTA1P) and 4 selected feature mRNAs (RGS20, RGS9BP, FGB, INA). Among the 8 prognostic RNAs, higher expression level of PGM5P2, SFTA1P, RGS9BP and INA were shown to correlate with better prognosis, indicating tumor-suppressive roles of these RNAs. Meanwhile, higher expression levels of hsa-miR-96, hsa-miR-204, RGS20 and FGB were found to correlate with worse prognosis, indicating tumor-promoting roles of these RNAs.

Most of these RNAs have been previously shown to be involved in certain types of cancers. INA is a neuronal intermediate filament protein,31 correlated with better prognosis of glioblastoma.32,33 RGS20 is a negative regulator of heterotrimeric G proteins and may promote cancer cell metastasis by upregulating vimentin and downregulating E-cadherin.34,35 FGB is one component of fibrinogen, which is a critical for tumor cell proliferation, angiogenesis and cancer metastasis.36,37 Elevated plasma level of fibrinogen is a strong indicator of poor prognosis of various tumors, such as breast tumor,38 prostate cancer,39 and lung cancer.40 SFTA1P is a lncRNA tumor suppressor functioning through inhibiting LUAD cell migration, invasion and metastasis.4143 RGS9BP is an anchor protein of RGS9, was also identified as being involved in bladder cancer,44 though the role it played remained elusive. The function of PGM5P2 is also unclear, however, it is implicated that PGM5P2 may be involved in pro-apoptosis and antiangiogenesis process,45 which is essential for the development and progression of cancer. Considering their roles in different cancer types, it is reasonable that these genes may play a role in the development and progression of LUAD. However, further studies are still needed to gain an insight into the roles of these molecules in LUAD.

The remaining two RNAs, however, was found to play controversial roles in different cancer types. hsa-miR-96 is involved in various cancers, however, divergent roles are reported with respect to different cancer types.46,47 It is shown that hsa-miR-96 can suppress tumor invasion in renal cell carcinoma47 and colorectal cancer,48 but it can promote cancer cell proliferation and invasion in breast cancer,49,50 bladder cancer46 and lung cancer.51 hsa-miR-204 has been reported to be a tumor suppressor in clear cell renal cell carcinoma, induced by VHL and functioning through inhibiting macroautophage by targeting LC3B.52 Besides, its variant hsa-miR-204-5p is also involved in endometrial carcinoma, and is shown to suppress the clonogenic growth, migration and invasion of endometrial carcinoma cells.53 However, we found that it played a tumor-promoting role in LUAD. Therefore, we speculate that hsa-miR-96 and hsa-miR-204 may also play divergent roles in different cancer types, which should be addressed in future experimental research.

Further, two miRNA-mRNA regulation pairs and an lncRNA-miRNA-mRNA regulation triplet were formed among these prognosis related RNAs according to the ceRNA network. Specifically, hsa-miR-96 formed two miRNA-mRNA regulation pairs with INA and RGS20, whereas hsa-miR-204 formed an lncRNA-miRNA-mRNA regulation triplet with SFTA1P and RGS20. We speculate that hsa-miR-96 may target INA and RGS20 in LUAD, whereas hsa-miR-204 may target RGS20 and regulated by SFTA1P. However, further experimental and functional studies are needed to disclose and confirm the pathways these RNAs involved.

However, the limitation of SVM classification model on evaluating the selected feature genes is lack of experiment validation. Further experiments, such as quantitative reverse-transcription PCR and/or western blot methods are still required to confirm our results. Moreover, the Kaplan–Meier curve analysis for these 8 prognosis-related RNAs was performed individually. If the prognostic value of these RNAs is validated by various combination analyses, more valuable results will be obtained for predicting the prognosis of LUAD.

In summary, we constructed a LUAD-specific SVM classification model based on the LUAD-related ceRNA network. The SVM classifier may serve as a novel diagnostic predictor of LUAD. Moreover, we also identified 8 key molecular markers with prognostic value from the ceRNA network, including PGM5P2, SFTA1P, hsa-miR-204, hsa-miR-96, RGS20, RGS9BP, FGB and INA. These molecular markers may be promising prognostic markers and drug targets in future clinical practice.

Acknowledgments

This work was supported by Chinese Medicine Science and Technology Development Project Fund of Shandong Province (project no 2017-200), Postdoctoral Applications Research Project Fund of Qingdao (project no 2016055) and The Affiliated Hospital of Qingdao University Youth Research Fund (2016).

Footnotes

Disclosure

All authors declared that they have no conflicts of interest in this work.

References

  • 1.Imielinski M, Berger AH, Hammerman PS, et al. Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing. Cell. 2012;150(6):1107–1120. doi: 10.1016/j.cell.2012.08.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Stewart BW, Wild C. World cancer report 2014. Lyon France: International Agency for Research on Cancer, World Health Organization; [PubMed] [Google Scholar]
  • 3.Saito M, Suzuki H, Kono K, Takenoshita S, Kohno T. Treatment of lung adenocarcinoma by molecular-targeted therapy and immunotherapy. Surg Today. 2018;48(1):1–8. doi: 10.1007/s00595-017-1497-7. [DOI] [PubMed] [Google Scholar]
  • 4.Camidge DR, Pao W, Sequist LV. Acquired resistance to TKIs in solid tumours: Learning from lung cancer. Nat Rev Clin Oncol. 2014;11(8):473–481. doi: 10.1038/nrclinonc.2014.104. [DOI] [PubMed] [Google Scholar]
  • 5.Takahashi T, Nau MM, Chiba I, et al. p53: A frequent target for genetic abnormalities in lung cancer. Science. 1989;246(4929):491–494. doi: 10.1126/science.2554494. [DOI] [PubMed] [Google Scholar]
  • 6.Singh A, Misra V, Thimmulappa RK, et al. Dysfunctional KEAP1-NRF2 interaction in non-small-cell lung cancer. PLoS Medicine. 2006;3(10):e420. doi: 10.1371/journal.pmed.0030420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Sanchez-Cespedes M, Parrella P, Esteller M, et al. Inactivation of LKB1/STK11 is a common event in adenocarcinomas of the lung. Cancer Res. 2002;62(13):3659–3662. [PubMed] [Google Scholar]
  • 8.Morris KV, Mattick JS. The rise of regulatory RNA. Nat Rev Genet. 2014;15(6):423–437. doi: 10.1038/nrg3722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Liz J, Esteller M. lncRNAs and microRNAs with a role in cancer development. Biochim Biophys Acta. 2016;1859(1):169–176. doi: 10.1016/j.bbagrm.2015.06.015. [DOI] [PubMed] [Google Scholar]
  • 10.Garzon R, Calin GA, Croce CM. MicroRNAs in Cancer. Annu Rev Med. 2009;60:167–179. doi: 10.1146/annurev.med.59.053006.104707. [DOI] [PubMed] [Google Scholar]
  • 11.Huarte M. The emerging role of lncRNAs in cancer. Nat Med. 2015;21(11):1253–1261. doi: 10.1038/nm.3981. [DOI] [PubMed] [Google Scholar]
  • 12.Tay Y, Rinn J, Pandolfi PP. The multilayered complexity of ceRNA crosstalk and competition. Nature. 2014;505(7483):344–352. doi: 10.1038/nature12986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Sui J, Li YH, Zhang YQ, et al. Integrated analysis of long non-coding RNA-associated ceRNA network reveals potential lncRNA biomarkers in human lung adenocarcinoma. Int J Oncol. 2016;49(5):2023–2036. doi: 10.3892/ijo.2016.3716. [DOI] [PubMed] [Google Scholar]
  • 14.Li DS, Ainiwaer JL, Sheyhiding I, Zhang Z, Zhang LW. Identification of key long non-coding RNAs as competing endogenous RNAs for miRNA-mRNA in lung adenocarcinoma. Eur Review Med Pharmacol Sci. 2016;20(11):2285–2295. [PubMed] [Google Scholar]
  • 15.Kabbout M, Garcia MM, Fujimoto J, et al. ETS2 mediated tumor suppressive function and MET oncogene inhibition in human non-small cell lung cancer. Clin Cancer Res. 2013;19(13):3383–3395. doi: 10.1158/1078-0432.CCR-13-0341. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Landi MT, Dracheva T, Rotunno M, et al. Gene expression signature of cigarette smoking and its role in lung adenocarcinoma development and survival. PLoS One. 2008;3(2):e1651. doi: 10.1371/journal.pone.0001651. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Carvalho B, Bengtsson H, Speed TP, Irizarry RA. Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data. Biostatistics. 2007;8(2):485–499. doi: 10.1093/biostatistics/kxl042. [DOI] [PubMed] [Google Scholar]
  • 18.Robinson MD, McCarthy DJ, Smyth GK. edgeR: A bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ge Y, Dudoit S, Speed TP. Resampling-based multiple testing for microarray data analysis. Test. 2003;12(1):1–77. [Google Scholar]
  • 20.Jeggari A, Marks DS, Larsson E. miRcode: A map of putative microRNA target sites in the long non-coding transcriptome. Bioinformatics. 2012;28(15):2062–2063. doi: 10.1093/bioinformatics/bts344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Li J-H, Liu S, Zhou H, Qu L-H, Yang J-H. starBase v2. 0: decoding miRNA-ceRNA, miRNA-ncRNA and protein–RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res. 2014;42(D1):D92–D97. doi: 10.1093/nar/gkt1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Hsu S-D, Lin F-M, Wu W-Y, et al. miRTarBase: A database curates experimentally validated microRNA–target interactions. Nucleic Acids Res. 2011;39(Suppl 1):D163–D169. doi: 10.1093/nar/gkq1107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Chou C-H, Chang N-W, Shrestha S, et al. miRTarBase 2016: Updates to the experimentally validated miRNA-target interactions database. Nucleic Acids Res. 2016;44(D1):D239–D247. doi: 10.1093/nar/gkv1258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Chatr-Aryamontri A, Breitkreutz BJ, Oughtred R, et al. The BioGRID interaction database: 2015 update. Nucleic Acids Res. 2015;43(Database issue):D470–D478. doi: 10.1093/nar/gku1204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Keshava Prasad TS, Goel R, Kandasamy K, et al. Human protein reference database – 2009 update. Nucleic Acids Res. 2009;37(Database issue):D767–D772. doi: 10.1093/nar/gkn892. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D. The Database of interacting proteins: 2004 update. Nucleic Acids Res. 2004;32(Database issue):D449–D451. doi: 10.1093/nar/gkh086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ashburner M, Ball CA, Blake JA, et al. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25(1):25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30. doi: 10.1093/nar/28.1.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Baur B, Bozdag S. A feature selection algorithm to compute gene centric methylation from probe level methylation data. PLoS One. 2016;11(2):e0148977. doi: 10.1371/journal.pone.0148977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Singh R, Mukhopadhyay K. Survival analysis in clinical trials: Basics and must know areas. Perspect Clin Res. 2011;2(4):145–148. doi: 10.4103/2229-3485.86872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Lariviere RC, Julien JP. Functions of intermediate filaments in neuronal development and disease. J Neurobiol. 2004;58(1):131–148. doi: 10.1002/neu.10270. [DOI] [PubMed] [Google Scholar]
  • 32.Suh JH, Park CK, Park SH. Alpha internexin expression related with molecular characteristics in adult glioblastoma and oligodendroglioma. J Korean Med Sci. 2013;28(4):593–601. doi: 10.3346/jkms.2013.28.4.593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Mokhtari K, Ducray F, Kros JM, et al. Alpha-internexin expression predicts outcome in anaplastic oligodendroglial tumors and may positively impact the efficacy of chemotherapy: European Organization for Research and Treatment of Cancer trial 26951. Cancer. 2011;117(13):3014–3026. doi: 10.1002/cncr.25827. [DOI] [PubMed] [Google Scholar]
  • 34.Li Q, Jin W, Cai Y, et al. Regulator of G protein signaling 20 correlates with clinicopathological features and prognosis in triple-negative breast cancer. Biochemi Biophys Res Commun. 2017;485(3):693–697. doi: 10.1016/j.bbrc.2017.02.106. [DOI] [PubMed] [Google Scholar]
  • 35.Yang L, Lee MM, Leung MM, Wong YH. Regulator of G protein signaling 20 enhances cancer cell aggregation, migration, invasion and adhesion. Cell Signal. 2016;28(11):1663–1672. doi: 10.1016/j.cellsig.2016.07.017. [DOI] [PubMed] [Google Scholar]
  • 36.Perisanidis C, Psyrri A, Cohen EE, et al. Prognostic role of pretreatment plasma fibrinogen in patients with solid tumors: A systematic review and meta-analysis. Cancer Treat Rev. 2015;41(10):960–970. doi: 10.1016/j.ctrv.2015.10.002. [DOI] [PubMed] [Google Scholar]
  • 37.Palumbo JS, Degen JL. Mechanisms coupling the hemostatic system to colitis-associated cancer. Thromb Res. 2010;125(Suppl. 2):S39–S43. doi: 10.1016/S0049-3848(10)70011-6. [DOI] [PubMed] [Google Scholar]
  • 38.Krenn-Pilko S, Langsenlehner U, Stojakovic T, Pichler M, Gerger A, Kapp KS, Langsenlehner T. An elevated preoperative plasma fibrinogen level is associated with poor disease-specific and overall survival in breast cancer patients. Breast. 2015;24(5):667–672. doi: 10.1016/j.breast.2015.08.003. [DOI] [PubMed] [Google Scholar]
  • 39.Thurner EM, Krenn-Pilko S, Langsenlehner U, et al. The association of an elevated plasma fibrinogen level with cancer-specific and overall survival in prostate cancer patients. World J Urol. 2015;33(10):1467–1473. doi: 10.1007/s00345-014-1459-2. [DOI] [PubMed] [Google Scholar]
  • 40.Palumbo JS, Kombrinck KW, Drew AF, Grimes TS, Kiser JH, Degen JL, Bugge TH. Fibrinogen is an important determinant of the metastatic potential of circulating tumor cells. Blood. 2000;96(10):3302–3309. [PubMed] [Google Scholar]
  • 41.Zhao W, Luo J, Jiao S. Comprehensive characterization of cancer subtype associated long non-coding RNAs and their clinical implications. Sci Rep. 2014;4:6591. doi: 10.1038/srep06591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Zhang H, Xiong Y, Xia R, Wei C, Shi X, Nie F. The pseudogene-derived long noncoding RNA SFTA1P is down-regulated and suppresses cell migration and invasion in lung adenocarcinoma. Tumour Biol. 2017;39(2):1010428317691418. doi: 10.1177/1010428317691418. [DOI] [PubMed] [Google Scholar]
  • 43.Huang GQ, Ke ZP, Hu HB, Gu B. Co-expression network analysis of long noncoding RNAs (IncRNAs) and cancer genes reveals SFTA1P and CASC2 abnormalities in lung squamous cell carcinoma. Cancer Biol Ther. 2017;18(2):115–122. doi: 10.1080/15384047.2017.1281494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Yang Z, Li C, Fan Z, et al. Single-cell sequencing reveals variants in ARID1A, GPRC5A and MLL2 driving self-renewal of human bladder cancer stem cells. Eur Urol. 2017;71(1):8–12. doi: 10.1016/j.eururo.2016.06.025. [DOI] [PubMed] [Google Scholar]
  • 45.Miao Y, Cui L, Chen Z, Zhang L. Gene expression profiling of DMU-212-induced apoptosis and anti-angiogenesis in vascular endothelial cells. Pharm Biol. 2016;54(4):660–666. doi: 10.3109/13880209.2015.1071414. [DOI] [PubMed] [Google Scholar]
  • 46.Wu Z, Liu K, Wang Y, Xu Z, Meng J, Gu S. Upregulation of microRNA-96 and its oncogenic functions by targeting CDKN1A in bladder cancer. Cancer Cell Int. 2015;15:107. doi: 10.1186/s12935-015-0235-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Yu N, Fu S, Liu Y, et al. miR-96 suppresses renal cell carcinoma invasion via downregulation of Ezrin expression. J Exp Clin Cancer Res. 2015;34:107. doi: 10.1186/s13046-015-0224-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Ress AL, Stiegelbauer V, Winter E, et al. MiR-96-5p influences cellular growth and is associated with poor survival in colorectal cancer patients. Mol Carcinog. 2015;54(11):1442–1450. doi: 10.1002/mc.22218. [DOI] [PubMed] [Google Scholar]
  • 49.Lin H, Dai T, Xiong H, et al. Unregulated miR-96 induces cell proliferation in human breast cancer by downregulating transcriptional factor FOXO3a. PLoS One. 2010;5(12):e15797. doi: 10.1371/journal.pone.0015797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Guttilla IK, White BA. Coordinate regulation of FOXO1 by miR-27a, miR-96, and miR-182 in breast cancer cells. J Biol Chem. 2009;284(35):23204–23216. doi: 10.1074/jbc.M109.031427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Guo H, Li Q, Li W, Zheng T, Zhao S, Liu Z. MiR-96 downregulates RECK to promote growth and motility of non-small cell lung cancer cells. Mol Cell Biochem. 2014;390(1–2):155–160. doi: 10.1007/s11010-014-1966-x. [DOI] [PubMed] [Google Scholar]
  • 52.Mikhaylova O, Stratton Y, Hall D, et al. VHL-regulated MiR-204 suppresses tumor growth through inhibition of LC3B-mediated autophagy in renal clear cell carcinoma. Cancer Cell. 2012;21(4):532–546. doi: 10.1016/j.ccr.2012.02.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Bao W, Wang HH, Tian FJ, et al. A TrkB-STAT3-miR-204-5p regulatory circuitry controls proliferation and invasion of endometrial carcinoma cells. Mol Cancer. 2013;12:155. doi: 10.1186/1476-4598-12-155. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from OncoTargets and therapy are provided here courtesy of Dove Press

RESOURCES