Skip to main content
PeerJ logoLink to PeerJ
. 2021 Aug 5;9:e11923. doi: 10.7717/peerj.11923

Signature identification of relapse-related overall survival of early lung adenocarcinoma after radical surgery

Peng Han 1, Jiaqi Yue 1, Kangle Kong 1, Shan Hu 1, Peng Cao 1, Yu Deng 1, Fan Li 1,, Bo Zhao 1,
Editor: Zunnan Huang
PMCID: PMC8349519  PMID: 34430085

Abstract

Background

The widespread use of low-dose chest CT screening has improved the detection of early lung adenocarcinoma. Radical surgery is the best treatment strategy for patients with early lung adenocarcinoma; however, some patients present with postoperative recurrence and poor prognosis. Through this study, we hope to establish a model that can identify patients that are prone to recurrence and have poor prognosis after surgery for early lung adenocarcinoma.

Materials and Methods

We screened prognostic and relapse-related genes using The Cancer Genome Atlas (TCGA) database and the GSE50081 dataset from the Gene Expression Omnibus (GEO) database. The GSE30219 dataset was used to further screen target genes and construct a risk prognosis signature. Time-dependent ROC analysis, calibration degree analysis, and DCA were used to evaluate the reliability of the model. We validated the TCGA dataset, GSE50081, and GSE30219 internally. External validation was conducted in the GSE31210 dataset.

Results

A novel four-gene signature (INPP5B, FOSL2, CDCA3, RASAL2) was established to predict relapse-related survival outcomes in patients with early lung adenocarcinoma after surgery. The discovery of these genes may reveal the molecular mechanism of recurrence and poor prognosis of early lung adenocarcinoma. In addition, ROC analysis, calibration analysis and DCA were used to verify the genetic signature internally and externally. Our results showed that our gene signature had a good predictive ability for recurrence and prognosis.

Conclusions

We established a four-gene signature and predictive model to predict the recurrence and corresponding survival rates in patients with early lung adenocarcinoma after surgery. These may be helpful for reforumulating post-operative consolidation treatment strategies.

Keywords: Early lung adenocarcinoma, Prognosis, Gene signature, Recurrence, Postoperation

Introduction

According to the latest epidemiological studies, lung cancer remains the most common malignancy in the world and one of the leading causes of cancer-related death (Siegel et al., 2021). It can be divided into small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC). NSCLC accounts for approximately 85% of all lung cancer cases. NSCLC can be categorized as lung adenocarcinoma (LUAD) and squamous cell lung carcinoma. according to its pathogenesis and histological morphology (Molina et al., 2008; Malhotra et al., 2016). As tobacco control increases, lung adenocarcinoma incidence is now higher than that of lung squamous cell carcinoma and has become the main cause of lung cancer-related death (Siegel et al., 2021; Cayuela et al., 2021). However, lung cancer prognosis is still poor. There are no characteristic clinical symptoms in the early stages of lung cancer, delaying its diagnosis and treatment (Jacobsen et al., 2017; Hong et al., 2015). We rely on low-dose chest CT scan more than clinical symptoms, chest X-rays, or tumor markers for the early diagnosis of lung cancer (Cainap et al., 2020; Zhang et al., 2019a). Therefore, the application of low-dose chest CT scans in the screening of early lung disease is becoming more common. It is worth noting that, with the outbreak of COVID-19, low-dose chest CT scanning was widely popularized in the early screening of the disease (Herpe et al., 2021), which indirectly promoted the early diagnosis of lung cancer (Zhang et al., 2020). The diagnosis and treatment of lung cancer, including surgical resection, chemotherapy, radiotherapy, targeted therapy, and immunotherapy, have improved (Yuan et al., 2019; Wang et al., 2020; Zappa & Mousa, 2016). Among all the treatment methods, radical surgical resection is the first choice for treating patients clinically diagnosed with early lung cancer and can significantly improve their prognosis (Abbas, 2018). Over the past decade, the widespread popularization of thoracoscopy in thoracic surgery has reduced the surgical trauma for patients and has further established the therapeutic status of radical surgery (Abbas, 2018; Leshnower et al., 2010; Shaw et al., 2008). Ideally, we hope to detect lung cancer early through regular physical examination and then carry out radical minimally invasive surgical treatment to completely cure the cancer. However, some patients are prone to relapse after radical surgery, which seriously affects the prognosis and patients’ quality of life (Yang et al., 2020). Targeted therapy and immunotherapy have dramatically improved with remarkable effects in recent years; however, there is still no conclusion as to whether consolidation therapy such as targeted therapy or postoperative immunotherapy is necessary to improve disease outcomes. We do not have a comprehensive and systematic method to identify this disease, which limits postoperative survival for patients with early-stage lung cancer.

Gene chip and high-throughput sequencing technology are becoming more popular. We analyzed RNASeq datasets from The Cancer Genome Atlas (TCGA) database and gene-chip datasets from three GEO datasets. Univariate Cox regression analysis and Lasso regression analysis were used to identify common genes associated with prognosis and recurrence. A model predicting the relapse-related overall survival rate of early lung adenocarcinoma was established and validated. In contrast to previous reports on lung adenocarcinoma prognosis and recurrence models (Merritt et al., 2020; Jones et al., 2021; Li et al., 2017; Yin et al., 2020), our model was dedicated to predicting both overall survival and relapse-free survival in patients with early postoperative lung adenocarcinoma. Our model contains two prognostic factors which may be more clinically applicable. Evaluating the survival benefit based only on postoperative recurrence does not provide a comprehensive view. A subset of exceptional recurrent patients may present with slow-growing recurrent pulmonary nodules (Kobayashi et al., 2015; Detterbeck, 2019; Kobayashi & Mitsudomi, 2019) and survival with nodules rather than overtreatment may be a better clinical strategy (Hammer & Hatabu, 2020). We defined early lung cancer as T1 and T2 patients regardless of N stage. Ours conclusion is prospective and applicable to clinical scenarios. In clinical practice it is easy to confirm the T stage of patients by chest CT scan, but it is difficult to confirm the N stage of patients (McPherson et al., 2020; Ghaly et al., 2017). Most patients will choose to undergo radical surgical treatment as early as possible in the T1/T2 stage of tumors.

Materials and methods

Data processing

We downloaded RNA-sequencing datasets for 551 LUAD tissue samples with corresponding clinical data from TCGA (https://portal.gdc.cancer.gov/) on November 22, 2020. Exclusion criteria included: (1) a history of prior malignancy or other existing malignancy; (2) patients who received neoadjuvant therapy or other unacceptable treatment; (3) unclear pathological diagnosis or adenocarcinoma with mixed subtypes; and (4) patients with T stage III/IV, distant metastases, or uncertain TNM staging. We included 239 LUAD samples in a preliminary univariate survival analysis. Batch effect removal and differential expression analysis were performed using the DEseq2 package (http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html). The prognostic related genes were identified based on an adjusted P value < 0.05.

Three Affymetrix datasets (GSE50081, GSE30219 and GSE31210) from the GEO database were screened out and included in our next analysis to ensure the reliability of our results. The screening conditions included: (1) samples in the dataset had clear pathological diagnosis, TNM staging, and prognosis information; (2) early LUAD samples of >50 cases; (3) postoperative sample availability; and (4) overall survival time and recurrence time was greater than 30 days. We used R (version 4.0.2) for our analysis. The Robust multi-chip Averaging (RMA) algorithm of the “Affy (Gautier et al., 2004)” package directly extracted data for processing.

GO and KEGG pathway enrichment analysis

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was performed to comprehensively evaluate the biological processes of prognostic differentially genes to explore the potential molecular mechanisms behind prognostic genes. GO enrichment analysis was conducted to analyze the biological functions of prognostic-related differentially genes of early LUAD from cellular component (CC), molecular function (MF), and biological process (BP). KEGG enrichment analysis was dedicated to predicting the major biological pathways involved in DEGs. An adjusted P value < 0.05 was considered statistically significant. We used the org.hs.eg.db and clusterProfiler (Yu et al., 2012) R packages for enrichment analysis.

Establishing and validating a prognostic predictive signature

We performed prognostic analysis on the TCGA datasets, the GSE50081 survival dataset, and the GSE50081 relapse dataset. The intersection was taken to obtain relapse-related prognostic genes. Subsequently, the above genes were included into the GSE30219 dataset for univariate regression analysis and Lasso regression analysis. The selected genes were used to construct a multivariate Cox proportional risk regression signature, and the corresponding risk score was calculated to evaluate the prognosis and recurrence of the patients. Risk score = β1*Exp1+ β2*Exp2+ β3*Exp3..., where Exp is the normalized gene expression level and β is the coefficient value. According to the normality test, the appropriate cut-off value was calculated to divide LUAD patients into low- and high-groups. The Kaplan–Meier method was used to evaluate the difference in overall survival and recurrence between low- and high-groups. The K-M analysis was conducted to validate our risk signature. The R packages used in this analysis include “Survival”, “plyr”, “ggplot2”, “SurvMiner”, “dplyr”, “Formula”, “tidyverse”, and “Survivalroc”.

Constructing and evaluating a predictive nomogram

We calculated the survival and relapse related clinical parameters (gender, age, smoking, T stage, N stage, and risk score) through univariate and multivariate regression analyses to improve the clinical transformation efficiency of our model. We used filtered-independent prognostic clinical parameters and risk scores to construct a univariate nomogram and a combined nomogram. Three recognized algorithms were applied for internal and external validation, respectively, to confirm the predictive potential of those models. ROC analysis was applied to evaluate the predictive performance of the nomogram. Calibration analysis was selected to evaluate clinical consistency of the histogram predictions. We used DCA to evaluate the clinical net benefit. For internal validation, the above three validation methods were used to evaluate one-year, three-year, and five-year OS and RFS of early LUAD patients in the TCGA, GSE50081, and GSE30219 datasets, respectively. The same method was repeated in the GSE31210 dataset for external evaluation. The R packages applied in this study included “ROCR30”, “SurvMisc”, “RMS”, “pROC”, “hash”, “timeROC”, “forestplot” and “VennDiagram”.

Results

Preliminary identification of relapse-related prognostic genes

We thoroughly screened the LUAD RNAseq data downloaded from the TCGA database and included 239 tumor samples in our prognostic analysis. The flow chart is shown in Fig. 1. We removed 12 patients with an OS <30 days or with unknown survival information and the remaining 227 patients were included in our univariate Cox regression analysis. The results indicated that 1,455 protein coding genes (P < 0.05) were statistically significant, including 567 differentially up-regulated genes and 410 down-regulated genes (239 Tumor vs. 36 Normal, P < 0.05) (Table S1). We then screened the GSE50081 dataset, which had complete TNM staging, prognosis, and relapse-related information, to further identify statistically significant relapse-associated prognostic genes. After normalizing using the RMA algorithm (Fig. 2A), univariate regression analysis was performed. Genes related to survival prognosis (Table S2) and disease-free survival (Table S3) were calculated, respectively. We recognized 948 genes associated with LUAD survival prognosis (P < 0.01) and 2,698 genes with LUAD recurrence (P < 0.05). We intersected GSE50081 survival related genes, GSE50081 disease-free survival related genes, and TCGA prognostic related genes. We then obtained an additional 90 reliable relapse-related prognostic genes (Fig. 2B).

Figure 1. The flow chart showing the scheme of our study on relapse-related prognostic signatures of early LUAD.

Figure 1

Figure 2. (A) RMA normalization of the GSE50081 dataset. (B) Intersection of TCGA datasets, GSE50081 survival and relapse datasets. GO (C) and KEGG (D) pathway enrichment analysis of 90 prognostic genes from the TCGA dataset and the GSE50081 dataset.

Figure 2

GO and KEGG pathway enrichment analysis

GO enrichment analysis was conducted on 90 relapse-associated prognostic genes to explore the major biological functional modules and pathways involved in prognostic genes. Our results showed that those genes were mainly enriched in biological processes related to “platelet-derived growth factor binding”, “extracellular matrix structural constituent conferring tensile strength”, “growth factor binding”, “proteoglycan binding”, “aminopeptidase activity”, “cyclin-dependent protein serine/threonine kinase regulator activity”, “NADP binding”, “cell adhesion molecule binding”, and “coenzyme binding” (Fig. 2C). Further KEGG pathway enrichment analysis revealed that prognostic DEGs were significantly enriched in “cell cycle”, “ protein digestion and absorption”, “oocyte meiosis”, and “progesterone-mediated oocyte maturation” pathways (Fig. 2D).

Identification and verification of relapse-related DEGs

The most popular and effective treatment for early LUAD is radical surgical resection. Postoperative recurrence is one of the major risks of cancer-related death in early lung adenocarcinoma and also a significant cause of poor prognosis (Martini et al., 1995; Mahvi et al., 2018). In order to screen out more important target genes, we included the 90 prognostic genes we obtained into the GSE30219 prognosis and relapse dataset. Thirty-six genes showed significant differences (Table S4) and were subsequently included in the GSE30219 prognosis (Table S5) and recurrence dataset (Table S6) for LASSO regression analysis. To ensure result repeatability, 1,000 LASSO regressions were performed in each dataset. Genes that showed up more than 950 times in 1,000 analyses were considered significant. Ultimately, four genes were identified and subsequently used to construct a relapse-associated prognostic gene signature. The four genes identified were RAS protein activator like 2 (RASAL2, ENSG00000075391.15), cell division cycle associated 3 (CDCA3, ENSG00000111665.10), Fos-related antigen 1 (FOSL1, ENSG00000175592.7), and inositol polyphosphate-5-phosphatase B (INPP5B, ENSG00000204084.11). The risk score = (0.8138293*Exp RASAL2) + (0.3749033*Exp CDCA3) + (0.3233154*Exp FOSL1) + (−1.3673670*Exp INPP5B). The Shapiro–Wilk test was used to calculate the distribution of risk scores in each dataset (Table S7). If data was normally distributed (P > 0.05) we selected their mean value as the cut-off. If the dataset had skewed distribution (P < 0.05), the median was selected as the cut-off. Samples were divided into high-and low-score subgroups according to their cut-off value. In order to verify the reliability of the risk score, we first conducted K-M analysis on three datasets, GSE30219, GSE50081, and TCGA respectively (Figs. 3A3E). K-M curve analysis suggested that the high-score group predicted a poor prognosis compared with the low-score group, with statistically significant differences.

Figure 3. KM analysis results of internal validation sets.

Figure 3

(A) K-M analysis of overall survival for risk scores in the GSE30219 dataset. (B) K-M analysis of relapse-free survival for risk scores in the GSE30219 dataset. (C) K-M analysis of overall survival for risk scores in the GSE50081 dataset. (B) K-M analysis of relapse-free survival for risk scores in the GSE50081 dataset. (E) K-M analysis of overall survival for risk scores in TCGA dataset. (F) Univariate regression analysis of clinical parameters related to overall survival in the GSE50081 dataset.

Evaluating the independent role of the prognostic signature and construction of comprehensive prediction model

We conducted univariate regression analysis (Fig. 3F) for the clinical parameters of gender, age, smoking, T staging, and N staging (Table S8) of the GSE50081 dataset. We then performed multivariate cox regression analysis (Fig. 4A) for the results with a P value of <0.05 (T staging, N staging, and risk score). The results suggested that both univariate (P = 0.006, HR =2.27) and multivariate (P = 0.014, HR = 2.10) Cox regression analyses had significant prognostic significance in lymph node metastasis. Our analysis of clinical data from the TCGA dataset also found that only lymph node metastasis had significant prognostic value in both univariate and multivariate regression (Table S9). Therefore, we constructed a comprehensive survival prediction model (Fig. 4B) that included the two factors of lymph node metastasis and risk score.

Figure 4. Construction of risk prediction model.

Figure 4

(A) Multivariate regression analysis of clinical parameters related to overall survival. For each patient, three lines are drawn upward to determine the points received from the three predictors in the nomogram. The sum of these points is located on the ‘Total Points’ axis. Then a line is drawn downward to determine the possibility of 1-, 3-, and 5-year OS (B)/RFS. (E) of LUAD. RFS-related univariate (C) and multivariate (D) regression analyses were performed to screen for independent relapse clinical parameters.

We also calculated the correlation between clinical parameters and recurrence. The results indicated that there were significant statistical differences in the N stage for tumor recurrence regardless of the univariate (P = 0.009, HR = 2.45) (Fig. 4C) or multivariate regression analysis (P = 0.037, HR = 2.08) (Fig. 4D). T staging was statistically significant in univariate analysis (P = 0.003, HR = 4.13), but not in multivariate analysis (P = 0.081, HR = 2.45). Therefore, we constructed a comprehensive recurrence prediction model (Fig. 4E) that included the lymph node metastasis and risk score.

Internal validation of the prognostic and recurrent signature

To further evaluate the predictive performance and clinical suitability of our model, we evaluated the prognostic model and the recurrence model using time-dependent ROC curve analyses, calibration analysis and decision curve analysis (DCA), respectively. First, we verified the reliability of the survival prognosis model in the GSE50081 dataset, and ROC analysis found that the prediction ability of our model was generally good. Compared with the risk score univariate model (Fig. 5A), the combined model fitted with N stage and risk score (Fig. 5C) showed more advantages in the prediction of three-year survival. There was no significant difference in the prediction of five-year survival, while the combined model showed a relatively low ability in the prediction of one-year survival. This may be related to the poor prediction for one-year survival by the N staging univariate model (Fig. 5B). We performed a C-index analysis on the model to further investigate the clinical predictive power of the combined model. The C-index of the combined model was 0.694 (0.653–0.735), which indicates that the combined model has a good predictive ability. The c-index of the risk score univariate model was 0.683 (0.642–0.724). Further calibration analysis showed that our combined model had good overall clinical applicability. The combined model predicted three-year and five-year OS with higher accuracy than one-year OS (Figs. 5D5F). We conducted a clinical DCA to evaluate the model’s clinical efficacy (Figs. 5G5I). In general, our model had high predictive power and clinical consistency for three-year and five-year OS, while the one-year prediction model was relatively poor. The risk score model was similar to the combined model, while the lymph node prediction model had the worst predictive ability. In summary, through multiple algorithm validation, we found that our combined model has good long-term survival prediction ability, while the univariate risk score prediction model has good near-term prediction ability. This is reasonable because early lung adenocarcinoma has an extremely high one-year survival rate after surgery. Thus, we are more concerned about the long-term survival rate after surgery.

Figure 5. Verification of the risk signature.

Figure 5

(A–C) Time-dependent ROC curves of the nomogram of 1-, 3- and 5-year overall survival for univariate model and combined model to evaluate the predictive performance of the nomogram. The horizontal axis represents the sensitivity of nomogram prediction, and the vertical axis represents the specificity of nomogram prediction. The higher the sensitivity and specificity of nomogram curve, the stronger the prediction power. (D–F) The calibration plot for internal validation of the nomogram. The solid line represents the prediction ability of the model. The closer the solid line is to the dotted line, the closer the prediction of the model is to the real situation. (G-I) DCA curves of the univariate model and combined model for 1-,3- and 5-year overall survival in LUAD to evaluate the clinical decision-making benefts. The horizontal axis represents the threshold probability and the vertical axis represents the net gain. The higher the net gain under the same threshold probability, the stronger the Nomogram prediction power will be.

The reliability of the relapse-free survival prognostic model was verified in the GSE50081 dataset. ROC analysis showed that the combined model fitted with N stage and risk score had the best predictive power (Fig. 6A), with one-year, three-year, and five-year AUC of 0.78, 0.69, and 0.75, respectively. The risk score prediction model’s predictive ability (Fig. 6B) was relatively high, while the predictive ability of the N-stage prediction model (Fig. 6C) was relatively poor. Lymph node metastasis is undoubtedly an independent prognostic factor for prognosis and recurrence, and our risk score model showed significantly better predictive power than the N Stage prognostic model, which will be more clinically applicable. We performed a C-index analysis on the model to further investigate the clinical predictive power of the combined model. The C-index of the combined model was 0.697 (0.647–0.747), which indicates that the combined model has good predictive ability. Further calibration analysis showed that our combined model had good clinical consistency in one-year, three-year, and five-year RFS (Figs. 6D6F). Among them, our model has the highest prediction accuracy for one-year RFS, which is consistent with the above ROC analysis results. In addition, in order to evaluate the accuracy of the model and compare the advantages and disadvantages of each model, we conducted a clinical DCA (Figs. 6G6I). In general, our model has high predictive power for one-year and five-year RFS. However, the combined model and the risk score model have their own advantages and disadvantages, and both are better than the N stage model. We found that our combined model has good RFS prediction ability through multiple algorithm validation. Similarly, we also conducted similar verification in the GSE30219 data set (Fig. S1). Due to the small sample size (n = 2/81) of lymph node metastasis in this dataset, we only validated the risk score model. The C-index of the survival and relapse groups was 0.758 (0.72–0.796) and 0.809 (0.765–0.853), respectively. In addition, the areas under the ROC curve in the survival group (Fig. S1A) and the relapse group (Fig. S1E) were 0.81, 0.83, 0.85, 0.88, 0.9, and 0.86 at one, three, and five years, respectively. These results show that the risk score prediction model has nearly perfect predictive ability.

Figure 6. Internal validation of risk prediction models.

Figure 6

(A–C) Time-dependent ROC curves of the nomogram of 1-, 3- and 5-year RFS for univariate model and combined model to evaluate the predictive performance. (D–F) The calibration plot for internal validation of the nomogram. (G–I) DCA curves of the univariate model and combined model for 1-,3- and 5-year RFS in LUAD to evaluatethe clinical decision-making benefts.

External validation of OS and RFS prognostic signature

The GSE31210 datasets were selected for external validation due to its complete prognosis and relapse sample information. The normalized results from the raw data are shown in Fig. S2. For the validation of survival related risk signatures, we adopted the same verification method described above. In the GSE31210 survival dataset (n = 226), we classified the risk scores into high-score and low-score groups. K-M analysis suggested that the high-score group had worse prognoses than the low-score groups (P = 0.00039) (Fig. 7A). Similarly, we found that the high-score group was more likely to relapse than the low-score group (P < 0.0001) (Fig. 7B). Time-dependent ROC analysis in the survival dataset showed that AUC for one-year, three-year and five-year OS of this validation set was 0.67, 0.66, 0.7, respectively (Fig. 8A). The C-index was 0.669 (0.628–0.71). Calibration analysis showed that the risk signature had good clinical consistency and survival prediction ability (Figs. 8B8D). In the GSE31210 relapse dataset (n = 226), time-dependent ROC analysis showed that AUC for one-year, three-year, and five-year OS of this validation set was 0.65, 0.63, and 0.7, respectively (Fig. 8E). The C-index was 0.642 (0.609–0.675). Calibration analysis also showed good survival prediction (Figs. 8F8H), especially for the five-year survival rate. In general, our risk signature has good predictive ability. Multiple algorithms show that our risk signature has a relatively good predictive ability in the GSE30219 dataset, which may be because the proportion of positive events in the GSE30219 dataset (42/81) is higher than that in the GSE31210 dataset (34/226).

Figure 7. The KM analysis result of Risk Signature in the external verification set of GSE31210.

Figure 7

(A) The KM analysis result of Risk Signature on the overall lifetime. (B) The KM analysis results of Risk Signature for relapse-free survival.

Figure 8. External validation of genetic signatures.

Figure 8

(A)/(D)Time-dependent ROC curves of the risk signature for 1-, 3- and 5-year OS/RFS in LUAD to evaluate the predictive performance in the GSE31210 dataset. The calibration plot for 1- , 3- and 5-year OS/RFS of the risk signature in the GSE31210 dataset (B–D)/(F–H).

Discussion

The clinical diagnosis of lung cancer has improved with the wide application of low-dose chest CT scan in routine screening of lung disease. Although minimally invasive radical surgery can greatly improve the survival rate of patients with cancer and has been widely used, there are still some patients who are prone to recurrence, which significantly affects their prognosis. Therefore, we constructed a 4-gene relapse-related survival risk model, which can identify individuals with a high recurrence rate and significant poor prognosis. Internal validation and external validation also confirm that our prediction model has high predictive ability. We can further develop individualized postoperative consolidation therapy, which is expected to improve overall survival after early lung adenocarcinoma for identified high-risk individuals.

A total of four genes were identified and validated (INPP5B, FOSL2, CDCA3, RASAL2). INPP5B seems to play a major role in our risk signature. The gene encodes a member of the inositol polyphospho-5 phosphatase family. These enzymes are involved in key activities such as cell signal transduction, membrane transport and cytoskeleton activity by regulating inositol phosphate (Bohdanowicz et al., 2012). Current studies on INPP5b have focused on its role in Lowe Syndrome and Type 2 DENT disease, while its role in lung cancer has not been reported. However, it has been shown that the same family of INPP5J appears to be a tumor suppressor gene (Ben-Chetrit et al., 2015), and many inositol polyphospho-5 phosphatase family molecules can affect cell migration, adhesion and polarity (Ramos, Elong Edimo & Erneux, 2018). For example, INPP5D (Wain, Westwick & Ward, 2005) and INPPL1 (Venkatareddy et al., 2011) have been reported to significantly affect cell migration and T cell chemotaxis. In addition, INPP5B is also significantly down-regulated in lung adenocarcinoma. Its high expression indicates a good prognosis and it may be a potential tumor suppressor gene in lung adenocarcinoma (http://gepia.cancer-pku.cn/detail.php?gene=INPP5B).

FOSL1 is a member of the Fos protein family. The protein encoded by FOSL1 is an essential component of the transcription factor complex AP-1 and is considered to be an important part of cell proliferation and differentiation (Talotta, Casalino & Verde, 2020). In lung cancer, FOSL1 acts as a bridge between KRAS mutations and mitosis and its inhibition will reduce the viability of KRAS mutant cells (Vallejo et al., 2017; Elangovan et al., 2018). FOSL1 can regulate gene expression induced by KRAS mutation, and thus control cell proliferation and survival, and predicts a poor prognosis in lung cancer. In addition, FOSL1, as an important hub gene, can also be regulated by non-coding RNAs such as miR-130a and LINC00460 to further regulate the progression of lung cancer (Xu, Wang & Liu, 2020; Cisneros-Villanueva et al., 2021). Moreover, FOSL1 may play an important role in the development and metastasis of tumors (Maurus et al., 2017). It has been clearly reported that FOSL1 can also promote the progression of pancreatic cancer (Vallejo et al., 2017; Luo, He & Qiu, 2018), bile duct cancer (Vallejo et al., 2021), breast cancer (Kim et al., 2020; Chen et al., 2018), bladder cancer (Cui et al., 2020; Gatta et al., 2019), stomach cancer (He et al., 2015) and esophageal cancer (Shen et al., 2020) through direct or indirect mechanisms. Therefore, high expression of FOSL1 may contribute to poor prognosis of LUAD from multiple aspects of tumor genesis and progression.

CDCA3 is a member of the cell division cycle-associated protein family, which is involved in the cell cycle and has been found to promote cell proliferation and cell cycle processes in a variety of tumors (Qian et al., 2018; Zhang et al., 2019b; Wu et al., 2020). For example, in colorectal cancer, CDCA3 can mediate p21-dependent proliferation by regulating E2F1 expression or promote cell proliferation by activating the NF-KappaB /cyclin D1 signaling pathway (Qian et al., 2018; Zhang et al., 2018). In addition to promoting cell proliferation, CDCA3 has been reported to promote the migration, invasion and chemotherapy resistance of tumors (Liu et al., 2020; Yu et al., 2020). As an important hinge molecule, CDCA3 can also be directly regulated by non-coding RNAs (Gao & Ji, 2021; Dou et al., 2020; Chen et al., 2020a). These evidences suggest an important role of CDCA3 in promoting cancer in a variety of tumors. What’s more, direct evidence for CDCA3 in lung cancer has also been provided. It has been reported that CDCA3 is significantly upregulated in lung cancer, the deletion of CDCA3 gene inhibits the proliferation of lung adenocarcinoma cell lines and promotes cell senescence (Adams et al., 2017), which is consistent with our results.

RASAL2 is a member of the RAS GTPASe-activated proteins (GAP) family, which negatively regulate the RAS signaling pathway by catalyzing the hydrolysis of Ras-GTP to Ras-GDP (Zhou et al., 2019). Several studies have shown that its effect on tumors is bidirectional (Zhou et al., 2019; Wang, Yin & Yang, 2019; Wang et al., 2019). For instance, RASAL2 inhibits tumor cell migration and invasion by inactivating the RAS pathway (Hui et al., 2017; McLaughlin et al., 2013). RASAL2 can also promote the progression of colorectal cancer through the LATS2 / YAP axis (Pan et al., 2018). In lung cancer, previous studies seem to indicate that RASAL2 is a tumor suppressor (Xiong et al., 2021). Low expression of RASAL2 can promote EMT in lung cancer (Li & Li, 2014) and promote metastasis through the RAS /ERK pathway (Fan et al., 2021), which is inconsistent with our results. Our evidence suggests that RASAL2 is a contributing factor to poor prognosis in lung adenocarcinoma. This is an interesting conclusion, and paradoxical conclusions must be accompanied by important intermediate mechanisms. A recent study suggests that the phosphorylation status of RASAl2 S351 acts as a molecular switch to inhibit or promote AMPK-mediated autophagy (Bao et al., 2021), suggesting that there is a regulatory switch for the bidirectional effect of RASAL2.

In conclusion, INPP5B is a potential tumor suppressor gene that has not yet been experimented, and RASAL2 may have a bidirectional effect on tumors, however, both require further study. CDCA3 and FOSL1 have relatively clear carcinogenic effects and are indicators of poor prognosis (Fig. 9) (Chen et al., 2020b). Our risk signature had a good ability to predict both postoperative RFS and OS for early lung adenocarcinoma. However, there are still some shortcomings in our analysis. First, the combined model with clinical parameters was not validated in the external dataset, mainly because some clinical data in the external dataset were incomplete or severely unbalanced. Second, we require additional external data sets containing complete information to further verify our model’s predictive ability. Third, regarding the cut-off value of the risk signature, due to the reagent, sequencing time/method, operational differences, and other factors, the batch effect between different datasets was too large. There is still an irreconcilable batch effect between different datasets after the SVA package was used. A uniform cut-off value, therefore, is not suitable for this study. Fourth, as a predictive model, the combined model may underestimate or overestimate patient outcomes.

Figure 9. The effect of key genes reported from previous studies compared with our analysis.

Figure 9

Conclusions

We constructed a four-gene signature risk model to predict RFS and OS after early lung adenocarcinoma surgery. We found that both CDCA3 and FOL1 have clear tumor-promoting effects in a variety of tumors and both play key roles in the regulatory mechanisms, which is consistent with previous studies. We also found that INPP5B may be a potential tumor suppressor gene that has never been reported in tumors, and there is a variety of evidence to demonstrate the important role of inositol polyphospho-5 phosphatase family in signal transduction, cell adhesion, and migration. Interestingly, RASAL2 shows significant bidirectional effects in other tumors, and it has been reported that the epigenetic regulation of RasAL2 acts as a molecular switch for promoting and inhibiting effects. Therefore, the complex mechanism of action of RASAL2 requires further clarification. In conclusion, our study may improve our understanding of the postoperative progress of early lung adenocarcinoma and provide new diagnostic indicators and therapeutic targets for clinical treatment in the future.

Supplemental Information

Supplemental Information 1. Supplemental Tables.
DOI: 10.7717/peerj.11923/supp-1
Supplemental Information 2. Validation results of GSE30219 dataset.
DOI: 10.7717/peerj.11923/supp-2
Supplemental Information 3. Normalization of GSE31210 .
DOI: 10.7717/peerj.11923/supp-3
Supplemental Information 4. Sample R code used in the analysis and validation process.

The core code in the process of data analysis and verification (the partially redundant code related to basic data collation is not provided).

DOI: 10.7717/peerj.11923/supp-4

Funding Statement

The authors received no funding for this work.

Contributor Information

Fan Li, Email: tjhtsdrli@163.com.

Bo Zhao, Email: 13006369600@163.com.

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Peng Han conceived and designed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.

Jiaqi Yue analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.

Kangle Kong, Shan Hu and Peng Cao performed the experiments, prepared figures and/or tables, and approved the final draft.

Yu Deng conceived and designed the experiments, performed the experiments, prepared figures and/or tables, and approved the final draft.

Fan Li conceived and designed the experiments, analyzed the data, prepared figures and/or tables, and approved the final draft.

Bo Zhao conceived and designed the experiments, prepared figures and/or tables, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The R code and related files are available in the Supplemental Files.

References

  • Abbas (2018).Abbas AE. Surgical management of lung cancer: history, evolution, and modern advances. Current Oncology Reports. 2018;20:98. doi: 10.1007/s11912-018-0741-7. [DOI] [PubMed] [Google Scholar]
  • Adams et al. (2017).Adams MN, Burgess JT, He Y, Gately K, Snell C, Zhang S-D, Hooper JD, Richard DJ, O’Byrne KJ. Expression of CDCA3 is a prognostic biomarker and potential therapeutic target in non-small cell lung cancer. Journal of Thoracic Oncology. 2017;12:1071–1084. doi: 10.1016/j.jtho.2017.04.018. [DOI] [PubMed] [Google Scholar]
  • Bao et al. (2021).Bao Y, Qian C, Liu M-Y, Jiang F, Jiang X, Liu H, Zhang Z, Sun F, Fu N, Hou Z, Ke Y, Li Y, Qian Z-M. PRKAA/AMPK α phosphorylation switches the role of RASAL2 from a suppressor to an activator of autophagy. Autophagy. 2021:1–15. doi: 10.1080/15548627.2021.1886767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ben-Chetrit et al. (2015).Ben-Chetrit N, Chetrit D, Russell R, Körner C, Mancini M, Abdul-Hai A, Itkin T, Carvalho S, Cohen-Dvashi H, Koestler WJ, Shukla K, Lindzen M, Kedmi M, Lauriola M, Shulman Z, Barr H, Seger D, Ferraro DA, Pareja F, Gil-Henn H, Lapidot T, Alon R, Milanezi F, Symons M, Ben-Hamo R, Efroni S, Schmitt F, Wiemann S, Caldas C, Ehrlich M, Yarden Y. Synaptojanin 2 is a druggable mediator of metastasis and the gene is overexpressed and amplified in breast cancer. Science Signaling. 2015;8:ra7. doi: 10.1126/scisignal.2005537. [DOI] [PubMed] [Google Scholar]
  • Bohdanowicz et al. (2012).Bohdanowicz M, Balkin DM, De Camilli P, Grinstein S. Recruitment of OCRL and Inpp5B to phagosomes by Rab5 and APPL1 depletes phosphoinositides and attenuates Akt signaling. Molecular Biology of the Cell. 2012;23:176–187. doi: 10.1091/mbc.E11-06-0489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Cainap et al. (2020).Cainap C, Pop LA, Balacescu O, Cainap SS. Early diagnosis and screening in lung cancer. American Journal of Cancer Research. 2020;10:1993–2009. [PMC free article] [PubMed] [Google Scholar]
  • Cayuela et al. (2021).Cayuela L, López-Campos JL, Otero R, Rodriguez Portal JA, Rodríguez-Domínguez S, Cayuela A. The beginning of the trend change in lung cancer mortality trends in Spain, 1980-2018. Archivos de Bronconeumologia. 2021;57:115–121. doi: 10.1016/j.arbres.2020.04.025. [DOI] [PubMed] [Google Scholar]
  • Chen et al. (2018).Chen X, Zhao M, Huang J, Li Y, Wang S, Harrington CA, Qian DZ, Sun X-X, Dai M-S. microRNA-130a suppresses breast cancer cell migration and invasion by targeting FOSL1 and upregulating ZO-1. Journal of Cellular Biochemistry. 2018;119:4945–4956. doi: 10.1002/jcb.26739. [DOI] [PubMed] [Google Scholar]
  • Chen et al. (2020b).Chen S, Gao C, Wu Y, Huang Z. Identification of prognostic miRNA signature and lymph node metastasis-related key genes in cervical cancer. Frontiers in Pharmacology. 2020b;11:544. doi: 10.3389/fphar.2020.00544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Chen et al. (2020a).Chen Q, Zhou L, Ye X, Tao M, Wu J. miR-145-5p suppresses proliferation, metastasis and EMT of colorectal cancer by targeting CDCA3. Pathology, Research and Practice. 2020a;216:152872. doi: 10.1016/j.prp.2020.152872. [DOI] [PubMed] [Google Scholar]
  • Cisneros-Villanueva et al. (2021).Cisneros-Villanueva M, Hidalgo-Pérez L, Cedro-Tanda A, Peña-Luna M, Mancera-Rodríguez MA, Hurtado-Cordova E, Rivera-Salgado I, Martínez-Aguirre A, Jiménez-Morales S, Alfaro-Ruiz LA, Arellano-Llamas R, Tenorio-Torres A, Domínguez-Reyes C, Villegas-Carlos F, Ríos-Romero M, Hidalgo-Miranda A. LINC00460 is a dual biomarker that acts as a predictor for increased prognosis in basal-like breast cancer and potentially regulates immunogenic and differentiation-related genes. Frontiers in Oncology. 2021;11:628027. doi: 10.3389/fonc.2021.628027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Cui et al. (2020).Cui YP, Xie M, Pan WX, Zhang ZY, Li WF. HOXA10 promotes the development of bladder cancer through regulating FOSL1. European Review for Medical and Pharmacological Sciences. 2020;24:2945–2954. doi: 10.26355/eurrev_202003_20659. [DOI] [PubMed] [Google Scholar]
  • Detterbeck (2019).Detterbeck FC. Surveillance of ground glass nodules-when is enough, enough. Translational Lung Cancer Research. 2019;8:S428–S429. doi: 10.21037/tlcr.2019.10.07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Dou et al. (2020).Dou D, Ren X, Han M, Xu X, Ge X, Gu Y, Wang X, Zhao S. CircUBE2D2 (hsa_circ_0005728) promotes cell proliferation, metastasis and chemoresistance in triple-negative breast cancer by regulating miR-512-3p/CDCA3 axis. Cancer Cell International. 2020;20:454. doi: 10.1186/s12935-020-01547-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Elangovan et al. (2018).Elangovan IM, Vaz M, Tamatam CR, Potteti HR, Reddy NM, Reddy SP. FOSL1 promotes kras-induced lung cancer through amphiregulin and cell survival gene regulation. American Journal of Respiratory Cell and Molecular Biology. 2018;58:625–635. doi: 10.1165/rcmb.2017-0164OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Fan et al. (2021).Fan D, Yu S, Yang Y, Qu S. Low expression of Rasal2 promotes non-small cell lung cancer metastasis through Ras/ERK pathway. Biological & Pharmaceutical Bulletin. 2021;44:992–998. doi: 10.1248/bpb.b21-00231. [DOI] [PubMed] [Google Scholar]
  • Gao & Ji (2021).Gao T, Ji Y. Long noncoding RNA LINC00707 accelerates tumorigenesis and progression of bladder cancer via targeting miR-145/CDCA3 regulatory loop. Urologia Internationalis. 2021:1–15. doi: 10.1159/000514388. [DOI] [PubMed] [Google Scholar]
  • Gatta et al. (2019).Gatta LB, Melocchi Laura, Bugatti M, Missale F, Lonardi S, Zanetti B, Cristinelli L, Belotti S, Simeone C, Ronca R, Grillo E, Licini S, Bresciani D, Tardanico R, Ruby Chan S, Giurisato E, Calza S, Verm W. Hyper-Activation of STAT3 sustains progression of non-papillary basal-type bladder cancer via FOSL1 regulome. Cancer. 2019;11:1219. doi: 10.3390/cancers11091219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gautier et al. (2004).Gautier L, Cope L, Bolstad BM, Irizarry RA. affy–analysis of Affymetrix GeneChip data at the probe level. Bioinformatics. 2004;20:307–315. doi: 10.1093/bioinformatics/btg405. [DOI] [PubMed] [Google Scholar]
  • Ghaly et al. (2017).Ghaly G, Rahouma M, Kamel MK, Nasar A, Harrison S, Nguyen AB, Port J, Stiles BM, Altorki NK, Lee PC. Clinical predictors of nodal metastases in peripherally clinical T1a N0 non-small cell lung cancer. The Annals of Thoracic Surgery. 2017;104:1153–1158. doi: 10.1016/j.athoracsur.2017.02.074. [DOI] [PubMed] [Google Scholar]
  • Hammer & Hatabu (2020).Hammer MM, Hatabu H. Subsolid pulmonary nodules: controversy and perspective. European Journal of Radiology Open. 2020;7:100267. doi: 10.1016/j.ejro.2020.100267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • He et al. (2015).He J, Zhu G, Gao L, Chen P, Long Y, Liao S, Yi H, Yi W, Pei Z, Wu Minghua, Li X, Xiang J, Peng S, Ma J, Zhou M, Xiong W, Zeng Z, Xiang B, Tang KE, CAO L, Li G, Zhou Y. Fra-1 is upregulated in gastric cancer tissues and affects the PI3K/Akt and p53 signaling pathway in gastric cancer. International Journal of Oncology. 2015;47:1725–1734. doi: 10.3892/ijo.2015.3146. [DOI] [PubMed] [Google Scholar]
  • Herpe et al. (2021).Herpe G, Lederlin M, Naudin M, Ohana M, Chaumoitre K, Gregory J, Vilgrain V, Freitag CA, De Margerie-Mellon C, Flory V, Ludwig M, Mondot L, Fitton I, Jacquier ARR, Ardilouze P, Petit I, Gervaise A, Bayle O, Crombe A, Mekuko Sokeng M, Thomas C, Henry G, Bliah V, Le Tat T, Guillot M-S, Gendrin P, Garetier M, Bertolle E, Montagne C, Langlet B, Kalaaji A, Kayayan H, Desmots F, Dhaene B, Saulnier P-J, Guillevin R, Bartoli J-M, Beregi J-P, Tasu JP. Efficacy of chest CT for COVID-19 pneumonia diagnosis in France. Radiology. 2021;298:E81–E87. doi: 10.1148/radiol.2020202568. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Hong et al. (2015).Hong QY, Wu G-M, Qian G-S, Hu C-P, Zhou J-Y, Chen L-A, Li W-M, Li S-Y, Wang K, Wang Q, Zhang X-J, Li J, Gong X, Bai C-X. Prevention and management of lung cancer in China. Cancer. 2015;121(Suppl 17):3080–3088. doi: 10.1002/cncr.29584. [DOI] [PubMed] [Google Scholar]
  • Hui et al. (2017).Hui K, Gao Y, Huang J, Xu S, Wang B, Zeng J, Fan J, Wang X, Yue Y, Wu S, Hsieh J-T, He D, Wu K. RASAL2, a RAS GTPase-activating protein, inhibits stemness and epithelial-mesenchymal transition via MAPK/SOX2 pathway in bladder cancer. Cell Death & Disease. 2017;8:e2600. doi: 10.1038/cddis.2017.9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Jacobsen et al. (2017).Jacobsen MM, Silverstein SC, Quinn M, Waterston LB, Thomas CA, Benneyan JC, Han PKJ. Timeliness of access to lung cancer diagnosis and treatment: a scoping literature review. Lung Cancer. 2017;112:156–164. doi: 10.1016/j.lungcan.2017.08.011. [DOI] [PubMed] [Google Scholar]
  • Jones et al. (2021).Jones GD, Brandt WS, Shen R, Sanchez-Vega F, Tan KS, Martin A, Zhou J, Berger M, Solit DB, Schultz N, Rizvi H, Liu Y, Adamski A, Chaft JE, Riely GJ, Rocco G, Bott MJ, Molena D, Ladanyi M, Travis WD, Rekhtman N, Park BJ, Adusumilli PS, Lyden D, Imielinski M, Mayo MW, Li BT, Jones DR. A genomic-pathologic annotated risk model to predict recurrence in early-stage lung adenocarcinoma. JAMA Surgery. 2021;156:e205601. doi: 10.1001/jamasurg.2020.5601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kim et al. (2020).Kim JE, Kim BG, Jang Y, Kang S, Lee JH, Cho NH. The stromal loss of miR-4516 promotes the FOSL1-dependent proliferation and malignancy of triple negative breast cancer. Cancer Letters. 2020;469:256–265. doi: 10.1016/j.canlet.2019.10.039. [DOI] [PubMed] [Google Scholar]
  • Kobayashi & Mitsudomi (2019).Kobayashi Y, Mitsudomi T. Ground glass nodules with 5 years’ stability can grow after 10-year follow-up: do genetic features determine the fate? Translational Lung Cancer Research. 2019;8:S425–S427. doi: 10.21037/tlcr.2019.10.16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kobayashi et al. (2015).Kobayashi Y, Mitsudomi T, Sakao Y, Yatabe Y. Genetic features of pulmonary adenocarcinoma presenting with ground-glass nodules: the differences between nodules with and without growth. Annals of Oncology. 2015;26:156–161. doi: 10.1093/annonc/mdu505. [DOI] [PubMed] [Google Scholar]
  • Leshnower et al. (2010).Leshnower BG, Miller DL, Fernandez FG, Pickens A, Force SD. Video-assisted thoracoscopic surgery segmentectomy: a safe and effective procedure. The Annals of Thoracic Surgery. 2010;89:1571–1576. doi: 10.1016/j.athoracsur.2010.01.061. [DOI] [PubMed] [Google Scholar]
  • Li et al. (2017).Li B, Cui Y, Diehn M, Li R. Development and validation of an individualized immune prognostic signature in early-stage nonsquamous non-small cell lung cancer. JAMA Oncology. 2017;3:1529–1537. doi: 10.1001/jamaoncol.2017.1609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Li & Li (2014).Li N, Li S. RASAL2 promotes lung cancer metastasis through epithelial-mesenchymal transition. Biochemical and Biophysical Research Communications. 2014;455:358–362. doi: 10.1016/j.bbrc.2014.11.020. [DOI] [PubMed] [Google Scholar]
  • Liu et al. (2020).Liu Y, Cheng G, Huang Z, Bao L, Liu J, Wang C, Xiong Z, Zhou L, Xu T, Liu D, Yang H, Chen K, Zhang X. Long noncoding RNA SNHG12 promotes tumour progression and sunitinib resistance by upregulating CDCA3 in renal cell carcinoma. Cell Death & Disease. 2020;11:515. doi: 10.1038/s41419-020-2713-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Luo, He & Qiu (2018).Luo YZ, He P, Qiu MX. FOSL1 enhances growth and metastasis of human prostate cancer cells through epithelial mesenchymal transition pathway. European Review for Medical and Pharmacological Sciences. 2018;22:8609–8615. doi: 10.26355/eurrev_201812_16624. [DOI] [PubMed] [Google Scholar]
  • Mahvi et al. (2018).Mahvi DA, Liu R, Grinstaff MW, Colson YL, Raut CP. Local cancer recurrence: the realities, challenges, and opportunities for new therapies. CA: A Cancer Journal for Clinicians. 2018;68:488–505. doi: 10.3322/caac.21498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Malhotra et al. (2016).Malhotra J, Malvezzi M, Negri E, La Vecchia C, Boffetta P. Risk factors for lung cancer worldwide. The European Respiratory Journal. 2016;48:889–902. doi: 10.1183/13993003.00359-2016. [DOI] [PubMed] [Google Scholar]
  • Martini et al. (1995).Martini N, Bains MS, Burt ME, Zakowski MF, McCormack P, Rusch VW, Ginsberg RJ. Incidence of local recurrence and second primary tumors in resected stage I lung cancer. The Journal of Thoracic and Cardiovascular Surgery. 1995;109:120–129. doi: 10.1016/s0022-5223(95)70427-2. [DOI] [PubMed] [Google Scholar]
  • Maurus et al. (2017).Maurus K, Hufnagel A, Geiger F, Graf S, Berking C, Heinemann A, Paschen A, Kneitz S, Stigloher C, Geissinger E, Otto C, Bosserhoff A, Schartl M, Meierjohann S. The AP-1 transcription factor FOSL1 causes melanocyte reprogramming and transformation. Oncogene. 2017;36:5110–5121. doi: 10.1038/onc.2017.135. [DOI] [PubMed] [Google Scholar]
  • McLaughlin et al. (2013).McLaughlin SK, et al. The RasGAP gene, RASAL2, is a tumor and metastasis suppressor. Cancer Cell. 2013;24:365–378. doi: 10.1016/j.ccr.2013.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • McPherson et al. (2020).McPherson I, Bradley NA, Govindraj R, Kennedy ED, Kirk AJB, Asif M. The progression of non-small cell lung cancer from diagnosis to surgery. European Journal of Surgical Oncology. 2020;46:1882–1887. doi: 10.1016/j.ejso.2020.08.013. [DOI] [PubMed] [Google Scholar]
  • Merritt et al. (2020).Merritt RE, Abdel-Rasoul M, Fitzgerald M, D’Souza DM, Kneuertz PJ. Nomograms for predicting overall and recurrence-free survival from pathologic stage IA and IB lung cancer after lobectomy. Clinical Lung Cancer. 2020 doi: 10.1016/j.cllc.2020.10.009. Epub ahead of print 2020 22 October. [DOI] [PubMed] [Google Scholar]
  • Molina et al. (2008).Molina JR, Yang P, Cassivi SD, Schild SE, Adjei AA. Non-small cell lung cancer: epidemiology, risk factors, treatment, and survivorship. Mayo Clinic Proceedings. 2008;83:584–594. doi: 10.4065/83.5.584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Pan et al. (2018).Pan Y, Tong JHM, Lung RWM, Kang W, Kwan JSH, Chak WP, Tin KY, Chung LY, Wu F, Ng SSM, Mak TWC, Yu J, Lo KW, Chan AWH, To KF. RASAL2 promotes tumor progression through LATS2/YAP1 axis of hippo signaling pathway in colorectal cancer. Molecular Cancer. 2018;17:102. doi: 10.1186/s12943-018-0853-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Qian et al. (2018).Qian W, Zhang Z, Peng W, Li J, Gu Q, Ji D, Wang Q, Zhang Y, Ji B, Wang S, Zhang D. CDCA3 mediates p21-dependent proliferation by regulating E2F1 expression in colorectal cancer. International Journal of Oncology. 2018;53:2021–2033. doi: 10.3892/ijo.2018.4538. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ramos, Elong Edimo & Erneux (2018).Ramos AR, Elong Edimo W, Erneux C. Phosphoinositide 5-phosphatase activities control cell motility in glioblastoma: two phosphoinositides PI(4, 5)P2 and PI(3, 4)P2 are involved. Advances in Biological Regulation. 2018;67:40–48. doi: 10.1016/j.jbior.2017.09.001. [DOI] [PubMed] [Google Scholar]
  • Shaw et al. (2008).Shaw JP, Dembitzer FR, Wisnivesky JP, Litle VR, Weiser TS, Yun J, Chin C, Swanson SJ. Video-assisted thoracoscopic lobectomy: state of the art and future directions. The Annals of Thoracic Surgery. 2008;85:S705–S709. doi: 10.1016/j.athoracsur.2007.11.048. [DOI] [PubMed] [Google Scholar]
  • Shen et al. (2020).Shen S, Li K, Liu Y, Liu X, Liu B, Ba Y, Xing W. Silencing lncRNA AGAP2-AS1 upregulates miR-195-5p to repress migration and invasion of EC cells via the decrease of FOSL1 expression. Molecular Therapy. Nucleic Acids. 2020;20:331–344. doi: 10.1016/j.omtn.2019.12.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Siegel et al. (2021).Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer Statistics, 2021. CA: A Cancer Journal for Clinicians. 2021;71:7–33. doi: 10.3322/caac.21654. [DOI] [PubMed] [Google Scholar]
  • Talotta, Casalino & Verde (2020).Talotta F, Casalino L, Verde P. The nuclear oncoprotein Fra-1: a transcription factor knocking on therapeutic applications’ door. Oncogene. 2020;39:4491–4506. doi: 10.1038/s41388-020-1306-4. [DOI] [PubMed] [Google Scholar]
  • Vallejo et al. (2017).Vallejo A, Perurena N, Guruceaga E, Mazur PK, Martinez-Canarias S, Zandueta C, Valencia K, Arricibita A, Gwinn D, Sayles LC, Chuang C-H, Guembe L, Bailey P, Chang DK, Biankin A, Ponz-Sarvise M, Andersen JB, Khatri P, Bozec A, Sweet-Cordero EA, Sage J, Lecanda F, Vicent S. An integrative approach unveils FOSL1 as an oncogene vulnerability in KRAS-driven lung and pancreatic cancer. Nature Communications. 2017;8:14294. doi: 10.1038/ncomms14294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Vallejo et al. (2021).Vallejo A, Erice O, Entrialgo-Cadierno R, Feliu I, Guruceaga e, Perugorria MJ, Olaizola P, Muggli A, Macaya I, O’Dell M, Ruiz-Fernandez de Cordoba B, Ortiz-Espinosa S, Hezel AF, Arozarena I, Lecanda F, Avila MA, Fernandez-Barrena MG, Evert M, Ponz-Sarvise M, Calvisi DF, Banales JM, Vicent S. FOSL1 promotes cholangiocarcinoma via transcriptional effectors that could be therapeutically targeted. Journal of Hepatology. 2021;75:363–376. doi: 10.1016/j.jhep.2021.03.028. [DOI] [PubMed] [Google Scholar]
  • Venkatareddy et al. (2011).Venkatareddy M, Cook L, Abuarquob K, Verma R, Garg P. Nephrin regulates lamellipodia formation by assembling a protein complex that includes Ship2, filamin and lamellipodin. PLOS ONE. 2011;6:e28710. doi: 10.1371/journal.pone.0028710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wain, Westwick & Ward (2005).Wain CM, Westwick J, Ward SG. Heterologous regulation of chemokine receptor signaling by the lipid phosphatase SHIP in lymphocytes. Cellular Signalling. 2005;17:1194–1202. doi: 10.1016/j.cellsig.2004.12.009. [DOI] [PubMed] [Google Scholar]
  • Wang et al. (2019).Wang X, Qian C, Yang Y, Liu M-Y, Ke Y, Qian Z-M. Phosphorylated Rasal2 facilitates breast cancer progression. EBioMedicine. 2019;50:144–155. doi: 10.1016/j.ebiom.2019.11.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wang, Yin & Yang (2019).Wang X, Yin X, Yang Y. Rasal2 suppresses breast cancer cell proliferation modulated by secretory autophagy. Molecular and Cellular Biochemistry. 2019;462:115–122. doi: 10.1007/s11010-019-03615-7. [DOI] [PubMed] [Google Scholar]
  • Wang et al. (2020).Wang Y, Zou S, Zhao Z, Liu P, Ke C, Xu S. New insights into small-cell lung cancer development and therapy. Cell Biology International. 2020;44:1564–1576. doi: 10.1002/cbin.11359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wu et al. (2020).Wu B, Huang Y, Luo Y, Ma A, Wu Z, Gan Y, Xu Y, Xu R. The diagnostic and prognostic value of cell division cycle associated gene family in Hepatocellular Carcinoma. Journal of Cancer. 2020;11:5727–5737. doi: 10.7150/jca.46554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Xiong et al. (2021).Xiong J, Xing S, Dong Z, Niu L, Xu Q, Li Y, Liu P, Yang P. miR-654-3p suppresses cell viability and promotes apoptosis by targeting RASAL2 in non-small-cell lung cancer. Molecular Medicine Reports. 2021;23:124. doi: 10.3892/mmr.2020.11763. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Xu, Wang & Liu (2020).Xu Y, Wang L, Liu J. miR-130a inhibits cell viability and migration through FOSL1 in lung cancer. Journal of Biological Regulators and Homeostatic Agents. 2020 doi: 10.23812/20-13L. Epub ahead of print 2020 30 April. [DOI] [PubMed] [Google Scholar]
  • Yang et al. (2020).Yang B, Rao W, Luo H, Zhang L, Wang D. Relapse-related molecular signature in early-stage lung adenocarcinomas based on base excision repair, stimulator of interferon genes pathway and tumor-infiltrating lymphocytes. Cancer Science. 2020;111:3493–3502. doi: 10.1111/cas.14570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Yin et al. (2020).Yin XH, Yu L-P, Zhao X-H, Li Q-M, Liu X-P, He Li. Development and validation of a 4-gene combination for the prognostication in lung adenocarcinoma patients. Journal of Cancer. 2020;11:1940–1948. doi: 10.7150/jca.37003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Yu et al. (2012).Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics: A Journal of Integrative Biology. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Yu et al. (2020).Yu J, Hua R, Zhang Y, Tao R, Wang Q, Ni Q. DNA hypomethylation promotes invasion and metastasis of gastric cancer cells by regulating the binding of SP1 to the CDCA3 promoter. Journal of Cellular Biochemistry. 2020;121:142–151. doi: 10.1002/jcb.28993. [DOI] [PubMed] [Google Scholar]
  • Yuan et al. (2019).Yuan M, Huang LL, Chen JH, Wu J, Xu Q. The emerging treatment landscape of targeted therapy in non-small-cell lung cancer. Signal Transduction and Targeted Therapy. 2019;4:61. doi: 10.1038/s41392-019-0099-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zappa & Mousa (2016).Zappa C, Mousa SA. Non-small cell lung cancer: current treatment and future advances. Translational Lung Cancer Research. 2016;5:288–300. doi: 10.21037/tlcr.2016.06.07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zhang et al. (2019a).Zhang G, et al. An appraisal of nodule diagnosis for lung cancer in CT images. Journal of Medical Systems. 2019a;43:181. doi: 10.1007/s10916-019-1327-0. [DOI] [PubMed] [Google Scholar]
  • Zhang et al. (2018).Zhang W, et al. CDCA3 promotes cell proliferation by activating the NF- κB/cyclin D1 signaling pathway in colorectal cancer. Biochemical and Biophysical Research Communications. 2018;500:196–203. doi: 10.1016/j.bbrc.2018.04.034. [DOI] [PubMed] [Google Scholar]
  • Zhang et al. (2019b).Zhang Y, Yin W, Cao W, Chen P, Bian L, Ni Q. CDCA3 is a potential prognostic marker that promotes cell proliferation in gastric cancer. Oncology Reports. 2019b;41:2471–2481. doi: 10.3892/or.2019.7008. [DOI] [PubMed] [Google Scholar]
  • Zhang et al. (2020).Zhang YJ, Yang W-J, Liu D, Cao Y-Q, Zheng Y-Y, Han Y-C, Jin R-S, Han Y, Wang X-Y, Pan A-S, Dai J-Y, Sun Q-F, Zhao F-Q, Yang Q-Y, Zhang J-H, Liu S-J, Da Q, Guo W, Li C-Q, Zhang W-T, Wu H, Chen X-S, Ji A-Q, Xiang J, Chen K, Feng X-J, Zhang X-F, Cao Q-Q, Qin L, Li J, Zhou M, Lu Y, Wang C-F, Yan F-H, Li H-C, Qu J-M. COVID-19 and early-stage lung cancer both featuring ground-glass opacities: a propensity score-matched study. Translational Lung Cancer Research. 2020;9:1516–1527. doi: 10.21037/tlcr-20-892. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zhou et al. (2019).Zhou B, Zhu W, Jiang X, Ren C. RASAL2 plays inconsistent roles in different cancers. Frontiers in Oncology. 2019;9:1235. doi: 10.3389/fonc.2019.01235. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Information 1. Supplemental Tables.
DOI: 10.7717/peerj.11923/supp-1
Supplemental Information 2. Validation results of GSE30219 dataset.
DOI: 10.7717/peerj.11923/supp-2
Supplemental Information 3. Normalization of GSE31210 .
DOI: 10.7717/peerj.11923/supp-3
Supplemental Information 4. Sample R code used in the analysis and validation process.

The core code in the process of data analysis and verification (the partially redundant code related to basic data collation is not provided).

DOI: 10.7717/peerj.11923/supp-4

Data Availability Statement

The following information was supplied regarding data availability:

The R code and related files are available in the Supplemental Files.


Articles from PeerJ are provided here courtesy of PeerJ, Inc

RESOURCES