Abstract
Background
Gastric cancer is one of the most lethal tumors and is characterized by poor prognosis and lack of effective diagnostic or therapeutic biomarkers. The aim of this study was to find hub genes serving as biomarkers in gastric cancer diagnosis and therapy.
Methods
GSE66229 from Gene Expression Omnibus (GEO) was used as training set. Genes bearing the top 25% standard deviations among all the samples in training set were performed to systematic weighted gene co-expression network analysis (WGCNA) to find candidate genes. Then, hub genes were further screened by using the “least absolute shrinkage and selection operator” (LASSO) logistic regression. Finally, hub genes were validated in the GSE54129 dataset from GEO by supervised learning method artificial neural network (ANN) algorithm.
Results
Twelve modules with strong preservation were identified by using WGCNA methods in training set. Of which, five modules significantly related to gastric cancer were selected as clinically significant modules, and 713 candidate genes were identified from these five modules. Then, ADIPOQ, ARHGAP39, ATAD3A, C1orf95, CWH43, GRIK3, INHBA, RDH12, SCNN1G, SIGLEC11 and LYVE1 were screened as the hub genes. These hub genes successfully differentiated the tumor samples from the healthy tissues in an independent testing set through artificial neural network algorithm with the area under the receiver operating characteristic curve at 0.946.
Conclusions
These hub genes bearing diagnostic and therapeutic values, and our results may provide a novel prospect for the diagnosis and treatment of gastric cancer in the future.
Keywords: Gastric cancer, Weighted gene co-expression network analysis, WGCNA, LASSO regression, Supervised machine learning
Background
Gastric carcinoma remains the fifth most frequently diagnosed cancer and the third leading cause of cancer-related deaths, with an estimated 1,033,701 new cases and 782,685 deaths worldwide in 2018 (Bray et al., 2018; Pormohammad et al., 2018). Gastric cancer is also one of the most common malignancies and the third leading cause of death in China, where 427,100 cases with 301,200 deaths were observed in 2013 (Chen et al., 2017b). Despite the several existing treatments including chemo-, radio-, or targeted therapy, the overall 5-year survival rate of stomach cancer patients is still <20% (Raimondi et al., 2018).
There are two types of gastric cancer, diffuse and intestinal types, which differ in their histological manifestations, epidemiological features and etiologic pathogenesis (Huang et al., 2019). Histopathology is the gold standard approach for diagnosing gastric cancer; however, this approach is not suitable for everyone due to the invasive nature of the biopsy (Yoon & Kim, 2015). Although there are several commonly used serum biomarkers such as alpha-fetoprotein (AFP), carcinoembryonic antigen (CEA), cancer antigen 125 (CA125), and cancer antigen 19–9 (CA19-9) (He et al., 2013) for gastric cancer diagnosis, none of them are sensitive or gastric-cancer-specific (Smyth et al., 2016). Moreover, effective and specific targeted therapies for gastric cancer remain to be identified. Presently, the major treatment strategies for gastric cancer are anti-human epidermal growth factor receptor 2 (HER2) and anti-vascular therapies (Raimondi et al., 2018). However, resistance to the targeted agents is common in some gastric tumor types. Therefore, novel practical approaches are needed for specific diagnosis and effective treatment of gastric cancer. Accordingly, identification of the key genes and biomarkers that are involved in the pathogenesis of gastric cancer is of paramount significance.
With recent advancements in bioinformatics methods, comprehensive identification of potential biomarkers through large-scale screening of expression profiles has been proposed (Li et al., 2018a; Takeno et al., 2008; Wang et al., 2014; Zeng et al., 2019). A weighted gene co-expression network analysis (WGCNA) approach provides a systematic analysis to investigate the functional clustering of expression profiles, based on the theory that genes with similar expression profiles may have closely functional linkages and/or pathways (Carlson et al., 2006; Carter et al., 2004; Zhou et al., 2018). This approach groups highly co-expressed genes into the same module. Modules bearing high correlation with certain clinical traits are identified as clinically significant modules (Zhou et al., 2018).
By using this systematic bioinformatic method, followed by the “least absolute shrinkage and selection operator” (LASSO) logistic regression, a suitable method for high-dimensional gene data analysis (Friedman, Hastie & Tibshirani, 2010; Zeng et al., 2019), candidate variables were selected from clinically significant modules. Finally, supervised artificial neural network (ANN) method was performed to test the reliability of the results in an independent dataset. ANN approach has been widely used in the prediction of cancer diagnosis, staging and recurrence since the mid-1990s (Hu et al., 2013), which is an useful method to incorporate and analyze large amounts of omics and health-care data (Ngiam & Khor, 2019).
Consequently, we attempt to construct a co-expression network by using systematic WGCNA method followed by LASSO regression to identify hub genes, which could effectively discriminate cancer samples from normal tissue. These findings may provide potential diagnostic and therapeutic targets in future research and clinical intervention of gastric cancer.
Material and Methods
Data collection and preprocessing
The workflow of this study is shown in Fig. 1. Raw expression datasets were downloaded from the Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/) by using the keywords “stomach/gastric cancer/tumor/carcinoma”, “normal”, “GPL570”, and “Homo sapiens”. Our inclusion criteria for the training set were that: (1) datasets based on the Affymetrix Human Genome U133 Plus 2.0 Array Platform (Affymetrix, Santa Clara, CA, USA) and (2) datasets derived from human case-control studies, with gastric tumor patients as the case group, regardless of the histopathological types and stages, and non-tumor individuals as the control group.
Therefore, only two datasets (GSE66229 (Cristescu et al., 2015; Oh et al., 2018) and GSE54129) as of October 10, 2020 met the screening criteria. GSE66229 contained 300 tumor and 100 normal samples, and was used as the training set to screen for the hub genes. GSE54129 (contained 111 tumor and 21 normal samples) served as an independent testing set to validate the hub genes.
All the analyses in this study were conducted using R software (version 3.5.1). FitPLM weight, Relative Log Expression (RLE), Normalized Unscaled Standard Errors (NUSE), and RNA degradation images were analyzed to evaluate the quality of each dataset. Then, the “rma” function with the default parameters of the “affy” package was used to perform background correction and normalization (Gautier et al., 2004). Missing values in each dataset were imputed by using the function “impute.knn” with the default parameters of the “impute” package (Hastie & Narasimhan, 2001). Platform annotations were downloaded from the GEO database, and finally, the gene symbol expression matrices were acquired from each dataset for further analyses.
Weighted gene co-expression network construction
Weighted gene co-expression network in the training set was constructed using the “WGCNA” package (Langfelder & Horvath, 2008; Zhang & Horvath, 2005). The genes with the top 25% SD among all the 400 samples in the expression matrix of the training set were selected as the input genes (5,115 genes in total).
In brief, first, the appropriate soft-thresholding power (β) was selected by using the “pickSoftThreshold” function with the default parameters (herein, β = 4). Subsequently, the Pearson’s correlation matrix was calculated to evaluate the similarity among all the pair-wise genes by using the “cor” function with the default parameters. Then, the adjacency was calculated based on β and the Pearson’s correlation matrix by using the “TOMsimilarity” function with the default parameters, and the corresponding dissimilarity (dissTOM) was also calculated. Finally, average linkage hierarchical clustering was conducted according to the dissTOM value with a minimum size of 30 for each gene dendrogram.
Module eigengenes (MEs), considered the first principal component (PC) of gene expression patterns of a corresponding module, were obtained for each module. To further strengthen the reliability of the modules, a cut line was set at 0.25 so that modules bearing <0.25 would be merged (Chen et al., 2017a).
Module preservation analysis
To evaluate the stability of the modules in the training set, GSE13911 was used to validate the module preservation of the training set (Chen et al., 2018; Neidlin, Dimitrakopoulou & Alexopoulos, 2019; Obeidat et al., 2017). Preservation analysis for GSE13911 was performed using the “modulePreservation” function by setting referenceNetworks = 2, nPermutations = 200, randomSeed = 1, and verbose = 3, maxModuleSize = 3000 and maxGoldModuleSize = 3000 in “WGCNA” package. The Z summary scores of each module were calculated to indicate the module preservations. Z summary scores <2, [2–10], and >10 indicated that the modules had no, moderate, and strong preservation, respectively (Liu et al., 2019b; Lou et al., 2017). The grey module contained the genes that did not belong to any of the other modules, and a gold module was generated for statistical purposes. Therefore, these two modules were not shown in the preservation analysis results (Langfelder et al., 2011b).
Identification of clinically significant modules
Herein, the most interesting clinical trait was the tissue type, which was designated as tumor or normal samples. We calculated log10 transformation of the P-value in the logistic regression between the MEs and clinical trait. Modules with log10 transformation of the P-value greater than 10 were considered to be closely correlated with tissue types.
Functional enrichment and pathway analyses of significant modules
To determine whether the clinically significant modules were closely correlated with gastric cancer, GO functional annotation and KEGG pathway analyses were performed using the Database for Annotation, Visualization and Integrated Discovery (DAVID) (version 6.8) (https://david.ncifcrf.gov/home.jsp) (Ashburner et al., 2000; Dennis et al., 2003). The visualization of the functional enrichment and pathway analyses was performed by the “GOplot” (Robin et al., 2011) and “ggplot2” (Wickham, 2016) packages of R, respectively.
Candidate gene selection
Candidate genes among clinically significant modules were selected according to the following criteria: (1) differentially expressed genes (DEGs) between gastric cancer samples and normal samples with —log2FC (Fold Change)—>1 and adjusted P-value <0.05 based on the “limma” package (Ritchie et al., 2015), (2) high module membership (defined as the correlation between the expression of each genes and MEs) —MM(Module Membership)—>0.8.
Selection of hub genes by LASSO logistic regression analysis
Candidate genes were subjected to LASSO regression, which was performed using the “glmnet” package by setting alpha = 1, and ten-fold cross-validation for tuning parameter selection. Lambda was defined as the minimum partial likelihood deviance (Friedman, 2010).
Validation of hub genes
The machine learning method of ANN (by using the “neuralnet” function with hidden = 2) (Fritsch et al., 2019) was performed to determine whether the hub genes could correctly distinguish the gastric cancer samples from the normal samples in a testing set. Moreover, in order to demonstrate whether these hub genes could specifically distinguish between gastric tumor and normal samples, we also evaluated the predictive effects of 11-gene model in pancreatic cancer (GSE15471) (Badea et al., 2008) and colorectal cancer (GSE37364 excluding adenoma samples) (Galamb et al., 2012) by using ANN algorithm.
Areas under the receiver operating characteristic (ROC) curve was calculated to show the predictive effect of supervised machine learning model, and then the ROC curves were plotted using the “pROC” package (Robin et al., 2011). An area under the curve (AUC) value between 0.8 and 0.9 is considered an excellent classification, while greater than 0.9 is considered as outstanding discrimination (Lemeshow, 2000).
Results
Construction of co-expression networks
After the quality check of the input data, no sample was removed (Fig. S1); herein, two clinical traits (tissue type and stage) are presented. According to different tissue types (tumor or normal), the 400 samples could be mainly divided into two clusters.
As shown in Fig. 2, the soft thresholding was set at 4, while the scale-free topology fit index reached 0.89, indicating approximate scale-free topology. Co-expressed gene modules were identified with the dynamic tree cut method (Fig. 3A) (Chen et al., 2017a). In total, 12 modules were found, and each color represented one module (Fig. 3B). The biggest module was the turquoise module, which contained 2,120 genes, followed by the blue module, bearing 1,503 genes. The grey module comprised 2 genes, which did not have a similar expression pattern and did not belong to any other module.
Module preservation analysis
5,115 genes in GSE13911 clustered into 11 colored modules (grey module only contained 2 genes, and did not show in Fig. 4), as determined in the training set. All gene modules were found to bear strong conservation, as the Z summary scores were all >10 (Fig. 4B).
Selection of clinically significant modules
After the assessment of the relationship by using regression analysis between the MEs and clinical traits, the log10 transformation of the P-value was shown in Fig. 5. Accoring to the screening criteria, there were 5 modules closely related to tissue types, which were black, turquoise, greenyellow, salmon and blue modules. These moduels were selected for further analysis.
Functional enrichment of clinically significant modules
Gene Ontology (GO) enrichment results showed that 1503 genes in the blue module, 137 genes in the black module, 2,120 genes in the turquoise module, 61 in the salmon module and 194 genes in the greenyellow module mainly participated in 141, 15, 221, 54 and 146 different significant biological processes, respectively (Table S1 to Table S5). The top three most significantly enriched biological processes were cell division (P = 2.29e−29), G1/S transition of the mitotic cell cycle (P = 6.32e−21), mitotic nuclear division (P = 3.49e−19) in the blue module (Fig. 6A), and potassium ion import (P = 3.39e−05), digestion (P = 8.25e−04), multicellular organismal water homeostasis (P = 0.0012) in the black module (Fig. 6B), and extracellular matrix organization (P = 6.30e−06), positive regulation of cell migration (P = 7.57e−06), axon guidance in the turquoise module (P = 9.18e−06) (Fig. 6C), and inflammatory response (P = 2.80e−21), immune response (P = 5.70e−17), neutrophil chemotaxis (P = 5.15e−15) in the greenyellow module (Fig. 6D), and defense response to virus (P = 3.33e−25), type I interferon signaling pathway (P = 8.52e−23), response to virus (P = 9.02e−16) in salmon module (Fig. 6E).
KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis showed that genes in the blue, black, turquoise, salmon and greenyellow modules were mainly significantly enriched in 15, 7, 35,14 and 23 pathways, respectively (from Table S6 to Table S10). The top three most significantly enriched pathways were cell cycle (P = 1.21e−15), DNA replication (P = 6.21e−15), the p53 signaling pathway (P = 2.03e−06) in blue module (Fig. 7A), and gastric acid secretion (P = 4.46e−05), protein digestion and absorption (P = 1.28e−04), drug metabolism-cytochrome P450 (P = 0.0032) in black moule (Fig. 7B), and focal adhesion (P = 1.49e−07), arrhythmogenic right ventricular cardiomyopathy (P = 2.27e−05), complement and coagulation cascades (P = 3.58e−05) in turquoise module (Fig. 7C), and cytokine-cytokine receptor interaction (P = 2.35e−08), amoebiasis (P = 3.30e−05), and rheumatoid arthritis (P = 5.65e−05) in greenyellow module (Fig. 7D). While in the salmon module, influenza A (P = 1.09e−09), measles (P = 9.44e−07), and herpes simplex infection (P = 7.99e−06) (Fig. 7E) were the most significantly enriched pathways.
Identification of hub genes
Using the screening criteria of —MM —>0.8, 926 genes were identified from 5 significant modules, of which 713 genes were differentially expressed between the normal and tumor samples with —logFC—>1 and adjusted P-value <0.05 (all the 713 candidate genes were listed in Table S11). Finally, 11 genes [Adiponectin (ADIPOQ); Rho GTPase activating protein 39 (ARHGAP39); ATPase family AAA-domain containing protein 3A (ATAD3A); C1orf95 (also known as STUM gene); Cell wall biogenesis 43 C-terminal homolog (CWH43); Glutamate receptor, ionotropic kainate 3 (GRIK3); Inhibin subunit beta A (INHBA); sodium channel epithelial 1 subunit gamma (SCNN1G); Sialic acid-binding immunoglobulin-like lectin-11 (SIGLEC11); Retinol dehydrogenase 12 (RDH12) and lymphatic vessel endothelial hyaluronan receptor 1 (LYVE1)] were identified as the hub genes by using LASSO logistic regression (Table 1). The heatmap of these 11 hub genes were shown in Fig. 8, indicating that these 11 hub genes differentially expressed between tumor and normal samples.
Table 1. LASSO regression results.
Gene | Coefficient | Odds ratio |
---|---|---|
ADIPOQ | −0.16241431554 | 0.8500889265 |
ARHGAP39 | 1.14238882470 | 3.134246595 |
ATAD3A | 0.85913784838 | 2.361124169 |
C1orf95 | −1.99447221881 | 0.1360854586 |
CWH43 | −0.05325279770 | 0.9481402945 |
GRIK3 | −4.02900881511 | 0.01779195633 |
INHBA | 0.19110414846 | 1.210585526 |
LYVE1 | −0.01065353474 | 0.9894030132 |
RDH12 | −0.09270793497 | 0.9114596669 |
SCNN1G | −0.02709362847 | 0.9732701115 |
SIGLEC11 | −0.17337880264 | 0.84081905 |
Validation of the hub genes
GSE54129 was utilized as the testing set to validate the 11-gene model. The AUC value of this classifier upon using artificial neural network was 0.946 indicating the excellent classification effects of the model (Fig. 9A). Furthermore, both the AUC values of this 11-gene model were around 0.5 in colorectal cancer (Fig. 9B) and pancreatic cancer (Fig. 9C), indicating specifically predictive effect in gastric cancer.
Discussion
In this study, five modules were identified as clinically significant and preserved modules by using WGCNA. The GO and KEGG analyses revealed that the genes in these four modules were significantly enriched in the biological processes of the cell cycle, cell division, and stomach-related functions. All these biological functions are closely related to gastric cancer (Cao et al., 2018; Waldum, Sagatun & Mjones, 2017). Eventually, 11 hub genes including ADIPOQ, ARHGAP39, ATAD3A, C1orf95, CWH43, GRIK3, INHBA, RDH12, SCNN1G, SIGLEC11 and LYVE1 were screened by using WGCNA method followed by LASSO regression. Then, artificial neural network algorithms were performed, and demonstrated that this 11-gene model could effectively discriminate between gastric cancer and normal tissues.
In preservation analysis, Z summary is used to assess the significance of observed statistics and is defined as the mean of Z scores computed for density and connectivity measures (Lou et al., 2017). When density and connectivity based preservation statistics are important factors for judging the preservation of a network, Z summary score was preferentially selected to evaluate the preservation (Langfelder et al., 2011b). Although turquoise, blue and greenyellow had low preservation according to the median rank results, they are still considered to be conserved since the Z summary scores of these modules were 39, 52 and 22, respectively. Black module has relatively high Z summary score and low median rank, indicating high preservation. In the current study, all the four gene modules were considered to be conserved and selected for further analysis.
INHBA, Inhibin- βA (INHBA), a ligand belonging to the transforming growth factor- β superfamily (Oshima et al., 2014), is associated with cell proliferation in various tumor types including colon adenocarcinoma (Lin et al., 2020a; Miao et al., 2020; Miyamoto et al., 2020), pancreatic cancer (Liu et al., 2020), gastric cancer (Chen et al., 2019) as well as oral squamous cell carcinoma (Lin et al., 2020b). Many studies have demonstrated the prognostic role of INHBA in colon adenocarcinoma (Chen et al., 2020; Li et al., 2020; Miao et al., 2020; Sun et al., 2020), and the role of INHBA in gastric cancer has been widely reported also. INHBA was highly expressed (Kaneda et al., 2011; Seeruttun et al., 2019; Zhang et al., 2010) and aberrantly methylated (Zhang et al., 2019) in gastric tumor samples, and high INHBA expression was associated with significantly poorer 5-year survival than low expression group (Katayama et al., 2017; Wang et al., 2012). One study has demonstrated that INHBA gene silencing could inhibit gastric cancer cell migration and invasion by impeding TGF- β signaling pathway (Chen et al., 2019).
ADIPOQ is one of the most important adipocytokines secreted by adipocytes (Parida, Siddharth & Sharma, 2019), and the polymorphisms of ADIPOQ have been reported to correlate with several types of cancer, including colorectal (Nimptsch et al., 2017; Tan et al., 2017) and breast (Mendez-Hernandez et al., 2017) cancers. A study focusing on the molecular mechanisms ADIPOQ participated in has revealed that ADIPOQ induces cytotoxic autophagy in breast cancer cells through STK11/LKB1-mediated activation of the AMPK-ULK1 axis (Chung et al., 2017). Another study has reported that miR-370 inhibits the proliferation, invasion, and epithelial-mesenchymal transition of gastric cells by directly downregulating receptor 4 of ADIPOQ (Feng et al., 2018). Overexpression of another microRNA, miR-15b-5p, promotes the metastasis of gastric cancer by regulating ADIPOQ receptor 3 (Zhao et al., 2017).
ATAD3A is a nuclear-encoded mitochondrial enzyme, involving in mitochondrial dynamics, cell death, and cholesterol metabolism (Teng, Lang & Shay, 2019). It has been reported to correlate with hepatocellular carcinoma (Liu et al., 2019a) and breast cancer (Daniel et al., 2019), and it might be an effective therapeutic target in cancer treatment (Teng et al., 2016). ATAD3A is differentially expressed between paclitaxel-resistant and -sensitive MCF7 breast cancer cells (Daniel et al., 2019). A study has revealed that ATAD3A is upregulated in hepatocellular carcinoma and ATAD3A upregulation is correlated with poor prognosis (Liu et al., 2019a).
LYVE1 acts as a receptor and binds to both soluble and immobilized hyaluronan (Banerji et al., 2016), may function in lymphatic hyaluronan transport and tumor metastasis (Wu et al., 2019). The dysregulation of LYVE1 closely correlated with many types of tumor, like gastric cancer (Ozmen et al., 2011), colorectal cancer (Gao et al., 2006), breast cancer (Kato et al., 2005), lung cancer (Koukourakis et al., 2005) and liver cancer (Mouta Carreira et al., 2001). LYVE1 has been studied extensively for its possible role in cancer diagnosis and prognosis in cancer. One study has demonstrated that LYVE1 was upregulated in gastric cancer, and overexpression of LYVE1 positively correlated with perineural invasion and lymph node in gastric cancers (Ozmen et al., 2011). And the expression of LYVE1 might be a biomarker to predict the existence of regional lymph node metastasis in early gastric cancer (Fujimoto et al., 2007).
RDH12, an NADPH-dependent retinal reductase, catalyzes the reduction of all-trans retinal to all-trans retinol (Belyaeva et al., 2005). It was significantly decreased in gastric tumor samples (Kropotova et al., 2013) and cervical squamous cell carcinoma samples (Peng et al., 2015). RDH12 was also one of the differentially expressed metabolism-related genes, and correlated with the prognosis of gastric cancer patients (Wen et al., 2020).
GRIK3 mainly participates in the neuroactive ligand–receptor interaction pathway, and GRIK3 upregulation is associated with poor survival in gastric cancer (Gong et al., 2017). GRIK3 promotes epithelial-mesenchymal transition by regulating the SPDEF/CDH1 signaling in breast cancer cells (Xiao et al., 2019).
There have been few studies focusing on the relationship between gastric cancer and SCNN1G, ARHGAP39, Clorf95, CWH43 or SIGLEC11. SCNN1G is one of the genes significantly upregulated in Ewing’s sarcoma and fibromatosis samples (Sarver et al., 2015). ARHGAP39 mutations or variations in copy number or expression level were found in several types of tumor-like tissues from the central nervous system, skin, prostate, and gastrointestinal tract (Nowak, 2018). ARHGAP39 interacts with p53 and BAX, and decreased expression of ARHGAP39 increases cell proliferation, leading to tumorigenesis (Jones, 2017). Clorf95 is one of the uncharacterized proteins correlated with diverse human cancers (Delgado et al., 2014). Another study focusing on scleroderma patients demonstrated the involvement of Clorf95 in cancer incidence (Xu et al., 2016). Sialic acid-binding immunoglobulin-like lectin-11 (SIGLEC11) is a primate-lineage–specific receptor of human tissue macrophages, and it is also expressed in brain microglia (Angata et al., 2002; Shahraz et al., 2015). A missense mutation of SIGLEC11 has been detected in pancreatic cancer patients (Jones et al., 2008), and SIGLEC11 was significantly upregulated in the poor prognostic group of pancreatic cancer patients (Stratford et al., 2010). CWH43 was correlated with tumorigenesis in thyrotropin-secreting pituitary adenomas (Sapkota et al., 2017). One meta-analysis also showed that CWH43 was differentially expressed between colorectal cancer and normal (Chu et al., 2014) samples.
All these results from the previous studies demonstrate that the hub genes identified in our study are closely correlated with gastric cancer and play important roles in cancer development, progression, or proliferation.
The significant module and hub genes identified in this study are biologically rational. First, the clinically significant module identified in our study bears strong preservation, implying that this clinically significant module is conservative and could also be reproduced in other datasets. Further, it suggests that that modules constructed by WGCNA are reliable. Second, most of the genes in the significant module were enriched for specific GO terms and KEGG pathways closely relating to stomach or cancer physiology. For instance, GO analysis demonstrated that most of the genes in the clinically significant modules were closely related to digestion, carbohydrate metabolic process, and gastric acid secretion, as well as cell division and cell cycle. KEGG enrichment analysis also indicated that most of the genes in the clinically significant module were implicated in gastric acid secretion, protein digestion and absorption, as well as glycerolipid metabolism and the p53 signaling pathway. Third, all the hub genes identified in our study had previously been reported to relate to cancer. Moreover, several hub genes are implicated in metabolic processes, influencing the development and progression of gastric cancer. A previous study has demonstrated the association between metabolic syndrome and gastric cancer (Li et al., 2018b). A study has detected increased fatty acid oxidation in gastric cancer (Lee et al., 2019), and adipocytes fuel gastric cancer by mediating fatty acid metabolism (Tan et al., 2018). It may thus be inferred that these genes are genuinely the hub genes in charge of the key processes in gastric cancer, and they deserve a deeper analysis and validation. Finally, by using machine learning methods, the hub genes were demonstrated to effectively discriminate the gastric tumor samples from normal samples. In our study, the predictive effects of ANN method was evaluated by AUC values (Huang & Ling, 2005). Herein, the AUC value was >0.8, indicating the excellent predictive results. Furthermore, these 11-gene model might be the specific predictors for gastric cancer, since the AUC values of this predictive model were less than 0.8 in other tumor types including, colorectal cancer and pancreatic cancer. All the results indicated that the expression profiles of these 11 hub genes have excellent predictive effects when discriminating gastric cancer samples from normal samples.
However, our study has limitations. First, all the hub genes were identified and validated only through bioinformatics, and further exploration of the biological functions and molecular mechanisms of these hub genes both in vitro and in vivo is required. Second, due to the limited availability of the data, we did not differentiate between intestinal-type and diffuse-type gastric cancers. More data are needed to analyze and identify the hub genes between these two types of gastric cancer and normal samples.
In summary, through WGCNA, we identified 11 hub genes, which might serve as potential diagnostic and/or therapeutic biomarkers for gastric cancer. Profile data mining by bioinformatics analysis is an available method to find potential diagnostic or therapeutic biomarkers systematically. Nevertheless, further investigations about the molecular mechanisms in which these hub genes are involved are still needed to verify the involvement of these genes in gastric cancer. Our findings provide a better understanding of the molecular mechanisms and putative diagnostic or therapeutic biomarkers for gastric cancer.
Supplemental Information
Funding Statement
This work was supported by the Science & Technology Department of Sichuan Province funding project (No. 2016FZ0108, 2017FZ0104), the 1.3.5 project for disciplines of excellence, West China Hospital, Sichuan University (ZYJC18010). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Additional Information and Declarations
Competing Interests
The authors declare there are no competing interests.
Author Contributions
Chunyang Li conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.
Haopeng Yu performed the experiments, analyzed the data, prepared figures and/or tables, and approved the final draft.
Yajing Sun performed the experiments, analyzed the data, prepared figures and/or tables, performed the double check of the results, and approved the final draft.
Xiaoxi Zeng conceived and designed the experiments, performed the experiments, analyzed the data, authored or reviewed drafts of the paper, and approved the final draft.
Wei Zhang conceived and designed the experiments, authored or reviewed drafts of the paper, and approved the final draft.
Data Availability
The following information was supplied regarding data availability:
R code and training set, pre-derivation analysis, and testing set raw data are available in the Supplemental Files.
References
- Angata et al. (2002).Angata T, Kerr SC, Greaves DR, Varki NM, Crocker PR, Varki A. Cloning and characterization of human Siglec-11. A recently evolved signaling molecule that can interact with SHP-1 and SHP-2 and is expressed by tissue macrophages, including brain microglia. Journal of Biological Chemistry. 2002;277:24466–24474. doi: 10.1074/jbc.M202833200. [DOI] [PubMed] [Google Scholar]
- Ashburner et al. (2000).Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nature Genetics. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Badea et al. (2008).Badea L, Herlea V, Dima SO, Dumitrascu T, Popescu I. Combined gene expression analysis of whole-tissue and microdissected pancreatic ductal adenocarcinoma identifies genes specifically overexpressed in tumor epithelia. Hepatogastroenterology. 2008;55:2016–2027. [PubMed] [Google Scholar]
- Banerji et al. (2016).Banerji S, Lawrance W, Metcalfe C, Briggs DC, Yamauchi A, Dushek O, Van der Merwe PA, Day AJ, Jackson DG. Homodimerization of the lymph vessel endothelial receptor LYVE-1 through a redox-labile disulfide is critical for hyaluronan binding in lymphatic endothelium. Journal of Biological Chemistry. 2016;291:25004–25018. doi: 10.1074/jbc.M116.736926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Belyaeva et al. (2005).Belyaeva OV, Korkina OV, Stetsenko AV, Kim T, Nelson PS, Kedishvili NY. Biochemical properties of purified human retinol dehydrogenase 12 (RDH12): catalytic efficiency toward retinoids and C9 aldehydes and effects of cellular retinol-binding protein type I (CRBPI) and cellular retinaldehyde-binding protein (CRALBP) on the oxidation and reduction of retinoids. Biochemistry. 2005;44:7035–7047. doi: 10.1021/bi050226k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bray et al. (2018).Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global Cancer Statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. A Cancer Journal for Clinicians. 2018;68(6):394–424. doi: 10.3322/caac.21492. [DOI] [PubMed] [Google Scholar]
- Cao et al. (2018).Cao J, Wu N, Han Y, Hou Q, Zhao Y, Pan Y, Xie X, Chen F. DDX21 promotes gastric cancer proliferation by regulating cell cycle. Biochemical and Biophysical Research Communications. 2018;505:1189–1194. doi: 10.1016/j.bbrc.2018.10.060. [DOI] [PubMed] [Google Scholar]
- Carlson et al. (2006).Carlson MR, Zhang B, Fang Z, Mischel PS, Horvath S, Nelson SF. Gene connectivity, function, and sequence conservation: predictions from modular yeast co-expression networks. BMC Genomics. 2006;7:40. doi: 10.1186/1471-2164-7-40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carter et al. (2004).Carter SL, Brechbuhler CM, Griffin M, Bond AT. Gene co-expression network topology provides a framework for molecular characterization of cellular state. Bioinformatics. 2004;20:2242–2250. doi: 10.1093/bioinformatics/bth234. [DOI] [PubMed] [Google Scholar]
- Chen et al. (2018).Chen L, Yuan L, Qian K, Qian G, Zhu Y, Wu CL, Dan HC, Xiao Y, Wang X. Identification of biomarkers associated with pathological stage and prognosis of clear cell renal cell carcinoma by co-expression network analysis. Frontiers in Physiology. 2018;9:399. doi: 10.3389/fphys.2018.00399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen et al. (2017a).Chen L, Yuan L, Wang Y, Wang G, Zhu Y, Cao R, Qian G, Xie C, Liu X, Xiao Y, Wang X. Co-expression network analysis identified FCER1G in association with progression and prognosis in human clear cell renal cell carcinoma. International Journal of Biological Sciences. 2017a;13:1361–1372. doi: 10.7150/ijbs.21657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen et al. (2020).Chen S, Cao GD, Wei W, Yida L, Xiaobo H, Lei Y, Ke C, Chen B, Xiong MM. Prediction and identification of immune genes related to the prognosis of patients with colon adenocarcinoma and its mechanisms. World Journal of Surgical Oncology. 2020;18:146. doi: 10.1186/s12957-020-01921-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen et al. (2017b).Chen W, Zheng R, Zhang S, Zeng H, Xia C, Zuo T, Yang Z, Zou X, He J. Cancer incidence and mortality in China, 2013. Cancer Letters. 2017b;401:63–71. doi: 10.1016/j.canlet.2017.04.024. [DOI] [PubMed] [Google Scholar]
- Chen et al. (2019).Chen ZL, Qin L, Peng XB, Hu Y, Liu B. INHBA gene silencing inhibits gastric cancer cell migration and invasion by impeding activation of the TGF-beta signaling pathway. Journal of Cellular Physiology. 2019;234:18065–18074. doi: 10.1002/jcp.28439. [DOI] [PubMed] [Google Scholar]
- Chu et al. (2014).Chu CM, Yao CT, Chang YT, Chou HL, Chou YC, Chen KH, Terng HJ, Huang CS, Lee CC, Su SL, Liu YC, Lin FG, Wetter T, Chang CW. Gene expression profiling of colorectal tumors and normal mucosa by microarrays meta-analysis using prediction analysis of microarray, artificial neural network, classification, and regression trees. Disease Markers. 2014;2014:634123. doi: 10.1155/2014/634123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chung et al. (2017).Chung SJ, Nagaraju GP, Nagalingam A, Muniraj N, Kuppusamy P, Walker A, Woo J, Gyorffy B, Gabrielson E, Saxena NK, Sharma D. ADIPOQ/adiponectin induces cytotoxic autophagy in breast cancer cells through STK11/LKB1-mediated activation of the AMPK-ULK1 axis. Autophagy. 2017;13:1386–1403. doi: 10.1080/15548627.2017.1332565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cristescu et al. (2015).Cristescu R, Lee J, Nebozhyn M, Kim KM, Ting JC, Wong SS, Liu J, Yue YG, Wang J, Yu K, Ye XS, Do IG, Liu S, Gong L, Fu J, Jin JG, Choi MG, Sohn TS, Lee JH, Bae JM, Kim ST, Park SH, Sohn I, Jung SH, Tan P, Chen R, Hardwick J, Kang WK, Ayers M, Hongyue D, Reinhard C, Loboda A, Kim S, Aggarwal A. Molecular analysis of gastric cancer identifies subtypes associated with distinct clinical outcomes. Nature Medicine. 2015;21:449–456. doi: 10.1038/nm.3850. [DOI] [PubMed] [Google Scholar]
- Daniel et al. (2019).Daniel P, Halada P, Jelinek M, Balusikova K, Kovar J. Differentially expressed mitochondrial proteins in human MCF7 breast cancer cells resistant to paclitaxel. International Journal of Molecular Sciences. 2019;20(12):2986. doi: 10.3390/ijms20122986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Delgado et al. (2014).Delgado AP, Brandao P, Chaoado MJ, Hamid S, Narayanan R. Open reading frames associated with cancer in the dark matter of the human genome. Cancer Genomics and Proteomics. 2014;11(4):201–213. [PubMed] [Google Scholar]
- Dennis et al. (2003).Dennis G, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA. DAVID: database for annotation, visualization, and integrated discovery. Genome Biology. 2003;4:R60. doi: 10.1186/gb-2003-4-9-r60. [DOI] [PubMed] [Google Scholar]
- Feng et al. (2018).Feng Y, Sun T, Yu Y, Gao Y, Wang X, Chen Z. MicroRNA-370 inhibits the proliferation, invasion and EMT of gastric cancer cells by directly targeting PAQR4. Journal of Pharmacological Sciences. 2018;138:96–106. doi: 10.1016/j.jphs.2018.08.004. [DOI] [PubMed] [Google Scholar]
- Friedman (2010).Friedman J. Package ‘glmnet’. https://cran.r-project.org/web/packages/glmnet/index.html 2010
- Friedman, Hastie & Tibshirani (2010).Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software. 2010;33:1–22. [PMC free article] [PubMed] [Google Scholar]
- Fritsch et al. (2019).Fritsch S, Guenther F, Wright MN, Suling M, Mueller SM, Wright MN. neuralnet: training of neural networks. 2019. https://journal.r-project.org/archive/2010/RJ-2010-006/index.html https://journal.r-project.org/archive/2010/RJ-2010-006/index.html
- Fujimoto et al. (2007).Fujimoto A, Ishikawa Y, Akishima-Fukasawa Y, Ito K, Akasaka Y, Tamai S, Maehara T, Kiguchi H, Ogata K, Nishimura C, Miki K, Ishii T. Significance of lymphatic invasion on regional lymph node metastasis in early gastric cancer using LYVE-1 immunohistochemical analysis. American Journal of Clinical Pathology. 2007;127:82–88. doi: 10.1309/LJQ9G0X8KP17QXP3. [DOI] [PubMed] [Google Scholar]
- Galamb et al. (2012).Galamb O, Wichmann B, Sipos F, Spisak S, Krenacs T, Toth K, Leiszter K, Kalmar A, Tulassay Z, Molnar B. Dysplasia-carcinoma transition specific transcripts in colonic biopsy samples. PLOS ONE. 2012;7:e48547. doi: 10.1371/journal.pone.0048547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gao et al. (2006).Gao F, Lu YM, Cao ML, Liu YW, He YQ, Wang Y. Expression and quantification of LYVE-1 in human colorectal cancer. Clinical and Experimental Medicine. 2006;6:65–71. doi: 10.1007/s10238-006-0097-4. [DOI] [PubMed] [Google Scholar]
- Gautier et al. (2004).Gautier L, Cope L, Bolstad BM, Irizarry RA. affy—analysis of Affymetrix GeneChip data at the probe level. Bioinformatics. 2004;20:307–315. doi: 10.1093/bioinformatics/btg405. [DOI] [PubMed] [Google Scholar]
- Gong et al. (2017).Gong B, Li Y, Cheng Z, Wang P, Luo L, Huang H, Duan S, Liu F. GRIK3: a novel oncogenic protein related to tumor TNM stage, lymph node metastasis, and poor prognosis of GC. Tumour Biology. 2017;39:1010428317704364. doi: 10.1177/1010428317704364. [DOI] [PubMed] [Google Scholar]
- Hastie & Narasimhan (2001).Hastie TTR, Narasimhan B. impute: imputation for microarray data. Bioinformatics. 2001;17:6. [Google Scholar]
- He et al. (2013).He CZ, Zhang KH, Li Q, Liu XH, Hong Y, Lv NH. Combined use of AFP, CEA, CA125 and CAl9-9 improves the sensitivity for the diagnosis of gastric cancer. BMC Gastroenterology. 2013;13:87. doi: 10.1186/1471-230X-13-87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hu et al. (2013).Hu X, Cammann H, Meyer HA, Miller K, Jung K, Stephan C. Artificial neural networks and prostate cancer—tools for diagnosis and management. Nature Reviews Urology. 2013;10:174–182. doi: 10.1038/nrurol.2013.9. [DOI] [PubMed] [Google Scholar]
- Huang & Ling (2005).Huang J, Ling CX. Using AUC and accuracy in evaluating learning algorithms. Ieee Transactions on Knowledge and Data Engineering. 2005;17:299–310. doi: 10.1109/Tkde.2005.50. [DOI] [Google Scholar]
- Huang et al. (2019).Huang SC, Ng KF, Yeh TS, Cheng CT, Lin JS, Liu YJ, Chuang HC, Chen TC. Subtraction of Epstein-Barr virus and microsatellite instability genotypes from the Lauren histotypes: combined molecular and histologic subtyping with clinicopathological and prognostic significance validated in a cohort of 1,248 cases. International Journal of Cancer. 2019;145:3218–3230. doi: 10.1002/ijc.32215. [DOI] [PubMed] [Google Scholar]
- Jones (2017).Jones R. PhD Thesis. 2017. Identification of novel risk variants for sarcoma and other cancers by whole exome sequencing in cancer cluster families. [Google Scholar]
- Jones et al. (2008).Jones S, Zhang X, Parsons DW, Lin JC, Leary RJ, Angenendt P, Mankoo P, Carter H, Kamiyama H, Jimeno A, Hong SM, Fu B, Lin MT, Calhoun ES, Kamiyama M, Walter K, Nikolskaya T, Nikolsky Y, Hartigan J, Smith DR, Hidalgo M, Leach SD, Klein AP, Jaffee EM, Goggins M, Maitra A, Iacobuzio-Donahue C, Eshleman JR, Kern SE, Hruban RH, Karchin R, Papadopoulos N, Parmigiani G, Vogelstein B, Velculescu VE, Kinzler KW. Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science. 2008;321:1801–1806. doi: 10.1126/science.1164368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaneda et al. (2011).Kaneda H, Arao T, Matsumoto K, De Velasco MA, Tamura D, Aomatsu K, Kudo K, Sakai K, Nagai T, Fujita Y, Tanaka K, Yanagihara K, Yamada Y, Okamoto I, Nakagawa K, Nishio K. Activin A inhibits vascular endothelial cell growth and suppresses tumour angiogenesis in gastric cancer. British Journal of Cancer. 2011;105:1210–1217. doi: 10.1038/bjc.2011.348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katayama et al. (2017).Katayama Y, Oshima T, Sakamaki K, Aoyama T, Sato T, Masudo K, Shiozawa M, Yoshikawa T, Rino Y, Imada T, Masuda M. Clinical significance of INHBA gene expression in patients with gastric cancer who receive curative resection followed by adjuvant S-1 chemotherapy. In Vivo. 2017;31:565–571. doi: 10.21873/invivo.11095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kato et al. (2005).Kato T, Prevo R, Steers G, Roberts H, Leek RD, Kimura T, Kameoka S, Nishikawa T, Kobayashi M, Jackson DG, Harris AL, Gatter KC, Pezzella F. A quantitative analysis of lymphatic vessels in human breast cancer, based on LYVE-1 immunoreactivity. British Journal of Cancer. 2005;93:1168–1174. doi: 10.1038/sj.bjc.6602844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koukourakis et al. (2005).Koukourakis MI, Giatromanolaki A, Sivridis E, Simopoulos C, Gatter KC, Harris AL, Jackson DG. LYVE-1 immunohistochemical assessment of lymphangiogenesis in endometrial and lung cancer. Journal of Clinical Pathology. 2005;58:202–206. doi: 10.1136/jcp.2004.019174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kropotova et al. (2013).Kropotova ES, Zinov’eva OL, Zyrianova AF, Choinzonov EL, Afanas’ev SG, Cherdyntseva NV, Beresten SF, Oparina N, Mashkova TD. Expression of genes involved in retinoic acid biosynthesis in human gastric cancer. Molekuliarnaia Biologiia. 2013;47:317–330. doi: 10.7868/s0026898413020079. [DOI] [PubMed] [Google Scholar]
- Langfelder & Horvath (2008).Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. doi: 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Langfelder et al. (2011b).Langfelder P, Luo R, Oldham MC, Horvath S. Is my network module preserved and reproducible? PLOS Computational Biology. 2011;7:e1001057. doi: 10.1371/journal.pcbi.1001057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee et al. (2019).Lee JS, Kim SH, Lee S, Kang JH, Lee SH, Cheong JH, Kim SY. Gastric cancer depends on aldehyde dehydrogenase 3A1 for fatty acid oxidation. Scientific Reports. 2019;9:16313. doi: 10.1038/s41598-019-52814-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lemeshow (2000).Lemeshow DWHS. Applied logistic regression. 2nd Ed. Hoboken: Wiley; 2000. [Google Scholar]
- Li et al. (2018a).Li C, Zeng X, Yu H, Gu Y, Zhang W. Identification of hub genes with diagnostic values in pancreatic cancer by bioinformatics analyses and supervised learning methods. World Journal of Surgical Oncology. 2018a;16:223. doi: 10.1186/s12957-018-1519-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li et al. (2018b).Li F, Du H, Li S, Liu J. The association between metabolic syndrome and gastric cancer in Chinese. Frontiers in Oncology. 2018b;8:326. doi: 10.3389/fonc.2018.00326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li et al. (2020).Li X, Yu W, Liang C, Xu Y, Zhang M, Ding X, Cai X. INHBA is a prognostic predictor for patients with colon adenocarcinoma. BMC Cancer. 2020;20:305. doi: 10.1186/s12885-020-06743-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin et al. (2020a).Lin H, Hong YG, Zhou JD, Gao XH, Yuan PH, Xin C, Huang ZP, Zhang W, Hao LQ, Hou KZ. LncRNA INHBA-AS1 promotes colorectal cancer cell proliferation by sponging miR-422a to increase AKT1 axis. European Review for Medical and Pharmacological Sciences. 2020a;24:9940–9948. doi: 10.26355/eurrev_202010_23206. [DOI] [PubMed] [Google Scholar]
- Lin et al. (2020b).Lin TA, Wu TS, Li YJ, Yang CN, Illescas Ralda MM, Chang HH. Role and mechanism of LIF in oral squamous cell carcinoma progression. Journal of Clinical Medicine. 2020b;9:295. doi: 10.3390/jcm9020295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu et al. (2020).Liu D, Zhou D, Sun Y, Zhu J, Ghoneim D, Wu C, Yao Q, Gamazon ER, Cox NJ, Wu L. A transcriptome-wide association study identifies candidate susceptibility genes for pancreatic cancer risk. Cancer Research. 2020;80:4346–4354. doi: 10.1158/0008-5472.CAN-20-1353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu et al. (2019a).Liu X, Li G, Ai L, Ye Q, Yu T, Yang B. Prognostic value of ATAD3 gene cluster expression in hepatocellular carcinoma. Oncology Letters. 2019a;18:1304–1310. doi: 10.3892/ol.2019.10454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu et al. (2019b).Liu Z, Li M, Hua Q, Li Y, Wang G. Identification of an eight-lncRNA prognostic model for breast cancer using WGCNA network analysis and a Cox-proportional hazards model based on L1-penalized estimation. International Journal of Molecular Medicine. 2019b;44:1333–1343. doi: 10.3892/ijmm.2019.4303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lou et al. (2017).Lou Y, Tian GY, Song Y, Liu YL, Chen YD, Shi JP, Yang J. Characterization of transcriptional modules related to fibrosing-NAFLD progression. Scientific Reports. 2017;7:4748. doi: 10.1038/s41598-017-05044-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mendez-Hernandez et al. (2017).Mendez-Hernandez A, Gallegos-Arreola MP, Moreno-Macias H, Espinosa Fematt J, Perez-Morales R. LEP rs7799039, LEPR rs1137101, and ADIPOQ rs2241766 and 1501299 Polymorphisms are associated with obesity and chemotherapy response in mexican women with breast cancer. Clinical Breast Cancer. 2017;17:453–462. doi: 10.1016/j.clbc.2017.03.010. [DOI] [PubMed] [Google Scholar]
- Miao et al. (2020).Miao YD, Wang JT, Ma XP, Yang Y, Mi D. Identification prognosis-associated immune genes in colon adenocarcinoma. Bioscience Reports. 2020;40(11):BSR20201734. doi: 10.1042/BSR20201734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyamoto et al. (2020).Miyamoto Y, Schirripa M, Suenaga M, Cao S, Zhang W, Okazaki S, Berger MD, Matsusaka S, Yang D, Ning Y, Baba H, Loupakis F, Lonardi S, Pietrantonio F, Borelli B, Cremolini C, Yamaguchi T, Lenz HJ. A polymorphism in the cachexia-associated gene INHBA predicts efficacy of regorafenib in patients with refractory metastatic colorectal cancer. PLOS ONE. 2020;15:e0239439. doi: 10.1371/journal.pone.0239439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mouta Carreira et al. (2001).Mouta Carreira C, Nasser SM, Di Tomaso E, Padera TP, Boucher Y, Tomarev SI, Jain RK. LYVE-1 is not restricted to the lymph vessels: expression in normal liver blood sinusoids and down-regulation in human liver cancer and cirrhosis. Cancer Research. 2001;61:8079–8084. [PubMed] [Google Scholar]
- Neidlin, Dimitrakopoulou & Alexopoulos (2019).Neidlin M, Dimitrakopoulou S, Alexopoulos LG. Multi-tissue network analysis for drug prioritization in knee osteoarthritis. Scientific Reports. 2019;9:15176. doi: 10.1038/s41598-019-51627-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ngiam & Khor (2019).Ngiam KY, Khor IW. Big data and machine learning algorithms for health-care delivery. The Lancet Oncology. 2019;20:e262–e273. doi: 10.1016/S1470-2045(19)30149-4. [DOI] [PubMed] [Google Scholar]
- Nimptsch et al. (2017).Nimptsch K, Song M, Aleksandrova K, Katsoulis M, Freisling H, Jenab M, Gunter MJ, Tsilidis KK, Weiderpass E, Bueno-De-Mesquita HB, Chong DQ, Jensen MK, Wu C, Overvad K, Kuhn T, Barrdahl M, Melander O, Jirstrom K, Peeters PH, Sieri S, Panico S, Cross AJ, Riboli E, Van Guelpen B, Myte R, Huerta JM, Rodriguez-Barranco M, Quiros JR, Dorronsoro M, Tjonneland A, Olsen A, Travis R, Boutron-Ruault MC, Carbonnel F, Severi G, Bonet C, Palli D, Janke J, Lee YA, Boeing H, Giovannucci EL, Ogino S, Fuchs CS, Rimm E, Wu K, Chan AT, Pischon T. Genetic variation in the ADIPOQ gene, adiponectin concentrations and risk of colorectal cancer: a mendelian randomization analysis using data from three large cohort studies. European Journal of Epidemiology. 2017;32:419–430. doi: 10.1007/s10654-017-0262-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nowak (2018).Nowak FV. Porf-2 = Arhgap39 = Vilse: a pivotal role in neurodevelopment, learning and memory. eNeuro. 2018;5(5):ENEURO.0082-18.2018. doi: 10.1523/ENEURO.0082-18.2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Obeidat et al. (2017).Obeidat M, Nie Y, Chen V, Shannon CP, Andiappan AK, Lee B, Rotzschke O, Castaldi PJ, Hersh CP, Fishbane N, Ng RT, McManus B, Miller BE, Rennard S, Pare PD, Sin DD. Network-based analysis reveals novel gene signatures in peripheral blood of patients with chronic obstructive pulmonary disease. Respiratory Research. 2017;18:72. doi: 10.1186/s12931-017-0558-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oh et al. (2018).Oh SC, Sohn BH, Cheong JH, Kim SB, Lee JE, Park KC, Lee SH, Park JL, Park YY, Lee HS, Jang HJ, Park ES, Kim SC, Heo J, Chu IS, Jang YJ, Mok YJ, Jung W, Kim BH, Kim A, Cho JY, Lim JY, Hayashi Y, Song S, Elimova E, Estralla JS, Lee JH, Bhutani MS, Lu Y, Liu W, Lee J, Kang WK, Kim S, Noh SH, Mills GB, Kim SY, Ajani JA, Lee JS. Clinical and genomic landscape of gastric cancer with a mesenchymal phenotype. Nature Communications. 2018;9:1777. doi: 10.1038/s41467-018-04179-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oshima et al. (2014).Oshima T, Yoshihara K, Aoyama T, Hasegawa S, Sato T, Yamamoto N, Akito N, Shiozawa M, Yoshikawa T, Numata K, Rino Y, Kunisaki C, Tanaka K, Akaike M, Imada T, Masuda M. Relation of INHBA gene expression to outcomes in gastric cancer after curative surgery. Anticancer Research. 2014;34:2303–2309. [PubMed] [Google Scholar]
- Ozmen et al. (2011).Ozmen F, Ozmen MM, Ozdemir E, Moran M, Seckin S, Guc D, Karaagaoglu E, Kansu E. Relationship between LYVE-1, VEGFR-3 and CD44 gene expressions and lymphatic metastasis in gastric cancer. World Journal of Gastroenterology. 2011;17:3220–3228. doi: 10.3748/wjg.v17.i27.3220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parida, Siddharth & Sharma (2019).Parida S, Siddharth S, Sharma D. Adiponectin, obesity, and cancer: clash of the bigwigs in health and disease. International Journal of Molecular Sciences. 2019;20:2519. doi: 10.3390/ijms20102519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peng et al. (2015).Peng G, Dan W, Jun W, Junjun Y, Tong R, Baoli Z, Yang X. Transcriptome profiling of the cancer and adjacent nontumor tissues from cervical squamous cell carcinoma patients by RNA sequencing. Tumour Biology. 2015;36:3309–3317. doi: 10.1007/s13277-014-2963-0. [DOI] [PubMed] [Google Scholar]
- Pormohammad et al. (2018).Pormohammad A, Mohtavinejad N, Gholizadeh P, Dabiri H, Salimi Chirani A, Hashemi A, Nasiri MJ. Global estimate of gastric cancer in Helicobacter pylori-infected population: a systematic review and meta-analysis. Journal of Cellular Physiology. 2018;234(2):1208–1218. doi: 10.1002/jcp.27114. [DOI] [PubMed] [Google Scholar]
- Raimondi et al. (2018).Raimondi A, Nichetti F, Peverelli G, Di Bartolomeo M, Braud FDe, Pietrantonio F. Genomic markers of resistance to targeted treatments in gastric cancer: potential new treatment strategies. Pharmacogenomics. 2018;19:1047–1068. doi: 10.2217/pgs-2018-0077. [DOI] [PubMed] [Google Scholar]
- Ritchie et al. (2015).Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Research. 2015;43:e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robin et al. (2011).Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, Muller M. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:77. doi: 10.1186/1471-2105-12-77. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sapkota et al. (2017).Sapkota S, Horiguchi K, Tosaka M, Yamada S, Yamada M. Whole-exome sequencing study of thyrotropin-secreting pituitary adenomas. Journal of Clinical Endocrinology and Metabolism. 2017;102:566–575. doi: 10.1210/jc.2016-2261. [DOI] [PubMed] [Google Scholar]
- Sarver et al. (2015).Sarver AE, Sarver AL, Thayanithy V, Subramanian S. Identification, by systematic RNA sequencing, of novel candidate biomarkers and therapeutic targets in human soft tissue tumors. Laboratory Investigation. 2015;95:1077–1088. doi: 10.1038/labinvest.2015.80. [DOI] [PubMed] [Google Scholar]
- Seeruttun et al. (2019).Seeruttun SR, Cheung WY, Wang W, Fang C, Liu ZM, Li JQ, Wu T, Wang J, Liang C, Zhou ZW. Identification of molecular biomarkers for the diagnosis of gastric cancer and lymph-node metastasis. Gastroenterology Report. 2019;7:57–66. doi: 10.1093/gastro/goy023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shahraz et al. (2015).Shahraz A, Kopatz J, Mathy R, Kappler J, Winter D, Kapoor S, Schutza V, Scheper T, Gieselmann V, Neumann H. Anti-inflammatory activity of low molecular weight polysialic acid on human macrophages. Scientific Reports. 2015;5:16800. doi: 10.1038/srep16800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smyth et al. (2016).Smyth EC, Verheij M, Allum W, Cunningham D, Cervantes A, Arnold D, Committee EG. Gastric cancer: ESMO clinical practice guidelines for diagnosis, treatment and follow-up. Annals of Oncology. 2016;27:v38–v49. doi: 10.1093/annonc/mdw350. [DOI] [PubMed] [Google Scholar]
- Stratford et al. (2010).Stratford JK, Bentrem DJ, Anderson JM, Fan C, Volmar KA, Marron JS, Routh ED, Caskey LS, Samuel JC, Der CJ, Thorne LB, Calvo BF, Kim HJ, Talamonti MS, Iacobuzio-Donahue CA, Hollingsworth MA, Perou CM, Yeh JJ. A six-gene signature predicts survival of patients with localized pancreatic ductal adenocarcinoma. PLOS Medicine. 2010;7:e1000307. doi: 10.1371/journal.pmed.1000307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun et al. (2020).Sun YL, Zhang Y, Guo YC, Yang ZH, Xu YC. A prognostic model based on the immune-related genes in colon adenocarcinoma. International Journal of Medical Sciences. 2020;17:1879–1896. doi: 10.7150/ijms.45813. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takeno et al. (2008).Takeno A, Takemasa I, Doki Y, Yamasaki M, Miyata H, Takiguchi S, Fujiwara Y, Matsubara K, Monden M. Integrative approach for differentially overexpressed genes in gastric cancer by combining large-scale gene expression profiling and network analysis. British Journal of Cancer. 2008;99:1307–1315. doi: 10.1038/sj.bjc.6604682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tan et al. (2017).Tan X, Wang GB, Tang Y, Bai J, Ye L. Association of ADIPOQ and ADIPOR variants with risk of colorectal cancer: a meta-analysis. Journal of Huazhong University of Science and Technology. 2017;37:161–171. doi: 10.1007/s11596-017-1710-3. [DOI] [PubMed] [Google Scholar]
- Tan et al. (2018).Tan Y, Lin K, Zhao Y, Wu Q, Chen D, Wang J, Liang Y, Li J, Hu J, Wang H, Liu Y, Zhang S, He W, Huang Q, Hu X, Yao Z, Liang B, Liao W, Shi M. Adipocytes fuel gastric cancer omental metastasis via PITPNC1-mediated fatty acid metabolic reprogramming. Theranostics. 2018;8:5452–5468. doi: 10.7150/thno.28219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Teng, Lang & Shay (2019).Teng Y, Lang L, Shay C. ATAD3A on the path to cancer. Advances in Experimental Medicine and Biology. 2019;1134:259–269. doi: 10.1007/978-3-030-12668-1_14. [DOI] [PubMed] [Google Scholar]
- Teng et al. (2016).Teng Y, Ren X, Li H, Shull A, Kim J, Cowell JK. Mitochondrial ATAD3A combines with GRP78 to regulate the WASF3 metastasis-promoting protein. Oncogene. 2016;35:333–343. doi: 10.1038/onc.2015.86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waldum, Sagatun & Mjones (2017).Waldum HL, Sagatun L, Mjones P. Gastrin and gastric cancer. Frontiers in Endocrinology. 2017;8:1. doi: 10.3389/fendo.2017.00001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang et al. (2014).Wang J, Ni Z, Duan Z, Wang G, Li F. Altered expression of hypoxia-inducible factor-1alpha (HIF-1alpha) and its regulatory genes in gastric cancer tissues. PLOS ONE. 2014;9:e99835. doi: 10.1371/journal.pone.0099835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang et al. (2012).Wang Q, Wen YG, Li DP, Xia J, Zhou CZ, Yan DW, Tang HM, Peng ZH. Upregulated INHBA expression is associated with poor survival in gastric cancer. Medical Oncology. 2012;29:77–83. doi: 10.1007/s12032-010-9766-y. [DOI] [PubMed] [Google Scholar]
- Wen et al. (2020).Wen F, Huang J, Lu X, Huang W, Wang Y, Bai Y, Ruan S, Gu S, Chen X, Shu P. Identification and prognostic value of metabolism-related genes in gastric cancer. Aging. 2020;12:17647–17661. doi: 10.18632/aging.103838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wickham (2016).Wickham H. ggplot2: elegant graphics for data analysis. Springer-Verlag; New York: 2016. [Google Scholar]
- Wu et al. (2019).Wu S, Liu X, He J, Wang H, Luo Y, Gong W, Li Y, Huang Y, Zhong L, Zhao Y. A dual targeting magnetic nanoparticle for human cancer detection. Nanoscale Research Letters. 2019;14:228. doi: 10.1186/s11671-019-3049-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xiao et al. (2019).Xiao B, Kuang Z, Zhang W, Hang J, Chen L, Lei T, He Y, Deng C, Li W, Lu J, Qu J, Zhou Q, Hao W, Sun Z, Li L. Glutamate Ionotropic Receptor Kainate Type Subunit 3 (GRIK3) promotes epithelial-mesenchymal transition in breast cancer cells by regulating SPDEF/CDH1 signaling. Molecular Carcinogenesis. 2019;58:1314–1323. doi: 10.1002/mc.23014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu et al. (2016).Xu GJ, Shah AA, Li MZ, Xu Q, Rosen A, Casciola-Rosen L, Elledge SJ. Systematic autoantigen analysis identifies a distinct subtype of scleroderma with coincident cancer. Proceedings of the National Academy of Sciences of the United States of America. 2016;113:E7526–E7534. doi: 10.1073/pnas.1615990113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yoon & Kim (2015).Yoon H, Kim N. Diagnosis and management of high risk group for gastric cancer. Gut Liver. 2015;9:5–17. doi: 10.5009/gnl14118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zeng et al. (2019).Zeng X, Li C, Li Y, Yu H, Fu P, Hong HG, Zhang W. A network-based variable selection approach for identification of modules and biomarker genes associated with end-stage kidney disease. Nephrology. 2019;25:775–784. doi: 10.1111/nep.13655. [DOI] [PubMed] [Google Scholar]
- Zhang & Horvath (2005).Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Statistical Applications in Genetics and Molecular Biology. 2005;4:Article17. doi: 10.2202/1544-6115.1128. [DOI] [PubMed] [Google Scholar]
- Zhang et al. (2019).Zhang C, Liang Y, Ma MH, Wu KZ, Dai DQ. KRT15, INHBA, MATN3, and AGT are aberrantly methylated and differentially expressed in gastric cancer and associated with prognosis. Pathology, Research and Practice. 2019;215:893–899. doi: 10.1016/j.prp.2019.01.034. [DOI] [PubMed] [Google Scholar]
- Zhang et al. (2010).Zhang X, Yang JJ, Kim YS, Kim KY, Ahn WS, Yang S. An 8-gene signature, including methylated and down-regulated glutathione peroxidase 3, of gastric cancer. International Journal of Oncology. 2010;36:405–414. [PubMed] [Google Scholar]
- Zhao et al. (2017).Zhao C, Li Y, Chen G, Wang F, Shen Z, Zhou R. Overexpression of miR-15b-5p promotes gastric cancer metastasis by regulating PAQR3. Oncology Reports. 2017;38:352–358. doi: 10.3892/or.2017.5673. [DOI] [PubMed] [Google Scholar]
- Zhou et al. (2018).Zhou XG, Huang XL, Liang SY, Tang SM, Wu SK, Huang TT, Mo ZN, Wang QY. Identifying miRNA and gene modules of colon cancer associated with pathological stage by weighted gene co-expression network analysis. OncoTargets and Therapy. 2018;11:2815–2830. doi: 10.2147/OTT.S163891. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The following information was supplied regarding data availability:
R code and training set, pre-derivation analysis, and testing set raw data are available in the Supplemental Files.