Skip to main content
Saudi Journal of Biological Sciences logoLink to Saudi Journal of Biological Sciences
. 2022 Feb 23;29(5):3519–3527. doi: 10.1016/j.sjbs.2022.02.016

Bioinformatics role of the WGCNA analysis and co-expression network identifies of prognostic marker in lung cancer

Liang Chengcheng a,1, Sayed Haidar Abbas Raza a,1, Yu Shengchen a, Zuhair M Mohammedsaleh c, Abdullah F Shater c, Fayez M Saleh d, Muna O Alamoudi e, Bandar H Aloufi e, Ahmed Mohajja Alshammari e, Nicola M Schreurs f, Linsen Zan a,b,
PMCID: PMC9280221  PMID: 35844396

Abstract

Lung cancer is the most talked about cancer in the world. It is also one of the cancers that currently has a high mortality rate. The aim of our research is to find more effective therapeutic targets and prognostic markers for human lung cancer. First, we download gene expression data from the GEO database. We performed weighted co-expression network analysis on the selected genes, we then constructed scale-free networks and topological overlap matrices, and performed correlation modular analysis with the cancer group. We screened the 200 genes with the highest correlation in the cyan module for functional enrichment analysis and protein interaction network construction, found that most of them focused on cell division, tumor necrosis factor-mediated signaling pathways, cellular redox homeostasis, reactive oxygen species biosynthesis, and other processes, and were related to the cell cycle, apoptosis, HIF-1 signaling pathway, p53 signaling pathway, NF-κB signaling pathway, and several cancer disease pathways are involved. Finally, we used the GEPIA website data to perform survival analysis on some of the genes with GS > 0.6 in the cyan module. CBX3, AHCY, MRPL12, TPGB, TUBG1, KIF11, LRRC59, MRPL17, TMEM106B, ZWINT, TRIP13, and HMMR was identified as an important prognostic factor for lung cancer patients. In summary, we identified 12 mRNAs associated with lung cancer prognosis. Our study contributes to a deeper understanding of the molecular mechanisms of lung cancer and provides new insights into drug use and prognosis.

Keywords: Bioinformatics, Lung Cancer, Gene Expression Omnibus, Weighted Correlation Network Analysis (WGCNA), Gene Expression Profiling Interactive Analysis (GEPIA)

1. Introduction

Cancer is defined as an abnormal growth of cells. Cancer originates in any organ or body structure and consists of tiny cells that have lost the ability to stop growing. The incidence of cancer has been rising steadily over the past decade, probably due to our regular exposure to various carcinogens, or cancer-causing agents, and changes in our lifestyles. Cancer is one of the most dreaded diseases of the 20th century and is spreading further in the 21th century as the incidence continues and increases. This is a very worrying situation, with one in four people at risk of developing cancer (Roy and Saikia, 2016, Torre et al., 2016). Cancer profiles are inseparable from their internal metabolism, whether the cancer profile leads to altered internal metabolism or the reverse. Alterations in signal transduction pathways may explain the metabolic reprogramming of cells (Kroemer and Pouyssegur, 2008). Cancers with lethal potential, or their precursors, should be detected early for treatment that reduces mortality and morbidity (Srivastava et al., 2019).

With increasing urbanization and environmental pollution, the causative factors of lung cancer have become increasingly complex. Research directions have focused on the epidemiological characteristics of lung cancer and its associated risk factors (Mao et al., 2016). Proteomics can detect relatively different protein abundances in normal and cancer patients, and analysis can identify predictive and prognostic markers of drug resistance in lung cancer. In turn, new molecular therapeutic targets can be identified (Cheung and Juan, 2017, Oberndorfer and Müllauer, 2018). Research on molecular markers of lung cancer is underway in Europe, which will potentially be an important screening tool (Veronesi, 2015).

WGCNA, or Weighted Correlation Network Analysis, is a measure of the co-expression relationship between genes using their expression correlation coefficients. Expression patterns between genes clustered into a module are similar. They can be involved in the same biological processes or signaling pathways (Liu et al., 2017, Kakati et al., 2019). In addition, these modules can also be associated with clinical features. When screening for differences, most studies focused on the differential genes and ignored the high correlation between genes. Currently, WGCNA has been applied in several research fields and is very effective for screening new therapeutic targets (Chen et al., 2017, Yuan et al., 2018, Yao et al., 2019).

In this study data were collected from the GEO dataset on the expression profile of lung cancer. Fifty-four of these samples (27 normal paraneoplastic normal tissues, 27 tumor cancerous tissues) were selected for data compilation. And the average FPKM < 0.5 expression was filtered. We then performed WGCNA analysis of the resulting genes and then constructed a protein interaction network on the highly correlated genes of lung cancer samples and survival analysis by combining the clinical data from GEPIA database, and screened a total of 12 genes related to survival rate. The screened genes could be used as prognostic markers of lung cancer for further study.

2. Methods

2.1. Data acquisition and pre‐processing

To screen for prognostic markers in lung cancer patients, this study used datasets from the GEO database, and GEO data contain expression profiling data and DNA methylation data.

2.2. Construction of Weighted Correlation Network Analysis

The value of mean FPKM was filtered for values greater than 0.5. We also perform cluster analysis on all samples and fit index analysis to convert the similarity matrix to adjacency matrix, and then we calculate the optimal power (soft threshold) value to ensure that the correlation between connectivity and power is above 0.9. The best power value was determined to be 14. Based on this, we constructed a scale-free network and a topology overlap matrix (TOM). Corresponding TOM dissimilarity (diss TOM) was also performed, which resulted in the generation of a gene (tree graph) hierarchical clustering tree based on function hclust by hierarchical clustering for module detection. To avoid generating too many modules, the relevant parameters were a minimum Module Size = 30 and deep Split = 2. MEDissThres = 0.25, the similarity is 0.75. When the similarity is > 0.75, the modules are merged to generate new merge module after that.

2.3. Relationship between grouping information and modules

The correlations between the constructed modules and the tumor and normal groups were analyzed, and a heat map plotted including 35 modules. Finally, we performed correlation analysis based on the gene significance of the module membership with the tumor group and plotted the scatter plot. The cyan module was found to have (Cor = 0.71, p-value = 1.4e-133). We further perform subsequent functional enrichment and analysis of this module.

2.4. Gene ontology and pathway enrichment analysis

The cyan module (Cor = 0.67, p-value = 3e-08) was screened and 200 hub genes were obtained by GS > 0.5 screening. Gene Ontology (GO) enrichment analysis of the hub genes was performed using the David version 6.8 (https://david.ncifcrf.gov/) database (Huang da et al., 2009). The kobas version 3.0 (http://kobas.cbi.pku.edu.cn/kobas3/?t=1) was selected for transformation of hub genes and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis (Wu et al., 2006, Xie et al., 2011). The GO consists of three components: biological process (BP), molecular function (MF), and cellular component (CC). All displayed GO or KEGG terms or genes have a p < 0.05. Finally, we visualized some of the terms using the GO plot package (https://cran.r-project.org/web/packages/GOplot/) in the R language software (Walter et al., 2015).

2.5. Construction of PPI network

A protein interaction network map was constructed for the hub genes in the cyan module with GS > 0.5 (the higher the GS, the greater the correlation between the gene and the trait), which were displayed using the online website string (https://string-db.org/cgi/) (Szklarczyk et al., 2015, Szklarczyk et al., 2017). The screening condition was that with the highest confidence = 0.900. Further visualization was obtained using Cytoscape version 3.6.0 (https://cytoscape.org/) software (Shannon et al., 2003, Doncheva et al., 2019).

2.6. Survival analysis

Survival analysis was performed on some of the genes with GS > 0.6 in the cyan module, using the Gene Expression Profiling Interactive Analysis (GEPIA) website (Tang et al., 2017) (http://gepia2.cancer-pku.cn/# (index) of the Lung cancer disease clinical data for survival analysis of GS correlation>0.6 in the cyan module. The clinical sample consisted of 239 low expression groups and 239 high expression groups (S.Table 1).

3. Results

3.1. Pre‐processing of the data sets

We downloaded dataset GSE7670 and selected 54 of the samples (27 normal paracancerous normal tissues and 27 tumor cancerous tissues). The platform file used the GPL96 [HG-U133A] Affymetrix Human Genome U133A Array, and we performed id conversion of the matrix files according to the platform file. The matrix files were sorted and processed for missing values, and filtered for expression with an average FPKM < 0.5. We then performed WGCNA analysis of the resulting genes.

Fig. 1 shows the scale-free fit indices at various soft-threshold (power) values. There are plots showing the average connectivity at various soft-thresholds (power). The best power value was determined as β = 14. We then performed sample cluster analysis (Fig. 2). After grouping the adjacency matrices were transformed to generate TOM matrices, and hierarchical clustering was used to generate a function hclust hierarchical clustering-based tree for module detection. Dynamic tree clipping method was used for module identification. And modules with similarity greater than 0.75 are merged. The final module clustering tree diagram was obtained (Fig. 3). We perform correlation analysis between the various modules and trait groupings generated by clustering. A heat map of the module and trait data is plotted (Fig. 4). It is total of 35 modules were generated for all samples. The correlation coefficients and p-values between each module and each trait are listed in the heat map. Finally, we selected the Cyan module for subsequent analysis. Because this module had the largest correlation coefficient with the tumor group (Cor = 0.67, p-value = 3e-08). This study also looked at the correlation between MM (module membership) in the cyan module and GS (gene significance) in the trait tumor (Fig. 5). The correlation was found to be 0.71, p-value = 1.4e-133. It means that the genes in the cyan module had a high positive correlation with the tumor cancer group. We then performed functional studies on the genes in the cyan module.

Fig. 1.

Fig. 1

Scatter plot of fit index, average connectivity and power values. Left: Soft Threshold (power) is the weight, and the vertical axis indicates the correlation between connectivity and power. Right: Soft Threshold (power) is the weight, and the vertical axis is the average connectivity. The power when the correlation is required to reach 0.9 is used as the β value, β = 14 in this analysis.

Fig. 2.

Fig. 2

Sample clustering plot and sample trait heat map. The image is a clustering plot after clustering the samples, and below is a heat map of the samples corresponding to the traits Tumor and Normal, where red indicates the Tumor group. White indicates the Normal group.

Fig. 3.

Fig. 3

Module clustering tree diagram. Scale-free networks and topological overlap matrices (TOMs) were constructed. The corresponding TOM phase dissimilarity (diss TOM) was also performed, resulting in the generation of a gene (tree diagram) hierarchical clustering tree based on the function hclust by hierarchical clustering for module detection. In order to avoid generating too many modules, the relevant parameters min Module Size is 30 and deep Split = 2. MEDissThres is set to 0.25, the similarity is 0.75. When the similarity is > 0.75, the modules are merged together to generate new merge the module after that.

Fig. 4.

Fig. 4

Module and trait-related heat map. The horizontal coordinates are traits, divided into tumor and normal groups. The vertical coordinates are the clustering obtained for each module. We chose the most correlated (Cor = 0.67, p-value = 3e-08) module cyan for the follow-up study.

Fig. 5.

Fig. 5

Scatter plot of correlation analysis of Cyan modules and tumor group. We can see that the correlation coefficient between Cyan module and tumor group reaches 0.71. To find the hub genes associated with the tumor trait, we first calculate the internal connectivity of the gene and the module, which measures the position of the gene within the module, while the module indicates to which module the gene belongs. The higher the correlation between the module MM and the current GS, the more positively the module is correlated with the trait tumor.

3.2. Enrichment analysis of GO, KEGG and PPI network

A total of 120 GO terms were found. Of these, 62 were biological processes (BP), 38 were cellular components (CC), and 20 were molecular functions (MF). Most of them were focused on cell division, tumor necrosis factor-mediated signaling pathway, negative regulation of cell redox homeostasis, negative regulation of reactive oxygen species biosynthesis, positive regulation of Wnt signaling pathway and other processes related to cancer cell division, maintenance of cell redox homeostasis and reactive oxygen species biosynthesis. KEGG pathway enrichment analysis enriched a total of 110 pathways, mainly including cell cycle, cellular apoptosis, HIF-1 signaling pathway, peroxisome, insulin resistance, PI3K-Akt signaling pathway, p53 signaling pathway, NF-κB signaling pathway, and multiple cancer disease pathways. We selected some of the results of GO and KEGG analyses and plotted them visually using the R package GO plot (Fig. 6). Finally, we performed protein interaction network analysis based on the genes with GS > 0.5 in the cyan module (Fig. 7). A total of 96 nodes and 311 edges were obtained. The size of the circles in the graph indicates the size of the correlation with GS tumor. The larger the circles, the stronger the correlation.

Fig. 6.

Fig. 6

Chord diagram of GO and KEGG enrichment analysis of the Hub gene. GO enrichment analysis includes biological process (BP), molecular function (MF) and cellular component (CC) terms) and KEGG pathway enrichment analysis. Selected sections are presented.

Fig. 7.

Fig. 7

PPI protein interaction network diagram for the Hub gene. The protein interaction network map was constructed for the hub gene with GS > 0.5 in the cyan module, and the screening condition was highest confidence = 0.900. The size of nodes indicates the correlation between GS and tumor trait, and the larger the nodes, the stronger the correlation.

3.3. Survival analysis

A total of 12 lung cancer prognostic markers were screened: Chromobox 3 (CBX3), adenosylhomocysteinase (AHCY), Kinesin Family Member 11 (KIF11) by survival analysis based on the GEPIA public database, Mitochondrial Ribosomal Protein L12 (MRPL12), Leucine Rich Repeat Containing 59 (LRRC59), Hyaluronan Mediated Motility Receptor (HMMR), Mitochondrial Ribosomal Protein L17 (MRPL17), Transmembrane Protein 106B (TMEM106B), Trophoblast Glycoprotein (TPBG), Tubulin Gamma 1 (TUBG1), ZW10 Interacting Kinetochore Protein (ZWINT), Thyroid Hormone Receptor Interactor 13 (TRIP13). The survival analysis curve is shown in Fig. 8. When the expression of 12 genes is high, the survival rate of lung cancer patients is small; therefore, these genes may be associated with a poor prognosis.

Fig. 8.

Fig. 8

Survival analysis curves for selected Hub genes. GS correlation in the cyan module was greater than 0.6 for survival analysis, and the clinical sample consisted of 239 low expression groups and 239 high expression groups, method selection overall survival, using median's group cutoff, log rank p < 0.05.

4. Discussion

WGCNA analysis is more widely used in cancer research. Examples include studies related to colon cancer (Zhai et al., 2017), liver cancer(Yin et al., 2018) and osteosarcoma metastasis(Tian et al., 2018) This analysis allows for the classification of co-expressed genes by weighted gene co-expression. Listed as a module, each module and related traits are finally correlated, thus selecting module genes that are highly correlated with the trait for further analysis.

The incidence of lung cancer now leads the world and has gone from being a rare disease to a global public health problem. The causes of the disease have also become more complex than before. Most of these are due to smoking, environmental pollution problems, and industrialization. (Mao et al., 2016, Nasim et al., 2019). But with advances in the life sciences. New molecular targets for lung disease continue to be discovered, prompting the development of new therapies. For NSCLC, future use of targeted therapies or immunotherapy may be more effective (Hirsch et al., 2017).

Gene CBX3 mainly plays a role in neural differentiation and the growth of hepatocellular carcinoma cells, and has minimal studies for its association with lung cancer (Huang et al., 2017, Zhong et al., 2019). AHCY is associated with stem cell proliferation (Aranda et al., 2019), and studies on lung cancer are currently unreported. MRPL12 is mainly associated with mitochondrial energy supply (Ma et al., 2020)and the gene LRRC59 is reported to be associated with poor prognosis in lung cancer and is involved in the cell proliferation process (Li et al., 2020). The gene TPGBG is associated with Wnt signaling and can promote pancreatic ductal adenocarcinoma cell metastasis (He et al., 2015). The TUBG1 gene has been associated with the risk of breast cancer (Blanco et al., 2015). The gene we screened is not common in lung cancer studies. However, it has a high positive correlation with lung cancer traits. Therefore, it is of great research significance.

This is consistent with the characteristics of the sample and bodes well for the desirability of our research method. The results are reliable.

In this study, we used bioinformatics WGCNA analysis for modular analysis of lung cancer samples, and we identified 12 mRNAs that can be used as prognostic markers for lung cancer analysis. They included 5 unknown (CBX3, AHCY, MRPL12, TPBG, TUBG1) and 7 known (KIF11, LRRC59, MRPL17, TMEM106B, ZWINT, TRIP13, HMMR) markers.

5. Conclusion

Overall, we screened 12 mRNAs associated with prognosis in lung cancer that could serve as prognostic biomarkers for lung cancer research. The screening study performed contributes to further understanding of the molecular mechanisms of lung carcinogenesis and provides new insights into drug use and prognosis.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

Acknowledgement

All the authors acknowledge and thank their respective institutes and universities.

Funding

This study was supported by the National Key Research and Development Program of China (2018YFD0501700), National Natural Science Foundation of China (31972994), Key Research and Development Program of Ningxia Province (2019BEF02004), National Beef and Yak Industrial Technology System (CARS-37), Agricultural Science and Technology Innovation and Transformation Project of Shaanxi Province (NYKJ-2018-LY09).

Footnotes

Peer review under responsibility of King Saud University.

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.sjbs.2022.02.016.

Appendix A. Supplementary data

The following are the Supplementary data to this article:

Supplementary data 1
mmc1.zip (99.4KB, zip)

References

  1. Aranda S., Alcaine-Colet A., Blanco E., Borràs E., Caillot C., Sabidó E., Di Croce L. Chromatin capture links the metabolic enzyme ahcy to stem cell proliferation. Sci. Adv. 2019;5(3):eaav2448. doi: 10.1126/sciadv.aav2448. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Blanco I., Kuchenbaecker K., Cuadras D., Wang X., Barrowdale D., de Garibay G.R., Librado P., Sánchez-Gracia A., Rozas J., Bonifaci N., McGuffog L., Pankratz V.S., Islam A., Mateo F., Berenguer A., Petit A., Català I., Brunet J., Feliubadaló L., Tornero E., Benítez J., Osorio A., Cajal T.R.y., Nevanlinna H., Aittomäki K., Arun B.K., Toland A.E., Karlan B.Y., Walsh C., Lester J., Greene M.H., Mai P.L., Nussbaum R.L., Andrulis I.L., Domchek S.M., Nathanson K.L., Rebbeck T.R., Barkardottir R.B., Jakubowska A., Lubinski J., Durda K., Jaworska-Bieniek K., Claes K., Van Maerken T., Díez O., Hansen T.V., Jønson L., Gerdes A.-M., Ejlertsen B., de la Hoya M., Caldés T., Dunning A.M., Oliver C., Fineberg E., Cook M., Peock S., McCann E., Murray A., Jacobs C., Pichert G., Lalloo F., Chu C., Dorkins H., Paterson J., Ong K.-R., Teixeira M.R., Hogervorst F.B.L., van der Hout A.H., Seynaeve C., van der Luijt R.B., Ligtenberg M.J.L., Devilee P., Wijnen J.T., Rookus M.A., Meijers-Heijboer H.E.J., Blok M.J., van den Ouweland A.M.W., Aalfs C.M., Rodriguez G.C., Phillips K.-A., Piedmonte M., Nerenstone S.R., Bae-Jump V.L., O'Malley D.M., Ratner E.S., Schmutzler R.K., Wappenschmidt B., Rhiem K., Engel C., Meindl A., Ditsch N., Arnold N., Plendl H.J., Niederacher D., Sutter C., Wang-Gohrke S., Steinemann D., Preisler-Adams S., Kast K., Varon-Mateeva R., Gehrig A., Bojesen A., Pedersen I.S., Sunde L., Jensen U.B., Thomassen M., Kruse T.A., Foretova L., Peterlongo P., Bernard L., Peissel B., Scuvera G., Manoukian S., Radice P., Ottini L., Montagna M., Agata S., Maugard C., Simard J., Soucy P., Berger A., Fink-Retter A., Singer C.F., Rappaport C., Geschwantler-Kaulich D., Tea M.-K., Pfeiler G., John E.M., Miron A., Neuhausen S.L., Terry M.B., Chung W.K., Daly M.B., Goldgar D.E., Janavicius R., Dorfling C.M., van Rensburg E.J., Fostira F., Konstantopoulou I., Garber J., Godwin A.K., Olah E., Narod S.A., Rennert G., Paluch S.S., Laitman Y., Friedman E., Liljegren A., Rantala J., Stenmark-Askmalm M., Loman N., Imyanitov E.N., Hamann U., Spurdle A.B., Healey S., Weitzel J.N., Herzog J., Margileth D., Gorrini C., Esteller M., Gómez A., Sayols S., Vidal E., Heyn H., Stoppa-Lyonnet D., Léoné M., Barjhoux L., Fassy-Colcombet M., de Pauw A., Lasset C., Ferrer S.F., Castera L., Berthet P., Cornelis F., Bignon Y.-J., Damiola F., Mazoyer S., Sinilnikova O.M., Maxwell C.A., Vijai J., Robson M., Kauff N., Corines M.J., Villano D., Cunningham J., Lee A., Lindor N., Lázaro C., Easton D.F., Offit K., Chenevix-Trench G., Couch F.J., Antoniou A.C., Pujana M.A., Suzuki H. Assessing associations between the aurka-hmmr-tpx2-tubg1 functional module and breast cancer risk in brca1/2 mutation carriers. PLoS ONE. 2015;10(4):e0120020. doi: 10.1371/journal.pone.0120020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Chen L., Yuan L., Wang Y., Wang G., Zhu Y., Cao R., Qian G., Xie C., Liu X., Xiao Y.u., Wang X. Co-expression network analysis identified fcer1g in association with progression and prognosis in human clear cell renal cell carcinoma. Int. J. Biol. Sci. 2017;13(11):1361–1372. doi: 10.7150/ijbs.21657. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Cheung C.H.Y., Juan H.F. Quantitative proteomics in lung cancer. J. Biomed. Sci. 2017;24(1):37. doi: 10.1186/s12929-017-0343-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Doncheva N.T., Morris J.H., Gorodkin J., Jensen L.J. Cytoscape stringapp: Network analysis and visualization of proteomics data. J. Proteome Res. 2019;18(2):623–632. doi: 10.1021/acs.jproteome.8b0070210.1021/acs.jproteome.8b00702.s00110.1021/acs.jproteome.8b00702.s002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. He P., Jiang S., Ma M., Wang Y., Li R., Fang F., Tian G., Zhang Z. Trophoblast glycoprotein promotes pancreatic ductal adenocarcinoma cell metastasis through wnt/planar cell polarity signaling. Mol. Med. Rep. 2015;12(1):503–509. doi: 10.3892/mmr.2015.3412. [DOI] [PubMed] [Google Scholar]
  7. Hirsch F.R., Scagliotti G.V., Mulshine J.L., Kwon R., Curran W.J., Jr., Wu Y.L., Paz-Ares L. Lung cancer: Current therapies and new targeted treatments. Lancet. 2017;389(10066):299–311. doi: 10.1016/s0140-6736(16)30958-8. [DOI] [PubMed] [Google Scholar]
  8. Huang C., Su T., Xue Y., Cheng C., Lay F.D., McKee R.A., Li M., Vashisht A., Wohlschlegel J., Novitch B.G., Plath K., Kurdistani S.K., Carey M. Cbx3 maintains lineage specificity during neural differentiation. Genes Dev. 2017;31(3):241–246. doi: 10.1101/gad.292169.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Huang da W., Sherman B.T., Lempicki R.A. Systematic and integrative analysis of large gene lists using david bioinformatics resources. Nat. Protoc. 2009;4(1):44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
  10. Kakati T., Bhattacharyya D.K., Barah P., Kalita J.K. Comparison of methods for differential co-expression analysis for disease biomarker prediction. Comput. Biol. Med. 2019;113:103380. doi: 10.1016/j.compbiomed.2019.103380. [DOI] [PubMed] [Google Scholar]
  11. Kroemer G., Pouyssegur J. Tumor cell metabolism: Cancer's achilles' heel. Cancer Cell. 2008;13(6):472–482. doi: 10.1016/j.ccr.2008.05.005. [DOI] [PubMed] [Google Scholar]
  12. Li D., Xing Y., Tian T., Guo Y., Qian J. Overexpression of lrrc59 is associated with poor prognosis and promotes cell proliferation and invasion in lung adenocarcinoma. Oncol. Targets Ther. 2020;136453–6463 doi: 10.2147/ott.s245336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Liu W., Li L., Ye H., Tu W. weighted gene co-expression network analysis in biomedicine research. Sheng Wu Gong Cheng Xue Bao. 2017;33(11):1791–1801. doi: 10.13345/j.cjb.170006. [DOI] [PubMed] [Google Scholar]
  14. Ma Y., Zhu S., Lv T., Gu X., Feng H., Zhen J., Xin W., Wan Q. SQSTM1/p62 Controls mtDNA Expression and Participates in Mitochondrial Energetic Adaption via MRPL12. iScience. 2020;23(8):101428. doi: 10.1016/j.isci.2020.101428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Mao Y., Yang D., He J., Krasna M.J. Epidemiology of lung cancer. Surg. Oncol. Clin. N. Am. 2016;25(3):439–445. doi: 10.1016/j.soc.2016.02.001. [DOI] [PubMed] [Google Scholar]
  16. Nasim F., Sabath B.F., Eapen G.A. Lung cancer. Med. Clin. North Am. 2019;103(3):463–473. doi: 10.1016/j.mcna.2018.12.006. [DOI] [PubMed] [Google Scholar]
  17. Oberndorfer F., Müllauer L. Molecular pathology of lung cancer: Current status and perspectives. Curr. Opin. Oncol. 2018;30(2):69–76. doi: 10.1097/cco.0000000000000429. [DOI] [PubMed] [Google Scholar]
  18. Roy P.S., Saikia B.J. Cancer and cure: A critical analysis. Indian J. Cancer. 2016;53(3):441–442. doi: 10.4103/0019-509x.200658. [DOI] [PubMed] [Google Scholar]
  19. Shannon P., Markiel A., Ozier O., Baliga N.S., Wang J.T., Ramage D., Amin N., Schwikowski B., Ideker T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Srivastava S., Koay E.J., Borowsky A.D., De Marzo A.M., Ghosh S., Wagner P.D., Kramer B.S. Cancer overdiagnosis: A biological challenge and clinical dilemma. Nat. Rev. Cancer. 2019;19(6):349–358. doi: 10.1038/s41568-019-0142-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Szklarczyk D., Franceschini A., Wyder S., Forslund K., Heller D., Huerta-Cepas J., Simonovic M., Roth A., Santos A., Tsafou K.P., et al. String v10: Protein-protein interaction networks, integrated over the tree of life. Nucl. Acids Res. 2015;43(Database issue):D447–452. doi: 10.1093/nar/gku1003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Szklarczyk D., Morris J.H., Cook H., Kuhn M., Wyder S., Simonovic M., Santos A., Doncheva N.T., Roth A., Bork P., Jensen L.J., von Mering C. The string database in 2017: Quality-controlled protein-protein association networks, made broadly accessible. Nucl. Acids Res. 2017;45(D1):D362–D368. doi: 10.1093/nar/gkw937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Tang Z., Li C., Kang B., Gao G., Li C., Zhang Z. Gepia: A web server for cancer and normal gene expression profiling and interactive analyses. Nucl. Acids Res. 2017;45(W1):W98–w102. doi: 10.1093/nar/gkx247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Tian H., Guan D., Li J. Identifying osteosarcoma metastasis associated genes by weighted gene co-expression network analysis (wgcna) Medicine (Baltimore). 2018;97(24) doi: 10.1097/md.0000000000010781. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Torre L.A., Siegel R.L., Ward E.M., Jemal A. Global cancer incidence and mortality rates and trends–an update. Cancer Epidemiol. Biomarkers Prev. 2016;25(1):16–27. doi: 10.1158/1055-9965.epi-15-0578. [DOI] [PubMed] [Google Scholar]
  26. Veronesi G. Lung cancer screening: The european perspective. Thorac. Surg. Clin. 2015;25(2):161–174. doi: 10.1016/j.thorsurg.2014.12.002. [DOI] [PubMed] [Google Scholar]
  27. Walter W., Sánchez-Cabo F., Ricote M. Goplot: An r package for visually combining expression data with functional analysis. Bioinformatics. 2015;31(17):2912–2914. doi: 10.1093/bioinformatics/btv300. [DOI] [PubMed] [Google Scholar]
  28. Wu J., Mao X., Cai T., Luo J., Wei L. KOBAS server: a web-based platform for automated annotation and pathway identification. Nucl. Acids Res. 2006;34(Web Server):W720–W724. doi: 10.1093/nar/gkl167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Xie C., Mao X., Huang J., Ding Y., Wu J., Dong S., Kong L., Gao G., Li C.Y., Wei L. Kobas 2.0: A web server for annotation and identification of enriched pathways and diseases. Nucl. Acids Res. 2011;39(Web Server issue):W316–322. doi: 10.1093/nar/gkr483. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Yao Y., Zhang T., Qi L., Zhou C., Wei J., Feng F., Liu R., Sun C. Integrated analysis of co-expression and cerna network identifies five lncrnas as prognostic markers for breast cancer. J. Cell Mol. Med. 2019;23(12):8410–8419. doi: 10.1111/jcmm.14721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Yin L.i., Cai Z., Zhu B., Xu C. Identification of key pathways and genes in the dynamic progression of hcc based on wgcna. Genes (Basel) 2018;9(2):92. doi: 10.3390/genes9020092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Yuan L., Zeng G., Chen L., Wang G., Wang X., Cao X., Lu M., Liu X., Qian G., Xiao Y.u., Wang X. Identification of key genes and pathways in human clear cell renal cell carcinoma (ccrcc) by co-expression analysis. Int. J. Biol. Sci. 2018;14(3):266–279. doi: 10.7150/ijbs.23574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Zhai X., Xue Q., Liu Q., Guo Y., Chen Z. Colon cancer recurrence-associated genes revealed by wgcna co-expression network analysis. Mol. Med. Rep. 2017;16(5):6499–6505. doi: 10.3892/mmr.2017.7412. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Zhong X., Kan A., Zhang W., Zhou J., Zhang H., Chen J., Tang S. Cbx3/hp1γ promotes tumor proliferation and predicts poor survival in hepatocellular carcinoma. Aging (Albany NY). 2019;11(15):5483–5497. doi: 10.18632/aging.102132. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary data 1
mmc1.zip (99.4KB, zip)

Articles from Saudi Journal of Biological Sciences are provided here courtesy of Elsevier

RESOURCES