Abstract
Colorectal cancer (CRC) is one of the most common carcinomas and the fourth leading cause of cancer-related death worldwide. One of the obstacles in the successful treatment of CRC is a high rate of recurrence. We aimed to construct weighted gene co-expression network analysis (WGCNA) to identify key modules and hub genes in association with recurrence in CRC patients. We firstly used the microarray data, GSE41258, to construct a co-expression network and identify gene modules. Furthermore, protein and protein interaction (PPI) network was also performed to screen hub genes. To validate the hub genes, an independent dataset GSE17536 was used for survival analyses. Additionally, another two databases were also performed to investigate the survival rates and expression levels of hub genes. Gene set enrichment analyses (GSEA) combined with gene ontology (GO) were performed to further explore function and mechanisms. In our study, the midnightblue module was identified to be significant, 15 hub genes were screened, four of which were identified as hub nodes in the PPI network. In the test dataset, we found higher expression of MYL9 and CNN1 were significantly associated with shorter survival time of CRC patients. GO analyses showed that MYL9 and CNN1 were enriched in “muscle system process” and “cytoskeletal protein binding”. GSEA found the two hub genes were enriched in “pathways in cancer” and “calcium signaling pathway”. In conclusion, our study demonstrated that MYL9 and CNN1 were hub genes associated with the recurrence of CRC, which may contribute to the improvement of recurrence-free survival time of CRC patients.
Keywords: colorectal cancer (CRC), weighted gene co-expression network analysis (WGCNA), recurrence, hub gene
Introduction
Human colorectal cancer (CRC) is a global cause of concern in terms of both morbidity and mortality. It is recognized as the third most common carcinoma and the fourth leading cause of cancer-related death in the world 1. Although tremendous advances have been made for diagnosis and treatment of CRC, the burden of disease is still high 2. One of the deficiencies in the completely cure of CRC is the poor prognosis, such as a high rate of tumor recurrence. It is estimated that 30% of patients with stage I-III and 65% of patients with stage IV CRC suffer recurrence after curative treatment 3. No matter whether patients have received a series of progressive adjuvant therapies, however, early detection of disease recurrence during follow-up period results in improved long-term outcomes in CRC patients 4-6. Therefore, knowledge of prognostic and predictive biomarkers for recurrence may effectively contribute to better guidance to selection of treatment strategy as well as improved prognosis.
With the discovery and progression of microarray technologies with high throughput, it is the fact that gene expression profiles have been widely applied in the research area of cancer. Most microarray analyses have attached value to the comparison between tumor and normal samples 7. With the growing interest in bioinformatics analysis, weighted gene co-expression network analysis (WGCNA) has emerged as a novel approach to perform holistic microarray analyses, which is able to identify not only differentially expressed genes (DEGs), but also high degree of interconnection between genes 8. In WGCNA, the basic concept is construction of co-expression modules, which are clusters of genes maintaining consistent expression patterns and even playing similar biological roles, and these modules are derived from the data of mRNA expression profiles by performing unsupervised hierarchical clustering 9-11. In recent years, WGCNA has been successfully applied in the investigation of tumors, such as CRC. It has been reported that four key lncRNAs (RP11‐33O4.1, PCGF5, RP11‐164P12.5 and CTD‐2396E7.11) were found to participate in the development of colon adenocarcinoma (COAD) 12. In addition, COL8A1 was demonstrated to be related to prognosis and progression of COAD by constructing WGCNA as well 13. With the exploration of a growing number of mRNA expression files of CRC in different databases, we get opportunities to analyze the mRNA expression data of genes derived from a variety of platforms and institutes 14. Exploring more different potential biomarkers for different clinicopathological variables of CRC is of great significance.
In our study, for purpose of improving the knowledge of biological mechanisms underlying recurrence of CRC, WGCNA was applied to identify hub genes associated with the recurrence of patients with CRC.
Materials and Methods
Data Collection
The mRNA expression profiles of human CRC with contained clinical information of patients were downloaded from GEO online database. GSE41258 and GSE17536 were two datasets performed on Affymetrix Human Genome U133A Array (HG-U133A). GSE41258 was used as a training set by constructing a co-expression network then identifying hub genes, which included 182 CRC samples and 54 normal ones 15. In addition, GSE17536 was used as an independent test dataset, including 177 CRC patients 16.
Data Preprocessing
The “affy” package was used to preprocess and normalize the raw microarray data 17. Microarray quality was evaluated by the methods of intuitive observation, averaging and data fitting 18. First, we could obtain a grey-scale map by using the function of “image” in order to get a view of the signal intensity of microarray on the whole. Then, the “simpleaffy” package was conducted to get an overview diagram of quality assessment 19, 20. Furthermore, data fitting was conducted by the “affyPLM” package. Finally, sample clustering was conducted to detect microarray outliers on the basis of distances between samples in Pearson's correlation matrices and average linkage. The top 5000 varying genes were chosen for WGCNA.
Construction of WGCNA
R package “WGCNA” was conducted. First, an appropriate soft thresholding power was determined by the approximate scale-free topology criterion 21, 22. After that, the adjacency was transformed into topological overlap matrix (TOM), followed by calculation of corresponding dissimilarity (1-TOM) 23. Then, identification of modules was accomplished through the method of dynamic tree cut. It was usually determined by hierarchically clustering genes with similar expression profile and using 1-TOM dissimilarity with a minimum size of 30 cut-off 24. Finally, we selected a cutline for module dendrogram to merge modules for further analysis by calculating the dissimilarity of module eigengenes (MEs).
Identification of Clinically Significant Modules
ME and module significance (MS), two important parameters, were used to identify modules related to clinical traits of CRC patients. First, ME was the major component of a particular gene module, correlation between clinical features and MEs were calculated. Second, MS was regarded as the average value of gene significance (GS) in a particular module. Hence, the highest absolute MS was usually thought to be a persuasive indicator when selecting a clinically significant module.
Identification of Hub Genes
Hub genes were measured by the clinical feature relationship (cor.geneTraitSignificance > 0.2) and the absolute value of the Pearson's correlation (cor.geneModuleMembership > 0.8), which showed a high network connectivity in a particular module 25, 26. Additionally, by uploading all genes in the clinically significant module of interest to the Search Tool for the Retrieval of Interacting Genes (STRING) database (https://string-db.org/) 27, protein and protein interaction (PPI) network was constructed with confidence > 0.4. In the PPI network, hub nodes were genes with connectivity degree ≥ 4 (node/edge). Collectively, genes in both co-expression network and PPI network were selected as candidate hub genes for further analysis. The DEGs with |log2fold change (FC)| > 1 and false discovery rate (FDR) < 0.05 were statistically significant in GSE41258 28.
Validation of Hub Genes
First, a test dataset, GSE17536, was used for survival analyses of above candidate hub genes. In the dataset, 177 samples were divided into low and high groups according to the medium value of the expression of genes. Then “survival” package in R was performed. The log-rank test was adopted to compare two groups 29. Furthermore, Gene Expression Profiling Interactive Analysis (GEPIA) (http://gepia.cancer-pku.cn/) database was used to validate outcomes of survival analyses and gene expression levels 30. Moreover, the Human Protein Atlas (https://www.proteinatlas.org/) database was used for immunohistochemistry (IHC) analysis.
Function Enrichment Analysis
To get further insight into the function of hub genes in the module of interest, we performed Gene Ontology (GO) enrichment analysis by loading "anRichment" package in R. P < 0.05 was set as the cut-off criterion.
Gene Set Enrichment Analysis (GSEA)
In the test dataset GSE17536, 177 samples with CRC were grouped into low and high groups, according to the median value of the expression levels of genes. To explore mechanisms of hub genes in CRC, GSEA 31 was performed and mapped into Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment database. c2.cp.kegg.v5.2.symbols.gmt was selected in this study, which was used as reference gene sets. FDR < 0.05 was chosen as the cut-off criteria.
Collection of clinical tissue samples
30 paired CRC tissues and adjacent tissues were collected from the Department of General Surgery, Zhongnan Hospital of Wuhan University (Wuhan, China). The study was approved by the ethics committee of Zhongnan Hospital of Wuhan University, in accordance with the Declaration of Helsinki. Written informed consent was obtained from the enrolled patients. The samples were directly dissected and subjected to RNA extraction.
Quantitative Real-Time PCR (qRT-PCR)
Total RNA was extracted from 30 paired tissues with TRIzol reagent (Invitrogen, USA). Total RNA (1 μg) was used to synthesize first-strand cDNA using a synthesis kit (Thermo Fisher Scientific, USA). The experiments were performed by using a QuantStudioTM 6 Flex Real-Time PCR instrument (ABI, USA) with SYBR® Premix Ex TaqTM II Mix (Takara, Japan). GAPDH was taken as the internal control. The relative mRNA expression levels were calculated using the 2-ΔΔCt method 32. The gene specific primers were as follows: MYL9 (Forward 5'-GGATGTGATTCGCAACGCCTTTG-3' and Reverse 5'-CGGTACATCTCGTCCACTTCCT-3'); CNN1 (Forward 5'-CCAACGACCTGTTTGAGAACACC-3' and Reverse 5'-ATTTCCGCTCCTGCTTCTCTGC-3'); GAPDH (Forward 5'-AGAAGGCTGGGGCTCATTTG-3' and Reverse 5'-GCAGGAGGCATTGCTGATGAT-3').
Results
Data Preprocessing
First, we performed an overview diagram of quality assessment of total 182 CRC samples (Fig. S1). It is recognized that the absolute value of GAPDH 3'/5' should not be greater than 1 20. Fig. S1c showed that GSM1012445 was unqualified. Furthermore, GSM1012445 was chosen for data fitting analysis. Weights plot, residuals plot, relative log expression (RLE) boxplot and normalized unscaled standard errors (NUSE) boxplot were shown (Fig. S2, S3 and Fig. 1). NUSE is more sensitive than RLE, it is suggested that the samples were unqualified when the NUSE deviated from 1. Therefore, our results showed that GSM1012286, GSM1012445, GSM1012529, GSM1012531, GSM1012533 and GSM1012651 (Fig. 1a, c, d) were unqualified. Moreover, another three samples (GSM1012525, GSM1012326 and GSM1012604) were also removed by performing sample clustering (Fig. S4). To sum up, a total of nine samples should be removed from subsequent analyses in GES41258.
Weighted Co-expression Network Construction and Key Modules Identification
A total of 173 qualified CRC samples with clinical data were included. Using the R package, “WGCNA”, genes which showed similar expression patterns were divided into different modules (Fig. 2). In our work, β = 6 (scale free R2 = 0.89) was screened as the soft-thresholding (Fig. 3). Then, we identified 16 modules and the network heatmap was shown (Fig. 4a, b). Relevance between key module and CRC recurrence was tested using two methods. Our results showed that the ME of the midnightblue module possessed the highest correlation with tumor recurrence ((P = 5×10-4, R2 = 0.26), Fig. 4c). Moreover, we also indicated that the MS of the midnightblue module was the highest among all modules (Fig. 4d), which was considered to have more connection with tumor recurrence. Therefore, we identified the midnightblue module to be a clinically significant module of interest in association with CRC recurrence in the training set.
Hub gene identification
In our study, 15 genes were identified to be hub genes, which had high connectivity in the midnightblue module (Table 1). In addition, all genes in the midnightblue module were uploaded to the STRING database (Fig. S5). According to the PPI network, four hub genes were considered as hub nodes (MYL9, MYLK, CNN1 and DES) as well, which were also DEGs in GSE41258 (Table 1, Fig. 5a, b). Therefore, MYL9, MYLK, CNN1 and DES were candidate hub genes. Furthermore, all genes in the midnightblue module were enriched for GO analysis. Our results showed top 20 GO terms and indicated that hub genes were significantly enriched in “muscle contraction”, “muscle system process”, “contractile fiber part”, “cytoskeleton” and “cytoskeletal protein binding” (P < 0.05, Fig. 5c).
Table 1.
Gene symbol | Probe | cor.geneModuleMembership | Hub gene in PPI network | DEG analysis | |
---|---|---|---|---|---|
|logFC| | FDR | ||||
MYL9 | 201058_s_at | 0.943041082 | Yes | 1.617677 | 1.03E-14 |
MYLK | 202555_s_at | 0.936476159 | Yes | 1.794758 | 7.82E-19 |
CNN1 | 203951_at | 0.906218239 | Yes | 2.713818 | 1.34E-19 |
DES | 202222_s_at | 0.841739696 | Yes | 2.55989 | 7.76E-18 |
HSPB8 | 221667_s_at | 0.913958361 | No | 2.087421 | 3.09E-27 |
SPARCL1 | 200795_at | 0.900690649 | No | 1.531813 | 3.86E-21 |
KCNMB1 | 209948_at | 0.890620367 | No | 1.567299 | 7.39E-26 |
KANK2 | 218418_s_at | 0.880372051 | No | 1.084421 | 2E-20 |
AOC3 | 204894_s_at | 0.880033125 | No | 1.91461 | 3.38E-27 |
CSRP1 | 200621_at | 0.877492479 | No | 1.134281 | 1.7E-20 |
PPP1R12B | 201957_at | 0.872867041 | No | 1.795588 | 1.33E-33 |
PLN | 204940_at | 0.871769799 | No | 2.168066 | 4.12E-24 |
SYNM | 212730_at | 0.858816617 | No | 2.828553 | 1.48E-26 |
PDLIM3 | 209621_s_at | 0.839725447 | No | 1.494531 | 3.86E-14 |
FERMT2 | 214212_x_at | 0.808484526 | No | 0.804553 | 9.87E-12 |
Abbreviations: CRC, colorectal cancer; PPI, protein-protein interaction; DEG, differentially expressed gene; FC, fold change; FDR, false discovery rate.
Validation of Hub Genes
The four candidate hub genes (MYL9, MYLK, CNN1 and DES) were chosen for validation. GSE17536 was used as a test dataset for overall survival analyses. Our results discovered that patients who had higher expression of MYL9 and CNN1 showed a significantly shorter overall survival time (MYL9 P=0.014; CNN1 P=0.02, Fig. 6a, c). More convincingly, GEPIA also indicated a decreased overall and disease-free survival rates in CRC patients with highly expressed (n=135) MYL9 and CNN1 (MYL9 overall: P=0.0071, disease-free: P=0.021; CNN1 overall: P=0.0089, disease-free: P=0.018, Fig. 6e-l). Moreover, GEPIA showed that mRNA expression levels of MYL9 and CNN1 in CRC tissues were both significantly lower than that in normal colon samples (P<0.05, Fig. 7a, b), which were consistent with results of DEGs analysis in GSE41258 dataset. In addition, IHC demonstrated that the protein expression levels of MYL9 and CNN1 were also downregulated in CRC tissues (Fig. 7c, d).
Gene Set Enrichment Analysis
To obtain further insight into the mechanisms of MYL9 and CNN1 in CRC, GSEA was performed to search KEGG pathways enriched in MYL9 and CNN1 highly expressed samples, respectively. Under the cut-off criteria FDR < 0.01, six common and representative functional gene sets were shown (Fig. 8), such as “pathways in cancer”, “calcium signaling pathway” and “focal adhesion”.
qRT-PCR analysis
We further performed qRT-PCR to validate the mRNA expression of MYL9 and CNN1 in 30 paired fresh CRC tissues and adjacent tissues. Our experimental results showed that MYL9 and CNN1 were highly expressed in adjacent tissues compared with CRC tissues (Fig. 9), which were consistent with results by microarray data.
Discussion
In recent years, a systems biology approach, namely, WGCNA, has been widely used to identify potential and novel biomarkers in different kinds of tumors, such as adrenocortical cancer, clear cell renal cell cancer, oral squamous cell cancer and CRC 33-35, 13, 12. By performing WGCNA, Gao et al. reported distinct gene modules existing only in CRC liver metastatic tissues, which were not found in non-metastatic CRC samples 36. In addition, Liu et al. identified a novel prognostic maker, CENPA, which was associated with favorable survival outcome in CRC 14. As far as we know, numerous publications using WGCNA paid more attention to biomarkers for tumor stage of CRC. Although the TNM staging system was used to predict the recurrence of tumors 37, which was usually thought to be positively correlated with recurrence rate, however, it was contentious in a proportion of patients with CRC 38. Therefore, it is of great necessary to identify potential biomarkers which could not only predict the recurrence rate of CRC patients and make up for the deficiency of TNM staging system, but also have opportunities to identify high-risk patients at an earlier stage.
Until now, few studies have investigated potential biomarkers for CRC recurrence by performing WGCNA. In this work, WGCNA was applied in analysis of mRNA expression dataset GSE41258 in order to identify hub modules and genes associated with clinical features, such as tumor recurrence, which might be expected to be used as recurrence-free indicators in the future. Then, an independent validation dataset, GSE17536, was used to confirm our findings. Based on the clinical features, such as tumor stage, tumor metastasis and tumor recurrence, we finally chosen the tumor recurrence of CRC patients which was of interest to screen hub genes. The midnightblue module was identified, in which 15 genes were screened as hub genes; furthermore, four hub genes were also identified as hub nodes that showed a significant correlation with CRC recurrence. Among the four hub genes, MYL9 and CNN1 were found to be significantly correlated to overall survival as well as disease-free survival time of CRC patients. To investigate potential function and mechanisms related to hub genes, GO and GSEA analyses were performed. As a result, we found that MYL9 and CNN1 may participate in “pathways in cancer”, “calcium signaling pathway” and “focal adhesion”.
Myosin light chain 9 (MYL9), a protein encoding gene, is able to module the ATPase activity of myosin heads and regulate muscle contraction 39. It is well known that the phosphorylation of MYL9 plays a crucial role during the process of cell migration on solid substrates 40. Studies have detected the expression of MYL9 in various tumors, revealing that it was upregulated in breast cancer as well as liver cancer, while downregulated in prostate cancer and bladder cancer 41-44, however, it is controversial in CRC. Yan et al. reported that MYL9 was downregulated in CRC and lower expression level of MYL9 resulted in a decreased survival rate in CRC patients 45. Zhao et al. found that MYL9 was upregulated in the patients with early-onset CRC 46. In our study, we confirmed that MYL9 was downregulated in CRC samples and patients with low expression level of MYL9 had an increased survival time. Calponin 1 (CNN1) is one of the modulators of actomyosin contraction, which is also known to be involved in cancer development 47, 48. For instance, it has been reported that CNN1 expression could suppress ovarian cancer development 49. As for CRC, CNN1 was demonstrated to be expressed at a higher expression level in normal colon tissue compared to CRC samples 50, which was consistent with our results.
Generally speaking, almost 50% patients may suffer from tumor recurrence within the first year after initial resection, which may be closely correlated to the prognosis. To our knowledge, this is the first study to identify two potential biomarkers, MYL9 and CNN1, which were associated with CRC recurrence by using the WGCNA algorithm. Interestingly, we found that MYL9 and CNN1 were dramatically decreased in CRC samples compared with normal samples in both GSE41258 and GEPIA database, however, our survival analyses showed that higher expression of MYL9 and CNN1 correlated with poorer prognosis of CRC. We speculated that MYL9 and CNN1 may act as tumor suppressor genes in human bodies, however, with the development and progression of CRC, the two hub genes may be captured by tumor cells and turned to be harmful genes, therefore protecting tumor cells becomes the major role of MYL9 and CNN1. From another perspective, it is known that the stromal microenvironment in tumor tissue is different from the stroma of the corresponding normal tissue in many human cancers. Functionally, myosins are implicated in cell migration and adhesion, cells may exert force propelling the cell forward by contraction of actin cytoskeleton by activating of myosin II which is regulated by the phosphorylation of MYL9. As such, CNN1 is thought to play an essential role in organizing stable actin stress fibers. Additionally, GSEA found that the two hub genes were enriched in pathways in cancer. We therefore hypothesized that the phosphorylation of MYL9 or CNN1 is the key of cell migration process on solid substrates in tumor microenvironment but not in the normal stroma microenvironment, leading to the aggressive progression of CRC. Additionally, we also consider that the expressions of MYL9 and CNN1 are downregulated in CRC because of the general dedifferentiation of cancer cells, but those cancer cells with higher expression of MYL9 and CNN1 may be better suited for migration and metastasis compared with lower expressed ones. In this context, it is not surprising to observe that the two hub genes were decreased in CRC but negatively correlated to survival.
In conclusion, we identified two hub genes, MYL9 and CNN1, which were significantly related to the recurrence of CRC and may contribute to the improvement of recurrence-free survival time of CRC patients.
Supplementary Material
Acknowledgments
The work was supported by the Sanming Project of Medicine in Shenzhen (No. SZSM201512006).
References
- 1.Siegel RL, Miller KD, Jemal A. Cancer statistics, 2015. CA Cancer J Clin. 2015;65:5–29. doi: 10.3322/caac.21254. [DOI] [PubMed] [Google Scholar]
- 2.Schmoll HJ, Van Cutsem E, Stein A, Valentini V, Glimelius B, Haustermans K. et al. ESMO Consensus Guidelines for management of patients with colon and rectal cancer. a personalized approach to clinical decision making. Ann Oncol. 2012;23:2479–516. doi: 10.1093/annonc/mds236. [DOI] [PubMed] [Google Scholar]
- 3.van der Stok EP, Spaander MCW, Grünhagen DJ, Verhoef C, Kuipers EJ. Surveillance after curative treatment for colorectal cancer. Nat Rev Clin Oncol. 2017;14:297–315. doi: 10.1038/nrclinonc.2016.199. [DOI] [PubMed] [Google Scholar]
- 4.Jeffery GM, Hickey BE, Hider P. Follow-up strategies for patients treated for non-metastatic colorectal cancer. Cochrane Database Syst Rev. 2016;11:CD002200. doi: 10.1002/14651858.CD002200.pub3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pita-Fernandez S, Alhayek-Ai M, Gonzalez-Martin C, Lopez-Calvino B, Seoane-Pillado T, Pertega-Diaz S. Intensive follow-up strategies improve outcomes in nonmetastatic colorectal cancer patients after curative surgery: a systematic review and meta-analysis. Ann Oncol. 2015;26:644–56. doi: 10.1093/annonc/mdu543. [DOI] [PubMed] [Google Scholar]
- 6.Renehan AG, Egger M, Saunders MP, O'Dwyer ST. Impact on survival of intensive follow up after curative resection for colorectal cancer: systematic review and meta-analysis of randomised trials. BMJ. 2002;324:813. doi: 10.1136/bmj.324.7341.813. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Choi JK, Yu U, Yoo OJ, Kim S. Differential coexpression analysis using microarray data and its application to human cancer. Bioinformatics. 2005;21:4348–55. doi: 10.1093/bioinformatics/bti722. [DOI] [PubMed] [Google Scholar]
- 8.Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. doi: 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bergmann S, Ihmels J, Barkai N. Similarities and differences in genome-wide expression data of six organisms. PLoS Biol. 2004;2:E9. doi: 10.1371/journal.pbio.0020009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lee HK, Hsu AK, Sajdak J, Qin J, Pavlidis P. Coexpression analysis of human genes across many microarray data sets. Genome Res. 2004;14:1085–94. doi: 10.1101/gr.1910904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Stuart JM, Segal E, Koller D, Kim SK. A gene-coexpression network for global discovery of conserved genetic modules. Science (New York, NY) 2003;302:249–55. doi: 10.1126/science.1087447. [DOI] [PubMed] [Google Scholar]
- 12.Jiang S, Tan B, Zhang X. Identification of key lncRNAs in the carcinogenesis and progression of colon adenocarcinoma by co-expression network analysis. J Cell Biochem. 2019;120:6490–6501. doi: 10.1002/jcb.27940. [DOI] [PubMed] [Google Scholar]
- 13.Shang J, Wang F, Chen P, Wang X, Ding F, Liu S. et al. Co-expression Network Analysis Identified COL8A1 Is Associated with the Progression and Prognosis in Human Colon Adenocarcinoma. Dig Dis Sci. 2018;63:1219–28. doi: 10.1007/s10620-018-4996-5. [DOI] [PubMed] [Google Scholar]
- 14.Liu R, Zhang W, Liu ZQ, Zhou HH. Associating transcriptional modules with colon cancer survival through weighted gene co-expression network analysis. BMC Genomics. 2017;18:361. doi: 10.1186/s12864-017-3761-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Sheffer M, Bacolod MD, Zuk O, Giardina SF, Pincas H, Barany F. et al. Association of survival and disease progression with chromosomal instability: a genomic exploration of colorectal cancer. Proc Natl Acad Sci U S A. 2009;106:7131–6. doi: 10.1073/pnas.0902232106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Smith JJ, Deane NG, Wu F, Merchant NB, Zhang B, Jiang A. et al. Experimentally derived metastasis gene expression profile predicts recurrence and death in patients with colon cancer. Gastroenterology. 2010;138:958–68. doi: 10.1053/j.gastro.2009.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Gautier L, Cope L, Bolstad BM, Irizarry RA. affy-analysis of Affymetrix GeneChip data at the probe level. Bioinformatics. 2004;20:307–15. doi: 10.1093/bioinformatics/btg405. [DOI] [PubMed] [Google Scholar]
- 18. Expression ConsoleTM Software Software 1.3 User Manual.
- 19. GeneChipc Expression Analysis Data Analysis Fundamentals.
- 20. Quality Assessment of Exon and Gene Arrays.
- 21.Botia JA, Vandrovcova J, Forabosco P, Guelfi S, D'Sa K, Hardy J. et al. An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks. BMC Syst Biol. 2017;11:47. doi: 10.1186/s12918-017-0420-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005;4:Article17. doi: 10.2202/1544-6115.1128. [DOI] [PubMed] [Google Scholar]
- 23.Yip AM, Horvath S. Gene network interconnectedness and the generalized topological overlap measure. BMC Bioinformatics. 2007;8:22. doi: 10.1186/1471-2105-8-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ravasz E, Somera AL, Mongru DA, Oltvai ZN, Barabasi AL. Hierarchical organization of modularity in metabolic networks. Science (New York, NY) 2002;297:1551–5. doi: 10.1126/science.1073374. [DOI] [PubMed] [Google Scholar]
- 25.Horvath S, Dong J. Geometric interpretation of gene coexpression network analysis. PLoS Comput Biol. 2008;4:e1000117. doi: 10.1371/journal.pcbi.1000117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Presson AP, Sobel EM, Papp JC, Suarez CJ, Whistler T, Rajeevan MS. et al. Integrated weighted gene co-expression network analysis with an application to chronic fatigue syndrome. BMC Syst Biol. 2008;2:95. doi: 10.1186/1752-0509-2-95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J. et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43:D447–52. doi: 10.1093/nar/gku1003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Goel MK, Khanna P, Kishore J. Understanding survival analysis: Kaplan-Meier estimate. Int J Ayurveda Res. 2010;1:274–8. doi: 10.4103/0974-7788.76794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Tang Z, Li C, Kang B, Gao G, Li C, Zhang Z. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Res. 2017;45:W98–W102. doi: 10.1093/nar/gkx247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102:15545–50. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods. 2001;25:402–8. doi: 10.1006/meth.2001.1262. [DOI] [PubMed] [Google Scholar]
- 33.Yuan L, Qian G, Chen L, Wu CL, Dan HC, Xiao Y. et al. Co-expression Network Analysis of Biomarkers for Adrenocortical Carcinoma. Front Genet. 2018;9:328. doi: 10.3389/fgene.2018.00328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Chen L, Yuan L, Wang Y, Wang G, Zhu Y, Cao R. et al. Co-expression network analysis identified FCER1G in association with progression and prognosis in human clear cell renal cell carcinoma. Int J Biol Sci. 2017;13:1361–72. doi: 10.7150/ijbs.21657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Zhang X, Feng H, Li Z, Li D, Liu S, Huang H. et al. Application of weighted gene co-expression network analysis to identify key modules and hub genes in oral squamous cell carcinoma tumorigenesis. Onco Targets Ther. 2018;11:6001–21. doi: 10.2147/OTT.S171791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gao B, Shao Q, Choudhry H, Marcus V, Dong K, Ragoussis J. et al. Weighted gene co-expression network analysis of colorectal cancer liver metastasis genome sequencing data and screening of anti-metastasis drugs. Int J Oncol. 2016;49:1108–18. doi: 10.3892/ijo.2016.3591. [DOI] [PubMed] [Google Scholar]
- 37.Kawaguchi T, Komatsu S, Ichikawa D, Kubota T, Okamoto K, Shiozaki A. et al. Comparison of prognostic compatibility between seventh AJCC/TNM of the esophagus and 14th JCGC staging systems in Siewert type II adenocarcinoma. Anticancer Res. 2013;33:3461–5. [PubMed] [Google Scholar]
- 38.O'Connell JB, Maggard MA, Ko CY. Colon cancer survival rates with the new American Joint Committee on Cancer sixth edition staging. J Natl Cancer Inst. 2004;96:1420–5. doi: 10.1093/jnci/djh275. [DOI] [PubMed] [Google Scholar]
- 39.Licht AH, Nubel T, Feldner A, Jurisch-Yaksi N, Marcello M, Demicheva E. et al. Junb regulates arterial contraction capacity, cellular contractility, and motility via its target Myl9 in mice. J Clin Invest. 2010;120:2307–18. doi: 10.1172/JCI41749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Sturge J, Wienke D, Isacke CM. Endosomes generate localized Rho-ROCK-MLC2-based contractile signals via Endo180 to promote adhesion disassembly. J Cell Biol. 2006;175:337–47. doi: 10.1083/jcb.200602125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Jurmeister S, Baumann M, Balwierz A, Keklikoglou I, Ward A, Uhlmann S. et al. MicroRNA-200c represses migration and invasion of breast cancer cells by targeting actin-regulatory proteins FHOD1 and PPM1F. Mol Cell Biol. 2012;32:633–51. doi: 10.1128/MCB.06212-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wong CC, Wong CM, Ko FC, Chan LK, Ching YP, Yam JW. et al. Deleted in liver cancer 1 (DLC1) negatively regulates Rho/ROCK/MLC pathway in hepatocellular carcinoma. PloS one. 2008;3:e2779. doi: 10.1371/journal.pone.0002779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Huang YQ, Han ZD, Liang YX, Lin ZY, Ling XH, Fu X. et al. Decreased expression of myosin light chain MYL9 in stroma predicts malignant progression and poor biochemical recurrence-free survival in prostate cancer. Med Oncol. 2014;31:820. doi: 10.1007/s12032-013-0820-4. [DOI] [PubMed] [Google Scholar]
- 44.Lu Y, Liu P, Wen W, Grubbs CJ, Townsend RR, Malone JP. et al. Cross-species comparison of orthologous gene expression in human bladder cancer and carcinogen-induced rodent models. Am J Transl Res. 2010;3:8–27. [PMC free article] [PubMed] [Google Scholar]
- 45.Yan Z, Li J, Xiong Y, Xu W, Zheng G. Identification of candidate colon cancer biomarkers by applying a random forest approach on microarray data. Oncol Rep. 2012;28:1036–42. doi: 10.3892/or.2012.1891. [DOI] [PubMed] [Google Scholar]
- 46.Zhao B, Baloch Z, Ma Y, Wan Z, Huo Y, Li F. et al. Identification of Potential Key Genes and Pathways in Early-Onset Colorectal Cancer Through Bioinformatics Analysis. Cancer Control. 2019;26:1073274819831260. doi: 10.1177/1073274819831260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ogura T, Kobayashi H, Ueoka Y, Okugawa K, Kato K, Hirakawa T. et al. Adenovirus-mediated calponin h1 gene therapy directed against peritoneal dissemination of ovarian cancer: bifunctional therapeutic effects on peritoneal cell layer and cancer cells. Clin Cancer Res. 2006;12:5216–23. doi: 10.1158/1078-0432.CCR-06-0674. [DOI] [PubMed] [Google Scholar]
- 48.Taniguchi S, Takeoka M, Ehara T, Hashimoto S, Shibuki H, Yoshimura N. et al. Structural fragility of blood vessels and peritoneum in calponin h1-deficient mice, resulting in an increase in hematogenous metastasis and peritoneal dissemination of malignant tumor cells. Cancer Res. 2001;61:7627–34. [PubMed] [Google Scholar]
- 49.Yamane T, Asanoma K, Kobayashi H, Liu G, Yagi H, Ohgami T. et al. Identification of the Critical Site of Calponin 1 for Suppression of Ovarian Cancer Properties. Anticancer Res. 2015;35:5993–9. [PubMed] [Google Scholar]
- 50.Drew JE, Farquharson AJ, Mayer CD, Vase HF, Coates PJ, Steele RJ. et al. Predictive gene signatures: molecular markers distinguishing colon adenomatous polyp and carcinoma. PloS one. 2014;9:e113071. doi: 10.1371/journal.pone.0113071. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.