Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2017 Apr 6;12(4):e0175185. doi: 10.1371/journal.pone.0175185

An integrated method for the identification of novel genes related to oral cancer

Lei Chen 1,2,#, Jing Yang 1,#, Zhihao Xing 3, Fei Yuan 4, Yang Shu 3, YunHua Zhang 5, XiangYin Kong 3, Tao Huang 3,*, HaiPeng Li 6,*, Yu-Dong Cai 1,*
Editor: Quan Zou7
PMCID: PMC5383255  PMID: 28384236

Abstract

Cancer is a significant public health problem worldwide. Complete identification of genes related to one type of cancer facilitates earlier diagnosis and effective treatments. In this study, two widely used algorithms, the random walk with restart algorithm and the shortest path algorithm, were adopted to construct two parameterized computational methods, namely, an RWR-based method and an SP-based method; based on these methods, an integrated method was constructed for identifying novel disease genes. To validate the utility of the integrated method, data for oral cancer were used, on which the RWR-based and SP-based methods were trained, thereby building two optimal methods. The integrated method combining these optimal methods was further adopted to identify the novel genes of oral cancer. As a result, 85 novel genes were inferred, among which eleven genes (e.g., MYD88, FGFR2, NF-κBIA) were identified by both the RWR-based and SP-based methods, 70 genes (e.g., BMP4, IFNG, KITLG) were discovered only by the RWR-based method and four genes (L1R1, MCM6, NOG and CXCR3) were predicted only by the SP-based method. Extensive analyses indicate that several novel genes have strong associations with cancers, indicating the effectiveness of the integrated method for identifying disease genes.

Introduction

Cancer is a significant public health problem worldwide. Oral cancer (OC) is a subgroup of head and neck cancer; it develops on the lips, tongue, salivary glands, gingiva, oropharynx, and on buccal surfaces [1]. Oral squamous cell carcinoma (OSCC) accounts for more than 90% of all OC [2]. OC is estimated by the WHO (World Health Organization) to be the eleventh most common cancer in the world, accounting for 300,000 new cases and 145,000 deaths in 2012 [3, 4]. The incidence of OC exhibits significant local variation and continues to be high in India, East Asia, Eastern Europe, and parts of South America [5]. Tobacco and alcohol are the most important risk factors for OC [6]. Poor nutrition, genetic factors and viral infection may be potential risk factors for OC [7, 8].

Carcinogenesis is a multi-step process, and a variety of alterations accumulate, driving and gradually increasing tumorigenesis [9]. During the past decade, a large variety of genomic variations has been implicated in OC. EGFR (epidermal growth factor receptor) amplification and over-expression were found in a large proportion of oral tumors [1012]. ErbB2 amplification and overexpression appear to occur frequently in OC specimens, and high levels of ErbB2 may be related to a worse prognosis of patients [12]. A strong correlation has been detected between c-erbB-2 overexpression and overall survival of patients with oral squamous cell carcinoma [13, 14]. The cyclin family plays a critical role in cell cycle progression. Aberrant up-expression of cyclin D accounts for 36–66% of OC. Amplification and an SNP of cyclin D may be associated with a worse prognosis and susceptibility to OC, respectively [15, 16]. Several lines of evidence suggest that the p53 tumor suppression network is altered in OC [17, 18]. In addition, many other target genes have been reported, such as ras, VEGF (vascular endothelial growth factor), and the MMP (matrix metalloproteinases) family [1921]. To the best of our knowledge, the mechanism that underlies OC is still unclear. A search for new genes related to OC may facilitate earlier diagnosis and effective treatment.

In general, many standard methods have been used for detection of virulence genes. In the hospital and laboratory, cancer samples are sliced into pathological sections and stained to determine the disease pathology type. Quantitative reverse transcription-PCR is typically used to detect the mRNA level of genes in cancer. IHC (immunohistochemical) and western blotting measure the expression of related proteins in tissues and cells, respectively. However, it is difficult to analyze genes synthetically and comprehensively. In addition, some large-scale experiments have been exploited to screen virulence genes, such as microarray, GWAS (genome-wide association study) and NGS (next-generation sequencing). The most common techniques are time-consuming and costly, and thus new methods must be explored to identify tumor genes. In recent years, with the development of computer techniques [2228], some of them can be applied to tackle this problem. Up to now, several computational methods have been proposed to identify disease genes. Many of them are based on guilt-by-association [29], i.e., the assumption that genes are similar to their neighbors in a gene network. Thus, the neighbors of the disease genes are more likely to be disease genes. However, these types of methods are local methods that use only part of the network. Thus, they do not always yield good performance. Many other methods employ the Random Walk with Restart (RWR) algorithm to identify disease genes [3032]. This algorithm simulates a walker starting from a seed node or a set of seed nodes that represent disease genes and random walking on the network. The probability of a node being a disease gene is updated until the probabilities of all nodes become stable. The genes corresponding to nodes with high probabilities are selected as novel candidate disease genes. Recently, computational methods have adopted the shortest path (SP) algorithm to address the problem [3339]. This algorithm assumes that genes lying in the shortest paths connecting any two disease genes may also be disease genes. Clearly, methods based on RWR or SP algorithms take full advantage of the network compared with those based on guilt-by-association. Thus, they can yield clues for the discovery of novel disease genes.

In this study, we used the RWR and SP algorithms to construct two novel computational methods, the RWR-based method and the SP-based method, respectively, for the identification of novel disease genes. Additionally, an integrated method was constructed by combining these two methods. To indicate the effectiveness of the integrated method, disease genes of OC were employed. It has been reported in some studies that using only the RWR algorithm and SP algorithm consistently produces several false discoveries [33, 40], which may be caused by the structure of the network or lack of consideration of the essential properties of genes. Thus, further rules, produced by the permutation test, associations between candidates and the validation of genes according to their properties, were integrated into the RWR-based and SP-based methods. These methods were executed using the corresponding large network that was built using protein-protein interaction (PPI) information and that was trained on validated OC genes to determine the optimal parameters. The obtained optimal methods were used to infer novel genes related to OC, and the integrated method combined the predicted genes using these two optimal methods to yield 85 novel genes. Among them, eleven genes were obtained by both methods, 70 genes were obtained only by the RWR-based method and four genes were obtained only by the SP-based method. According to the analyses, several genes show stimulative or suppressive effects on cancers by experiments or have a certain relationship with cancers reported in published papers, indicating the utility of the integrated method. It is also clear that the integrated method can provide more comprehensive analysis of various diseases because it is capable of producing more possible disease genes than the RWR-based method or the SP-based method.

Materials and methods

Genes related to oral cancer

The OC-related genes were collected from the following three sources: (1) 44 genes were retrieved from UniProt (http://www.uniprot.org/, accessed in July, 2015) [41] after ‘human oral cancer reviewed’ was input as a keyword; (2) seven genes were chosen from the catalogue of oral cancer from the TSGene (https://bioinfo.uth.edu/TSGene/, accessed in July, 2015) database; and (3) 156 genes were retrieved from the NCI (National Cancer Institute, https://gforge.nci.nih.gov, accessed in July, 2015) database using ‘Homo sapiens’ as a keyword. After combining the OC-related genes mentioned above, 202 OC-related genes were finally obtained; these genes are provided in S1 Table. Because our methods are based on the network constructed from the PPI information retrieved from Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) [42], these 202 OC-related genes were mapped into their Ensembl IDs, and those not occurring in the network were discarded. One hundred eighty-four Ensembl IDs of OC-related genes were accessed and composed the set Soc.

Protein-protein interactions

PPIs play important roles in several intracellular and intercellular biochemical processes. The known PPI information has been widely used to investigate several protein-related problems, such as protein function prediction [43, 44] and disease gene identification [33, 34, 36, 39]. Based on the results reported in these previous studies, it can be concluded that two proteins that can interact with each other always have a functional relationship. In this study, we attempted to discover novel candidate OC-related genes using validated OC-related genes. Thus, we can utilize PPIs to search for proteins that have a functional relationship with proteins encoded by validated OC-related genes, thereby mining novel OC-related genes.

The PPI information was retrieved from STRING (http://string-db.org/, version 9.1) [42], a well-known online public database that contains known and predicted protein interactions. The interactions reported in STRING are derived from the following four types of sources: (I) Genomic Context; (II) High-throughput Experiments; (III) (Conserved) Coexpression; (IV) Previous Knowledge, which include direct (physical) and indirect (functional) associations between proteins and thus offer more chances to mine hidden protein information. From the database, a file called ‘protein.links.v9.1.txt.gz‘ was retrieved, which contained 2,425,314 human PPIs. Each PPI was represented by two Ensembl IDs and a score ranging between 150 and 999, which indicates the strength of the interaction; i.e., proteins in interactions with higher scores are regarded as more likely to interact with each other. Let us denote the score of a PPI between proteins p1 and p2 by S(p1, p2).

RWR-based method

RWR is a ranking algorithm [30]. In a network, this algorithm simulates a walker starting from a seed node or a set of seed nodes and random walking on the network. In each step, the probability of the walker walking to each node is updated, and it stabilizes after several steps. Nodes in the network are ranked by the final probabilities assigned to them. Based on the validated disease genes (used as seed nodes), the RWR algorithm can be used to discover novel disease genes by investigating novel genes with high ranks. This algorithm has therefore been used to mine novel disease genes in recent years [3032]. However, the RWR algorithm consistently provides false discoveries [40], supporting the need for additional screening rules. In this study, we used the RWR algorithm as the basic algorithm and added some rules to build the present RWR-based learning method for the identification of novel OC-related genes. Furthermore, to obtain a better RWR-based method, some parameters were employed that are determined by training the method.

Network construction for RWR algorithm

An accurate network is important for the identification of novel OC-related genes. Here, we adopted the PPI information mentioned in the Section “Protein-protein interactions” to build the network. The constructed network defined the 20,770 proteins occurring in the 2,425,314 human PPIs as nodes, and two nodes were adjacent if and only if the corresponding proteins could interact with each other. It can be observed that each edge represented a PPI. To employ the interaction score in the network, each edge was assigned a weight that was defined as the score of the corresponding PPI. For convenience, let us denote the constructed network by NRWR.

Searching for new candidate genes using the RWR algorithm

As mentioned in the Section “Genes related to oral cancer”, a set Soc, consisting of 184 Ensembl IDs of OC-related genes was used in this study. The RWR algorithm simulated a random walker starting from Soc. Before executing the RWR algorithm on NRWR, each node in NRWR was assigned a probability, with 1/184 set for the Ensembl IDs in Soc and 0 set for the remaining nodes. The initial probability of each node constituted the probability vector P0. The RWR algorithm updated the probability vector in each step. Let Pt denote the probability vector after performing the t-th step, which can be updated by

Pt+1=(1r)ATPt+rP0 (1)

where r is set to 0.8 and A is the column-wise normalized adjacency matrix of NRWR. The update procedure was repeated until the change between Pt and Pt+1, measured by L1 norm, was less than 1e-6. Ensembl IDs with high probabilities were considered to be encoded by OC-related genes. Because we did not know which probability was a suitable threshold for selecting candidate genes, we established a parameter, pRWRp, for the threshold, which can be determined by training the RWR-based method.

SP-based method

The SP algorithm is a classic graph algorithm. In recent years, some investigators have applied the SP algorithm for the identification of disease genes [3339]. The new candidate genes were extracted from the shortest paths connecting any two validated disease genes. Here, we built an SP-based machine learning method to identify novel OC-related genes. Like the RWR-based method, further screening rules were established to discard non-essential candidate genes and select important ones.

Network construction for SP algorithm

Similar to NRWR for the RWR-based method, we also constructed the network NSP. As mentioned in the Section “Network construction for RWR-based method”, each edge in NRWR represented a PPI. In addition, each PPI had an interaction score as described in the Section “Protein-protein interactions”. This score was used to define the weight of the edge. The range of the interaction score was between 150 and 999, and an edge with a low weight indicated strong correlations between its endpoints in the SP-based model. Thus, for an edge e with endpoints n1 and n2, weight was defined in the following manner:

w(e)=1000S(p1,p2) (2)

where p1 and p2 were two corresponding proteins of nodes n1 and n2. The constructed network NSP consisted of 20,770 nodes and 2,425,314 edges.

Searching for new candidate genes using the SP algorithm

The SP algorithm, Dijkstra’s algorithm [45], was applied to the network NSP to search for all of the shortest paths connecting any two Ensembl IDs in Soc. From the obtained shortest paths, we extracted the inner nodes that did not represent validated OC-related genes. Genes corresponding to the extracted nodes were considered to be related to OC and denoted candidate genes. In addition, each candidate gene was assigned a measurement, called betweenness, which was defined as the number of shortest paths within which it was contained.

Screening rules

The RWR and SP algorithms can produce a number of candidate genes for OC when the probability threshold is provided for the RWR algorithm. However, several false discoveries are inevitable, as mentioned in previous studies [33, 40]. To screen out these false discoveries, a series of screening rules were added to this section and integrated into the RWR-based and SP-based methods. Some parameters were employed to build the rules. Their optimal values in RWR-based and SP-based methods are determined by training these two methods, respectively.

The probability that a candidate gene produced by the RWR algorithm was influenced by the structure of the network NRWR and the betweenness for a candidate gene produced by the SP algorithm was also clearly influenced by the structure of the NSP network. Some candidate genes receiving high probabilities (betweenness) were not specific to OC. Thus, a permutation test was designed that first randomly constructed 1,000 sets of Ensembl IDs such that each set had the same size of Soc, denoted by S1, S2,⋯, S1000. For each set, the RWR algorithm and SP algorithm were executed on NRWR and NSP, respectively, by setting the Ensembl IDs in the set as the input, yielding a probability for each candidate gene produced by the RWR algorithm and a betweenness for each candidate gene produced by the SP algorithm. Finally, for each candidate gene, there was one probability (betweenness) for Soc and 1,000 probabilities (betweenness) for S1, S2,⋯,S1000. If the candidate gene was specific to OC, its probability (betweenness) for Soc should clearly be larger than most probabilities (betweenness) for S1, S2,⋯,S1000. Thus, we calculated the p-value for each candidate gene g, which was defined by

pvalue(g)=i=11000δi1000 (3)

where δi = 1 if the probability (betweenness) for Si was larger than that of Soc; δi = 0 otherwise. Because 0.05 was always used as an important cut-off for the significance level of the test, it was set to be the threshold of the p-value; i.e., candidate genes with p-values less than 0.05 were selected.

The network-based method is useful for complicated problems. However, this type of method seldom considers the essential properties of nodes in the network, leading to several false discoveries. Rules utilizing the essential properties of candidate and validated genes are needed to exclude false discoveries and select important ones.

As mentioned in the Section “Protein-protein interactions”, two proteins that can interact with each other always have a functional relationship. Furthermore, considering the interaction scores, proteins in an interaction with a high score are more likely to have a strong functional relationship than those in an interaction with a low score. Thus, the interaction score can be used to measure the associations between candidate genes and OC-related genes. For each candidate gene g, we calculated the maximum interaction score (MIS), which was defined by

MIS(g)=max{S(g,g):gSoc} (4)

Clearly, a candidate gene with a high MIS is more likely to be a novel OC-related gene and thus should be selected. However, the threshold of MIS was not easy to determine. Thus, it was set as a parameter, pMIS. In the RWR-based method, the optimal value of pMIS is determined by training this method. In addition, the optimal value of pMIS in the SP-based method is also accessed by training the SP-based method.

The MIS measures the associations between candidate genes and OC-related genes. The following measurement evaluates the associations between them in another way. It is known that OC-related genes must be highly related to some gene ontology (GO) terms or biological pathways. Thus, we can use the annotated GO term and KEGG pathway information for candidate genes and OC-related genes to evaluate their correlations. To achieve this goal, each candidate gene or OC-related gene was encoded by its GO enrichment scores and KEGG enrichment scores. For a candidate gene g and an OC-related gene g′, their associations can be measured by

M(g,g)=ES(g)ES(g)ES(g)ES(g) (5)

where ES(g) (ES(g′), respectively) is a vector consisting of the GO enrichment scores and KEGG enrichment scores of g (g′, respectively). A high value for Eq 5 indicated a strong association. Similar to the definition of MIS, we calculated the maximum function score (MFS) of each candidate gene g by

MFS(g)=max{M(g,g):gSoc} (6)

Similarly, a candidate gene with a high MFS might be a novel OC-related gene with a high probability and should be selected. Additionally, it was difficult to determine the threshold of MFS. The parameter pMFS was also established for this threshold. Similar to parameter pMIS mentioned above, the optimal value of pMFS in the RWR-based and SP-based methods is determined by training these two methods, respectively.

Integrated method

The RWR-based and SP-based methods, together with the screen rules mentioned in the Section “Screening rules”, were constructed. The pseudo-codes of these two methods are listed in Tables 1 and 2, respectively. The integrated method encompassed these two methods by combining their results.

Table 1. The pseudo-code of the RWR-based method.

RWR-based method
Input: An OC-related gene set, Soc; a network, NRWR
Output: A number of putative OC-related genes
1. Execute the RWR algorithm on NRWR using Soc as the input, producing a probability for each gene in NRWR; select candidate genes with a probability higher than pRWRp;
2. Execute the permutation test, producing the p-value for each gene; select candidate genes with a p-value less than 0.05;
3. For each candidate gene, calculate its MIS and select candidate genes with an MIS no less than pMIS;
4. For each candidate gene, calculate its MFS and select candidate genes with an MFS larger than pMFS;
5. Output the remaining candidate genes as the putative OC-related genes.

Table 2. The pseudo-code of the SP-based method.

SP-based method
Input: An OC-related gene set, Soc; a network, NSP
Output: A number of putative OC-related genes
1. Execute the SP algorithm on NSP using Soc as the input, extracting candidate genes lying on the obtained shortest paths;
2. Execute the permutation test, producing the p-value for each candidate gene; select candidate genes with p-values less than 0.05;
3. For each candidate gene, calculate its MIS and select candidate genes with an MIS no less than pMIS;
4. For each candidate gene, calculate its MFS and select candidate genes with an MFS larger than pMFS;
5. Output the remaining candidate genes as the putative OC-related genes.

Evaluation methods

The RWR-based and SP-based methods were applied to identify novel OC-related genes. However, there are some parameters in these two methods that must be determined before they are used to identify novel OC-related genes. Thus, these two methods were trained on the validated OC-related gene set Soc, through which the optimal parameters can be determined. We used the jackknife test [46, 47], which is one of the classic cross-validation methods [48, 49], to evaluate the performance of these two methods, i.e., each OC-related gene in Soc was singled out sequentially, and the remaining genes in Soc were used to generate predictions under various combinations of parameters. When training the methods, we supposed that all genes in the network other than the validated OC-related genes were negative; i.e., they were not OC-related genes. The performance of the method can be measured according to the following two features: (1) whether the selected OC-related gene can be recovered by executing the method on the remaining OC-related genes; (2) the predicted genes other than the selected OC-related gene should be as low as possible. Thus, we considered the following two measurements: precision and recall, which are always used to evaluate the performance of the methods on a binary classification problem in the fields of pattern recognition and information retrieval. Recall is defined as the proportion of retrieved OC-related genes among all OC-related genes, and precision represents the proportion of retrieved OC-related genes among all retrieved genes. Furthermore, another measurement, the F1-measure, is often used to evaluate overall performance, and it can be calculated by

F1-measure=2recallprecisionrecall+precision (7)

Eq 7 shows that recall and precision have the same role. However, in this study, we considered recall to be more important than precision because the method with low recall and high precision could not reliably produce the predicted results. Thus, we simply revised Eq 7 and defined a new measurement, namely, F1-measure-R, which can be computed by

F1-measure-R=recallF1-measure=2recall2precisionrecall+precision (8)

Because the retrieved genes differ when the selected OC-related gene was not the same, the recall, precision and F1-measure-R must be considered each time for the predicted results. Thus, under a combination of parameters, the RWR-based and SP-based methods can produce a series of recall, precision and F1-measure-R values. We calculated the average values to indicate the performance of the method under this combination of parameters. For convenience, the recall, precision and F1-measure-R presented in the rest of this paper represent the average values.

Results

Optimized parameters for the RWR-based and SP-based methods

As mentioned in the Section “RWR-based method”, “SP-based method” and “Screening rule”, some parameters should be optimized for the RWR-based method and the SP-base method. To extract an optimal combination of parameters for each method, the RWR-based method and the SP-base method were trained on Soc, and their performance was evaluated by the Jackknife test.

For the RWR-based method, three parameters, the threshold of probability pRWRp, the threshold of MIS pMIS, and the threshold of MFS pMFS, should be optimized. For pRWRp, we tried various values ranging from 1E-05 to 1E-04; for pMIS, we tried three values, 400, 700, 900, which are reported in STRING for thresholds of medium confidence, high confidence and highest confidence, respectively; for pMFS, we tried various values ranging from 0 to 0.9. The measurements mentioned in the Section “Evaluation methods” for the RWR-based method with different combinations of parameters are listed in S2 Table. For ease of observation, the values of F1-measure-R obtained using the RWR-based method with different parameters are illustrated in Fig 1; we can see the same level of performance when the parameters pRWRp and pMFS are equivalent. The maximum F1-measure-R was 1.677E-03 when the parameters were set to be pRWRp=0.00006, pMFS = 0.8 and pMIS = 400 or 700. Because this combination of parameters yielded the best performance, they were used to build the optimal RWR-based method, which is adopted to identify novel OC-related genes.

Fig 1. The performance of the RWR-based method under different combinations of parameters.

Fig 1

(a) The performance of the RWR-based method setting pMIS = 400. (b) The performance of the RWR-based method setting pMIS = 700. (c) The performance of the RWR-based method setting pMIS = 900.

The SP-based method was also trained to extract the optimal parameters for pMIS and pMFS. We tested three values for pMIS, as mentioned in the above paragraph, and various values ranging from 0 to 0.9 for pMFS. For the results obtained using the SP-based method with different combinations of parameters, the measurements mentioned in the Section “Evaluation methods” were counted and are provided in S3 Table. Additionally, three curves are plotted in Fig 2 to show the values of F1-measure-R obtained using the SP-based method with different values of pMFS and a fixed value of pMIS. When the values of pMFS were small, the values of F1-measure-R were proportional to the value of pMIS. However, the values of F1-measure-R were almost the same when pMFS were large. The maximum F1-measure-R was 2.693E-04 when the parameters were set to be pMFS = 0.8 and pMIS = 400 or 700 or 900. Similarly, we used these values to build the optimal SP-based method for the identification of novel OC-related genes.

Fig 2. The performance of the SP-based method under different combinations of parameters.

Fig 2

There are three lines in this figure, which represent the performance of the SP-based method with different thresholds of maximum interaction score. In detail, the full line represents the performance of the SP-based method with the threshold of maximum interaction score 900, the dot line represents the performance of the SP-based method with the threshold of maximum interaction score 700, the dash line represents the performance of the SP-based method with the threshold of maximum interaction score 400.

Inferred results of the RWR-based method

As mentioned in the Section “Optimized parameters for the RWR-based and SP-based methods”, the optimal RWR-based method was built using pRWRp=0.00006, pMFS = 0.8 and pMIS = 400 or 700. This method was further used to identify novel OC-related genes based on all validated OC-related genes mentioned in the Section “Genes related to oral cancer”. Because the optimal RWR-based method contains two options for the parameter pMIS, this method was executed twice. First, pMIS was set to 400, while in the second iteration, it was set to 700. The genes identified using this method with a different parameter, pMIS, are listed in S4 Table. A careful review of the results showed that the identified genes produced using these two optimal RWR-based methods were equivalent. They all identified 81 novel genes. Because all of these genes were viewed as high probability by the RWR method and had strong associations with validated OC-related genes, we believe that they are highly related to OC, and they are termed putative OC-related genes.

Inferred results of the SP-based method

The optimal SP-based method was built in the Section “Optimized parameters for the RWR-based and SP-based methods”, in which pMFS was set to 0.8 and pMIS was set to 400, 700 or 900. This method was also adopted to identify novel OC-related genes. By setting pMIS to 400, 700 or 900, we can build three optimal SP-based methods. They were executed sequentially to identify novel OC-related genes. Similarly, they all yielded fifteen identical genes, which are listed in Table 3. It is believed that these genes are closely related to OC, and they are termed putative OC-related genes.

Table 3. Genes identified by the optimal SP-based method.

Ensembl ID Gene symbol Betweenness P-value MIS MFS Function
ENSP00000354394 STAT1 b 1443 <0.001 999 0.852 functions as a key factor in cell viability in response to different cell stimuli and pathogens [125]
ENSP00000263341 IL1B b 543 <0.001 994 0.873 a member of the interleukin 1 cytokine family
ENSP00000379625 MYD88 a 528 0.006 999 0.880 an essential signal transducer in the IL1 and Toll-like receptor signaling pathways [126, 127]
ENSP00000233946 IL1R1 b 528 0.001 920 0.843 interleukin 1 receptor type 1 [128]
ENSP00000216797 NFKBIA b 201 0.018 999 0.825 a member of the NF-kappa-B inhibitor family, which is involved in inflammatory responses [61]
ENSP00000222382 CYP3A43 b 183 <0.001 958 0.988 a member of the cytochrome P450 superfamily of enzymes [129]
ENSP00000328181 NOG b 183 0.005 999 0.862 binds and inactivates members of the TGF-beta superfamily signaling proteins [130]
ENSP00000410294 FGFR2 a 183 0.01 999 0.846 a tyrosine protein kinase that functions as a receptor for fibroblast growth factors and plays key roles in cell proliferation, differentiation, migration and apoptosis [131]
ENSP00000362795 CXCR3 a 179 0.021 999 0.808 a G protein-coupled receptor with selectivity for chemokines [132, 133]
ENSP00000260356 THBS1 a 10 0.033 984 0.807 an adhesive glycoprotein that mediates cell-cell and cell-matrix interactions [134, 135]
ENSP00000264156 MCM6 b 8 0.014 999 0.822 be involved in the formation of replication forks [136]
ENSP00000301141 CYP2A6 b 3 0.016 950 0.948 a member of the cytochrome P450 superfamily of enzymes [129]
ENSP00000331736 SELE b 1 0.006 978 0.852 responsible for the accumulation of blood leukocytes at sites of inflammation [67]
ENSP00000168712 FGF4 b 1 0.016 999 0.847 fibroblast growth factor 4 which are involved in various biological processes such as cell growth and morphogenesis
ENSP00000286758 CXCL13 b 1 0.005 992 0.838 C-X-C motif chemokine ligand 13

a: Genes that have shown stimulative or suppressive effects on cancer as validated by experiments.

b: Genes that have been reported to have a certain relationship with cancer but that have not been validated by experiments.

Inferred results of the integrated method

As mentioned in the Sections “Inferred results of the RWR-based method” and “Inferred results of the SP-based method”, the RWR-based method yielded 81 putative genes, and the SP-based method yielded fifteen putative genes. The union of these two putative gene sets provided the results of the integrated method, in which 85 putative genes were obtained. Their distribution is illustrated in Fig 3; we can see that eleven putative genes were identified by both the RWR-based and the SP-based methods. Among the remaining putative genes, 70 were identified using the RWR-based method, and four putative genes were identified using the SP-based method. Because the principles of the RWR-based and SP-based methods are very different, the identified novel OC-related genes were not the same. By considering the identified genes produced by either of them, we can obtain more putative genes and have an opportunity to extensively study disease genes in OC. To indicate the obtained 85 putative genes are highly related to OC, we extracted a sub-network containing these putative genes and OC-related genes from NRWR and NSP as shown in Fig 4.

Fig 3. The distribution of the 85 putative OC-related genes obtained in this study.

Fig 3

The blue part represents the set consisting of 70 putative genes obtained using only the RWR-based method. The red part represents the set consisting of 11 putative genes obtained using both the RWR-based and SP-based methods. The green part represents the set consisting of 4 putative genes obtained using only the SP-based method.

Fig 4. The sub-network containing the putative genes and OC-related genes that was extracted from the network for RWR-based and SP-based methods.

Fig 4

The blue nodes represent putative genes and red nodes represent OC-related genes.

Discussion

Using the RWR-based method, 81 genes were obtained. Using the SP-based method, fifteen genes were accessed. All of these genes were deemed to be significantly associated with OC. Furthermore, eleven genes were identified by both methods, which may be more important than the others. In this section, some important putative genes (listed in Tables 46) were extensively analyzed to confirm their associations with OC.

Table 4. Eleven putative genes identified using both RWR-based and SP-based methods.

Ensembl ID Gene symbol RWR-based method SP-based method Function
Probability P-value MIS MFS Betweenness P-value MIS MFS
ENSP00000379625 MYD88 a 6.67E-05 0.032 999 0.880 528 0.006 999 0.880 an essential signal transducer in the IL1 and Toll-like receptor signaling pathways [126, 127]
ENSP00000410294 FGFR2 a 9.04E-05 0.021 999 0.846 183 0.01 999 0.846 a tyrosine protein kinase that functions as a receptor for fibroblast growth factors and plays key roles in cell proliferation, differentiation, migration and apoptosis [131]
ENSP00000216797 NFKBIA b 7.56E-05 0.026 999 0.825 201 0.018 999 0.825 a member of the NF-kappa-B inhibitor family, which is involved in inflammatory responses [61]
ENSP00000331736 SELE b 9.73E-05 <0.001 978 0.852 1 0.006 978 0.852 responsible for the accumulation of blood leukocytes at sites of inflammation [67]
ENSP00000260356 THBS1 a 6.94E-05 <0.001 984 0.807 10 0.033 984 0.807 an adhesive glycoprotein that mediates cell-cell and cell-matrix interactions [134, 135]
ENSP00000354394 STAT1 b 4.36E-04 <0.001 999 0.852 1443 <0.001 999 0.852 functions as a key factor in cell viability in response to different cell stimuli and pathogens [125]
ENSP00000301141 CYP2A6 b 8.85E-05 <0.001 950 0.948 3 0.016 950 0.948 a member of the cytochrome P450 superfamily of enzymes[129]
ENSP00000222382 CYP3A43 b 3.35E-04 <0.001 958 0.988 183 <0.001 958 0.988 a member of the cytochrome P450 superfamily of enzymes [129]
ENSP00000286758 CXCL13 b 6.63E-05 0.006 992 0.837 1 0.005 992 0.838 C-X-C motif chemokine ligand 13
ENSP00000168712 FGF4 b 8.21E-05 0.001 999 0.847 1 0.016 999 0.847 fibroblast growth factor 4 which are involved in various biological processes such as cell growth and morphogenesis
ENSP00000263341 IL1B b 1.92E-04 <0.001 994 0.873 543 <0.001 994 0.873 a member of the interleukin 1 cytokine family

a: Genes that have shown stimulative or suppressive effects on cancer as validated by experiments.

b: Genes that have been reported to have a certain relationship with cancer but that have not been validated by experiments.

Table 6. Four putative genes identified using the SP-based method.

Ensembl ID Gene symbol Betweenness P-value MIS MFS Function
ENSP00000264156 MCM6 b 8 0.014 999 0.822 be involved in the formation of replication forks [136]
ENSP00000328181 NOG b 183 0.005 999 0.862 binds and inactivates members of the TGF-beta superfamily signaling proteins [130]
ENSP00000362795 CXCR3 a 179 0.021 999 0.808 a G protein-coupled receptor with selectivity for chemokines [132, 133]
ENSP00000233946 IL1R1 b 528 0.001 920 0.843 interleukin 1 receptor type 1 [128]

a: Genes that have shown stimulative or suppressive effects on cancer as validated by experiments.

b: Genes that have been reported to have a certain relationship with cancer but that have not been validated by experiments.

Putative genes identified using both RWR-based and SP-based methods

The following eleven proteins were identified using both RWR-based and SP-based methods: CYP3A43, FGF4, NFKBIA, THBS1, IL1B, CXCL13, CYP2A6, SELE, STAT1, MYD88 and FGFR2, as listed in Table 4.

MYD88 (myeloid differentiation primary response 88) encodes a cytosolic adapter protein that functions as a key signal transducer in the interleukin-1 and Toll-like receptor (TLRs) signaling pathways. TLRs and their ligands play a crucial role in inflammation and host defense [50]. In a broad variety of tumor tissues and cell lines, aberrant expression of TLRs plays a role in tumor immune escape or resistance to apoptosis [51, 52]. The expression of TLR4 and MyD88 is also aberrant and affects the down-stream signaling pathway [53]. In this study, MyD88 was predicted using both RWR-based and SP-based methods, and thus it may be a potential target for OC.

FGFR2 (fibroblast growth factor receptor 2) belongs to the FGFR tyrosine kinase family, which is one of the most frequently altered kinase families in some types of cancer [54]. Point mutations have been observed in 12% of endometrial carcinomas [55]. These studies suggest that FGFR2 may act together with the regulation of the PI3K/AKT/mTOR pathway to drive endometrial cancer growth in a subset of patients [56, 57]. FGFR2 mutations also occur in oral squamous cell carcinoma, and a patient with OSCC was found to respond to pazopanib, a multiple tyrosine kinase inhibitor [58]. These findings support the potential diagnostic and therapeutic value of FGFR2 as an effective strategy for the treatment of patients with OC with defined molecular characteristics. In addition, our results showed that FGF4 had a significant probability in both methods. Some large-scale experiments have suggested that aberrant amplification of FGF4 occurs in several types of cancer, including lymph node metastasis and urinary bladder cancer [59, 60].

NF-κBIA (nuclear factor-kappa B inhibitor alpha) inhibits NF-κB, which is involved in inflammatory responses and is a hallmark linking inflammation to tumor development and progression [61, 62]. The polymorphic variations in NFKBIA were associated with the risk of various cancers, including gastric cancer, prostate cancer and melanoma [6365]. NFKBIA polymorphisms have significant associations with OSCC [66]. In our study, NFKBIA had a close relationship with OC using both methods.

SELE (selectin E), which was found in cytokine-stimulated endothelial cells, functions in the accumulation of blood leukocytes at sites of inflammation [67]. The level of SELE-mediated adhesion of colon cancer and head and neck squamous cell cancer cells to the endothelium has been implied to be involved in metastasis [68, 69]. Our analysis revealed that SELE might be a putative marker for tumorigenesis in OC. A study in Taiwan supports the idea that SELE-related inflammation plays a crucial role in the pathogenesis process of OSCC [70]. Future investigations of the function of SELE may clarify the mechanism of tumorigenesis and metastasis.

THBS1 (thrombospondin 1) is a subunit of a disulfide-linked homotrimeric protein, which mediates cell-cell and cell-matrix interactions [71]. The significant relationship between THBS1 and OC was detected using both RWR-based and SP-based methods. Several studies have indicated that THBS1 has crucial functions in oral tumorigenesis [72, 73]. THBS1 might be a potential diagnostic and therapeutic target for OC.

STAT1 (signal transducer and activator of transcription 1), which belongs to the STAT protein family, mediates the expression of a variety of genes and cell viability in response to stimuli and pathogens [74]. Aberrant activation of STAT1 has frequently been found in various cancers, such as head and neck cancer [75]. A similar STAT1 activation status was detected in patients with OSCC [76]. In this study, STAT1 displayed a significant association with OC using both the RWR-based and SP-based methods, and this molecular marker could help in guiding diagnostic and therapeutic decisions in patients with OC.

Among these eleven putative genes, including genes such as MYD88, FGFR2 and THBS1, stimulative or suppressive effects on cancers have been shown by the experiments. We speculate that these three genes have certain functions in OC. In addition, it has been reported that other genes, including NFKBIA, SELE, STAT1, IL1B, CYP2A6, CYP3A43, FGF4 and CXCL13 [7780], have a certain relationship with various forms cancer. However, the mechanism has not been thoroughly studied. In our analysis, they have a significant relationship with OC, and the mechanism should be explored.

Putative genes identified only by the RWR-based method

Seventy proteins were identified only by the RWR-based method. The important genes are BMP4, CCL2, CCL20, CCL5, CCR7, IFNG, KITLG, TNFRSF1A, PIK3CB, PIK3CD, PIK3CG, CYP2B6, CYP2C19, CYP2J2, CYP3A4, CYP4X1 and RAC3; these genes are listed in Table 5. Some of them participate in tumorigenesis by regulating cell growth or apoptosis, such as BMP4 and RAC3. Some genes are important factors in the immune system of cancer patients, such as CCL2, CCL20, CCL5, CCR7, IFNG and P450 family. Detailed function analyses of candidate genes are shown in S1 File.

Table 5. Important genes among the seventy putative genes identified using the RWR-based method.

Ensembl ID Gene symbol Probability P-value MIS MFS Function
ENSP00000245451 BMP4 a 9.57E-05 0.026 981 0.905 bind TGF-beta receptor leading to recruitment and activation of transcription factor [137]
ENSP00000225831 CCL2 b 1.21E-04 0.012 984 0.869 C-C motif chemokine ligand 2
ENSP00000351671 CCL20 b 6.69E-05 0.003 965 0.804 C-C motif chemokine ligand 20
ENSP00000293272 CCL5 a 7.70E-05 0.002 994 0.891 C-C motif chemokine ligand 5
ENSP00000292303 CCR5 a 1.01E-04 0.003 996 0.839 C-C motif chemokine receptor 5
ENSP00000246657 CCR7 a 9.82E-05 <0.001 998 0.823 C-C motif chemokine receptor 7
ENSP00000229135 IFNG a 1.70E-04 0.03 994 0.839 binds to the interferon gamma receptor to response to infection [138]
ENSP00000228280 KITLG a 1.05E-04 0.026 948 0.816 the ligand of the tyrosine-kinase receptor
ENSP00000162749 TNFRSF1A b 9.69E-05 0.013 999 0.825 a member of the TNF receptor superfamily which plays a role in various biological processes
ENSP00000289153 PIK3CB a 1.23E-04 <0.001 997 0.926 an isoform of the catalytic subunit of PI3K
ENSP00000366563 PIK3CD b 1.15E-04 <0.001 997 0.919 PI3Ks phosphorylate inositol lipids and it is involved in the immune response [139]
ENSP00000352121 PIK3CG a 1.21E-04 <0.001 996 0.921 phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit gamma
ENSP00000324648 CYP2B6 b 9.73E-05 <0.001 962 0.943 a member of the cytochrome P450 superfamily
ENSP00000360372 CYP2C19 b 9.32E-05 <0.001 962 0.966 cytochrome P450 family 2 subfamily C member 19
ENSP00000360247 CYP2J2 a 7.48E-05 0.004 912 0.969 cytochrome P450 family 2 subfamily J member 2
ENSP00000337915 CYP3A4 b 3.32E-04 <0.001 963 0.941 cytochrome P450 family 3 subfamily A member 4
ENSP00000360968 CYP4X1 b 7.43E-05 0.018 939 0.959 cytochrome P450 family 4 subfamily X member 1
ENSP00000304283 RAC3 a 1.08E-04 0.006 990 0.981 a GTPase regulates cell growth, cytoskeletal reorganization, and the activation of kinases [140142]

a: Genes that have shown stimulative or suppressive effects on cancer as validated by experiments.

b: Genes that have been reported to have a certain relationship with cancer but that have not been validated by experiments.

We identified several novel putative genes using only the RWR-based method, which have strong associations with the tumorigenesis of OC. The functions of some genes have been explored in experiments, such as BMP4, IFNG, KITLG, CCL5, CCR5, CCR7, CYP2J2, PIK3CB, PIK3CG and RAC3; others revealed mutations or aberrant expression in cancers, but the mechanism is not clear. Future research is required to replicate and validate the effects of these genes.

Putative genes identified only by the SP-based method

Four proteins were predicted to be closely associated with OC using the SP-based method but not the RWR-based method. These genes were IL1R1, MCM6, NOG and CXCR3; they are listed in Table 6. It has been reported that CXCR3 can promote metastasis in some types of cancer [8186]. Other genes showed aberrant expression or mutations in cancers, which merits attention. Detailed function analyses of candidate genes are shown in S1 File.

Based on the analyses of the novel genes obtained by the integrated method, we found that several genes have shown stimulative or suppressive effects on cancers by experiments or aberrant expression or mutations in cancers reported in published papers, implying the effectiveness of the integrated method. In addition, the integrated method can provide more potential disease genes for investigating OC because it combines the results yielded by the RWR-based and SP-based methods; i.e., simultaneous usage of the RWR-based and SP-based method can help us mine more information about disease genes in oral cancer. We believe that it is helpful to investigate various diseases using both the RWR-based and the SP-based methods.

GO-term and pathway function enrichment analysis of putative genes using DAVID

By the integrated method, 85 putative genes were obtained. To uncover the biological meaning behind these genes, the functional annotation tool, the Database for Annotation, Visualization and Integrated Discovery (DAVID) [87], was adopted to analyze them. The obtained results are provided in S5 Table.

For the GO enrichment results yielded by DAVID (see S5 Table), we can see that seven-two biological process (BP) GO terms, fifteen molecular function (MF) GO terms and nine cellular component (CC) GO terms are with statistically significance (FDR<0.05). The biological process mostly focused on cell proliferation, apoptosis, transcription and signal transduction. Malignant proliferation and anti-apoptosis are the major characteristics of many cancers. In this study, it has been found many proliferation related genes such as EGF, IGF1 and CDKN1B. EGF (epidermal growth factor) and its receptor EGFR affect cellular processes like proliferation, motility and adhesion. EGF has a higher expression and is correlated with the progression of cancer such as breast cancer [88]. The circulating IGF1 (insulin-like growth factor-1) level is associated with the risk to develop breast cancer [89, 90]. CDKN1B functions as a key cell cycle gatekeeper to prevent or slow down cell division [91, 92]. Aberrant downregulating CDKN1B may promote the proliferation of multiple myeloma cells [93]. Un-controlled expanded number of cancers is determined not only by cell proliferation but also by the cell evading apoptosis. In our study, many genes are enriched in apoptosis process such as TP53, CASP3 and cmyc. The mutations of TP53 are the most common alteration in cancer and are related to cell apoptosis, cell cycle and malignancy [9497]. TP53 could arrest cell cycle and induced cell apoptosis in response to DNA damage [98]. It has been reported that over-expression of c-Myc drives the level of BAX and other apoptosis-related genes and is involved in cell apoptosis [99103]. CASPs is a kind of cysteine-dependent aspartate-specific proteases, and is associated with the initiation and execution of apoptosis. As an activate effector, CASP3 receives the apoptotic signals to perform the cell death process [104]. In our result, a series of genes were enriched in the GO term related apoptosis, which proved process of apoptosis is a key process in oral cancer. In addition, several other biological process GO terms were shown such as transcription and signal transduction. In proliferation, apoptosis and other processes, functional genes were selectively transcribed and expressed. For example, c-Myc is a transcription factor which could mediate a series of down-stream genes express to drive cellular proliferation and apoptosis [105]. It can be seen from S5 Table that putative genes were enriched in several CC GO terms and MF GO terms such as growth factor activity (MF), cytokine activity (MF), enzyme or protein binding (MF), extracellular region (CC), extracellular space (CC) and plasma membrane (CC). These results suggest that the functional activity and localization of the protein are directly or indirectly related to oral cancer.

The DAVID also produced the KEGG pathway enrichment on putative genes (see S5 Table), several pathways were highlighted such as TNF signaling pathway, PI3K-Akt pathway, Pap1 pathway, MAPK pathway, TLR pathway, NF-kB pathway, JaK-STAT, Ras pathway and cytokine-cytokine receptor interaction. TNF pathway could be activated by various signals to affect the immunity, cell growth, apoptosis and other biological behaviors of tumor cells [106110]. Protein-protein interaction and pathways is a cross-talk network. TNF can also activates the NF-kB pathway. The NF-kB pathway is activated in various cancers [111, 112]. Activation of NF-kB can be inhibited by the blockade of PI3K/Akt and ERK pathway to suppress tumor metastasis [113]. It was reported that PI3K/AKT-NF-kB is an axis which promotes bone metastasis in prostate cancer [114]. The Jak/STAT pathway is critical in normal tissues and tumors and the Jak kinase family includes JAK1, JAK2, JAK3 and TYK2 [115]. Jak mediated STAT phosphorylation leads to their nuclear translocation. STAT molecules bind specific promoter of genes and result in the transcription in nucleus, which regulate the cell proliferation, differentiation and apoptosis [115, 116]. It was reported that over-activation of the JAK/STAT pathway is related to subsets of patients with certain solid tumors and chronic myeloid leukemia [117, 118]. MAPK pathway is induced by activation of TLR (toll like receptors) and NOD (nucleotide-binding oligomerization domain receptors), which initiates of inflammation and are involved in cancer proliferation and control [119]. Ras pathway is a highly conserved pathway in cell function, including cell proliferation, differentiation and signaling transduction. This pathway is commonly deregulated in cancers, making the components in the pathway as targets for therapeutic interventions [120]. In these pathways, several putative genes were enriched, which indicates these pathway and gene need more attention in tumorigenesis of oral cancer.

Conclusions

In this study, we investigated genes related to oral cancer. Two popular algorithms, the random walk with a restart algorithm and the shortest path algorithm, which are often used to identify novel disease genes, were integrated with some further rules to build two computational methods. An integrated method was further built by combining these two methods. To access an optimal prediction method for the identification of genes related to oral cancer using the integrated method, these two methods were trained on validated genes. The optimal prediction method was further adopted to identify novel genes related to oral cancer. The results indicated the following facts: (1) The integrated method is effective for identification of disease genes of oral cancer; (2) Candidate genes produced by the integrated method provide an opportunity to achieve a more extensive investigation of oral cancer. We hope that the integrated method can be useful for identifying novel disease genes. In view of the utility of the RWR-based and SP-based methods for identification of disease genes, it is hopeful that these methods can be applied to investigate other problems, such as DNA-binding protein prediction [121], protein fold recognition [122, 123], detection of tubule boundary [124], etc.

Supporting information

S1 File. Detailed analysis of the putative genes obtained by RWR-based method or SP-based method.

(DOCX)

S1 Table. 202 genes related to oral cancer and their sources.

(DOCX)

S2 Table. The performance of the RWR-based method with different combinations of parameters.

(DOCX)

S3 Table. The performance of the SP-based method with different combinations of parameters.

(DOCX)

S4 Table. Genes identified by the optimal RWR-based method.

(DOCX)

S5 Table. The enrichment analysis of putative genes yielded by DAVID.

(XLSX)

Acknowledgments

This study was supported by the National Natural Science Foundation of China (31371335), Natural Science Foundation of Shanghai (17ZR1412500), Shanghai Sailing Program and The Youth Innovation Promotion Association of Chinese Academy of Sciences (CAS) (2016245).

Data Availability

All relevant data are within the paper and its Supporting Information files.

Funding Statement

This study was supported by the National Natural Science Foundation of China (31371335), Natural Science Foundation of Shanghai (17ZR1412500), Shanghai Sailing Program and The Youth Innovation Promotion Association of Chinese Academy of Sciences (CAS) (2016245).

References

  • 1.Tsantoulis PK, Kastrinakis NG, Tourvas AD, Laskaris G, Gorgoulis VG. Advances in the biology of oral cancer. Oral Oncol. 2007;43(6):523–34. 10.1016/j.oraloncology.2006.11.010 [DOI] [PubMed] [Google Scholar]
  • 2.Shridhar K, Walia GK, Aggarwal A, Gulati S, Geetha AV, Prabhakaran D, et al. DNA methylation markers for oral pre-cancer progression: A critical review. Oral Oncol. 2016;53:1–9. 10.1016/j.oraloncology.2015.11.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Bray F, Ren JS, Masuyer E, Ferlay J. Global estimates of cancer prevalence for 27 sites in the adult population in 2008. International journal of cancer Journal international du cancer. 2013;132(5):1133–45. 10.1002/ijc.27711 [DOI] [PubMed] [Google Scholar]
  • 4.Noda M. International Agency for Research on Cancer (http://www.iarc.fr/). Japanese journal of clinical oncology. 1999;29(11):592 [DOI] [PubMed] [Google Scholar]
  • 5.Cancer incidence in five continents. Volume IX. IARC Sci Publ. 2008;(160):1–837. [PubMed] [Google Scholar]
  • 6.Warnakulasuriya S, Sutherland G, Scully C. Tobacco, oral cancer, and treatment of dependence. Oral Oncol. 2005;41(3):244–60. 10.1016/j.oraloncology.2004.08.010 [DOI] [PubMed] [Google Scholar]
  • 7.Lucenteforte E, Garavello W, Bosetti C, La Vecchia C. Dietary factors and oral and pharyngeal cancer risk. Oral Oncol. 2009;45(6):461–7. 10.1016/j.oraloncology.2008.09.002 [DOI] [PubMed] [Google Scholar]
  • 8.Pavia M, Pileggi C, Nobile CG, Angelillo IF. Association between fruit and vegetable consumption and oral cancer: a meta-analysis of observational studies. The American journal of clinical nutrition. 2006;83(5):1126–34. [DOI] [PubMed] [Google Scholar]
  • 9.Fearon ER, Vogelstein B. A genetic model for colorectal tumorigenesis. Cell. 1990;61(5):759–67. [DOI] [PubMed] [Google Scholar]
  • 10.Nagatsuka H, Ishiwari Y, Tsujigiwa H, Nakano K, Nagai N. Quantitation of epidermal growth factor receptor gene amplification by competitive polymerase chain reaction in pre-malignant and malignant oral epithelial lesions. Oral Oncol. 2001;37(7):599–604. [DOI] [PubMed] [Google Scholar]
  • 11.Ishitoya J, Toriyama M, Oguchi N, Kitamura K, Ohshima M, Asano K, et al. Gene amplification and overexpression of EGF receptor in squamous cell carcinomas of the head and neck. British journal of cancer. 1989;59(4):559–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Shiraki M, Odajima T, Ikeda T, Sasaki A, Satoh M, Yamaguchi A, et al. Combined expression of p53, cyclin D1 and epidermal growth factor receptor improves estimation of prognosis in curatively resected oral cancer. Modern pathology: an official journal of the United States and Canadian Academy of Pathology, Inc. 2005;18(11):1482–9. [DOI] [PubMed] [Google Scholar]
  • 13.Xia W, Lau YK, Zhang HZ, Liu AR, Li L, Kiyokawa N, et al. Strong correlation between c-erbB-2 overexpression and overall survival of patients with oral squamous cell carcinoma. Clin Cancer Res. 1997;3(1):3–9. [PubMed] [Google Scholar]
  • 14.Werkmeister R, Brandt B, Joos U. Clinical relevance of erbB-1 and -2 oncogenes in oral carcinomas. Oral Oncol. 2000;36(1):100–5. [DOI] [PubMed] [Google Scholar]
  • 15.Miyamoto R, Uzawa N, Nagaoka S, Hirata Y, Amagasa T. Prognostic significance of cyclin D1 amplification and overexpression in oral squamous cell carcinomas. Oral Oncol. 2003;39(6):610–8. [DOI] [PubMed] [Google Scholar]
  • 16.Koontongkaew S, Chareonkitkajorn L, Chanvitan A, Leelakriangsak M, Amornphimoltham P. Alterations of p53, pRb, cyclin D(1) and cdk4 in human oral and pharyngeal squamous cell carcinomas. Oral Oncol. 2000;36(4):334–9. [DOI] [PubMed] [Google Scholar]
  • 17.Huang MF, Chang YC, Liao PS, Huang TH, Tsay CH, Chou MY. Loss of heterozygosity of p53 gene of oral cancer detected by exfoliative cytology. Oral Oncol. 1999;35(3):296–301. [DOI] [PubMed] [Google Scholar]
  • 18.Largey JS, Meltzer SJ, Yin J, Norris K, Sauk JJ, Archibald DW. Loss of heterozygosity of p53 in oral cancers demonstrated by the polymerase chain reaction. Cancer. 1993;71(6):1933–7. [DOI] [PubMed] [Google Scholar]
  • 19.Sakata K. Alterations of tumor suppressor genes and the H-ras oncogene in oral squamous cell carcinoma. Journal of oral pathology & medicine: official publication of the International Association of Oral Pathologists and the American Academy of Oral Pathology. 1996;25(6):302–7. [DOI] [PubMed] [Google Scholar]
  • 20.Shang ZJ, Li ZB, Li JR. VEGF is up-regulated by hypoxic stimulation and related to tumour angiogenesis and severity of disease in oral squamous cell carcinoma: in vitro and in vivo studies. Int J Oral Maxillofac Surg. 2006;35(6):533–8. 10.1016/j.ijom.2005.09.006 [DOI] [PubMed] [Google Scholar]
  • 21.Patel BP, Shah PM, Rawal UM, Desai AA, Shah SV, Rawal RM, et al. Activation of MMP-2 and MMP-9 in patients with oral squamous cell carcinoma. J Surg Oncol. 2005;90(2):81–8. 10.1002/jso.20240 [DOI] [PubMed] [Google Scholar]
  • 22.Zeng X, Liao Y, Liu Y, Zou Q. Prediction and validation of disease genes using HeteSim Scores. IEEE/ACM Trans Comput Biol Bioinform. 2016. [DOI] [PubMed] [Google Scholar]
  • 23.Chen X, Perez-Jimenez MJ, Valencia-Cabrera L, Wang BZ, Zeng XX. Computing with viruses. Theoretical Computer Science. 2016;623:146–59. [Google Scholar]
  • 24.Zou Q, Wan S, Ju Y, Tang J, Zeng X. Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy. BMC Syst Biol. 2016;10(Suppl 4):114 10.1186/s12918-016-0353-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Gu B, Sheng VS, Tay KY, Romano W, Li S. Incremental Support Vector Learning for Ordinal Regression. Ieee T Neur Net Lear. 2015;26(7):1403–16. [DOI] [PubMed] [Google Scholar]
  • 26.Gu B, Sheng VS, Li S. Bi-parameter space partition for cost-sensitive SVM. Proceedings of the 24th International Conference on Artificial Intelligence; Buenos Aires, Argentina. 2832741: AAAI Press; 2015. p. 3532–9.
  • 27.Tang W, Liao Z, Zou Q. Which statistical significance test best detects oncomiRNAs in cancer tissues? An exploratory analysis. Oncotarget. 2016;7(51):85613–23. 10.18632/oncotarget.12828 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Zhang JR, Ju Y, Lu HJ, Xuan P, Zou Q. Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology. Int J Genomics. 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Oliver S. Guilt-by-association goes global. Nature. 2000;403(6770):601–3. 10.1038/35001165 [DOI] [PubMed] [Google Scholar]
  • 30.Kohler S, Bauer S, Horn D, Robinson PN. Walking the interactome for prioritization of candidate disease genes. The Amerian Journal of Human Genetics. 2008;82(4):949–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Navlakha S, Kingsford C. The power of protein interaction networks for associating genes with diseases. Bioinformatics. 2010;26(8):1057–63. 10.1093/bioinformatics/btq076 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Care MA, Bradford JR, Needham CJ, Bulpitt AJ, Westhead DR. Combining the interactome and deleterious SNP predictions to improve disease gene identification. Hum Mutat. 2009;30(3):485–92. 10.1002/humu.20917 [DOI] [PubMed] [Google Scholar]
  • 33.Chen L, Xing Z, Huang T, Shu Y, Huang G, Li H-P. Application of the shortest path algorithm for the discovery of breast cancer related genes. Current Bioinformatics. 2016;11(1):51–8. [Google Scholar]
  • 34.Gao Y-F, Shu Y, Yang L, He Y-C, Li L-P, Huang G, et al. A Graphic Method for Identification of Novel Glioma Related Genes. BioMed Research International. 2014;2014:891945 10.1155/2014/891945 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zhang J, Yang J, Huang T, Shu Y, Chen L. Identification of novel proliferative diabetic retinopathy related genes on protein-protein interaction network. Neurocomputing. 2016;217:63–72. [Google Scholar]
  • 36.Chen L, Wang B, Wang S, Yang J, Hu J, Xie Z, et al. OPMSP: A computational method integrating protein interaction and sequence information for the identification of novel putative oncogenes. Protein Pept Lett. 2016;23(12):1081–94. [DOI] [PubMed] [Google Scholar]
  • 37.Chen L, Huang T, Zhang YH, Jiang Y, Zheng M, Cai YD. Identification of novel candidate drivers connecting different dysfunctional levels for lung adenocarcinoma using protein-protein interactions and a shortest path approach. Sci Rep. 2016;6:29849 Epub 2016/07/15. 10.1038/srep29849 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Chen L, Yang J, Huang T, Kong X, Lu L, Cai Y-D. Mining for novel tumor suppressor genes using a shortest path approach. Journal of Biomolecular Structure and Dynamics. 2016;34(3):664–75. 10.1080/07391102.2015.1042915 [DOI] [PubMed] [Google Scholar]
  • 39.Cai Y-D, Zhang Q, Zhang Y-H, Chen L, Huang T. Identification of genes associated with breast cancer metastasis to bone on a protein-protein interaction network with a shortest path algorithm. J Proteome Res. 2017;16(2):1027–38. 10.1021/acs.jproteome.6b00950 [DOI] [PubMed] [Google Scholar]
  • 40.Chen L, Zhang YH, Huang T, Cai YD. Identifying novel protein phenotype annotations by hybridizing protein-protein interactions and protein sequence similarities. Molecular Genetics and Genomics. 2016;291(2):913–34. 10.1007/s00438-015-1157-9 [DOI] [PubMed] [Google Scholar]
  • 41.The UniProt Consortium. Update on activities at the Universal Protein Resource (UniProt) in 2013. Nucleic Acids Res. 2013;41(Database issue):D43–7. Epub 2012/11/20. 10.1093/nar/gks1068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Franceschini A, Szklarczyk D, Frankild S, Kuhn M, Simonovic M, Roth A, et al. STRING v9. 1: protein-protein interaction networks, with increased coverage and integration. Nucleic acids research. 2013;41(D1):D808–D15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Ng KL, Ciou JS, Huang CH. Prediction of protein functions based on function-function correlation relations. Comput Biol Med. 2010;40(3):300–5. 10.1016/j.compbiomed.2010.01.001 [DOI] [PubMed] [Google Scholar]
  • 44.Hu LL, Huang T, Shi X, Lu WC, Cai YD, Chou KC. Predicting functions of proteins in mouse based on weighted protein-protein interaction network and protein hybrid properties. PLoS ONE. 2011;6(1):e14556 10.1371/journal.pone.0014556 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Gormen TH, Leiserson CE, Rivest RL, Stein C, editors. Introduction to algorithms: MIT press; Cambridge, MA; 1990. [Google Scholar]
  • 46.Chen L, Zeng W-M, Cai Y-D, Feng K-Y, Chou K-C. Predicting Anatomical Therapeutic Chemical (ATC) Classification of Drugs by Integrating Chemical-Chemical Interactions and Similarities. PLoS ONE. 2012;7(4):e35254 10.1371/journal.pone.0035254 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Wei L, Xing P, Shi G, Ji ZL, Zou Q. Fast prediction of protein methylation sites using a sequence-based feature selection technique. IEEE/ACM Trans Comput Biol Bioinform. 2017. [DOI] [PubMed] [Google Scholar]
  • 48.Wei L, Xing P, Tang J, Zou Q. PhosPred-RF: a novel sequence-based predictor for phosphorylation sites using sequential information only. IEEE Trans Nanobioscience. 2017. [DOI] [PubMed] [Google Scholar]
  • 49.Kohavi R, editor A study of cross-validation and bootstrap for accuracy estimation and model selection. International joint Conference on artificial intelligence; 1995: Lawrence Erlbaum Associates Ltd.
  • 50.Akira S, Uematsu S, Takeuchi O. Pathogen recognition and innate immunity. Cell. 2006;124(4):783–801. 10.1016/j.cell.2006.02.015 [DOI] [PubMed] [Google Scholar]
  • 51.Szajnik M, Szczepanski MJ, Czystowska M, Elishaev E, Mandapathil M, Nowak-Markwitz E, et al. TLR4 signaling induced by lipopolysaccharide or paclitaxel regulates tumor survival and chemoresistance in ovarian cancer. Oncogene. 2009;28(49):4353–63. 10.1038/onc.2009.289 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Szczepanski MJ, Czystowska M, Szajnik M, Harasymczuk M, Boyiadzis M, Kruk-Zagajewska A, et al. Triggering of Toll-like receptor 4 expressed on human head and neck squamous cell carcinoma promotes tumor development and protects the tumor from immune attack. Cancer Res. 2009;69(7):3105–13. 10.1158/0008-5472.CAN-08-3838 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Tucci M, Stucci S, Passarelli A, Giudice G, Dammacco F, Silvestris F. The immune escape in melanoma: role of the impaired dendritic cell function. Expert review of clinical immunology. 2014;10(10):1395–404. 10.1586/1744666X.2014.955851 [DOI] [PubMed] [Google Scholar]
  • 54.Cancer Genome Atlas Research N. Comprehensive genomic characterization of squamous cell lung cancers. Nature. 2012;489(7417):519–25. 10.1038/nature11404 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Dutt A, Salvesen HB, Chen TH, Ramos AH, Onofrio RC, Hatton C, et al. Drug-sensitive FGFR2 mutations in endometrial carcinoma. Proc Natl Acad Sci U S A. 2008;105(25):8713–7. 10.1073/pnas.0803379105 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Byron SA, Gartside MG, Wellens CL, Mallon MA, Keenan JB, Powell MA, et al. Inhibition of activated fibroblast growth factor receptor 2 in endometrial cancer cells induces cell death despite PTEN abrogation. Cancer Res. 2008;68(17):6902–7. 10.1158/0008-5472.CAN-08-0770 [DOI] [PubMed] [Google Scholar]
  • 57.Gozgit JM, Squillace RM, Wongchenko MJ, Miller D, Wardwell S, Mohemmad Q, et al. Combined targeting of FGFR2 and mTOR by ponatinib and ridaforolimus results in synergistic antitumor activity in FGFR2 mutant endometrial cancer models. Cancer Chemother Pharmacol. 2013;71(5):1315–23. 10.1007/s00280-013-2131-z [DOI] [PubMed] [Google Scholar]
  • 58.Liao RG, Jung J, Tchaicha J, Wilkerson MD, Sivachenko A, Beauchamp EM, et al. Inhibitor-sensitive FGFR2 and FGFR3 mutations in lung squamous cell carcinoma. Cancer Res. 2013;73(16):5195–205. 10.1158/0008-5472.CAN-12-3950 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Zaharieva BM, Simon R, Diener PA, Ackermann D, Maurer R, Alund G, et al. High-throughput tissue microarray analysis of 11q13 gene amplification (CCND1, FGF3, FGF4, EMS1) in urinary bladder cancer. The Journal of pathology. 2003;201(4):603–8. 10.1002/path.1481 [DOI] [PubMed] [Google Scholar]
  • 60.Sugahara K, Michikawa Y, Ishikawa K, Shoji Y, Iwakawa M, Shibahara T, et al. Combination effects of distinct cores in 11q13 amplification region on cervical lymph node metastasis of oral squamous cell carcinoma. International journal of oncology. 2011;39(4):761–9. 10.3892/ijo.2011.1094 [DOI] [PubMed] [Google Scholar]
  • 61.Karin M, Greten FR. NF-kappaB: linking inflammation and immunity to cancer development and progression. Nat Rev Immunol. 2005;5(10):749–59. Epub 2005/09/22. 10.1038/nri1703 [DOI] [PubMed] [Google Scholar]
  • 62.Pikarsky E, Porat RM, Stein I, Abramovitch R, Amit S, Kasem S, et al. NF-kappaB functions as a tumour promoter in inflammation-associated cancer. Nature. 2004;431(7007):461–6. 10.1038/nature02924 [DOI] [PubMed] [Google Scholar]
  • 63.Bu H, Rosdahl I, Sun XF, Zhang H. Importance of polymorphisms in NF-kappaB1 and NF-kappaBIalpha genes for melanoma risk, clinicopathological features and tumor progression in Swedish melanoma patients. J Cancer Res Clin Oncol. 2007;133(11):859–66. 10.1007/s00432-007-0228-7 [DOI] [PubMed] [Google Scholar]
  • 64.Zhang P, Wei Q, Li X, Wang K, Zeng H, Bu H, et al. A functional insertion/deletion polymorphism in the promoter region of the NFKB1 gene increases susceptibility for prostate cancer. Cancer Genet Cytogenet. 2009;191(2):73–7. 10.1016/j.cancergencyto.2009.01.017 [DOI] [PubMed] [Google Scholar]
  • 65.Lo SS, Chen JH, Wu CW, Lui WY. Functional polymorphism of NFKB1 promoter may correlate to the susceptibility of gastric cancer in aged patients. Surgery. 2009;145(3):280–5. 10.1016/j.surg.2008.11.001 [DOI] [PubMed] [Google Scholar]
  • 66.Lin CW, Hsieh YS, Hsin CH, Su CW, Lin CH, Wei LH, et al. Effects of NFKB1 and NFKBIA gene polymorphisms on susceptibility to environmental factors and the clinicopathologic development of oral cancer. PLoS One. 2012;7(4):e35078 10.1371/journal.pone.0035078 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Bevilacqua MP, Nelson RM. Selectins. J Clin Invest. 1993;91(2):379–87. 10.1172/JCI116210 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Lauri D, Needham L, Martin-Padura I, Dejana E. Tumor cell adhesion to endothelial cells: endothelial leukocyte adhesion molecule-1 as an inducible adhesive receptor specific for colon carcinoma cells. Journal of the National Cancer Institute. 1991;83(18):1321–4. [DOI] [PubMed] [Google Scholar]
  • 69.Wenzel CT, Scher RL, Richtsmeier WJ. Adhesion of head and neck squamous cell carcinoma to endothelial cells. The missing links. Archives of otolaryngology—head & neck surgery. 1995;121(11):1279–86. [DOI] [PubMed] [Google Scholar]
  • 70.Chang PY, Kuo YB, Wu TL, Liao CT, Sun YC, Yen TC, et al. Association and prognostic value of serum inflammation markers in patients with leukoplakia and oral cavity cancer. Clinical chemistry and laboratory medicine: CCLM / FESCC. 2013;51(6):1291–300. [DOI] [PubMed] [Google Scholar]
  • 71.Lahav J. The functions of thrombospondin and its involvement in physiology and pathophysiology. Biochim Biophys Acta. 1993;1182(1):1–14. [DOI] [PubMed] [Google Scholar]
  • 72.Yao L, Zhao YL, Itoh S, Wada S, Yue L, Furuta I. Thrombospondin-1 expression in oral squamous cell carcinomas: correlations with tumor vascularity, clinicopathological features and survival. Oral Oncol. 2000;36(6):539–44. [DOI] [PubMed] [Google Scholar]
  • 73.Pal SK, Nguyen CT, Morita KI, Miki Y, Kayamori K, Yamaguchi A, et al. THBS1 is induced by TGFB1 in the cancer stroma and promotes invasion of oral squamous cell carcinoma. Journal of oral pathology & medicine: official publication of the International Association of Oral Pathologists and the American Academy of Oral Pathology. 2016. [DOI] [PubMed] [Google Scholar]
  • 74.Durbin JE, Hackenmiller R, Simon MC, Levy DE. Targeted disruption of the mouse Stat1 gene results in compromised innate immunity to viral disease. Cell. 1996;84(3):443–50. [DOI] [PubMed] [Google Scholar]
  • 75.Xi S, Dyer KF, Kimak M, Zhang Q, Gooding WE, Chaillet JR, et al. Decreased STAT1 expression by promoter methylation in squamous cell carcinogenesis. Journal of the National Cancer Institute. 2006;98(3):181–9. 10.1093/jnci/djj020 [DOI] [PubMed] [Google Scholar]
  • 76.Laimer K, Spizzo G, Obrist P, Gastl G, Brunhuber T, Schafer G, et al. STAT1 activation in squamous cell cancer of the oral cavity: a potential predictive marker of response to adjuvant chemotherapy. Cancer. 2007;110(2):326–33. 10.1002/cncr.22813 [DOI] [PubMed] [Google Scholar]
  • 77.Haroun F, Al-Shaar L, Habib RH, El-Saghir N, Tfayli A, Bazarbachi A, et al. Effects of CYP2B6 genetic polymorphisms in patients receiving cyclophosphamide combination chemotherapy for breast cancer. Cancer Chemother Pharmacol. 2015;75(1):207–14. 10.1007/s00280-014-2632-4 [DOI] [PubMed] [Google Scholar]
  • 78.Li Y, Zhao W, Zhao Z, Wu J, Chen L, Ma Y, et al. IL1B gene polymorphisms, age and the risk of non-small cell lung cancer in a Chinese population. Lung Cancer. 2015;89(3):232–7. 10.1016/j.lungcan.2015.06.009 [DOI] [PubMed] [Google Scholar]
  • 79.Zeigler-Johnson C, Friebel T, Walker AH, Wang Y, Spangler E, Panossian S, et al. CYP3A4, CYP3A5, and CYP3A43 genotypes and haplotypes in the etiology and severity of prostate cancer. Cancer Res. 2004;64(22):8461–7. 10.1158/0008-5472.CAN-04-1651 [DOI] [PubMed] [Google Scholar]
  • 80.Chen L, Huang Z, Yao G, Lyu X, Li J, Hu X, et al. The expression of CXCL13 and its relation to unfavorable clinical characteristics in young breast cancer. J Transl Med. 2015;13:168 10.1186/s12967-015-0521-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Walser TC, Ma X, Kundu N, Dorsey R, Goloubeva O, Fulton AM. Immune-mediated modulation of breast cancer growth and metastasis by the chemokine Mig (CXCL9) in a murine model. Journal of immunotherapy. 2007;30(5):490–8. 10.1097/CJI.0b013e318031b551 [DOI] [PubMed] [Google Scholar]
  • 82.Walser TC, Rifat S, Ma X, Kundu N, Ward C, Goloubeva O, et al. Antagonism of CXCR3 inhibits lung metastasis in a murine model of metastatic breast cancer. Cancer Res. 2006;66(15):7701–7. 10.1158/0008-5472.CAN-06-0709 [DOI] [PubMed] [Google Scholar]
  • 83.Kawada K, Hosogi H, Sonoshita M, Sakashita H, Manabe T, Shimahara Y, et al. Chemokine receptor CXCR3 promotes colon cancer metastasis to lymph nodes. Oncogene. 2007;26(32):4679–88. 10.1038/sj.onc.1210267 [DOI] [PubMed] [Google Scholar]
  • 84.Ma X, Norsworthy K, Kundu N, Rodgers WH, Gimotty PA, Goloubeva O, et al. CXCR3 expression is associated with poor survival in breast cancer and promotes metastasis in a murine model. Mol Cancer Ther. 2009;8(3):490–8. 10.1158/1535-7163.MCT-08-0485 [DOI] [PubMed] [Google Scholar]
  • 85.Cambien B, Karimdjee BF, Richard-Fiardo P, Bziouech H, Barthel R, Millet MA, et al. Organ-specific inhibition of metastatic colon carcinoma by CXCR3 antagonism. British journal of cancer. 2009;100(11):1755–64. 10.1038/sj.bjc.6605078 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Pradelli E, Karimdjee-Soilihi B, Michiels JF, Ricci JE, Millet MA, Vandenbos F, et al. Antagonism of chemokine receptor CXCR3 inhibits osteosarcoma metastasis to lungs. International journal of cancer Journal international du cancer. 2009;125(11):2586–94. 10.1002/ijc.24665 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57. Epub 2009/01/10. 10.1038/nprot.2008.211 [DOI] [PubMed] [Google Scholar]
  • 88.Kim BK, Lee JW, Park PJ, Shin YS, Lee WY, Lee KA, et al. The multiplex bead array approach to identifying serum biomarkers associated with breast cancer. Breast cancer research: BCR. 2009;11(2):R22 10.1186/bcr2247 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Key TJ, Appleby PN, Reeves GK, Roddam AW, Breast TEH. Insulin-like growth factor 1 (IGF1), IGF binding protein 3 (IGFBP3), and breast cancer risk: pooled individual data analysis of 17 prospective studies. Lancet Oncology. 2010;11(6):530–42. 10.1016/S1470-2045(10)70095-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Karamouzis MV, Papavassiliou AG. Targeting insulin-like growth factor in breast cancer therapeutics. Crit Rev Oncol Hematol. 2012;84(1):8–17. 10.1016/j.critrevonc.2012.02.010 [DOI] [PubMed] [Google Scholar]
  • 91.Polyak K, Lee MH, Erdjumentbromage H, Koff A, Roberts JM, Tempst P, et al. Cloning of P27(Kip1), a Cyclin-Dependent Kinase Inhibitor and a Potential Mediator of Extracellular Antimitogenic Signals. Cell. 1994;78(1):59–66. [DOI] [PubMed] [Google Scholar]
  • 92.Chang BL, Zheng SQL, Isaacs SD, Wiley KE, Turner A, Li G, et al. A polymorphism in the CDKN1B gene is associated with increased risk of hereditary prostate cancer. Cancer Research. 2004;64(6):1997–9. [DOI] [PubMed] [Google Scholar]
  • 93.Lang T, Nie YL. MiR-148a participates in the growth of RPMI8226 multiple myeloma cells by regulating CDKN1B. Biomedicine & Pharmacotherapy. 2016;84:1967–71. [DOI] [PubMed] [Google Scholar]
  • 94.Nanda MS, Shah ZA, Sameer AS, Syeed N, Murtaza I, Siddiqi MA, et al. TP53-Molecular Soldier's Mutations in Bladder Cancer in the Kashmiri Population. Asian Pacific Journal of Cancer Prevention. 2011;12(1):67–72. [PubMed] [Google Scholar]
  • 95.Cheng LA, Zhang SB, MacLennan GT, Williamson SR, Lopez-Beltran A, Montironi R. Bladder cancer: translating molecular genetic insights into clinical practice. Human Pathology. 2011;42(4):455–81. 10.1016/j.humpath.2010.07.007 [DOI] [PubMed] [Google Scholar]
  • 96.Levine AJ, Oren M. The first 30 years of p53: growing ever more complex. Nat Rev Cancer. 2009;9(10):749–58. 10.1038/nrc2723 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Nishiyama H, Watanabe J, Ogawa O. p53 and chemosensitivity in bladder cancer. Int J Clin Oncol. 2008;13(4):282–6. 10.1007/s10147-008-0815-x [DOI] [PubMed] [Google Scholar]
  • 98.Wang SJ, Gu W. To be, or not to be: functional dilemma of p53 metabolic regulation. Curr Opin Oncol. 2014;26(1):78–85. 10.1097/CCO.0000000000000024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Evan GI, Wyllie AH, Gilbert CS, Littlewood TD, Land H, Brooks M, et al. Induction of Apoptosis in Fibroblasts by C-Myc Protein. Cell. 1992;69(1):119–28. [DOI] [PubMed] [Google Scholar]
  • 100.Bissonnette RP, Echeverri F, Mahboubi A, Green DR. Apoptotic cell death induced by c-myc is inhibited by bcl-2. Nature. 1992;359(6395):552–4. 10.1038/359552a0 [DOI] [PubMed] [Google Scholar]
  • 101.Egle A, Harris AW, Bouillet P, Cory S. Bim is a suppressor of Myc-induced mouse B cell leukemia. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(16):6164–9. 10.1073/pnas.0401471101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Murphy DJ, Junttila MR, Pouyet L, Karnezis A, Shchors K, Bui DA, et al. Distinct thresholds govern Myc's biological output in vivo. Cancer Cell. 2008;14(6):447–57. 10.1016/j.ccr.2008.10.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Mitchell KO, Ricci MS, Miyashita T, Dicker DT, Jin ZY, Reed JC, et al. Bax is a transcriptional target and mediator of c-Myc-induced apoptosis. Cancer Research. 2000;60(22):6318–25. [PubMed] [Google Scholar]
  • 104.Nicholson DW, Thornberry NA. Caspases: killer proteases. Trends Biochem Sci. 1997;22(8):299–306. [DOI] [PubMed] [Google Scholar]
  • 105.Dang CV, O'Donnell KA, Zeller KI, Nguyen T, Osthus RC, Li F. The c-Myc target gene network. Seminars in cancer biology. 2006;16(4):253–64. 10.1016/j.semcancer.2006.07.014 [DOI] [PubMed] [Google Scholar]
  • 106.Zhao X, Mohaupt M, Jiang J, Liu S, Li B, Qin Z. Tumor necrosis factor receptor 2-mediated tumor suppression is nitric oxide dependent and involves angiostasis. Cancer Res. 2007;67(9):4443–50. 10.1158/0008-5472.CAN-07-0185 [DOI] [PubMed] [Google Scholar]
  • 107.Qin Z, Kruger-Krasagakes S, Kunzendorf U, Hock H, Diamantstein T, Blankenstein T. Expression of tumor necrosis factor by different tumor cell lines results either in tumor suppression or augmented metastasis. J Exp Med. 1993;178(1):355–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Blankenstein T, Qin ZH, Uberla K, Muller W, Rosen H, Volk HD, et al. Tumor Suppression after Tumor-Cell Targeted Tumor-Necrosis-Factor-Alpha Gene-Transfer. Journal of Experimental Medicine. 1991;173(5):1047–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Oliff A, Defeo-Jones D, Boyer M, Martinez D, Kiefer D, Vuocolo G, et al. Tumors secreting human TNF/cachectin induce cachexia in mice. Cell. 1987;50(4):555–63. [DOI] [PubMed] [Google Scholar]
  • 110.Kulbe H, Thompson R, Wilson JL, Robinson S, Hagemann T, Fatah R, et al. The inflammatory cytokine tumor necrosis factor-alpha generates an autocrine tumor-promoting network in epithelial ovarian cancer cells. Cancer Res. 2007;67(2):585–92. 10.1158/0008-5472.CAN-06-2941 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Ben-Neriah Y, Karin M. Inflammation meets cancer, with NF-kappaB as the matchmaker. Nat Immunol. 2011;12(8):715–23. 10.1038/ni.2060 [DOI] [PubMed] [Google Scholar]
  • 112.DiDonato JA, Mercurio F, Karin M. NF-kappa B and the link between inflammation and cancer. Immunol Rev. 2012;246:379–400. 10.1111/j.1600-065X.2012.01099.x [DOI] [PubMed] [Google Scholar]
  • 113.Kang MH, Oh SC, Lee HJ, Kang HN, Kim JL, Kim JS, et al. Metastatic function of BMP-2 in gastric cancer cells: the role of PI3K/AKT, MAPK, the NF-kappaB pathway, and MMP-9 expression. Exp Cell Res. 2011;317(12):1746–62. 10.1016/j.yexcr.2011.04.006 [DOI] [PubMed] [Google Scholar]
  • 114.Graham TR, Odero-Marah VA, Chung LW, Agrawal KC, Davis R, Abdel-Mageed AB. PI3K/Akt-dependent transcriptional regulation and activation of BMP-2-Smad signaling by NF-kappaB in metastatic prostate cancer cells. The Prostate. 2009;69(2):168–80. 10.1002/pros.20870 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Levine RL, Pardanani A, Tefferi A, Gilliland DG. Role of JAK2 in the pathogenesis and therapy of myeloproliferative disorders. Nat Rev Cancer. 2007;7(9):673–83. 10.1038/nrc2210 [DOI] [PubMed] [Google Scholar]
  • 116.Silva M, Richard C, Benito A, Sanz C, Olalla I, Fernandez-Luna JL. Expression of Bcl-x in erythroid precursors from patients with polycythemia vera. N Engl J Med. 1998;338(9):564–71. 10.1056/NEJM199802263380902 [DOI] [PubMed] [Google Scholar]
  • 117.Sansone P, Bromberg J. Targeting the Interleukin-6/Jak/Stat Pathway in Human Malignancies. Journal of Clinical Oncology. 2012;30(9):1005–14. 10.1200/JCO.2010.31.8907 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Traer E, MacKenzie R, Snead J, Agarwal A, Eiring AM, O'Hare T, et al. Blockade of JAK2-mediated extrinsic survival signals restores sensitivity of CML cells to ABL inhibitors. Leukemia. 2012;26(5):1140–3. 10.1038/leu.2011.325 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Groeger S, Jarzina F, Domann E, Meyle J. Porphyromonas gingivalis activates NFkappaB and MAPK pathways in human oral epithelial cells. BMC Immunol. 2017;18(1):1 10.1186/s12865-016-0185-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.McCubrey JA, Steelman LS, Chappell WH, Abrams SL, Franklin RA, Montalto G, et al. Ras/Raf/MEK/ERK and PI3K/PTEN/Akt/mTOR Cascade Inhibitors: How Mutations Can Result in Therapy Resistance and How to Overcome Resistance. Oncotarget. 2012;3(10):1068–111. 10.18632/oncotarget.659 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Wei L, Tang J, Zou Q. Local-DPP: An improved DNA-binding protein prediction method by exploring local evolutionary information. Inform Sciences. 2017;384:135–44. [Google Scholar]
  • 122.Wei L, Zou Q. Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition. International journal of molecular sciences. 2016;17(12). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Wei L, Liao M, Gao X, Zou Q. Enhanced Protein Fold Prediction Method Through a Novel Feature Extraction Technique. IEEE Trans Nanobioscience. 2015;14(6):649–59. 10.1109/TNB.2015.2450233 [DOI] [PubMed] [Google Scholar]
  • 124.Su R, Zhang C, Pham TD, Davey R, Bischof L, Vallotton P, et al. Detection of tubule boundaries based on circular shortest path and polar-transformation of arbitrary shapes. Journal of microscopy. 2016;264(2):127–42. 10.1111/jmi.12421 [DOI] [PubMed] [Google Scholar]
  • 125.Bromberg J. Stat proteins and oncogenesis. J Clin Invest. 2002;109(9):1139–42. 10.1172/JCI15617 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Wu YW, Tang W, Zuo JP. Toll-like receptors: potential targets for lupus treatment. Acta pharmacologica Sinica. 2015;36(12):1395–407. 10.1038/aps.2015.91 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Xiang W, Chao ZY, Feng DY. Role of Toll-like receptor/MYD88 signaling in neurodegenerative diseases. Reviews in the neurosciences. 2015;26(4):407–14. 10.1515/revneuro-2014-0067 [DOI] [PubMed] [Google Scholar]
  • 128.Muller M, Herrath J, Malmstrom V. IL-1R1 is expressed on both Helios(+) and Helios(-)FoxP3(+)CD4(+) T cells in the rheumatic joint. Clinical and Experimental Immunology. 2015;182(1):90–100. 10.1111/cei.12668 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.Danielson PB. The cytochrome P450 superfamily: biochemistry, evolution and drug metabolism in humans. Curr Drug Metab. 2002;3(6):561–97. [DOI] [PubMed] [Google Scholar]
  • 130.Haudenschild DR, Palmer SM, Moseley TA, You Z, Reddi AH. Bone morphogenetic protein (BMP)-6 signaling and BMP antagonist noggin in prostate cancer. Cancer Res. 2004;64(22):8276–84. 10.1158/0008-5472.CAN-04-2251 [DOI] [PubMed] [Google Scholar]
  • 131.Zhang X, Ibrahimi OA, Olsen SK, Umemori H, Mohammadi M, Ornitz DM. Receptor specificity of the fibroblast growth factor family. The complete mammalian FGF family. J Biol Chem. 2006;281(23):15694–700. 10.1074/jbc.M601252200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Zhou H, Wu J, Wang T, Zhang X, Liu D. CXCL10/CXCR3 axis promotes the invasion of gastric cancer via PI3K/AKT pathway-dependent MMPs production. Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie. 2016;82:479–88. [DOI] [PubMed] [Google Scholar]
  • 133.Pranzatelli MR, Tate ED, McGee NR, Travelstead AL, Verhulst SJ, Ransohoff RM. Expression of CXCR3 and its ligands CXCL9, -10 and -11 in paediatric opsoclonus-myoclonus syndrome. Clin Exp Immunol. 2013;172(3):427–36. 10.1111/cei.12065 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Baenziger NL, Brodie GN, Majerus PW. Isolation and properties of a thrombin-sensitive protein of human platelets. J Biol Chem. 1972;247(9):2723–31. [PubMed] [Google Scholar]
  • 135.Bornstein P. Diversity of function is inherent in matricellular proteins: an appraisal of thrombospondin 1. The Journal of cell biology. 1995;130(3):503–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136.Labib K, Tercero JA, Diffley JF. Uninterrupted MCM2-7 function required for DNA replication fork progression. Science. 2000;288(5471):1643–7. [DOI] [PubMed] [Google Scholar]
  • 137.Elcin AE, Parmaksiz M, Dogan A, Seker S, Durkut S, Dalva K, et al. Differential gene expression profiling of human adipose stem cells differentiating into smooth muscle-like cells by TGFbeta1/BMP4. Exp Cell Res. 2017. [DOI] [PubMed] [Google Scholar]
  • 138.Vallinoto AC, Graca ES, Araujo MS, Azevedo VN, Cayres-Vallinoto I, Machado LF, et al. IFNG +874T/A polymorphism and cytokine plasma levels are associated with susceptibility to Mycobacterium tuberculosis infection and clinical manifestation of tuberculosis. Human immunology. 2010;71(7):692–6. 10.1016/j.humimm.2010.03.008 [DOI] [PubMed] [Google Scholar]
  • 139.Elgizouli M, Lowe DM, Speckmann C, Schubert D, Hulsdunker J, Eskandarian Z, et al. Activating PI3Kdelta mutations in a cohort of 669 patients with primary immunodeficiency. Clin Exp Immunol. 2016;183(2):221–9. 10.1111/cei.12706 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 140.Wang G, Wang H, Zhang C, Liu T, Li Q, Lin X, et al. Rac3 regulates cell proliferation through cell cycle pathway and predicts prognosis in lung adenocarcinoma. Tumour Biol. 2016;37(9):12597–607. 10.1007/s13277-016-5126-7 [DOI] [PubMed] [Google Scholar]
  • 141.Haataja L, Kaartinen V, Groffen J, Heisterkamp N. The small GTPase Rac3 interacts with the integrin-binding protein CIB and promotes integrin alpha(IIb)beta(3)-mediated adhesion and spreading. J Biol Chem. 2002;277(10):8321–8. 10.1074/jbc.M105363200 [DOI] [PubMed] [Google Scholar]
  • 142.Mira JP, Benard V, Groffen J, Sanders LC, Knaus UG. Endogenous, hyperactive Rac3 controls proliferation of breast cancer cells by a p21-activated kinase-dependent pathway. Proc Natl Acad Sci U S A. 2000;97(1):185–9. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File. Detailed analysis of the putative genes obtained by RWR-based method or SP-based method.

(DOCX)

S1 Table. 202 genes related to oral cancer and their sources.

(DOCX)

S2 Table. The performance of the RWR-based method with different combinations of parameters.

(DOCX)

S3 Table. The performance of the SP-based method with different combinations of parameters.

(DOCX)

S4 Table. Genes identified by the optimal RWR-based method.

(DOCX)

S5 Table. The enrichment analysis of putative genes yielded by DAVID.

(XLSX)

Data Availability Statement

All relevant data are within the paper and its Supporting Information files.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES