Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2018 Oct 25;8:15813. doi: 10.1038/s41598-018-34180-6

Prediction of potential disease-associated microRNAs by composite network based inference

Bin-Sheng He 1, Jia Qu 2,, Min Chen 3,
PMCID: PMC6202421  PMID: 30361693

Abstract

MicroRNAs (miRNAs) act a significant role in multiple biological processes and their associations with the development of all kinds of complex diseases are much close. In the research area of biology, medicine, and bioinformatics, prediction of potential miRNA-disease associations (MDAs) on the base of a variety of heterogeneous biological datasets in a short time is an important subject. Therefore, we proposed the model of Composite Network based inference for MiRNA-Disease Association prediction (CNMDA) through applying random walk to a multi-level composite network constructed by heterogeneous dataset of disease, long noncoding RNA (lncRNA) and miRNA. The results showed that CNMDA achieved an AUC of 0.8547 in leave-one-out cross validation and an AUC of 0.8533+/−0.0009 in 5-fold cross validation. In addition, we employed CNMDA to infer novel miRNAs for kidney neoplasms, breast neoplasms and lung neoplasms on the base of HMDD v2.0. Also, we employed the approach for lung neoplasms on the base of HMDD v1.0 and for breast neoplasms that have no known related miRNAs. It was found that CNMDA could be seen as an applicable tool for potential MDAs prediction.

Introduction

MicroRNAs (miRNAs) is a kind of short noncoding RNA (ncRNA) molecules with about 22 nucleotides in length which can regulate complementary messenger RNAs1. Unlike the miRNAs, long noncoding RNAs (lncRNAs) are a sort of heterogeneous ncRNAs with about 200 nucleotides and usually show less sequence conservation. Accumulating evidence indicates that miRNAs are participated in a wide variety of life process of cells, such as proliferation2, development3, aging4, viral infection5, metabolism4,6 and so on5,7. It is no surprise that miRNAs are closely related to a number of clinically important diseases8,9. For example, miR-335 and miR-126 were proved to be metastasis suppressor miRNAs in human breast cancer10. In addition, previous study also confirmed that the differential expression of miR-21, -31, -143 and -145 is closely participate in clinic pathologic features of colorectal cancer11. Therefore, identification of disease-related miRNAs would be beneficial for disease diagnosis, treatment, and prevention12. Currently, unlike traditional time-consuming biological experiments, adopting validation to the predicted miRNA-disease associations (MDAs) obtained from calculation models could reduce a lot of time and cost. Therefore, it is very significant to propose effective calculation models to infer potential MDAs1317.

According to the idea that miRNAs with similar functions are usually relevant to similar diseases and the reverse is also true. some researchers built elaborate computational models for the identification of potential MDAs on the basis of known MDAs in databases only. For example, Li et al.18 developed a computational approach based on matrix completion, in which the adjacency matrix constructed from known MDAs was updated to gain final association scores of each miRNA-disease pair. Considering various types of known MDAs, Chen et al.19 constructed an restricted boltzmann machine (RBM) model to further predict four kinds of MDAs.

Based on the information of known MDAs and the corresponding similarity information of diseases and miRNAs, Chen et al.20 developed an effective method via combining all those information to construct a heterogeneous graph and then further inferred MDAs with the consideration of paths between miRNA nodes and disease nodes. Besides, this method could also be implemented to predict for new diseases (miRNAs). Through integrating the distribution information of k most similar neighbors per miRNA and the corresponding functional similarity between the miRNA and its neighbors, Xuan et al.21 proposed a reliable computational approach to infer novel MDAs. However, HDMP cannot predict disease-related miRNAs for new diseases. After computing miRNAs functional similarity (MFS), Xuan et al.22 proposed a prediction model via implementing random walk on constructed miRNA functional similarity network in which they assigned larger transition weights to marked nodes. At last, probability association scores of each disease-miRNA pair would be obtained and ranked. A calculation model was further built by Chen et al.17 in which miRNA’s k-nearest-neighbors (KNNs) and disease’s KNNs were respectively searched and then these KNNs would be ranked according to support vector machine. After that, they finally got all potential MDAs with weighted voting. Under the framework of semi-supervised learning, a novel model23 was presented for MDAs prediction via combining the optimal solutions in the miRNA space and disease space. Recently, Chen et al.24 proposed another prediction model through calculating within-score and between-score for both miRNAs and diseases which were then combined to obtain the final MDA scores.

Also, researchers put forward some other calculation approaches via considering relevant genes or proteins as a bridge to predict novel MDAs. For example, using a discrete probability distribution of hypergeometric, Jiang et al.25 presented a prediction model on the basis of the constructed integrated network. By connecting miRNAs to diseases with the proteins as a bridge between them, a calculation model was employed by Mork et al.26 through using a scoring scheme, which can greatly increase the model’s efficiency. Furthermore, Shi et al.27 implemented random walk on a built protein similarity network to identify MDAs.

By combining the known MDAs network and MFS network, a new calculating method was studied by Chen et al.28 by the analyzed of random walk with restart (RWR). It is worth noting that RWR is a very effective model for MDAs prediction. By adopting RWR, a novel model named Composite Network based inference for MiRNA-Disease Association prediction (CNMDA) was presented in the light of a multi-level network which was built by combination of Gaussian interaction profile kernel similarity (GIPKS) for lncRNAs, integrated similarity for miRNAs (ISMs) and diseases (ISDs), known MDAs, lncRNA-disease associations (LDAs) and miRNA-lncRNA interactions (MLIs). In addition, leave-one-out cross validation (LOOCV) and 5-fold cross validation were adopted in this paper to assess CNMDA’s effectiveness. It could be seen that the AUCs of LOOCV and 5-fold cross validation were respectively 0.8547 and 0.8533+/−0.0009. As for case studies, CNMDA was carried out on kidney neoplasms (KN), breast neoplasms (BN) and lung neoplasms (LN) to infer its associated miRNAs based on HMDD v2.029. Also according to HMDD v2.0, we further infer novel miRNAs for BN after hiding its known associated miRNAs. At last, we carried out the case studies based on HMDD v1.030 to infer LN-related miRNAs. Based on the above results, the effectiveness of CNMDA for MDAs prediction was validated.

Results

Cross validation

In this paper, we carried out LOOCV and 5-fold cross validation to assess CNMDA’s prediction accuracy according to HMDD v2.029 and then made comparison between CNMDA and four other classical computational models: RLSMDA23, HDMP21, WBSMDA24 and RKNNMDA17 (See Fig. 1). In LOOCV, test sample is one of the 5430 MDAs; training samples are the rest of 5429 known MDAs; candidate samples are those unlabeled 184155 miRNA-disease pairs. When each known MDA was taken to be the test sample, we would get association scores for all miRNA-disease pairs after implementing MCMDA and then the ranking of test sample among the candidate samples would be gained based on their association scores. We would say that the model makes a correctly prediction if the test sample ranked higher than the set threshold. Finally, we drew Receiver-Operating Characteristics (ROC) curve through computing the ratio of true positive rate to false positive rate. To evaluate CNMDA’s performance, we computed area under the ROC curve (AUC). If AUC = 1, CNMDA would possess perfect performance; If AUC = 0.5, CNMDA could only predict randomly. As a result, CNMDA, RLSMDA, HDMP, WBSMDA, RKNNMDA obtained AUCs of 0.8547 (0.8533+/−0.0009), 0.8426 (0.8569+/−0.0020), 0.8366 (0.8342+/−0.0010), 0.8030 (0.8185+/−0.0009) and 0.7159 (0.6723+/−0.0027) in the LOOCV (5-fold cross validation), respectively. Through comparative analysis with other method, the reliability and effectiveness of CNMDA for identification of potential MDAs were proved.

Figure 1.

Figure 1

CNMDA got better AUCs of 0.8547 in the LOOCV in comparison of other four calculation approaches (RLSMDA, HDMP, WBSMDA, RKNNMDA).

Case studies

Three different case studies were also implemented to assess CNMDA’ performance. In the first case study, CNMDA was employed to predict KN-related miRNAs based on HMDD v2.0. Further, another two reliable MDA databases (dbDEMC and miR2Disease) would be utilized to validate the top 50 identified outcomes. In the second case study, we respectively inferred BN-associated miRNAs and BN-associated miRNAs after removing all known BN-associated miRNAs in HMDD v2.0. In the third kind of case studies, CNMDA was adopted to predict for LN according to associations in HMDD v1.0 and v2.0, respectively.

KN is a disease caused by cellular metabolic disorders31. If kidney tumors are detected and treated early and localized in the kidney, Patients would have a good disease-specific survival rate. Otherwise, patients have only an 18% two-year survival rate when they present with terminal disease32. With recent researches and studies, about two hundred and fifty thousand renal tumor patients are newly diagnosed annually, and KN’ morbidity and mortality continue to increase33. Many miRNAs related to KN have been found based on a large number of biological experiments. For example, in renal cell carcinoma (RCC), up regulation of miR-21 is related to kidney cancer that with lower survival rate34. Through targeting MMP-9 in RCC, miRNA-133b can suppress cell proliferation, migration and invasion35. Finally, we implemented CNMDA for potential KN-related miRNA prediction. It was found that 8 of the first 10 and 37 of the first 50 miRNAs were verified (See Supplementary Table 1). we also provided the whole scores of potential MDAs on the base of HMDD v2.0 (See Supplementary Table 2).

BN is a major chronic disease affecting adult women and detected breast neoplasms can be removed surgically36. However, if people with BN have not been detected, BN may develop into a life-threatening clinical recurrence in the next 5, 10, 15, or more years37. Recent experimental studies have provide evidences that miRNA-195 may work as latent biomarker for early BN detection38. To find the novel biomarkers for BN for the treatment of the disease is significant. In the second, we employed CNMDA for potential BN-related miRNA prediction. It was found that 5 of the first 10 and 31 of the first 50 miRNAs were verified (See Supplementary Table 3). Also, we implemented CNMDA for the prediction of BN by hiding all its confirmed associations in HMDD v2.0. This means that we would remove all known BN-associated miRNAs and predict potential BN-associated miRNAs based on other known associations and corresponding similarity information. Supplementary Table 4 presents the top 50 predicted outcomes and their verification evidences. As a result, 9 of the first 10 and 41 of the first 50 miRNAs were confirmed (See Supplementary Table 4).

LN is the primary reason of cancer deaths on a global scale39. The genetic and epigenetic damage caused by tobacco smoke is the main cause of the disease40. Obviously, it is urgent to find a more therapy systemic39. In squamous cell carcinoma, miR-126 have been verified to be down regulated and two miRNAs of miR-185∗, miR-125a-5p were up regulated39. MiR-205 were expressed differently in the non-small cell lung carcinoma (NSCLC)40. In order to test the stability of CNMDA, we employed CNMDA based on the associations in HMDD v2.0 and HMDD v1.030, respectively. It was found that 20 and 28 of first 50 associated miRNAs for LN have been verified, respectively (See Supplementary Tables 5 and 6).

As seen in the results above, we can arrival at a conclusion that CNMDA possesses excellent predictive performance for the novel MDAs prediction.

Discussions

As overwhelming evidences expounded that miRNAs are participated in all sorts of diseases. The development of new calculation approaches for predicting MDAs in a short time is important to further experimental validation. Accordingly, it is now possible to confirmed novel MDAs using biological experiments with low time and cost. Existing models are usually proposed based on four different calculation mechanisms41. Some scoring functions were constructed to prioritize disease-related miRNAs through carrying out probability distribution. Complex network algorithm-based prediction models were introduced through establishing complex network based on various data that are collected or calculated from different perspectives. Machine learning-based prediction models were introduced by using powerful machine learning algorithms. Moreover, multiple biological information-based models were put forward through constructing intermediate medium associations based on various biological datasets. We put forward the computing method of CNMDA to infer novel MDAs. In the model, we implemented RWR on a multi-level composite network that was built through combining collected and calculated data (ISD, ISM, GIPKS for lncRNAs, experimentally validated MDAs, MLIs and LDAs). From the evaluation results, it can be seen that the accuracy of our prediction model was superior in the comparison with other four models.

The main merits for the effective performance of CNMDA are as follows: Through taking advantage of multi-source information based on reliable database, it is no surprise that the integration strategy of CNMDA could predict potential MDAs effectively. Secondly, in comparison of local network information, RWR is an iterative process based on global network for the MDAs prediction. The attractive properties of global network information have been proved in the identification for potential disease-gene associations, MDAs41,42, LDAs43 and drug-target interaction44. Furthermore, CNMDA could identify novel diseases that have no known associated miRNAs. At last, the implementation of CNMDA only needs positive samples as training data. Since there is no known negative sample information, the forecasting precision of CNMDA is more convincing. However, some limitations also exist in the computation model of CNMDA. For example, the number of experimentally determined MDAs, LDAs and MLI is insufficient. For the number of known MDAs, only 5430 known MDAs were collected. The more the known MDAs, the higher forecasting precision the model. Importantly, the current forecasting precision still needs to be improved according to the evaluation of LOOCV.

Methods

MiRNA-disease associations

Experimentally confirmed MDAs used in this paper were come from high-quality database29. Through constructing a adjacency matrix Wdm to indicate the 5430 known MDAs, we made use of variables nm and nd to express the total amount of miRNAs and diseases in the known MDAs dataset, respectively.

Wdm(i,j)={1,ifmiRNAm(j)isrelatedtodiseased(i)0,otherwise 1

LncRNA-disease associations

The known LDAs was from the LncRNADisease45. After removing excess LDAs whose diseases don’t arise in the 5430 known MDAs mentioned above, we would acquire 250 known LDAs. Likewise, we built an adjacency matrix Wdl(i,j) to indicate the 250 known LDAs. Variable nl refer to the number of lncRNAs in the 250 known LDAs.

Wdl(i,j)={1,iflncRNAl(j)isrelatedtodiseased(i)0,otherwise 2

MiRNA-lncRNA interactions

The known MLIs was from starBase v2.046. In the same way, we need to delete excess MLIs whose miRNAs and lncRNAs do not exist in the 5430 known MDAs and 250 known LDAs. At last, 9088 known MLIs were gotten and an adjacency matrix Wml was used to refer to the 9088 MLIs.

Wml(i,j)={1,iflncRNAl(j)isrelatedtomiRNAm(i)0,otherwise 3

MiRNA functional similarity

The scores of MFS were obtained from http://www.cuilab.cn/files/images/cuilab/misim.zip47. We used FS(i,j) to indicate the score of MFS between miRNA m(i) and miRNA m(j).

Disease semantic similarity model 1 (DSS1)

We put forward DSS148 on the basis of Directed Acyclic Graph (DAG)49, which can be picked up according to MeSH descriptor of Category C. In the DAG = (D, T(D), E(D)) for disease D, all nodes are linked together from father to son using a straight line. The nodes of D and its elder can be collected into T(D) and E(D) referring to all the straight lines from father to son. Therefore, contribution of disease d in DAG(D) to the semantic value of disease D can be put forward.

{DD1(d)=1ifd=DDD1(d)=max{ΔDD1(d')|d'childrenofd}ifdD 4

where Δ is the semantic contribution decay factor. It is worthy of being mentioned that the value of contribution for disease D to its own semantic value is 1. The semantic value of disease D could be put forward.

DV1(D)=dT(D)DD1(d) 5

At last, DSS1 between d(i) and d(j) can be described.

SS1(d(i),d(j))=tT(d(i))T(d(j))(Dd(i)1(t)+Dd(j)1(t))DV1(d(i))+DV1(d(j)) 6

Disease semantic similarity model 2 (DSS2)

In the DSS248, due to the fact that a more specific disease d appearing in less DAGs would contribute more to the semantic value of disease D. Accordingly, the contribution made by d for the semantic value of D can be described by

DD2(d)=log[thenumberofDAGsincludingdthenumberofdisease] 7

DSS2 between disease d(i) and d(j) can be defined as follows:

DV2(D)=dT(D)DD2(d) 8
SS2(d(i),d(j))=tT(d(i))T(d(j))(Dd(i)2(t)+Dd(j)2(t))DV2(d(i))+DV2(d(j)) 9

Gaussian interaction profile kernel similarity

For disease d(u), we used IP(d(u)) to refer to row vectors of line u in Wdm on the basis of known MDA. Through watching whether d(u) is related to each miRNA, we computed GIPKS for diseases d(u) and d(v)50.

KD(d(u),d(v))=exp(γdIP(d(u))IP(d(v))2) 10

where

γd=γd/(1ndu=1ndIP(d(u))2) 11

Similarly, GIPKS for miRNA m(i) and m(j) can be constructed.

KM(m(i),m(j))=exp(γmIP(m(i))IP(m(j))2) 12

where

γm=γm/(1nmi=1nmIP(m(i))2) 13

For lncRNA l(p) and l(q), GIPKS between them can be constructed.

KL(l(p),l(q))=exp(γlIP(l(p))IP(l(q))2) 14
γl=γl/(1nlp=1nlIP(l(p))2) 15

Integrated similarity for diseases (ISD) and miRNAs

We have taken into account combining GIPKS for diseases, DSS1 and DSS2 to compute ISD between diseases d(u) and d(v)24.

SD(d(u),d(v))={SS1(d(u),d(v))+SS2(d(u),d(v))2d(u)andd(v)hassemanticsimilarityKD(d(u),d(v))otherwise 16

Similarly, the ISM between miRNAs m(i) and m(j) can be put forward by the integration of GIPK for miRNA and MFS24.

Sm(m(i),m(j))={FS(m(i),m(j))m(i)andm(j)hasfunctionalsimilarityKM(m(i),m(j))otherwise 17

CNMDA

Aiming at the prediction of potential MDAs, a computing method of CNMDA was stated. Carrying out RWR on a multi-level composite network that built by integration of ISM, ISD, GIPKS for lncRNA, known MDAs, LDAs and MLIs, final association scores of novel MDAs would be obtained (See Fig. 2, motivated by the studies of Yao et al.51). In our introduced model, we used Wl,Wd,Wm,Wld,Wdm,Wlm to indicate the initial matrix of GIPKS for lncRNAs, ISD, ISM, known LDAs, known MDAs and known MLIs, respectively. Then, the initial matrix of the multi-level composite network can be defined as W=[WlWldWlmWldTWdWdmWlmTWdmTWm], here, T refer to the transposition of matrix.

Figure 2.

Figure 2

Flowchart of CNMDA for potential MDAs prediction in the light of HMDD v2.0. Each node in the constructed multi-level composite network possesses original probability p0. Final scores p for MDAs would be gotten after employing RWR.

Global information based on the multi-level network would be captured through RWR algorithm. At each steps, seed nodes move to their immediate neighbors with a probability (1δ) or go back to the seed nodes with a restart probability δ. P0 was put forward to denote the original probability vector, and Pt+1 was introduced to represent a probability vector of node at step t + 1, which could be described by:

Pt+1=(1δ)MPt+δP0 18

where δ(0,1) is a restart probability. In the multi-level network, the initial seed node probability P0=[αu0βv0(1αβ)w0], where α, β and (1 − α − β) denote the weight of ISD network, ISM network and the network of GIPKS for lncRNAs, respectively. The corresponding u0, v0, w0 are the original probabilities of these three-similarity networks respectively. Here, u0 is calculated through assigning equal probability to all nodes in LDAs with a total to 1. Similarly, v0, w0 can be calculated.

Meanwhile, the transition matrix M=[MlMldMlmMdlTMdMdmMmlTMmdTMm] can be computed in the light of adjacency matrix W. M(i,j) represents the transition probability from i to j. In the network of GIPKS for lncRNAs, the transition probability from lncRNA i(li) to lncRNA j(lj) was put forward.

Ml(i,j)=Pr(lj|li)={(1xy)Wl(i,j)/jWl(i,j),ifjWld(i,j)0andjWlm(i,j)0(1x)Wl(i,j)/jWl(i,j),ifjWld(i,j)0andjWlm(i,j)=0(1y)Wl(i,j)/jWl(i,j),ifjWld(i,j)=0andjWlm(i,j)0Wl(i,j)/jWl(i,j),ifjWld(i,j)=0andjWlm(i,j)0 19

Similarly, in the ISD network, the transition probability from disease i(di) to disease j(dj) was put forward.

Md(i,j)=Pr(dj|di)={(1xz)Wd(i,j)/jWd(i,j),ifjWdm(i,j)0andjWld(j,i)0(1z)Wd(i,j)/jWd(i,j),ifjWdm(i,j)0andjWld(j,i)=0(1x)Wd(i,j)/jWd(i,j),ifjWdm(i,j)=0andjWld(j,i)0Wd(i,j)/jWd(i,j),ifjWdm(i,j)=0andjWld(j,i)=0 20

In the ISM network, the transition probability from miRNA i(mi) to miRNA j(mj) was put forward.

Mm(i,j)=Pr(mj|mi)={(1yz)Wm(i,j)/jWm(i,j),ifjWdm(j,i)0andjWlm(j,i)0(1y)Wm(i,j)/jWm(i,j),ifjWdm(j,i)0andjWlm(j,i)=0(1z)Wm(i,j)/jWm(i,j),ifjWdm(j,i)=0andjWlm(j,i)0Wm(i,j)/jWm(i,j),ifjWdm(j,i)=0andjWlm(j,i)=0 21

In the LDAs network, the transition probability from lncRNA i(li) to disease j(dj) was put forward.

Mld(i,j)=Pr(dj|li)={xWld(i,j)/jWld(i,j),ifjWld(i,j)00,otherwise 22

In the MLIs network, transition probability from lncRNA i(li) to miRNA j(mj) was put forward.

Mlm(i,j)=Pr(mj|li)={yWlm(i,j)/jWlm(i,j),ifjWlm(i,j)00,otherwise 23

In the LDAs network, the transition probability from disease i(di) to lncRNA j(lj) was put forward.

Mdl(i,j)=Pr(lj|di)={xWld(j,i)/jWld(j,i),ifjWld(j,i)00,otherwise 24

In the MDAs network, the transition probability from disease i(di) to miRNA j(mj) was put forward.

Mdm(i,j)=Pr(mj|di)={zWdm(i,j)/jWdm(i,j),ifjWdm(j,i)00,otherwise 25

In the MLIs network, the transition probability from miRNA i(mi) to lncRNA j(lj) was put forward.

Mml(i,j)=Pr(lj|mi)={yWlm(j,i)/jWlm(j,i),ifjWlm(j,i)00,otherwise 26

In the MDAs network, the transition probability from miRNA i(mi) to disease j(dj) was put forward.

Mmd(i,j)=Pr(dj|mi)={zWmd(j,i)/jWmd(j,i),ifjWmd(j,i)00,otherwise 27

where x,y,z are the jumping probability between the network of GIPKS for lncRNAs and ISD network, between the network of GIPKS for lncRNAs and ISM network, and between ISD network and ISM network, respectively. CNMDA is performed until the probabilities tend to a steady state, P=[αuβv(1αβ)w] (the range between Pt and P0 computed by L1 norm is smaller than 10−6). Then, the candidate miRNAs can be ranked according to w.

By incorporating MLIs and LDA into MDAs prediction, RWR was put forward on a constructed multi-level network to infer novel MDAs. In the network, because initial MLIs, LDAs and MDAs have more credibility, they all as weights in the RWR equations. Obviously, the one interaction and two associations play an equally important part in the network to disseminate information of miRNAs, diseases and lncRNAs for the novel MDAs prediction. In this study, we chose the same parameter as the one in previous literature51, which used RWR on the same multi-level composite network in their study. Therefore, we set the parameter δ to 0.7 and x, y, z, α, β to 13.

Electronic supplementary material

Supplementary material (25.4KB, docx)
Supplementary Table 1 (9.8KB, xlsx)
Supplementary Table 2 (4.7MB, xlsx)
Supplementary Table 3 (9.7KB, xlsx)
Supplementary Table 4 (9.8KB, xlsx)
Supplementary Table 5 (9.7KB, xlsx)
Supplementary Table 6 (9.7KB, xlsx)

Acknowledgements

BSH was supported by Key Program of Hunan Provincial Education Department (Grant No. 15A026), General Program of Hunan Provincial Philosophy and Social Science Planning Fund office (Grant No. 15YBA035). MC was supported by National Nature Science Foundation of China (Grant No. 61772192), Nature Science Foundation of Hunan Province under Grant (Grant No. 2018JJ2085), Key Cultivation Project in Hunan Institute of Technology (Grant No. 2017HGPY001).

Author Contributions

B.S.H. put forward the subject and its prediction approach, also, wrote the manuscript. J.Q. carried out the experiments and amended this manuscript. M.C. analyzed the outcome and amended manuscript.

Competing Interests

The authors declare no competing interests.

Footnotes

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Jia Qu, Email: TB17060015B4@cumt.edu.cn.

Min Chen, Email: chenmin@hnit.edu.cn.

Electronic supplementary material

Supplementary information accompanies this paper at 10.1038/s41598-018-34180-6.

References

  • 1.Wienholds E, Plasterk RHA. MicroRNA expression in zebrafish embryonic development. Science. 2005;309:310–311. doi: 10.1126/science.1114519. [DOI] [PubMed] [Google Scholar]
  • 2.Cheng AM, Byrom MW, Jeffrey S, Ford LP. Antisense inhibition of human miRNAs and indications for an involvement of miRNA in cell growth and apoptosis. Nucleic Acids Research. 2005;33:1290–1297. doi: 10.1093/nar/gki200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Karp X, Ambros V. Encountering microRNAs in cell fate signaling. Science. 2005;310:1288–1289. doi: 10.1126/science.1121566. [DOI] [PubMed] [Google Scholar]
  • 4.Alshalalfa M, Alhajj R. Using context-specific effect of miRNAs to identify functional associations between miRNAs and gene signatures. Bmc Bioinformatics. 2013;14:S1. doi: 10.1186/1471-2105-14-S12-S1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Miska EA. How microRNAs control cell division, differentiation and death. Curr Opin Genet Dev. 2005;15:563–568. doi: 10.1016/j.gde.2005.08.005. [DOI] [PubMed] [Google Scholar]
  • 6.Bartel DP. MicroRNA Target Recognition and Regulatory Functions. Cell. 2009;136:215–233. doi: 10.1016/j.cell.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Xu P, Guo M, Hay BA. MicroRNAs and the regulation of cell death. Trends in genetics: TIG. 2004;20:617–624. doi: 10.1016/j.tig.2004.09.010. [DOI] [PubMed] [Google Scholar]
  • 8.Esquela-Kerscher A, Slack FJ. Oncomirs - microRNAs with a role in cancer. Nature reviews. Cancer. 2006;6:259–269. doi: 10.1038/nrc1840. [DOI] [PubMed] [Google Scholar]
  • 9.Meola N, Gennarino VA, Banfi S. microRNAs and genetic diseases. PathoGenetics. 2009;2:7. doi: 10.1186/1755-8417-2-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tavazoie SF, et al. Endogenous human microRNAs that suppress breast cancer metastasis. Nature. 2008;451:147–152. doi: 10.1038/nature06487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Slaby O, et al. Altered expression of miR-21, miR-31, miR-143 and miR-145 is related to clinicopathologic features of colorectal cancer. Oncology. 2007;72:397–402. doi: 10.1159/000113489. [DOI] [PubMed] [Google Scholar]
  • 12.Calin GA, Croce CM. MicroRNA signatures in human cancers. Nature Reviews Cancer. 2006;6:857–866. doi: 10.1038/nrc1997. [DOI] [PubMed] [Google Scholar]
  • 13.Zeng X, Liu L, Lu L, Zou Q. Prediction of potential disease-associated microRNAs using structural perturbation method. Bioinformatics. 2018;34:2425–2432. doi: 10.1093/bioinformatics/bty112. [DOI] [PubMed] [Google Scholar]
  • 14.Zeng X, Zhang X, Zou Q. Integrative approaches for predicting microRNA function and prioritizing disease-related microRNA using biological interaction networks. Briefings in bioinformatics. 2016;17:193–203. doi: 10.1093/bib/bbv033. [DOI] [PubMed] [Google Scholar]
  • 15.Chen, X., Wang, L., Qu, J., Guan, N. N. & Li, J. Q. Predicting miRNA-disease association based on inductive matrix completion. Bioinformatics (Oxford, England), 10.1093/bioinformatics/bty503 (2018). [DOI] [PubMed]
  • 16.Chen, X., Zhou, Z. & Zhao, Y. ELLPMDA: Ensemble learning and link prediction for miRNA-disease association prediction. RNA Biol, 1–12, 10.1080/15476286.2018.1460016 (2018). [DOI] [PMC free article] [PubMed]
  • 17.Chen X, Wu QF, Yan GY. RKNNMDA: Ranking-based KNN for MiRNA-Disease Association prediction. RNA Biol. 2017;14:952–962. doi: 10.1080/15476286.2017.1312226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Li JQ, Rong ZH, Chen X, Yan GY, You ZH. MCMDA: Matrix completion for MiRNA-disease association prediction. Oncotarget. 2017;8:21187–21199. doi: 10.18632/oncotarget.15061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Chen X, et al. RBMMMDA: predicting multiple types of disease-microRNA associations. Scientific Reports. 2015;5:13877. doi: 10.1038/srep13877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Chen X, et al. HGIMDA: Heterogeneous graph inference for miRNA-disease association prediction. Oncotarget. 2016;7:65257–65269. doi: 10.18632/oncotarget.11251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Xuan P, et al. Correction: Prediction of microRNAs Associated with Human Diseases Based on Weighted k Most Similar Neighbors. Plos One. 2013;8:e70204. doi: 10.1371/journal.pone.0070204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Xuan P, et al. Prediction of potential disease-associated microRNAs based on random walk. Bioinformatics. 2015;31:1805–1815. doi: 10.1093/bioinformatics/btv039. [DOI] [PubMed] [Google Scholar]
  • 23.Chen X, Yan GY. Semi-supervised learning for potential human microRNA-disease associations inference. Scientific Reports. 2014;4:5501. doi: 10.1038/srep05501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Chen X, et al. WBSMDA: Within and Between Score for MiRNA-Disease Association prediction. Scientific Reports. 2016;6:21106. doi: 10.1038/srep21106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Jiang Q, et al. Prioritization of disease microRNAs through a human phenome-microRNAome network. Bmc Systems Biology. 2010;4(Suppl 1):S2. doi: 10.1186/1752-0509-4-S1-S2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Mork S, Pletscher-Frankild S, Palleja Caro A, Gorodkin J, Jensen LJ. Protein-driven inference of miRNA-disease associations. Bioinformatics. 2014;30:392–397. doi: 10.1093/bioinformatics/btt677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Shi H, et al. Walking the interactome to identify human miRNA-disease associations through the functional link between miRNA targets and disease genes. Bmc Systems Biology. 2013;7:1–12. doi: 10.1186/1752-0509-7-101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Chen X, Liu MX, Yan GY. RWRMDA: predicting novel human microRNA-disease associations. Mol Biosyst. 2012;8:2792–2798. doi: 10.1039/c2mb25180a. [DOI] [PubMed] [Google Scholar]
  • 29.Li Y, et al. HMDDv2.0: a database for experimentally supported human microRNA and disease associations. Nucleic Acids Res. 2014;42:D1070–1074. doi: 10.1093/nar/gkt1023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lu M, et al. An analysis of human microRNA and disease associations. Plos One. 2008;3:e3420. doi: 10.1371/journal.pone.0003420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Linehan WM, Grubb RL, Coleman JA, Zbar B, Walther MM. The genetic basis of cancer of kidney cancer: implications for gene-specific clinical management. Bju International. 2005;95:2–7. doi: 10.1111/j.1464-410X.2005.05189.x. [DOI] [PubMed] [Google Scholar]
  • 32.Sudarshan S, Linehan WM. Genetic basis of cancer of the kidney. Seminars in oncology. 2006;33:544–551. doi: 10.1053/j.seminoncol.2006.06.008. [DOI] [PubMed] [Google Scholar]
  • 33.Lamm DL. Cancer statistics. CA: a cancer journal for clinicians. 1990;40:318–319. doi: 10.3322/canjclin.40.5.318. [DOI] [PubMed] [Google Scholar]
  • 34.Zaman MS, et al. Up-Regulation of MicroRNA-21 Correlates with Lower Kidney Cancer Survival. Plos One. 2012;7:e31060–e31060. doi: 10.1371/journal.pone.0031060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Wu D, et al. microRNA-133b downregulation and inhibition of cell proliferation, migration and invasion by targeting matrix metallopeptidase-9 in renal cell carcinoma. Molecular Medicine Reports. 2014;10:2491–2498. doi: 10.3892/mmr.2014.2116. [DOI] [PubMed] [Google Scholar]
  • 36.Smigal C, et al. Trends in breast cancer by race and ethnicity: update 2006. CA: a cancer journal for clinicians. 2006;56:168–183. doi: 10.3322/canjclin.56.3.168. [DOI] [PubMed] [Google Scholar]
  • 37.Group EBCTC. Effects of chemotherapy and hormonal therapy for early breast cancer on recurrence and 15-year survival: an overview of the randomised trials. Lancet. 2005;365:1687–1717. doi: 10.1016/S0140-6736(05)66544-0. [DOI] [PubMed] [Google Scholar]
  • 38.Heneghan HM, Miller N, Kelly R, Newell J, Kerin MJ. Systemic miRNA-195 Differentiates Breast Cancer from Other Malignancies and Is a Potential Biomarker for Detecting Noninvasive and Early Stage Disease. Oncologist. 2010;15:673–682. doi: 10.1634/theoncologist.2010-0103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Yang Y, et al. The role of microRNA in human lung squamous cell carcinoma. Cancer Genetics & Cytogenetics. 2010;200:127–133. doi: 10.1016/j.cancergencyto.2010.03.014. [DOI] [PubMed] [Google Scholar]
  • 40.Yanaihara N, et al. Unique microRNA molecular profiles in lung cancer diagnosis and prognosis. Cancer Cell. 2006;9:189–198. doi: 10.1016/j.ccr.2006.01.025. [DOI] [PubMed] [Google Scholar]
  • 41.Chen, X., Xie, D., Zhao, Q. & You, Z. H. MicroRNAs and complex diseases: from experimental results to computational models. Briefings in bioinformatics, 10.1093/bib/bbx130 (2017). [DOI] [PubMed]
  • 42.You ZH, et al. PBMDA: A novel and effective path-based computational model for miRNA-disease association prediction. Plos Computational Biology. 2017;13:e1005455. doi: 10.1371/journal.pcbi.1005455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Chen X, Yan CC, Zhang X, You ZH. Long non-coding RNAs and complex diseases: from experimental results to computational models. Briefings in bioinformatics. 2017;18:558–576. doi: 10.1093/bib/bbw060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Chen X, et al. Drug-target interaction prediction: databases, web servers and computational models. Briefings in bioinformatics. 2016;17:696–712. doi: 10.1093/bib/bbv066. [DOI] [PubMed] [Google Scholar]
  • 45.Chen G, et al. LncRNADisease: a database for long-non-coding RNA-associated diseases. Nucleic Acids Res. 2013;41:D983–986. doi: 10.1093/nar/gks1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Li JH, Liu S, Zhou H, Qu LH, Yang JH. StarBasev2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res. 2014;42:D92–97. doi: 10.1093/nar/gkt1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Wang D, Wang J, Lu M, Song F, Cui Q. Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases. Bioinformatics. 2010;26:1644–1650. doi: 10.1093/bioinformatics/btq241. [DOI] [PubMed] [Google Scholar]
  • 48.Xuan P, et al. Prediction of microRNAs associated with human diseases based on weighted k most similar neighbors. PLoS One. 2013;8:e70204. doi: 10.1371/journal.pone.0070204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Chen X, Huang YA, Wang XS, You ZH, Chan KC. FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model. Oncotarget. 2016;7:45948–45958. doi: 10.18632/oncotarget.10008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.van Laarhoven T, Nabuurs SB, Marchiori E. Gaussian interaction profile kernels for predicting drug-target interaction. Bioinformatics. 2011;27:3036–3043. doi: 10.1093/bioinformatics/btr500. [DOI] [PubMed] [Google Scholar]
  • 51.Yao Q, et al. Global Prioritizing Disease Candidate lncRNAs via a Multi-level Composite Network. Sci Rep. 2017;7:39516. doi: 10.1038/srep39516. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material (25.4KB, docx)
Supplementary Table 1 (9.8KB, xlsx)
Supplementary Table 2 (4.7MB, xlsx)
Supplementary Table 3 (9.7KB, xlsx)
Supplementary Table 4 (9.8KB, xlsx)
Supplementary Table 5 (9.7KB, xlsx)
Supplementary Table 6 (9.7KB, xlsx)

Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES