ABSTRACT
Since lots of miRNA-disease associations have been verified, it is meaningful to discover more miRNA-disease associations for serving disease diagnosis and prevention of human complex diseases. However, it is not practical to identify potential associations using traditional biological experimental methods since the process is expensive and time consuming. Therefore, it is necessary to develop efficient computational methods to accomplish this task. In this work, we introduced a matrix completion model with dual Laplacian regularization (DLRMC) to infer unknown miRNA-disease associations in heterogeneous omics data. Specifically, DLRMC transformed the task of miRNA-disease association prediction into a matrix completion problem, in which the potential missing entries of the miRNA-disease association matrix were calculated, the missing association can be obtained based on the prediction scores after the completion procedure. Meanwhile, the miRNA functional similarity and the disease semantic similarity were fully exploited to serve the miRNA-disease association matrix completion by using a dual Laplacian regularization term. In the experiments, we conducted global and local Leave-One-Out Cross Validation (LOOCV) and case studies to evaluate the efficacy of DLRMC on the Human miRNA-disease associations dataset obtained from the HMDDv2.0 database. As a result, the AUCs of DLRMC is 0.9174 and 0.8289 in global LOOCV and local LOOCV, respectively, which significantly outperform a variety of previous methods. In addition, in the case studies on four significant diseases related to human health including Colon Neoplasms, Kidney neoplasms, Lymphoma and Prostate neoplasms, 90%, 92%, 92% and 94% out of the top 50 predicted miRNAs has been confirmed, respectively.
KEYWORDS: miRNA-disease association prediction, matrix completion, Laplacian regularization, microRNA
1. Introduction
As a kind of short non-coding single-stranded RNA, MicroRNAs (miRNAs) (22nt) are found in plants, animals, and some viruses. By binding to the 3ʹ untranslated regions (UTRs) of the target messenger RNAs (mRNAs) through base pairing, miRNAs can suppress the protein production and gene expression [1–5]. On the other hand, some scientific researchers have also shown that miRNAs could also act as positive regulators [6,7]. Since the first two miRNAs, i.e., Caenorhabditis elegans lin-4 and let-7 were discovered more than twenty years ago, more and more miRNAs have been detected in recent years. Meanwhile, large amounts of evidences confirm that miRNAs play vital roles in a variety of crucial cell biological processes, such as proliferation, development, differentiation, apoptosis, metabolism, viral infection, aging, signal transduction and so on [8–14]. Therefore, it is no doubt that miRNAs are closely related to many complex human diseases and some miRNA-disease associations have already been verified. For example, the miRNA deregulation is closely related to the development of various cancers [15–19]. As the first evidence for the fact that miRNAs are involved in cancer formation [20], firstly clarified that miR-15 and miR-16 are deleted in more than half cases of B-cell chronic lymphocytic leukemia. Also, miR-34a and miR-199a/b were found to be down-regulated in solid cancer cells [21]. Besides, researchers revealed that miR-126 were expressed with significantly higher levels in the blood from patients with Crohn Disease [22]. Furthermore, for patients with end-stage renal disease, the circulating levels of miR-15b were significantly suppressed [23]. Apart from the aforementioned miRNA-disease associations, there may exist other ones. Therefore, it is meaningful to discover more miRNA-disease associations for benefiting disease diagnosis and prevention of human complex diseases [24–27]. However, it is expensive and time consuming to identify the associations between miRNAs and diseases using existing biological experimental methods. Hence, it is necessary for us to develop efficient computational methods to predict the potential miRNA-disease associations by using the recent superior computing resources [28,29].
During last decades, many computational methods for predicting potential miRNA-disease associations have been proposed. Some of these methods depend on the assumption that miRNAs with similar functions are more likely to have connections with diseases which share similar phenotypes [30–36]. For example, Wang et al. [37] predicted potential miRNA-disease associations by scoring each miRNA for a disease of interest through the cumulative hypergeometric distribution. Shi et al. [38] exploited the functional associations between miRNA and disease by implementing the algorithm of random walk on protein-protein interaction network. Based on the assumption that miRNAs whose target genes are related to certain diseases are more likely to be associated with these diseases, they discovered the miRNA-target interactions, disease-gene associations, and protein-protein interactions to predict potential associations between the miRNAs and diseases. Xuan et al. [39] developed the HDMP prediction method by calculating the functional similarities of weighted neighbor miRNAs. Instead of relying on the known miRNA-disease associations, Xu et al. [40] introduced a miRNA prioritization method by evaluating the similarity between the targets of miRNAs and disease genes. By combining protein-disease associations and miRNA-protein interactions together to predict novel miRNA-disease associations, Mork et al. [41] proposed a computational model named miRPD. Luo et al. [42] proposed a transduction learning-based method to prioritize disease-related miRNAs, especially for those diseases that are associated with sparse known miRNAs. In [43], the restricted Boltzmann machine (RBM) which is a two-layer undirected graphical model consisting of layers of visible and hidden units was deployed to construct a computational model called RBMMMDA. Li et al. [44] developed a Matrix Completion for MiRNA-Disease Association prediction model based on the known miRNA-disease associations, during the matrix completion, the known associations are well retained, potential associations are predicted by the completed values. Mugunga et al. [45] presented a prediction model that is based on the theory of path features and random walk to obtain a relevancy score of miRNA-related disease. In their model, highly ranked scores are potential miRNA-disease associations. Li et al. [46] explored the way to predict the miRNA-disease association by using the extensive collaborative filtering in order to diagnose the diseases better, they considered the miRNA functional similarity and disease similarity while used minimal amount of related information. In [47], three inference methods were introduced to predict potential miRNA-disease associations based on the global network similarity measure and the assumption that functionally related miRNAs tend to be associated with phenotypically similar diseases. Chen et al. [28] proposed to prioritize disease miRNAs by using the the between-scores and within-scores of each miRNA-disease pair. By integrating heterogeneous omics data for identifying disease miRNAs, Luo et al. [48] developed a Kronecker regularized least squares-based method for miRNA-disease association prediction. By considering that most of previous approaches strongly rely on the known association information, and only a handful of them could be applied to uncover the potential associations involving novel diseases or miRNAs. Chen et al. [29] presented a model named RLSMDA based on semi-supervised learning which calculates the semantic similarity between different diseases. RLSMDA could identify related miRNAs for diseases without any known associated miRNAs, meanwhile avoiding the problem of using negative associations between miRNAs and diseases. Xiao et al. [49] also introduced a novel framework called GRNMF to infer the unknown miRNA-disease associations in heterogeneous omics data, which could work for both new diseases and miRNAs. You et al. [50] constructed a heterogeneous graph consisting of three interlinked sub-graphs and further adopted depth-first search algorithm to infer potential miRNA-disease associations. By constructing a similarity network and utilizing ensemble learning to combine rank results given by three classic similarity-based algorithms, Chen et al. [35] obtained superior prediction results. Based on the known miRNA-disease associations, disease semantic similarity, miRNA functional similarity, and Gaussian interaction profile kernel similarity for miRNAs and diseases, many computational models have been put forward, e.g., Sparse Subspace Learning [51], matrix completion-based model [52], Extreme Gradient Boosting Machine [53], Bipartite Network Projection [54], Triple Layer Heterogeneous Network [55] and Random Forest [56]. In Table 1, we give a brief summary of the differences between some typical previous computational methods. As can be seen, the three kinds of similarity priors have been widely used in previous methods. A detailed review of previous miRNA-disease associations prediction methods can be found in [33,57].
Table 1.
Differences between some typical previous computational methods for miRNA-disease association prediction. ‘A’ denotes depending on disease semantic similarity, ‘B’ denotes depending on miRNA functional similarity, and ‘C’ denotes depending on Gaussian interaction profile kernel similarity for miRNAs and diseases.
| Year | Methods | A | B | C | Characteristic |
|---|---|---|---|---|---|
| 2010 | Phenome-microRNAome network [37] | ✓ | Cumulative hypergeometric distribution, phenotypic similarity score | ||
| 2013 | bipartite network [38] | ✓ | ✓ | PPI network, random walk, global perspective measures | |
| 2013 | Weighted kNN [39] | ✓ | ✓ | Weighted kNN, miRNA family and the cluster information | |
| 2013 | MBSI [47] | ✓ | microRNA-based similarity inference, global network similarity measure) | ||
| 2013 | PBSI [47] | ✓ | phenotype-based similarity inference, global network similarity measure | ||
| 2013 | NetCBI [47] | ✓ | network-consistency-based inference, global network similarity measure | ||
| 2014 | miRNAs Prioritization [40] | ✓ | molecular mechanisms, context-dependent miRNA-target interactions | ||
| 2014 | RLSMDA [29] | ✓ | ✓ | Semi-supervised prediction method, global approach | |
| 2014 | miRPD [41] | miRNAProteinDisease associations, text mining | |||
| 2015 | RBMMMDA [43] | Restricted Boltzmann machine, inferring multiple types of miRNA-disease pairs | |||
| 2016 | WBSMDA [28] | ✓ | ✓ | ✓ | Within and between score, integrating plenty of heterogeneous biological datasets |
| 2016 | HGIMDA [58] | ✓ | ✓ | ✓ | Heterogeneous Graph Inference |
| 2017 | BRWH [48] | ✓ | ✓ | bi-random walk, heterogeneous network, microbe similarity network, disease similarity network | |
| 2017 | GRNMF [49] | ✓ | ✓ | graph regularized NMF | |
| 2017 | PBMDA [50] | ✓ | ✓ | ✓ | special depth-first search algorithm, heterogeneous graph |
| 2017 | LRSSLMDA [51] | ✓ | ✓ | ✓ | Laplacian Regularized Sparse Subspace Learning, local structures of the training data |
| 2017 | CPTL [42] | ✓ | ✓ | Collective Prediction, Transduction Learning | |
| 2017 | RKNNMDA [34] | ✓ | ✓ | ✓ | Ranking-based KNN |
| 2017 | MCMDA [44] | Matrix completion, adjacency matrix of known miRNA-disease associations | |||
| 2018 | IMCMDA [52] | ✓ | ✓ | ✓ | Inductive Matrix Completion |
| 2018 | EGBMMDA [53] | ✓ | ✓ | Extreme Gradient Boosting Machine, statistical measures, graph theoretical measures, matrix factorization results | |
| 2018 | BNPMDA [54] | ✓ | ✓ | ✓ | Bipartite Network Projection, bias ratings, agglomerative hierarchical clustering |
| 2018 | TLHNMDA [55] | ✓ | ✓ | Triple Layer Heterogeneous Network based inference | |
| 2018 | ELLPMDA [35] | ✓ | ✓ | ✓ | Ensemble Learning, Link Prediction, similarity network |
| 2018 | MDHGI [59] | ✓ | ✓ | ✓ | Matrix decomposition and Heterogeneous Graph Inference |
| 2018 | BLHARMDA [60] | ✓ | ✓ | ✓ | Bipartite local models, Jaccard similarity, hubness-aware regression, KNN classifier |
| 2018 | SNMFMDA [61] | ✓ | ✓ | ✓ | Symmetric NMF, Kronecker regularized least square |
| 2018 | RFMDA [56] | ✓ | ✓ | ✓ | Random Forest, robust feature selection |
In this work, we introduce a novel dual laplacian regularized matrix completion framework, referred to as DLRMC briefly, to infer the unknown miRNA-disease associations in heterogeneous omics data, which can work well for both new diseases and miRNAs. DLRMC transforms the task of miRNA-disease association prediction into a matrix completion problem, in which the potential missing entries of the miRNA-disease association matrix were calculated, the missing association can be obtained based on the prediction scores after the completion procedure. In DLRMC, the miRNA functional similarity and the disease semantic similarity are fully exploited to serve the miRNA-disease association matrix completion by using a dual Laplacian regularization term, i.e., miRNAs with similar functions are more likely to have connections with diseases which share similar phenotypes and diseases with semantic similarity are tend to be related with similar miRNAs. In addition, during the matrix competition process, an indicator matrix with binary values which indicates the indices of the observed miRNA-disease associations are deployed to preserve the experimental confirmed associations. The experimental results indicate that DLRMC achieves superior performance compared with other methods and can be effectively applied in the discovery of missing or potential associations for novel diseases and miRNAs.
2. Results
2.1. Leave-one-out cross validation
To investigate the performance of DLRMC, we use global and local leave-one-out cross validation (LOOCV) based on the known miRNA-disease associations obtained from HMDDv2.0 database. As a validation method, LOOCV uses one known association as the test sample and the remaining known associations as the training samples. As to HMDDv2.0 database, we have already obtained 5430 known miRNA-disease associations, these associations can be regarded as the gold standard data set. The miRNA-diseases without known association evidences were considered as candidate samples. The scores of all miRNA-disease pairs can be obtained after DLRMC was implemented. As to global LOOCV, the score of the test sample was compared with the scores of all the candidate samples while in local LOOCV, the test sample was merely compared with the scores of the candidate samples which included the particular disease in the test sample. In this work, we perform 5-fold cross validation experiments for evaluation. The known miRNA-disease associations were randomly divided into five disjoint parts. For each time, one part was picked out as test samples and the rest four parts were used as training samples, and the miRNA-disease pairs without known association evidences were still regarded as candidate samples. Finally, the score of each test sample was compared with the scores of all the candidate samples, respectively. This procedure was repeated five times until each known association was used as test sample and its score was compared with the scores of the candidate samples. Those test samples whose scores rank exceeded the given threshold were considered to predict the miRNA-disease associations correctly.
2.2. Comparison with other methods
In order to demonstrate the efficacy of the proposed DLRMC, we compare its performance with the following methods: HDMP [39], RLSMDA [29], WBSMDA [28], MCMDA [44] and EGBMMDA [53]. We draw a receiver operating characteristics curve (ROC) to compare DLRMC with other previous state-of-the-art methods. In the ROC, true positive rate (TPR, sensitivity) against false positive rate (FPR, 1-specificity) at different thresholds are plotted [62]. TPR/sensitivity denotes the percentage of the test miRNA-disease associations which are ranked higher than the given threshold, while specificity represents the percentage of negative miRNA-disease pairs ranked below the threshold [63]. The area under the ROC curve (AUC) is calculated to evaluate the accuracy of miRNA-disease associations prediction methods. For a specific method, if AUC = 1, it means that the method owns a prefect performance. AUC of 0.5 means that the method merely has a random prediction performance. As to global LOOCV, the AUCs of HDMP, RLSMDA, WBSMDA, MCMDA, EGBMMDA and DLRMC are 0.8366, 0.8030, 0.8030, 0.8749, 0.9123 and 0.9174, respectively. For local LOOCV, HDMP, RLSMDA, WBSMDA, MCMDA, EGBMMDA and DLRMC reach AUCs of 0.7702, 0.0.6953, 0.0.8031, 0.7718, 0.8221 and 0.8289, respectively. Figs. 1a, b) show the ROCs of different methods in terms of global LOOCV and local LOOCV, respectively. As can be seen, DLRMC works more effective in predicting potential miRNA-disease associations when compared with the other previous methods.
Figure 1.

Performance evaluation comparison between DLRMC and other methods in terms of ROC curve and AUC based on global LOOCV and local LOOCV tested by known miRNA-disease associations in the HMDD database. As can be seen, DLRMC achieves AUC of 0.9174 in global LOOCV and 0.8289 in local LOOCV. Thus, the performance of DLRMC is almost better than other methods in some degree and it proves to be effective in predicting the potential miRNA-disease associations.
2.3. Case studies
Similar to previous methods, we also implement case studies to further validate the efficacy of the proposed DLRMC. In detail, four significant diseases related to human health including Colon Neoplasms, Kidney neoplasms, Lymphoma and Prostate neoplasms are used to practically evaluate the prediction accuracy of DLRMC. The top 20 and top 50 predicted miRNAs related with these diseases are examined by another two independent miRNA-disease databases: dbDEMC [64] and miR2Disease [65].
Colon Neoplasms, is known as a malignant cancer which is the third most common type of cancer constituting about 10% of all cancer cases [66]. However, early patients of colon neoplasms only suffer from subtle symptoms, which makes it a fairly hard-to-detect cancer at an early stage [67,68]. Studies have confirmed that some miRNAs are related to Colon Neoplasms, for instance, miR-145 is constantly downregulated in colorectal tumors [69]. MiR-127 has been reported to play a role as a possible tumor suppressor gene [70]. In addition, there exists report indicating that the occurrence rate of colon neoplasms has an increasing trend these years [71]. As a result, it is important to predict the potential miRNAs related to colon neoplasms. In this work, we implement DLRMC to predict the top 50 miRNAs associated with colon neoplasms. Finally, 19 of the top 20 and 45 of the top 50 predicted miRNAs associated with colon neoplasms are validated by dbDEMC and miR2Disease database,the prediction results are shown in Table 2.
Table 2.
The top 50 miRNAs associated with colon neoplasms predicted by our DLRMC. In summary, 19 out of the top 20, and 45 out of the top 50 predicted Colon Neoplasms related miRNAs are confirmed based on dbDEMC and miR2Disease (1st column: top 1–25; 2nd column: top 26–50).
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-146a | dbdemc | hsa-mir-196a | dbdemc;miR2Disease |
| hsa-mir-155 | dbdemc;miR2Disease | hsa-mir-29c | dbdemc |
| hsa-mir-132 | miR2Disease | hsa-mir-223 | dbdemc;miR2Disease |
| hsa-mir-21 | dbdemc;miR2Disease | hsa-mir-143 | dbdemc;miR2Disease |
| hsa-mir-34a | dbdemc;miR2Disease | hsa-let-7a | unconfirmed |
| hsa-mir-221 | dbdemc;miR2Disease | hsa-mir-195 | dbdemc;miR2Disease |
| hsa-mir-20a | dbdemc;miR2Disease | hsa-mir-200b | dbdemc |
| hsa-mir-16 | dbdemc | hsa-mir-214 | dbdemc |
| hsa-let-7a | dbdemc;miR2Disease | hsa-mir-148a | dbdemc |
| hsa-mir-125b | dbdemc | hsa-mir-106b | dbdemc;miR2Disease |
| hsa-mir-29a | dbdemc;miR2Disease | hsa-mir-23a | miR2Disease |
| hsa-mir-29b | dbdemc;miR2Disease | hsa-mir-142 | unconfirmed |
| hsa-mir-15a | dbdemc | hsa-mir-31 | dbdemc;miR2Disease |
| hsa-mir-133a | dbdemc;miR2Disease | hsa-mir-34c | miR2Disease |
| hsa-mir-222 | dbdemc | hsa-mir-141 | dbdemc;miR2Disease |
| hsa-mir-199a | unconfirmed | hsa-mir-182 | dbdemc;miR2Disease |
| hsa-mir-26a | dbdemc;miR2Disease | hsa-mir-200a | unconfirmed |
| hsa-mir-1 | dbdemc;miR2Disease | hsa-let-7c | dbdemc |
| hsa-mir-19b | dbdemc;miR2Disease | hsa-mir-101 | unconfirmed |
| hsa-mir-19a | dbdemc;miR2Disease | hsa-mir-192 | dbdemc;miR2Disease |
| hsa-mir-15b | miR2Disease | hsa-mir-181a | dbdemc;miR2Disease |
| hsa-mir-18a | miR2Disease | hsa-mir-9 | dbdemc;miR2Disease |
| hsa-mir-92a | dbdemc | hsa-mir-133b | dbdemc;miR2Disease |
| hsa-mir-30b | dbdemc;miR2Disease | hsa-mir-34b | dbdemc;miR2Disease |
| hsa-mir-150 | dbdemc;miR2Disease | hsa-mir-183 | dbdemc;miR2Disease |
Kidney neoplasms is known as renal cancer, which starts in the cells of kidney, and it includes various types [72]. For kidney neoplasms patients, the most common symptoms are pains in the lumbar and hematuria [73]. Based on recent biological experiments, many existing kidney neoplasm-related miRNAs have been verified. For instance, the common target ACVR2B of five miRNAs (miRNA-192, miRNA-194, miRNA-215, miRNA-200c and miRNA-141) is strongly expressed in renal childhood neoplasms [74]. In addition, researches had reported that the decreasing miRNA-23b expression might inhibit kidney tumor growth [75]. As a result of the case study for Kidney neoplasms, 17 out of the top-20 predicted miRNAs and 46 out of the top-50 predicted miRNAs of kidney neoplasms were verified by miR2Disease database and dbDEMC database. The prediction results are shown in Table 3.
Table 3.
The top 50 miRNAs associated with kidney neoplasms predicted by our DLRMC. In summary, 17 out of the top 20, and 46 out of the top 50 predicted kidney neoplasms related miRNAs are confirmed based on dbDEMC and miR2Disease (1st column: top 1–25; 2nd column: top 26–50).
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-155 | dbdemc | hsa-mir-92a | dbdemc |
| hsa-mir-146a | dbdemc | hsa-mir-195 | dbdemc |
| hsa-mir-122 | dbdemc;miR2Disease | hsa-mir-126 | dbdemc;miR2Disease |
| hsa-mir-34a | dbdemc | hsa-mir-29c | dbdemc;miR2Disease |
| hsa-mir-221 | unconfirmed | hsa-mir-23a | dbdemc |
| hsa-mir-16 | dbdemc | hsa-mir-143 | dbdemc |
| hsa-mir-125b | unconfirmed | hsa-mir-223 | dbdemc |
| hsa-mir-29a | dbdemc;miR2Disease | hsa-mir-214 | dbdemc;miR2Disease |
| hsa-mir-133a | unconfirmed | hsa-let-7a | dbdemc |
| hsa-mir-29b | dbdemc;miR2Disease | hsa-mir-148a | dbdemc |
| hsa-mir-145 | dbdemc | hsa-mir-200b | dbdemc;miR2Disease |
| hsa-mir-26a | dbdemc;miR2Disease | hsa-mir-31 | dbdemc |
| hsa-mir-199a | dbdemc;miR2Disease | hsa-mir-210 | dbdemc;miR2Disease |
| hsa-mir-222 | dbdemc | hsa-mir-106b | dbdemc;miR2Disease |
| hsa-mir-1 | dbdemc | hsa-mir-34c | dbdemc |
| hsa-mir-15b | dbdemc | hsa-mir-182 | dbdemc;miR2Disease |
| hsa-mir-20a | dbdemc;miR2Disease | hsa-mir-200a | dbdemc |
| hsa-mir-17 | dbdemc;miR2Disease | hsa-mir-101 | dbdemc;miR2Disease |
| hsa-mir-30b | dbdemc | hsa-let-7c | dbdemc |
| hsa-mir-206 | dbdemc | hsa-mir-181a | dbdemc |
| hsa-mir-19a | dbdemc | hsa-mir-9 | dbdemc |
| hsa-mir-196a | dbdemc | hsa-mir-34b | dbdemc |
| hsa-mir-19b | dbdemc;miR2Disease | hsa-mir-183 | dbdemc |
| hsa-mir-18a | dbdemc | hsa-mir-218 | dbdemc |
| hsa-mir-150 | dbdemc;miR2Disease | hsa-let-7b | unconfirmed |
Lymphoma often refers to a group of cancerous blood cell tumors, it originates in the lymphatic hematopoietic system [76] which consists of two categories: non-Hodgkin lymphoma (NHL) and Hodgkin’s lymphoma (HL). Worldwide, lymphoma is the seventh most common cancer and makes up 3–4% of all cancers. As to children, lymphomas is also the third-most common cancer. Thanks to the development of high-throughput sequencing technologies, several miRNAs had been already reported to be associated with lymphomas. For example, re-expression of miRNA-150 induces EBV-positive Burkitt lymphoma differentiation by modulating c-Myb in vitro [77]. The expressions proportion of miRNA-21 and miRNA-210 in plasma of previously untreated lymphoma patient group were higher than those of the patients treated for 6 or more courses [78]. In addition, it was found that the plasma miRNA-92a level could be a useful biomarker for NHL diagnosis and patients monitoring after chemotherapy [79]. We implemented DLRMC to predict the related miRNAs for lymphoma based on known associations in the HMDDv2.0 database. As a result, among the top 20 and 50 potential lymphoma-related miRNAs, 19 and 46 were confirmed by miR2Disease and dbDEMC databases, respectively. The prediction results are shown in Table 4.
Table 4.
The top 50 miRNAs associated with Lymphoma predicted by our DLRMC. In summary, 19 out of the top 20, and 46 out of the top 50 predicted Lymphoma related miRNAs are confirmed based on dbDEMC and miR2Disease (1st column: top 1–25; 2nd column: top 26–50).
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-30b | dbdemc | hsa-mir-199b | dbdemc |
| hsa-mir-148a | dbdemc | hsa-mir-26b | dbdemc |
| hsa-mir-373 | dbdemc | hsa-mir-7i | dbdemc |
| hsa-mir-196a | dbdemc | hsa-mir-9 | dbdemc |
| hsa-mir-23a | dbdemc | hsa-let-7b | dbdemc |
| hsa-mir-206 | dbdemc | hsa-mir-96 | dbdemc |
| hsa-mir-195 | dbdemc | hsa-let-7d | dbdemc |
| hsa-mir-372 | unconfirmed | hsa-mir-93 | dbdemc |
| hsa-mir-199a | dbdemc | hsa-mir-25 | dbdemc |
| hsa-mir-15b | dbdemc | hsa-mir-371a | unconfirmed |
| hsa-mir-137b | dbdemc;miR2Disease | hsa-let-7e | dbdemc;miR2Disease |
| hsa-mir-34b | dbdemc | hsa-mir-7 | dbdemc |
| hsa-mir-183 | dbdemc | hsa-mir-223 | dbdemc |
| hsa-mir-132 | dbdemc | hsa-mir-106a | dbdemc;miR2Disease |
| hsa-mir-214 | dbdemc | hsa-mir-205 | dbdemc |
| hsa-mir-182 | dbdemc | hsa-mir-222 | dbdemc |
| hsa-mir-31 | dbdemc | hsa-mir-335 | dbdemc |
| hsa-mir-133a | dbdemc | hsa-mir-27a | dbdemc |
| hsa-mir-212 | dbdemc | hsa-mir-181c | dbdemc |
| hsa-mir-141 | dbdemc | hsa-mir-224 | dbdemc |
| hsa-mir-127 | dbdemc;miR2Disease | hsa-mir-27b | dbdemc |
| hsa-mir-192 | dbdemc | hsa-mir-30a | dbdemc |
| hsa-mir-100 | dbdemc | hsa-mir-370 | unconfirmed |
| hsa-mir-451a | unconfirmed | hsa-mir-1 | dbdemc |
| hsa-mir-106b | dbdemc | hsa-let-7g | dbdemc |
Prostate neoplasms is a malignant tumor which originates in the epithelial cells of prostate [80], it is the second most common type of cancer and the fifth leading cause of cancer-related death in men according to the statistics of the World Health Organization [81]. Up to now, plenty of evidences had confirmed that lots of miRNAs were associated with prostate neoplasms. For instance, the downregulation of miRNA-145 was observed in prostate cancer [82]. It was found that miRNA-183 might be a novel prostate cancer biomarker and inhibiting the expression of miRNA-183 could be beneficial for prostate cancer treatment ([83]). DLRMC predicted the top 20 and top 50 potential miRNAs which might be associated with prostate neoplasms. As a result, 19 of the top 20 and 47 of the top 50 predicted miRNAs were confirmed in the dbDEMC and miR2Disease database, the prediction results are shown in Table 5.
Table 5.
The top 50 miRNAs associated with Prostate neoplasms predicted by our DLRMC. In summary, 19 out of the top 20, and 47 out of the top 50 predicted Prostate neoplasms related miRNAs are confirmed based on dbDEMC and miR2Disease (1st column: top 1–25; 2nd column: top 26–50).
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-146a | miR2Disease | hsa-mir-150 | dbdemc |
| hsa-mir-122 | unconfirmed | hsa-mir-126 | dbdemc;miR2Disease |
| hsa-mir-155 | dbdemc | hsa-mir-195 | dbdemc;miR2Disease |
| hsa-mir-21 | dbdemc;miR2Disease | hsa-mir-29c | dbdemc |
| hsa-mir-34a | dbdemc;miR2Disease | hsa-mir-223 | dbdemc;miR2Disease |
| hsa-mir-16 | dbdemc;miR2Disease | hsa-mir-143 | dbdemc;miR2Disease |
| hsa-mir-221 | dbdemc;miR2Disease | hsa-mir-23a | dbdemc;miR2Disease |
| hsa-mir-29a | dbdemc | hsa-let-7a | dbdemc;miR2Disease |
| hsa-mir-133a | dbdemc | hsa-mir-200b | unconfirmed |
| hsa-mir-29b | dbdemc;miR2Disease | hsa-mir-214 | dbdemc;miR2Disease |
| hsa-mir-15a | dbdemc;miR2Disease | hsa-mir-148a | miR2Disease |
| hsa-mir-26a | dbdemc;miR2Disease | hsa-mir-106b | dbdemc |
| hsa-mir-222 | dbdemc;miR2Disease | hsa-mir-34c | dbdemc |
| hsa-mir-199a | dbdemc;miR2Disease | hsa-mir-30c | dbdemc;miR2Disease |
| hsa-mir-1 | dbdemc | hsa-mir-31 | dbdemc;miR2Disease |
| hsa-mir-20a | miR2Disease | hsa-mir-141 | miR2Disease |
| hsa-mir-17 | miR2Disease | hsa-mir-182 | dbdemc;miR2Disease |
| hsa-mir-15b | dbdemc | hsa-mir-200a | dbdemc |
| hsa-mir-19a | dbdemc | hsa-mir-101 | dbdemc;miR2Disease |
| hsa-mir-19b | dbdemc;miR2Disease | hsa-let-7c | dbdemc;miR2Disease |
| hsa-mir-206 | dbdemc | hsa-mir-192 | dbdemc |
| hsa-mir-30b | dbdemc;miR2Disease | hsa-mir-181a | dbdemc;miR2Disease |
| hsa-mir-181b | dbdemc;miR2Disease | hsa-mir-9 | dbdemc |
| hsa-mir-196a | dbdemc | hsa-mir-34b | dbdemc |
| hsa-mir-92a | unconfirmed | hsa-mir-133b | dbdemc |
2.4. Advantages of DLRMC
The experimental results of above sections demonstrate that DLRMC can obtain higher AUC value than other previous methods, which demonstrates that DLRMC works more effective in predicting potential miRNA-disease associations. The advantages of DLRMC lie in following aspects: Firstly, the low-rank property of miRNA-disease association matrix is exploited to regularize the matrix completing process. Secondly, DLRMC fills the candidate samples without known associations with 0 and then iteratively updates them with the predictive scores. Finally, the known miRNA-disease associations, miRNA functional similarity and the disease semantic similarity are integrated into a unified model to serve the completion of miRNA-disease association matrix.
3. Materials and methods
3.1. Methods overview
In order to predict potential undiscovered miRNA-disease associations, we propose a novel method named DLRMC, which consists of three main steps. First, the miRNA functional similarity and the disease semantic similarity are calculated based on the collected data sources. Considering that the two kinds of similarity priors benefit a lot of previous miRNA-disease associations prediction methods, we integrate the two kinds of similarities into a matrix competition model for miRNA-disease associations prediction. The entry values of the completed association matrix are used to infer the potential miRNA-disease associations. Fig. 2 gives a brief flowchart of our proposed method.
Figure 2.

Flowchart of our DLRMC model for predicting the potential miRNA-disease associations.
3.2. Human miRNA-disease associations
In this work, we use the Human miRNA-disease associations dataset obtained from the HMDDv2.0 database [84], which consists of 5430 experimentally confirmed human miRNA-diseases associations. There are 495 miRNAs and 383 human diseases. An adjacency matrix with binary values is defined to clearly describe the known miRNAs-disease associations, i.e., if a specific miRNA is confirmed to be associated with a disease , the entity is assigned 1, otherwise 0. Thus, the adjacency matrix is with size .
3.3. Similarity measures
3.3.1. miRNA functional similarity
MiRNA functional similarity has been previously calculated by [32], based on the well-known assumption that miRNAs with similar functions are often related to similar diseases. In this work, based on the mentioned miRNA functional similarity, we construct a corresponding miRNA functional similarity matrix , in which denotes the functional similarity score between miRNA and .
3.3.2. Disease semantic similarity
It is well known that the relationship among different diseases can be represented by the hierarchical directed acyclic graph (DAG). In detail, a disease can be described as , where consists of node itself and all its ancestor nodes, and is the edge set including the direct edges linking parent nodes to child nodes [32,85–88]. Then, the semantic value of disease can be defined as follows:
| (1) |
where
In Equation (2), represents the semantic contribution factor of disease to disease . For a given disease , the contribution of itself is 1, while the contribution of another disease decreases as the distance between and increases. Thus, we denote as the disease semantic similarity matrix, and the semantic similarity between disease and can be calculated as follows:
| (3) |
4. Dual Laplacian regularized matrix completion for miRNA-disease associations prediction
4.1. Standard matrix completion
Matrix completion is the task of filling in the missing entries of a partially observed matrix . One of the mostly used model of the matrix completion problem is to find the lowest rank matrix which matches the matrix , which we wish to recover, for all entries in the set of observed entries. The basic mathematical formulation of this problem is as follows:
Due to the optimization problem (4) is non-convex and no efficient solution can be obtained, Equation (4) is usually transformed to the following convex problem by relaxing the rank function into the nuclear norm:
where is the nuclear norm, which is equal to the sum of singular values of . Equation (5) can be solved by using the singular value thresholding (SVT) algorithm.
4.2. DLRMC
Equation (5) was directly used for miRNAs-disease associations prediction [44] and obtained good results when compared to other previous methods. However, the miRNA functional similarity and disease semantic similarity which have been demonstrated useful in other methods (as shown in Table 1) have not been fully exploited to serve the matrix completion model. Thus, we believe that the miRNA functional similarity and disease semantic similarity can advantage the matrix competition model, of course, better prediction results can be expected. In this work, we propose a novel dual Laplacian regularized matrix completion model (DLRMC) for microRNA-disease associations prediction. In DLRMC, the known microRNA-disease associations, miRNA functional similarity and disease semantic similarity are integrated into a unified model to constrain that miRNAs with similar functions are more likely to have connections with diseases which share similar phenotypes and diseases with semantic similarity are tend to be related with similar miRNAs. The optimization problem of DLRMC can be formulated as follows:
where and represent the -th row and -th column of , respectively. , , and are the regularization coefficients, and ‘’ denotes the Hadamard product of two matrices. The Tikhonov regularization on is used to ensure the smoothness of . The third term aims to ensure that the scores of known associations in are close to those in , i.e., the experimental confirmed miRNAs-disease associations can be preserved after the matrix competition. Since A is with values, we use itself as the indicator matrix to indicate the indices of the observed miRNA-disease associations. The forth and fifth terms constitute the dual Laplacian regularizer which constrain that miRNAs with similar functions are more likely to have connections with diseases which share similar phenotypes and diseases with semantic similarity are tend to be related with similar miRNAs. In Equation (6), since both and regularize the Frobenius Norm of matrices, we set for simplicity. Then we have three parameters in DLRMC need to be tuned, i.e., , and . In our experiments, all parameter combinations are considered based on grid search. The optimal and are determined from , while is chosen from . Finally, the best results corresponding to the optimal parameters combination are reported. In the discussion section, we will analyze the impact of each parameter on the final results.
Note that a similar dual Laplacian regularization term has been used in [49] for identifying microRNA-disease associations (GRNMF). However, our proposed DLRMC differs from GRNMF in two aspects. Firstly, GRNMF is based on the Non-negative matrix factorization model, which decomposes the miRNA-disease association matrix into a basis matrix and a coefficient matrix, then the two learned matrices are used to recovery the potential miRNA-disease associations, while our DLRMC predicts miRNA-disease associations in a more intuitive way by using the matrix completion model which directly fills the missing values which represent potential miRNA-disease associations. Secondly, the purpose of the dual Laplacian regularization term of the two models is different. In GRNMF, the disease similarity matrix and miRNA functional similarity matrix are used to regularize the geometrical structure of the basis matrix and coefficient matrix, respectively. In our DLRMC, the dual Laplacian regularization term are used to regularize the miRNA-disease association matrix, i.e., miRNAs with similar functions should be more likely to have connections with diseases which share similar phenotypes and diseases with semantic similarity should be related with similar miRNAs, the physical meaning of our regularization term is more intuitive.
4.3. Optimization
To solve the optimization problem in Equation (6), we first transform it into the following form:
| (7) |
where is the Laplacian matrix with , and is the diagonal matrix with . Similarly, is the Laplacian matrix with , and is the diagonal matrix with .
Since problem (7) contains Hadamard product of two matrices, it is hard to tackle it directly. Thus, we propose an alternative iterative algorithm to solve this problem based on augmented Lagrange multiplier (ALM) algorithm [89]. We first introduce two auxiliary variables and to make the objective function separable:
The corresponding augmented Lagrange function of Equation (8) is:
| (9) |
where and are the Lagrange multipliers, and control the penalties for violating the linear constraints, and is the standard inner product of two matrices. Then the variables can be solved alternatively.
Solving with other variables fixed:
The variable can be solved by following equation with other variables fixed:
| (10) |
where can be solved by singular value thresholding (SVT) operator [90].
4.4. Solving with other variables fixed
When other variables are fixed, can be solved by minimizing following function:
| (11) |
Setting the derivative of Equation (11) w.r.t. to zero, and using properties of the Hadamard and Kronecker products, it is easy to get that can be obtained as:
| (12) |
where , and . This is a simple linear system and can be solved easily.
4.5. Solving with other variables fixed
We can solve by dropping other unrelated variables as follows:
| (13) |
By setting the derivative of Equation (13) w.r.t. to zero, we have
| (14) |
Equation (14) is a Sylvester equation [91]. Since is strictly positive definite, Equation (14) has stable solution for .
Updating Multipliers:
We update the multipliers by
| (15) |
The variables , and are iteratively updated until convergence. Finally, we obtain the predicted miRNA-disease associations based on the completed entities in matrix .
5. Discussion
5.1. Stability analysis
In our experiments, we use the Human miRNA-disease associations dataset obtained from the HMDDv2.0 database [84] for testing, which consists of 5430 experimentally confirmed human miRNA-diseases associations. In order to evaluate the stability of DLRMC to the miRNA-disease database, we change the ratio of training samples and compare the final results. The ratio of training samples varies from . For each ratio, the selected associations are used as training samples and the rest are used for testing. For each case, we perform the training samples selecting and associations predicting five times with randomly selected training samples. Finally, the average AUC of the five times are reported for each case. In Fig. 3, we plot the average AUC values with varying ratio of selected training samples. As can be seen, when more than half of the experimentally confirmed human miRNA-diseases associations are selected as training samples, DLRMC can obtain stable results. At present, three kinds of information including the known miRNA-disease associations, miRNA functional similarity and the disease semantic similarity are used for potential associations pridiction, we believe that if more information can be obtained, DLRMC can infer more potential miRNA-disease associations.
Figure 3.

Average AUC values with varying ratio of selected training samples.
5.2. Parameter sensitivity analysis
There are four parameters in our model (i.e., , , and ). Since we set , we need to tune three parameters. In order to further demonstrate the performance of the proposed DLRMC, we study its sensitivity w.r.t. the parameters , and . Firstly, we fix and vary and . Then we fix and vary and . Finally, we fix and vary and . Figs. 4 and 5 plot the AUC values with varying parameters under global LOOCV and local LOOCV, respectively. As can be seen, the final results are not very sensitive to the parameters. In a certain range, DLRMC can obtain stable results. Through a closer observation, we can find that when is fixed, the best result can be obtained with and . When or is fixed, optimal results can be obtained with .
Figure 4.

AUC values with varying parameters under global LOOCV.
Figure 5.

AUC values with varying parameters under local LOOCV.
6. Conclusion
It is necessary to develop efficient computational models for predicting the potential miRNA-disease associations since it is expensive and time-consuming to accomplish this task using experimental methods. In this study, we have proposed a computational method, called DLRMC, for miRNA-disease associations prediction. The main contribution of our work is the development a matrix completion model with a dual Laplacian regularizer which constrains that miRNAs with similar functions are more likely to have connections with diseases which share similar phenotypes and diseases with semantic similarity are tend to be related with similar miRNAs. Meanwhile, the experimental confirmed miRNAs-disease associations can be preserved by using a binary indicator matrix after the completion.
The efficacy of our method is validated through cross validations and case studies on the HMDDv2.0 database. The experimental results verify that DLRMC can effectively improve performance compared with other methods, which indicates that DLRMC can be used for novel miRNAs-disease associations prediction.
Our proposed DLRMC can be regarded as an effective complement to experimental methods for discovering miRNA-disease associations. Biomedical researchers can use our model to computationally infer the miRNAs which are potentially associated with the disease of interest, then the most promising associations can be finally chosen for biological confirmation. By this way, experiments could be more effective and productive. In addition, since the identification of disease-related miRNAs is critical to elucidating the development of tumorigenesis, predicted potential associations can further facilitate the discovery of potential drug targets and treatment options.
Funding Statement
This work was partly supported by the National Natural Science Foundation of China under Grant No. 61701451, and partly supported by the Fundamental Research Funds for the Central Universities, China University of Geosciences (Wuhan) under Grant No. CUG170654, and partly supported by Jiangsu Provincial Commission of Health and Family Planning under Grant No. H201556, and partly supported by the Six Top Talent Peak Projects in Jiangsu under Grant No. WSN-019, and partly supported by the 333 Talent Project of Jiangsu Province under Grant No. BRA2017246.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
References
- [1].Llave C, Carrington JC.. Cleavage of scarecrow-like mrna targets directed by a class of arabidopsis mirna. Science. 2002;297(5589):2053. [DOI] [PubMed] [Google Scholar]
- [2].Eulalio A, Huntzinger E, Izaurralde E. Getting to the root of mirna-mediated gene silencing. Cell. 2008;132(1):9. [DOI] [PubMed] [Google Scholar]
- [3].Ambros V. The functions of animal micrornas. Nature. 2004;431(7006):350. [DOI] [PubMed] [Google Scholar]
- [4].Tang C, Cao L, Zheng X, et al. Gene selection for microarray data classification via subspace learning and manifold regularization. Med Biol Eng Comput. 2018;56(7):1271–1284. [DOI] [PubMed] [Google Scholar]
- [5].Tang C, Liu X, Li M, et al. Robust unsupervised feature selection via dual self-representation and manifold regularization. Knowledge-Based Syst. 2018;145:109–120. [Google Scholar]
- [6].Jopling CL, Yi M, Lancaster AM, et al. Modulation of hepatitis c virus rna abundance by a liver-specific microrna. Science. 2005;309(5740):1577–1581. [DOI] [PubMed] [Google Scholar]
- [7].Vasudevan S, Tong Y, Steitz JA. Switching from repression to activation: micrornas can up-regulate translation. Science. 2007;318(5858):1931–1934. [DOI] [PubMed] [Google Scholar]
- [8].Cui Q, Yu Z, Purisima EO, et al. Principles of microrna regulation of a human cellular signaling network. Mol Syst Biol. 2006;2(1):46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Karp X, Ambros V. Encountering micrornas in cell fate signaling. Science. 2005;310(5752):1288–1289. [DOI] [PubMed] [Google Scholar]
- [10].Alshalalfa M, Alhajj R. Using context-specific effect of mirnas to identify functional associations between mirnas and gene signatures. Bmc Bioinformatics. 2013;14(S12):S1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Xu P, Guo M, Hay BA. Micrornas and the regulation of cell death. Trends Genet. 2004;20(12):617. [DOI] [PubMed] [Google Scholar]
- [12].Miska EA. How micrornas control cell division, differentiation and death. Curr Opin Genet Dev. 2005;15(5):563–568. [DOI] [PubMed] [Google Scholar]
- [13].Witwer KW. Xenomirs and mirna homeostasis in health and disease: evidence that diet and dietary mirnas directly and indirectly influence circulating mirna profiles. RNA Biol. 2012;9(9):1147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Kumarswamy R, Volkmann I, Thum T. Regulation and function of mirna-21 in health and disease. RNA Biol. 2011;8(5):706–713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Ii VB, Li Y, Philip PA, et al. Microrna and cancer: tiny molecules with major implications. Curr Genomics. 2008;9(2):97–109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Winter J, Diederichs S. Microrna biogenesis and cancer. Methods Mol Biol. 2005;65(9):3509. [DOI] [PubMed] [Google Scholar]
- [17].Desano JT, Liang X. Microrna regulation of cancer stem cells and therapeutic implications. AAPS J. 2009;11(4):682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Liu L, Jian W, Khanabdali R, et al. Circular rnas: isolation, characterization and their potential role in diseases. RNA Biol. 2017;14(12):1715–1721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Tarang S, Weston MD. Macros in microrna target identification: A comparative analysis of in silico, in vitro, and in vivo approaches to microrna target identification. RNA Biol. 2014;11(4):324–333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Calin GA, Dumitru CD, Shimizu M, et al. Frequent deletions and down-regulation of micro- rna genes mir15 and mir16 at 13q14 in chronic lymphocytic leukemia. Proc Natl Acad Sci U S A. 2002;99(24):15524–15529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Mudduluru G, Ceppi P, Kumarswamy R, et al. Regulation of axl receptor tyrosine kinase expression by mir-34a and mir-199a/b in solid cancer. Oncogene. 2011;30(25):2888. [DOI] [PubMed] [Google Scholar]
- [22].Paraskevi A, Theodoropoulos G, Papaconstantinou I, et al. Circulating microrna in inflammatory bowel disease. J Crohns Colitis. 2012;6(9):900–904. [DOI] [PubMed] [Google Scholar]
- [23].Wang H, Peng W, Xin O, et al. Reduced circulating mir-15b is correlated with phosphate metabolism in patients with end-stage renal disease on maintenance hemodialysis. Ren Fail. 2012;34(6):685–690. [DOI] [PubMed] [Google Scholar]
- [24].Calin GA, Croce CM. Microrna signatures in human cancers. Nat Rev Cancer. 2006;6(11):857. [DOI] [PubMed] [Google Scholar]
- [25].Keller A, Leidinger P, Gislefoss R, et al. Stable serum mirna profiles as potential tool for non-invasive lung cancer diagnosis. RNA Biol. 1900;8(3):506–516. [DOI] [PubMed] [Google Scholar]
- [26].Siebert M, Westbroek W, Chen YC, et al. Identification of mirnas that modulate glucocerebrosidase activity in gaucher disease cells. RNA Biol. 2014;11(10):1291–1300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Weiland M, Gao XH, Zhou L, et al. Small rnas have a large impact: circulating micrornas as biomarkers for human diseases. RNA Biol. 2012;9(6):850–859. [DOI] [PubMed] [Google Scholar]
- [28].Chen X, Yan CC, Zhang X, et al. Wbsmda: within and between score for mirna-disease association prediction. Sci Rep. 2016;6:21106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].Chen X, Yan GY. Semi-supervised learning for potential human microrna-disease associations inference. Sci Rep. 2014;4:5501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [30].Esquelakerscher A, Slack FJ. Oncomirs - micrornas with a role in cancer. Nat Rev Cancer. 2006;6(4):259. [DOI] [PubMed] [Google Scholar]
- [31].Bandyopadhyay S, Mitra R, Maulik U, et al. Development of the human cancer microrna network. Silence. 2010;1(1):6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [32].Wang D, Wang J, Lu M, et al. Inferring the human microrna functional similarity and functional network based on microrna-associated diseases. Bioinformatics. 2010;26(13):1644–1650. [DOI] [PubMed] [Google Scholar]
- [33].Le TD, Zhang J, Liu L, et al. Computational methods for identifying mirna sponge interactions. Brief Bioinform. 2016;18(4):577. [DOI] [PubMed] [Google Scholar]
- [34].Chen X, Wu QF, Yan GY. Rknnmda: ranking-based knn for mirna-disease association prediction. RNA Biol. 2017;14(7):952–962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Chen X, Zhou Z, Ao YZ. Ellpmda: ensemble learning and link prediction for mirna-disease association prediction. RNA Biol. 2018;15(6):807–818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [36].Tang C, Chen J, Liu X, et al. Consensus learning guided multi-view unsupervised feature selection. Knowledge-Based Syst. 2018;160:49–60. [Google Scholar]
- [37].Wang G, Juan L, Teng M, et al. Prioritization of disease micrornas through a human phenome-micrornaome network. BMC Syst Biol. 2010;4(S1):S2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [38].Shi H, Xu J, Zhang G, et al. Walking the interactome to identify human mirna-disease associations through the functional link between mirna targets and disease genes. BMC Syst Biol. 2013;7(1):101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [39].Xuan P, Han K, Guo M, et al. Prediction of micrornas associated with human diseases based on weighted k most similar neighbors. PloS one. 2013;8(8):e70204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [40].Xu C, Ping Y, Li X, et al. Prioritizing candidate disease mirnas by integrating phenotype associations of multiple diseases with matched mirna and mrna expression profiles. Mol Biosyst. 2014;10(11):2800–2809. [DOI] [PubMed] [Google Scholar]
- [41].Mork S, Pletscher-Frankild S, Palleja CA, et al. Protein-driven inference of mirna-disease associations. Bioinformatics. 2014;30(3):392–397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42].Luo J, Ding P, Liang C, et al. Collective prediction of disease-associated mirnas based on transduction learning. IEEE/ACM Trans Comput Biol Bioinform. 2017;14(6):1468–1475. [DOI] [PubMed] [Google Scholar]
- [43].Chen X, Yan CC, Zhang X, et al. Rbmmmda: predicting multiple types of disease-microrna associations. Sci Rep. 2015;5:13877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [44].Li JQ, Rong ZH, Chen X, et al. Mcmda: matrix completion for mirna-disease association prediction. Oncotarget. 2017;8(13):21187–21199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [45].Mugunga I, Ying J, Liu X, et al. Computational prediction of human disease-related micrornas by path-based random walk. Oncotarget. 2017;8(35):58526–58535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [46].Li ZS, Liu B, Yan C. Cfmda: collaborative filtering-based mirna-disease association prediction. Multimedia Tools Appl. 2017;78(1):1–14. [Google Scholar]
- [47].Chen H, Zhang Z. Similarity-based methods for potential human microrna-disease association prediction. BMC Med Genomics. 2013;6(1):12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [48].Luo J, Xiao Q. A novel approach for predicting microrna-disease associations by unbalanced bi-random walk on heterogeneous network. J Biomed Inform. 2017;66:194–203. [DOI] [PubMed] [Google Scholar]
- [49].Xiao Q, Luo J, Liang C, et al. A graph regularized non-negative matrix factorization method for identifying microrna-disease associations. Bioinformatics. 2017;34(2):239–248. [DOI] [PubMed] [Google Scholar]
- [50].You ZH, Huang ZA, Zhu Z, et al. Pbmda: A novel and effective path-based computational model for mirna-disease association prediction. PLoS Comput Biol. 2017;13(3):e1005455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [51].Chen X, Huang L. Lrsslmda: laplacian regularized sparse subspace learning for mirna-disease association prediction. PLoS Comput Biol. 2017;13(12):e1005912. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [52].Chen X, Wang L, Qu J, et al. Predicting mirnadisease association based on inductive matrix completion. Bioinformatics. 2018. [DOI] [PubMed] [Google Scholar]
- [53].Chen X, Huang L, Xie D, et al. Egbmmda: extreme gradient boosting machine for mirna-disease association prediction. Cell Death Dis. 2018;9(1):3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [54].Chen X, Xie D, Wang L, et al. Bnpmda: bipartite network projection for mirna-disease association prediction. Bioinformatics. 2018;34(18):3178–3186. [DOI] [PubMed] [Google Scholar]
- [55].Chen X, Qu J, Yin J. Tlhnmda: triple layer heterogeneous network based inference for mirna-disease association prediction. Front Genet. 2018;9:234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [56].Chen X, Wang CC, Yin J, et al. Novel human mirna-disease association inference based on random forest. Mol Ther Nucleic Acids. 2018;13:568–579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [57].Chen X, Xie D, Zhao Q, et al. Micrornas and complex diseases: from experimental results to computational models. Brief Bioinform. 2017. [DOI] [PubMed] [Google Scholar]
- [58].Chen X, Yan CC, Zhang X, et al. Hgimda: heterogeneous graph inference for mirna-disease association prediction. Oncotarget. 2016;7(40):65257–65269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [59].Chen X, Yin J, Qu J, et al. Mdhgi: matrix decomposition and heterogeneous graph inference for mirna-disease association prediction. PLoS Comput Biol. 2018;14(8):e1006418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [60].Chen X, Cheng JY, Yin J. Predicting microrna-disease associations using bipartite local models and hubness-aware regression. RNA Biol. 2018;15(9):1192–1205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [61].Zhao Y, Chen X, Yin J. A novel computational method for the identification of potential mirna-disease association based on symmetric non-negative matrix factorization and kronecker regularized least square. Front Genet. 2018;9:324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [62].Jiang Q, Hao Y, Wang G, et al. Prioritization of disease micrornas through a human phenome-micrornaome network. BMC Syst Biol. 2010;4(1):S2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [63].Oved K, Morag A, Pasmanik-Chor M, et al. Genome-wide mirna expression profiling of human lymphoblastoid cell lines identifies tentative ssri antidepressant response biomarkers. Pharmacogenomics. 2012;13(10):1129–1139. [DOI] [PubMed] [Google Scholar]
- [64].Yang Z, Ren F, Liu C, et al. dbdemc: a database of differentially expressed mirnas in human cancers. BMC Econ. 2010;11(Suppl 4):1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [65].Jiang Q, Wang Y, Hao Y, et al. mir2disease: a manually curated database for microrna deregulation in human disease. Nucleic Acids Res. 2008;37:D98–D104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [66].Phipps AI, Lindor NM, Jenkins MA, et al. Colon and rectal cancer survival by tumor location and microsatellite instability: the colon cancer family registry. Dis Colon Rectum. 2013;56(8):937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [67].Jemal A, Bray F, Center MM, et al. Global cancer statistics. CA Cancer J Clin. 2011;61(2):69–90. [DOI] [PubMed] [Google Scholar]
- [68].Ogata-Kawata H, Izumiya M, Kurioka D, et al. Circulating exosomal micrornas as biomarkers of colon cancer. PloS one. 2014;9(4):e92921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [69].Espinosa CES, Slack FJ. The role of micrornas in cancer. Yale J Biol Med. 2006;79(3–4):131–140. [PMC free article] [PubMed] [Google Scholar]
- [70].Cahill S, Smyth P, Denning K, et al. Effect of braf v600e mutation on transcription and post-transcriptional regulation in a papillary thyroid carcinoma model. Mol Cancer. 2007;6(1):21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [71].Chong V, Abdullah M, Telisinghe P, et al. Colorectal cancer: incidence and trend in brunei darussalam. Singapore Med J. 2009;50(11):1085. [PubMed] [Google Scholar]
- [72].Manojlović D, Elaković D, Lj M. Therapeutic value of transcatheter embolization in malignant tumors of the renal parenchyma. Srp Arh Celok Lek. 1986;114(7):631–637. [PubMed] [Google Scholar]
- [73].Duque JL, Loughlin KR, O’Leary MP, et al. Partial nephrectomy: alternative treatment for selected patients with renal cell carcinoma. Urology. 1998;52(4):584–590. [DOI] [PubMed] [Google Scholar]
- [74].Senanayake U, Das S, Vesely P, et al. mir-192, mir-194, mir-215, mir-200c and mir-141 are downregulated and their common target acvr2b is strongly expressed in renal childhood neoplasms. Carcinogenesis. 2012;33(5):1014–1021. [DOI] [PubMed] [Google Scholar]
- [75].Liu W, Zabirnyk OH, Shiao YH, et al. mir-23b targets proline oxidase, a novel tumor suppressor protein in renal cancer. Oncogene. 2010;29(35):4914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [76].Maryam D, Sara S, Malin W, et al. In vitroandin vivoactivity of melflufen (j1)in lymphoma. BMC Cancer. 2016;16(1):1–9. [Google Scholar]
- [77].Chen S, Wang Z, Dai X, et al. Re-expression of microrna-150 induces ebv-positive burkitt lymphoma differentiation by modulating c-myb in vitro. Cancer Sci. 2013;104(7):826–834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [78].Ge TT, Liang Y, Fu R, et al. Expressions of mir-21,mir-155 and mir-210 in plasma of patients withlymphoma and its clinical significance. J Exp Hematol. 2012;20(20):305–309. [PubMed] [Google Scholar]
- [79].Ohyashiki K, Umezu T, Yoshizawa S, et al. Clinical impact of down-regulated plasma mir-92a levels in non-hodgkin’s lymphoma. Plos One. 2011;6(2):e16408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [80].Gmyrek GA, Walburg M, Webb CP, et al. Normal and malignant prostate epithelial cells differ in their response to hepatocyte growth factor/scatter factor. Am J Pathol. 2001;159(2):579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [81].Mcguire S. World cancer report 2014. geneva, switzerland: world health organization, international agency for research on cancer, who press, 2015. Adv Nutr. 2016;7(2):418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [82].Hart M, Wach S, Nolte E, et al. The proto-oncogene erg is a target of microrna mir-145 in prostate cancer. FEBS J. 2013;280(9):2105–2116. [DOI] [PubMed] [Google Scholar]
- [83].Ueno K, Hirata H, Shahryari V, et al. microrna-183 is an oncogene targeting dkk-3 and smad4 in prostate cancer. Br J Cancer. 2013;108(8):1659–1667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [84].Li Y, Qiu C, Tu J, et al. Hmdd v2. 0: a database for experimentally supported human microrna and disease associations. Nucleic Acids Res. 2013;42(D1):D1070–D1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [85].Chen X, Yan CC, Luo C, et al. Constructing lncrna functional similarity network based on lncrna-disease associations and disease semantic similarity. Sci Rep. 2015;5:11338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [86].Chen X, Yan CC, Zhang X, et al. Long non-coding rnas and complex diseases: from experimental results to computational models. Brief Bioinform. 2016;18(4):558–576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [87].Chen X, Huang YA, Wang XS, et al. Fmlncsim: fuzzy measure-based lncrna functional similarity calculation model. Oncotarget. 2016;7(29):45948. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [88].Huang YA, Chen X, You ZH, et al. Ilncsim: improved lncrna functional similarity calculation model. Oncotarget. 2016;7(18):25902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [89].Lin Z, Liu R, Su Z. Linearized alternating direction method with adaptive penalty for low-rank representation. In: Advances in Neural Information Processing Systems. Curran Associates, Inc.; 2011. p. 612–620.
- [90].Cai JF, Candès EJ, Shen Z. A singular value thresholding algorithm for matrix completion. SIAM J Optim. 2010;20(4):1956–1982. [Google Scholar]
- [91].Bartels RH, Stewart GW. Solution of the matrix equation ax + xb = c. Commun ACM. 1972;15(9):820–826.http://dl.acm.org/citation.cfm?doid=361573.361582 [Google Scholar]
