Skip to main content
RNA Biology logoLink to RNA Biology
. 2019 Feb 20;16(5):601–611. doi: 10.1080/15476286.2019.1570811

Dual Laplacian regularized matrix completion for microRNA-disease associations prediction

Chang Tang a,*, Hua Zhou b,*, Xiao Zheng c, Yanming Zhang b, Xiaofeng Sha d,
PMCID: PMC6546388  PMID: 30676207

ABSTRACT

Since lots of miRNA-disease associations have been verified, it is meaningful to discover more miRNA-disease associations for serving disease diagnosis and prevention of human complex diseases. However, it is not practical to identify potential associations using traditional biological experimental methods since the process is expensive and time consuming. Therefore, it is necessary to develop efficient computational methods to accomplish this task. In this work, we introduced a matrix completion model with dual Laplacian regularization (DLRMC) to infer unknown miRNA-disease associations in heterogeneous omics data. Specifically, DLRMC transformed the task of miRNA-disease association prediction into a matrix completion problem, in which the potential missing entries of the miRNA-disease association matrix were calculated, the missing association can be obtained based on the prediction scores after the completion procedure. Meanwhile, the miRNA functional similarity and the disease semantic similarity were fully exploited to serve the miRNA-disease association matrix completion by using a dual Laplacian regularization term. In the experiments, we conducted global and local Leave-One-Out Cross Validation (LOOCV) and case studies to evaluate the efficacy of DLRMC on the Human miRNA-disease associations dataset obtained from the HMDDv2.0 database. As a result, the AUCs of DLRMC is 0.9174 and 0.8289 in global LOOCV and local LOOCV, respectively, which significantly outperform a variety of previous methods. In addition, in the case studies on four significant diseases related to human health including Colon Neoplasms, Kidney neoplasms, Lymphoma and Prostate neoplasms, 90%, 92%, 92% and 94% out of the top 50 predicted miRNAs has been confirmed, respectively.

KEYWORDS: miRNA-disease association prediction, matrix completion, Laplacian regularization, microRNA

1. Introduction

As a kind of short non-coding single-stranded RNA, MicroRNAs (miRNAs) (22nt) are found in plants, animals, and some viruses. By binding to the 3ʹ untranslated regions (UTRs) of the target messenger RNAs (mRNAs) through base pairing, miRNAs can suppress the protein production and gene expression [15]. On the other hand, some scientific researchers have also shown that miRNAs could also act as positive regulators [6,7]. Since the first two miRNAs, i.e., Caenorhabditis elegans lin-4 and let-7 were discovered more than twenty years ago, more and more miRNAs have been detected in recent years. Meanwhile, large amounts of evidences confirm that miRNAs play vital roles in a variety of crucial cell biological processes, such as proliferation, development, differentiation, apoptosis, metabolism, viral infection, aging, signal transduction and so on [814]. Therefore, it is no doubt that miRNAs are closely related to many complex human diseases and some miRNA-disease associations have already been verified. For example, the miRNA deregulation is closely related to the development of various cancers [1519]. As the first evidence for the fact that miRNAs are involved in cancer formation [20], firstly clarified that miR-15 and miR-16 are deleted in more than half cases of B-cell chronic lymphocytic leukemia. Also, miR-34a and miR-199a/b were found to be down-regulated in solid cancer cells [21]. Besides, researchers revealed that miR-126 were expressed with significantly higher levels in the blood from patients with Crohn Disease [22]. Furthermore, for patients with end-stage renal disease, the circulating levels of miR-15b were significantly suppressed [23]. Apart from the aforementioned miRNA-disease associations, there may exist other ones. Therefore, it is meaningful to discover more miRNA-disease associations for benefiting disease diagnosis and prevention of human complex diseases [2427]. However, it is expensive and time consuming to identify the associations between miRNAs and diseases using existing biological experimental methods. Hence, it is necessary for us to develop efficient computational methods to predict the potential miRNA-disease associations by using the recent superior computing resources [28,29].

During last decades, many computational methods for predicting potential miRNA-disease associations have been proposed. Some of these methods depend on the assumption that miRNAs with similar functions are more likely to have connections with diseases which share similar phenotypes [3036]. For example, Wang et al. [37] predicted potential miRNA-disease associations by scoring each miRNA for a disease of interest through the cumulative hypergeometric distribution. Shi et al. [38] exploited the functional associations between miRNA and disease by implementing the algorithm of random walk on protein-protein interaction network. Based on the assumption that miRNAs whose target genes are related to certain diseases are more likely to be associated with these diseases, they discovered the miRNA-target interactions, disease-gene associations, and protein-protein interactions to predict potential associations between the miRNAs and diseases. Xuan et al. [39] developed the HDMP prediction method by calculating the functional similarities of weighted k neighbor miRNAs. Instead of relying on the known miRNA-disease associations, Xu et al. [40] introduced a miRNA prioritization method by evaluating the similarity between the targets of miRNAs and disease genes. By combining protein-disease associations and miRNA-protein interactions together to predict novel miRNA-disease associations, Mork et al. [41] proposed a computational model named miRPD. Luo et al. [42] proposed a transduction learning-based method to prioritize disease-related miRNAs, especially for those diseases that are associated with sparse known miRNAs. In [43], the restricted Boltzmann machine (RBM) which is a two-layer undirected graphical model consisting of layers of visible and hidden units was deployed to construct a computational model called RBMMMDA. Li et al. [44] developed a Matrix Completion for MiRNA-Disease Association prediction model based on the known miRNA-disease associations, during the matrix completion, the known associations are well retained, potential associations are predicted by the completed values. Mugunga et al. [45] presented a prediction model that is based on the theory of path features and random walk to obtain a relevancy score of miRNA-related disease. In their model, highly ranked scores are potential miRNA-disease associations. Li et al. [46] explored the way to predict the miRNA-disease association by using the extensive collaborative filtering in order to diagnose the diseases better, they considered the miRNA functional similarity and disease similarity while used minimal amount of related information. In [47], three inference methods were introduced to predict potential miRNA-disease associations based on the global network similarity measure and the assumption that functionally related miRNAs tend to be associated with phenotypically similar diseases. Chen et al. [28] proposed to prioritize disease miRNAs by using the the between-scores and within-scores of each miRNA-disease pair. By integrating heterogeneous omics data for identifying disease miRNAs, Luo et al. [48] developed a Kronecker regularized least squares-based method for miRNA-disease association prediction. By considering that most of previous approaches strongly rely on the known association information, and only a handful of them could be applied to uncover the potential associations involving novel diseases or miRNAs. Chen et al. [29] presented a model named RLSMDA based on semi-supervised learning which calculates the semantic similarity between different diseases. RLSMDA could identify related miRNAs for diseases without any known associated miRNAs, meanwhile avoiding the problem of using negative associations between miRNAs and diseases. Xiao et al. [49] also introduced a novel framework called GRNMF to infer the unknown miRNA-disease associations in heterogeneous omics data, which could work for both new diseases and miRNAs. You et al. [50] constructed a heterogeneous graph consisting of three interlinked sub-graphs and further adopted depth-first search algorithm to infer potential miRNA-disease associations. By constructing a similarity network and utilizing ensemble learning to combine rank results given by three classic similarity-based algorithms, Chen et al. [35] obtained superior prediction results. Based on the known miRNA-disease associations, disease semantic similarity, miRNA functional similarity, and Gaussian interaction profile kernel similarity for miRNAs and diseases, many computational models have been put forward, e.g., Sparse Subspace Learning [51], matrix completion-based model [52], Extreme Gradient Boosting Machine [53], Bipartite Network Projection [54], Triple Layer Heterogeneous Network [55] and Random Forest [56]. In Table 1, we give a brief summary of the differences between some typical previous computational methods. As can be seen, the three kinds of similarity priors have been widely used in previous methods. A detailed review of previous miRNA-disease associations prediction methods can be found in [33,57].

Table 1.

Differences between some typical previous computational methods for miRNA-disease association prediction. ‘A’ denotes depending on disease semantic similarity, ‘B’ denotes depending on miRNA functional similarity, and ‘C’ denotes depending on Gaussian interaction profile kernel similarity for miRNAs and diseases.

Year Methods A B C Characteristic
2010 Phenome-microRNAome network [37] × × Cumulative hypergeometric distribution, phenotypic similarity score
2013 bipartite network [38] × PPI network, random walk, global perspective measures
2013 Weighted kNN [39] × Weighted kNN, miRNA family and the cluster information
2013 MBSI [47] × × microRNA-based similarity inference, global network similarity measure)
2013 PBSI [47] × × phenotype-based similarity inference, global network similarity measure
2013 NetCBI [47] × × network-consistency-based inference, global network similarity measure
2014 miRNAs Prioritization [40] × × molecular mechanisms, context-dependent miRNA-target interactions
2014 RLSMDA [29] × Semi-supervised prediction method, global approach
2014 miRPD [41] × × × miRNAProteinDisease associations, text mining
2015 RBMMMDA [43] × × × Restricted Boltzmann machine, inferring multiple types of miRNA-disease pairs
2016 WBSMDA [28] Within and between score, integrating plenty of heterogeneous biological datasets
2016 HGIMDA [58] Heterogeneous Graph Inference
2017 BRWH [48] × bi-random walk, heterogeneous network, microbe similarity network, disease similarity network
2017 GRNMF [49] × graph regularized NMF
2017 PBMDA [50] special depth-first search algorithm, heterogeneous graph
2017 LRSSLMDA [51] Laplacian Regularized Sparse Subspace Learning, local structures of the training data
2017 CPTL [42] × Collective Prediction, Transduction Learning
2017 RKNNMDA [34] Ranking-based KNN
2017 MCMDA [44] × × × Matrix completion, adjacency matrix of known miRNA-disease associations
2018 IMCMDA [52] Inductive Matrix Completion
2018 EGBMMDA [53] × Extreme Gradient Boosting Machine, statistical measures, graph theoretical measures, matrix factorization results
2018 BNPMDA [54] Bipartite Network Projection, bias ratings, agglomerative hierarchical clustering
2018 TLHNMDA [55] × Triple Layer Heterogeneous Network based inference
2018 ELLPMDA [35] Ensemble Learning, Link Prediction, similarity network
2018 MDHGI [59] Matrix decomposition and Heterogeneous Graph Inference
2018 BLHARMDA [60] Bipartite local models, Jaccard similarity, hubness-aware regression, KNN classifier
2018 SNMFMDA [61] Symmetric NMF, Kronecker regularized least square
2018 RFMDA [56] Random Forest, robust feature selection

In this work, we introduce a novel dual laplacian regularized matrix completion framework, referred to as DLRMC briefly, to infer the unknown miRNA-disease associations in heterogeneous omics data, which can work well for both new diseases and miRNAs. DLRMC transforms the task of miRNA-disease association prediction into a matrix completion problem, in which the potential missing entries of the miRNA-disease association matrix were calculated, the missing association can be obtained based on the prediction scores after the completion procedure. In DLRMC, the miRNA functional similarity and the disease semantic similarity are fully exploited to serve the miRNA-disease association matrix completion by using a dual Laplacian regularization term, i.e., miRNAs with similar functions are more likely to have connections with diseases which share similar phenotypes and diseases with semantic similarity are tend to be related with similar miRNAs. In addition, during the matrix competition process, an indicator matrix with binary values which indicates the indices of the observed miRNA-disease associations are deployed to preserve the experimental confirmed associations. The experimental results indicate that DLRMC achieves superior performance compared with other methods and can be effectively applied in the discovery of missing or potential associations for novel diseases and miRNAs.

2. Results

2.1. Leave-one-out cross validation

To investigate the performance of DLRMC, we use global and local leave-one-out cross validation (LOOCV) based on the known miRNA-disease associations obtained from HMDDv2.0 database. As a validation method, LOOCV uses one known association as the test sample and the remaining known associations as the training samples. As to HMDDv2.0 database, we have already obtained 5430 known miRNA-disease associations, these associations can be regarded as the gold standard data set. The miRNA-diseases without known association evidences were considered as candidate samples. The scores of all miRNA-disease pairs can be obtained after DLRMC was implemented. As to global LOOCV, the score of the test sample was compared with the scores of all the candidate samples while in local LOOCV, the test sample was merely compared with the scores of the candidate samples which included the particular disease in the test sample. In this work, we perform 5-fold cross validation experiments for evaluation. The known miRNA-disease associations were randomly divided into five disjoint parts. For each time, one part was picked out as test samples and the rest four parts were used as training samples, and the miRNA-disease pairs without known association evidences were still regarded as candidate samples. Finally, the score of each test sample was compared with the scores of all the candidate samples, respectively. This procedure was repeated five times until each known association was used as test sample and its score was compared with the scores of the candidate samples. Those test samples whose scores rank exceeded the given threshold were considered to predict the miRNA-disease associations correctly.

2.2. Comparison with other methods

In order to demonstrate the efficacy of the proposed DLRMC, we compare its performance with the following methods: HDMP [39], RLSMDA [29], WBSMDA [28], MCMDA [44] and EGBMMDA [53]. We draw a receiver operating characteristics curve (ROC) to compare DLRMC with other previous state-of-the-art methods. In the ROC, true positive rate (TPR, sensitivity) against false positive rate (FPR, 1-specificity) at different thresholds are plotted [62]. TPR/sensitivity denotes the percentage of the test miRNA-disease associations which are ranked higher than the given threshold, while specificity represents the percentage of negative miRNA-disease pairs ranked below the threshold [63]. The area under the ROC curve (AUC) is calculated to evaluate the accuracy of miRNA-disease associations prediction methods. For a specific method, if AUC = 1, it means that the method owns a prefect performance. AUC of 0.5 means that the method merely has a random prediction performance. As to global LOOCV, the AUCs of HDMP, RLSMDA, WBSMDA, MCMDA, EGBMMDA and DLRMC are 0.8366, 0.8030, 0.8030, 0.8749, 0.9123 and 0.9174, respectively. For local LOOCV, HDMP, RLSMDA, WBSMDA, MCMDA, EGBMMDA and DLRMC reach AUCs of 0.7702, 0.0.6953, 0.0.8031, 0.7718, 0.8221 and 0.8289, respectively. Figs. 1a, b) show the ROCs of different methods in terms of global LOOCV and local LOOCV, respectively. As can be seen, DLRMC works more effective in predicting potential miRNA-disease associations when compared with the other previous methods.

Figure 1.

Figure 1.

Performance evaluation comparison between DLRMC and other methods in terms of ROC curve and AUC based on global LOOCV and local LOOCV tested by known miRNA-disease associations in the HMDD database. As can be seen, DLRMC achieves AUC of 0.9174 in global LOOCV and 0.8289 in local LOOCV. Thus, the performance of DLRMC is almost better than other methods in some degree and it proves to be effective in predicting the potential miRNA-disease associations.

2.3. Case studies

Similar to previous methods, we also implement case studies to further validate the efficacy of the proposed DLRMC. In detail, four significant diseases related to human health including Colon Neoplasms, Kidney neoplasms, Lymphoma and Prostate neoplasms are used to practically evaluate the prediction accuracy of DLRMC. The top 20 and top 50 predicted miRNAs related with these diseases are examined by another two independent miRNA-disease databases: dbDEMC [64] and miR2Disease [65].

Colon Neoplasms, is known as a malignant cancer which is the third most common type of cancer constituting about 10% of all cancer cases [66]. However, early patients of colon neoplasms only suffer from subtle symptoms, which makes it a fairly hard-to-detect cancer at an early stage [67,68]. Studies have confirmed that some miRNAs are related to Colon Neoplasms, for instance, miR-145 is constantly downregulated in colorectal tumors [69]. MiR-127 has been reported to play a role as a possible tumor suppressor gene [70]. In addition, there exists report indicating that the occurrence rate of colon neoplasms has an increasing trend these years [71]. As a result, it is important to predict the potential miRNAs related to colon neoplasms. In this work, we implement DLRMC to predict the top 50 miRNAs associated with colon neoplasms. Finally, 19 of the top 20 and 45 of the top 50 predicted miRNAs associated with colon neoplasms are validated by dbDEMC and miR2Disease database,the prediction results are shown in Table 2.

Table 2.

The top 50 miRNAs associated with colon neoplasms predicted by our DLRMC. In summary, 19 out of the top 20, and 45 out of the top 50 predicted Colon Neoplasms related miRNAs are confirmed based on dbDEMC and miR2Disease (1st column: top 1–25; 2nd column: top 26–50).

miRNA Evidence miRNA Evidence
hsa-mir-146a dbdemc hsa-mir-196a dbdemc;miR2Disease
hsa-mir-155 dbdemc;miR2Disease hsa-mir-29c dbdemc
hsa-mir-132 miR2Disease hsa-mir-223 dbdemc;miR2Disease
hsa-mir-21 dbdemc;miR2Disease hsa-mir-143 dbdemc;miR2Disease
hsa-mir-34a dbdemc;miR2Disease hsa-let-7a unconfirmed
hsa-mir-221 dbdemc;miR2Disease hsa-mir-195 dbdemc;miR2Disease
hsa-mir-20a dbdemc;miR2Disease hsa-mir-200b dbdemc
hsa-mir-16 dbdemc hsa-mir-214 dbdemc
hsa-let-7a dbdemc;miR2Disease hsa-mir-148a dbdemc
hsa-mir-125b dbdemc hsa-mir-106b dbdemc;miR2Disease
hsa-mir-29a dbdemc;miR2Disease hsa-mir-23a miR2Disease
hsa-mir-29b dbdemc;miR2Disease hsa-mir-142 unconfirmed
hsa-mir-15a dbdemc hsa-mir-31 dbdemc;miR2Disease
hsa-mir-133a dbdemc;miR2Disease hsa-mir-34c miR2Disease
hsa-mir-222 dbdemc hsa-mir-141 dbdemc;miR2Disease
hsa-mir-199a unconfirmed hsa-mir-182 dbdemc;miR2Disease
hsa-mir-26a dbdemc;miR2Disease hsa-mir-200a unconfirmed
hsa-mir-1 dbdemc;miR2Disease hsa-let-7c dbdemc
hsa-mir-19b dbdemc;miR2Disease hsa-mir-101 unconfirmed
hsa-mir-19a dbdemc;miR2Disease hsa-mir-192 dbdemc;miR2Disease
hsa-mir-15b miR2Disease hsa-mir-181a dbdemc;miR2Disease
hsa-mir-18a miR2Disease hsa-mir-9 dbdemc;miR2Disease
hsa-mir-92a dbdemc hsa-mir-133b dbdemc;miR2Disease
hsa-mir-30b dbdemc;miR2Disease hsa-mir-34b dbdemc;miR2Disease
hsa-mir-150 dbdemc;miR2Disease hsa-mir-183 dbdemc;miR2Disease

Kidney neoplasms is known as renal cancer, which starts in the cells of kidney, and it includes various types [72]. For kidney neoplasms patients, the most common symptoms are pains in the lumbar and hematuria [73]. Based on recent biological experiments, many existing kidney neoplasm-related miRNAs have been verified. For instance, the common target ACVR2B of five miRNAs (miRNA-192, miRNA-194, miRNA-215, miRNA-200c and miRNA-141) is strongly expressed in renal childhood neoplasms [74]. In addition, researches had reported that the decreasing miRNA-23b expression might inhibit kidney tumor growth [75]. As a result of the case study for Kidney neoplasms, 17 out of the top-20 predicted miRNAs and 46 out of the top-50 predicted miRNAs of kidney neoplasms were verified by miR2Disease database and dbDEMC database. The prediction results are shown in Table 3.

Table 3.

The top 50 miRNAs associated with kidney neoplasms predicted by our DLRMC. In summary, 17 out of the top 20, and 46 out of the top 50 predicted kidney neoplasms related miRNAs are confirmed based on dbDEMC and miR2Disease (1st column: top 1–25; 2nd column: top 26–50).

miRNA Evidence miRNA Evidence
hsa-mir-155 dbdemc hsa-mir-92a dbdemc
hsa-mir-146a dbdemc hsa-mir-195 dbdemc
hsa-mir-122 dbdemc;miR2Disease hsa-mir-126 dbdemc;miR2Disease
hsa-mir-34a dbdemc hsa-mir-29c dbdemc;miR2Disease
hsa-mir-221 unconfirmed hsa-mir-23a dbdemc
hsa-mir-16 dbdemc hsa-mir-143 dbdemc
hsa-mir-125b unconfirmed hsa-mir-223 dbdemc
hsa-mir-29a dbdemc;miR2Disease hsa-mir-214 dbdemc;miR2Disease
hsa-mir-133a unconfirmed hsa-let-7a dbdemc
hsa-mir-29b dbdemc;miR2Disease hsa-mir-148a dbdemc
hsa-mir-145 dbdemc hsa-mir-200b dbdemc;miR2Disease
hsa-mir-26a dbdemc;miR2Disease hsa-mir-31 dbdemc
hsa-mir-199a dbdemc;miR2Disease hsa-mir-210 dbdemc;miR2Disease
hsa-mir-222 dbdemc hsa-mir-106b dbdemc;miR2Disease
hsa-mir-1 dbdemc hsa-mir-34c dbdemc
hsa-mir-15b dbdemc hsa-mir-182 dbdemc;miR2Disease
hsa-mir-20a dbdemc;miR2Disease hsa-mir-200a dbdemc
hsa-mir-17 dbdemc;miR2Disease hsa-mir-101 dbdemc;miR2Disease
hsa-mir-30b dbdemc hsa-let-7c dbdemc
hsa-mir-206 dbdemc hsa-mir-181a dbdemc
hsa-mir-19a dbdemc hsa-mir-9 dbdemc
hsa-mir-196a dbdemc hsa-mir-34b dbdemc
hsa-mir-19b dbdemc;miR2Disease hsa-mir-183 dbdemc
hsa-mir-18a dbdemc hsa-mir-218 dbdemc
hsa-mir-150 dbdemc;miR2Disease hsa-let-7b unconfirmed

Lymphoma often refers to a group of cancerous blood cell tumors, it originates in the lymphatic hematopoietic system [76] which consists of two categories: non-Hodgkin lymphoma (NHL) and Hodgkin’s lymphoma (HL). Worldwide, lymphoma is the seventh most common cancer and makes up 3–4% of all cancers. As to children, lymphomas is also the third-most common cancer. Thanks to the development of high-throughput sequencing technologies, several miRNAs had been already reported to be associated with lymphomas. For example, re-expression of miRNA-150 induces EBV-positive Burkitt lymphoma differentiation by modulating c-Myb in vitro [77]. The expressions proportion of miRNA-21 and miRNA-210 in plasma of previously untreated lymphoma patient group were higher than those of the patients treated for 6 or more courses [78]. In addition, it was found that the plasma miRNA-92a level could be a useful biomarker for NHL diagnosis and patients monitoring after chemotherapy [79]. We implemented DLRMC to predict the related miRNAs for lymphoma based on known associations in the HMDDv2.0 database. As a result, among the top 20 and 50 potential lymphoma-related miRNAs, 19 and 46 were confirmed by miR2Disease and dbDEMC databases, respectively. The prediction results are shown in Table 4.

Table 4.

The top 50 miRNAs associated with Lymphoma predicted by our DLRMC. In summary, 19 out of the top 20, and 46 out of the top 50 predicted Lymphoma related miRNAs are confirmed based on dbDEMC and miR2Disease (1st column: top 1–25; 2nd column: top 26–50).

miRNA Evidence miRNA Evidence
hsa-mir-30b dbdemc hsa-mir-199b dbdemc
hsa-mir-148a dbdemc hsa-mir-26b dbdemc
hsa-mir-373 dbdemc hsa-mir-7i dbdemc
hsa-mir-196a dbdemc hsa-mir-9 dbdemc
hsa-mir-23a dbdemc hsa-let-7b dbdemc
hsa-mir-206 dbdemc hsa-mir-96 dbdemc
hsa-mir-195 dbdemc hsa-let-7d dbdemc
hsa-mir-372 unconfirmed hsa-mir-93 dbdemc
hsa-mir-199a dbdemc hsa-mir-25 dbdemc
hsa-mir-15b dbdemc hsa-mir-371a unconfirmed
hsa-mir-137b dbdemc;miR2Disease hsa-let-7e dbdemc;miR2Disease
hsa-mir-34b dbdemc hsa-mir-7 dbdemc
hsa-mir-183 dbdemc hsa-mir-223 dbdemc
hsa-mir-132 dbdemc hsa-mir-106a dbdemc;miR2Disease
hsa-mir-214 dbdemc hsa-mir-205 dbdemc
hsa-mir-182 dbdemc hsa-mir-222 dbdemc
hsa-mir-31 dbdemc hsa-mir-335 dbdemc
hsa-mir-133a dbdemc hsa-mir-27a dbdemc
hsa-mir-212 dbdemc hsa-mir-181c dbdemc
hsa-mir-141 dbdemc hsa-mir-224 dbdemc
hsa-mir-127 dbdemc;miR2Disease hsa-mir-27b dbdemc
hsa-mir-192 dbdemc hsa-mir-30a dbdemc
hsa-mir-100 dbdemc hsa-mir-370 unconfirmed
hsa-mir-451a unconfirmed hsa-mir-1 dbdemc
hsa-mir-106b dbdemc hsa-let-7g dbdemc

Prostate neoplasms is a malignant tumor which originates in the epithelial cells of prostate [80], it is the second most common type of cancer and the fifth leading cause of cancer-related death in men according to the statistics of the World Health Organization [81]. Up to now, plenty of evidences had confirmed that lots of miRNAs were associated with prostate neoplasms. For instance, the downregulation of miRNA-145 was observed in prostate cancer [82]. It was found that miRNA-183 might be a novel prostate cancer biomarker and inhibiting the expression of miRNA-183 could be beneficial for prostate cancer treatment ([83]). DLRMC predicted the top 20 and top 50 potential miRNAs which might be associated with prostate neoplasms. As a result, 19 of the top 20 and 47 of the top 50 predicted miRNAs were confirmed in the dbDEMC and miR2Disease database, the prediction results are shown in Table 5.

Table 5.

The top 50 miRNAs associated with Prostate neoplasms predicted by our DLRMC. In summary, 19 out of the top 20, and 47 out of the top 50 predicted Prostate neoplasms related miRNAs are confirmed based on dbDEMC and miR2Disease (1st column: top 1–25; 2nd column: top 26–50).

miRNA Evidence miRNA Evidence
hsa-mir-146a miR2Disease hsa-mir-150 dbdemc
hsa-mir-122 unconfirmed hsa-mir-126 dbdemc;miR2Disease
hsa-mir-155 dbdemc hsa-mir-195 dbdemc;miR2Disease
hsa-mir-21 dbdemc;miR2Disease hsa-mir-29c dbdemc
hsa-mir-34a dbdemc;miR2Disease hsa-mir-223 dbdemc;miR2Disease
hsa-mir-16 dbdemc;miR2Disease hsa-mir-143 dbdemc;miR2Disease
hsa-mir-221 dbdemc;miR2Disease hsa-mir-23a dbdemc;miR2Disease
hsa-mir-29a dbdemc hsa-let-7a dbdemc;miR2Disease
hsa-mir-133a dbdemc hsa-mir-200b unconfirmed
hsa-mir-29b dbdemc;miR2Disease hsa-mir-214 dbdemc;miR2Disease
hsa-mir-15a dbdemc;miR2Disease hsa-mir-148a miR2Disease
hsa-mir-26a dbdemc;miR2Disease hsa-mir-106b dbdemc
hsa-mir-222 dbdemc;miR2Disease hsa-mir-34c dbdemc
hsa-mir-199a dbdemc;miR2Disease hsa-mir-30c dbdemc;miR2Disease
hsa-mir-1 dbdemc hsa-mir-31 dbdemc;miR2Disease
hsa-mir-20a miR2Disease hsa-mir-141 miR2Disease
hsa-mir-17 miR2Disease hsa-mir-182 dbdemc;miR2Disease
hsa-mir-15b dbdemc hsa-mir-200a dbdemc
hsa-mir-19a dbdemc hsa-mir-101 dbdemc;miR2Disease
hsa-mir-19b dbdemc;miR2Disease hsa-let-7c dbdemc;miR2Disease
hsa-mir-206 dbdemc hsa-mir-192 dbdemc
hsa-mir-30b dbdemc;miR2Disease hsa-mir-181a dbdemc;miR2Disease
hsa-mir-181b dbdemc;miR2Disease hsa-mir-9 dbdemc
hsa-mir-196a dbdemc hsa-mir-34b dbdemc
hsa-mir-92a unconfirmed hsa-mir-133b dbdemc

2.4. Advantages of DLRMC

The experimental results of above sections demonstrate that DLRMC can obtain higher AUC value than other previous methods, which demonstrates that DLRMC works more effective in predicting potential miRNA-disease associations. The advantages of DLRMC lie in following aspects: Firstly, the low-rank property of miRNA-disease association matrix is exploited to regularize the matrix completing process. Secondly, DLRMC fills the candidate samples without known associations with 0 and then iteratively updates them with the predictive scores. Finally, the known miRNA-disease associations, miRNA functional similarity and the disease semantic similarity are integrated into a unified model to serve the completion of miRNA-disease association matrix.

3. Materials and methods

3.1. Methods overview

In order to predict potential undiscovered miRNA-disease associations, we propose a novel method named DLRMC, which consists of three main steps. First, the miRNA functional similarity and the disease semantic similarity are calculated based on the collected data sources. Considering that the two kinds of similarity priors benefit a lot of previous miRNA-disease associations prediction methods, we integrate the two kinds of similarities into a matrix competition model for miRNA-disease associations prediction. The entry values of the completed association matrix are used to infer the potential miRNA-disease associations. Fig. 2 gives a brief flowchart of our proposed method.

Figure 2.

Figure 2.

Flowchart of our DLRMC model for predicting the potential miRNA-disease associations.

3.2. Human miRNA-disease associations

In this work, we use the Human miRNA-disease associations dataset obtained from the HMDDv2.0 database [84], which consists of 5430 experimentally confirmed human miRNA-diseases associations. There are 495 miRNAs and 383 human diseases. An adjacency matrix A with binary values is defined to clearly describe the known miRNAs-disease associations, i.e., if a specific miRNA m(i) is confirmed to be associated with a disease d(j), the entity A(i,j) is assigned 1, otherwise 0. Thus, the adjacency matrix A is with size 495×383.

3.3. Similarity measures

3.3.1. miRNA functional similarity

MiRNA functional similarity has been previously calculated by [32], based on the well-known assumption that miRNAs with similar functions are often related to similar diseases. In this work, based on the mentioned miRNA functional similarity, we construct a corresponding miRNA functional similarity matrix FS, in which FS(i,j) denotes the functional similarity score between miRNA m(i) and m(j).

3.3.2. Disease semantic similarity

It is well known that the relationship among different diseases can be represented by the hierarchical directed acyclic graph (DAG). In detail, a disease d can be described as DAG(d)=(d,T(d),E(d)), where T(d) consists of node d itself and all its ancestor nodes, and E(d) is the edge set including the direct edges linking parent nodes to child nodes [32,8588]. Then, the semantic value of disease d can be defined as follows:

SV(d)=θT(d)Sd(θ), (1)

where

Sd(θ)=1  ifθ=dmax{ΔSd(θ)|θchildrenofθ}ifθd (2)

In Equation (2), Δ represents the semantic contribution factor of disease θ to disease θ. For a given disease d, the contribution of itself is 1, while the contribution of another disease t decreases as the distance between d and t increases. Thus, we denote SS as the disease semantic similarity matrix, and the semantic similarity between disease d(i) and d(j) can be calculated as follows:

SS(d(i),d(j))=tT(d(i))T(d(j))(Sd(i)(t)+Sd(j)(t))SV(d(i))+SV(d(j)). (3)

4. Dual Laplacian regularized matrix completion for miRNA-disease associations prediction

4.1. Standard matrix completion

Matrix completion is the task of filling in the missing entries of a partially observed matrix M. One of the mostly used model of the matrix completion problem is to find the lowest rank matrix X which matches the matrix M, which we wish to recover, for all entries in the set E of observed entries. The basic mathematical formulation of this problem is as follows:

minXrank(X)s.t.Xij=Mij i,jE. (4)

Due to the optimization problem (4) is non-convex and no efficient solution can be obtained, Equation (4) is usually transformed to the following convex problem by relaxing the rank function into the nuclear norm:

minX|X| s.t. Xij=Miji,jE. (5)

where || is the nuclear norm, which is equal to the sum of singular values of X. Equation (5) can be solved by using the singular value thresholding (SVT) algorithm.

4.2. DLRMC

Equation (5) was directly used for miRNAs-disease associations prediction [44] and obtained good results when compared to other previous methods. However, the miRNA functional similarity and disease semantic similarity which have been demonstrated useful in other methods (as shown in Table 1) have not been fully exploited to serve the matrix completion model. Thus, we believe that the miRNA functional similarity and disease semantic similarity can advantage the matrix competition model, of course, better prediction results can be expected. In this work, we propose a novel dual Laplacian regularized matrix completion model (DLRMC) for microRNA-disease associations prediction. In DLRMC, the known microRNA-disease associations, miRNA functional similarity and disease semantic similarity are integrated into a unified model to constrain that miRNAs with similar functions are more likely to have connections with diseases which share similar phenotypes and diseases with semantic similarity are tend to be related with similar miRNAs. The optimization problem of DLRMC can be formulated as follows:

minX||X||+α||X||F2+β||A(XA)||F2Known associations preservation+λni,j=1n||xixj||2FS(i,j)+λmp,q=1m||xpxq||2SS(i,j),Dual Laplacian regularization (6)

where xi and xp represent the i-th row and p-th column of X, respectively. α, β, λn and λm are the regularization coefficients, and ‘’ denotes the Hadamard product of two matrices. The Tikhonov regularization on X is used to ensure the smoothness of X. The third term aims to ensure that the scores of known associations in X are close to those in A, i.e., the experimental confirmed miRNAs-disease associations can be preserved after the matrix competition. Since A is with 01 values, we use itself as the indicator matrix to indicate the indices of the observed miRNA-disease associations. The forth and fifth terms constitute the dual Laplacian regularizer which constrain that miRNAs with similar functions are more likely to have connections with diseases which share similar phenotypes and diseases with semantic similarity are tend to be related with similar miRNAs. In Equation (6), since both α and β regularize the Frobenius Norm of matrices, we set α=β for simplicity. Then we have three parameters in DLRMC need to be tuned, i.e., β, λn and λm. In our experiments, all parameter combinations are considered based on grid search. The optimal λn and λm are determined from 0.0001,0.001,0.01,0.1,1,10,100,1000,10000, while β is chosen from 0.25,0.5,1,2,4. Finally, the best results corresponding to the optimal parameters combination are reported. In the discussion section, we will analyze the impact of each parameter on the final results.

Note that a similar dual Laplacian regularization term has been used in [49] for identifying microRNA-disease associations (GRNMF). However, our proposed DLRMC differs from GRNMF in two aspects. Firstly, GRNMF is based on the Non-negative matrix factorization model, which decomposes the miRNA-disease association matrix into a basis matrix and a coefficient matrix, then the two learned matrices are used to recovery the potential miRNA-disease associations, while our DLRMC predicts miRNA-disease associations in a more intuitive way by using the matrix completion model which directly fills the missing values which represent potential miRNA-disease associations. Secondly, the purpose of the dual Laplacian regularization term of the two models is different. In GRNMF, the disease similarity matrix and miRNA functional similarity matrix are used to regularize the geometrical structure of the basis matrix and coefficient matrix, respectively. In our DLRMC, the dual Laplacian regularization term are used to regularize the miRNA-disease association matrix, i.e., miRNAs with similar functions should be more likely to have connections with diseases which share similar phenotypes and diseases with semantic similarity should be related with similar miRNAs, the physical meaning of our regularization term is more intuitive.

4.3. Optimization

To solve the optimization problem in Equation (6), we first transform it into the following form:

minX||X||+α||X||F2+β||A(XA)||F2+λnTr(XTLnX)+λmTr(XLmXT), (7)

where LnRn×n is the Laplacian matrix with Ln=DnFS, and Dn is the diagonal matrix with Dn(i,i)=jFS(i,j). Similarly, LmRm×m is the Laplacian matrix with Lm=DmSS, and Dm is the diagonal matrix with Dm(i,i)=jSS(i,j).

Since problem (7) contains Hadamard product of two matrices, it is hard to tackle it directly. Thus, we propose an alternative iterative algorithm to solve this problem based on augmented Lagrange multiplier (ALM) algorithm [89]. We first introduce two auxiliary variables J and Z to make the objective function separable:

minX,J,Z||J||+α||X||F2+β||A(ZA)||F2+λnTr(XTLnX)+λmTr(XLmXT)s.t.X=J,X=Z. (8)

The corresponding augmented Lagrange function of Equation (8) is:

L(X,J,Z,Y1,Y2,μ1,μ2)=||J||+α||X||F2+β||A(ZA)||F2+λnTr(XTLnX)+λmTr(XLmXT)+Y1,XJ+μ12||XJ||F2+Y2,XZ+μ22||XZ||F2, (9)

where Y1 and Y2 are the Lagrange multipliers, μ1>0 and μ2>0 control the penalties for violating the linear constraints, and , is the standard inner product of two matrices. Then the variables can be solved alternatively.

Solving J with other variables fixed:

The variable J can be solved by following equation with other variables fixed:

minJ||J||+Y1,XJ+μ12||XJ||F2=minJ||J||+μ12||XJ+Y1μ1||F2, (10)

where J can be solved by singular value thresholding (SVT) operator [90].

4.4. Solving with other variables fixed

When other variables are fixed, Z can be solved by minimizing following function:

minZβ||A(ZA)||F2+<Y2,XZ>+μ22||XZ||F2=minZβ||A(ZA)||F2+μ22||XZ+Y2μ2||F2. (11)

Setting the derivative of Equation (11) w.r.t. Z to zero, and using properties of the Hadamard and Kronecker products, it is easy to get that Z can be obtained as:

Rvec(Z)=vec(C), (12)

where R=2βDiag(vec(A))+μ2I, and C=2β(AA)+μ2X+Y2. This is a simple linear system and can be solved easily.

4.5. Solving with other variables fixed

We can solve X by dropping other unrelated variables as follows:

minXα||X||F2+λnTr(XTLnX)+λmTr(XLmXT)+μ12||XJ+Y1μ1||F2+μ22||XZ+Y2μ2||F2, (13)

By setting the derivative of Equation (13) w.r.t. X to zero, we have

2αX+2λnLnX+2λmXLm+μ1(XJ)+μ2(XZ)+Y1+Y2=0. (14)

Equation (14) is a Sylvester equation [91]. Since 2α+2λnLn is strictly positive definite, Equation (14) has stable solution for X.

Updating Multipliers:

We update the multipliers by

Y1=Y1+μ1(XJ)Y2=Y2+μ2(XZ). (15)

The variables J,Z and X are iteratively updated until convergence. Finally, we obtain the predicted miRNA-disease associations based on the completed entities in matrix X.

5. Discussion

5.1. Stability analysis

In our experiments, we use the Human miRNA-disease associations dataset obtained from the HMDDv2.0 database [84] for testing, which consists of 5430 experimentally confirmed human miRNA-diseases associations. In order to evaluate the stability of DLRMC to the miRNA-disease database, we change the ratio of training samples and compare the final results. The ratio of training samples varies from [10%,15%,20%,,90%]. For each ratio, the selected associations are used as training samples and the rest are used for testing. For each case, we perform the training samples selecting and associations predicting five times with randomly selected training samples. Finally, the average AUC of the five times are reported for each case. In Fig. 3, we plot the average AUC values with varying ratio of selected training samples. As can be seen, when more than half of the experimentally confirmed human miRNA-diseases associations are selected as training samples, DLRMC can obtain stable results. At present, three kinds of information including the known miRNA-disease associations, miRNA functional similarity and the disease semantic similarity are used for potential associations pridiction, we believe that if more information can be obtained, DLRMC can infer more potential miRNA-disease associations.

Figure 3.

Figure 3.

Average AUC values with varying ratio of selected training samples.

5.2. Parameter sensitivity analysis

There are four parameters in our model (i.e., α, β, λm and λn). Since we set α=β, we need to tune three parameters. In order to further demonstrate the performance of the proposed DLRMC, we study its sensitivity w.r.t. the parameters β, λm and λn. Firstly, we fix β=1 and vary λm and λn. Then we fix λm=1 and vary β and λn. Finally, we fix λn=1 and vary β and λm. Figs. 4 and 5 plot the AUC values with varying parameters under global LOOCV and local LOOCV, respectively. As can be seen, the final results are not very sensitive to the parameters. In a certain range, DLRMC can obtain stable results. Through a closer observation, we can find that when β is fixed, the best result can be obtained with λn=100 and λm=10000. When λm or λn is fixed, optimal results can be obtained with β=2.

Figure 4.

Figure 4.

AUC values with varying parameters under global LOOCV.

Figure 5.

Figure 5.

AUC values with varying parameters under local LOOCV.

6. Conclusion

It is necessary to develop efficient computational models for predicting the potential miRNA-disease associations since it is expensive and time-consuming to accomplish this task using experimental methods. In this study, we have proposed a computational method, called DLRMC, for miRNA-disease associations prediction. The main contribution of our work is the development a matrix completion model with a dual Laplacian regularizer which constrains that miRNAs with similar functions are more likely to have connections with diseases which share similar phenotypes and diseases with semantic similarity are tend to be related with similar miRNAs. Meanwhile, the experimental confirmed miRNAs-disease associations can be preserved by using a binary indicator matrix after the completion.

The efficacy of our method is validated through cross validations and case studies on the HMDDv2.0 database. The experimental results verify that DLRMC can effectively improve performance compared with other methods, which indicates that DLRMC can be used for novel miRNAs-disease associations prediction.

Our proposed DLRMC can be regarded as an effective complement to experimental methods for discovering miRNA-disease associations. Biomedical researchers can use our model to computationally infer the miRNAs which are potentially associated with the disease of interest, then the most promising associations can be finally chosen for biological confirmation. By this way, experiments could be more effective and productive. In addition, since the identification of disease-related miRNAs is critical to elucidating the development of tumorigenesis, predicted potential associations can further facilitate the discovery of potential drug targets and treatment options.

Funding Statement

This work was partly supported by the National Natural Science Foundation of China under Grant No. 61701451, and partly supported by the Fundamental Research Funds for the Central Universities, China University of Geosciences (Wuhan) under Grant No. CUG170654, and partly supported by Jiangsu Provincial Commission of Health and Family Planning under Grant No. H201556, and partly supported by the Six Top Talent Peak Projects in Jiangsu under Grant No. WSN-019, and partly supported by the 333 Talent Project of Jiangsu Province under Grant No. BRA2017246.

Disclosure of Potential Conflicts of Interest

No potential conflicts of interest were disclosed.

References

  • [1].Llave C, Carrington JC.. Cleavage of scarecrow-like mrna targets directed by a class of arabidopsis mirna. Science. 2002;297(5589):2053. [DOI] [PubMed] [Google Scholar]
  • [2].Eulalio A, Huntzinger E, Izaurralde E. Getting to the root of mirna-mediated gene silencing. Cell. 2008;132(1):9. [DOI] [PubMed] [Google Scholar]
  • [3].Ambros V. The functions of animal micrornas. Nature. 2004;431(7006):350. [DOI] [PubMed] [Google Scholar]
  • [4].Tang C, Cao L, Zheng X, et al. Gene selection for microarray data classification via subspace learning and manifold regularization. Med Biol Eng Comput. 2018;56(7):1271–1284. [DOI] [PubMed] [Google Scholar]
  • [5].Tang C, Liu X, Li M, et al. Robust unsupervised feature selection via dual self-representation and manifold regularization. Knowledge-Based Syst. 2018;145:109–120. [Google Scholar]
  • [6].Jopling CL, Yi M, Lancaster AM, et al. Modulation of hepatitis c virus rna abundance by a liver-specific microrna. Science. 2005;309(5740):1577–1581. [DOI] [PubMed] [Google Scholar]
  • [7].Vasudevan S, Tong Y, Steitz JA. Switching from repression to activation: micrornas can up-regulate translation. Science. 2007;318(5858):1931–1934. [DOI] [PubMed] [Google Scholar]
  • [8].Cui Q, Yu Z, Purisima EO, et al. Principles of microrna regulation of a human cellular signaling network. Mol Syst Biol. 2006;2(1):46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Karp X, Ambros V. Encountering micrornas in cell fate signaling. Science. 2005;310(5752):1288–1289. [DOI] [PubMed] [Google Scholar]
  • [10].Alshalalfa M, Alhajj R. Using context-specific effect of mirnas to identify functional associations between mirnas and gene signatures. Bmc Bioinformatics. 2013;14(S12):S1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Xu P, Guo M, Hay BA. Micrornas and the regulation of cell death. Trends Genet. 2004;20(12):617. [DOI] [PubMed] [Google Scholar]
  • [12].Miska EA. How micrornas control cell division, differentiation and death. Curr Opin Genet Dev. 2005;15(5):563–568. [DOI] [PubMed] [Google Scholar]
  • [13].Witwer KW. Xenomirs and mirna homeostasis in health and disease: evidence that diet and dietary mirnas directly and indirectly influence circulating mirna profiles. RNA Biol. 2012;9(9):1147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Kumarswamy R, Volkmann I, Thum T. Regulation and function of mirna-21 in health and disease. RNA Biol. 2011;8(5):706–713. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Ii VB, Li Y, Philip PA, et al. Microrna and cancer: tiny molecules with major implications. Curr Genomics. 2008;9(2):97–109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Winter J, Diederichs S. Microrna biogenesis and cancer. Methods Mol Biol. 2005;65(9):3509. [DOI] [PubMed] [Google Scholar]
  • [17].Desano JT, Liang X. Microrna regulation of cancer stem cells and therapeutic implications. AAPS J. 2009;11(4):682. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Liu L, Jian W, Khanabdali R, et al. Circular rnas: isolation, characterization and their potential role in diseases. RNA Biol. 2017;14(12):1715–1721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Tarang S, Weston MD. Macros in microrna target identification: A comparative analysis of in silico, in vitro, and in vivo approaches to microrna target identification. RNA Biol. 2014;11(4):324–333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [20].Calin GA, Dumitru CD, Shimizu M, et al. Frequent deletions and down-regulation of micro- rna genes mir15 and mir16 at 13q14 in chronic lymphocytic leukemia. Proc Natl Acad Sci U S A. 2002;99(24):15524–15529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [21].Mudduluru G, Ceppi P, Kumarswamy R, et al. Regulation of axl receptor tyrosine kinase expression by mir-34a and mir-199a/b in solid cancer. Oncogene. 2011;30(25):2888. [DOI] [PubMed] [Google Scholar]
  • [22].Paraskevi A, Theodoropoulos G, Papaconstantinou I, et al. Circulating microrna in inflammatory bowel disease. J Crohns Colitis. 2012;6(9):900–904. [DOI] [PubMed] [Google Scholar]
  • [23].Wang H, Peng W, Xin O, et al. Reduced circulating mir-15b is correlated with phosphate metabolism in patients with end-stage renal disease on maintenance hemodialysis. Ren Fail. 2012;34(6):685–690. [DOI] [PubMed] [Google Scholar]
  • [24].Calin GA, Croce CM. Microrna signatures in human cancers. Nat Rev Cancer. 2006;6(11):857. [DOI] [PubMed] [Google Scholar]
  • [25].Keller A, Leidinger P, Gislefoss R, et al. Stable serum mirna profiles as potential tool for non-invasive lung cancer diagnosis. RNA Biol. 1900;8(3):506–516. [DOI] [PubMed] [Google Scholar]
  • [26].Siebert M, Westbroek W, Chen YC, et al. Identification of mirnas that modulate glucocerebrosidase activity in gaucher disease cells. RNA Biol. 2014;11(10):1291–1300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Weiland M, Gao XH, Zhou L, et al. Small rnas have a large impact: circulating micrornas as biomarkers for human diseases. RNA Biol. 2012;9(6):850–859. [DOI] [PubMed] [Google Scholar]
  • [28].Chen X, Yan CC, Zhang X, et al. Wbsmda: within and between score for mirna-disease association prediction. Sci Rep. 2016;6:21106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [29].Chen X, Yan GY. Semi-supervised learning for potential human microrna-disease associations inference. Sci Rep. 2014;4:5501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [30].Esquelakerscher A, Slack FJ. Oncomirs - micrornas with a role in cancer. Nat Rev Cancer. 2006;6(4):259. [DOI] [PubMed] [Google Scholar]
  • [31].Bandyopadhyay S, Mitra R, Maulik U, et al. Development of the human cancer microrna network. Silence. 2010;1(1):6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Wang D, Wang J, Lu M, et al. Inferring the human microrna functional similarity and functional network based on microrna-associated diseases. Bioinformatics. 2010;26(13):1644–1650. [DOI] [PubMed] [Google Scholar]
  • [33].Le TD, Zhang J, Liu L, et al. Computational methods for identifying mirna sponge interactions. Brief Bioinform. 2016;18(4):577. [DOI] [PubMed] [Google Scholar]
  • [34].Chen X, Wu QF, Yan GY. Rknnmda: ranking-based knn for mirna-disease association prediction. RNA Biol. 2017;14(7):952–962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].Chen X, Zhou Z, Ao YZ. Ellpmda: ensemble learning and link prediction for mirna-disease association prediction. RNA Biol. 2018;15(6):807–818. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [36].Tang C, Chen J, Liu X, et al. Consensus learning guided multi-view unsupervised feature selection. Knowledge-Based Syst. 2018;160:49–60. [Google Scholar]
  • [37].Wang G, Juan L, Teng M, et al. Prioritization of disease micrornas through a human phenome-micrornaome network. BMC Syst Biol. 2010;4(S1):S2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [38].Shi H, Xu J, Zhang G, et al. Walking the interactome to identify human mirna-disease associations through the functional link between mirna targets and disease genes. BMC Syst Biol. 2013;7(1):101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [39].Xuan P, Han K, Guo M, et al. Prediction of micrornas associated with human diseases based on weighted k most similar neighbors. PloS one. 2013;8(8):e70204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [40].Xu C, Ping Y, Li X, et al. Prioritizing candidate disease mirnas by integrating phenotype associations of multiple diseases with matched mirna and mrna expression profiles. Mol Biosyst. 2014;10(11):2800–2809. [DOI] [PubMed] [Google Scholar]
  • [41].Mork S, Pletscher-Frankild S, Palleja CA, et al. Protein-driven inference of mirna-disease associations. Bioinformatics. 2014;30(3):392–397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [42].Luo J, Ding P, Liang C, et al. Collective prediction of disease-associated mirnas based on transduction learning. IEEE/ACM Trans Comput Biol Bioinform. 2017;14(6):1468–1475. [DOI] [PubMed] [Google Scholar]
  • [43].Chen X, Yan CC, Zhang X, et al. Rbmmmda: predicting multiple types of disease-microrna associations. Sci Rep. 2015;5:13877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [44].Li JQ, Rong ZH, Chen X, et al. Mcmda: matrix completion for mirna-disease association prediction. Oncotarget. 2017;8(13):21187–21199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [45].Mugunga I, Ying J, Liu X, et al. Computational prediction of human disease-related micrornas by path-based random walk. Oncotarget. 2017;8(35):58526–58535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [46].Li ZS, Liu B, Yan C. Cfmda: collaborative filtering-based mirna-disease association prediction. Multimedia Tools Appl. 2017;78(1):1–14. [Google Scholar]
  • [47].Chen H, Zhang Z. Similarity-based methods for potential human microrna-disease association prediction. BMC Med Genomics. 2013;6(1):12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [48].Luo J, Xiao Q. A novel approach for predicting microrna-disease associations by unbalanced bi-random walk on heterogeneous network. J Biomed Inform. 2017;66:194–203. [DOI] [PubMed] [Google Scholar]
  • [49].Xiao Q, Luo J, Liang C, et al. A graph regularized non-negative matrix factorization method for identifying microrna-disease associations. Bioinformatics. 2017;34(2):239–248. [DOI] [PubMed] [Google Scholar]
  • [50].You ZH, Huang ZA, Zhu Z, et al. Pbmda: A novel and effective path-based computational model for mirna-disease association prediction. PLoS Comput Biol. 2017;13(3):e1005455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [51].Chen X, Huang L. Lrsslmda: laplacian regularized sparse subspace learning for mirna-disease association prediction. PLoS Comput Biol. 2017;13(12):e1005912. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [52].Chen X, Wang L, Qu J, et al. Predicting mirnadisease association based on inductive matrix completion. Bioinformatics. 2018. [DOI] [PubMed] [Google Scholar]
  • [53].Chen X, Huang L, Xie D, et al. Egbmmda: extreme gradient boosting machine for mirna-disease association prediction. Cell Death Dis. 2018;9(1):3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [54].Chen X, Xie D, Wang L, et al. Bnpmda: bipartite network projection for mirna-disease association prediction. Bioinformatics. 2018;34(18):3178–3186. [DOI] [PubMed] [Google Scholar]
  • [55].Chen X, Qu J, Yin J. Tlhnmda: triple layer heterogeneous network based inference for mirna-disease association prediction. Front Genet. 2018;9:234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [56].Chen X, Wang CC, Yin J, et al. Novel human mirna-disease association inference based on random forest. Mol Ther Nucleic Acids. 2018;13:568–579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [57].Chen X, Xie D, Zhao Q, et al. Micrornas and complex diseases: from experimental results to computational models. Brief Bioinform. 2017. [DOI] [PubMed] [Google Scholar]
  • [58].Chen X, Yan CC, Zhang X, et al. Hgimda: heterogeneous graph inference for mirna-disease association prediction. Oncotarget. 2016;7(40):65257–65269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [59].Chen X, Yin J, Qu J, et al. Mdhgi: matrix decomposition and heterogeneous graph inference for mirna-disease association prediction. PLoS Comput Biol. 2018;14(8):e1006418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [60].Chen X, Cheng JY, Yin J. Predicting microrna-disease associations using bipartite local models and hubness-aware regression. RNA Biol. 2018;15(9):1192–1205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [61].Zhao Y, Chen X, Yin J. A novel computational method for the identification of potential mirna-disease association based on symmetric non-negative matrix factorization and kronecker regularized least square. Front Genet. 2018;9:324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [62].Jiang Q, Hao Y, Wang G, et al. Prioritization of disease micrornas through a human phenome-micrornaome network. BMC Syst Biol. 2010;4(1):S2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [63].Oved K, Morag A, Pasmanik-Chor M, et al. Genome-wide mirna expression profiling of human lymphoblastoid cell lines identifies tentative ssri antidepressant response biomarkers. Pharmacogenomics. 2012;13(10):1129–1139. [DOI] [PubMed] [Google Scholar]
  • [64].Yang Z, Ren F, Liu C, et al. dbdemc: a database of differentially expressed mirnas in human cancers. BMC Econ. 2010;11(Suppl 4):1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [65].Jiang Q, Wang Y, Hao Y, et al. mir2disease: a manually curated database for microrna deregulation in human disease. Nucleic Acids Res. 2008;37:D98–D104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [66].Phipps AI, Lindor NM, Jenkins MA, et al. Colon and rectal cancer survival by tumor location and microsatellite instability: the colon cancer family registry. Dis Colon Rectum. 2013;56(8):937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [67].Jemal A, Bray F, Center MM, et al. Global cancer statistics. CA Cancer J Clin. 2011;61(2):69–90. [DOI] [PubMed] [Google Scholar]
  • [68].Ogata-Kawata H, Izumiya M, Kurioka D, et al. Circulating exosomal micrornas as biomarkers of colon cancer. PloS one. 2014;9(4):e92921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [69].Espinosa CES, Slack FJ. The role of micrornas in cancer. Yale J Biol Med. 2006;79(3–4):131–140. [PMC free article] [PubMed] [Google Scholar]
  • [70].Cahill S, Smyth P, Denning K, et al. Effect of braf v600e mutation on transcription and post-transcriptional regulation in a papillary thyroid carcinoma model. Mol Cancer. 2007;6(1):21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [71].Chong V, Abdullah M, Telisinghe P, et al. Colorectal cancer: incidence and trend in brunei darussalam. Singapore Med J. 2009;50(11):1085. [PubMed] [Google Scholar]
  • [72].Manojlović D, Elaković D, Lj M. Therapeutic value of transcatheter embolization in malignant tumors of the renal parenchyma. Srp Arh Celok Lek. 1986;114(7):631–637. [PubMed] [Google Scholar]
  • [73].Duque JL, Loughlin KR, O’Leary MP, et al. Partial nephrectomy: alternative treatment for selected patients with renal cell carcinoma. Urology. 1998;52(4):584–590. [DOI] [PubMed] [Google Scholar]
  • [74].Senanayake U, Das S, Vesely P, et al. mir-192, mir-194, mir-215, mir-200c and mir-141 are downregulated and their common target acvr2b is strongly expressed in renal childhood neoplasms. Carcinogenesis. 2012;33(5):1014–1021. [DOI] [PubMed] [Google Scholar]
  • [75].Liu W, Zabirnyk OH, Shiao YH, et al. mir-23b targets proline oxidase, a novel tumor suppressor protein in renal cancer. Oncogene. 2010;29(35):4914. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [76].Maryam D, Sara S, Malin W, et al. In vitroandin vivoactivity of melflufen (j1)in lymphoma. BMC Cancer. 2016;16(1):1–9. [Google Scholar]
  • [77].Chen S, Wang Z, Dai X, et al. Re-expression of microrna-150 induces ebv-positive burkitt lymphoma differentiation by modulating c-myb in vitro. Cancer Sci. 2013;104(7):826–834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [78].Ge TT, Liang Y, Fu R, et al. Expressions of mir-21,mir-155 and mir-210 in plasma of patients withlymphoma and its clinical significance. J Exp Hematol. 2012;20(20):305–309. [PubMed] [Google Scholar]
  • [79].Ohyashiki K, Umezu T, Yoshizawa S, et al. Clinical impact of down-regulated plasma mir-92a levels in non-hodgkin’s lymphoma. Plos One. 2011;6(2):e16408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [80].Gmyrek GA, Walburg M, Webb CP, et al. Normal and malignant prostate epithelial cells differ in their response to hepatocyte growth factor/scatter factor. Am J Pathol. 2001;159(2):579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [81].Mcguire S. World cancer report 2014. geneva, switzerland: world health organization, international agency for research on cancer, who press, 2015. Adv Nutr. 2016;7(2):418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [82].Hart M, Wach S, Nolte E, et al. The proto-oncogene erg is a target of microrna mir-145 in prostate cancer. FEBS J. 2013;280(9):2105–2116. [DOI] [PubMed] [Google Scholar]
  • [83].Ueno K, Hirata H, Shahryari V, et al. microrna-183 is an oncogene targeting dkk-3 and smad4 in prostate cancer. Br J Cancer. 2013;108(8):1659–1667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [84].Li Y, Qiu C, Tu J, et al. Hmdd v2. 0: a database for experimentally supported human microrna and disease associations. Nucleic Acids Res. 2013;42(D1):D1070–D1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [85].Chen X, Yan CC, Luo C, et al. Constructing lncrna functional similarity network based on lncrna-disease associations and disease semantic similarity. Sci Rep. 2015;5:11338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [86].Chen X, Yan CC, Zhang X, et al. Long non-coding rnas and complex diseases: from experimental results to computational models. Brief Bioinform. 2016;18(4):558–576. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [87].Chen X, Huang YA, Wang XS, et al. Fmlncsim: fuzzy measure-based lncrna functional similarity calculation model. Oncotarget. 2016;7(29):45948. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [88].Huang YA, Chen X, You ZH, et al. Ilncsim: improved lncrna functional similarity calculation model. Oncotarget. 2016;7(18):25902. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [89].Lin Z, Liu R, Su Z. Linearized alternating direction method with adaptive penalty for low-rank representation. In: Advances in Neural Information Processing Systems. Curran Associates, Inc.; 2011. p. 612–620.
  • [90].Cai JF, Candès EJ, Shen Z. A singular value thresholding algorithm for matrix completion. SIAM J Optim. 2010;20(4):1956–1982. [Google Scholar]
  • [91].Bartels RH, Stewart GW. Solution of the matrix equation ax + xb = c. Commun ACM. 1972;15(9):820–826.http://dl.acm.org/citation.cfm?doid=361573.361582 [Google Scholar]

Articles from RNA Biology are provided here courtesy of Taylor & Francis

RESOURCES