Skip to main content
BioMed Research International logoLink to BioMed Research International
. 2021 Feb 24;2021:6652948. doi: 10.1155/2021/6652948

Prediction of miRNA-Disease Association Using Deep Collaborative Filtering

Li Wang 1, Cheng Zhong 2,
PMCID: PMC7929672  PMID: 33681362

Abstract

The existing studies have shown that miRNAs are related to human diseases by regulating gene expression. Identifying miRNA association with diseases will contribute to diagnosis, treatment, and prognosis of diseases. The experimental identification of miRNA-disease associations is time-consuming, tremendously expensive, and of high-failure rate. In recent years, many researchers predicted potential associations between miRNAs and diseases by computational approaches. In this paper, we proposed a novel method using deep collaborative filtering called DCFMDA to predict miRNA-disease potential associations. To improve prediction performance, we integrated neural network matrix factorization (NNMF) and multilayer perceptron (MLP) in a deep collaborative filtering framework. We utilized known miRNA-disease associations to capture miRNA-disease interaction features by NNMF and utilized miRNA similarity and disease similarity to extract miRNA feature vector and disease feature vector, respectively, by MLP. At last, we merged outputs of the NNMF and MLP to obtain the prediction matrix. The experimental results indicate that compared with other existing computational methods, our method can achieve the AUC of 0.9466 based on 10-fold cross-validation. In addition, case studies show that the DCFMDA can effectively predict candidate miRNAs for breast neoplasms, colon neoplasms, kidney neoplasms, leukemia, and lymphoma.

1. Introduction

miRNAMicroRNAs (miRNAs)areshort endogenous noncoding RNAs with about 22 nucleotides. A number of studies have shown that miRNAs play important roles in many biological processes including cell proliferation, development, differentiation, death, apoptosis, metabolism, aging, signal transduction, and viral infection [16]. Biological studies have revealed that dysregulation of miRNAs is closely related to the occurrence and development of complex diseases [79]. Dysregulation of miR-15 and miR-16 was discovered to be related with B-cell chronic lymphocytic leukemia firstly [10]. So far, it has been verified that many miRNAs are related to cancers. Five members of the miRNA-200 family (miR-200a, miR-200b, miR-200c, miR-141, and miR-429) are downregulated in the development of breast cancer [11]. Epigenetic modulation of the miR-200 family relates to transition to a breast cancer stem cell-like state [12]. Some studies demonstrate that in human colorectal cancer cells, miR-186, miR-216b, miR-337-3p, and miR-760 could work in synergy to induce cellular senescence by targeting the alpha subunit of protein kinase CKII [13]. By accurately measuring expression levels of miRNAs in the serum of 220 patients with early-stage non-small cell lung cancer and 220 matched controls, researchers found that the expressions of miR-27a, miR-106a, miR-221, miR-146b, miR-155, miR-17-5p, and let-7 were lower than those in controls, while the expression of miR-29c was increased [14].

Identifying the miRNAs associated with diseases will contribute to exploring the pathogenesis, diagnosis, treatment, and prognosis of diseases and help to develop new drugs. Some studies showed that miRNA-23, miRNA-24, and miRNA-27 contained underlying therapeutic factors in ischemic heart and vascular disease [15]. By targeting the BCL6 corepressor such as BCORL1, the migration and invasion of hepatocellular carcinoma (HCC) cells are restrained by miR-876-5p, which provides a new idea for the treatment of HCC [16]. However, the experimental methods for finding associations between miRNAs and diseases are expensive and time-consuming. The computational methods for predicting potential miRNA-disease associations can provide verifiable hypotheses for further experimental verification, which can reduce biological experiment time and improve the experimental efficiency.

Recently, plenty of computational methods have been proposed to predict potential miRNA-disease associations [17]. Most of the computational methods are based on the assumption that miRNAs with similar functions are more likely to be associated with phenotypically similar diseases and vice versa. These methods are based on different principles to predict miRNA-disease associations, such as similarity-based methods, machine learning-based methods, and matrix factorization-based methods.

The previous similarity-based computational methods were based on miRNA-target interaction network and protein-protein interaction (PPI). For example, Jiang et al. [18] proposed a method to predict potential miRNA-disease associations by applying a scoring system to human phenome-microRNAome network and functionally related miRNA network. Shi et al. [19] developed a computational framework to identify miRNA-disease associations by preforming random walk with restart. The method utilized the function connections between miRNAtargets and disease genes in protein-protein interaction (PPI) networks. Mrk et al. [20] presented a miRNA-protein-disease association prediction model (miRPD) in which miRNAs are linked to diseases via the underlying proteins. However, these methods largely relied on miRNA-target interactions which have high false-positive rate and false-negative rate, so they cannot achieve satisfying prediction performance.

To solve the above-mentioned problem, some similarity-based methods without relying on miRNA-target interactions were proposed. Similarity computation strategy is the key issue for miRNA-disease prediction [21]. Xuan et al. [22] presented a prediction algorithm called HDMP. HDMP predicts potential disease-associated miRNAs based on weighted k-nearest similar neighbors. However, HDMP could not predict miRNAs (diseases) associated with new diseases (miRNAs) due to local similarity networks. So, some global similarity-based methods were proposed, which construct a heterogeneous global network by integrating miRNA similarity, disease similarity, and known human miRNA-disease associations. For example, Chen et al. [23] proposed a global network-based prediction model, RWRMDA, to infer potential miRNA-disease association by implementing the random walk algorithm on a global network. However, it was not applicable for new diseases without any known associated miRNAs. Xuan et al. [24] proposed another prediction model called MIDP based on random walk. Compared with RWRMDA, it could predict related miRNA for new diseases. Liu et al. [25] proposed a miRNA-disease association prediction method by random walk on heterogeneous network constructed by integrating multiple data sources. In addition, some improved algorithms based on random walk were proposed [26, 27]. In addition to the random walk algorithm, other global network-based methods were proposed. For example, Chen et al. [28] developed the model for miRNA-disease association prediction (WBSMDA) by utilizing within score and between score. The within-score can capture miRNA similarity and disease similarity in known miRNA disease pairs, and the between-score can capture miRNA similarity and disease similarity in unknown miRNA-disease pairs. Next year, Chen et al. [29] proposed a computational model based on super-disease and miRNA for potential miRNA-disease association (SDMMDA) prediction. You et al. [30] proposed a path-based miRNA-disease association (PBMDA) prediction model. PBMDA adopted depth-first search algorithm on a heterogeneous graph. Zeng et al. [31] applied link prediction algorithm named structural perturbation method (SPM) on the miRNA-disease bilayer network to predict potential miRNA-disease associations. Chen et al. [32] proposed a computational model of bipartite network projection for miRNA-disease association (BNPMDA) prediction. The model took advantage of the agglomerative hierarchical clustering and improved the baseline algorithm of bipartite network recommendation based on the constructed bias ratings. In addition, some researcher utilized lncRNA-related other information to predict potential miRNA-disease associations. Chen et al. [33] developed a triple layer heterogeneous network miRNA-disease association (TLHNMDA) prediction model. In the model, the triple layer network was constructed by integrating the known miRNA-disease associations, miRNA-lncRNA interactions, miRNA function similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. Zhao et al. [34] developed a computational method based on a distance correlation set to predict miRNA-disease associations (DCSMDA), which integrated known lncRNA-disease associations, known miRNA-lncRNA associations, disease semantic similarity, and various lncRNA and disease similarity measures to construct a miRNA-lncRNA-disease network.

Furthermore, many computational models using machine learning to identify potential associations between miRNAs and diseases have also begun to appear. Chen and Yan [35] proposed regularized least squares for miRNA-disease association (RLSMDA) to uncover the relationship between diseases and miRNAs. Next, Chen and Huang [36] presented a prediction model based on Laplacian regularized sparse subspace learning called LRSSLMDA, which extracted two informative feature profiles by performing feature extraction from the integrated similarity. Chen et al. [37] developed a model of random forest for miRNA-disease association (RFMDA) prediction based on machine learning. Liang et al. [38] developed a method to discover disease-related candidate miRNAs based on adaptive multiview multilabel learning. Zhao et al. [39] developed adaptive boosting for miRNA-disease association (ABMDA) prediction to predict potential associations between diseases and miRNAs, which can balance the positive and negative samples by performing random sampling based on k-mean clustering on negative samples, and integrated weak classifiers to form a strong classifier based on corresponding weights. Wang et al. [40] proposed miRNA-disease association prediction model- (LMTRDA) based logistic model tree, which fused multisource information, especially the introduced miRNA sequence information. Chen et al. [41] proposed an ensemble of decision tree-based miRNA-disease association (EDTMDA) prediction model, which is a computational framework of integrating ensemble learning and dimensionality reduction. Deep learning can capture hidden, complex, and nonlinear relationships from the original data. Deep learning has been applied to various fields of bioinformatics. With the rapid development of deep learning, some deep learning-based methods have been proposed to solve the problem about miRNA-disease association prediction. For example, Chen et al. [42] proposed a model of restricted Boltzmann machine to predict multiple types of miRNA-disease association (RBMMMDA). Xuan et al. [43] presented the convolutional network-based methods for predicting candidate disease. Zeng et al. [44] developed a neural network model to predict miRNA-disease associations (NNMDA). NNMDA not only aggregated the neighbor information during the process but also preserved the topology of the original network at the same time. Gong et al. [45] proposed a network embedding-based multiple information integration method (NEMII) for miRNA-disease association prediction. Peng [46] proposed a learning-based framework, MDA-CNN, for miRNA-disease association identification. The model captures interaction features based on disease similarity network, miRNA similarity network, and protein-protein interaction network and employed an autoencoder to identify the essential feature combination for each miRNA-disease pair, and it used a convolutional neural network to predict the final label. Chen et al. [47] developed a model of deep-belief network for miRNA-disease association (DBNMDA) prediction. DBNMDA utilizes the information of all miRNA-disease pairs by introducing the unsupervised pretraining process. Then, according to the parameters obtained by pretraining, positive samples and the same number of randomly selected negative samples were applied to fine-tune deep-belief network.

Recently, some researchers have introduced the recommendation system to predict miRNA-disease association. Matrix factorizations are widely used in the recommendation systems. Some computational models based on matrix completion have been proposed. For example, Li et al. [48] proposed a miRNA-disease association prediction method based on matrix completion (MCMDA). MCMDA could not predict miRNAs for new diseases with no associations. In order to solve this problem, Chen et al. [49] proposed a computational model-based inductive matrix completion for miRNA-disease association prediction (IMCMDA). The model integrated miRNA functional similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. In addition, Chen et al. [50] integrated neighborhood constraint with matrix completion and proposed a computational model based neighborhood constraint matrix completion for miRNA-disease association (NCMCMDA) prediction. On the other hand, matrix decomposition is also used for identifying potential miRNA-disease associations. For example, Xiao et al. [51] proposed a prediction framework called graph regularized nonnegative matrix factorization (GRNMF) to infer the unknown miRNA-disease associations in heterogeneous omics data. Chen et al. [52] took advantage of the matrix factorization and network algorithm to develop a matrix decomposition and heterogeneous graph inference (MDHGI) for miRNA-disease association prediction. Cui et al. [53] proposed a robust collaborative matrix factorization method to predict novel miRNA-disease associations. The method improved the prediction accuracy by introducing the weighted K nearest known neighbors and the L2,1‐norm. Gao et al. [54] presented a computational framework based on graph Laplacian regularized L2,1-nonnegative matrix factorization (GRL2,1‐NMF) for inferring possible disease-connected miRNAs.

To further improve the prediction performance, we study to predict potential miRNA-disease associations based on matrix factorization and deep learning. We propose a new miRNA-disease association prediction method called DCFMDA, which combines the multilayer perceptron (MLP) and the neural nonnegative matrix factorization (NNMF) in a deep collaborative filtering framework. Firstly, we obtain miRNA and disease similarity matrices by integrating multiple heterogeneous data. Then, we utilize MLP to extract high-level features from miRNA and disease similarity matrices and decompose the known miRNA-disease association into two low rank matrices by NNMF. Finally, we merge the output of the MLP submodel and the NNMF submodel to obtain prediction results for miRNA-disease potential associations.

The rest of this paper is organized as follows. Section 2 describes the data and method. Section 3 presents experimental results. Section 4 summarizes the paper.

2. Data and Methods

2.1. Data

2.1.1. Human miRNA-Disease Association

HMDD is a database that curated experiment-supported evidence for human miRNA-disease associations. We downloaded known miRNA-disease association data from HMDD V2.0 [55], which includes 5430 experimentally verified miRNA-disease associations between 383 miRNAs and 495 diseases. We used adjacency matrix A ∈ ℝM×N to formalize the miRNA-disease associations, where M and N are the number of miRNAs and diseases, respectively. If miRNA m is experimentally verified to be related with disease d, the value of A(m, d) is 1, otherwise 0.

2.1.2. Disease Semantic Similarity

In the National Library of Medicine MeSH, each disease is described as a hierarchical Directed Acyclic Graph (DAG). As described in [56], the disease semantic similarity can be calculated based on these DAGs. For example, disease d can be represented as a graph DAG(d) = (d, Td, Ed), where Td is the disease set of all ancestor nodes of disease d including disease d itself and Ed is the edge set of corresponding links. The contribution of disease t in DAG to the semantic value of disease d is defined as follows:

Ddt=1,if t=d,Ddt=max12DdttC,if td, (1)

where C is children set of t.

The semantic value DD(d) of disease d is calculated by

DDd=tTdDdt. (2)

The larger the part the two diseases share in their DAGs, the higher the similarity between the two diseases is. The semantic similarity SSD(di, dj) of disease di and dj is defined as follows:

SSDdi,dj=tTdiTdjDdit+DdjtDDdi+DDdj,i,j=1,2,,N. (3)

2.1.3. Disease Functional Similarity by Functional Gene Network

The score of functional similarity between two diseases can be measured by disease-gene and gene-gene association data [57]. The functional similarity FSD(gi, gj) between gene gi and gj is defined as follows:

FSDgi,gj=1,i=j,LLSNgi,gj,ij,ei,jEH,0,ij,ei,jEH, (4)

where e(i, j) denotes edge between gene gi and gj, E(H) denotes the set of all edges in the HumanNet V2 database [58], and LLSN(gi, gj) is an associated log likelihood score (LLS) that measures the probability of a functional linkage between gene gi and gj after normalization.

The functional association FG(g) between gene g and gene set G = {g1, g2, ⋯, gk} is defined as follows:

FGg=max1ikFSDg,gi, (5)

where k indicates the number of genes in G, gi is the ith gene of G, i = 1, 2, ⋯, k.

The functional similarity FSD(di, dj) between disease di and dj is defined as follows:

FSDdi,dj=FSDG1,G2=1imFG2g1i+1jnFG1g2jm+n, (6)

where G1 = {g11, g12, ⋯, g1m} and G2 = {g21, g22, ⋯, g2n} are gene set related to diseases di and dj, respectively, m is the number of genes in G1, and n is the number of genes in G2.

2.1.4. miRNA Functional Similarity

We obtained the miRNA functional similarity data by the method provided in [56]. In the previous subsection, we have described calculating the semantic similarity between diseases. The functional similarity for each miRNA pair was calculated based on the semantic similarity of diseases. Firstly, the similarity between disease d and disease group D = {d1, d2, ⋯, dk} is calculated by

Sd,D=max1ikSSDd,di, (7)

where k denotes the number of diseases in D and di is the ith disease of D, i = 1, 2, ⋯, k.

Then, calculation of the functional similarity between miRNA mi and mj is equal to calculating the similarity between D1 and D2, where D1 and D2 represent the related disease sets of miRNA mi and mj, respectively. Finally, the matrix FSMM×M is used to denote the miRNA functional similarity. FSM(mi, mj) represents the functional similarity between miRNAs mi and mj, which is calculated as follows:

FSMmi,mj=1sD2Sds,D1+1tD1Sdt,D2D1+D2. (8)

2.1.5. Gaussian Interaction Profile Kernel Similarity

Gaussian kernel is a commonly used kernel function, which has been proven effective for measuring both miRNA similarity and disease similarity [59]. The interaction profile IP(di) of disease di is the ith column vector of the miRNA-disease association matrix. It is a binary vector representing the presence or absence of its associations with each miRNA. Gaussian interaction profile kernel similarity GD(di, dj) between disease di and dj is defined as follows [59]:

GDdi,dj=expβdIPdiIPdj2,βd=βd1/Ni=1NIPdi2, (9)

where i, j = 1, 2, ⋯, N and βd is used to control kernel bandwidth, which is obtained by normalizing the average number  βd′ of associated miRNAs per disease.

Similarly, the Gaussian interaction profile kernel similarity GM(mi, mj) between miRNA mi and mj is defined as follows:

GMmi,mj=expβmIPmiIPmj2,βm=βm1/Mi=1MIPmi2, (10)

where i, j = 1, 2, ⋯, M.

2.1.6. Integrated Similarity for miRNAs and Diseases

In order to overcome the shortcomings of single similarity measure to accurately reflect the characteristics of miRNA similarity and disease similarity from different perspectives, some method integrated multiple different similarity measure data to construct the miRNA similarity matrix and disease similarity matrix to improve the prediction performance.

We integrate miRNA functional similarity FSM and miRNA Gaussian interaction profile kernel similarity GM to construct the miRNA similarity matrix ISM(mi, mj) between miRNA mi and mj.

ISMmi,mj=FSMmi,mj,if FSMmi,mj0,GMmi,mj,otherwise. (11)

We also integrate disease semantic similarity SSD, the disease functional similarity FSD, and the disease Gaussian interaction profile kernel similarity GD to construct the disease similarity matrix ISD(di, dj) between disease di and dj. The formula is as follows:

ISDdi,dj=FSDdi,dj+SSDdi,dj2,if FSDdi,dj0 or SSDdi,dj0,GDdi,dj, otherwise. (12)

2.2. Methods

As a universal computational algorithm, the recommendation algorithm has been applied in many fields including bioinformatics. The miRNA-disease association prediction can be regarded as a recommendation problem. This kind of prediction method regards miRNAs as users and diseases as commodities and recommends miRNAs to a disease according to its known preference on miRNAs and vice versa. Traditional recommendation models are mainly divided into collaborative filtering, content-based recommendation system, and hybrid recommendation system. Recently, researchers have proposed some recommendation algorithms using deep learning to overcome the shortcomings of traditional collaborative filtering models [60].

In this paper, we proposed a new deep collaborative filtering framework for miRNA-disease association prediction called DCFMDA. This method combines the multilayer perceptron (MLP) submodel and neural nonnegative matrix factorization (NNMF) submodel in deep collaborative filtering framework. Firstly, in the MLP submodel, the mth row of the miRNA similarity matrix (i.e., the similarity data between miRNA m and all the other miRNAs) was fed into a multilayer perception, and the dth row of the disease similarity matrix (i.e., the similarity data between disease d and all the other diseases) was fed into another multilayer perception. The two MLPs would be trained to learn high-level biological patterns from miRNA similarity and disease similarity, respectively. Secondly, all known miRNA-disease association pairs were fed into the neural NNMF submodel to train. Finally, the output of the two submodels was merged to get prediction scores of miRNA-disease pairs. The proposed method is shown in Figure 1.

Figure 1.

Figure 1

Workflow of deep collaborative filtering for miRNA-disease association prediction.

2.2.1. Neural Nonnegative Matrix Factorization (NNMF)

Matrix factorization (MF) based approaches are proven to be highly accurate and scalable in addressing collaborative filtering (CF) problems [61]. The purpose of nonnegative matrix factorization (NMF) is to find two nonnegative matrices whose product is optimal approximation to the original matrix. Given miRNA-disease association matrix A, it can be decomposed into the product of two low rank nonnegative matrices W and H, namely, AWHT. Solving the problem of prediction miRNA-disease potential associations using NMF can be described as the following objective function:

minAWHTF2. (13)

However, the matrix factorization method only uses the fixed inner product to predict miRNA-disease associations, which leads to some limitation of prediction algorithm. So we use a nonnegative matrix factorization submodel (NNMF) based on a two-layer fully connected neural network to predict the potential association between miRNA and disease. In the NNMF submodel, one-hot encodings of miRNA i and disease jare used as the input vectors and two embedding vectors mi and dj are obtained by the embedding layer. We have two two-layer fully connected neural networks to transform the representations of mi and dj. Through the neural network, mi and dj are mapped to low-dimensional vectosr pi and qj in a latent space, respectively. The miRNA-disease association prediction submodel based on NNMF is as follows:

y^ij=fNNMFi,jpi,qj=piTqj=piqj, (14)

where i = 1, 2, ⋯, M, j = 1, 2, ⋯, N

2.2.2. Multilayer Perceptron (MLP)

Multilayer perceptron (MLP) is a deep learning structure, which is a feedforward neural network with multiple hidden layers between input and output layers. A single layer perceptron cannot classify linear inseparable problems, but the multilayer perceptron can overcome this weakness by the nonlinear mapping of input space based on activation function. MLP has high ability of nonlinear modeling. The structure of the multilayer perceptron model is shown in Figure 2.

Figure 2.

Figure 2

The multilayer perceptron.

We regard the miRNA-disease association prediction as a binary classification problem. That is, if there is a correlation between miRNA i and disease j, the corresponding label will be 1, otherwise 0. We use a submodel based on MLP to solve the problem for miRNA-disease association prediction. The number of neurons in each hidden layer of MLP is less than that in the previous layer. The number of neurons in the first hidden layer is equal to the dimension of the input vector, and the number of neurons in the last hidden layer is equal to the dimension of the output vector. Formally, we denote the input vector by X, the output vector by Y, the number of hidden layers by L, the connection weight matrix from hidden layer l − 1 to hidden layer l by Wl, and the bias vector of the lth layer by bl, where 1 ≤ lL. The MLP model is formulated as follows [62]:

a0=X,a1=θ1W1Ta0+b1,a2=θ2W2Ta1+b2,aL=θLWLTaL1+bL,Y=ϕMLPX=aL, (15)

where al denotes the output of the lth layer and θl denotes the activation function of the lth layer. We select ReLU as the activation function of each hidden layer, which can be computed by f(x) = max(0, x). ReLU is employed to alleviate the problem of the gradient disappearance and solve the overfitting problem of machine learning [63].

In our proposed MLP submodel, we used two MLPs to transform the representations of miRNA and disease. The miRNA similarity matrix ISM and disease similarity matrix ISD are the input of these two multilayer perceptrons. ISMi is the ith row of matrix ISM, which represents the similarity feature of miRNA mi. ISDj is the jth row of matrix ISD, which represents similarity feature of disease dj. We used ISMi to train the left MLP and used ISDj to train the right MLP. Through the neural network, ISMi and ISDj are finally mapped to a low-dimensional vector in a latent space. So the similarity feature vectors of the miRNA mi and disease dj can be formulated as follows:

miMLP=ϕMLPISMi,djMLP=ϕMLPISDj. (16)

The miRNA-disease association prediction submodel based on MLP is as follows:

y^ij=fMLPISMi,ISDjmiMLP,djMLP=σmiMLP,djMLP, (17)

where i = 1, 2, ⋯, M, j = 1, 2, ⋯, N, σ(x) = 1/(1 + ex).

2.2.3. Method DCFMDA

We construct a prediction model based on NNMF and MLP. We capture the linear relationship between miRNAs and diseases by NNMF and learn the nonlinear relationship between miRNAs and diseases by MLP. The NNMF submodel and the MLP submodel share the embedding layer.

The NNMF submodel learns from known miRNA-disease association to obtain the original prediction score. The MLP submodel learns the low-dimensional feature of miRNA and disease from the miRNA similarity matrix and disease similarity matrix, respectively. Finally, we merge outputs of the NNMF submodel and MLP submodel to obtain the final prediction score for disease-related miRNAs. The presented model is formulated as follows:

yijNNMF=fNNMFi,jmiNNMF,djNNMF=miNNMFdjNNMF, (18)
miMLP=ϕMLPISMi, (19)
djMLP=ϕMLPISDj, (20)
miDCFMDA=miMLPmi, (21)
djDCFMDA=djMLPdj, (22)
y^ij=σyijNNMFmiDCFMDAdjDCFMDA, (23)

where mi and dj are two embedding vectors of miRNA i and disease j, respectively. ISMi is the similarity feature of miRNA i, ISDj is the similarity feature of disease j, miMLP denotes the output of the left MLP, djMLP denotes the output of the right MLP, yijNNMF denotes original prediction score, miDCFMDA is obtained by concatenating miMLP with the miRNA embedding vector mi, and djDCFMDA is obtained by concatenating djMLP with the disease embedding vector dj.

The final layer of the DCFMDA is used for the classification task, its activation function σ(x)is the sigmoid function σ(x) = 1/(1 + ex).

In our proposed model, we use the binary crossentropy loss function :

L=i,jAyijlogy^ij+1yijlog1y^ij, (24)

where (i, j) ∈ A is the index of the training examples, yij represents the true label for the given input sample (i, j), and y^ij represents predicted result. The purpose of deep learning is to minimize loss function through continuous training iterations to get the best prediction.

Algorithm 1 describes our proposed miRNA-disease association prediction algorithm using deep collaborative filtering called DCFMDA.

Algorithm 1.

Algorithm 1

DCFMDA.

3. Result

3.1. Performance Evaluation

To evaluate the prediction performance of algorithm DCFMDA, we perform a 10-fold cross-validation on known experimentally verified miRNA-disease associations. The 5430 experiment-supported miRNA-disease associations are considered as positive samples. We are not sure which miRNAs are not associated with diseases. So, for each known miRNA-disease pair, we will randomly sample four unobserved miRNA-disease pairs as negative samples. For the 10-fold cross-validation, all positive and negative samples are randomly divided into ten parts. In each fold, nine of the ten parts are used for the training model in turn, and the remaining one is used as test samples.

We use the receiver operating characteristic (ROC) curve, area under ROC (AUC), precision-recall (PR) curve, and F1 score to evaluate the performance of the predictive algorithm. The ROC curve plots the true-positive rate (TPR) versus the false-positive rate (FPR) at different thresholds. The value of AUC is usually between 0.5 and 1. When AUC is 1, it means that the prediction result will achieve the best effect. Our data is seriously unbalanced, because the number of negative samples (unconfirmed miRNA-disease associations) is much larger than the number of positive samples (experiment-supported miRNA-disease associations). Therefore, we also draw a PR curve to evaluate the prediction ability of different miRNA-disease association prediction algorithms. We compare DCFMDA with four existing miRNA-disease prediction algorithms HDMP, RLSMDA, IMCMDA, and SPM. Figure 3 shows ROC curves and PR curves of the five prediction algorithms and reports their corresponding AUCs in a 10-fold cross-validation on experimentally verified miRNA-disease associations. Figure 3(a) shows that DCFMDA almost always has the highest TPRs under the same false-negative rates, and obtains the highest AUC (0.94662) among these algorithms, whereas the AUCs of HDMP, RLSMDA, IMCMDA, and SPM are 0.87156, 0.87453, 0.84081, and 0.86274, respectively. Figure 3(b) shows that DCFMDA achieves a higher precision than all the other algorithms for any given recall value. For ROC curve or PR curve, DCFMDA performs significantly better than the other four algorithms. F1 score is the harmonic mean of both metrics of recall and precision. Since there is a trade-off between precision and recall, F1 score is also used to evaluate the performance of algorithms. Table 1 shows the F1 score of the top-K candidates. The F1 score of DCFMDA is more stable, while F1 scores of the other four algorithms were decreasing from the top 50 to top 1000. From Table 1, we can see that DCFMDA achieved better performance than other algorithms in terms of the F1 score.

Figure 3.

Figure 3

Performance comparison for HDMP, RLSMDA, IMCMDA, SPM, and DCFMDA in terms of ROC curve, AUC, and PR curve.

Table 1.

Performance comparison for HDMP, RLSMDA, IMCMDA, SPM, and DCFMDA in terms of F1 score.

Top 100 Top 200 Top 300 Top 400 Top 500 Top 600 Top 700 Top 800 Top 900 Top 1000
DCFMDA 0.8827 0.8919 0.9091 0.8996 0.8938 0.8806 0.8773 0.8803 0.8854 0.8777
HDMP 0.9130 0.9417 0.8047 0.7635 0.7760 0.8201 0.8215 0.8405 0.8424 0.8317
RLSMDA 0.9473 0.9247 0.9209 0.9262 0.9142 0.9041 0.8906 0.8788 0.8701 0.8623
IMCMDA 0.9847 0.9446 0.8785 0.8506 0.8372 0.8048 0.7784 0.7635 0.7439 0.7318
SPM 0.9583 0.9333 0.9150 0.8732 0.8345 0.8142 0.8003 0.7943 0.7879 0.7812

The experiment results indicate that our method can achieve higher prediction performance. One reason is that we introduce deep learning into miRNA-disease prediction and effectively integrated neural nonnegative matrix factorization and multilayer perceptron technology to predict potential miRNA-disease associations. Another reason is that different miRNA similarity and disease similarity data are used to construct the miRNA similarity matrix and the disease similarity matrix.

3.2. Case Study

In order to further verify the prediction performance of DCFMDA, we carried out case studies on five diseases including breast neoplasms, colon neoplasms, kidney neoplasms, leukemia, and lymphoma. From HMDD V2.0, we obtained 5430 known associations and 184155 unknown associations between 495 miRNAs and 383 diseases. For our case study, all the known miRNA-disease associations were used as training samples, and other unknown associations were regarded as candidate associations for validation. For each investigated disease, we ranked candidate miRNAs according to their predicted scores and selected the top-50 candidate miRNAs to verify whether the candidate miRNAs were associated with the current disease by two other databases, namely, dbDEMC [64] and miRCancer [65], as well as published literatures. The validation results are shown in Table 2. The database dbDEMC 2.0 is an integrated database that documents 209 expression profiling data sets with 36 cancer types and 73 subtypes, and a total of 2224 differentially expressed miRNAs were identified. It allows users to make a quick search of the differentially expressed miRNAs in certain cancer types. The database miRCancer is a miRNA-cancer association database that provides comprehensive collection of miRNA expression profiles in various human cancers. A user can search the database by miRNA or cancer name.

Table 2.

Proportion of verified associations in the top-50 candidate miRNAs for five different diseases.

Breast neoplasms Colon neoplasms Kidney neoplasms Leukemia Lymphoma
Percentage 98% 100% 98% 92% 98%

There are intersections between known miRNA-disease associations obtained from databases HMDD V2.0, dbDEMC, and miRCancer. For example, 546 of the 5430 known miRNA-disease associations in HMDD V2.0 also exist in dbDEMC 2.0. Because we only predict and verify the candidate miRNAs unrelated to the investigated disease in HMDD V2.0, none of these candidate miRNAs exist in HMDD V2.0. We can be sure that the validation of candidate miRNAs is completely independent of HMDD V2.0.

Breast neoplasms are one of the most common cancers for women. We have inferred associations between all the candidate miRNAs for breast neoplasm and confirmed 49 of the top-50 candidate miRNAs to be association with breast cancer by dbDEMC, miRCancer, and published literatures (see Table 3). Because the same miRNA gene may have different identifiers, we can use the alias to verify whether the miRNA is associated with breast cancer. We obtain alias of miRNAs by retrieving the miRBase and GeneCards databases. For example, hsa-mir-371 (the alias of hsa-mir-371a) and has-mir-642 (the alias of hsa-mir-642) can be confirmed to be related to breast cancer by dbDEMC2.

Table 3.

The top-50 predicted miRNAs associated with breast neoplasms.

Rank miRNA Evidence Rank miRNA Evidence
1 has-mir-106a dbDEMC 26 has-mir-372 dbDEMC
2 has-mir-192 dbDEMC 27 has-mir-181d dbDEMC
3 has-mir-449a dbDEMC 28 has-mir-196b dbDEMC
4 has-mir-449b dbDEMC 29 has-mir-532 dbDEMC
5 has-mir-99b dbDEMC 30 has-mir-198 dbDEMC
6 has-mir-483 miRCancer 31 has-mir-370 dbDEMC
7 has-mir-15b dbDEMC 32 has-mir-513b dbDEMC
8 has-mir-376a dbDEMC 33 has-mir-433 dbDEMC
9 has-mir-424 dbDEMC 34 has-mir-513c dbDEMC
10 has-mir-491 PMID:25725194 35 has-mir-362 dbDEMC
11 has-mir-144 dbDEMC 36 has-mir-615 dbDEMC
12 has-mir-181c dbDEMC 37 has-mir-98 dbDEMC
13 has-mir-30e miRCancer 38 has-mir-363 dbDEMC
14 has-mir-498 dbDEMC 39 has-mir-325 dbDEMC
15 has-mir-138 dbDEMC 40 has-mir-509 PMID:25659578
16 has-mir-142 PMID:25406066 41 has-mir-130b dbDEMC
17 has-mir-371a dbDEMC 42 has-mir-154 dbDEMC
18 has-mir-92b dbDEMC 43 has-mir-675 dbDEMC
19 has-mir-184 dbDEMC 44 has-mir-642a dbDEMC
20 has-mir-542 PMID:24846313 45 has-mir-500a dbDEMC
21 has-mir-134 dbDEMC 46 has-mir-548c PMID:25802200
22 has-mir-571 dbDEMC 47 has-mir-331 PMID:25883093
23 has-mir-185 dbDEMC 48 has-mir-381 dbDEMC
24 has-mir-32 dbDEMC 49 has-mir-519b Unconfirmed
25 has-mir-130a dbDEMC 50 has-mir-502 PMID:27080302

Colon neoplasms are a common malignant tumor of the digestive tract occurring in the colon. Various evidences indicate that miRNAs potentially play an important role in predicting markers of early diagnosis, prognosis, and chemosensitivity of colon cancer [66]. DCFMDA has inferred associations between all the candidate miRNAs for colon cancer, and all top-50 candidate miRNAs are confirmed to be associated with colon neoplasms by dbDEMC, miRCancer, and published literatures (see Table 4).

Table 4.

The top-50 predicted miRNAs associated with colon neoplasms.

Rank miRNA Evidence Rank miRNA Evidence
1 has-mir-30e dbDEMC 26 has-mir-340 PMID:24448820
2 has-mir-15b dbDEMC 27 has-mir-20a dbDEMC
3 has-mir-193b dbDEMC 28 has-mir-625 dbDEMC
4 has-mir-373 miRCancer 29 has-mir-486 dbDEMC
5 has-mir-16 PMID:25623762 30 has-mir-370 dbDEMC
6 has-mir-203 dbDEMC 31 has-mir-194 dbDEMC
7 has-mir-192 dbDEMC 32 has-mir-383 dbDEMC
8 has-mir-148a dbDEMC 33 has-mir-146a dbDEMC
9 has-mir-204 dbDEMC 34 has-mir-30b dbDEMC
10 has-mir-106b dbDEMC 35 has-mir-92a dbDEMC
11 has-mir-376b dbDEMC 36 has-mir-223 dbDEMC
12 has-mir-124 dbDEMC 37 has-mir-23b dbDEMC
13 has-mir-122 dbDEMC 38 has-mir-32 dbDEMC
14 has-mir-132 dbDEMC 39 has-mir-497 dbDEMC
15 has-mir-143 dbDEMC 40 has-mir-93 dbDEMC
16 has-mir-10b dbDEMC 41 has-mir-19a dbDEMC
17 has-mir-186 dbDEMC 42 has-mir-34a dbDEMC
18 has-mir-182 dbDEMC 43 has-mir-214 dbDEMC
19 has-mir-429 dbDEMC 44 has-mir-190a dbDEMC
20 has-mir-125b dbDEMC 45 has-mir-107 dbDEMC
21 has-mir-18a dbDEMC 46 has-mir-15a dbDEMC
22 has-mir-372 dbDEMC 47 has-mir-27a dbDEMC
23 has-mir-96 dbDEMC 48 has-mir-31 dbDEMC
24 has-mir-212 dbDEMC 49 has-mir-424 dbDEMC
25 has-mir-19b dbDEMC 50 has-mir-125a dbDEMC

Kidney neoplasms are one of the most rapidly growing malignant tumors. Abnormal expression of miRNAs has been detected in several kinds of kidney cancers. For example, compared with the normal samples, the expression of hsa-mir-194 (the third in Table 5) was reported to be downregulated in kidney neoplasm patients. Literature [67] confirmed that the expression level of hsa-mir-378 (the twelfth in Table 5) is up-regulated in the blood of patients with renal cell carcinoma compared to healthy controls. Predicting miRNAs related with kidney neoplasm by DCFMDA, 49 of the top-50 candidate miRNAs have been validated (see Table 5). Considering the different names of the same miRNA, we can find previous IDs of these miRNAs in the miRBase database. For example, hsa-mir-378a cannot be retrieved to be related with kidney cancer from database dbDEMC and miRCancer, but it can be retrieved by its previous ID has-mir-378.

Table 5.

The top-50 predicted miRNAs associated with kidney neoplasms.

Rank miRNA Evidence Rank miRNA Evidence
1 has-mir-494 dbDEMC 26 has-mir-184 dbDEMC
2 has-mir-130b dbDEMC 27 has-mir-122 dbDEMC
3 has-mir-194 dbDEMC 28 has-mir-15b dbDEMC
4 has-mir-384 dbDEMC 29 has-mir-188 dbDEMC
5 has-mir-24 dbDEMC 30 has-mir-136 dbDEMC
6 has-mir-16 dbDEMC 31 has-mir-145 dbDEMC
7 has-mir-342 dbDEMC 32 has-mir-487a PMID:25938468
8 has-mir-203 dbDEMC 33 has-mir-133b dbDEMC
9 has-mir-150 dbDEMC 34 has-mir-561 dbDEMC
10 has-mir-186 dbDEMC 35 has-mir-125b dbDEMC
11 has-mir-126 dbDEMC 36 has-mir-17 dbDEMC
12 has-mir-378a dbDEMC 37 has-mir-429 dbDEMC
13 has-mir-92b dbDEMC 38 has-mir-214 dbDEMC
14 has-mir-424 dbDEMC 39 has-mir-106a dbDEMC
15 has-mir-20a dbDEMC 40 has-mir-106b dbDEMC
16 has-mir-200a dbDEMC 41 has-mir-23a dbDEMC
17 has-mir-31 dbDEMC 42 has-mir-127 dbDEMC
18 has-mir-372 dbDEMC 43 has-mir-451a dbDEMC
19 has-mir-219 PMID:22440013 44 has-mir-423 dbDEMC
20 has-mir-200b dbDEMC 45 has-mir-223 dbDEMC
21 has-mir-199a dbDEMC 46 has-mir-189 Unconfirmed
22 has-mir-206 dbDEMC 47 has-mir-20b dbDEMC
23 has-mir-138 dbDEMC 48 has-mir-143 dbDEMC
24 has-mir-373 dbDEMC 49 has-mir-132 dbDEMC
25 has-mir-205 miRCancer 50 has-mir-19a dbDEMC

Leukemia is a cancer caused by an overproduction of damaged white blood cells. It is the most common cancer among people under 15 years old. MiRNAs play an important role in the development of leukemia. One of the most typical examples is the association of miR-15a and miR-16a with chronic lymphocytic leukemia. Researchers found that 65% of B cell chronic lymphoblastic leukemia patients have deletions of chromosome 13q14, a locus that includes miR-15a and miR-16a, which consequently present downregulated expression [10]. In our case study, 46 of the 50 candidate miRNAs related to leukemia have been verified by relevant databases (see Table 6). We have verified hsa-mir-323 to be associated with leukemia by database dbDEMC, where hsa-mir-323 is the previous ID of hsa-mir-323a. We are not sure whether the remaining four of the top-50 miRNAs, namely, hsa-mir-302b, hsa-mir-216b, hsa-mir-668, and has-mir-489, are related to leukemia.

Table 6.

The top-50 predicted miRNAs associated with leukemia.

Rank miRNA Evidence Rank miRNA Evidence
1 has-mir-218 PMID:23022987 26 has-mir-494 dbDEMC
2 has-mir-221 dbDEMC 27 has-mir-219 dbDEMC
3 has-mir-222 dbDEMC 28 has-mir-214 PMID:25361012
4 has-mir-302b Unconfirmed 29 has-mir-99a dbDEMC
5 has-mir-452 PMID:29326345 30 has-mir-145 dbDEMC
6 has-mir-128 PMID:22209839 31 has-mir-181b dbDEMC
7 has-let-7e dbDEMC 32 has-mir-489 Unconfirmed
8 has-mir-142 dbDEMC 33 has-mir-146b dbDEMC
9 has-mir-155 dbDEMC 34 has-mir-146a dbDEMC
10 has-mir-197 dbDEMC 35 has-mir-127 dbDEMC
11 has-mir-22 dbDEMC 36 has-let-7i dbDEMC
12 has-mir-148a dbDEMC 37 has-mir-23a dbDEMC
13 has-mir-20b dbDEMC 38 has-mir-203 PMID:21323860
14 has-mir-182 dbDEMC 39 has-mir-181d dbDEMC
15 has-mir-216b Unconfirmed 40 has-mir-655 PMID:26340914
16 has-let-7a dbDEMC 41 has-mir-106b dbDEMC
17 has-mir-504 dbDEMC 42 has-mir-708 dbDEMC
18 has-mir-223 dbDEMC 43 has-mir-423 dbDEMC
19 has-mir-144 dbDEMC 44 has-mir-668 Unconfirmed
20 has-mir-34b dbDEMC 45 has-mir-15b dbDEMC
21 has-let-7d dbDEMC 46 has-let-7c dbDEMC
22 has-mir-224 dbDEMC 47 has-mir-129 dbDEMC
23 has-mir-520h PMID:29768346 48 has-mir-323a dbDEMC
24 has-mir-425 dbDEMC 49 has-mir-23b dbDEMC
25 has-mir-126 dbDEMC 50 has-mir-542 dbDEMC

Lymphomas are the most common ones of hematologic tumors. For the top-50 lymphoma-associated miRNAs predicted by DCFMDA, 49 of them have experimental literature evidence (see Table 7). For example, Literature [68] found that the expression of hsa-mir-223 was downregulated more than twice in diffuse large B-cell lymphoma (DLBCL) .

Table 7.

The top-50 predicted miRNAs associated with lymphoma.

Rank miRNA Evidence Rank miRNA Evidence
1 has-let-7a dbDEMC 26 has-let-7g dbDEMC
2 has-mir-494 dbDEMC 27 has-mir-208b dbDEMC
3 has-mir-338 dbDEMC 28 has-mir-206 dbDEMC
4 has-let-7b dbDEMC 29 has-mir-27b dbDEMC
5 has-mir-93 dbDEMC 30 has-mir-132 dbDEMC
6 has-mir-141 dbDEMC 31 has-mir-223 dbDEMC
7 has-mir-518c Unconfirmed 32 has-mir-208a dbDEMC
8 has-mir-302c dbDEMC 33 has-mir-9 dbDEMC
9 has-mir-302a dbDEMC 34 has-let-7d dbDEMC
10 has-mir-23b dbDEMC 35 has-mir-378a dbDEMC
11 has-mir-145 dbDEMC 36 has-mir-296 dbDEMC
12 has-mir-106b dbDEMC 37 has-mir-96 dbDEMC
13 has-mir-99a dbDEMC 38 has-mir-106a dbDEMC
14 has-let-7e dbDEMC 39 has-mir-483 dbDEMC
15 has-mir-31 dbDEMC 40 has-mir-422a dbDEMC
16 has-mir-103a dbDEMC 41 has-mir-125b dbDEMC
17 has-mir-130b dbDEMC 42 has-mir-152 dbDEMC
18 has-mir-205 dbDEMC 43 has-mir-183 dbDEMC
19 has-mir-192 dbDEMC 44 has-mir-34a dbDEMC
20 has-mir-29a dbDEMC 45 has-mir-33b dbDEMC
21 has-mir-584 dbDEMC 46 has-mir-182 dbDEMC
22 has-let-7f dbDEMC 47 has-mir-302d dbDEMC
23 has-lef-7c dbDEMC 48 has-mir-216b dbDEMC
24 has-mir-424 dbDEMC 49 has-mir-137 dbDEMC
25 has-mir-219 dbDEMC 50 has-mir-375 dbDEMC

4. Conclusion

Predicting disease-related miRNAs will help people understand the underlying pathogenesis of diseases. To overcome the time-consuming and expensive shortcomings of experimental methods, researchers have focused on identifying miRNA-disease potential association by computational methods. Compared with existing methods, the main contribution of our work is to propose a method of predicting potential miRNA-disease association by deep collaborative filtering. In addition to the experimental confirmed miRNA-disease association, our method constructs the miRNA similarity matrix by integrating the miRNA functional similarity and miRNA Gaussian interaction profile kernel similarity and constructs the disease similarity matrix by integrating the disease semantic similarity, disease functional similarity, and disease Gaussian interaction profile kernel similarity. The performance of our method is validated by 10-fold cross-validation and case studies. The experiment results indicate that our method can achieve effective and reliable prediction results. In the future, we will further improve the prediction performance of DCFMDA by the following three aspects. Firstly, considering that the random selection of negative samples may lead to a false negative, we will use an unsupervised deep learning model for prediction. Secondly, our model simply combined the lncRNA similarity and disease similarity as the feature vector of miRNA-disease association, which cannot accurately describe the association features. Lastly, to better integrate various miRNA similarity and disease similarity by weighted average, we will study an optimal weighting strategy so that object similarity matrices can be appropriately constructed.

Acknowledgments

This work is supported by the National Natural Science Foundation of China under Grant Nos. 61962004 and 61462005.

Data Availability

Known miRNA-disease association data were taken from database HMDD 2.0 (http://www.cuilab.cn/hmdd), human microRNA functional similarity (http://www.lirmed.com/misim/), and disease semantic similarity (https://github.com/IMCMDAsourcecode/IMCMDA).

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

  • 1.Xu P., Guo M., Hay B. A. MicroRNAs and the regulation of cell death. Trends in Genetics. 2004;20(12):617–624. doi: 10.1016/j.tig.2004.09.010. [DOI] [PubMed] [Google Scholar]
  • 2.Cheng A. M., Byrom M. W., Shelton J., Ford L. P. Antisense inhibition of human miRNAs and indications for an involvement of miRNA in cell growth and apoptosis. Nucleic Acids Research. 2005;33(4):1290–1297. doi: 10.1093/nar/gki200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Karp X., Ambros V. Developmental biology. Encountering microRNAs in cell fate signaling. Science. 2005;310(5752):1288–1289. doi: 10.1126/science.1121566. [DOI] [PubMed] [Google Scholar]
  • 4.Miska E. A. How microRNAs control cell division, differentiation and death. Current Opinion in Genetics & Development. 2005;15(5):563–568. doi: 10.1016/j.gde.2005.08.005. [DOI] [PubMed] [Google Scholar]
  • 5.Bartel D. P. MicroRNAs: target recognition and regulatory functions. Cell. 2009;136(2):215–233. doi: 10.1016/j.cell.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Tang W., Wan S. X., Yang Z., Teschendorff A. E., Zou Q. Tumor origin detection with tissue-specific miRNA and DNA methylation markers. Bioinformatics. 2018;34(3):398–406. doi: 10.1093/bioinformatics/btx622. [DOI] [PubMed] [Google Scholar]
  • 7.Chen X., Xie D., Zhao Q., You Z. H. MicroRNAs and complex diseases: from experimental results to computational models. Briefings in Bioinformatics. 2019;20(2):515–539. doi: 10.1093/bib/bbx130. [DOI] [PubMed] [Google Scholar]
  • 8.He L., Thomson J. M., Hemann M. T., et al. A microRNA polycistron as a potential human oncogene. Nature. 2005;435(7043):828–833. doi: 10.1038/nature03552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Croce C. M. Causes and consequences of microRNA dysregulation in cancer. Nature Reviews Genetics. 2009;10(10):704–714. doi: 10.1038/nrg2634. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Calin G. A., Dumitru C. D., Shimizu M., et al. Nonlinear partial differential equations and applications: frequent deletions and down-regulation of micro-RNA genes mir15 and mir16 at 13q14 in chronic lymphocytic leukemia. Proceedings of the National Academy of Sciences of the United States of America. 2002;99(24):15524–15529. doi: 10.1073/pnas.242606799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Gregory P. A., Bert A. G., Paterson E. L., et al. The miR-200 family and miR-205 regulate epithelial to mesenchymal transition by targeting ZEB1 and SIP1. Nature Cell Biology. 2008;10(5):593–601. doi: 10.1038/ncb1722. [DOI] [PubMed] [Google Scholar]
  • 12.Lim Y.-Y., Wright J. A., Attema J. L., et al. Epigenetic modulation of the miR-200 family is associated with transition to a breast cancer stem-cell-like state. Journal of Cell Science. 2013;126(10):2256–2266. doi: 10.1242/jcs.122275. [DOI] [PubMed] [Google Scholar]
  • 13.Kim S. Y., Lee Y. H., Bae Y. S. miR-186, miR-216b, miR-337-3p, and miR-760 cooperatively induce cellular senescence by targeting α subunit of protein kinase CKII in human colorectal cancer cells. Biochemical and Biophysical Research Communications. 2012;429(3-4):173–179. doi: 10.1016/j.bbrc.2012.10.117. [DOI] [PubMed] [Google Scholar]
  • 14.Heegaard N. H. H., Schetter A. J., Welsh J. A., Yoneda M., Bowman E. D., Harris C. C. Circulating micro-RNA expression profiles in early stage nonsmall cell lung cancer. International Journal of Cancer. 2012;130(6):1378–1386. doi: 10.1002/ijc.26153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Bang C., Fiedler J., Thum T. Cardiovascular importance of the microRNA-23/27/24 family. Microcirculation. 2012;19(3):208–214. doi: 10.1111/j.1549-8719.2011.00153.x. [DOI] [PubMed] [Google Scholar]
  • 16.Xu Q., Zhu Q., Zhou Z., et al. MicroRNA-876-5p inhibits epithelial-mesenchymal transition and metastasis of hepatocellular carcinoma by targeting BCL6 corepressor like 1. Biomedicine & Pharmacotherapy. 2018;103:645–652. doi: 10.1016/j.biopha.2018.04.037. [DOI] [PubMed] [Google Scholar]
  • 17.Zeng X. X., Zhang X., Zou Q. Integrative approaches for predicting microRNA function and prioritizing disease-related microRNA using biological interaction networks. Briefings in Bioinformatics. 2016;17(2):193–203. doi: 10.1093/bib/bbv033. [DOI] [PubMed] [Google Scholar]
  • 18.Jiang Q., Hao Y., Wang G., et al. Prioritization of disease microRNAs through a human phenome-microRNAome network. BMC Systems Biology. 2010;4(S1) S1:p. S2. doi: 10.1186/1752-0509-4-s1-s2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Shi H., Xu J., Zhang G., et al. Walking the interactome to identify human miRNA-disease associations through the functional link between miRNA targets and disease genes. BMC Systems Biology. 2013;7(1):p. 101. doi: 10.1186/1752-0509-7-101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Mork S., Pletscher-Frankild S., Palleja Caro A., Gorodkin J., Jensen L. J., et al. Protein-driven inference of miRNA-disease associations. Bioinformatics. 2014;30(3):392–397. doi: 10.1093/bioinformatics/btt677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zou Q., Li J. J., Song S., Zeng X. X., Wang G. H. Similarity computation strategies in the microRNA-disease network: a survey. Briefings in Functional Genomics. 2016;15(1):55–64. doi: 10.1093/bfgp/elv024. [DOI] [PubMed] [Google Scholar]
  • 22.Xuan P., Han K., Guo M., et al. Prediction of microRNAs associated with human diseases based on weighted k most similar neighbors. PLoS One. 2013;8(8, article e70204) doi: 10.1371/journal.pone.0070204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Chen X., Liu M. X., Yan G. Y. RWRMDA: predicting novel human microRNA-disease associations. Molecular BioSystems. 2012;8(10):2792–2798. doi: 10.1039/c2mb25180a. [DOI] [PubMed] [Google Scholar]
  • 24.Xuan P., Han K., Guo Y., et al. Prediction of potential disease-associated microRNAs based on random walk. Bioinformatics. 2015;31(11):1805–1815. doi: 10.1093/bioinformatics/btv039. [DOI] [PubMed] [Google Scholar]
  • 25.Liu Y., Zeng X., He Z., Zou Q. Inferring microRNA-disease associations by random walk on a heterogeneous network with multiple data sources. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2017;14(4):905–915. doi: 10.1109/TCBB.2016.2550432. [DOI] [PubMed] [Google Scholar]
  • 26.Luo J., Xiao Q. A novel approach for predicting microRNA-disease associations by unbalanced bi-random walk on heterogeneous network. Journal of Biomedical Informatics. 2017;66:194–203. doi: 10.1016/j.jbi.2017.01.008. [DOI] [PubMed] [Google Scholar]
  • 27.Chen M., Liao B., Li Z. Global similarity method based on a two-tier random walk for the prediction of microRNA-disease association. Scientific Reports. 2018;8(1):p. 6481. doi: 10.1038/s41598-018-24532-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Chen X., Yan C. C., Zhang X., et al. WBSMDA: within and between score for miRNA-disease association prediction. Scientific Reports. 2016;6(1):p. ???. doi: 10.1038/srep21106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Chen X., Jiang Z. C., Xie D., et al. A novel computational model based on super-disease and miRNA for potential miRNA-disease association prediction. Molecular BioSystems. 2017;13(6):1202–1212. doi: 10.1039/C6MB00853D. [DOI] [PubMed] [Google Scholar]
  • 30.You Z.-H., Huang Z.-A., Zhu Z., et al. PBMDA: a novel and effective path-based computational model for miRNA-disease association prediction. PLoS Computational Biology. 2017;13(3, article e1005455) doi: 10.1371/journal.pcbi.1005455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Zeng X., Liu L., Lü L., Zou Q. Prediction of potential disease-associated microRNAs using structural perturbation method. Bioinformatics. 2018;34(14):2425–2432. doi: 10.1093/bioinformatics/bty112. [DOI] [PubMed] [Google Scholar]
  • 32.Chen X., Xie D., Wang L., Zhao Q., You Z. H., Liu H. BNPMDA: bipartite network projection for miRNA-disease association prediction. Bioinformatics. 2018;34(18):3178–3186. doi: 10.1093/bioinformatics/bty333. [DOI] [PubMed] [Google Scholar]
  • 33.Chen X., Qu J., Yin J. TLHNMDA: triple layer heterogeneous network based inference for miRNA-disease association prediction. Frontiers in Genetics. 2018;9:p. 234. doi: 10.3389/fgene.2018.00234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Zhao H., Kuang L., Wang L., et al. Prediction of microRNA-disease associations based on distance correlation set. BMC Bioinformatics. 2018;19(1):p. 141. doi: 10.1186/s12859-018-2146-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Chen X., Yan G. Y. Semi-supervised learning for potential human microRNA-disease associations inference. Scientific Reports. 2015;4, article 5501 doi: 10.1038/srep05501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Chen X., Huang L. LRSSLMDA: Laplacian regularized sparse subspace learning for miRNA-disease association prediction. PLoS Computational Biology. 2017;13(12, article e1005912) doi: 10.1371/journal.pcbi.1005912. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Chen X., Wang C. C., Yin J., You Z. H. Novel human miRNA-disease association inference based on random forest. Molecular Therapy - Nucleic Acids. 2018;13:568–579. doi: 10.1016/j.omtn.2018.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Liang C., Yu S., Luo J. Adaptive multi-view multi-label learning for identifying disease-associated candidate miRNAs. PLoS Computational Biology. 2019;15(4, article e1006931) doi: 10.1371/journal.pcbi.1006931. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Zhao Y., Chen X., Yin J. Adaptive boosting-based computational model for predicting potential miRNA-disease associations. Bioinformatics. 2019;35(22):4730–4738. doi: 10.1093/bioinformatics/btz297. [DOI] [PubMed] [Google Scholar]
  • 40.Wang L., You Z. H., Chen X., et al. LMTRDA: using logistic model tree to predict miRNA disease associations by fusing multi-source information of sequences and similarities. PLoS Computational Biology. 2019;15(3, article e1006865) doi: 10.1371/journal.pcbi.1006865. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Chen X., Zhu C. C., Yin J. Ensemble of decision tree reveals potential miRNA-disease associations. PLoS Computational Biology. 2019;15(7, article e1007209) doi: 10.1371/journal.pcbi.1007209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Chen X., Clarence Yan C., Zhang X., et al. RBMMMDA: predicting multiple types of disease-microRNA associations. Scientific Reports. 2015;5(1):p. ???. doi: 10.1038/srep13877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Xuan P., Sun H., Wang X., Zhang T. G., Pan S. X. Inferring the disease-associated miRNAs based on network representation learning and convolutional neural networks. International Journal of Molecular Sciences. 2019;20(15):p. 3648. doi: 10.3390/ijms20153648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Zeng X., Wang W., Deng G., Bing J., Zou Q. Prediction of potential disease-associated microRNAs by using neural networks. Molecular Therapy - Nucleic Acids. 2019;7(16):566–575. doi: 10.1016/j.omtn.2019.04.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Gong Y., Niu Y., Zhang W., Li X. A network embedding-based multiple information integration method for the miRNA-disease association prediction. BMC Bioinformatics. 2019;20(1):p. 468. doi: 10.1186/s12859-019-3063-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Peng J. A learning-based framework for miRNA-disease association identification using neural networks. Bioinformatics. 2019;35(21):4364–4371. doi: 10.1093/bioinformatics/btz254. [DOI] [PubMed] [Google Scholar]
  • 47.Chen X., Li T. H., Zhao Y., Wang C. C., Zhu C. C. Deep-Belief Network for Predicting Potential miRNA-Disease Associations. Briefings in Bioinformatics. 2020 doi: 10.1093/bib/bbaa186. [DOI] [PubMed] [Google Scholar]
  • 48.Li J. Q., Rong Z. H., Chen X., Yan G. Y., You Z. H. MCMDA: matrix completion for miRNA-disease association prediction. Oncotarget. 2017;8(13):21187–21199. doi: 10.18632/oncotarget.15061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Chen X., Wang L., Qu J., Guan N. N., Li J. Q. Predicting miRNA–disease association based on inductive matrix completion. Bioinformatics. 2018;34(24):4256–4265. doi: 10.1093/bioinformatics/bty503. [DOI] [PubMed] [Google Scholar]
  • 50.Chen X., Sun L. G., Zhao Y. NCMCMDA: miRNA–disease association prediction through neighborhood constraint matrix completion. Briefings in Bioinformatics. 2021;22(1):485–496. doi: 10.1093/bib/bbz159. [DOI] [PubMed] [Google Scholar]
  • 51.Xiao Q., Luo J., Liang C., Cai J., Ding P. A graph regularized nonnegative matrix factorization method for identifying microRNA-disease associations. Bioinformatics. 2018;34(2):239–248. doi: 10.1093/bioinformatics/btx545. [DOI] [PubMed] [Google Scholar]
  • 52.Chen X., Yin J., Qu J., Huang L. MDHGI: matrix decomposition and heterogeneous graph inference for miRNA-disease association prediction. PLoS Computational Biology. 2018;14(8, article e1006418) doi: 10.1371/journal.pcbi.1006418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Cui Z., Liu J. X., Gao Y. L., Zheng C. H., Wang J. RCMF: a robust collaborative matrix factorization method to predict miRNA-disease associations. BMC Bioinformatics. 2019;20(S25):p. 686. doi: 10.1186/s12859-019-3260-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Gao Z., Wang Y. T., Wu Q. W., Ni J. C., Zheng C. H. Graph regularized L2,1-nonnegative matrix factorization for miRNA-disease association prediction. BMC Bioinformatics. 2020;21(1):p. 61. doi: 10.1186/s12859-020-3409-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Li Y., Qiu C., Tu J., et al. HMDD v2.0: a database for experimentally supported human microRNA and disease associations. Nucleic Acids Research. 2013;42:1070–1074. doi: 10.1093/nar/gkt1023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Wang D., Wang J., Lu M., Song F., Cui Q. H. Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases. Bioinformatics. 2010;26(13):1644–1650. doi: 10.1093/bioinformatics/btq241. [DOI] [PubMed] [Google Scholar]
  • 57.Cheng L., Li J., Ju P., Peng J., Wang Y. SemFunSim: a new method for measuring disease similarity by integrating semantic and gene functional association. PLoS One. 2014;9(6, article e99415) doi: 10.1371/journal.pone.0099415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Lee I., Blom U. M., Wang P. I., Shim J. E., Marcotte E. M. Prioritizing candidate disease genes by network-based boosting of genome-wide association data. Genome Research. 2011;21(7):1109–1121. doi: 10.1101/gr.118992.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Chen X., Yan C. C., Zhang X., You Z. H., Huang Y. A., Yan G. Y. HGIMDA: heterogeneous graph inference for miRNA-disease association prediction. Oncotarget. 2016;7(40):65257–65269. doi: 10.18632/oncotarget.11251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.He X., Liao L., Zhang H., Nie L., Hu X., Chua T. S. Neural collaborative filtering. WWW’17: Proceedings of the 26th International Conference on World Wide Web; 2017; Perth, Australia. pp. 173–182. [DOI] [Google Scholar]
  • 61.Luo X., Zhou M., Xia Y., Zhu Q. An efficient non-negative matrix factorization-based approach to collaborative filtering for recommender systems. IEEE Transactions on Industrial Informatics. 2014;10(2):1273–1284. doi: 10.1109/TII.2014.2308433. [DOI] [Google Scholar]
  • 62.Lu W. Z., Fan H. Y., Lo S. M. Application of evolutionary neural network method in predicting pollutant levels in downtown area of Hong Kong. Neurocomputing. 2003;51:387–400. doi: 10.1016/S0925-2312(02)00623-9. [DOI] [Google Scholar]
  • 63.Krizhevsky A., Sutskever I., Hinton G. ImageNet classification with deep convolutional neural networks. Proceedings of Neural Information Processing Systems; 2012; Lake Tahoe, NV. [Google Scholar]
  • 64.Yang Z., Wu L., Wang A., et al. dbDEMC 2.0: updated database of differentially expressed miRNAs in human cancers. Nucleic Acids Research. 2017;45(D1):812–818. doi: 10.1093/nar/gkw1079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Xie B., Ding Q., Han H., Wu D. miRCancer: a microRNA–cancer association database constructed by text mining on literature. Bioinformatics. 2013;29(5):638–644. doi: 10.1093/bioinformatics/btt014. [DOI] [PubMed] [Google Scholar]
  • 66.Hollis M., Nair K., Vyas A., Chaturvedi L. S., Gambhir S., Vyas D. MicroRNAs potential utility in colon cancer: early detection, prognosis, and chemosensitivity. World Journal of Gastroenterology. 2015;21(27):8284–8292. doi: 10.3748/wjg.v21.i27.8284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Martina R., Poprach A., Nekvindova J., et al. Circulating miR-378 and miR-451 in serum are potential biomarkers for renal cell carcinoma. Journal of Translational Medicine. 2012;10, article 55 doi: 10.1186/1479-5876-10-55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Imig J., Motsch N., Zhu J. Y., et al. MicroRNA profiling in Epstein-Barr virus-associated B-cell lymphoma. Nucleic Acids Research. 2011;39(5):1880–1893. doi: 10.1093/nar/gkq1043. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Known miRNA-disease association data were taken from database HMDD 2.0 (http://www.cuilab.cn/hmdd), human microRNA functional similarity (http://www.lirmed.com/misim/), and disease semantic similarity (https://github.com/IMCMDAsourcecode/IMCMDA).


Articles from BioMed Research International are provided here courtesy of Wiley

RESOURCES