Abstract
MicroRNAs (miRNAs) play an important role in various biological processes and their abnormal expression could lead to the occurrence of diseases. Exploring the potential relationships between miRNAs and diseases can contribute to the diagnosis and treatment of complex diseases. The increasing databases storing miRNA and disease information provide opportunities to develop computational methods for discovering unobserved disease-related miRNAs, but there are still some challenges in how to effectively learn and fuse information from multi-source data. In this study, we propose a multi-view information fusion based method for miRNA-disease association (MDA)prediction, named MVIFMDA. Firstly, multiple heterogeneous networks are constructed by combining the known MDAs and different similarities of miRNAs and diseases based on multi-source information. Secondly, the topology features of miRNAs and diseases are obtained by using the graph convolutional network to each heterogeneous network view, respectively. Moreover, we design the attention strategy at the topology representation level to adaptively fuse representations including different structural information. Meanwhile, we learn the attribute representations of miRNAs and diseases from their similarity attribute views with convolutional neural networks, respectively. Finally, the complicated associations between miRNAs and diseases are reconstructed by applying a bilinear decoder to the combined features, which combine topology and attribute representations. Experimental results on the public dataset demonstrate that our proposed model consistently outperforms baseline methods. The case studies further show the ability of the MVIFMDA model for inferring underlying associations between miRNAs and diseases.
Keywords: miRNA-disease associations, multi-view, deep learning, graph convolutional networks, convolutional neural networks
1 Introduction
MicroRNAs (miRNAs) are endogenous non-coding RNAs of approximately 21–23 nucleotides that play an important role in the regulation of gene expression (Bartel, 2004). A large number of studies have shown that miRNAs are involved in various biological processes, including metabolism, cell proliferation, cell cycle regulation, and differentiation (Cheng et al., 2005; Miska, 2005; Carleton et al., 2007; Bartel, 2009), and abnormal expression of miRNAs is related to the pathogenesis of various diseases such as cancer (Calin and Croce, 2006; Small and Olson, 2011; Bracken et al., 2016; Metzinger-Le Meuth and Metzinger, 2019; Sereshgi et al., 2019). Considering the important roles of miRNAs in different diseases, the identification of potential associations between miRNAs and diseases is helpful for the understanding of disease pathogenesis and the diagnosis, treatment and prognosis of diseases. Traditional biological assay methods for discovering disease-related miRNAs are time-consuming and expensive. Therefore, with the accumulation of biological data and the improvement of computational power, more and more researchers propose to predict potential miRNA-disease associations (MDAs) by using computational methods.
Based on the hypothesis that miRNAs with similar functions are more likely to be associated with diseases with similar phenotypes and vice versa (Bandyopadhyay et al., 2010), many computational models have been developed to predict MDAs. For example, Jiang et al. (2010) used a hypergeometric distribution model to evaluate the probability scores of unknown MDAs based on a phenome-microRNAome network. But this model only considers the direct neighbor information of each node and ignores the indirect neighbors. Subsequently, Xuan et al. (2013) developed a computational model HDMP based on k most similar neighbors to infer disease-related miRNAs. To improve the prediction result, HDMP puts forward to estimate miRNA functional similarity by integrating the information content of disease terms and phenotypic similarity between diseases. However, HDMP only considers the local information of the network and is not suitable for predicting potential miRNAs for novel diseases without known related miRNAs. Therefore, some methods that consider global network information have been proposed by some researchers. To make good use of structural information, Chen et al. (2012) used the random walk with restart on the miRNA functional similarity network to infer the potential associations between miRNAs and diseases. The algorithm still has the limitation that it is not applicable to new diseases and new miRNAs. Although researchers have proposed many new models (Chen et al., 2016; You et al., 2017; Chen et al., 2018b) to solve this problem, the above similarity-based methods still cannot effectively capture the complex relationships of miRNA-disease pairs.
In addition, matrix completion-based methods are also often used for biomedical link prediction due to their ability to explore the intrinsic and shared structures of heterogeneous data sources (Ou-Yang et al., 2022). Specifically, we predict potential connections by filling in the missing entries of part of the observed matrix when using this method for MDA inference. Li et al. (2017) put forward an efficient matrix completion model to infer novel MDAs, called MCMDA. Subsequently, to solve the problem that MCMDA cannot be used for new diseases, Chen et al. (2018a) designed a new computational model based on inductive matrix completion to predict potential miRNAs associated with diseases, which uses integrated miRNA similarity, disease similarity and validated MDA pairs to complement missing MDAs. Meanwhile, Xiao et al. (2018) presented a graph regularized non-negative matrix factorization method to take full advantage of the intrinsic geometric structure of the data, which enables it to effectively discover potential relationships between miRNAs and diseases, including new diseases and new miRNAs. Some matrix completion-based methods have been developed to infer underlying associations between miRNAs and diseases (Gao et al., 2020; Zhang et al., 2020; Chen et al., 2021).
In recent years, as machine learning methods have been widely used in various fields, some machine learning-based models have been presented to further improve the prediction performance of miRNA-disease potential associations. For example, Xu et al. (2011) calculated four topological features of miRNAs and constructed a support vector machine classifier to reveal the relationships between diseases and miRNAs. Since samples are randomly selected from unknown miRNA-disease relationship pairs as negative samples, these negative samples are unreliable, and they may be positive samples that have not been experimentally verified. Given the limitations of existing methods, Chen and Yan (2014) used a semi-supervised learning-based computational model of regularized least squares to identify miRNAs that may be associated with diseases. This method can be used for diseases without validated relevant miRNAs and avoid the selection of negative samples by using semi-supervision. However, with the rapid growth of biomedical data, traditional machine learning methods are not suitable for complex and changeable data, while deep learning has shown good performance in utilizing unstructured data (Sheng et al., 2022; Zhang et al., 2022). Peng et al. (2019) used auto-encoder to reduce the dimensionality of features and calculated miRNA-disease relationship scores by the convolutional neural network (CNN). Li et al. (2020) proposed to use latent feature representations of miRNAs and diseases, respectively learned by graph convolutional networks (GCNs) (Kipf and Welling, 2016), as input for neural inductive matrix completion to obtain scores for unknown miRNA-disease pairs. Tang et al. (2021) presented a multi-view multichannel attention graph convolutional network (MMGCN) to identify new disease-related miRNAs, which uses GCNs to learn the embeddings of miRNAs and diseases, furthermore adopts multi-channel attention to enhance the learned latent representations. The GNN based on link representation proposed by Kang et al. (2022) employed the GCN to obtain node embeddings and then obtained the improved intermolecular relationship scores according to the designed propagation rule and layer-wise fusing rule. Although there are some methods using deep learning for MDA prediction, many of them ignore the effective learning and fusion of information from different data sources, such as some methods simply utilize one type information, some simply fill in the missing values of one type of information with other types of information, and some ignore information in known associations. Thus, the methods that better utilize multi-source data information to identify underlying disease-related miRNAs should be further explored.
In this study, we present a novel MDA prediction method based on multi-view information fusion (MVIFMDA), which attempts to effectively preserve the topological and attribute information from multi-source data. The basic idea of MVIFMDA is as follows. We firstly use multi-source data to construct the known association network of miRNA-disease and the similarity networks of miRNA and disease, including miRNA sequence similarity network, miRNA functional similarity network, disease semantic similarity network, and disease functional similarity network. And then multiple heterogeneous networks between miRNAs and diseases are constructed based on the association network and similarity networks of miRNA and disease. Secondly, GCNs are employed to learn various topological representations of miRNAs and diseases according to different heterogeneous network views, respectively. Furthermore, the attention strategy at the topology representation level is established to obtain more informative topology embeddings by effectively learning the importance of different topology features. Meanwhile, CNNs are adopted to respectively get the attribute representations of the miRNAs and diseases based on the various miRNA similarity and disease similarity views. Finally, the combined miRNA and disease embeddings are fed into a bilinear decoder to calculate the MDA scores. 5-Fold cross-validation (5-CV) and case studies demonstrate that the MVIFMDA model extracts more information from multiple biological data sources and is suitable for MDA prediction.
2 Materials and methods
In this study, we propose a new multi-view information fusion model named MVIFMDA for MDA prediction. The framework of MVIFMDA is shown in Figure 1. We firstly construct miRNA-disease heterogeneous networks based on known associations and various similarities of miRNA and disease (Figure 1A). Additionally, GCNs are adopted to encode heterogeneous network views including different information, and an attention mechanism is designed to adaptively integrate different topology representations for miRNAs and diseases obtained from GCNs (Figure 1B). Meanwhile, the attribute representations of the miRNAs and diseases are learned by utilizing the CNN encoder (Figures 1C,D). Finally, the bilinear decoder combines the topology and attribute representations of miRNAs and diseases to predict the association scores between miRNAs and diseases.
2.1 Dataset
We downloaded experimentally validated human miRNA-disease relationships from HMDD v3.2 (Huang et al., 2019), which is a curated database. To take full advantage of multiple biomedical data, we obtained the medical subject headings (MeSH) descriptors from the National Library of Medicine (https://www.nlm.nih.gov/), which provides semantic information of diseases through directed acyclic graphs (DAGs). And disease-gene associations and weighted gene-gene relationships were obtained from DisGeNET (Piñero et al., 2020) and HumanNet (Hwang et al., 2019), respectively. In addition, we downloaded the sequence information of miRNAs from miRbase (Kozomara et al., 2019), and the miRNA-gene relationships were got from miRTarBase (Huang et al., 2020). By taking the intersection of data from multiple data sources and merging duplicate data, we finally acquired 12,446 associations among 853 miRNAs and 591 diseases for the next association prediction, furthermore used these processed data to calculate the similarities of miRNAs and diseases and build heterogeneous networks between miRNAs and diseases.
2.2 Construction of heterogeneous networks
2.2.1 Human miRNA-disease associations
We employ the obtained known human relationship pairs between miRNAs and diseases to construct the association matrix , where and represent the number of miRNAs and diseases, respectively. When there is an observed association between miRNA and disease , , that is, there is an edge with the weight of 1 between the miRNA node and the disease node. When there is an unknown or unobserved association between miRNA and disease j, , which means that there is no edge between the two nodes.
2.2.2 Disease semantic similarity
As shown in (Wang et al., 2010a), DAGs can be used to compute the semantic similarity of diseases. For example, a disease can be represented as , where is the disease set composed of all ancestor nodes of disease (including itself), and denotes all direct connections between nodes. In , the semantic contribution of disease to disease is defined as follows:
(1) |
where the semantic contribution factor is set to 0.5 (Wang et al., 2010b), which means that the farther the disease is from the disease , the smaller the semantic contribution to the disease . Then, the semantic similarity performance between disease and disease is calculated as follows:
(2) |
According to Eq. 2, we can finally get the weight matrix of the disease semantic similarity network , where if , it means that there is an edge with a weight of between the node and . On the contrary, when , it indicates that there is no semantically similar edge between the node and .
2.2.3 Disease functional similarity
Driven by the hypothesis that similar disease tendencies interact with similar genes (Xu and Li, 2006; Wei and Liu, 2020), we calculate the functional similarity of diseases utilizing the relationship between disease and gene. The gene functional interaction network can be obtained from HumanNet, where it provides a log-likelihood score (LLS) for each gene interaction to assess the probability of functional connectivity between genes (Lee et al., 2011; Hwang et al., 2019). We obtain the similarity between genes through min-max normalization based on LLS, and then the similarity score between a gene and a set of genes is defined as follows:
(3) |
where represents the functional similarity score between gene and gene . It is defined as follows:
(4) |
where when there is no linkage between the genes and . Finally, we get the functional similarity between disease and disease as follows:
(5) |
where and denote gene sets related to diseases and respectively, and and are the cardinality of the gene sets. Analogously, according to Eq. 5, we finally get the weight matrix of the disease functional similarity network .
2.2.4 miRNA sequence similarity
To measure the similarity of miRNA sequences, we employ the Needleman-Wunsch Algorithm (Needleman and Wunsch, 1970) to quantify the similarity between two miRNAs by sequence alignment. In addition, we normalize the sequence similarity score between miRNA and miRNA to the range of [0,1] by min-max normalization, which can be written as follows:
(6) |
where and represent the minimum and maximum similarity scores among all miRNA sequence pairs, respectively. Similar to the disease similarity network, we obtain a miRNA sequence similarity network with the edge weight matrix .
2.2.5 miRNA functional similarity
Similar to the calculation of disease functional similarity, we utilize the relationships between miRNAs and genes to calculate miRNA functional similarity, which avoids the dependence on known associations between miRNAs and diseases and enables the similarity calculation of new miRNAs (Xiao et al., 2018; Xiao et al., 2021). Analogously, we can define the functional similarity between miRNA and miRNA as follows:
(7) |
where and are gene sets associated with miRNA and , respectively. Similar to the miRNA sequence similarity network construction, we use the calculated miRNA functional similarity score as the edge weight of the network.
In addition, for miRNAs and diseases, different kinds of similarity matrix views obtained from different data sources are considered as their initial attribute feature, which can be used to further learn their attribute representations.
2.2.6 Heterogeneous networks
By integrating the MDA network, two miRNA similarity networks and two disease similarity networks, multiple miRNA-disease heterogeneous networks are constructed, as shown in Figure 1A. As mentioned above, in each heterogeneous network, the weights of the edges between two miRNA nodes and between two disease nodes are equal to the similarity scores between them, respectively, while the edge weight between a miRNA node and a disease node is determined by whether there is a known association between two nodes. Given the MDA matrix and similarity matrices and , we define the adjacency matrices of these heterogeneous networks as follows:
where denotes the transpose of , and and respectively represent the matrix representations of four different heterogeneous network views, which reflect the relationship between miRNAs and diseases and the degree of similarity between nodes of the same type from the perspective of different information sources.
2.3 Multi-view topology representation learning
2.3.1 Topology representations learning by graph convolutional network encoder
Graph convolutional network (GCN) is a powerful tool for learning node embeddings of graph-structured data, which has been proven both theoretically and practically (Zhou et al., 2020). GCN generates a low-dimensional and efficient representation of a node by aggregating the information of the neighbour nodes of the node in the graph and capturing the dependencies between the data. For an undirected graph, the layer-wise propagation rule of a multi-layer GCN can be expressed as follows:
(8) |
Where is the representations of the nodes in the -th layer, denotes the nonlinear activation function, and is the learnable weight matrix that maps the features to the latent space. is the adjacency matrix of the graph, and is the diagonal degree matrix of , , where and represents the number of nodes in the graph.
For the miRNA-disease heterogeneous network views constructed in the previous chapter, we use them as the input of the GCN encoder respectively, and then obtain different embeddings of miRNAs and diseases. Taking the use of GCN to encode the heterogeneous network as an example, we set , and then according to Eq. 8, the first layer of the GCN encoder can be defined as follows:
(9) |
where is the weight matrix input to the hidden layer, represents the dimension of the embedding feature, and the initial embedding . According to Eq. 8, we can get the embedding of the heterogeneous network as follows:
(10) |
where the GCN has layers to learn topology information of the heterogeneous network view.
Similarly, we can obtain the embedding of miRNA and disease according to the miRNA-disease heterogeneous network view in turn, where and is the number of the heterogeneous network views. The obtained embeddings can be represented as:
(11) |
(12) |
(13) |
Furthermore, as shown in Figure 1B, we get topological representations of miRNAs and diseases from different perspectives according to the multi-layer GCN encoder and the next section will describe how to integrate these representations, which contain different structural information.
2.3.2 Topology representations fusing by attention mechanism
The structural information of the input network captured by different GCN layers is different. For instance, the first layer captures the direct connection information between nodes, and by updating the embeddings layer by layer, multi-hop neighbour information can be captured by higher layer embeddings (He et al., 2020; Yu et al., 2021). In addition, the embeddings from different heterogeneous network views are not equally important to explore MDAs. Therefore, we design the attention strategy at the topology representation level to adaptively fuse multiple topology embeddings of miRNAs and diseases learned by GCN encoder. The multiple feature matrices of miRNAs and diseases from the heterogeneous network views are stacked to form a feature tensor . Given the feature tensor of , the attention weight is calculated as follows:
(14) |
(15) |
where , and denote the weight matrix, the bias vector and the topology representation attention vector, respectively. is the information score of the -th topology representation. After obtaining the attention score at topology representation level, we could focus on more important features by combining the topology representations with attention, and the topological features after attention enhancement are expressed as follows:
(16) |
2.4 Multi-view attribute representations learning by convolutional neural network encoder
Convolutional neural network (CNN) can obtain the local message contained in the feature map through multiple convolution kernels, which helps us to use CNN to extract the deep attribute features of miRNAs and diseases from different information sources respectively. We take different kinds of miRNA similarity matrix views as the initial attribute feature of miRNAs and regard different disease similarity matrix views as the initial attribute feature of disease . Given the initial attribute feature of miRNAs nodes, the embedding of -th output channel is expressed as:
(17) |
where represents the -th convolution filter, denotes the bias vector, and is the convolution operator. The final miRNA attribute representation can be got by stacking the output embeddings of multiple channels, where is the number of the output channels. Similarly, as shown in Figure 1D, we can extract the disease attribute representation by leveraging the CNN encoder to the initial attribute feature .
In order to make full use of the information from different data sources, we combine the topological features from multiple miRNA-disease heterogeneous networks learned by GCN encoder and the attribute features from multiple similarity matrices learned by CNN encoder as the final embeddings, which is expressed as follows:
(18) |
(19) |
where represents the concatenation operation, and and are respectively the final embeddings of miRNA nodes and disease nodes.
2.4.1 The reconstruction of miRNA-disease associations and optimization
Although the inner product of node embeddings is often used to predict relationship probabilities between nodes, it is limited in capturing complex associations between nodes. Here, we reconstruct the associations between miRNAs and diseases by introducing a bilinear decoder. Based on the obtained embedding matrices for miRNAs and for diseases, the prediction scores between miRNAs and diseases are calculated as follows:
(20) |
where denotes a learnable matrix, and is the prediction probability that miRNA and disease are associated. The higher the predicted score between miRNA and disease , the more likely that miRNA is correlated with disease .
In this study, we train MVIFMDA to learn the parameters of the model by utilizing the following cross-entropy loss :
(21) |
where and represents the set of positive samples and the set of negative samples, respectively. The known MDA pairs are regarded as the positive samples and the other unobserved pairs are considered as negative samples. In addition, we leverage the Adam optimizer (Kingma and Ba, 2014) to minimize the loss function and train the model end-to-end by a back propagation algorithm.
3 Results
3.1 Experiment settings
In this study, 5-CV is adopted to evaluate the performance of MVIFMDA for identifying candidate disease-associated miRNAs. For 5-CV, all known MDAs (also named positive samples) are randomly divided into five equal parts, and each part in turn is utilized for testing while the remaining is adopted for training. In each fold, the miRNA-disease heterogeneous networks are updated based on new known associations, where the known associations for testing are treated as unobserved associations. MDA prediction can be viewed as a classification task, therefore, several common classification metrics are used to evaluate the prediction performance of MVIFMDA and baseline models, including area under receiver operating characteristic (ROC) curve (AUC), area under the precision-recall (PR) curve (AUPR), accuracy, precision, recall, specificity, and precision rate and recall rate within the top . In addition, we set in the study and repeat the experiments 10 times to get the average of these metrics.
In MVIFMDA, there are several hyperparameters to adjust, such as the number of GCN layer , topology embedding dimension for miRNAs and diseases , attribute embedding dimension and learning rate . By adjusting the parameters empirically, we set the parameter , , , for the MVIFMDA model. In addition, we take the optimal values given by the authors as the parameters of the baseline models.
3.2 Ablation experiments
In the study, we combine GCN encoder and CNN encoder to enhance the feature embeddings of miRNAs and diseases. In order to validate the effectiveness of the main components in MVIFMDA, we designed two variants of MVIFMDA (MVIFMDA-noTR, MVIFMDA-noAR) for the ablation study. MVIFMDA-noTR removes the topology representations of miRNAs and diseases based on the GCN encoder. MVIFMDA-noAR only adopts the topology features without using the attribute representations based on the CNN encoder. The experimental results of MVIFMDA and two variants are shown in Table 1. The results demonstrate that MVIFMDA outperforms the other two variant models on all evaluation metrics. It means that the topological representations obtained by the GCN encoder and the attribute representations learned by the CNN encoder can play a complementary role and the combination of the two can more effectively learn the multi-view information of miRNA and disease nodes from different information sources. For MVIFMDA-noTR and MVIFMDA-noAR, the performance of MVIFMDA-noAR is better, i.e., the topological information extracted from the heterogeneous network views is very useful and the topological level attention mechanism effectively integrates the different structural information. In conclusion, the combination of topological representations from multiple heterogeneous network views learned by the GCN encoder and attribute representations from multiple similarity views learned by the CNN encoder makes our proposed model perform better.
TABLE 1.
Metrics ( ) | MVIFMDA-noTR | MVIFMDA-noAR | MVIFMDA |
---|---|---|---|
AUC | |||
AUPR | |||
accuracy | |||
precision | |||
recall | |||
F1 | |||
precision@5% | |||
precision@10% | |||
recall @5% | |||
recall @10% |
The bold values indicate the best values in rows.
3.3 Comparison with other methods
To demonstrate the performance of MVIFMDA in identifying potential disease-related miRNAs, we compared it with six state-of-the-art approaches that were developed for MDA prediction, including MDHGI (Chen et al., 2018b), ABMDA (Zhao et al., 2019), NIMGSA (Jin et al., 2022), NIMCGCN (Li et al., 2020), DANE-MDA (Ji et al., 2021), MMGCN (Tang et al., 2021).
For a fair comparison, all models are evaluated using 5-CV. Figure 2 and Table 2 show that except that the recall of MVIFMDA is slightly lower than that of MMGCN, all other metrics are significantly higher than the comparison methods, whereas compared to MMGCN, the AUC and AUPR are improved by 1.8 and 4.7%, respectively. One of the possible reasons is that MVIFMDA is able to enhance the representation of nodes by combining topological representations from different heterogeneous network views and attribute representations from different similarity views, which further shows that the design of our model is sound. Compared with MDHGI, ABMDA, NIMGSA, and NIMCGCN, MVIFMDA builds multiple heterogeneous networks with different similarities and learns topological representations from these heterogeneous networks respectively, and uses CNNs to learn high-level features from multiple similarity matrices, which replaces simply combining the multiple similarities into one like the compared methods. Although DANE-MDA considers both the attribute information and topology information, its performance is not as good as MVIFMDA. This may be because of that DANE-MDA simply uses miRNA sequence similarity and disease semantic similarity to obtain node embeddings of miRNAs and diseases, while MVIFMDA learns topological and attribute information from two different similarities more efficiently. Furthermore, although MMGCN uses a multichannel attention mechanism to capture more important topological features from different similarity network views, its performance is also not as good as that of MVIFMDA in addition to the recall, probably because MVIFMDA is capable of combining topological representations learned by GCN and attribute representations learned by CNN to more effectively capture information in multi-view networks. In addition, we further evaluate the performance of MVIFMDA and the comparison methods using the paired t-test based on 10 runs of 5-CV. Table 3 shows that MVIFMDA is significantly preferred to other computational methods in terms of AUC and AUPR ( ).
TABLE 2.
Metrics ( ) | MDHGI | ABMDA | NIMGSA | NIMCGCN | DANE-MDA | MMGCN | MVIFMDA |
---|---|---|---|---|---|---|---|
AUC | |||||||
AUPR | |||||||
accuracy | |||||||
precision | |||||||
recall | |||||||
F1 | |||||||
precision@5% | |||||||
precision@10% | |||||||
recall @5% | |||||||
recall @10% |
The bold values indicate the best values in rows.
TABLE 3.
MVIFMDA versus | MDHGI | ABMDA | NIMGSA | NIMCGCN | DANE-MDA | MMGCN |
---|---|---|---|---|---|---|
-vaule of AUC | 6.7944e-18 | 1.1456e-07 | 6.5493e-14 | 3.5915e-08 | 2.0171e-13 | 9.7205e-09 |
-vaule of AUPR | 2.0333e-19 | 1.3929e-10 | 2.5665e-12 | 4.7847e-08 | 1.7074e-13 | 2.6278e-10 |
3.4 Case studies: Colonic neoplasms, esophageal neoplasms and lymphoma
To further demonstrate the reliability of the MVIFMDA model in real cases, we construct case studies for colonic neoplasms, esophageal neoplasms and lymphoma. All known MDAs are used as positive samples to train the model, then this trained model is used to predict the probability scores of all unknown relationship pairs. For each disease, the predicted scores are sorted in descending order.
The top-rank 20 miRNAs associated with each disease are shown in Tables 4–6, where we use dbDEMC 3.0 (Yang et al., 2017) to validate the candidate MDAs. Colon tumours are the third leading cause of cancer-related deaths in the United States (Siegel et al., 2016). As shown in Table 4, among the top-rank 20 disease-related candidate miRNAs, 18 are identified by dbDEMC, which suggests that these miRNAs are associated with colonic neoplasms. Overexpression of hsa-miR-122 increases the sensitivity of fluorouracil (5-FU)-resistant colon cancer cells to 5-FU through PKM2 downregulation (He et al., 2014). High levels of hsa-miR-122-5p in plasma could suggest liver metastases from colorectal cancer and correlate with poorer recurrence-free survival and overall survival times (Maierthaler et al., 2017; Sun et al., 2020). Colorectal cancer is a collective term for colon cancer and rectal cancer, implying that hsa-miR-122-5p may be associated with the survival prognosis of colon cancer patients. High levels of miR-196a in colorectal cancer can actuate the Akt signaling pathway and accelerate cancer cell metastasis and infiltration (Schimanski et al., 2009; Wang et al., 2010b). Furthermore, it was mentioned in (Ge et al., 2014) that miR-196a in colorectal cancer displays an association with aggressive disease and a detrimental effect on therapeutic outcomes. Esophageal tumours are the major malignant tumours of the digestive system, with the sixth and fourth highest incidence and mortality rates, respectively, among all malignancies. Lymphoma, meanwhile, is a malignant tumour of the lymphatic hematopoietic system, the incidence of which is increasing annually. From Tables 5, 6, it can be seen that the top-rank 20 candidate miRNAs predicted by the MVIFMDA model with regard to esophageal neoplasms and lymphomas can all be confirmed by the dbDEMC dataset. In summary, the case studies further show that our model is effective in inferring new disease-related miRNAs.
TABLE 4.
Rank | miRNA | Evidence | Rank | miRNA | Evidence |
---|---|---|---|---|---|
1 | hsa-mir-122 | Unconfirmed | 11 | hsa-mir-100 | dbDEMC |
2 | hsa-mir-146b | dbDEMC | 12 | hsa-mir-34b | dbDEMC |
3 | hsa-mir-182 | dbDEMC | 13 | hsa-mir-149 | dbDEMC |
4 | hsa-mir-214 | dbDEMC | 14 | hsa-mir-342 | dbDEMC |
5 | hsa-mir-29c | dbDEMC | 15 | hsa-mir-26b | dbDEMC |
6 | hsa-mir-27b | dbDEMC | 16 | hsa-mir-196a-2 | Unconfirmed |
7 | hsa-mir-206 | dbDEMC | 17 | hsa-mir-193b | dbDEMC |
8 | hsa-mir-183 | dbDEMC | 18 | hsa-mir-99a | dbDEMC |
9 | hsa-mir-34c | dbDEMC | 19 | hsa-mir-29b-2 | dbDEMC |
10 | hsa-mir-144 | dbDEMC | 20 | hsa-mir-494 | dbDEMC |
TABLE 5.
Rank | miRNA | Evidence | Rank | miRNA | Evidence |
---|---|---|---|---|---|
1 | hsa-mir-17 | dbDEMC | 11 | hsa-mir-23a | dbDEMC |
2 | hsa-mir-29a | dbDEMC | 12 | hsa-mir-125a | dbDEMC |
3 | hsa-mir-222 | dbDEMC | 13 | hsa-mir-15b | dbDEMC |
4 | hsa-mir-142 | dbDEMC | 14 | hsa-mir-206 | dbDEMC |
5 | hsa-mir-30a | dbDEMC | 15 | hsa-mir-125b-1 | dbDEMC |
6 | hsa-mir-132 | dbDEMC | 16 | hsa-mir-23b | dbDEMC |
7 | hsa-mir-18a | dbDEMC | 17 | hsa-mir-16-1 | dbDEMC |
8 | hsa-mir-200b | dbDEMC | 18 | hsa-let-7d | dbDEMC |
9 | hsa-mir-182 | dbDEMC | 19 | hsa-mir-125b-2 | dbDEMC |
10 | hsa-mir-19b-1 | dbDEMC | 20 | hsa-mir-107 | dbDEMC |
TABLE 6.
Rank | miRNA | Evidence | Rank | miRNA | Evidence |
---|---|---|---|---|---|
1 | hsa-mir-34a | dbDEMC | 11 | hsa-mir-132 | dbDEMC |
2 | hsa-mir-223 | dbDEMC | 12 | hsa-mir-23a | dbDEMC |
3 | hsa-mir-145 | dbDEMC | 13 | hsa-mir-182 | dbDEMC |
4 | hsa-mir-29a | dbDEMC | 14 | hsa-mir-192 | dbDEMC |
5 | hsa-mir-30a | dbDEMC | 15 | hsa-mir-214 | dbDEMC |
6 | hsa-let-7b | dbDEMC | 16 | hsa-mir-15b | dbDEMC |
7 | hsa-mir-195 | dbDEMC | 17 | hsa-mir-183 | dbDEMC |
8 | hsa-mir-106b | dbDEMC | 18 | hsa-let-7c | dbDEMC |
9 | hsa-mir-146b | dbDEMC | 19 | hsa-mir-130a | dbDEMC |
10 | hsa-mir-27a | dbDEMC | 20 | hsa-mir-205 | dbDEMC |
3.5 Prediction of novel diseases
To show the predictive performance of MVIFMDA for new diseases without known relevant miRNAs, we construct another case study for novel diseases in this experiment. When predicting miRNAs relevant to a new disease, we use known relationship pairs other than those associated with the specific disease as positive samples to train the MVIFMDA model and then explore the relationship probabilities between the specific disease and all miRNAs. Based on the descending ranked prediction scores, we use HMDD v3.2 to verify these top-rank 20 candidate MDA pairs.
Breast cancer is the most common cancer worldwide, and miRNAs are considered as new diagnostic and prognostic markers for it. Therefore, here we predict miRNAs associated with breast neoplasms by employing the MVIFMDA model, and as shown in Table 7, 20 of the top 20 miRNAs are validated by the HMDD dataset, which show that our model is good in identifying miRNAs associated with novel diseases.
TABLE 7.
Rank | miRNA | Evidence | Rank | miRNA | Evidence |
---|---|---|---|---|---|
1 | hsa-mir-21 | HMDD | 11 | hsa-mir-210 | HMDD |
2 | hsa-mir-155 | HMDD | 12 | hsa-mir-221 | HMDD |
3 | hsa-mir-146a | HMDD | 13 | hsa-mir-20a | HMDD |
4 | hsa-mir-126 | HMDD | 14 | hsa-mir-19a | HMDD |
5 | hsa-mir-150 | HMDD | 15 | hsa-mir-146b | HMDD |
6 | hsa-mir-223 | HMDD | 16 | hsa-mir-142 | HMDD |
7 | hsa-mir-34a | HMDD | 17 | hsa-mir-143 | HMDD |
8 | hsa-mir-17 | HMDD | 18 | hsa-mir-122 | HMDD |
9 | hsa-mir-145 | HMDD | 19 | hsa-mir-222 | HMDD |
10 | hsa-mir-29a | HMDD | 20 | hsa-mir-22 | HMDD |
4 Conclusion
In this study, we propose a new end-to-end model called MVIFMDA to predict potential MDAs. This model captures topological features in multiple heterogeneous network views by GCN encoder, then adaptively fuses different topological features using an attention mechanism, furthermore employs CNN encoder to extract attribute features from different similarity views of miRNAs and diseases, respectively, finally its prediction performance is improved by combining topological and attribute features. The comparison with six advanced methods for identifying new MDAs and the case studies indicate that MVIFMDA has excellent predictive performance and can perform well in practical applications.
Although MVIFMDA has shown good predictive performance, it still has some issues that need further investigation. First, we use CNN to learn the attribute representations of miRNA and disease node levels, whether the attribute embeddings of miRNA and disease node pairs levels can improve the prediction performance of MDAs needs to be further studied. Second, we use only two similarities of both miRNAs and diseases, and more relevant evidence of miRNA and disease should be used to construct the similarity networks, such as the interaction relationships between miRNAs and lncRNAs and the association relationships between lncRNAs and diseases. In addition, though we consider using gene-related information to calculate the similarity of diseases and miRNAs, a multi-layer network among genes, miRNAs and diseases is not directly constructed to explore the miRNAs correlated with diseases. Therefore, it is still worthwhile to continue investigating how to effectively utilize the information from multiple data sources.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Author contributions
YW and XX conceived the prediction method, and they wrote the paper. XX and SZ developed the computer programs. NS analyzed the results. NS, YC, and YF revised the paper. All authors read and approved the published version of the manuscript.
Funding
This work was supported by the National Natural Science Foundation of China (No. 62072212), the Development Project of Jilin Province of China (Nos. 20200401083GX, 2020C003, and 20200403172).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
- Bandyopadhyay S., Mitra R., Maulik U., Zhang M. Q. (2010). Development of the human cancer microRNA network. Silence 1 (1), 6–14. 10.1186/1758-907X-1-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartel D. P. (2004). MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell. 116 (2), 281–297. 10.1016/s0092-8674(04)00045-5 [DOI] [PubMed] [Google Scholar]
- Bartel D. P. (2009). MicroRNAs: Target recognition and regulatory functions. Cell. 136 (2), 215–233. 10.1016/j.cell.2009.01.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bracken C. P., Scott H. S., Goodall G. J. (2016). A network-biology perspective of microRNA function and dysfunction in cancer. Nat. Rev. Genet. 17 (12), 719–732. 10.1038/nrg.2016.134 [DOI] [PubMed] [Google Scholar]
- Calin G. A., Croce C. M. (2006). MicroRNA signatures in human cancers. Nat. Rev. Cancer 6 (11), 857–866. 10.1038/nrc1997 [DOI] [PubMed] [Google Scholar]
- Carleton M., Cleary M. A., Linsley P. S. (2007). MicroRNAs and cell cycle regulation. Cell. cycle 6 (17), 2127–2132. 10.4161/cc.6.17.4641 [DOI] [PubMed] [Google Scholar]
- Chen X., Liu M.-X., Yan G.-Y. (2012). Rwrmda: Predicting novel human microRNA–disease associations. Mol. Biosyst. 8 (10), 2792–2798. 10.1039/c2mb25180a [DOI] [PubMed] [Google Scholar]
- Chen X., Sun L.-G., Zhao Y. (2021). Ncmcmda: miRNA–disease association prediction through neighborhood constraint matrix completion. Brief. Bioinform. 22 (1), 485–496. 10.1093/bib/bbz159 [DOI] [PubMed] [Google Scholar]
- Chen X., Wang L., Qu J., Guan N.-N., Li J.-Q. (2018a). Predicting miRNA–disease association based on inductive matrix completion. Bioinformatics 34 (24), 4256–4265. 10.1093/bioinformatics/bty503 [DOI] [PubMed] [Google Scholar]
- Chen X., Yan C. C., Zhang X., You Z.-H., Deng L., Liu Y., et al. (2016). Wbsmda: Within and between score for MiRNA-disease association prediction. Sci. Rep. 6 (1), 21106–21109. 10.1038/srep21106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen X., Yan G.-Y. (2014). Semi-supervised learning for potential human microRNA-disease associations inference. Sci. Rep. 4 (1), 5501–5510. 10.1038/srep05501 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen X., Yin J., Qu J., Huang L. (2018b). Mdhgi: Matrix decomposition and heterogeneous graph inference for miRNA-disease association prediction. PLoS Comput. Biol. 14 (8), e1006418. 10.1371/journal.pcbi.1006418 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng A. M., Byrom M. W., Shelton J., Ford L. P. (2005). Antisense inhibition of human miRNAs and indications for an involvement of miRNA in cell growth and apoptosis. Nucleic Acids Res. 33 (4), 1290–1297. 10.1093/nar/gki200 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gao Z., Wang Y.-T., Wu Q.-W., Ni J.-C., Zheng C.-H. (2020). Graph regularized L 2, 1-nonnegative matrix factorization for miRNA-disease association prediction. BMC Bioinforma. 21 (1), 61–13. 10.1186/s12859-020-3409-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ge J., Chen Z., Li R., Lu T., Xiao G. (2014). Upregulation of microRNA-196a and microRNA-196b cooperatively correlate with aggressive progression and unfavorable prognosis in patients with colorectal cancer. Cancer Cell. Int. 14 (1), 128–8. 10.1186/s12935-014-0128-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- He J., Xie G., Tong J., Peng Y., Huang H., Li J., et al. (2014). Overexpression of microRNA-122 re-sensitizes 5-FU-resistant colon cancer cells to 5-FU through the inhibition of PKM2 in vitro and in vivo . Cell. biochem. Biophys. 70 (2), 1343–1350. 10.1007/s12013-014-0062-x [DOI] [PubMed] [Google Scholar]
- He X., Deng K., Wang X., Li Y., Zhang Y., Wang M. (2020). “Lightgcn: Simplifying and powering graph convolution network for recommendation,” in Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval), 639–648. [Google Scholar]
- Huang H.-Y., Lin Y.-C.-D., Li J., Huang K.-Y., Shrestha S., Hong H.-C., et al. (2020). miRTarBase 2020: updates to the experimentally validated microRNA–target interaction database. Nucleic Acids Res. 48 (D1), D148–D154. 10.1093/nar/gkz896 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang Z., Shi J., Gao Y., Cui C., Zhang S., Li J., et al. (2019). HMDD v3. 0: A database for experimentally supported human microRNA–disease associations. Nucleic Acids Res. 47 (D1), D1013–D1017. 10.1093/nar/gky1010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hwang S., Kim C. Y., Yang S., Kim E., Hart T., Marcotte E. M., et al. (2019). HumanNet v2: Human gene networks for disease research. Nucleic Acids Res. 47 (D1), D573–D580. 10.1093/nar/gky1126 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ji B.-Y., You Z.-H., Wang Y., Li Z.-W., Wong L. (2021). DANE-MDA: Predicting microRNA-disease associations via deep attributed network embedding. Iscience 24 (6), 102455. 10.1016/j.isci.2021.102455 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiang Q., Hao Y., Wang G., Juan L., Zhang T., Teng M., et al. (2010). Prioritization of disease microRNAs through a human phenome-microRNAome network. BMC Syst. Biol. 4 (1), S2–S9. 10.1186/1752-0509-4-S1-S2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jin C., Shi Z., Lin K., Zhang H. (2022). Predicting miRNA-disease association based on neural inductive matrix completion with graph autoencoders and self-attention mechanism. Biomolecules 12 (1), 64. 10.3390/biom12010064 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kipf T. N., Welling M. (2016). Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 . [Google Scholar]
- Kang C., Zhang H., Liu Z., Huang S., Yin Y. (2022). LR-GNN: A graph neural network based on link representation for predicting molecular associations. Brief. Bioinform. 23 (1), bbab513. 10.1093/bib/bbab513 [DOI] [PubMed] [Google Scholar]
- Kingma D. P., Ba J. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 . [Google Scholar]
- Kozomara A., Birgaoanu M., Griffiths-Jones S. (2019). miRBase: from microRNA sequences to function. Nucleic Acids Res. 47 (D1), D155–D162. 10.1093/nar/gky1141 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee I., Blom U. M., Wang P. I., Shim J. E., Marcotte E. M. (2011). Prioritizing candidate disease genes by network-based boosting of genome-wide association data. Genome Res. 21 (7), 1109–1121. 10.1101/gr.118992.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li J.-Q., Rong Z.-H., Chen X., Yan G.-Y., You Z.-H. (2017). Mcmda: Matrix completion for MiRNA-disease association prediction. Oncotarget 8 (13), 21187–21199. 10.18632/oncotarget.15061 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li J., Zhang S., Liu T., Ning C., Zhang Z., Zhou W. (2020). Neural inductive matrix completion with graph convolutional networks for miRNA-disease association prediction. Bioinformatics 36 (8), 2538–2546. 10.1093/bioinformatics/btz965 [DOI] [PubMed] [Google Scholar]
- Maierthaler M., Benner A., Hoffmeister M., Surowy H., Jansen L., Knebel P., et al. (2017). Plasma miR‐122 and miR‐200 family are prognostic markers in colorectal cancer. Int. J. Cancer 140 (1), 176–187. 10.1002/ijc.30433 [DOI] [PubMed] [Google Scholar]
- Metzinger-Le Meuth V., Metzinger L. (2019). miR-223 and other miRNA's evaluation in chronic kidney disease: innovative biomarkers and therapeutic tools. Noncoding. RNA Res. 4 (1), 30–35. 10.1016/j.ncrna.2019.01.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miska E. A. (2005). How microRNAs control cell division, differentiation and death. Curr. Opin. Genet. Dev. 15 (5), 563–568. 10.1016/j.gde.2005.08.005 [DOI] [PubMed] [Google Scholar]
- Needleman S. B., Wunsch C. D. (1970). A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 48 (3), 443–453. 10.1016/0022-2836(70)90057-4 [DOI] [PubMed] [Google Scholar]
- Ou-Yang L., Lu F., Zhang Z.-C., Wu M. (2022). Matrix factorization for biomedical link prediction and scRNA-seq data imputation: An empirical survey. Brief. Bioinform. 23 (1), bbab479. 10.1093/bib/bbab479 [DOI] [PubMed] [Google Scholar]
- Peng J., Hui W., Li Q., Chen B., Hao J., Jiang Q., et al. (2019). A learning-based framework for miRNA-disease association identification using neural networks. Bioinformatics 35 (21), 4364–4371. 10.1093/bioinformatics/btz254 [DOI] [PubMed] [Google Scholar]
- Piñero J., Ramírez-Anguita J. M., Saüch-Pitarch J., Ronzano F., Centeno E., Sanz F., et al. (2020). The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res. 48 (D1), D845–D855. 10.1093/nar/gkz1021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schimanski C. C., Frerichs K., Rahman F., Berger M., Lang H., Galle P. R., et al. (2009). High miR-196a levels promote the oncogenic phenotype of colorectal cancer cells. World J. Gastroenterol. 15 (17), 2089–2096. 10.3748/wjg.15.2089 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sereshgi M. M. A., Abdollahpour-Alitappeh M., Mahdavi M., Ranjbar R., Ahmadi K., Taheri R. A., et al. (2019). Immunologic balance of regulatory T cell/T helper 17 responses in gastrointestinal infectious diseases: Role of miRNAs. Microb. Pathog. 131, 135–143. 10.1016/j.micpath.2019.03.029 [DOI] [PubMed] [Google Scholar]
- Sheng N., Huang L., Wang Y., Zhao J., Xuan P., Gao L., et al. (2022). Multi-channel graph attention autoencoders for disease-related lncRNAs prediction. Brief. Bioinform. 23 (2), bbab604. 10.1093/bib/bbab604 [DOI] [PubMed] [Google Scholar]
- Siegel R. L., Miller K. D., Jemal A. (2016). Cancer statistics, 2016. Ca. Cancer J. Clin. 66 (1), 7–30. 10.3322/caac.21332 [DOI] [PubMed] [Google Scholar]
- Small E. M., Olson E. N. (2011). Pervasive roles of microRNAs in cardiovascular biology. Nature 469 (7330), 336–342. 10.1038/nature09783 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun L., Liu X., Pan B., Hu X., Zhu Y., Su Y., et al. (2020). Serum exosomal miR-122 as a potential diagnostic and prognostic biomarker of colorectal cancer with liver metastasis. J. Cancer 11 (3), 630–637. 10.7150/jca.33022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tang X., Luo J., Shen C., Lai Z. (2021). Multi-view multichannel attention graph convolutional network for miRNA–disease association prediction. Brief. Bioinform. 22 (6), bbab174. 10.1093/bib/bbab174 [DOI] [PubMed] [Google Scholar]
- Wang D., Wang J., Lu M., Song F., Cui Q. (2010a). Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases. Bioinformatics 26 (13), 1644–1650. 10.1093/bioinformatics/btq241 [DOI] [PubMed] [Google Scholar]
- Wang Y. X., Zhang X. Y., Zhang B. F., Yang C. Q., Chen X. M., Gao H. J. (2010b). Initial study of microRNA expression profiles of colonic cancer without lymph node metastasis. J. Dig. Dis. 11 (1), 50–54. 10.1111/j.1751-2980.2009.00413.x [DOI] [PubMed] [Google Scholar]
- Wei H., Liu B. (2020). iCircDA-MF: identification of circRNA-disease associations based on matrix factorization. Brief. Bioinform. 21 (4), 1356–1367. 10.1093/bib/bbz057 [DOI] [PubMed] [Google Scholar]
- Xiao Q., Luo J., Liang C., Cai J., Ding P. (2018). A graph regularized non-negative matrix factorization method for identifying microRNA-disease associations. Bioinformatics 34 (2), 239–248. 10.1093/bioinformatics/btx545 [DOI] [PubMed] [Google Scholar]
- Xiao Q., Zhang N., Luo J., Dai J., Tang X. (2021). Adaptive multi-source multi-view latent feature learning for inferring potential disease-associated miRNAs. Brief. Bioinform. 22 (2), 2043–2057. 10.1093/bib/bbaa028 [DOI] [PubMed] [Google Scholar]
- Xu J., Li C.-X., Lv J.-Y., Li Y.-S., Xiao Y., Shao T.-T., et al. (2011). Prioritizing candidate disease miRNAs by topological features in the miRNA target–dysregulated network: Case study of prostate cancer. Mol. Cancer Ther. 10 (10), 1857–1866. 10.1158/1535-7163.MCT-11-0055 [DOI] [PubMed] [Google Scholar]
- Xu J., Li Y. (2006). Discovering disease-genes by topological features in human protein–protein interaction network. Bioinformatics 22 (22), 2800–2805. 10.1093/bioinformatics/btl467 [DOI] [PubMed] [Google Scholar]
- Xuan P., Han K., Guo M., Guo Y., Li J., Ding J., et al. (2013). Prediction of microRNAs associated with human diseases based on weighted k most similar neighbors. PloS one 8 (8), e70204. 10.1371/journal.pone.0070204 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang Z., Wu L., Wang A., Tang W., Zhao Y., Zhao H., et al. (2017). dbDEMC 2.0: updated database of differentially expressed miRNAs in human cancers. Nucleic Acids Res. 45 (D1), D812–D818. 10.1093/nar/gkw1079 [DOI] [PMC free article] [PubMed] [Google Scholar]
- You Z.-H., Huang Z.-A., Zhu Z., Yan G.-Y., Li Z.-W., Wen Z., et al. (2017). Pbmda: A novel and effective path-based computational model for miRNA-disease association prediction. PLoS Comput. Biol. 13 (3), e1005455. 10.1371/journal.pcbi.1005455 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu Z., Huang F., Zhao X., Xiao W., Zhang W. (2021). Predicting drug–disease associations through layer attention graph convolutional network. Brief. Bioinform. 22 (4), bbaa243. 10.1093/bib/bbaa243 [DOI] [PubMed] [Google Scholar]
- Zhang S., Ma A., Zhao J., Xu D., Ma Q., Wang Y. (2022). Assessing deep learning methods in cis-regulatory motif finding based on genomic sequencing data. Brief. Bioinform. 23 (1), bbab374. 10.1093/bib/bbab374 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Z.-C., Zhang X.-F., Wu M., Ou-Yang L., Zhao X.-M., Li X.-L. (2020). A graph regularized generalized matrix factorization model for predicting links in biomedical bipartite networks. Bioinformatics 36 (11), 3474–3481. 10.1093/bioinformatics/btaa157 [DOI] [PubMed] [Google Scholar]
- Zhao Y., Chen X., Yin J. (2019). Adaptive boosting-based computational model for predicting potential miRNA-disease associations. Bioinformatics 35 (22), 4730–4738. 10.1093/bioinformatics/btz297 [DOI] [PubMed] [Google Scholar]
- Zhou J., Cui G., Hu S., Zhang Z., Yang C., Liu Z., et al. (2020). Graph neural networks: A review of methods and applications. AI Open 1, 57–81. 10.1016/j.aiopen.2021.01.001 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.