Abstract
The accurate prediction of potential associations between microRNAs (miRNAs) and small molecule (SM) drugs can enhance our knowledge of how SM cures endogenous miRNA-related diseases. Given that traditional methods for predicting SM-miRNA associations are time-consuming and arduous, a number of computational models have been proposed to anticipate the potential SM–miRNA associations. However, several of these strategies failed to eliminate noise from the known SM-miRNA association information or failed to prioritize the most significant known SM-miRNA associations. Therefore, we proposed a model of Graph Convolutional Network with Layer Attention mechanism for SM-MiRNA Association prediction (GCNLASMMA). Firstly, we obtained the new SM-miRNA associations by matrix decomposition. The new SM-miRNA associations, as well as the integrated SM similarity and miRNA similarity were subsequently incorporated into a heterogeneous network. Finally, a graph convolutional network with an attention mechanism was used to compute the reconstructed SM-miRNA association matrix. Furthermore, four types of cross validations and two types of case studies were performed to assess the performance of GCNLASMMA. In cross validation, global Leave-One-Out Cross Validation (LOOCV), miRNA-fixed LOOCV, SM-fixed LOOCV and 5-fold cross-validation achieved excellent performance. Numerous hypothesized associations in case studies were confirmed by experimental literatures. All of these results confirmed that GCNLASMMA is a trustworthy association inference method.
Keywords: microRNA, small molecule, deep learning, association prediction, matrix decomposition
1 Introduction
As a form of non-coding RNA (ncRNA), MicroRNA (miRNA), is roughly 22 nucleotides in length (Bartel, 2004; Hammond, 2015; Lu and Rothenberg, 2018). Lin-4 was the first human miRNA identified in1993 by Lee et al. in Caenorhabditis elegans (Lee et al., 1993; Wightman et al., 1993). With the advent of high-throughput sequencing technologies, an increasing number of miRNAs with important functions in human gene expression have been identified (Denzler et al., 2016; Tagliafierro et al., 2017; Thomou et al., 2017; Gam et al., 2018; Ghini et al., 2018; Liu et al., 2018). Specifically, miRNAs can attach to the 3′ UnTranslated Region (3’ UTR) of target messenger RNAs (mRNAs) via base-pairing to control the degradation of target mRNAs and limit the translation of target mRNAs, hence regulating gene expression (Gorbea et al., 2017). In the control of target mRNA gene expression by miRNA, one miRNA may regulate many target mRNAs, or numerous miRNAs regulate one target mRNA (Saikia et al., 2020; Iwata et al., 2021; Zhong et al., 2021). Several studies demonstrated the role of miRNAs in the maturation of immune cells (Kumar Kingsley and Vishnu Bhat, 2017). Since the profound impact of miRNAs on biological development became apparent, numerous miRNA types have been identified to be involved in biological evolutionary processes (Rupaimoole and Slack, 2017; Cristino et al., 2019).
Small Molecule (SM) drugs are mostly composed of molecules with molecular weights typically fewer than 1,000 g/mol. More than 98 percent of today’s drugs are SMs (Geng and Craig, 2021). The development of SMs that target miRNAs is a current trend in drug research (Dai and Tan, 2015; Yu et al., 2020). In previous drug development, protein enzymes and receptors were typically employed as therapeutic targets. Over 80 percent of drug development was intimately tied to protein enzymes and receptors (Deyle et al., 2017; Yekkirala et al., 2017; Nair et al., 2018; Lai-Kwon et al., 2021). In recent years, more scientific experiments have proven inextricable linkages between SMs and miRNAs (Healy et al., 2012; Monroig et al., 2015; Haniff et al., 2021). When miRNAs fail to regulate the gene expression of an organism, specific disorders such as cardiovascular diseases, neurological diseases and cancers may develop (Kumari et al., 2018; Xia et al., 2019; Dragomir et al., 2021). In addition, SMs are effective in regulating miRNA dysregulation to treat linked endogenous disorders, and numerous SMs have been created for clinical therapy of these diseases (Dragomir et al., 2021).
The development of novel SMs is facilitated by the accurate identification of miRNA-related SMs. Recent studies have focused on discovering possible associations between SMs and miRNAs (Chen et al., 2021; Li et al., 2021; Wang et al., 2021). Early identification approaches used high-throughput screening methods, such as mass spectrometry, fluorescence and reporter genes (Seth et al., 2005; Parsons et al., 2009; Carnevali et al., 2010; Chen et al., 2012). The most frequent method for discovering potential SM-miRNA associations is the reporter genes. On the basis of the reporter genes, a functional novel drug screening method capable of screening lead compounds was proposed. By substituting biomacromolecules with tiny organic compounds, the screening process for drugs could be expedited dramatically. The use of tiny organic compounds throughout the screening procedure could provide information on the functional responses of cells. (Wen et al., 2015). In drug screening research, luciferase reporter genes satisfy the requirements for high sensitivity, target specificity and high throughput (Thorne et al., 2010).
However, it was discovered that biological screening approaches are stochastic and time-consuming. With the proliferation of bioinformatics databases, the number of known SM-miRNA associations increased, as did the calculational methodologies for SM and miRNA similarity. Consequently, machine learning techniques obtained more precise prediction outcomes (Qu et al., 2019). Bioinformaticians have begun to employ machine learning techniques to predict probable SM-miRNA associations to circumvent time-consuming and labor-intensive biological investigations (Wang and Chen, 2019; Wang et al., 2019).
Among the previous methods for predicting probable SM-miRNA associations, (Qu et al., 2018), developed a model titled Triple Layer Heterogeneous Network based Small Molecule-MiRNA Association prediction (TLHNSMMA). TLHNSMMA first merged the known SM-miRNA associations, SM similarity and miRNA similarity into a three-layer heterogeneous network. The three-layer heterogeneous graph was then implemented with an iterative updating algorithm. Finally, the reconstructed SM-miRNA association matrix was obtained using an iterative propagation approach that made extensive use of global data. Based on the establishment of a three-layer SM-miRNA heterogeneous network, (Liu et al., 2020), suggested a novel model for potential SM-miRNA association prediction called Random Walk with Negative Samples (RWNS). Firstly, RWNS obtained integrated similarities of SM and miRNA. Then, Liu et al. devised a Credible Negative Sample extraction method (CNSMiRS) to extract plausible negative SM-miRNA samples under the premise that dissimilar SMs/miRNAs are unlikely to be associated with each other’s related miRNAs/SMs. Finally, the reconstructed SM-miRNA association matrix was obtained by implementing a random walk algorithm on the constructed small molecule-disease-miRNA association network. However, the performance of TLHNSMMA and RWNS is dependent on the known SM-miRNA association adjacency matrix. Consequently, (Yin et al., 2019), suggested a model of Sparse Learning and Heterogeneous Graph Inference for Small Molecule-MiRNA Association prediction (SLHGISMMA). Yin et al. first used matrix decomposition on known SM-miRNA associations to obtain the new SM-miRNA associations. Then, the new SM-miRNA associations, integrated miRNA similarity and integrated SM similarity were incorporated into a heterogeneous network. Finally, the reconstructed SM-miRNA association matrix was obtained using heterogeneous graph inference. Chen et al. (2021) recently proposed the Bounded Nuclear Norm Regularization for SM–miRNA Associations prediction (BNNRSMMA), which treated the problem of potential SM-miRNA association prediction as a matrix complementation problem. In addition, BNNRSMMA included a regularization term to remove the negative effects of data noise.
In recent years, improvements have been made to machine learning techniques, and deep learning has emerged as one of the brightest new stars (Wang et al., 2020). Deep learning has achieved exceptional results in traditional classification tasks, such as handwritten font recognition (Singh et al., 2021), computer vision (Borges Oliveira et al., 2021) and computational biology (Angermueller et al., 2016). In addition, deep learning has substantially affected the field of potential association prediction. For example, zeng et al. proposed a computational framework termed AOPEDF based on drug-target network and deep forest algorithm to predict potential drug-target associations (Zeng et al., 2020). AOPEDF attained excellent performance in identifying molecular targets among known drugs on two external validation datasets by comparison to other machine learning methods. Therefore, we proposed a model of Graph Convolutional Network with Layer Attention mechanism for SM-MiRNA Association prediction (GCNLASMMA). To evaluate the performance of GCNLASMMA, we used two types of cross validation, namely, 5-fold cross-validation and Leave-One-Out Cross Validation (LOOCV). Additionally, we also utilized two types of case studies to confirm the effectiveness of GCNLASMMA in identifying potential miRNAs for investigated SMs. The results showed that GCNLASMMA could accurately and effectively predict the SM-miRNA pairs most likely to be potentially associated.
2 Materials and methods
2.1 SM-miRNA associations
We named two datasets used in our work after dataset1 and dataset2. Eight hundred and thirty-one SMs in dataset1 were downloaded from three databases, namely SM2miR, DrugBank (Knox et al., 2011) and PubChem (Wang et al., 2009). Five hundred and forty-one miRNAs were downloaded from four databases, namely SM2miR, HMDD (Li et al., 2014), miR2Disease (Jiang et al., 2009) and PhenomiR (Ruepp et al., 2010). Six hundred and sixty-four known SM-miRNA associations were downloaded from a database, namely SM2miR V1.0 (Liu et al., 2013). On the basis of dataset1, we removed the SMs and miRNAs that did not constitute any known association. Then, we obtained dataset2 which included 286 different miRNAs, 39 different SMs and 664 known SM-miRNA association pairs. Specifically, the known SM-miRNA association between the SM and the miRNA was stored as follows.
2.2 Integration of SM similarities
The integrated SM similarity was calculated by (Lv et al., 2015). In his method, a total of four SM similarities were used, namely SM side effect similarity (Gottlieb et al., 2011), gene functional consistency-based similarity for SMs (Lv et al., 2012), SM chemical structure similarity (Hattori et al., 2003) and disease phenotype-based similarity for SMs (Gottlieb et al., 2011). In Lv’s article, the side effect properties of SM were first downloaded from SIDe Effect Resource (SIDER) and calculated by Jaccard score to obtain SMs side effect similarities (Gottlieb et al., 2011). The calculation of gene functional consistency-based similarities for SMs was implemented on the target genes of SMs obtained from the DrugBank and Therapeutic Targets Database (TTD) (Liu et al., 2011). The Gene Set Functional Similarity (GSFS) method was given in the previous article (Lv et al., 2012). Specifically, we downloaded the SM chemical structure information. Then, a graph-based method, SIMilar COMPound (SIMCOMP) (Lv et al., 2012), was applied to obtain SMs’ chemical structure similarities. Finally, the disease phenotype-based similarities for SMs were obtained by calculating the data downloaded from the DrugBank and TTD with the Jaccard score method.
After obtaining all four SM similarities, we named them after , , and , respectively. Then, the scores of the four SM similarities were integrated by the following formula,
(1) |
where represents the weights of SM similarities. All of the measures are important in terms of biology. Thus, we set the values of all to 1, which means that each SM similarity made an equal contribution to constituting the integrated SM similarity (Li et al., 2004). Finally, the integrated SM similarity between the and SMs was obtained after normalization as follows.
(2) |
2.3 Integration of miRNA similarities
Two miRNA similarities, gene function consistency-based similarity (Lv et al., 2012) and indication phenotype-based similarity (Gottlieb et al., 2011), were used to obtain integrated miRNA similarity. Specifically, we downloaded the target scores of each miRNA from the database TargetScan (Agarwal et al., 2015) and obtained gene function consistency-based similarity using the GSFS method (Lv et al., 2012). The indication phenotype-based similarity was obtained from the Human MicroRNA Disease Database (HMDD) version 2.0 (v 2.0), miR2Disease and PhenomiR databases using the GSFS method. Then, we combined the gene function consistency-based similarity and the indication phenotype-based similarity using the Jaccard score. Then, we named the two kinds of miRNA similarities after and , respectively. Moreover, the integrated miRNA similarity was obtained by the following equation,
(3) |
where and represent the weights of miRNA similarities. Also, we set the values of and to 1, which means each miRNA similarity made an equal contribution to constituting the integrated miRNA similarity. Finally, the integrated miRNA similarity between the and miRNAs was obtained after normalization as follows.
(4) |
2.4 GCNLASMMA
GCNLASMMA was separated into two steps. The known SM-miRNA association was initially decomposed and reconstructed to obtain the new SM-miRNA association . The reconstructed SM-miRNA association matrix was then obtained by calculating the new SM-miRNA association using a graph convolutional network with an attention mechanism. More specifically, we obtained the new SM-miRNA associations by matrix decomposition. Then, the new SM-miRNA association matrix, integrated SM similarity and integrated miRNA similarity were constructed into a heterogeneous network. Finally, the graph convolutional network with layer attention mechanism was applied to obtain the reconstructed SM-miRNA association matrix. GCNLASMMA is a model of a neural network with more hidden layers than other networks. The multi-layer calculation thoroughly considered the known features and avoided overfitting. Moreover, the attention mechanism extracted significant information from each layer, thereby improving the accuracy of association prediction (Niu et al., 2021). The specific flow chart of GCNLASMMA is shown in Figure 1.
FIGURE 1.
The flow chart of potential SM-miRNA association prediction based on GCNLASMMA. Firstly, the matrix decomposition is applied to obtain the new SM-miRNA associations. Then the new SM-miRNA associations, integrated SM similarity and integrated miRNA similarity are constructed into an SM-miRNA association heterogeneous network. Finally, a graph convolutional network with layer attention mechanism is applied to obtain the reconstructed SM-miRNA association matrix.
2.4.1 Matrix decomposition
The existence of noise in known SM-miRNA associations tends to reduce prediction accuracy. Prior research has demonstrated that hidden features with considerable value can be extracted by applying dimension-reduction and noise-reduction to the data (Vidal, 2011). A low-rank matrix is a tool for efficiently obtaining hidden features with significant values (Peng et al., 2012). Therefore, we used matrix decomposition to learn a low-rank matrix from the known SM-miRNA association . The decomposition of was performed as follows:
(5) |
Since the above equation contains an infinite number of solutions, we applied the constraint to turn it into:
(6) |
where , . In Eq. 6, the nuclear norm and sparse norm were applied to constrain and , which allowed and to be low-rank and sparse matrices, respectively. The balance parameter of low-rank and sparse matrices was set to 0.1. According to earlier research, if in Eq. 6 is transformed into an identity matrix, then the model is degenerated to the Robust Principal Component Analysis (RPCA), a convex optimization problem with constraints (Chandrasekaran et al., 2009).
(7) |
Based on the previous work (Meng et al., 2014), Eq. 7 can be converted into an unconstrained optimization problem. Therefore, the problem can be resolved using the Exact Augmented Lagrange Multipliers (EALM) algorithm.
(8) |
In Eq. 8, the penalty parameter . According to the Inexact Augmented Lagrange Multipliers (IALM) algorithm (See Table 1), we fixed other variables and solved the minimum value of , and by updating the Lagrange multipliers and . Moreover, we defined and as the solution of Eq. 8. represents the similarity matrix of miRNA or SM. represents the noise matrix. Then, the new SM-miRNA association was expressed as:
(9) |
TABLE 1.
The illustration of the IALM algorithm.
Algorithm: Inexact augmented lagrange multipliers |
---|
Input: Known SM-miRNA association matrix and |
Initialize: |
While true |
1. Fix others and |
2. Fix others and |
3. Fix others and |
4. Update ; |
5. Update |
If and |
End while |
Output: and |
2.4.2 SM-miRNA heterogeneous network
In this study, the new SM-miRNA association , integrated SM similarity and integrated miRNA similarity were combined into a heterogeneous network. There would be a known association between the SM and the miRNA if element in equaled 1. represented the integrated similarity between the SM and the SM. represented the integrated similarity between the miRNA and the miRNA. The specific equation of the heterogeneous network construction is as follows:
(10) |
where represents the transpose matrix of . In Eq. 10, we normalized the similarity matrix of SM and miRNA by and , respectively. Specifically, and .
2.4.3 Graph convolutional network
As classic network models, Long-Short Term Memory (LSTM) and Convolution Neural Network (CNN) are only applicable to grid-structured data. Nevertheless, the Graph Convolutional Network (GCN) can manage data with generalized topological graph structures and deeply explore the features of the data (Habib and Qureshi, 2020). In this paper, we constructed GCNLASMMA, which is a model for graph convolution of biological information. Specifically, GCN was implemented on the SM-miRNA heterogeneous network that was constructed by the known SM-miRNA associations, SM similarities and miRNA similarities. GCN is a neural network structure consisting of an input layer, an output layer and many hidden layers that can represent nodes in a low-dimensional manner. Each hidden layer of GCN takes the output of the previous layer as input. The graph convolutional network propagation rule is as follows:
(11) |
In Eq. 11, and denote the embeddings of nodes in the and layers, respectively. is a diagonal matrix of input graph , represents the trainable weight matrix with a layer-specific value, denotes the nonlinear activation function.
In the encoder part, to learn low-dimensional representations of miRNAs and SMs, we combined the new SM-miRNA association, integrated SM similarity and integrated miRNA similarity into SM-miRNA association heterogeneous network . Firstly, we set a penalty factor in the input graph during the propagation process as follows:
(12) |
Then, we initialized the input layer embeddings as:
(13) |
In this way, we obtained the propagation formula for the first layer from Eqs 11, 13:
(14) |
In Eq. 12, is a weight matrix that acts only between the input layer and the first hidden layer. is the first-layer embeddings of the heterogeneous network , is the dimension of the embeddings. Similarly, the propagation rules for the subsequent layers of the GCN encoder followed Eq. 11, where . After iterations, we obtained embeddings from different graph convolution layers. Exponential linear elements were used as nonlinear activation functions in the graph convolution layer, which sped up the learning process and significantly improved the generalization performance.
In addition, we tried several different combinations of parameters from the range , . By adjusting the parameters empirically, we set the dimensions of embeddings , the number of layers , the initial learning rate of optimizer , the total training epochs , the two dropout rates and , the penalty factor on both dataset1 and dataset2.
2.4.4 Layer attention mechanism
In addition, the layer attention mechanism was added to this model by introducing an attention mechanism between each layer and storing the position information in . As a help for the attention mechanism, we extracted the pertinent information straight from the source data when constructing the embeddings of each layer output during the decoding process. Through this mechanism, we obtained the final SM embeddings and final miRNA embeddings from the fully connected layer: , where represents the final embeddings of miRNA, is the final embeddings of SM. The neural network automatically adjusted the value of by the initial input value . Finally, we obtained the reconstructed SM-miRNA association matrix by an activation function as follows,
(15) |
where is a trainable matrix. The corresponding element is the potential correlation score between miRNA and SM .
3 Results
To evaluate the performance of GCNLASMMA, we used two types of cross validation, namely, 5-fold cross-validation and Leave-One-Out Cross Validation (LOOCV). The two different datasets include the same known 664 SM-miRNA associations. Specifically, dataset 1 has 831 SMs and 541 miRNAs. On the basis of dataset1, we removed the SMs and miRNAs that did not constitute any known association. Then, we obtained dataset2 which has only 286 different miRNAs, 39 different SMs. In this study, the Area Under the receiver operating characteristic Curves (AUCs) obtained under 5-fold cross-validation based on dataset1 and dataset2 were 0.9721 ± 0.0018 and 0.8393 ± 0.0047, respectively. The global AUC and local AUC obtained under LOOCV based on dataset1 were 0.9751 (global LOOCV), 0.9746 (miRNA-fixed LOOCV) and 0.5014 (SM-fixed LOOCV), respectively. Based on dataset2, the AUCs of GCNLASMMA were 0.8504 (global LOOCV), 0.8490 (miRNA-fixed LOOCV) and 0.6398 (SM-fixed LOOCV), respectively. Additionally, we utilized two types of case studies to confirm the effectiveness of GCNLASMMA in identifying potential miRNAs for investigated SMs. Specifically, GCNLASMMA has predicted the potential miRNAs associated with 5-Fluorouracil (5-Fu, CID: 3385), 5-Aza-2′-deoxycytidine (5-Aza-CdR, CID: 451668) and 17β-Estradiol (E2, CID: 5757). For 5-Fu, the results showed that 9, 16 and 39 out of the top 10, 20 and 50 potential related miRNAs in the first type of case studies, 8, 15 and 39 out of the top 10, 20 and 50 potential related miRNAs in the second type of case studies were validated in other literature or databases, respectively. For 5-Aza-CdR, the results showed that 8, 13 and 26 out of the top 10, 20 and 50 potential related miRNAs in the first type of case studies, 8, 14 and 28 out of the top 10, 20 and 50 potential related miRNAs in the second type of case studies were validated in other literature or databases, respectively. For E2, the results showed that 6, 14 and 29 out of the top 10, 20 and 50 potential related miRNAs in the first type of case studies, 4, 11 and 29 out of the top 10, 20 and 50 potential related miRNAs in the second type of case studies were validated in other literature or databases, respectively.
3.1 Performance evaluation
In 5-fold cross-validation, all known SM-miRNA associations were randomly separated into five subsets of nearly comparable size. Then, each subset was in turn considered as the test sample, and the rest four subsets were treated as training samples. Moreover, all unknown SM-miRNA pairs were regarded as candidate samples. Subsequently, we obtained a predicted association score matrix by GCNLASMMA, and ranked the scores of each test sample against those of the candidate samples. This partition-prediction-ranking procedure was repeated 100 times to obtain a sound estimate of the mean and variance of GCNLASMMA’s prediction accuracy. Finally, the prediction of a test sample was deemed successful if the sample’s rank was higher than the given threshold. Therefore, we utilized the threshold to calculated the false positive rate (FPR, 1-specificity) and the true positive rate (TPR, sensitivity). The FPR and TPR represented the percentage of candidate samples that lower than the threshold and the percentage of test samples that higher than the threshold, respectively. Then, we regarded FPR and TPR as horizontal and vertical axis. The Receiver Operating Characteristic (ROC) curve were plotted. Finally, we attained the Area Under the ROC Curve (AUC) by computing the area under the ROC curves. In this investigation, GCNLASMMA achieved the AUCs of 0.9721 ± 0.0018 and 0.8393 ± 0.0047 under 5-fold cross-validation based on dataset1 and dataset2, respectively.
LOOCV was further classified as either global and local. Then, the local-LOOCV was subdivided into miRNA-fixed LOOCV and SM-fixed LOOCV. In LOOCV, each known SM-miRNA association was in turn considered to be the test sample and the others were treated as the training samples. Moreover, all unknown SM-miRNA pairs were treated as candidate samples. In miRNA-fixed LOOCV and SM-fixed LOOCV, test samples and training samples were chosen similarly. However, in SM-fixed LOOCV, only unknown SM-miRNA pairs containing the selected SM were regarded as candidate samples. Similarly, in miRNA-fixed LOOCV, candidate samples only included those involving the chosen miRNA. Then, we ranked the score of the test sample against those of the candidate samples. Finally, the prediction of a test sample was deemed successful if the rank of this test sample was higher than the given threshold. Based on dataset1, GCNLASMMA attained the AUCs of 0.9751, 0.9746 and 0.5014 under global LOOCV, miRNA-fixed LOOCV and SM-fixed LOOCV, respectively. Based on dataset2, GCNLASMMA attained the AUCs of 0.8504, 0.8490 and 0.6398 under global LOOCV, miRNA-fixed LOOCV and SM-fixed LOOCV, respectively.
The AUC comparison figures based on dataset1 (dataset2) were plotted to determine the differences between GCNLASMMA and other models’ outcomes. AUC = 0.5 would suggest that the model was only capable of random prediction, whereas AUC = 1 would indicate that all test samples were accurately predicted. Figure 2 demonstrates that the results of GCNLASMMA under global LOOCV are significantly better than that of SMiR-NBI. Figures 3, 4 show that the results of GCNLASMMA under miRNA-fixed local LOOCV and SM-fixed local LOOCV were significantly better than those of SLHGISMMA and SMiR-NBI. Furthermore, the AUC of miRNA-fixed local LOOCV based on dataset1 is 0.9746, which means almost all potential SM-miRNA associations in dataset1 were predicted successfully.
FIGURE 2.
The left half of the figure shows the comparison of GCNLASMMA with two comparison algorithms under global LOOCV based on dataset1. As a result, GCNLASMMA, SLHGISMMA and SMiR-NBI achieve AUCs of 0.9751, 0.9273 and 0.8843, respectively. The right half of the figure shows the comparison of GCNLASMMA with two comparison algorithms under global LOOCV based on dataset2. As a result, GCNLASMMA, SLHGISMMA and SMiR-NBI achieve AUCs of 0.8504, 0.7897 and 0.7264, respectively.
FIGURE 3.
The left half of the figure shows the comparison of GCNLASMMA with two comparison algorithms under miRNA-fixed LOOCV based on dataset1. As a result, GCNLASMMA, SLHGISMMA and SMiR-NBI achieve AUCs of 0.9746, 0.9553 and 0.8837, respectively. The right half of the figure shows the comparison of GCNLASMMA with two comparison algorithms under miRNA-fixed LOOCV based on dataset2. As a result, GCNLASMMA, SLHGISMMA and SMiR-NBI achieve AUCs of 0.8490, 0.8106 and 0.7846, respectively.
FIGURE 4.
The left half of the figure shows the comparison of GCNLASMMA with two comparison algorithms under SM-fixed LOOCV based on dataset1. As a result, GCNLASMMA, SLHGISMMA and SMiR-NBI achieve AUCs of 0.5014, 0.7702 and 0.7497, respectively. The right half of the figure shows the comparison of GCNLASMMA with two comparison algorithms under SM-fixed LOOCV based on dataset2. As a result, GCNLASMMA, SLHGISMMA and SMiR-NBI achieve AUCs of 0.6398, 0.6565 and 0.6100, respectively.
3.2 Case studies
To further illustrate the GCNLASMMA’s applicability to identify potential miRNAs, we conducted two types of case studies on three essential SMs, namely 5-Fluorouracil (5-Fu, CID: 3385), 5-Aza-2′-deoxycytidine (5-Aza-CdR, CID: 451668) and 17β-Estradiol (E2, CID: 5757). On the basis of all known SM-miRNA associations, the first type was applied to forecast potential miRNAs for investigated SMs. As the training set, we utilized the known SM-miRNA associations from dataset1. Then, for each investigated SM, we ranked all candidate miRNAs according on their predicted scores. The second type was used to forecast potential miRNAs for investigated SMs without any known SM-miRNA association. Therefore, we removed all verified associations related to the investigated SMs before the prediction and ranked them as the first type of case studies. After ranking all candidate miRNAs for each investigated SM based on their predicted scores, the top 50 predicted miRNAs were picked out and verified in other literature or databases. Moreover, we selected 10, 20 and 50 associations randomly from all potential associations to further demonstrate the validity of GCNLASMMA. The results show that only 0, 0 and 2 out of random 10, 20 and 50 associations are confirmed in other literature or databases (See Table 2), which significantly worse than the top 10, 20 and 50 miRNAs related to investigated SMs.
TABLE 2.
Validation of the random 50 SM-miRNAs associations. The first column records the random 1–25 associations. The second column records the random 26–50 associations.
SM | miRNA | Evidence | SM | miRNA | Evidence |
---|---|---|---|---|---|
CID 4116 | hsa-mir-329-2 | unconfirmed | CID 2662 | hsa-mir-330 | unconfirmed |
CID 60726 | hsa-mir-216b | unconfirmed | CID 7028 | hsa-mir-592 | unconfirmed |
CID 4760 | hsa-mir-520c | unconfirmed | CID 5656 | hsa-mir-646 | 32083545 |
CID 3052 | hsa-mir-193a | unconfirmed | CID 3520 | hsa-mir-1266 | unconfirmed |
CID 444036 | hsa-mir-199a-2 | unconfirmed | CID 43008 | hsa-mir-519a-1 | unconfirmed |
CID 3198 | hsa-mir-216a | unconfirmed | CID 3343 | hsa-mir-1469 | unconfirmed |
CID 157922 | hsa-mir-1260a | unconfirmed | CID 5566 | hsa-mir-548a-3 | unconfirmed |
CID 3698 | hsa-mir-2110 | unconfirmed | CID 5493444 | hsa-mir-1285-2 | unconfirmed |
CID 4212 | hsa-mir-219-2 | unconfirmed | CID 60843 | hsa-let-7d | unconfirmed |
CID 8223 | hsa-mir-98 | unconfirmed | CID 110635 | hsa-mir-216b | unconfirmed |
CID 19861 | hsa-mir-659 | unconfirmed | CID 2801 | hsa-mir-744 | unconfirmed |
CID 71329 | hsa-mir-100 | unconfirmed | CID 216239 | hsa-mir-1273e | unconfirmed |
CID 47641 | hsa-mir-150 | unconfirmed | CID 71398 | hsa-mir-526a-1 | unconfirmed |
CID 443980 | hsa-mir-760 | unconfirmed | CID 4201 | hsa-mir-153-2 | unconfirmed |
CID 5574 | hsa-mir-512-2 | unconfirmed | CID 5281040 | hsa-mir-548a-2 | unconfirmed |
CID 8969 | hsa-mir-543 | unconfirmed | CID 444020 | hsa-mir-320a | unconfirmed |
CID 5282415 | hsa-mir-619 | unconfirmed | CID 3025 | hsa-mir-24-1 | unconfirmed |
CID 65833 | hsa-mir-760 | unconfirmed | CID 3019 | hsa-mir-1226 | unconfirmed |
CID 1775 | hsa-mir-520f | unconfirmed | CID 1125 | hsa-mir-27a | unconfirmed |
CID 3749 | hsa-mir-1285-2 | unconfirmed | CID 1349907 | hsa-mir-642a | unconfirmed |
CID 2905 | hsa-mir-96 | unconfirmed | CID 656719 | hsa-mir-611 | unconfirmed |
CID 3180 | hsa-mir-148a | unconfirmed | CID 2795 | hsa-mir-711 | unconfirmed |
CID 5566 | hsa-mir-646 | unconfirmed | CID 23994 | hsa-mir-614 | unconfirmed |
CID 4212 | hsa-mir-18a | 31063487 | CID 4099 | hsa-mir-708 | unconfirmed |
CID 82146 | hsa-mir-490 | unconfirmed | CID 5281106 | hsa-mir-1302-6 | unconfirmed |
3.2.1 5-Fu
5-Fu, one of the earliest anticancer drugs, can be fully absorbed by tumor cells. Moreover, 5-Fu can decrease tumor cell proliferation by interfering with the formation of DeoxyriboNucleic Acid (DNA) and RiboNucleic Acid (RNA) in tumor cells. It has been demonstrated that 5-Fu has considerable inhibitory effects on various cancer cells. Therefore, 5-Fu is frequently used as a positive control in anticancer drug effect experiments and clinical adjuvant treatment of gastric cancer (Longley et al., 2003). The first type of case studies’ results show that 9, 16 and 39 out of the top 10, 20 and 50 potential 5-Fu-associated miRNAs are confirmed in other literature or databases (See Table 3). The second type of case studies’ results show that 8, 15 and 39 out of the top 10, 20 and 50 potential 5-Fu-associated miRNAs are confirmed in other literature or databases (See Table 4). For example, 5-Fu is the most common chemotherapeutic agent for colorectal cancer. On the one hand, over-expression of hsa-miR-23a causes the resistance to 5-Fu in microsatellite instability colorectal cancer, which results in a diminished effect of 5-Fu chemotherapy (Shang et al., 2014). On the other hand, Ectopic expression of hsa-miR-23a increased the viability and survival of microsatellite stability colorectal cancer cells, thereby leading to the apoptosis of colorectal cancer cells (Li et al., 2015).
TABLE 3.
Validation of the top 50 miRNAs associated with 5-Fu in the first type of case studies. The first column records the top 1–25 related miRNAs. The second column records the top 26–50 related miRNAs.
SM | miRNA | Evidence | SM | miRNA | Evidence |
---|---|---|---|---|---|
CID 3385 | hsa-miR-151a | 23220571 | CID 3385 | hsa-miR-126 | 26062749 |
CID 3385 | hsa-miR-195 | 21947305 | CID 3385 | hsa-miR-128-1 | 23220571 |
CID 3385 | hsa-let-7d | 23220571 | CID 3385 | hsa-miR-337 | unconfirmed |
CID 3385 | hsa-miR-195 | 21947305 | CID 3385 | hsa-miR-181c | unconfirmed |
CID 3385 | hsa-miR-125a | 23220571 | CID 3385 | hsa-miR-30c-1 | unconfirmed |
CID 3385 | hsa-miR-345 | unconfirmed | CID 3385 | hsa-miR-27a | 23220571 |
CID 3385 | hsa-miR-16-1 | 26198104 | CID 3385 | hsa-let-7a-1 | 23220571 |
CID 3385 | hsa-miR-24-1 | 26198104 | CID 3385 | hsa-miR-139 | 27173050 |
CID 3385 | hsa-miR-23b | 23220571 | CID 3385 | hsa-miR-302b | 26457704 |
CID 3385 | hsa-miR-1226 | 26198104 | CID 3385 | hsa-let-7b | 25789066 |
CID 3385 | hsa-miR-151a | 23220571 | CID 3385 | hsa-miR-26b | 23220571 |
CID 3385 | hsa-miR-132 | 23220571 | CID 3385 | hsa-miR-221 | 27501171 |
CID 3385 | hsa-125b-1 | unconfirmed | CID 3385 | hsa-miR-338 | 28928082 |
CID 3385 | hsa-let-7e | 23220571 | CID 3385 | hsa-miR-130a | unconfirmed |
CID 3385 | hsa-miR-19a | 23220571 | CID 3385 | hsa-miR-10b | 22322955 |
CID 3385 | hsa-miR-181a-1 | unconfirmed | CID 3385 | hsa-miR-204 | 27095441 |
CID 3385 | hsa-miR-181b-1 | unconfirmed | CID 3385 | hsa-miR-26a-1 | unconfirmed |
CID 3385 | hsa-miR-25 | 23220571 | CID 3385 | hsa-miR-92a-1 | 23220571 |
CID 3385 | hsa-miR-106a | 23220571 | CID 3385 | hsa-miR-299 | 31786874 |
CID 3385 | hsa-miR-200c | 23220571 | CID 3385 | hsa-miR-107 | 26636340 |
CID 3385 | hsa-miR-22 | 25449431 | CID 3385 | hsa-miR-181a-2 | 24462870 |
CID 3385 | hsa-miR-20a | 23220571 | CID 3385 | hsa-miR-205 | 24396484 |
CID 3385 | hsa-let-7d | 23220571 | CID 3385 | hsa-miR-23a | 23220571 |
CID 3385 | hsa-miR-34b | unconfirmed | CID 3385 | hsa-miR-199b | unconfirmed |
CID 3385 | hsa-miR-205 | 24396484 | CID 3385 | hsa-miR-93 | 23220571 |
TABLE 4.
Validation of the top 50 miRNAs associated with 5-Fu in the second type of case studies. The first column records the top 1–25 related miRNAs. The second column records the top 26–50 related miRNAs.
SM | miRNA | Evidence | SM | miRNA | Evidence |
---|---|---|---|---|---|
CID 3385 | hsa-miR-151a | 23220571 | CID 3385 | hsa-miR-195 | 21947305 |
CID 3385 | hsa-let-7d | 23220571 | CID 3385 | hsa-miR-27a | 23220571 |
CID 3385 | hsa-miR-205 | 24396484 | CID 3385 | hsa-miR-204 | 27095441 |
CID 3385 | hsa-miR-181a-2 | 24462870 | CID 3385 | hsa-miR-181a-1 | unconfirmed |
CID 3385 | hsa-miR-23a | 23220571 | CID 3385 | hsa-miR-25 | 23220571 |
CID 3385 | hsa-miR-1226 | 26198104 | CID 3385 | hsa-miR-199b | unconfirmed |
CID 3385 | hsa-miR-181c | unconfirmed | CID 3385 | hsa-miR-139 | 27173050 |
CID 3385 | hsa-miR-151a | 23220571 | CID 3385 | hsa-miR-195 | 21947305 |
CID 3385 | hsa-miR-26a-1 | unconfirmed | CID 3385 | hsa-miR-132 | 23220571 |
CID 3385 | hsa-miR-26b | 23220571 | CID 3385 | hsa-miR-20a | 23220571 |
CID 3385 | hsa-miR-130a | unconfirmed | CID 3385 | hsa-miR-126 | 26062749 |
CID 3385 | hsa-miR-345 | unconfirmed | CID 3385 | hsa-125b-1 | unconfirmed |
CID 3385 | hsa-miR-128-1 | 23220571 | CID 3385 | hsa-miR-200c | 23220571 |
CID 3385 | hsa-let-7d | 23220571 | CID 3385 | hsa-miR-299 | 31786874 |
CID 3385 | hsa-miR-181b-1 | unconfirmed | CID 3385 | hsa-miR-30c-1 | unconfirmed |
CID 3385 | hsa-miR-205 | 24396484 | CID 3385 | hsa-miR-24-1 | 26198104 |
CID 3385 | hsa-miR-125a | 23220571 | CID 3385 | hsa-miR-93 | 23220571 |
CID 3385 | hsa-miR-22 | 25449431 | CID 3385 | hsa-let-7e | 23220571 |
CID 3385 | hsa-miR-16-1 | 26198104 | CID 3385 | hsa-let-7b | 25789066 |
CID 3385 | hsa-miR-106a | 23220571 | CID 3385 | hsa-miR-221 | 27501171 |
CID 3385 | hsa-miR-23b | 23220571 | CID 3385 | hsa-miR-19a | 23220571 |
CID 3385 | hsa-miR-338 | 28928082 | CID 3385 | hsa-miR-92a-1 | 23220571 |
CID 3385 | hsa-miR-10b | 22322955 | CID 3385 | hsa-miR-302b | 26457704 |
CID 3385 | hsa-let-7a-1 | 23220571 | CID 3385 | hsa-miR-107 | 26636340 |
CID 3385 | hsa-miR-337 | unconfirmed | CID 3385 | hsa-miR-34b | unconfirmed |
3.2.2 5-Aza-CdR
5- Aza-CdR can bind to DNA methyltransferases to reduce methylation levels, reducing the biological activity of methyltransferase inhibitors and regulating gene expression. In clinical usage, 5-Aza-CdR is frequently used in clinical settings to treat diseases caused by gene variants (Do Amaral et al., 2019). Additionally, 5-Aza-CdR can suppress tumor cell proliferation via demethylation, making it one of the most potent inhibitors currently available in vitro (Lemaire et al., 2008). Meanwhile, 5-Aza-CdR can enhance the sensitivity of targeted drugs in non-small cell lung cancer chemotherapy, inhibit cell proliferation, accelerate the apoptosis of cancer cells, induce cell differentiation and activate quiescent anticancer cells in the human body. The first type of case studies’ results show that 8, 13 and 26 out of the top 10, 20 and 50 potential 5-Aza-CdR-associated miRNAs are confirmed in other literature or databases (See Table 5). The second type of case studies’ results show that 8, 14 and 28 out of the top 10, 20 and 50 potential 5-Aza-CdR-associated miRNAs are confirmed in other literature or databases (See Table 6). For example, quantitative methylation-specific Polymerase Chain Reaction analysis showed hypermethylation of the choline phosphoglyceride island adjacent to hsa-let-7e, and demethylation treatment with 5-Aza-CdR or transfection of pYr-let-7e-shRNA plasmid containing unmethylated hsa-let-7e DNA sequence could restore hsa-let-7e expression and partly reduce the chemoresistance (Cai et al., 2013).
TABLE 5.
Validation of the top 50 miRNAs associated with 5-Aza-CdR in the first type of case studies. The first column records the top 1–25 related miRNAs. The second column records the top 26–50 related miRNAs.
SM | miRNA | Evidence | SM | miRNA | Evidence |
---|---|---|---|---|---|
CID 451668 | hsa-miR-20a | 23220571 | CID 451668 | hsa-miR-30a | unconfirmed |
CID 451668 | hsa-miR-320a | 26198104 | CID 451668 | hsa-miR-107 | 23220571 |
CID 451668 | hsa-miR-125a | 23220571 | CID 451668 | hsa-miR-199b | 24659709 |
CID 451668 | hsa-miR-182 | 23220571 | CID 451668 | hsa-let-7a-1 | unconfirmed |
CID 451668 | hsa-miR-204 | unconfirmed | CID 451668 | hsa-miR-92a-1 | unconfirmed |
CID 451668 | hsa-miR-200b | 23626803 | CID 451668 | hsa-miR-181a-1 | 23220571 |
CID 451668 | hsa-miR-23a | unconfirmed | CID 451668 | hsa-let-7e | 22053057 |
CID 451668 | hsa-let-7f-1 | 23220571 | CID 451668 | hsa-miR-26a-1 | unconfirmed |
CID 451668 | hsa-let-7b | 26708866 | CID 451668 | hsa-miR-1233-1 | unconfirmed |
CID 451668 | hsa-miR-200c | 23626803 | CID 451668 | hsa-miR-130a | 23220571 |
CID 451668 | hsa-miR-25 | 23220571 | CID 451668 | hsa-miR-30c-1 | unconfirmed |
CID 451668 | hsa-miR-128-1 | 27705931 | CID 451668 | hsa-miR-22 | 23220571 |
CID 451668 | hsa-miR-145 | 26198104 | CID 451668 | hsa-miR-301a | unconfirmed |
CID 451668 | hsa-miR-221 | unconfirmed | CID 451668 | hsa-let-7g | 23220571 |
CID 451668 | hsa-miR-19b-1 | unconfirmed | CID 451668 | hsa-miR-195 | 23333942 |
CID 451668 | hsa-miR-197 | unconfirmed | CID 451668 | hsa-miR-302b | unconfirmed |
CID 451668 | hsa-let-7i | 23220571 | CID 451668 | hsa-miR-26b | unconfirmed |
CID 451668 | hsa-miR-181b-1 | unconfirmed | CID 451668 | hsa-miR-205 | unconfirmed |
CID 451668 | hsa-miR-338 | unconfirmed | CID 451668 | hsa-miR-218-1 | unconfirmed |
CID 451668 | hsa-let-7d | 26802971 | CID 451668 | hsa-miR-93 | 23220571 |
CID 451668 | hsa-miR-139 | unconfirmed | CID 451668 | hsa-miR-124-1 | unconfirmed |
CID 451668 | hsa-miR-328 | unconfirmed | CID 451668 | hsa-miR-15b | unconfirmed |
CID 451668 | hsa-miR-126 | 23220571 | CID 451668 | hsa-miR-10b | unconfirmed |
CID 451668 | hsa-miR-17 | 23220571 | CID 451668 | hsa-miR-128-2 | unconfirmed |
CID 451668 | hsa-miR-19a | 23220571 | CID 451668 | hsa-miR-27a | 23220571 |
TABLE 6.
Validation of the top 50 miRNAs associated with 5-Aza-CdR in the second type of case studies. The first column records the top 1–25 related miRNAs. The second column records the top 26–50 related miRNAs.
SM | miRNA | Evidence | SM | miRNA | Evidence |
---|---|---|---|---|---|
CID 451668 | hsa-miR-20a | 23220571 | CID 451668 | hsa-miR-92a-1 | unconfirmed |
CID 451668 | hsa-miR-181b-1 | unconfirmed | CID 451668 | hsa-miR-125a | 23220571 |
CID 451668 | hsa-miR-205 | unconfirmed | CID 451668 | hsa-let-7b | 26708866 |
CID 451668 | hsa-miR-19a | 23220571 | CID 451668 | hsa-miR-302b | unconfirmed |
CID 451668 | hsa-miR-181a-1 | 23220571 | CID 451668 | hsa-miR-30a | unconfirmed |
CID 451668 | hsa-miR-130a | 23220571 | CID 451668 | hsa-miR-23b | 23220571 |
CID 451668 | hsa-let-7g | 23220571 | CID 451668 | hsa-miR-199b | 24659709 |
CID 451668 | hsa-miR-200b | 23626803 | CID 451668 | hsa-miR-128-2 | unconfirmed |
CID 451668 | hsa-miR-126 | 23220571 | CID 451668 | hsa-miR-15b | unconfirmed |
CID 451668 | hsa-miR-320a | 26198104 | CID 451668 | hsa-miR-124-1 | unconfirmed |
CID 451668 | hsa-miR-30c-1 | unconfirmed | CID 451668 | hsa-miR-26b | unconfirmed |
CID 451668 | hsa-miR-328 | unconfirmed | CID 451668 | hsa-miR-128-1 | 27705931 |
CID 451668 | hsa-let-7e | 22053057 | CID 451668 | hsa-let-7a-1 | unconfirmed |
CID 451668 | hsa-miR-10b | unconfirmed | CID 451668 | hsa-miR-218-1 | unconfirmed |
CID 451668 | hsa-let-7f-1 | 23220571 | CID 451668 | hsa-miR-200c | 23626803 |
CID 451668 | hsa-miR-221 | unconfirmed | CID 451668 | hsa-miR-26a-1 | unconfirmed |
CID 451668 | hsa-miR-182 | 23220571 | CID 451668 | hsa-miR-338 | unconfirmed |
CID 451668 | hsa-let-7i | 23220571 | CID 451668 | hsa-miR-93 | 23220571 |
CID 451668 | hsa-miR-195 | 23333942 | CID 451668 | hsa-miR-139 | unconfirmed |
CID 451668 | hsa-miR-27a | 23220571 | CID 451668 | hsa-miR-145 | 26198104 |
CID 451668 | hsa-miR-204 | unconfirmed | CID 451668 | hsa-miR-107 | 23220571 |
CID 451668 | hsa-miR-25 | 23220571 | CID 451668 | hsa-let-7d | 26802971 |
CID 451668 | hsa-miR-23a | unconfirmed | CID 451668 | hsa-miR-19b-1 | unconfirmed |
CID 451668 | hsa-let-7f-1 | 23220571 | CID 451668 | hsa-miR-22 | 23220571 |
CID 451668 | hsa-miR-17 | 23220571 | CID 451668 | hsa-miR-197 | unconfirmed |
3.2.3 E2
In addition to stimulating the growth and maintenance of the reproductive system, E2 exerts protective effects on cardiovascular and other organs. Specifically, E2 can reduce blood cholesterol levels by decreasing Low-Density Lipoprotein (LDL), increasing High-Density Lipoprotein (HDL) and boosting apolipoprotein content (Oh et al., 2019). Moreover, researchers are paying more attention to the anti-inflammatory, antioxidant and anti-apoptotic properties of E2 on cardiovascular diseases such as coronary heart disease and atherosclerosis, are getting more attention from researchers (Tse et al., 1999; Rachoń et al., 2002). The first type of case studies’ results show that 6, 14 and 29 out of the top 10, 20 and 50 potential E2-associated miRNAs are confirmed in other literature or databases (See Table 7). The second type of case studies’ results show that 4, 11 and 29 out of the top 10, 20 and 50 potential E2-associated miRNAs are confirmed in other literature or databases (See Table 8). For example, hsa-miR-23a could be negatively regulated by E2 in both myocardium and cultured cardiomyocytes. Moreover, hsa-miR-23a could directly down-regulate peroxisome proliferator-activated receptor γ coactivator-alpha (PGC-1α) expression in cardiomyocytes via binding to its 3′-untranslated regions, which implied that hsa-miR-23a could be critical for the down-regulation of PGC-1α under E2 deficiency (Sun et al., 2014).
TABLE 7.
Validation of the top 50 miRNAs associated with E2 in the first type of case studies. The first column records the top 1–25 related miRNAs. The second column records the top 26–50 related miRNAs.
SM | miRNA | Evidence | SM | miRNA | Evidence |
---|---|---|---|---|---|
CID 5757 | hsa-miR-183 | unconfirmed | CID 5757 | hsa-miR-181b-1 | unconfirmed |
CID 5757 | hsa-let-7g | 23220571 | CID 5757 | hsa-miR-19b-1 | unconfirmed |
CID 5757 | hsa-miR-181a-2 | unconfirmed | CID 5757 | hsa-miR-141 | unconfirmed |
CID 5757 | hsa-miR-125a | 21914226 | CID 5757 | hsa-miR-15a | unconfirmed |
CID 5757 | hsa-miR-107 | 23220571 | CID 5757 | hsa-miR-17 | 23220571 |
CID 5757 | hsa-miR-26b | 24735615 | CID 5757 | hsa-miR-10b | 23220571 |
CID 5757 | hsa-miR-19a | 29416771 | CID 5757 | hsa-miR-30a | 29331043 |
CID 5757 | hsa-miR-195 | unconfirmed | CID 5757 | hsa-let-7f-1 | 23220571 |
CID 5757 | hsa-miR-128-2 | 23220571 | CID 5757 | hsa-miR-302b | 23220571 |
CID 5757 | hsa-miR-181a-1 | unconfirmed | CID 5757 | hsa-miR-199b | unconfirmed |
CID 5757 | hsa-miR-128-1 | 23220571 | CID 5757 | hsa-miR-181c | unconfirmed |
CID 5757 | hsa-miR-130a | unconfirmed | CID 5757 | hsa-miR-106b | 28422740 |
CID 5757 | hsa-miR-338 | 22996663 | CID 5757 | hsa-miR-23a | 23220571 |
CID 5757 | hsa-let-7e | 23220571 | CID 5757 | hsa-miR-9-2 | 23220571 |
CID 5757 | hsa-miR-20a | 21914226 | CID 5757 | hsa-miR-182 | 28678802 |
CID 5757 | hsa-miR-200c | 23220571 | CID 5757 | hsa-miR-139 | unconfirmed |
CID 5757 | hsa-miR-27a | 23220571 | CID 5757 | hsa-let-7b | 23220571 |
CID 5757 | hsa-miR-200b | 23220571 | CID 5757 | hsa-miR-25 | unconfirmed |
CID 5757 | hsa-miR-221 | 21057537 | CID 5757 | hsa-miR-218-1 | unconfirmed |
CID 5757 | hsa-miR-151a | unconfirmed | CID 5757 | hsa-miR-22 | 24715036 |
CID 5757 | hsa-miR-204 | 29789714 | CID 5757 | hsa-miR-15b | 23220571 |
CID 5757 | hsa-miR-106a | unconfirmed | CID 5757 | hsa-miR-130a | unconfirmed |
CID 5757 | hsa-miR-205 | unconfirmed | CID 5757 | hsa-miR-23b | 23220571 |
CID 5757 | hsa-miR-92a-1 | unconfirmed | CID 5757 | hsa-miR-26a-1 | unconfirmed |
CID 5757 | hsa-miR-130b | unconfirmed | CID 5757 | hsa-miR-30c-1 | 23220571 |
TABLE 8.
Validation of the top 50 miRNAs associated with E2 in the second type of case studies. The first column records the top 1–25 related miRNAs. The second column records the top 26–50 related miRNAs.
SM | miRNA | Evidence | SM | miRNA | Evidence |
---|---|---|---|---|---|
CID 5757 | hsa-miR-183 | unconfirmed | CID 5757 | hsa-miR-19a | 29416771 |
CID 5757 | hsa-miR-30c-1 | 23220571 | CID 5757 | hsa-miR-19b-1 | unconfirmed |
CID 5757 | hsa-miR-15a | unconfirmed | CID 5757 | hsa-miR-125a | 21914226 |
CID 5757 | hsa-miR-181a-1 | unconfirmed | CID 5757 | hsa-miR-15b | 23220571 |
CID 5757 | hsa-let-7f-1 | 23220571 | CID 5757 | hsa-miR-128-2 | 23220571 |
CID 5757 | hsa-miR-181b-1 | unconfirmed | CID 5757 | hsa-miR-20a | 21914226 |
CID 5757 | hsa-miR-205 | unconfirmed | CID 5757 | hsa-miR-26b | 24735615 |
CID 5757 | hsa-miR-181a-2 | unconfirmed | CID 5757 | hsa-miR-10b | 23220571 |
CID 5757 | hsa-miR-9-2 | 23220571 | CID 5757 | hsa-miR-181c | unconfirmed |
CID 5757 | hsa-miR-23a | 23220571 | CID 5757 | hsa-miR-22 | 24715036 |
CID 5757 | hsa-miR-128-1 | 23220571 | CID 5757 | hsa-miR-139 | unconfirmed |
CID 5757 | hsa-let-7e | 23220571 | CID 5757 | hsa-miR-106a | unconfirmed |
CID 5757 | hsa-let-7b | 23220571 | CID 5757 | hsa-miR-141 | unconfirmed |
CID 5757 | hsa-miR-130a | unconfirmed | CID 5757 | hsa-let-7g | 23220571 |
CID 5757 | hsa-miR-338 | 22996663 | CID 5757 | hsa-miR-107 | 23220571 |
CID 5757 | hsa-miR-30a | 29331043 | CID 5757 | hsa-miR-23b | 23220571 |
CID 5757 | hsa-miR-302b | 23220571 | CID 5757 | hsa-miR-195 | unconfirmed |
CID 5757 | hsa-miR-130b | unconfirmed | CID 5757 | hsa-miR-27a | 23220571 |
CID 5757 | hsa-miR-106b | 28422740 | CID 5757 | hsa-miR-25 | unconfirmed |
CID 5757 | hsa-miR-199b | unconfirmed | CID 5757 | hsa-miR-204 | 29789714 |
CID 5757 | hsa-miR-200b | 23220571 | CID 5757 | hsa-miR-221 | 21057537 |
CID 5757 | hsa-miR-182 | 28678802 | CID 5757 | hsa-miR-151a | unconfirmed |
CID 5757 | hsa-miR-26a-1 | unconfirmed | CID 5757 | hsa-miR-218-1 | unconfirmed |
CID 5757 | hsa-miR-17 | 23220571 | CID 5757 | hsa-miR-130a | unconfirmed |
CID 5757 | hsa-miR-200c | 23220571 | CID 5757 | hsa-miR-92a-1 | unconfirmed |
4 Discussion
Deep learning offers a wide range of applications in major areas of computer science, such as computer vision, natural language processing and machine translation. More effective models can be obtained by adding hidden layers to standard neural networks. Deep learning also contributes to medication development and precision medicine by predicting potential SM-miRNA associations. Furthermore, deep learning models have more hidden layer nodes than conventional neural networks. The number of hidden layers can even reach ten for extremely complex problems. After multiple layers of calculation, the results of deep learning-based algorithms are often closer to the actual situation than those of traditional machine learning-based algorithms. Initially, we utilized matrix decomposition to reduce noise from known SM-miRNA associations. Then, the layer attention mechanism was introduced to the deep learning model, which significantly improved the performance of our model by integrating the SM-miRNA association feature vectors used for calculation.
GCNLASMMA is a model of a neural network with numerous hidden layers. Multiple layers computations allowed the results to completely consider known features and avoid overfitting. The attention mechanism extracted vital information from each layer of the neural network. Besides, the matrix decomposition module reduced the noise of known SM-miRNA associations, significantly enhancing GCN’s performance. GCNLASMMA was an attempt to identify potential SM-miRNA associations using deep learning. The advantages above enabled GCNLASMMA to accurately anticipate potential SM-miRNA associations.
Deep learning’s spectacular performance is contingent on a vast number of known SM-miRNA associations. The number of known SM-miRNA associations utilized in this investigation was apparently insufficient to fulfill GCNLASMMA. Therefore, the performance of GCNLASMMA was still unsatisfactory. In addition, the parameters used in GCNLASMMA may not be ideal. Moreover, the construction of heterogeneous networks will yield better results if other biological information, such as long non-coding RNA or disease, is utilized. These factors will motivate researchers to develop more effective deep learning models to predict potential SM-miRNA associations using more trustworthy biological datasets.
Data availability statement
The Python code and datasets of GCNLASMMA are publicly available at https://github.com/1054366388/GCNLASMMA.
Author contributions
JZL led the project and supervised the writing. JN did the experiments, searched the literature and wrote the article. XLC and TGN did the subsequent revisions. All authors participated in the revisions and approved the final version of the manuscript.
Funding
This study was supported by the Postgraduate Research and Practice Innovation Program of Jiangsu Province (Grant Number KYCX21_2836).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
- Agarwal V., Bell G. W., Nam J. W., Bartel D. P. (2015). Predicting effective microRNA target sites in mammalian mRNAs. Elife 4. 10.7554/eLife.05005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Angermueller C., Pärnamaa T., Parts L., Stegle O. (2016). Deep learning for computational biology. Mol. Syst. Biol. 12 (7), 878. 10.15252/msb.20156651 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartel D. P. (2004). MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell 116 (2), 281–297. 10.1016/s0092-8674(04)00045-5 [DOI] [PubMed] [Google Scholar]
- Borges Oliveira D. A., Ribeiro Pereira L. G., Bresolin T., Pontes Ferreira R. E., Reboucas Dorea J. R. (2021). A review of deep learning algorithms for computer vision systems in livestock. Livest. Sci. 253, 104700. 10.1016/j.livsci.2021.104700 [DOI] [Google Scholar]
- Cai J., Yang C., Yang Q., Ding H., Jia J., Guo J., et al. (2013). Deregulation of let-7e in epithelial ovarian cancer promotes the development of resistance to cisplatin. Oncogenesis 2 (10), e75. 10.1038/oncsis.2013.39 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carnevali M., Parsons J., Wyles D. L., Hermann T. (2010). A modular approach to synthetic RNA binders of the hepatitis C virus internal ribosome entry site. ChemBioChem 11 (10), 1364–1367. 10.1002/cbic.201000177 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chandrasekaran V., Sanghavi S., Parrilo P. A., Willsky A. S. (2009). Rank-sparsity incoherence for matrix decomposition. SIAM J. Optim. 21 (2), 572–596. 10.1137/090761793 [DOI] [Google Scholar]
- Chen C. Z., Sobczak K., Hoskins J., Southall N., Marugan J. J., Zheng W., et al. (2012). Two high-throughput screening assays for aberrant RNA–protein interactions in myotonic dystrophy type 1. Anal. Bioanal. Chem. 402 (5), 1889–1898. 10.1007/s00216-011-5604-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen X., Zhou C., Wang C. C., Zhao Y. (2021). Predicting potential small molecule-miRNA associations based on bounded nuclear norm regularization. Brief. Bioinform. 22 (6), bbab328. 10.1093/bib/bbab328 [DOI] [PubMed] [Google Scholar]
- Cristino A. S., Nourse J., West R. A., Sabdia M. B., Law S. C., Gunawardana J., et al. (2019). EBV microRNA-BHRF1-2-5p targets the 3'UTR of immune checkpoint ligands PD-L1 and PD-L2. Blood 134 (25), 2261–2270. 10.1182/blood.2019000889 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dai X., Tan C. (2015). Combination of microRNA therapeutics with small-molecule anticancer drugs: Mechanism of action and co-delivery nanocarriers. Adv. Drug Deliv. Rev. 81, 184–197. 10.1016/j.addr.2014.09.010 [DOI] [PubMed] [Google Scholar]
- Denzler R., McGeary S. E., Title A. C., Agarwal V., Bartel D. P., Stoffel M. (2016). Impact of MicroRNA levels, target-site complementarity, and cooperativity on competing endogenous RNA-regulated gene expression. Mol. Cell 64 (3), 565–579. 10.1016/j.molcel.2016.09.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deyle K., Kong X. D., Heinis C. (2017). Phage selection of cyclic peptides for application inresearch and drug development. Acc. Chem. Res. 50 (8), 1866–1874. 10.1021/acs.accounts.7b00184 [DOI] [PubMed] [Google Scholar]
- Do Amaral G., Planello A. C., Borgato G., de Lima D. G., Guimarães G. N., Marqueso M. R., et al. (2019). 5-Aza-CdR promotes partial MGMT demethylation and modifies expression of different genes in oral squamous cell carcinoma. Oral Surg Oral Med Oral Pathol Oral Radiol 127 (5), 424–432. 10.1016/j.oooo.2019.01.006 [DOI] [PubMed] [Google Scholar]
- Dragomir M. P., Knutsen E., Calin G. A. (2021). Classical and noncanonical functions of miRNAs in cancers. Trends Genet. 38, 379–394. 10.1016/j.tig.2021.10.002 [DOI] [PubMed] [Google Scholar]
- Gam J. J., Babb J., Weiss R. (2018). A mixed antagonistic/synergistic miRNA repression model enables accurate predictions of multi-input miRNA sensor activity. Nat. Commun. 9 (1), 2430. 10.1038/s41467-018-04575-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Geng B., Craig T. J. (2021). Small molecule drugs for atopic dermatitis, rheumatoid arthritis, and hereditary angioedema. Ann. Allergy Asthma Immunol. 128, 263–268. 10.1016/j.anai.2021.10.015 [DOI] [PubMed] [Google Scholar]
- Ghini F., Rubolino C., Climent M., Simeone I., Marzi M. J., Nicassio F. (2018). Endogenous transcripts control miRNA levels and activity in mammalian cells by target-directed miRNA degradation. Nat. Commun. 9 (1), 3119. 10.1038/s41467-018-05182-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gorbea C., Mosbruger T., Cazalla D. (2017). A viral Sm-class RNA base-pairs with mRNAs and recruits microRNAs to inhibit apoptosis. Nature 550 (7675), 275–279. 10.1038/nature24034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gottlieb A., Stein G. Y., Ruppin E., Sharan R. (2011). Predict: A method for inferring novel drug indications with application to personalized medicine. Mol. Syst. Biol. 7, 496. 10.1038/msb.2011.26 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Habib G., Qureshi S. (2020). Optimization and acceleration of convolutional neural networks: A survey. J. King Saud Univ. - Comput. Inf. Sci. 34, 4244. 10.1016/j.jksuci.2020.10.004 [DOI] [Google Scholar]
- Hammond S. M. (2015). An overview of microRNAs. Adv. Drug Deliv. Rev. 87, 3–14. 10.1016/j.addr.2015.05.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haniff H. S., Liu X., Tong Y., Meyer S. M., Knerr L., Lemurell M., et al. (2021). A structure-specific small molecule inhibits a miRNA-200 family member precursor and reverses a type 2 diabetes phenotype. Cell Chem. Biol. 29, 300–311.e10. 10.1016/j.chembiol.2021.07.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hattori M., Okuno Y., Goto S., Kanehisa M. (2003). Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. J. Am. Chem. Soc. 125 (39), 11853–11865. 10.1021/ja036030u [DOI] [PubMed] [Google Scholar]
- Healy N., Schiff R., Osborne C. K., Kerin M. (2012). Mirnas: Small molecules, big players in tamoxifen resistance in breast cancer. Int. J. Surg. 10 (8), S4. 10.1016/j.ijsu.2012.06.025 [DOI] [Google Scholar]
- Iwata T., Mizuno N., Nagahara T., Kaneda-Ikeda E., Kajiya M., Sasaki S., et al. (2021). Cytokines regulate stemness of mesenchymal stem cells via miR-628-5p during periodontal regeneration. J. Periodontol. 93, 269–286. 10.1002/jper.21-0064 [DOI] [PubMed] [Google Scholar]
- Jiang Q., Wang Y., Hao Y., Juan L., Teng M., Zhang X., et al. (2009). miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Res. 37, D98–D104. 10.1093/nar/gkn714 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knox C., Law V., Jewison T., Liu P., Ly S., Frolkis A., et al. (2011). DrugBank 3.0: A comprehensive resource for 'omics' research on drugs. Nucleic Acids Res. 39, D1035–D1041. 10.1093/nar/gkq1126 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumar Kingsley S. M., Vishnu Bhat B. (2017). Role of MicroRNAs in the development and function of innate immune cells. Int. Rev. Immunol. 36 (3), 154–175. 10.1080/08830185.2017.1284212 [DOI] [PubMed] [Google Scholar]
- Kumari R., Kumar S., Kant R. (2018). Role of circulating miRNAs in the pathophysiology of CVD: As a potential biomarker. Gene Rep. 13, 146–150. 10.1016/j.genrep.2018.10.003 [DOI] [Google Scholar]
- Lai-Kwon J., Tiu C., Pal A., Khurana S., Minchom A. (2021). Moving beyond epidermal growth factor receptor resistance in metastatic non-small cell lung cancer - a drug development perspective. Crit. Rev. Oncol. Hematol. 159, 103225. 10.1016/j.critrevonc.2021.103225 [DOI] [PubMed] [Google Scholar]
- Lee R. C., Feinbaum R. L., Ambros V. (1993). The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell 75 (5), 843–854. 10.1016/0092-8674(93)90529-y [DOI] [PubMed] [Google Scholar]
- Lemaire M., Chabot G. G., Raynal N. J., Momparler L. F., Hurtubise A., Bernstein M. L., et al. (2008). Importance of dose-schedule of 5-aza-2'-deoxycytidine for epigenetic therapy of cancer. BMC Cancer 8, 128. 10.1186/1471-2407-8-128 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li X., Rao S., Wang Y., Gong B. (2004). Gene mining: A novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling. Nucleic Acids Res. 32 (9), 2685–2694. 10.1093/nar/gkh563 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y., Qiu C., Tu J., Geng B., Yang J., Jiang T., et al. (2014). HMDD v2.0: A database for experimentally supported human microRNA and disease associations. Nucleic Acids Res. 42, D1070–D1074. 10.1093/nar/gkt1023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li X., Li X., Liao D., Wang X., Wu Z., Nie J., et al. (2015). Elevated microRNA-23a expression enhances the chemoresistance of colorectal cancer cells with microsatellite instability to 5-fluorouracil by directly targeting ABCF1. Curr. Protein Pept. Sci. 16 (4), 301–309. 10.2174/138920371604150429153309 [DOI] [PubMed] [Google Scholar]
- Li J., Peng D., Xie Y., Dai Z., Zou X., Li Z. (2021). Novel potential small molecule-MiRNA-cancer associations prediction model based on fingerprint, sequence, and clinical symptoms. J. Chem. Inf. Model. 61 (5), 2208–2219. 10.1021/acs.jcim.0c01458 [DOI] [PubMed] [Google Scholar]
- Liu X., Zhu F., Ma X., Tao L., Zhang J., Yang S., et al. (2011). The therapeutic target database: An internet resource for the primary targets of approved, clinical trial and experimental drugs. Expert Opin. Ther. Targets 15 (8), 903–912. 10.1517/14728222.2011.586635 [DOI] [PubMed] [Google Scholar]
- Liu X., Wang S., Meng F., Wang J., Zhang Y., Dai E., et al. (2013). SM2miR: A database of the experimentally validated small molecules' effects on microRNA expression. Bioinformatics 29 (3), 409–411. 10.1093/bioinformatics/bts698 [DOI] [PubMed] [Google Scholar]
- Liu R., Lu Z., Gu J., Liu J., Huang E., Liu X., et al. (2018). MicroRNAs 15A and 16-1 activate signaling pathways that mediate chemotaxis of immune regulatory B cells to colorectal tumors. Gastroenterology 154 (3), 637–651. e637. 10.1053/j.gastro.2017.09.045 [DOI] [PubMed] [Google Scholar]
- Liu F., Peng L., Tian G., Yang J., Chen H., Hu Q., et al. (2020). Identifying small molecule-miRNA associations based on credible negative sample selection and random walk. Front. Bioeng. Biotechnol. 8, 131. 10.3389/fbioe.2020.00131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Longley D. B., Harkin D. P., Johnston P. G. (2003). 5-fluorouracil: Mechanisms of action and clinical strategies. Nat. Rev. Cancer 3 (5), 330–338. 10.1038/nrc1074 [DOI] [PubMed] [Google Scholar]
- Lu T. X., Rothenberg M. E. (2018). MicroRNA. J. Allergy Clin. Immunol. 141 (4), 1202–1207. 10.1016/j.jaci.2017.08.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lv S., Li Y., Wang Q., Ning S., Huang T., Wang P., et al. (2012). A novel method to quantify gene set functional association based on gene ontology. J. R. Soc. Interface 9 (70), 1063–1072. 10.1098/rsif.2011.0551 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lv Y., Wang S., Meng F., Yang L., Wang Z., Wang J., et al. (2015). Identifying novel associations between small molecules and miRNAs based on integrated molecular networks. Bioinformatics 31 (22), 3638–3644. 10.1093/bioinformatics/btv417 [DOI] [PubMed] [Google Scholar]
- Meng F., Yang X., Zhou C. (2014). The augmented Lagrange multipliers method for matrix completion from corrupted samplings with application to mixed Gaussian-impulse noise removal. Plos One 9 (9), e108125. 10.1371/journal.pone.0108125 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Monroig P. d. C., Chen L., Zhang S., Calin G. A. (2015). Small molecule compounds targeting miRNAs for cancer therapy. Adv. Drug Deliv. Rev. 81, 104–116. 10.1016/j.addr.2014.09.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nair A., Chung H. C., Sun T., Tyagi S., Dobrolecki L. E., Dominguez-Vidana R., et al. (2018). Combinatorial inhibition of PTPN12-regulated receptors leads to a broadly effective therapeutic strategy in triple-negative breast cancer. Nat. Med. 24 (4), 505–511. 10.1038/nm.4507 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Niu Z., Zhong G., Yu H. (2021). A review on the attention mechanism of deep learning. Neurocomputing 452, 48–62. 10.1016/j.neucom.2021.03.091 [DOI] [Google Scholar]
- Oh J. Y., Choi G. E., Lee H. J., Jung Y. H., Chae C. W., Kim J. S., et al. (2019). 17β-Estradiol protects mesenchymal stem cells against high glucose-induced mitochondrial oxidants production via Nrf2/Sirt3/MnSOD signaling. Free Radic. Biol. Med. 130, 328–342. 10.1016/j.freeradbiomed.2018.11.003 [DOI] [PubMed] [Google Scholar]
- Parsons J., Castaldi M. P., Dutta S., Dibrov S. M., Wyles D. L., Hermann T. (2009). Conformational inhibition of the hepatitis C virus internal ribosome entry site RNA. Nat. Chem. Biol. 5 (11), 823–825. 10.1038/nchembio.217 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peng Y., Ganesh A., Wright J., Xu W., Ma Y. (2012). Rasl: Robust alignment by sparse and low-rank decomposition for linearly correlated images. IEEE Trans. Pattern Anal. Mach. Intell. 34 (11), 2233–2246. 10.1109/TPAMI.2011.282 [DOI] [PubMed] [Google Scholar]
- Qu J., Chen X., Sun Y. Z., Li J. Q., Ming Z. (2018). Inferring potential small molecule-miRNA association based on triple layer heterogeneous network. J. Cheminform. 10 (1), 30. 10.1186/s13321-018-0284-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qu J., Chen X., Sun Y. Z., Zhao Y., Cai S. B., Ming Z., et al. (2019). In silico prediction of small molecule-miRNA associations based on the HeteSim algorithm. Mol. Ther. Nucleic Acids 14, 274–286. 10.1016/j.omtn.2018.12.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rachoń D., Myśliwska J., Suchecka-Rachoń K., Wieckiewicz J., Myśliwski A., Mysliwski A. (2002). Effects of oestrogen deprivation on interleukin-6 production by peripheral blood mononuclear cells of postmenopausal women. J. Endocrinol. 172 (2), 387–395. 10.1677/joe.0.1720387 [DOI] [PubMed] [Google Scholar]
- Ruepp A., Kowarsch A., Schmidl D., Buggenthin F., Brauner B., Dunger I., et al. (2010). PhenomiR: A knowledgebase for microRNA expression in diseases and biological processes. Genome Biol. 11 (1), R6. 10.1186/gb-2010-11-1-r6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rupaimoole R., Slack F. J. (2017). MicroRNA therapeutics: Towards a new era for the management of cancer and other diseases. Nat. Rev. Drug Discov. 16 (3), 203–222. 10.1038/nrd.2016.246 [DOI] [PubMed] [Google Scholar]
- Saikia M., Paul S., Chakraborty S. (2020). Role of microRNA in forming breast carcinoma. Life Sci. 259, 118256. 10.1016/j.lfs.2020.118256 [DOI] [PubMed] [Google Scholar]
- Seth P. P., Miyaji A., Jefferson E. A., Sannes-Lowery K. A., Osgood S. A., Propp S. S., et al. (2005). SAR by MS: Discovery of a new class of RNA-binding small molecules for the hepatitis C virus: Internal ribosome entry site IIA subdomain. J. Med. Chem. 48 (23), 7099–7102. 10.1021/jm050815o [DOI] [PubMed] [Google Scholar]
- Shang J., Yang F., Wang Y., Wang Y., Xue G., Mei Q., et al. (2014). MicroRNA-23a antisense enhances 5-fluorouracil chemosensitivity through APAF-1/caspase-9 apoptotic pathway in colorectal cancer cells. J. Cell. Biochem. 115 (4), 772–784. 10.1002/jcb.24721 [DOI] [PubMed] [Google Scholar]
- Singh S., Sharma A., Chauhan V. K. (2021). Online handwritten Gurmukhi word recognition using fine-tuned Deep Convolutional Neural Network on offline features. Mach. Learn. Appl. 5, 100037. 10.1016/j.mlwa.2021.100037 [DOI] [Google Scholar]
- Sun L. Y., Wang N., Ban T., Sun Y. H., Han Y., Sun L. L., et al. (2014). MicroRNA-23a mediates mitochondrial compromise in estrogen deficiency-induced concentric remodeling via targeting PGC-1α. J. Mol. Cell. Cardiol. 75, 1–11. 10.1016/j.yjmcc.2014.06.012 [DOI] [PubMed] [Google Scholar]
- Tagliafierro L., Glenn O. C., Zamora M. E., Beach T. G., Woltjer R. L., Lutz M. W., et al. (2017). Genetic analysis of α-synuclein 3' untranslated region and its corresponding microRNAs in relation to Parkinson's disease compared to dementia with Lewy bodies. Alzheimers Dement. 13 (11), 1237–1250. 10.1016/j.jalz.2017.03.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomou T., Mori M. A., Dreyfuss J. M., Konishi M., Sakaguchi M., Wolfrum C., et al. (2017). Adipose-derived circulating miRNAs regulate gene expression in other tissues. Nature 542 (7642), 450–455. 10.1038/nature21365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thorne N., Inglese J., Auld D. S. (2010). Illuminating insights into firefly luciferase and other bioluminescent reporters used in chemical biology. Chem. Biol. 17 (6), 646–657. 10.1016/j.chembiol.2010.05.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tse J., Martin-McNaulty B., Halks-Miller M., Kauser K., DelVecchio V., Vergona R., et al. (1999). Accelerated atherosclerosis and premature calcified cartilaginous metaplasia in the aorta of diabetic male Apo E knockout mice can be prevented by chronic treatment with 17β-estradiol. Atherosclerosis 144 (2), 303–313. 10.1016/S0021-9150(98)00325-6 [DOI] [PubMed] [Google Scholar]
- Vidal R. (2011). Subspace clustering. IEEE Signal Processing Magazine 28 (2), 52–68. 10.1109/msp.2010.939739 [DOI] [Google Scholar]
- Wang C. C., Chen X. (2019). A unified framework for the prediction of small molecule-MicroRNA association based on cross-layer dependency inference on multilayered networks. J. Chem. Inf. Model. 59 (12), 5281–5293. 10.1021/acs.jcim.9b00667 [DOI] [PubMed] [Google Scholar]
- Wang Y., Xiao J., Suzek T. O., Zhang J., Wang J., Bryant S. H. (2009). PubChem: A public information system for analyzing bioactivities of small molecules. Nucleic Acids Res. 37, W623–W633. 10.1093/nar/gkp456 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang C. C., Chen X., Qu J., Sun Y. Z., Li J. Q. (2019). Rfsmma: A new computational model to identify and prioritize potential small molecule-MiRNA associations. J. Chem. Inf. Model. 59 (4), 1668–1679. 10.1021/acs.jcim.9b00129 [DOI] [PubMed] [Google Scholar]
- Wang X., Zhao Y., Pourpanah F. (2020). Recent advances in deep learning. Int. J. Mach. Learn. Cybern. 11 (4), 747–750. 10.1007/s13042-020-01096-5 [DOI] [Google Scholar]
- Wang S. H., Wang C. C., Huang L., Miao L. Y., Chen X. (2021). Dual-Network Collaborative Matrix Factorization for predicting small molecule-miRNA associations. Brief. Bioinform. 23, bbab500. 10.1093/bib/bbab500 [DOI] [PubMed] [Google Scholar]
- Wen D., Danquah M., Chaudhary A. K., Mahato R. I. (2015). Small molecules targeting microRNA for cancer therapy: Promises and obstacles. J. Control. Release 219, 237–247. 10.1016/j.jconrel.2015.08.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wightman B., Ha I., Ruvkun G. (1993). Posttranscriptional regulation of the heterochronic gene lin-14 by lin-4 mediates temporal pattern formation in C. elegans . Cell 75 (5), 855–862. 10.1016/0092-8674(93)90530-4 [DOI] [PubMed] [Google Scholar]
- Xia X., Wang Y., Huang Y., Zhang H., Lu H., Zheng J. C. (2019). Exosomal miRNAs in central nervous system diseases: Biomarkers, pathological mediators, protective factors and therapeutic agents. Prog. Neurobiol. 183, 101694. 10.1016/j.pneurobio.2019.101694 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yekkirala A. S., Roberson D. P., Bean B. P., Woolf C. J. (2017). Breaking barriers to novel analgesic drug development. Nat. Rev. Drug Discov. 16 (8), 545–564. 10.1038/nrd.2017.87 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yin J., Chen X., Wang C. C., Zhao Y., Sun Y. Z. (2019). Prediction of small molecule-MicroRNA associations by sparse learning and heterogeneous graph inference. Mol. Pharm. 16 (7), 3157–3166. 10.1021/acs.molpharmaceut.9b00384 [DOI] [PubMed] [Google Scholar]
- Yu A. M., Choi Y. H., Tu M. J. (2020). RNA drugs and RNA targets for small molecules: Principles, progress, and challenges. Pharmacol. Rev. 72 (4), 862–898. 10.1124/pr.120.019554 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zeng X., Zhu S., Hou Y., Zhang P., Li L., Li J., et al. (2020). Network-based prediction of drug-target interactions using an arbitrary-order proximity embedded deep forest. Bioinformatics 36 (9), 2805–2812. 10.1093/bioinformatics/btaa010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhong H., Zhou Y., Xu Q., Yan J., Zhang X., Zhang H., et al. (2021). Low expression of miR-19a-5p is associated with high mRNA expression of diacylglycerol O-acyltransferase 2 (DGAT2) in hybrid tilapia. Genomics 113 (4), 2392–2399. 10.1016/j.ygeno.2021.05.016 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The Python code and datasets of GCNLASMMA are publicly available at https://github.com/1054366388/GCNLASMMA.