Abstract
MicroRNAs are a large class of post-transcriptional regulators that bind to the 3′ untranslated region of messenger RNAs. They play a critical role in many cellular processes and have been linked to the control of signal transduction pathways. Recent studies indicate that microRNAs can function as tumor suppressors or even as oncogenes when aberrantly expressed. For more general insights of disease-associated microRNAs, we analyzed their impact on human signaling pathways from two perspectives. On a global scale, we found a core set of signaling pathways with enriched tissue-specific microRNA targets across diseases. The function of these pathways reflects the affinity of microRNAs to regulate cellular processes associated with apoptosis, proliferation or development. Comparing cancer and non-cancer related microRNAs, we found no significant differences between both groups. To unveil the interaction and regulation of microRNAs on signaling pathways locally, we analyzed the cellular location and process type of disease-associated microRNA targets and proteins. While disease-associated proteins are highly enriched in extracellular components of the pathway, microRNA targets are preferentially located in the nucleus. Moreover, targets of disease-associated microRNAs preferentially exhibit an inhibitory effect within the pathways in contrast to disease proteins. Our analysis provides systematic insights into the interaction of disease-associated microRNAs and signaling pathways and uncovers differences in cellular locations and process types of microRNA targets and disease-associated proteins.
Introduction
MicroRNAs are endogenous, non-protein coding, approximately 22-nucleotide RNA molecules that have recently emerged as post-transcriptional regulators, known to influence diverse cellular processes ranging from stem cell differentiation to apoptosis [1]. They mostly target the 3′ untranslated region of a target mRNA, thereby destabilizing the transcript and inhibiting its translation [2], [3]. While there is evidence [4]–[6] that microRNA expression and maturation is induced by signaling pathways, microRNAs also emerge as regulators of signaling proteins. In zebrafish, miR-9 has been shown to regulate several components of the FGF signaling pathway, and thus controls neurogenesis in the midbrain-hindbrain domain during late embryonic development [7]. In another recent example in fruit fly [8], miR-8 has been identified to target both a transmembrane protein and a transcription factor of the WNT signaling pathway. Ricarte-Filho et al. [9] showed that the RET-pathway is mediated by let-7 which inhibits the activation of the RET/PTC-RAS-BRAF-ERK cascade exemplifying the direct influence of a single microRNA on a submodule of a signaling pathway. Given the generally large number of microRNA targets [10] it is natural to assume that many microRNAs regulate not only a single important pathway protein, but rather coordinate protein levels on a pathway-wide scale. Altered microRNA levels might then result in inaccurate target protein levels, consequently fallacious signal transduction, and potentially a disease phenotype.
From this perspective, it is intriguing to observe that medical sciences increasingly focus on the impact of microRNA-mediated regulatory control on diseases, especially in cancer: microRNAs are intensively used as diagnostic and prognostic disease markers [11], and even appear in first clinical trials [12]. Given the linkages between signaling pathways and microRNA regulation on the one hand, and microRNAs and disease phenotypes on the other, we aim to unveil the connection between phenotypes and pathways induced by microRNA mediated regulatory control.
In this work, we analyzed the tissue-specific regulatory patterns of disease-associated microRNAs in signaling pathways on different scales. Globally, we investigated the enrichment of disease-associated microRNAs on different pathways, and more locally, on the cellular location and process type of target proteins. We used manually annotated data from hundreds of patient studies to estimate the impact of disease-associated microRNAs on signaling pathways. We identified a core set of pathways, homogeneously enriched throughout nearly all diseases. Most of these pathways have been associated with cell growth, proliferation, and apoptosis. However, deregulation of signaling pathways can be induced by diverse factors. Point mutation of central signaling cascade proteins [13] have a severe impact on the information flow as well as any change in the expression pattern of cis or trans regulators. We thus compared the cellular localization and process type of signaling proteins that are microRNA targets with proteins that have been identified as disease-associated. In the following, we show that in contrast to disease proteins, microRNA targets are significantly enriched as inhibitors within the nucleus.
Results
We captured the different entities of our investigation in a multipartite graph. The graph consists of five sets of nodes representing the entities microRNAs, proteins, tissue, diseases, and pathways and links between but not within the set of nodes. Links are given by a prediction tool and four databases. MicroRNAs (as provided by the mirBase database [14]) are linked to diseases and corresponding tissue via the PhenomiR database [15], a manually curated database containing disease-associated microRNAs in human disorders. MicroRNA target transcripts are determined by TargetScanS [10] a prediction tool that shows a high performance on different microRNA target data sets [16]. In addition, we used the tissue atlas provided by Su et al. [17] to filter potential microRNA targets for a specific disease and a given tissue. We unified the set of mRNA transcripts and corresponding proteins to a set of nodes denoted simply as proteins. This set is linked to signaling pathways via the National Cancer Institute Pathway Interaction Database (NCI PID) [18], containing 79 human pathways together with its constituting components. Finally, disease proteins are identified by their KEGG DISEASE annotation [19] (see Methods for a detailed description of the materials used). Figure 1 summarizes the entities and connections used. Notably, similar results were obtained with other microRNA prediction tools and a different set of disease genes, as provided by OMIM [20] (for a detailed discussion see Robustness analysis in File S1).
MicroRNAs induce a core set of signaling pathways across diseases and tissues
We first analyzed the connection between diseases and signaling pathways, mediated by disease-associated microRNAs. In order to project the properties of the multipartite graph onto a disease-pathway correlation, we calculated the enrichment of disease-associated microRNA targets in a particular pathway. We used the tissue annotation in PhenomiR to filter for expressed microRNA targets, as given by the tissue atlas of Su et al. [17]. For a particular disease and a specific pathway, we computed the log odds ratio (LOD score) by dividing the relative number of associated microRNA targets in this pathway and tissue with the expected number, based on the relative number of associated microRNA targets in all signaling pathways given a specific tissue. Disease-pathway interactions with no targets (white fields in the heatmap Figure 2A) were excluded from further analyses (see Methods for a detailed description). We obtained a matrix of LOD scores, where each entry indicates the enrichment or depletion of tissue-specific targets of disease-associated microRNAs in the respective signaling pathway. We ordered this matrix according to a hierarchical clustering along the disease axis and pathway axis, respectively. Two features of the resulting heatmap are remarkable: First, dividing the hierarchical clustering of the signaling pathways into 3 major sub-clusters, we found one cluster (cluster 2; mean LOD = 0.55, variance = 0.008) showing a high enrichment throughout all diseases (see Figure 2A). We define this cluster as the core set of signaling pathways highly enriched with disease-associated microRNA targets. The remaining clusters show a high variance (cluster 3; mean LOD = 0.21, variance = 0.02) and a common depletion of microRNA targets (cluster 1; mean LOD = −0.36, variance = 0.07). Second, the 63 diseases split into two clusters with high and low microRNA-pathway associations. Within the larger of the two clusters, the enrichment of microRNA targets is extremely homogenous. Moreover we performed a multi-scale bootstrap resampling approach (relative sample sizes of bootstrap replication of 20%) [21] to test whether clusters 1–3 are robust against variation in the data. We can reject the hypothesis that the clusters do not exist with a significance level 0.05 indicating that the clusters 1–3 may stably be observed by increasing the number of observations.
All signaling pathways located in the core set are given in Table 1. The functions of these pathways reflect the affinity of microRNAs to regulate cellular processes associated with apoptosis, proliferation or development, as we will outline with three examples. (i) The PDGFa pathway, for example, promotes cell migration, proliferation, and survival [22]–[25]. PDGF expression has been demonstrated in a number of different solid tumors, from glioblastomas to prostate carcinomas. Its biological function varies from autocrine stimulation of cell growth to subtler paracrine interactions involving adjacent stroma or vasculature [26]. (ii) It was recently reported that let-7 has an influence on the RET-pathway by effecting the cell growth and differentiation of papillary thyroid cancer [9]. Ricarte-Filho et al. [9] concluded that let-7 inhibited the activation of the RET/PTC-RAS-BRAF-ERK cascade exemplifying the direct influence of a single microRNA on a submodule of a signaling pathway. (iii) The Reelin pathway has been directly correlated with tumor aggressiveness [27]–[29]. Evangelisti et al. [30] linked this pathway for the first time to cancer by showing the inhibition of Reelin by miR-124a.
Table 1. Core set of signaling pathways with highly enriched microRNA targets.
Pathway | Median LOD | microRNA | Z-score | Z-score |
Rhodopsin | 0.76 | miR-154 | 8.69 | 6.58 |
Botulinum | 0.61 | miR-29b | 8.58 | 8.10 |
TGFBR | 0.61 | miR-216a | 12.20 | 7.10 |
BMP | 0.60 | miR-224 | 9.37 | 7.93 |
IGF1 | 0.59 | miR-375 | 9.39 | 8.12 |
VEGFR3 | 0.57 | miR-422a | 8.29 | 7.89 |
EphrinB/EPHB | 0.57 | miR-422a | 11.44 | 8.06 |
PDGFa | 0.56 | miR-383 | 7.20 | 7.59 |
MET | 0.55 | miR-422a | 10.96 | 7.61 |
EphrinA/EPHA | 0.53 | miR-136 | 8.31 | 8.15 |
RET | 0.52 | miR-422a | 9.04 | 7.24 |
VEGFR1 | 0.51 | miR-422a | 11.72 | 7.82 |
REELIN | 0.51 | miR-197 | 7.76 | 6.86 |
TRKR | 0.49 | miR-335 | 12.94 | 7.88 |
mTOR4 | 0.47 | miR-375 | 7.23 | 7.44 |
EPO | 0.43 | miR-134 | 6.75 | 8.00 |
The Median LOD score is calculated over all diseases for a particular pathway. MicroRNA is the most enriched single microRNA within the corresponding pathway. Z-score was calculated by comparing the median LOD score with the obtained score by a random sampling of microRNA targets. Z-score was calculated by comparing the median LOD score with the obtained score by a random sampling of pathway proteins.
The pathways with the highest negative enrichments, as depleted by disease-associated microRNA targets, are the IL-23 mediated pathway (playing a pivotal role in autoimmunity [31]) and BRAD1, which is associated with cell survival and cell death [32]. Although we found a core set of pathways across diseases, differences between disorders can arise due to different expression levels of the respective microRNAs. The PDGFa pathway for example shows high enrichments across diseases independent of the microRNA prediction tool (see Table S1). We found miR-144 to be highly enriched in the PDGFa pathway. Analyzing the expression profile, we found miR-144 down-regulated in cancer, but up-regulated in Parkinson disease and idiopathic Myelofibrosis. Predicted targets of miR-144 are SRF, a transcription factor activated by PDGFa, and FOS that is thought to have an important role in signal transduction, cell proliferation and differentiation [33]–[35]. This finding shows that although different diseases are associated with the same signaling pathway, differences in the effects of the stimulated pathways can be induced by complementary expression profiles of microRNAs.
As the PhenomiR data set is dominated by cancer-related diseases (60%), we divided the set of diseases into a subset of cancer and non-cancer related microRNAs to study differences between both groups. We found 14 out of 16 pathways of the global core set also in the cancer-specific core set (see Table S2). The core set for the non-cancer related pathways contains 12 pathways that were also found by the global data set, but we also identify also two non-cancer specific pathway enrichments (see Table S3) such as the KIT pathway and the NFB pathway, that is involved in the expression of genes associated with development, cell death, and immune response [36]–[39].
Robustness analysis of the core set of signaling pathways
In order to ensure that our results are not artifacts of the chosen prediction tool, we analyzed the data with four other prediction tools: PicTar [40], Miranda [41], TargetSpy [42], and RNA22 [43]. Different features like conservation of the seed region or binding energies are taken into account to predict microRNA-transcript interactions in each tool. Based on these differences the overlap between the target sets from different tools is generally rather low [44]. We define for each tool the core set of signaling pathways, which are highly enriched by microRNA targets and compare these list with our core set listed in Table 1. The result shows that the signaling pathways in our core set are mostly consistent with different prediction tools (see Table S1). We found 8 out of 16 pathways within the core set of at least 3 different prediction tools.
In order to test the significance of these pathways, we performed a randomization approach, by comparing the median LOD score of these pathways with the median scores obtained by two random samplings. We first sampled 10.000 times pathway proteins keeping the pathway size constant, second, we generated 10.000 times a random microRNA predictor by sampling for each microRNA the corresponding targets. Finally, we calculated a z-score to estimate the significance of each pathway within the core set. We obtained high z-scores for the pathways within the core set independent of the sampling approach (see Table 1). The mean z-score for all pathways is 12.51 (Z-score) and 7.65 (Z-score), respectively.
The enrichment of microRNA targets is summarized in the boxplot in Figure 2B, where the distribution of LOD scores for each pathway is shown. The median LOD scores and their variance for the set of signaling pathways are significantly negatively correlated (Pearson correlation coefficient C = 0.37, , see Figure S1). In contrast to depleted pathways, highly enriched pathways are homogeneously targeted by microRNAs across diseases. This indicates that disease-associated microRNAs in human disorders target a core set of signaling pathways irrespective of the specific disease and tissue.
We ensure that the LOD scores are not trivially biased by the pathway size (C = −0.032, ) and show the respective plot in Figure S2. We noticed that the pathway enrichment is significantly negatively correlated with the number of microRNAs with targets in this pathway (C = −0.31, ), with up to 159 targeting microRNAs in the SMAD2 pathway.
Interaction of disease-associated proteins and microRNA targets
Much effort has been invested in understanding the mechanisms underlying the complex network of factors contributing to human diseases. Databases like OMIM [20], KEGG DISEASE [19], or HGMD [45] link dysfunctional proteins and genetic mutations to human disorders. In order to focus on already confirmed gene-disease interactions, we used the KEGG DISEASE database to study similarities and differences to microRNA targets in signaling pathways. In the following, we analyzed 23 diseases that are both annotated in KEGG DISEASE and PhenomiR (see Methods). In this subset, we analyzed 365 KEGG DISEASE proteins located in the NCI PID signaling pathways and identified 123 (33.7%) proteins as microRNA targets. The current estimation for the amount of microRNA targets in the human genome lies between 30–35% [10], [46]. This implies that there is no higher rate of microRNA targets in the set of disease proteins than expected. In order to study the interplay of disease proteins and microRNA targets, we compared their mapping to NCI PID pathways (see Figure 1). We found that typically, disease-affected proteins are widely distributed over pathways for a particular disease. Focusing on pathways showing a high fraction of disease-associated proteins, we found no correlation of microRNA target enrichment and the fraction of disease-affected signaling proteins (see Figure S3). These findings imply that disease-affected proteins and disease-associated microRNA targets do not prefer a common set of signaling pathways. To elucidate those differences, we changed the scale of our investigation and compare the localization and process type of disease-associated microRNA targets and disease proteins.
MicroRNA targets are preferentially located in the nucleus in contrast to disease proteins
To question whether microRNA targets and KEGG DISEASE proteins differ with respect to their cellular location and process type annotation, we divided the set of signaling proteins according to their NCI PID annotation into four groups: extracellular region, cell membrane, intracellular region, and nucleus. We then estimated the fraction of microRNA targets as well as disease proteins for each group and calculated the LOD enrichment scores (see Methods for a detailed description). Surprisingly, we found opposing patterns of cellular localization for disease-associated proteins and microRNA targets (see Figure 3A). Deregulated microRNAs preferentially target nuclear proteins (LOD = 0.57, ), while disease-associated proteins in the nucleus are underrepresented (LOD = −0.41, ). Therefore, microRNA targets are almost twice more frequently located in the nucleus as compared to disease proteins. Furthermore, proteins located in extracellular region are only weakly controlled (LOD = −0.81, ) by microRNAs. Disease associated proteins showing again a complementary result compared to microRNA targets (LOD = 0.44, ), being more than twice more frequently located in the extracellular region. Proteins located in the cell membrane or intracellular region show no significant differences and enrichments for microRNAs or disease-associations. Comparing these results with the subset of cancer-related microRNAs we obtained the similar finding of a preferred target location in the nucleus. This result shows that preferred location is not based on a disease-specific set but a common pattern, valid for cancer as well as non-cancer related microRNAs (see Figure S4). We repeated the location analysis with different prediction tools and obtained similar results for microRNA targets (see Figure S5). Analyzing microRNA targets located in the nucleus by Gene Ontology, we found 50% of those genes involved in transcriptional regulation. In addition, we used the OMIM database to select disease-associated genes and found again a opposite pattern of cellular localization for OMIM and microRNA targets (see Figure S6).
In contrast to disease proteins, microRNA targets frequently exhibit an inhibitory effect
We sorted the set of signaling proteins into three different groups according to their process type annotation: activating proteins, inhibiting proteins and proteins that can act as either activators or inhibitors, further on denoted as ambivalent. We then counted the number of microRNA targets as well as disease proteins for each group in our signaling pathways and calculated the LOD score. The result shows again a complementary pattern: As shown in Figure 3B, targets of disease-associated microRNAs are preferentially inhibitors (LOD = 1.62, ), whereas only 6 disease-associated proteins (LOD = −2.08, ) show a inhibitory effect. MicroRNA targets are enriched almost 14 times more in inhibiting proteins compared to disease proteins showing a complementary focus. Ambivalent proteins show a strong under-representation for microRNA targets (LOD = −0.96, ), whereas disease-affected proteins are significantly enriched (LOD = 1.26, ). For activators, we found a significant under-representation for both disease proteins (LOD = −0.75, ), and microRNA targets (LOD = −0.60, ), respectively. Again, we found the same result for cancer and non-cancer related microRNA targets indicating a common pattern. Notably, the enrichment of process types of disease proteins remains for the OMIM data set (see Figure S7).
Discussion
In order to study the role of disease-associated microRNAs in pathways, we applied a thorough statistical analysis to a multipartite graph consisting of microRNAs, proteins, diseases, tissue and signaling pathways. We investigated enrichment of disease-associated microRNAs globally on different pathways by considering of tissue-specific transcript expression, and more locally, on the cellular location and process type of target proteins.
We found that the amount of regulatory control mediated by disease-associated microRNAs differs from pathway to pathway. In [47], the authors showed that the targets of a specific microRNA cluster are significantly enriched in multiple pathways. For the majority of diseases, a homogeneous enrichment profile of microRNA targets throughout all pathways emerged. From our analysis of the constituting multipartite graph, we found that pathways are heterogeneously targeted by microRNAs. However, the core set of pathways under strong microRNA control appear to be homogeneously enriched throughout the majority of diseases, since many diseases are linked to a large number of microRNAs. So far, almost two third of the currently known microRNAs are linked via large-scale expression analysis to a phenotype. It is obvious that beside the phenotype responsible microRNAs, many microRNAs are detected as deregulated in human diseases but are not functionally linked to the phenotype.
What could be the biological function of a core set of globally enriched pathways? We showed that these pathways are targets of numerous deregulated microRNAs. One possible hypothesis is that these pathways could serve as disease sensors, transferring the information of erroneous cellular functions via deregulated microRNAs to important output proteins, like cell cycle checkpoints. From this perspective, it is intriguing that most top enriched pathways are associated with apoptotic, proliferation or developmental processes [48]. Entries in the PhenomiR database obtained by patient studies are more than 60% cancer-related diseases. Alterations in the expression or function of genes controlling cell growth and differentiation are considered to be the major cause of cancer. Notably, degenerative disorders like Alzheimer or Parkinson disease show a similar pathway profile compared to cancer-related phenotypes, although often with different direction of microRNA expression.
Presumably, the impact on signaling pathways for disease-associated proteins and microRNA targets differs. However, there might be an interaction between the disease-associated microRNAs and proteins to mediate deregulation of signaling pathways. It would be interesting to evaluate whether a given disease emerges due to protein deregulation caused by mutations with a successive deregulation of microRNAs, or due to deregulated microRNA levels, leading to pathogenic protein levels in turn. For a subset of microRNAs, located in the intron of a host gene, an examination of a common phenotypic effects is possible. Recently, we showed that intronic microRNAs support the regulatory effect of their host genes [49]. Here, we find one disease-associated microRNA-target pair with a common phenotype: both the host gene PTK2 and its intronic microRNA miR-151 are annotated with lung cancer in KEGG DISEASE and PhenomiR, respectively. In this case, the impact on the associated signaling pathways via correlated mir-151 and PTK2 deregulation is probably controlled by a single promoter. To unveil interactions between microRNAs and pathway proteins on a systems level, a much more precise knowledge of microRNA transcriptional regulation is needed.
We analyzed the subcellular location and process type behavior of disease-associated proteins and microRNA targets. Our result on the preferred cellular locations of microRNA targets shows an enrichment of proteins in the nucleus. This finding is in line with a study by Cui et al. [50], who obtained a similar result for the localization of microRNA targets on a much smaller set of signaling networks and microRNAs in mammalian hippocampal CA1 neurons. In addition, we found that disease-associated proteins often constitute the initial players of signaling networks and thus show an opposite pattern to microRNA targets. The deregulation of a single proteins at the cell surface receptor can have a severe impact on the whole signaling information flow stimulated by the receptor. For example, for growth factor receptors, the activation under normal conditions promotes cellular survival, whereas over-expression promotes tumor cell growth [51]. Therefore, cell surface receptors are well suited as drug targets, as diminishing the signal through these receptors has the potential to normalize cellular behavior. The deregulation of a single protein in the intracellular region or the nucleus might influence only a subpart of the signaling network.
A large fraction (50%) of microRNA targets located in the nucleus are involved in transcriptional regulation. It was shown that transcription factors like MYC, JUN, or FOS, have a short mRNA lifetime based on their RNA stability [52], [53]. Within these studies the importance of the 3′ untranslated region for the mRNA stability was mentioned. Thus, microRNAs presumably tune RNA stability in a tissue or stage dependent manner. Deregulated microRNAs changing the stability of transcription factors of a signaling pathway may then lead to malfunction of different cellular processes [54]. Motivated by the affinity of microRNAs to regulate with associated pathways apoptosis, proliferation or development [1], we suppose that the regulation of stability extends to proteins with short half-lives that are required only for limited time in, e.g., cell cycle, growth, or differentiation.
In a recent study, Legewie et al. [55] introduced a set of signal inhibitors with a short mRNA and protein lifetime that are transcriptionally induced upon stimulation. These rapid feedback inhibitors (RFIs) are thought to tune the signal transduction cascades, allow for swift feedback regulation and establish short latency phases after signaling induction. As we found an enrichment of inhibitory proteins targeted by microRNAs, the question arises, if RFI proteins are potential microRNA targets. Using the TargetScanS prediction tool we were able to confirm 18 out of 19 (95%) RFIs as microRNA targets (). We thus assume that the short mRNA lifetime of RFIs can be attributed to the degradation activity promoted by microRNA binding. Inhibiting proteins are preferentially located in the nucleus (see Table S4), whereas activating or ambivalent proteins are randomly distributed in the cellular regions. Interestingly, disease proteins showed a frequent association with ambivalent process type. We assume that for ambivalent proteins, deregulation of the expression levels imparts a more severe effect on signaling cascades as compared to activators or inhibitors alone.
The usage of hypergraphs for a proper representation of interconnected entities in systems biology has been acknowledged recently [56]. Here, we applied a thorough statistical analysis not only to bipartite but to a multipartite graph consisting of microRNAs, proteins, diseases, and signaling pathways in a tissue-specific manner and uncovered the impact of disease-associated microRNAs on human signaling pathways.
Materials and Methods
In this section, we give a detailed overview about the resources and methods, which were used to interconnect the different entities shown in Figure 1.
Human signaling pathway data
Human signaling pathway data was obtained from the National Cancer Institute Pathway Interaction Database (NCI PID) [18], which is a manually curated collection of biomolecular interactions and key cellular processes assembled into signaling pathways. NCI PID holds 128 pathways including 47 sub-networks. We combined all subnetworks with their parent networks to the set of signaling pathways. In addition, we kept all pathways that have more than one predicted microRNA target gene, leading to a final data set of 79 human signaling pathways containing 1573 unique human proteins. The database also provides information on subcellular location terms from the Gene Ontology Consortium. We used this information to divide all subcellular locations into four different groups: extracellular region, cell membrane, intracellular region and nucleus. Finally, location information for 1083 proteins containing 135 extracellular region, 344 cell membrane, 373 intracellular region and 231 proteins located in the nucleus were obtained. In addition, we extracted process type information for each biological process, which can be input, output, positive or negative regulator. In total, there are 1120 interactions of which 765 are activating, 74 inhibiting and 281 proteins acting as activators as well as inhibitors.
Disease-associated microRNAs
Human disease-associated microRNAs were obtained from the PhenomiR database [15]. PhenomiR is a manually curated collection of microRNA-disease associations, containing a total of 11 029 microRNA expression-phenotype relations collected from 542 different experiments. We used patient study data only and obtained 486 disease-associated microRNAs in 83 different diseases including up to 5 subtypes per disorder. For each disease, we take only those microRNA into account, that have at least one target in the specific tissue annotated by PhenomiR and obtained finally 165 different microRNAs in 63 diseases-tissue combinations.
MicroRNA target prediction
Hausser et al. [16] analyzed different features of microRNA targets and showed within their work that TargetScanS has a good performance on different data sets. We used TargetScanS as the main prediction tool but to handle the issue of the unknown reliability of microRNA prediction tools we used several other prediction tools like PicTar, intersection of PicTar and TargetScanS, Miranda, RNA22, and TargetSpy to confirm our results. We used for each method default parameter settings.
MicroRNA targets filtered by tissue expression
As microRNA expression is tissue-specific annotated in PhenomiR, we used the tissue atlas provided by Su et al. [17] to filter potential microRNA targets in a specific tissue. The data was downloaded from the NCBI Gene Expression Omnibus (GEO), and the processed data was used. We mapped the predicted microRNA target transcripts on the tissue atlas and considered a transcript as expressed in a specific tissue, if either one replicate has a present call or both show at least a marginal call, similar to the work of McClintick et al. [57].
Human disease data
Human disease proteins were taken from the KEGG DISEASE database [19]. It associates 5 neurodegenerative disorders, 5 infectious and metabolic disorders and 13 different cancer diseases. Finally, we obtained 909 proteins from 23 different diseases, which are also found in the PhenomiR database. For results obtained by the NCBI OMIM database see Figure S6 and Figure S7.
Pathway profile
Pathway profiles were calculated for all diseases annotated in PhenomiR passing the tissue filter. For each disease-pathway interaction we estimated the enrichment of microRNA targets of disease in pathway defined by a log odds ratio (LOD score):
where is the number of microRNA targets for all disease-associated microRNAs in disease and pathway ; is the number of proteins in pathway ; is the number of microRNA targets for all disease-associated microRNAs in disease over all pathways; : is the number of proteins over all pathways. We use these LOD scores to build up a heatmap using Manhattan distance function and ward clustering. A positive value indicates an enrichments and a negative a depletion. Whenever we identified no target for a particular disease-pathway interaction and therefore the resulting LOD score is . As commonly done, we excluded all cases with for calculating the mean and quantiles for each pathway. In addition, these cases were also excluded from the clustering taking the reduced dimensions into account.
Cellular location analysis
We used the subcellular location annotation of the NPI PID database to estimate the microRNA target enrichment. The enrichment was calculated by the logarithm of base 2 of the odds ratio (LOD score) and its significants was obtained by Fisher's exact test.
Process type analysis
In addition to the subcellular location, the NPI database provides information about specific process types of proteins in signaling processes. We used this information to analyze the interaction between inhibiting as well as activating proteins in signaling processes. Within this analysis we calculated the enrichment of microRNA targets as well as KEGG DISEASE proteins for different process types. The enrichment was calculated by the logarithm of base 2 of the odds ratio (LOD score) and its significants was obtained by Fisher's exact test.
Supporting Information
Acknowledgments
We would like to thank Sebastian Bohl, Jan Krumsiek, and Florian Blöchl for stimulating remarks.
Footnotes
Competing Interests: The authors have declared that no competing interests exist.
Funding: This work was supported by the Federal Ministry of Education and Research (BMBF) and its MedSys initiative (project ‘LungSys’) and the Helmholtz Alliance on Systems Biology (project ‘CoReNe’). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Kloosterman WP, Plasterk RHA. The diverse functions of microRNAs in animal development and disease. Dev Cell. 2006;11:441–450. doi: 10.1016/j.devcel.2006.09.009. [DOI] [PubMed] [Google Scholar]
- 2.Valencia-Sanchez MA, Liu J, Hannon GJ, Parker R. Control of translation and mRNA degradation by miRNAs and siRNAs. Genes Dev. 2006;20:515–524. doi: 10.1101/gad.1399806. [DOI] [PubMed] [Google Scholar]
- 3.Chi SW, Zang JB, Mele A, Darnell RB. Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps. Nature. 2009;460:479–486. doi: 10.1038/nature08170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.O'Connell RM, Taganov KD, Boldin MP, Cheng G, Baltimore D. MicroRNA-155 is induced during the macrophage inflammatory response. Proc Natl Acad Sci U S A. 2007;104:1604–1609. doi: 10.1073/pnas.0610731104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Davis BN, Hilyard AC, Lagna G, Hata A. SMAD proteins control DROSHA-mediated microRNA maturation. Nature. 2008;454:56–61. doi: 10.1038/nature07086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Suzuki HI, Yamagata K, Sugimoto K, Iwamoto T, Kato S, et al. Modulation of microRNA processing by p53. Nature. 2009;460:529–533. doi: 10.1038/nature08199. [DOI] [PubMed] [Google Scholar]
- 7.Leucht C, Stigloher C, Wizenmann A, Klafke R, Folchert A, et al. MicroRNA-9 directs late organizer activity of the midbrain-hindbrain boundary. Nat Neurosci. 2008 doi: 10.1038/nn.2115. [DOI] [PubMed] [Google Scholar]
- 8.Kennell JA, Gerin I, MacDougald OA, Cadigan KM. The microRNA miR-8 is a conserved negative regulator of Wnt signaling. Proc Natl Acad Sci U S A. 2008;105:15417–15422. doi: 10.1073/pnas.0807763105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Ricarte-Filho JCM, Fuziwara CS, Yamashita AS, Rezende E, Da-Silva MJ, et al. Effects of let-7 microRNA on Cell Growth and Differentiation of Papillary Thyroid Cancer. Translational oncology. 2009;2:236–41. doi: 10.1593/tlo.09151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lewis BP, Burge CB, Bartel DP. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005;120:15–20. doi: 10.1016/j.cell.2004.12.035. [DOI] [PubMed] [Google Scholar]
- 11.Calin GA, Croce CM. MicroRNA signatures in human cancers. Nat Rev Cancer. 2006;6:857–866. doi: 10.1038/nrc1997. [DOI] [PubMed] [Google Scholar]
- 12.Couzin J. MicroRNAs make big impression in disease after disease. Science. 2008;319:1782–1784. doi: 10.1126/science.319.5871.1782. [DOI] [PubMed] [Google Scholar]
- 13.Jones S, Zhang X, Parsons DW, Lin JCH, Leary RJ, et al. Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science. 2008;321:1801–1806. doi: 10.1126/science.1164368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Griffiths-Jones S. The microRNA Registry. Nucleic Acids Res. 2004;32:D109–11. doi: 10.1093/nar/gkh023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ruepp A, Kowarsch A, Schmidl D, Bruggenthin F, Brauner B, et al. PhenomiR: a knowledgebase for microRNA expression in diseases and biological processes. Genome biology. 2010;11:R6. doi: 10.1186/gb-2010-11-1-r6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hausser J, Landthaler M, Jaskiewicz L, Gaidatzis D, Zavolan M. Relative contribution of sequence and structure features to the mRNA binding of Argonaute/EIF2C-miRNA complexes and the degradation of miRNA targets. Genome research. 2009;19:2009–20. doi: 10.1101/gr.091181.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, et al. A gene atlas of the mouse and human protein-encoding transcriptomes. Proceedings of the National Academy of Sciences of the United States of America. 2004;101:6062–7. doi: 10.1073/pnas.0400782101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Schaefer CF, Anthony K, Krupa S, Buchoff J, Day M, et al. PID: the Pathway Interaction Database. Nucleic Acids Res. 2008 doi: 10.1093/nar/gkn653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, et al. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008;36:480–484. doi: 10.1093/nar/gkm882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hamosh A, Scott AF, Amberger JS, Bocchini CA, Mckusick VA. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 2005;33, Database issue, D514-D517 doi: 10.1093/nar/gki033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Shimodaira H. An approximately unbiased test of phylogenetic tree selection. Systematic biology. 2002;51:492–508. doi: 10.1080/10635150290069913. [DOI] [PubMed] [Google Scholar]
- 22.Wu E, Palmer N, Tian Z, Moseman AP, Galdzicki M, et al. Comprehensive dissection of PDGF-PDGFR signaling pathways in PDGFR genetically defined cells. PLoS ONE. 2008;3:e3794. doi: 10.1371/journal.pone.0003794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Heldin CH, Johnsson A, Wennergren S, Wernstedt C, Betsholtz C, et al. A human osteosarcoma cell line secretes a growth factor structurally related to a homodimer of PDGF A-chains. Nature. 1986;319:511–514. doi: 10.1038/319511a0. [DOI] [PubMed] [Google Scholar]
- 24.Li X, Ponten A, Aase K, Karlsson L, Abramsson A, et al. PDGF-C is a new protease-activated ligand for the PDGF alpha-receptor. Nat Cell Biol. 2000;2:302–309. doi: 10.1038/35010579. [DOI] [PubMed] [Google Scholar]
- 25.Ross R, Glomset J, Kariya B, Harker L. A platelet-dependent serum factor that stimulates the proliferation of arterial smooth muscle cells in vitro. Proc Natl Acad Sci U S A. 1974;71:1207–1210. doi: 10.1073/pnas.71.4.1207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.George D. Targeting PDGF receptors in cancer–rationales and proof of concept clinical trials. Advances in experimental medicine and biology. 2003;532:141–51. doi: 10.1007/978-1-4615-0081-0_12. [DOI] [PubMed] [Google Scholar]
- 27.Perrone G, Vincenzi B, Zagami M, Santini D, Panteri R, et al. Reelin expression in human prostate cancer: a marker of tumor aggressiveness based on correlation with grade. Modern pathology : an official journal of the United States and Canadian Academy of Pathology, Inc. 2007;20:344–51. doi: 10.1038/modpathol.3800743. [DOI] [PubMed] [Google Scholar]
- 28.Seigel GM, Hackam AS, Ganguly A, Mandell LM, Gonzalez-Fernandez F. Human embryonic and neuronal stem cell markers in retinoblastoma. Molecular vision. 2007;13:823–32. [PMC free article] [PubMed] [Google Scholar]
- 29.Wang W, Chen JX, Liao R, Deng Q, Zhou JJ, et al. Sequential activation of the MEK-extracellular signal-regulated kinase and MKK3/6-p38 mitogen-activated protein kinase pathways mediates oncogenic ras-induced premature senescence. Mol Cell Biol. 2002;22:3389–3403. doi: 10.1128/MCB.22.10.3389-3403.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Evangelisti C, Florian MC, Massimi I, Dominici C, Giannini G, et al. MiR-128 up-regulation inhibits Reelin and DCX expression and reduces neuroblastoma cell motility and invasiveness. The FASEB journal : official publication of the Federation of American Societies for Experimental Biology. 2009;23:4276–87. doi: 10.1096/fj.09-134965. [DOI] [PubMed] [Google Scholar]
- 31.McKenzie BS, Kastelein RA, Cua DJ. Understanding the IL-23-IL-17 immune pathway. Trends Immunol. 2006;27:17–23. doi: 10.1016/j.it.2005.10.003. [DOI] [PubMed] [Google Scholar]
- 32.Irminger-Finger I, Jefford CE. Is there more to BARD1 than BRCA1? Nature reviews Cancer. 2006;6:382–91. doi: 10.1038/nrc1878. [DOI] [PubMed] [Google Scholar]
- 33.Bruning JC, Gillette JA, Zhao Y, Bjorbaeck C, Kotzka J, et al. Ribosomal subunit kinase-2 is required for growth factor-stimulated transcription of the c-Fos gene. Proc Natl Acad Sci U S A. 2000;97:2462–2467. doi: 10.1073/pnas.97.6.2462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Okazaki K, Sagata N. The Mos/MAP kinase pathway stabilizes c-Fos by phosphorylation and augments its transforming activity in NIH 3T3 cells. EMBO J. 1995;14:5048–5059. doi: 10.1002/j.1460-2075.1995.tb00187.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bossis G, Malnou CE, Farras R, Andermarcher E, Hipskind R, et al. Down-regulation of c-Fos/c-Jun AP-1 dimer activity by sumoylation. Mol Cell Biol. 2005;25:6964–6979. doi: 10.1128/MCB.25.16.6964-6979.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bonizzi G, Karin M. The two NF-kappaB activation pathways and their role in innate and adaptive immunity. Trends Immunol. 2004;25:280–288. doi: 10.1016/j.it.2004.03.008. [DOI] [PubMed] [Google Scholar]
- 37.Perkins ND. Integrating cell-signalling pathways with NF-kappaB and IKK function. Nat Rev Mol Cell Biol. 2007;8:49–62. doi: 10.1038/nrm2083. [DOI] [PubMed] [Google Scholar]
- 38.Chiao PJ, Na R, Niu J, Sclabas GM, Dong Q, et al. Role of Rel/NF-kappaB transcription factors in apoptosis of human hepatocellular carcinoma cells. Cancer. 2002;95:1696–1705. doi: 10.1002/cncr.10829. [DOI] [PubMed] [Google Scholar]
- 39.Gerondakis S, Grossmann M, Nakamura Y, Pohl T, Grumont R. Genetic approaches in mice to understand Rel/NF-kappaB and IkappaB function: transgenics and knockouts. Oncogene. 1999;18:6888–6895. doi: 10.1038/sj.onc.1203236. [DOI] [PubMed] [Google Scholar]
- 40.Krek A, Grün D, Poy MN, Wolf R, Rosenberg L, et al. Combinatorial microRNA target predictions. Nature genetics. 2005;37:495–500. doi: 10.1038/ng1536. [DOI] [PubMed] [Google Scholar]
- 41.John B, Enright AJ, Aravin A, Tuschl T, Sander C, et al. Human microRNA targets. PLoS Biol. 2004;2e363 doi: 10.1371/journal.pbio.0020363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.TargetSpy. 2009. URL http://webclu.bio.wzw.tum.de/targetspy/
- 43.Miranda KC, Huynh T, Tay Y, Ang YS, Tam WL, et al. A pattern-based method for the identification of MicroRNA binding sites and their corresponding heteroduplexes. Cell. 2006;126:1203–1217. doi: 10.1016/j.cell.2006.07.031. [DOI] [PubMed] [Google Scholar]
- 44.Sethupathy P, Megraw M, Hatzigeorgiou AG. A guide through present computational approaches for the identification of mammalian microRNA targets. Nat Methods. 2006;3:881–886. doi: 10.1038/nmeth954. [DOI] [PubMed] [Google Scholar]
- 45.Stenson PD, Ball EV, Mort M, Phillips AD, Shiel JA, et al. Human Gene Mutation Database (HGMD): 2003 update. Hum Mutat. 2003;21:577–581. doi: 10.1002/humu.10212. [DOI] [PubMed] [Google Scholar]
- 46.Friedman RC, Farh KKH, Burge CB, Bartel DP. Most mammalian mRNAs are conserved targets of microRNAs. Genome Res. 2009;19:92–105. doi: 10.1101/gr.082701.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Xu J, Wong C. A computational screen for mouse signaling pathways targeted by microRNA clusters. RNA. 2008;14:1276–1283. doi: 10.1261/rna.997708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Jovanovic M, Hengartner MO. miRNAs and apoptosis: RNAs to die for. Oncogene. 2006;25:6176–87. doi: 10.1038/sj.onc.1209912. [DOI] [PubMed] [Google Scholar]
- 49.Lutter D, Marr C, Krumsiek J, Lang E, Theis F. Intronic microRNAs support their host genes by mediating synergistic and antagonistic regulatory effects. BMC genomics. 2010;11:224. doi: 10.1186/1471-2164-11-224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Cui Q, Yu Z, Purisima EO, Wang E. Principles of microRNA regulation of a human cellular signaling network. Molecular Systems Biology. 2006:1–7. doi: 10.1038/msb4100089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Adams GP, Weiner LM. Monoclonal antibody therapy of cancer. Nat Biotechnol. 2005;23:1147–1157. doi: 10.1038/nbt1137. [DOI] [PubMed] [Google Scholar]
- 52.Yeilding NM, Rehman MT, Lee WM. Identification of sequences in c-myc mRNA that regulate its steady-state levels. Mol Cell Biol. 1996;16:3511–3522. doi: 10.1128/mcb.16.7.3511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kabnick KS, Housman DE. Determinants that contribute to cytoplasmic stability of human c-fos and beta-globin mRNAs are located at several sites in each mRNA. Mol Cell Biol. 1988;8:3244–3250. doi: 10.1128/mcb.8.8.3244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Hollams EM, Giles KM, Thomson AM, Leedman PJ. MRNA stability and the control of gene expression: implications for human disease. Neurochem Res. 2002;27:957–980. doi: 10.1023/a:1020992418511. [DOI] [PubMed] [Google Scholar]
- 55.Legewie S, Herzel H, Westerhoff HV, Blüthgen N. Recurrent design patterns in the feedback regulation of the mammalian signalling network. Molecular Systems Biology. 2008:1–7. doi: 10.1038/msb.2008.29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Klamt S, Haus UU, Theis F. Hypergraphs and cellular networks. PLoS Comput Biol. 2009;5:e1000385. doi: 10.1371/journal.pcbi.1000385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.McClintick JN, Edenberg HJ. Effects of filtering by Present call on analysis of microarray experiments. BMC bioinformatics. 2006;7:49. doi: 10.1186/1471-2105-7-49. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.