Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2011 Nov 30;40(Database issue):D198–D204. doi: 10.1093/nar/gkr1159

miRNEST database: an integrative approach in microRNA search and annotation

Michał Wojciech Szcześniak 1,*, Sebastian Deorowicz 2, Jakub Gapski 1, Łukasz Kaczyński 1, Izabela Makałowska 1,*
PMCID: PMC3245016  PMID: 22135287

Abstract

Despite accumulating data on animal and plant microRNAs and their functions, existing public miRNA resources usually collect miRNAs from a very limited number of species. A lot of microRNAs, including those from model organisms, remain undiscovered. As a result there is a continuous need to search for new microRNAs. We present miRNEST (http://mirnest.amu.edu.pl), a comprehensive database of animal, plant and virus microRNAs. The core part of the database is built from our miRNA predictions conducted on Expressed Sequence Tags of 225 animal and 202 plant species. The miRNA search was performed based on sequence similarity and as many as 10 004 miRNA candidates in 221 animal and 199 plant species were discovered. Out of them only 299 have already been deposited in miRBase. Additionally, miRNEST has been integrated with external miRNA data from literature and 13 databases, which includes miRNA sequences, small RNA sequencing data, expression, polymorphisms and targets data as well as links to external miRNA resources, whenever applicable. All this makes miRNEST a considerable miRNA resource in a sense of number of species (544) that integrates a scattered miRNA data into a uniform format with a user-friendly web interface.

INTRODUCTION

Animal and plant miRNA genes are transcribed by RNA polymerase II or III into a primary transcript, called pri-miRNA (1). During initial steps of miRNA biogenesis, pri-miRNA is cut and a hairpin-shaped intermediate, called pre-miRNA, is produced. This process is catalyzed by Drosha in animals (2) and DCL1 (DICER-LIKE 1) in plants (3). Subsequently, a pre-miRNA is specifically cut at stem part of the hairpin and a miRNA/miRNA* duplex with 2-nt overhangs at 3′-ends is released. In animals this process is run by Dicer (2) and in plants it is controlled by DCL1 (3). In cytoplasm, one of duplex components, referred to as mature miRNA, gets incorporated into a riboprotein complex, named RISC (RNA-induced silencing complex) (4). RISC contains a functional unit, which allows regulation of the gene expression based on complementarity of the miRNA and the transcript of targeted gene. There are two modes of the regulation: by cleaving transcripts (5) and by inhibiting translation (6). The first one requires high complementarity between miRNA and targeted transcript and is ubiquitously observed in plants, while in animals it is translation inhibition that constitutes a major mechanism of miRNA action.

miRNAs regulate the expression of thousands of genes in plants and animals and are key players in developmental (7), stress-related (8) and signalling processes (9). A number of miRNAs have been associated with diseases in human, e.g. Alzheimer's disease (10), pancreatic cancer (11), or leukemia (12). Hence, identification of miRNAs and subsequent elucidation of their functions, both in plants and animals, became a critical issue not only in molecular biology but also in medical research and agriculture.

Recently, a number of investigations aimed at the identification of miRNAs have been published. Reported miRNAs were discovered either based on computational (13) or experimental approaches (14). Consequently, the growing number of miRNA studies led to accumulation of miRNA databases. However, many of these databases, like miRO (15) or miROrtho (16), are limited to species of high interest. Other resources are focused on selected taxa, e.g. microPC (13) and PMRD (17) contain only plant miRNAs, CoGemiR (18) consists of miRNAs from selected animal species and Vir-Mir db (19) is dedicated to viruses only. The main miRNA repository database, miRBase (20), although accommodates data from a wide range of species, contains only already published results. Similarly, databases like miRecords (21) or miRNAMap (22) consist exclusively of experimentally verified miRNAs. Contrary, microPC dataset is based solely on computational methods. Therefore, despite of the ever-growing number of miRNA-related resources, there is a lack of a single universal repository and users need to browse through a number of dispersed data sets to collect information related to specific species or miRNA type.

To overcome these limitations, we developed miRNEST as a comprehensive online resource for plant, animal and virus miRNAs. We applied a comparative approach to search for new microRNAs in animal and plant EST sequences from dbEST (23). The approach applied by us made it possible to identify 10 004 miRNA candidates in 221 animal and 199 plant species. To the best of our knowledge, for 236 species no miRNAs have been known before. Besides miRNA identification, we also performed miRNA target search in plant candidates and for 29 species we collected small RNA reads from Gene Expression Omnibus (24). Since our goal was not only to identify new miRNAs but also to develop a resource that would integrate miRNA data scattered across literature and databases, we complemented data resulting from our computational analysis with miRNA sequences from three other databases and two publications. In addition, based on availability, we incorporated the data from twelve resources providing further annotations of miRNAs from selected species. This gives the possibility to access, search and browse data from different resources simultaneously. Altogether the miRNEST database contains 39 122 miRNAs from 544 species. All data are presented in the same format via miRNEST interface and are available at http://mirnest.amu.edu.pl.

Data acquisition

We imported 16 961 known mature miRNA sequences from miRBase (20) and 9212 sequences from PMRD (17), which were used for comparative analysis and identification of conserved miRNAs. EST sequences of 225 animal and 202 plant species were downloaded from dbEST (23). We took into consideration only the species that had at least 10 000 EST sequences. For EST annotation, the UniProtKB/Swiss-Prot protein data set was obtained from UniProt (25), and to remove tRNAs and rRNAs from ESTs and for further miRNA annotation, we obtained ncRNA sequences from RFAM release 9.1 (26). Also, 192 small RNA deep sequencing libraries were downloaded from GEO, Gene Expression Omnibus (24).

In addition to mature miRNA and pre-miRNA data from miRBase and PMRD, we downloaded sequences from microPC (18) and two publications (27,28). The choice of databases and papers was made based on the overlap between resources and the availability of the data. For example, the sets of miRNAs deposited in miRNAMap (22) and RNAdb (29) fully overlapped with miRBase and therefore were not considered for the download. On the other hand, data from resources like miROrtho (16) or GrapeMiRNA (30) could not be downloaded and consequently, were not integrated with miRNEST. Targets data were collected from the largest and most recognized resources as miRDB (31), miRTarBase (32), miRecords (21), PMRD and ASRP (33). We also obtained miRNA expression data from PhenomiR 2.0 (34) and dbDEMC (35), miRNA regulation data and promoters from dPORE-miRNA (36) and PMRD, genomics information from CoGemiR v1.2b (18), miRNA polymorphisms from Patrocles (37) and imprinting data from ncRNAimprint (38). In particular, we took advantage of data available in miRBase, which among others provided us with literature references, links to external databases, genomics data, and served as a source of uniform, non-redundant miRNA nomenclature, which was of great importance when integrating the miRNEST external miRNA data. The list of all databases used for the data assembly is provided in Supplementary Table S1 and the range of miRNA overlap between resources in Supplementary Table S2.

Prediction of miRNAs

The prediction of animal and plant microRNA candidates was performed according to modified algorithm applied by Zhang et al. (39) and Mhuantong et al. (13) (Figure 1). The modifications include removal of sequences with high percentage of low-complexity regions, assembling ESTs into contigs using CAP3 (40) and position-based allowance for substitutions in mature miRNAs. Schema of all data processing steps is presented in Figure 1.

Figure 1.

Figure 1.

A computational pipeline applied for prediction of microRNAs in EST sequences and their annotation. The seven steps in miRNA search part are designed to minimize the false positives rate and provide a high quality set of candidates. In some of them, plant- and animal-specific parameters were applied, as described in the main text. The annotation part serves to provide more data on predicted miRNAs and no candidates are discarded there.

In the first step, mature miRNA sequences from miRBase and PMRD were searched against EST sequences using BLASTN (41). We filtered the BLASTN search results, based on pre-computed distribution of mismatches in mature miRNAs within miRNA homologs in animals and plants (Supplementary Figure S1). Then, EST sequences, clustered based on the similarity to the same miRNA, were assembled into contigs using CAP3. This step is important in plant microRNA prediction where pre-miRNA length occasionally exceeds 600 nucleotides (Supplementary Figure S2). Next, we ran BLASTN search against RFAM non-coding RNA sequences to remove rRNAs and tRNAs (with E-value < 1e-20). After that we searched for low-complexity regions using Dustmasker (42) and removed sequences containing more than 60% of low complexity regions. Then, RNA secondary structures were predicted using RNAfold (43). We filtered the secondary structures and left only the candidates with mature miRNA located in a stem part of the hairpin structure and with no more than five mismatches and two bulges between mature miRNA and the opposite hairpin arm (Supplementary Figure S3). At the same time, we extracted the hairpin sequences out of longer contig or singleton. After that, we performed a BLASTX search against UniProt to remove sequences with high similarity to proteins (E-value < 1e-20). Finally, we removed animal pre-miRNAs with length exceeding 215 nt, which is the maximum length for animal pre-miRNA in miRBase. As a result, we obtained 10 004 miRNA candidates in 221 animal and 199 plant species.

The ratio of sequences removed from the data set at various filtering steps differed between species; examples for Arabidopsis thaliana and Bos taurus are presented in Supplementary Table S3.

miRNA prediction accuracy

Specificity

To assess the specificity of miRNEST algorithm we generated one million random sequences that were subsequently subject to the algorithm. The initial number of BLAST hits, i.e. mature miRNAs against generated sequences, was 231 260. After filtering steps, Dustmasker and BLASTN against RFAM, the number of candidates has decreased substantially to 1120 and only three candidates left after secondary structure check point. The last step, BLAST against UniProt, reduced the number of candidates to two, which produces the false positives ratio of 0.0002% (two per million). This result is comparable with the one calculated in a very similar way by authors of microPC database (13). Applying a similar approach they obtained the ratio of 0.00064%. We have also checked how many of our human miRNA predictions are classified as true miRNAs by microPred (44), a tool for miRNA/non-miRNA classification. This tool was specifically designed for human pre-miRNAs and it classified as miRNAs 75.9% of miRNEST predicted human miRNAs. At the same time, 90.16% of miRBase human pre-miRNAs are considered as true miRNAs by this program.

Sensitivity

To estimate the sensitivity of applied algorithm known pre-miRNAs of B. taurus and A. thaliana, from miRBase database, were subjected to miRNA search. Out of 229 Arabidopsis hairpins, 209 (91.3%) have been recovered. In case of B. taurus, the result was lower and only 61.1% (392 out of 662) of hairpins have been recognized as miRNA by miRNEST algorithm. This is mostly because of the architecture of animal hairpins: most of the miRNAs that have not been recovered bear more than five mismatches and/or two bulges between mature miRNA and the opposite hairpin arm. Nonetheless, we decided to keep this filtering threshold in order to maintain the low false positives ratio. Authors of microPC database, where miRNAs are also predicted based on EST sequences analysis, estimated the sensitivity of their method for all analyzed plant species as 81.3%. We did calculation separately for individual species but on average, the rate of specificity is similar to this obtained in microPC.

Further annotation of predicted miRNAs

Plant mature miRNAs generally show a high level of complementarity with targeted mRNA sequences, thus target identification usually is not a very complex problem here. On the other hand, effective methods for target search in animals heavily rely on the evolutionary conservation of miRNA target site (45). Since such data are unavailable for a great majority of analyzed by us animal sequences, we decided to focus on target search for plant candidates and we used external target data for animal species whenever possible.

The targets were searched for in EST sequences from corresponding species applying standard rules of plant miRNA–mRNA interactions (46). In the final scoring a mismatch was given a score of 1, a wobble (G:U) was given a score of 0.5 and a bulge was given a score of 2.0. All matches with score above 3.5 were discarded. It was also important that positions 10 and 11 of miRNA perfectly matched to its target and that there was no more than 1 mismatch at positions 2 to 9. As a result, we identified targets for 6963 mature miRNAs in 187 plant species. Sequences of all potentially targeted ESTs were checked against UniProt data for functional annotations.

For 29 species, we downloaded small RNA deep sequencing libraries from GEO. The reads were mapped to predicted pre-miRNAs using Bowtie (47). The mapping was performed against both pre-miRNA strands and only one mismatch was allowed. The choice of species for which deep sequencing data were downloaded was solely based on the availability.

Processing of external data

The goal of our project was not only to predict miRNAs in EST sequences but also to build a comprehensive miRNA database. Therefore, we supplemented data set from our computational analyses with the sequence data from three external resources: miRBase, PMRD, microPC and two publications (27,28) reporting miRNAs not deposited in miRBase. All imported sequences were run through our pipeline so that all data are deposited and displayed in identical format. However, filtering steps were turned off to ensure that all external miRNAs would be incorporated into the miRNEST database even if they did not match criteria applied in miRNEST algorithm. Finally, we run reciprocal BLASTN search: each miRNEST pre-miRNA against each other to identify similar miRNAs across data sets.

Web interface

The miRNEST web interface has been divided into five sections that correspond to distinct types of data.

Browse

By clicking Browse in main menu, the user gets access to all miRNA sequences: miRNEST predictions and miRNAs from external sources. For each miRNA record we provide miRNA* sequence, coordinates of miRNA and miRNA* in pre-miRNA, guide strand, number of mismatches and bulges between mature miRNA and the opposite hairpin arm, secondary structures of pre-miRNAs, family assignment, experimental evidence and identical miRNEST, miRBase, PMRD or microPC miRNAs, whenever found (Figure 2). There are also links to miRNEST internal resources providing access to results of BLAST search, target predictions in EST sequences as well as target information from external sources, most similar mature miRNAs, source sequences for miRNAs predicted from ESTs, graphical display of deep sequencing reads mapping as well as data collected from external resources: miRNA genomics data, SNPs, promoters, TFBSs, miRNA-disease association, miRNA polymorphism, expression data, literature references, links to external resources and information on imprinted miRNAs. From here users have also opportunity to run BLASTN searches against selected sources of miRNA sequences and to build ClustalW (48) alignments based on BLASTN results. If species is selected, its full NCBI taxonomy is provided (49). By clicking a taxon, all miRNEST species belonging to the taxon are displayed as active links to miRNA data in corresponding species. Users can also browse through species in a taxonomic tree view.

Figure 2.

Figure 2.

(a) An example of miRNEST record (MNEST002602) (b) by clicking analyse link it is possible to perform BLASTN search and ClustalW analysis. (c) deep sequencing link leads to mapping pattern of small RNA deep sequencing reads.

Search

Using a search option users have the possibility to filter miRNA data by a number of parametres: species, sequence source (miRNEST prediction and/or external sources), mature miRNA sequence or its part, hairpin length, free folding energy, number of allowed mismatches and bulges between mature miRNA and the opposite arm in pre-miRNA hairpin structure, and E-values for BLAST search that was pre-run against UniProt, RFAM, miRBase, PMRD, and microPC. We also incorporated the target search option, where user can search for specific targets by typing mature miRNA sequence and selecting a species of interest. Moreover, users can limit their search to the records that have additional data, downloaded from outside resources, like experimental evidence or target sequences.

Unclassified

Unclassified section provides users with a list of miRNEST predictions that were not classified as potential miRNAs as they violated at least one of the following criteria: E-value for BLASTX search against UniProt >1e-20 or pre-miRNA length for animal candidate ≤215 nucleotides. There are 66 predictions that fall into this section due to the first criterion, 465—the latter and 8 predictions that violence both criteria. Unclassified and Search possess a very similar interface, however some options are unavailable in Unclassified category.

RNA-Seq

This component of the miRNEST database presents small RNA sequencing data that correspond to miRNEST predictions from 29 species (Supplementary Table S4). The user can select species, miRNA of interest, small RNA deep sequencing library, strand and number of allowed mismatches between reads and pre-miRNA. The results are displayed in a graphical form as an alignment of reads to the pre-miRNA. If no library is specified, then unique reads from all libraries from a given species are displayed. Otherwise, the read counts, as transcripts per million, are supplied. RNA-Seq data are also accessible from Browse section through deep sequencing internal link that corresponds to a given miRNA record.

Taxonomy

Taxonomy option provides a user with a phylogenetic tree of species that are included in miRNEST predictions. It presents clickable taxon names (kingdom, phylum, subclass, order, family, genus, and species) along with a number of taxon-specific miRNA families. By clicking on the taxon, the user gains access to more detailed information on the taxon-specific miRNA families and links to corresponding miRNEST records.

Comparison with other miRNA databases

miRNEST is a large collection of miRNA sequences and associated miRNA data (Supplementary Table 1). It encompasses 522 animal and plant species as well as 22 viruses (Supplementary Table S5) and as many as 9429 miRNA predictions can be found uniquely in miRNEST, though a fraction of the predictions overlap with known miRNA data (Supplementary Table S5). For comparison, miRBase (Release 17), the second largest miRNA database collects data from only 153 distinct animals, plants and viruses. As many as 236 miRNEST species were not taken into account in any other miRNA database. These include a number of model species as well as the ones of high interest in agriculture and medical research like Salmo salar, Takifugu rubripes, Actinidia chinensis, Anolis carolinensis, Bos indicus or Trichinella spiralis.

Another distinguishing feature of miRNEST is the extent of usage of small RNA deep sequencing reads. We incorporated data from 192 libraries from 29 species retrieved from GEO and mapped the reads to predicted pre-miRNAs. We carefully selected the libraries in order to make sure that they encompass a wide array of tissues, developmental stages and conditions. For comparison, deepBase (50), which is a platform for annotating and discovering small and long ncRNAs from next-generation sequencing, collects data from 185 small RNA libraries from seven species and in miRBase, Release 17, deep sequencing data has been added for seven species, yet this functionality is now being expanded to more species.

miRNEST also gives access to large-scale target search predictions for 187 plant species that were generated using standard rules for plant miRNA target prediction. For more than half of the species this is the first and only miRNA targets prediction.

Finally, miRNEST predictions are complemented with a wide range of external data retrieved from 13 databases. Our goal was to create a resource that would integrate the miRNA data that are currently scattered across multiple resources and to limit existing necessity of searching multiple databases to investigate a single miRNA or miRNAs from a given species.

AVAILABILITY AND REQUIREMENTS

miRNEST is freely available at http://mirnest.amu.edu.pl. The database was constructed using Hypertext Markup Language (HTML), Cascading Style Sheets (CSS), PHP 5.2.11 (http://www.php.net/), and MySQL 4.0.31 (http://www.mysql.com/). pre-miRNA secondary structures are drawn using Java lightweight applet VARNA (51) which requires the installation of Java plugin.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online: Supplementary Figures 1–3, Supplementary Tables 1–5.

FUNDING

Faculty of Biology at Adam Mickiewicz University in Poznan, Poland (PBWB-08/2011); the European Social Fund within the project—‘Fellowships for PhD Students in Wielkopolska Region conducting research in strategic areas supporting regional development’; the Ministry of Science and Higher Education (N N301 160935 to I.M. and N N516 441938 to S. D.). Funding for open access charge: Faculty of Biology at Adam Mickiewicz University in Poznan.

Conflict of interest statement. None declared.

ACKNOWLEDGEMENTS

The authors thank Professor Wojciech Makałowski for proofreading the article and Joanna Ciomborowska, Michał Kabza and Elżbieta Owczarkowska for helpful suggestions on the database web interface.

REFERENCES

  • 1.Cai X, Hagedorn CH, Cullen BR. Human microRNAs are processed from capped, polyadenylated transcripts that can also function as mRNAs. RNA. 2004;10:1957–1966. doi: 10.1261/rna.7135204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Lee Y, Ahn C, Han J, Choi H, Kim J, Yim J, Lee J, Provost P, Rådmark O, Kim S, et al. The nuclear RNase III Drosha initiates microRNA processing. Nature. 2003;425:415–419. doi: 10.1038/nature01957. [DOI] [PubMed] [Google Scholar]
  • 3.Kurihara Y, Takashi Y, Watanabe Y. The interaction between DCL1 and HYL1 is important for efficient and precise processing of pri-miRNA in plant microRNA biogenesis. RNA. 2006;12:206–212. doi: 10.1261/rna.2146906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lau NC, Lim LP, Weinstein EG, Bartel DP. An abundant class of tiny RNAs with probable regulatory roles in Caenorhabditis elegans. Science. 2001;294:858–862. doi: 10.1126/science.1065062. [DOI] [PubMed] [Google Scholar]
  • 5.Reinhart BJ, Weinstein EG, Rhoades MW, Bartel B, Bartel DP. MicroRNAs in plants. Genes Dev. 2002;16:1616–1626. doi: 10.1101/gad.1004402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Lai EC. Micro RNAs are complementary to 3' UTR sequence motifs that mediate negative post-transcriptional regulation. Nat. Genet. 2002;30:363–364. doi: 10.1038/ng865. [DOI] [PubMed] [Google Scholar]
  • 7.Kedde M, Agami R. Interplay between microRNAs and RNA-binding proteins determines developmental processes. Cell Cycle. 2008;7:899–903. doi: 10.4161/cc.7.7.5644. [DOI] [PubMed] [Google Scholar]
  • 8.Leung AK, Sharp PA. MicroRNA functions in stress responses. Mol. Cell. 2010;40:205–215. doi: 10.1016/j.molcel.2010.09.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.O'Neill LA, Sheedy FJ, McCoy CE. MicroRNAs: the fine-tuners of Toll-like receptor signalling. Nat. Rev. Immunol. 2011;11:163–175. doi: 10.1038/nri2957. [DOI] [PubMed] [Google Scholar]
  • 10.Yao J, Hennessey T, Flynt A, Lai E, Beal MF, Lin MT. MicroRNA-related cofilin abnormality in Alzheimer's disease. PLoS One. 2010;5:e15546. doi: 10.1371/journal.pone.0015546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Brabletz S, Bajdak K, Meidhof S, Burk U, Niedermann G, Firat E, Wellner U, Dimmler A, Faller G, Schubert J, et al. The ZEB1/miR-200 feedback loop controls Notch signalling in cancer cells. EMBO J. 2011;30:770–782. doi: 10.1038/emboj.2010.349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Schotte D, Pieters R, Den Boer ML. MicroRNAs in acute leukemia: from biological players to clinical contributors. Leukemia. 2011 doi: 10.1038/leu.2011.151. June 24 (doi:10.1038/leu.2011.151; epub ahead of print) [DOI] [PubMed] [Google Scholar]
  • 13.Mhuantong W, Wichadakul D. MicroPC (microPC): a comprehensive resource for predicting and comparing plant microRNAs. BMC Genomics. 2009;10:366. doi: 10.1186/1471-2164-10-366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Zhang L, Chia JM, Kumari S, Stein JC, Liu Z, Narechania A, Maher CA, Guill K, McMullen MD, Ware D. A genome-wide characterization of microRNA genes in maize. PLoS Genet. 2009;5:e1000716. doi: 10.1371/journal.pgen.1000716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Laganà A, Forte S, Giudice A, Arena MR, Puglisi PL, Giugno R, Pulvirenti A, Shasha D, Ferro A. miRò: a miRNA knowledge base. Database. 2009;2009:bap008. doi: 10.1093/database/bap008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Gerlach D, Kriventseva EV, Rahman N, Vejnar CE, Zdobnov EM. miROrtho: computational survey of microRNA genes. Nucleic Acids Res. 2009;37:D111–7. doi: 10.1093/nar/gkn707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zhang Z, Yu J, Li D, Zhang Z, Liu F, Zhou X, Wang T, Ling Y, Su Z. PMRD: plant microRNA database. Nucleic Acids Res. 2010;38:D806–813. doi: 10.1093/nar/gkp818. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Maselli V, Di Bernardo D, Banfi S. CoGemiR: a comparative genomics microRNA database. BMC Genomics. 2008;9:457. doi: 10.1186/1471-2164-9-457. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Li SC, Shiau CK, Lin WC. Vir-Mir db: prediction of viral microRNA candidate hairpins. Nucleic Acids Res. 2008;36:D184–D189. doi: 10.1093/nar/gkm610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Kozomara A, Griffiths-Jones S. miRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res. 2011;39:D152–D157. doi: 10.1093/nar/gkq1027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Xiao F, Zuo Z, Cai G, Kang S, Gao X, Li T. miRecords: an integrated resource for microRNA-target interactions. Nucleic Acids Res. 2009;37:D105–D110. doi: 10.1093/nar/gkn851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Hsu PW, Huang HD, Hsu SD, Lin LZ, Tsou AP, Tseng CP, Stadler PF, Washietl S, Hofacker IL. miRNAMap: genomic maps of microRNA genes and their target genes in mammalian genomes. Nucleic Acids Res. 2006;34:D135–D139. doi: 10.1093/nar/gkj135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Boguski MS, Lowe TM, Tolstoshev CM. dbEST—database for “expressed sequence tags”. Nat Genet. 1993;4:332–333. doi: 10.1038/ng0893-332. [DOI] [PubMed] [Google Scholar]
  • 24.Barrett T, Troup DB, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, et al. NCBI GEO: archive for functional genomics data sets—10 years on. Nucleic Acids Res. 2011;39:D1005–D1010. doi: 10.1093/nar/gkq1184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.The UniProt Consortium. Ongoing and future developments at the universal protein resource. Nucleic Acids Res. 2011;39:D214–D219. doi: 10.1093/nar/gkq1020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Griffiths-Jones S, Moxon S, Marshall M, Khanna A, Eddy SR, Bateman A. Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 2005;33:D121–D124. doi: 10.1093/nar/gki081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Huang J, Hao P, Chen H, Hu W, Yan Q, Liu F, Han ZG. Genome-wide identification of Schistosoma japonicum microRNAs using a deep-sequencing approach. PLoS One. 2009;4:e8206. doi: 10.1371/journal.pone.0008206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Hao L, Cai P, Jiang N, Wang H, Chen Q. Identification and characterization of microRNAs and endogenous siRNAs in Schistosoma japonicum. BMC Genomics. 2010;11:55. doi: 10.1186/1471-2164-11-55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Pang KC, Stephen S, Dinger ME, Engström PG, Lenhard B, Mattick JS. RNAdb 2.0—an expanded database of mammalian non-coding RNAs. Nucleic Acids Res. 2007;35:D178–D182. doi: 10.1093/nar/gkl926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lazzari B, Caprera A, Cestaro A, Merelli I, Del Corvo M, Fontana P, Milanesi L, Velasco R, Stella A. Ontology-oriented retrieval of putative microRNAs in Vitis vinifera via GrapeMiRNA: a web database of de novo predicted grape microRNAs. BMC Plant Biol. 2009;9:82. doi: 10.1186/1471-2229-9-82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Wang X. miRDB: a microRNA target prediction and functional annotation database with a wiki interface. RNA. 2008;14:1012–1017. doi: 10.1261/rna.965408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Hsu SD, Lin FM, Wu WY, Liang C, Huang WC, Chan WL, Tsai WT, Chen GZ, Lee CJ, Chiu CM, et al. miRTarBase: a database curates experimentally validated microRNA-target interactions. Nucleic Acids Res. 2010;39:D163–D169. doi: 10.1093/nar/gkq1107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Gustafson AM, Allen E, Givan S, Smith D, Carrington JC, Kasschau KD. ASRP: the Arabidopsis small RNA project database. Nucleic Acids Res. 2005;33:D637–D640. doi: 10.1093/nar/gki127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Ruepp A, Kowarsch A, Schmidl D, Buggenthin F, Brauner B, Dunger I, Fobo G, Frishman G, Montrone C, Theis FJ. PhenomiR: a knowledgebase for microRNA expression in diseases and biological processes. Genome Biol. 2010;11:R6. doi: 10.1186/gb-2010-11-1-r6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Yang Z, Ren F, Liu C, He S, Sun G, Gao Q, Yao L, Zhang Y, Miao R, Cao Y, et al. dbDEMC: a database of differentially expressed miRNAs in human cancers. BMC Genomics. 2010;11(Suppl. 4):S5. doi: 10.1186/1471-2164-11-S4-S5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Schmeier S, Schaefer U, MacPherson CR, Bajic VB. dPORE-miRNA: polymorphic regulation of microRNA genes. PLoS One. 2011;6:e16657. doi: 10.1371/journal.pone.0016657. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Hiard S, Charlier C, Coppieters W, Georges M, Baurain D. Patrocles: a database of polymorphic miRNA-mediated gene regulation in vertebrates. Nucleic Acids Res. 2010;38:D640–D651. doi: 10.1093/nar/gkp926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Zhang Y, Guan DG, Yang JH, Shao P, Zhou H, Qu LH. ncRNAimprint: a comprehensive database of mammalian imprinted non-coding RNAs. RNA. 2010;16:1889–1901. doi: 10.1261/rna.2226910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Zhang BH, Pan XP, Wang QL, Cobb GP, Anderson TA. Identification and characterization of new plant microRNAs using EST analysis. Cell Res. 2005;15:336–360. doi: 10.1038/sj.cr.7290302. [DOI] [PubMed] [Google Scholar]
  • 40.Huang X, Madan A. CAP3: a DNA sequence assembly program. Genome Res. 1999;9:868–877. doi: 10.1101/gr.9.9.868. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Morgulis A, Gertz EM, Schäffer AA, Agarwala R. A fast and symmetric DUST implementation to mask low-complexity DNA sequences. J. Comput. Biol. 2006;13:1028–1240. doi: 10.1089/cmb.2006.13.1028. [DOI] [PubMed] [Google Scholar]
  • 43.Hofacker IL, Fontana W, Stadler PF, Bonhoeffer S, Tacker M, Schuster P. Fast folding and comparison of RNA secondary structures. Monatshefte f. Chem. 1994;125:167–188. [Google Scholar]
  • 44.Batuwita R, Palade V. microPred: effective classification of pre-miRNAs for human miRNA gene prediction. Bioinformatics. 2009;25:989–995. doi: 10.1093/bioinformatics/btp107. [DOI] [PubMed] [Google Scholar]
  • 45.Alexiou P, Maragkakis M, Papadopoulos GL, Reczko M, Hatzigeorgiou AG. Lost in translation: an assessment and perspective for computational microRNA target identification. Bioinformatics. 2009;25:3049–3055. doi: 10.1093/bioinformatics/btp565. [DOI] [PubMed] [Google Scholar]
  • 46.Schwab R, Palatnik JF, Riester M, Schommer C, Schmid M, Weigel D. Specific effects of microRNAs on the plant transcriptome. Dev. Cell. 2005;8:517–527. doi: 10.1016/j.devcel.2005.01.018. [DOI] [PubMed] [Google Scholar]
  • 47.Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, et al. Clustal W and Clustal X Version 2.0. Bioinformatics. 2007;23:2947–2948. doi: 10.1093/bioinformatics/btm404. [DOI] [PubMed] [Google Scholar]
  • 49.Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW. GenBank. Nucleic Acids Res. 2009;37:D26–D31. doi: 10.1093/nar/gkn723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Yang JH, Shao P, Zhou H, Chen YQ, Qu LH. deepBase: a database for deeply annotating and mining deep sequencing data. Nucleic Acids Res. 2010;38:D123–D130. doi: 10.1093/nar/gkp943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Darty K, Denise A, Ponty Y. VARNA: interactive drawing and editing of the RNA secondary structure. Bioinformatics. 2009;25:1974–1975. doi: 10.1093/bioinformatics/btp250. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES