Abstract
We describe an update of MirGeneDB, the manually curated microRNA gene database. Adhering to uniform and consistent criteria for microRNA annotation and nomenclature, we substantially expanded MirGeneDB with 30 additional species representing previously missing metazoan phyla such as sponges, jellyfish, rotifers and flatworms. MirGeneDB 2.1 now consists of 75 species spanning over ∼800 million years of animal evolution, and contains a total number of 16 670 microRNAs from 1549 families. Over 6000 microRNAs were added in this update using ∼550 datasets with ∼7.5 billion sequencing reads. By adding new phylogenetically important species, especially those relevant for the study of whole genome duplication events, and through updating evolutionary nodes of origin for many families and genes, we were able to substantially refine our nomenclature system. All changes are traceable in the specifically developed MirGeneDB version tracker. The performance of read-pages is improved and microRNA expression matrices for all tissues and species are now also downloadable. Altogether, this update represents a significant step toward a complete sampling of all major metazoan phyla, and a widely needed foundation for comparative microRNA genomics and transcriptomics studies. MirGeneDB 2.1 is part of RNAcentral and Elixir Norway, publicly and freely available at http://www.mirgenedb.org/.
INTRODUCTION
With >14 000 publications in 2020 alone, microRNAs, an important group of post-transcriptional gene regulators, continue to dominate the expanding non-coding RNA field. Despite this uninterrupted and increasing interest in microRNAs, a number of serious issues on the quality of microRNA annotations in publicly available repositories (1–14) and the reproducibility of microRNA studies, pervade the field (15). Up to two-thirds of entries for plants (8,16) and animals (10,17) were identified as false-positives, and many animal microRNA complements were found to be incompletely annotated (17). This lack of consistent annotation among microRNA complements across the animal tree of life often resulted in missing data incorrectly interpreted as secondary losses by non-experts (18), raising then questions about the validity of prior studies (19–29) that used microRNAs as phylogenetic markers to explore recalcitrant areas of the metazoan tree (30–33).
To address this and enable comparative microRNA complement analyses across organisms, we previously developed already existing annotation criteria (34) into a next generation sequencing (NGS) strategy to annotate the near-complete microRNA repertoire for any metazoan species (13). Further, we employed a uniform and consistent annotation system for microRNA nomenclature designed to reflect the evolutionary relationships between microRNA genes and family members (10). Using both these annotation and nomenclature systems, we established the microRNA gene database MirGeneDB (https://mirgenedb.org). Initially, this database contained the bona fide microRNA complements of only four species, human, mouse, chicken and zebrafish. Forty one additional species were added with the release of MirGeneDB 2.0, including numerous protostome model systems such as Drosophila and Caenorhabditis (17), as well as several new features including NGS read data representation, isoMir annotations, and known instances of 3′-monouridylation. Nearly 11 000 bona fide and consistently named microRNAs constituting 1275 microRNA families were now included into the database, allowing us to confirm the unique phylogenetic utility of microRNAs in animals given their rare losses during evolution (18). Nonetheless, several animal phyla were missing from this version including basal metazoans such as sponge and cnidarian representatives. In addition, several major clades only contained a single representative species including actinopterygian fish, amphibians, lepidosaurs, and chelicerates, greatly limiting the robustness and applicability of the database.
In order to have a truly metazoan-wide microRNA complement, and a more detailed picture of animal microRNA evolution, we present a substantial update of MirGeneDB, MirGeneDB 2.1. This release includes several new animal phyla and 30 new species, totaling now 75 metazoan representatives, and spanning more than 850 million years of animal evolution (35). This addition of phylogenetically interesting and important species allowed us to substantially improve the resolution of phylogenetic node annotation of microRNA genes and families. With >6000 new microRNA genes, for a total of 16 670 entries grouped in 1549 families, ∼150 new sequencing datasets (∼550 in total), and comprising >7.5 billion small RNA sequencing reads, this update of MirGeneDB further strengthens our database for researchers looking for high quality annotations of model and non-model animal species for developmental, homeostatic, disease, and evolutionary analyses.
EXPANSION OF MirGeneDB
Following our previously described procedure of adding new taxa to MirGeneDB (17), we analyzed more than 150 new datasets that were automatically downloaded and processed using sRNAbench (36) and miRTrace (37), respectively (Supplementary Table S1 for all ∼550 datasets used in MirGeneDB 2.1). These data, along with publicly available genome references, were then used in MirMiner (22) for the annotation of bona fide microRNA genes. In a few special cases, including coelacanth, tuatara and the nautilus, although genomic references exist (38–40), no small RNA read data are currently available, and hence the conserved microRNA repertoires of these species were determined using a classical blast approach of closely related species with default settings.
To have a truly metazoan-wide microRNA complement in MirGeneDB, we included four non-bilaterian species, the two sponges Amphimedon queenslandica and the freshwater sponge Ephydatia muelleri, as well as two cnidarians, the freshwater polyp Hydra vulgaris and the starlet sea anemone Nematostella vectensis (Figure 1, blue). With this inclusion, MirGeneDB now contains the oldest known conserved microRNAs, Nve-Mir-10, a member of the eumetazoan MIR-10 family (22,41), with an estimated age of origin likely older than ∼650 million years, and Mir-2019, a sponge-specific microRNA that is likely at least 750 million years old, only around 30 million years younger than the last common ancestor of all living animals and the oldest known eukaryotic microRNA (42).
We further included 11 additional protostome species (6 spiralians, 5 ecdysozoans) to cover more of this incredibly diverse group (Figure 1, green). For spiralian representatives, we added two new metazoan phyla with one representative each: the rotifer Brachionus plicatilis and the flatworm Schmidtea mediterranea. Further, we substantially expanded the molluscan clade by adding the four cephalopod species Nautilus pompilius, the Hawaiian bobtail squid Euprymna scolopes, the California two-spot octopus Octopus bimaculoides and the common octopus Octopus vulgaris (Zolotarov et al., in prep.). For the ecdysozoan node, we added five new arthropod species. We included the Atlantic horseshoe crab Limulus polyphemus as well as the Arizona bark scorpion Centruroides sculpturatus as two new chelicerate representatives to better characterize the whole genome duplications events found in these lineages (43). We also included the crustacean model system Daphnia magna along with two additional Drosophila species, D. simulans and D. yakuba.
Finally, we added 15 new deuterostome species (Figure 1, red) including a new metazoan clade, the Xenoturbellida Xenoturbella bocki (Schiffer et al., in prep), a second species of the cephalochordates, the European lancelet Branchiostoma lanceolatum, and representatives of the cyclostomes (i.e. jawless vertebrates), the hagfish Eptatretus burgeri and the lamprey Petromyzon marinus (Pascual-Anaya et al., in prep). We further added a second shark species, the Australian ghostshark Callorhinchus milii, and the first representative of the Holostei, the gar Lepisosteus oculatus. Furthermore, we added three additional teleost fish, the pufferfish Tetraodon nigroviridis, the Atlantic cod Gadus morhua, and the Asian swamp eel Monopterus albus. We also added the coelacanth Latimeria chalumnae, an important species to understand tetrapod evolution. For tetrapods, we added altogether five new species, two amphibians, including the frog model system Xenopus laevis and the caecilian Microcaecilia unicolor, and three diapsid representatives, including the Burmese python Python bivittatus, Schlegel's Japanese gecko Gekko japonicus and the tuatara Sphenodon punctatus.
The newly included microRNAs are primarily canonical microRNAs, some with 3′-monouridylation including LET-7-P2 members (‘Group 1’ and ‘Group 2’ of Kim et al. (44) for overview). Nonetheless, we added phylogenetically conserved mirtrons, including the placental-specific Mir-877 and the Caenorhabditis-specific Mir-62, as well as the Drosha-independent 5' capped microRNAs (‘Group 3’ of Kim et al. 2016) of the MIR-320 family, and the DGCR8 exon-encoded MIR-1306. These non-canonical microRNA genes join the only non-canonical microRNA family MIR-451 (‘Group 4’, DICER-independent) that was already included in MirGeneDB 2.0. All non-canonical microRNAs are indicated as such in the ‘Comments’ row.
Despite the additions of these new non-canonical microRNAs, and a complete re-curation of the previously existing microRNA repertoire, the microRNA complements have hardly changed in terms of gene-number (310 previously false negatives were added to and 169 (1.55%) false-positives were removed from the altogether nearly 11 000 genes of MirGeneDB 2.0, see Supplementary Table S2). Therefore, following Bartel (45), most microRNA complements, especially in the case of vertebrates and human in particular, are essentially complete.
UPDATED NOMENCLATURE IN GNATHOSTOMES AND MirGeneDB-Tracker
Given the inclusion of so many new taxa in MirGeneDB 2.1, we were able to more precisely identify the nodes of origin for numerous microRNA families and genes. This did not affect their nomenclature, only the assigned phylogenetic origin. However, taking advantage of recent insights into the whole genome duplication (WGD) events early in gnathostome history, numerous name changes were also made to reflect the origin of suites of microRNA genes (43). As detailed by Simakov et al. (46), vertebrates underwent a single WGD sometime after the split from urochordates, but before the vertebrate last common ancestor (LCA). Then, sometime after this LCA, but before the gnathostome LCA, this genome duplicated again through the hybridization of two species’ genomes. Thus, the early gnathostome genome consisted of four separate paralogous regions or paralogons, each housing a portion of the early gnathostome microRNA repertoire (Figure 2A).
Taking advantage of this historical insight, gnathostome microRNA gene names now reflect the chromosomal history of that particular syntenic region, and thus all genes are consistently named. This nomenclature system is shown in detail for the Mir-17∼92 cluster (Figure 2B). This cluster originally contained 8 genes generated through tandem gene duplication of three distinct families, MIR-17, MIR-19 and MIR-92. The first WGD event resulted in two copies of this cluster followed by the loss of Mir-17-P3a/c on cluster 1, and Mir-19-P1b/d on cluster 2. Then, there was a speciation event resulting in two lineages, what Simakov named α and β. Although there were no losses of any microRNAs in either cluster in the α lineage, the β lineage lost both Mir-17-P2d and Mir-17-P3d. Then, representatives of these two lineages hybridized, bringing together these distinct lineages into a new, now octaploid genome with four separate paralogous regions. After the gnathostome LCA, both the chondrichthyan lineage, as well as the osteichthyan lineage, experienced independent losses, including the entire loss of the 2α cluster in bony fishes. Notice how the new nomenclature system helps not only identity homologous genes with homologous duplication histories in these two taxa (e.g. Mir-17-P1a), but also allows the user to easily identify missing genes like the 2α cluster in bony fishes.
Because originally MirGeneDB identifiers largely just followed miRBase gene names, numerous changes were made in order to go from an effectively random nomenclature system to one that actually tracks the evolutionary history of each microRNA gene generated by these two WGD events. In order to be able to track those name-changes, we have developed a downloadable MirGeneDB version tracker (‘MirGeneDB-tracker’) that will help the user to see the difference between MirGeneDB releases such as name changes, new species, new genes, as well as gene deletions (see Supplementary Table S2).
IMPROVED WEB-INTERFACE
Gene-pages
To fit new species, families, genes and read data, while at the same time reducing the computational footprint of our webpage, we included a read page overview page where all data is shown in a summary representation, without resolving individual samples. In a newly developed drop-down menu, these files can be selected independently, but also, if the user prefers, in the classical and computationally more intensive representation.
Browse and download pages
Navigating through sites with many genes or species, respectively, is now simpler as headers are now frozen when scrolling down through the use of style sheets. Similar to the browse section, rows are now also highlighted in the download-section of the database.
Count matrix-download
Previously in MirGeneDB 2.0, we had introduced a visual representation of a normalized expression heatmap (RPM) of all samples on our browse-section for each species. This was a popular feature, but we did not provide a downloadable version that some users requested. Therefore, for this release, we provide automatically generated csv-formatted versions of these matrices that are downloadable in the browse & download sections, respectively.
Information
We have substantially update the information on “Unique structural features of microRNAs” that are the foundation of our annotation system and the corresponding references. MirGeneDB-tracker file can be downloaded on this page. As before, we provide list of false negative accessions, i.e. microRNAs that we detected in read data, but could not locate in either genome-assembly or genomic traces.
FUTURE DEVELOPMENTS
The establishment of MirGeneDB represents a stable and robust foundation for reproducible microRNA research that overcame a range of curation problems in the microRNA field (10,14,17). With the current update, studies wishing to employ metazoan wide comparative analyses to explore the roles of microRNAs in development and disease (47–49), as well as the evolution of microRNAs and animals themselves (30,50–53), have a much larger range of species available, more easily accessible and with more comprehensive datasets for each species at hand. Our short-term goal will be to focus on specific clades that are either currently not annotated at all, clades that are still poorly represented, or clades for which microRNAs might contribute relevant data for outstanding biological questions. For such groups, we will continue to curate all publicly available data, but we will also generate substantial data ourselves within the MIRevolution framework. We are also looking into the possibilities to incorporate large scale comparative efforts, such as the recent work on all hexapod clades by Ma et al. (54) and large genome sequencing initiatives, into our database. The naming of novel microRNAs might become a priority for the next major release depending on the input by the community. Eventually we hope to have curated representatives of all major metazoan clades including multiple species, along with a large number of high quality and low bias datasets from a comprehensive set of organs, tissues, cell types and developmental time points.
DATA AVAILABILITY
All MirGeneDB data are publicly and freely available under the Creative Commons Zero license. MirGeneDB is part of FAIRsharing.org (55,56). Data is available for bulk download from http://mirgenedb.org/download. Feedback on any aspect of the MirGeneDB database is welcome by email to Bastian.Fromm@uit.no or Kevin.J.Peterson@dartmouth.edu, or via Twitter (@MirGeneDB).
Supplementary Material
ACKNOWLEDGEMENTS
The authors wish to thank Claudia Kutter for helpful comments and suggestions. The RNAcentral consortium (57) and Elixir Norway are acknowledged for including MirGeneDB into their service portfolio.
Contributor Information
Bastian Fromm, The Arctic University Museum of Norway, UiT- The Arctic University of Norway, Tromsø, Norway; Science for Life Laboratory, Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden.
Eirik Høye, Department of Tumor Biology, Institute for Cancer Research, The Norwegian Radium Hospital, Oslo University Hospital, Oslo, Norway; Institute of Clinical Medicine, University of Oslo, Oslo, Norway.
Diana Domanska, Center for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway; Department of Pathology, Institute of Clinical Medicine, University of Oslo, Oslo, Norway.
Xiangfu Zhong, Department of Biosciences and Nutrition, Karolinska Institute, Huddinge, Sweden.
Ernesto Aparicio-Puerta, Department of Genetics, Faculty of Sciences, MNAT Excellence Unit, University of Granada, Granada, Spain; Biotechnology Institute, CIBM, Granada, Spain; Biohealth Research Institute (ibs.GRANADA), University Hospitals of Granada, University of Granada, Granada, Spain.
Vladimir Ovchinnikov, Computational and Molecular Evolutionary Biology Research Group, School of life sciences, Faculty of Medicine and Health Sciences, University of Nottingham, Nottingham, UK.
Sinan U Umu, Department of Research, Cancer Registry of Norway, Oslo, Norway.
Peter J Chabot, Department of Biological Sciences, Dartmouth College, Hanover, USA.
Wenjing Kang, Science for Life Laboratory, Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden; Science for Life Laboratory, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Solna, Sweden.
Morteza Aslanzadeh, Science for Life Laboratory, Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden.
Marcel Tarbier, Science for Life Laboratory, Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden; Science for Life Laboratory, Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Solna, Sweden.
Emilio Mármol-Sánchez, Science for Life Laboratory, Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden; Centre for Palaeogenetics, Stockholm, Sweden.
Gianvito Urgese, Politecnico di Torino, Torino, Italy.
Morten Johansen, Center for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway.
Eivind Hovig, Department of Tumor Biology, Institute for Cancer Research, The Norwegian Radium Hospital, Oslo University Hospital, Oslo, Norway; Center for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway.
Michael Hackenberg, Department of Genetics, Faculty of Sciences, MNAT Excellence Unit, University of Granada, Granada, Spain; Biotechnology Institute, CIBM, Granada, Spain; Biohealth Research Institute (ibs.GRANADA), University Hospitals of Granada, University of Granada, Granada, Spain.
Marc R Friedländer, Science for Life Laboratory, Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden.
Kevin J Peterson, Department of Biological Sciences, Dartmouth College, Hanover, USA.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
B.F. is supported by the Tromsø forskningsstiftelse (TFS) [20_SG_BF ‘MIRevolution’]; Strategic Research Area (SFO) program of the Swedish Research Council (to V.R.) through Stockholm University (to B.F., W.K., E.M.-S. and M.R.F.); M.R.F. is additionally supported by ERC [758397 ‘miRCell’]; South-Eastern Norway Regional Health Authority support is acknowledged [2018014 to E.H.]; P.J. Chabot is supported by the Junior Scholars Program (Dartmouth College); V.O.’s research funding was awarded to Dr Mary J. O’Connell (Associate Professor) from the School of Life Sciences University of Nottingham; M.H. is supported by the Spanish Government [AGL2017-88702-C2-2-R]; University of Granada [A-BIO-481-UGR18, FEDER 18]; K.J.P. has been supported by the National Science Foundation; NASA Ames; Dartmouth College. Funding for open access charge: Stockholm University.
Conflict of interest statement. None declared.
REFERENCES
- 1. Castellano L., Stebbing J.. Deep sequencing of small RNAs identifies canonical and non-canonical miRNA and endogenous siRNAs in mammalian somatic tissues. Nucleic Acids Res. 2013; 41:3339–3351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Chiang H.R., Schoenfeld L.W., Ruby J.G., Auyeung V.C., Spies N., Baek D., Johnston W.K., Russ C., Luo S., Babiarz J.E.et al.. Mammalian microRNAs: experimental evaluation of novel and previously annotated genes. Genes Dev. 2010; 24:992–1009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Jones-Rhoades M.W. Conservation and divergence in plant microRNAs. Plant Mol. Biol. 2012; 80:3–16. [DOI] [PubMed] [Google Scholar]
- 4. Ludwig N., Becker M., Schumann T., Speer T., Fehlmann T., Keller A., Meese E.. Bias in recent miRBase annotations potentially associated with RNA quality issues. Sci. Rep. 2017; 7:5162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Langenberger D., Bartschat S., Hertel J., Hoffmann S., Tafer H., Stadler P.F.. MicroRNA or Not MicroRNA?. Advances in Bioinformatics and Computational Biology. 2011; Springer Berlin Heidelberg; 1–9. [Google Scholar]
- 6. Meng Y., Shao C., Wang H., Chen M.. Are all the miRBase-registered microRNAs true? A structure- and expression-based re-examination in plants. RNA Biol. 2012; 9:249–253. [DOI] [PubMed] [Google Scholar]
- 7. Tarver J.E., Donoghue P.C., Peterson K.J.. Do miRNAs have a deep evolutionary history?. Bioessays. 2012; 34:857–866. [DOI] [PubMed] [Google Scholar]
- 8. Taylor R.S., Tarver J.E., Hiscock S.J., Donoghue P.C.. Evolutionary history of plant microRNAs. Trends Plant Sci. 2014; 19:175–182. [DOI] [PubMed] [Google Scholar]
- 9. Wang X., Liu X.S.. Systematic curation of miRBase annotation using integrated small RNA high-throughput sequencing data for C. elegans and drosophila. Front. Genet. 2011; 2:25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Fromm B., Billipp T., Peck L.E., Johansen M., Tarver J.E., King B.L., Newcomb J.M., Sempere L.F., Flatmark K., Hovig E.et al.. A uniform system for the annotation of vertebrate microRNA genes and the evolution of the human microRNAome. Annu. Rev. Genet. 2015; 49:213–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Axtell M.J., Meyers B.C.. Revisiting criteria for plant MicroRNA annotation in the era of big data. Plant Cell. 2018; 30:272–284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Guo Z., Kuang Z., Wang Y., Zhao Y., Tao Y., Cheng C., Yang J., Lu X., Hao C., Wang T.et al.. PmiREN: a comprehensive encyclopedia of plant miRNAs. Nucleic Acids Res. 2020; 48:D1114–D1121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Fromm B., Domanska D., Høye E., Ovchinnikov V., Kang W., Aparicio-Puerta E., Johansen M., Flatmark K., Mathelier A., Hovig E.et al.. MirGeneDB 2.0: the metazoan microRNA complement. Nucleic Acids Res. 2020; 48:D1172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Fromm B., Keller A., Yang X., Friedlander M.R., Peterson K.J., Griffiths-Jones S.. Quo vadis microRNAs?. Trends Genet. 2020; 36:461–463. [DOI] [PubMed] [Google Scholar]
- 15. Witwer K.W., Halushka M.K.. Toward the promise of microRNAs - enhancing reproducibility and rigor in microRNA research. RNA Biol. 2016; 13:1103–1116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Taylor R.S., Tarver J.E., Foroozani A., Donoghue P.C.J.. MicroRNA annotation of plant genomes- Do it right or not at all. Bioessays. 2017; 39:1600113. [DOI] [PubMed] [Google Scholar]
- 17. Fromm B., Domanska D., Høye E., Ovchinnikov V., Kang W., Aparicio-Puerta E., Johansen M., Flatmark K., Mathelier A., Hovig E.et al.. MirGeneDB 2.0: the metazoan microRNA complement. Nucleic Acids Res. 2020; 48:D132–D141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Tarver J.E., Taylor R.S., Puttick M.N., Lloyd G.T., Pett W., Fromm B., Schirrmeister B.E., Pisani D., Peterson K.J., Donoghue P.C.J.. Well-annotated microRNAomes do not evidence pervasive miRNA loss. Genome Biol. Evol. 2018; 10:1457–1470. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Hertel J., Lindemeyer M., Missal K., Fried C., Tanzer A., Flamm C., Hofacker I.L., Stadler P.F.Students of Bioinformatics Computer Labs 2004 and 2005 . The expansion of the metazoan microRNA repertoire. BMC Genomics. 2006; 7:25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Sempere L.F., Cole C.N., McPeek M.A., Peterson K.J.. The phylogenetic distribution of metazoan microRNAs: insights into evolutionary complexity and constraint. J. Exp. Zool. B Mol. Dev. Evol. 2006; 306:575–588. [DOI] [PubMed] [Google Scholar]
- 21. Heimberg A.M., Sempere L.F., Moy V.N., Donoghue P.C., Peterson K.J.. MicroRNAs and the advent of vertebrate morphological complexity. Proc. Natl. Acad. Sci. U.S.A. 2008; 105:2946–2950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Wheeler B.M., Heimberg A.M., Moy V.N., Sperling E.A., Holstein T.W., Heber S., Peterson K.J.. The deep evolution of metazoan microRNAs. Evol. Dev. 2009; 11:50–68. [DOI] [PubMed] [Google Scholar]
- 23. Sperling E.A., Vinther J., Moy V.N., Wheeler B.M., Semon M., Briggs D.E., Peterson K.J.. MicroRNAs resolve an apparent conflict between annelid systematics and their fossil record. Proc. Biol. Sci. 2009; 276:4315–4322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Heimberg A.M., Cowper-Sal-lari R., Semon M., Donoghue P.C., Peterson K.J.. microRNAs reveal the interrelationships of hagfish, lampreys, and gnathostomes and the nature of the ancestral vertebrate. Proc. Natl. Acad. Sci. U.S.A. 2010; 107:19379–19383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Philippe H., Brinkmann H., Copley R.R., Moroz L.L., Nakano H., Poustka A.J., Wallberg A., Peterson K.J., Telford M.J.. Acoelomorph flatworms are deuterostomes related to Xenoturbella. Nature. 2011; 470:255–258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Helm C., Bernhart S.H., Höner zu Siederdissen C., Nickel B., Bleidorn C.. Deep sequencing of small RNAs confirms an annelid affinity of Myzostomida. Mol. Phylogenet. Evol. 2012; 64:198–203. [DOI] [PubMed] [Google Scholar]
- 27. Lyson T.R., Sperling E.A., Heimberg A.M., Gauthier J.A., King B.L., Peterson K.J.. MicroRNAs support a turtle + lizard clade. Biol. Lett. 2012; 8:104–107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Fromm B., Worren M.M., Hahn C., Hovig E., Bachmann L.. Substantial loss of conserved and gain of novel MicroRNA families in flatworms. Mol. Biol. Evol. 2013; 30:2619–2628. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Field D.J., Gauthier J.A., King B.L., Pisani D., Lyson T.R., Peterson K.J.. Toward consilience in reptile phylogeny: miRNAs support an archosaur, not lepidosaur, affinity for turtles. Evol. Dev. 2014; 16:189–196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Thomson R.C., Plachetzki D.C., Mahler D.L., Moore B.R.. A critical appraisal of the use of microRNA data in phylogenetics. Proc. Natl. Acad. Sci. U.S.A. 2014; 111:E3659–E3668. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Dunn C.W. Reconsidering the phylogenetic utility of miRNA in animals. Proc. Natl. Acad. Sci. U.S.A. 2014; 111:12576–12577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Dunn C.W., Giribet G., Edgecombe G.D., Hejnol A.. Animal phylogeny and its evolutionary implications. Annu. Rev. Ecol. Evol. Syst. 2014; 45:371–395. [Google Scholar]
- 33. Cannon J.T., Kocot K.M., Waits D.S., Weese D.A., Swalla B.J., Santos S.R., Halanych K.M.. Phylogenomic resolution of the hemichordate and echinoderm clade. Curr. Biol. 2014; 24:2827–2832. [DOI] [PubMed] [Google Scholar]
- 34. Ambros V., Bartel B., Bartel D.P., Burge C.B., Carrington J.C., Chen X., Dreyfuss G., Eddy S.R., Griffiths-Jones S., Marshall M.et al.. A uniform system for microRNA annotation. RNA. 2003; 9:277–279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Erwin D.H., Laflamme M., Tweedt S.M., Sperling E.A., Pisani D., Peterson K.J.. The Cambrian conundrum: early divergence and later ecological success in the early history of animals. Science. 2011; 334:1091–1097. [DOI] [PubMed] [Google Scholar]
- 36. Aparicio-Puerta E., Lebrón R., Rueda A., Gómez-Martín C., Giannoukakos S., Jaspez D., Medina J.M., Zubkovic A., Jurak I., Fromm B.et al.. sRNAbench and sRNAtoolbox 2019: intuitive fast small RNA profiling and differential expression. Nucleic Acids Res. 2019; 47:W530–W535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Kang W., Eldfjell Y., Fromm B., Estivill X., Biryukova I., Friedländer M.R.. miRTrace reveals the organismal origins of microRNA sequencing data. Genome Biol. 2018; 19:213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Gemmell N.J., Rutherford K., Prost S., Tollis M., Winter D., Macey J.R., Adelson D.L., Suh A., Bertozzi T., Grau J.H.et al.. The tuatara genome reveals ancient features of amniote evolution. Nature. 2020; 584:403–409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Amemiya C.T., Alföldi J., Lee A.P., Fan S., Philippe H., MacCallum I., Braasch I., Manousaki T., Schneider I., Rohner N.et al.. The African coelacanth genome provides insights into tetrapod evolution. Nature. 2013; 496:311–316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Zhang Y., Mao F., Mu H., Huang M., Bao Y., Wang L., Wong N.-K., Xiao S., Dai H., Xiang Z.et al.. The genome of Nautilus pompilius illuminates eye evolution and biomineralization. Nat. Ecol. Evol. 2021; 5:927–938. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Grimson A., Srivastava M., Fahey B., Woodcroft B.J., Chiang H.R., King N., Degnan B.M., Rokhsar D.S., Bartel D.P.. Early origins and evolution of microRNAs and Piwi-interacting RNAs in animals. Nature. 2008; 455:1193–1197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Sperling E.A., Robinson J.M., Pisani D., Peterson K.J.. Where's the glass? Biomarkers, molecular clocks, and microRNAs suggest a 200-Myr missing Precambrian fossil record of siliceous sponge spicules. Geobiology. 2010; 8:24–36. [DOI] [PubMed] [Google Scholar]
- 43. Peterson K.J., Beavan A., Chabot P., McPeek M.L., Pisani D., Fromm B., Simakov O.. microRNAs as indicators into the causes and consequences of whole genome duplication events. 2021; bioRxiv doi:02 September 2021, preprint: not peer reviewed 10.1101/2021.09.01.458616. [DOI] [PMC free article] [PubMed]
- 44. Kim Y.-K., Kim B., Kim V.N.. Re-evaluation of the roles of DROSHA, Export in 5, and DICER in microRNA biogenesis. Proc. Natl. Acad. Sci. U.S.A. 2016; 113:E1881–E1889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Bartel D.P. Metazoan MicroRNAs. Cell. 2018; 173:20–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Simakov O., Marlétaz F., Yue J.-X., O’Connell B., Jenkins J., Brandt A., Calef R., Tung C.-H., Huang T.-K., Schmutz J.et al.. Deeply conserved synteny resolves early events in vertebrate evolution. Nat. Ecol. Evol. 2020; 4:820–830. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Giraldez A.J., Cinalli R.M., Glasner M.E., Enright A.J., Thomson J.M., Baskerville S., Hammond S.M., Bartel D.P., Schier A.F.. MicroRNAs regulate brain morphogenesis in zebrafish. Science. 2005; 308:833–838. [DOI] [PubMed] [Google Scholar]
- 48. Song J.L., Stoeckius M., Maaskola J., Friedländer M., Stepicheva N., Juliano C., Lebedeva S., Thompson W., Rajewsky N., Wessel G.M.. Select microRNAs are essential for early development in the sea urchin. Dev. Biol. 2012; 362:104–113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. DeVeale B., Swindlehurst-Chan J., Blelloch R.. The roles of microRNAs in mouse development. Nat. Rev. Genet. 2021; 22:307–323. [DOI] [PubMed] [Google Scholar]
- 50. Sarkies P., Selkirk M.E., Jones J.T., Blok V., Boothby T., Goldstein B., Hanelt B., Ardila-Garcia A., Fast N.M., Schiffer P.M.et al.. Ancient and novel small RNA pathways compensate for the loss of piRNAs in multiple independent nematode lineages. PLoS Biol. 2015; 13:e1002061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Leite D.J., Ninova M., Hilbrant M., Arif S., Griffiths-Jones S., Ronshaugen M., McGregor A.P.. Pervasive microRNA duplication in chelicerates: insights from the embryonic microRNA repertoire of the spider Parasteatoda tepidariorum. Genome Biol. Evol. 2016; 8:2133–2144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Desvignes T., Sydes J., Montfort J., Bobe J., Postlethwait J.H.. Evolution after whole-genome duplication: teleost MicroRNAs. Mol. Biol. Evol. 2021; 38:3308–3331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Lewis S.H., Quarles K.A., Yang Y., Tanguy M., Frézal L., Smith S.A., Sharma P.P., Cordaux R., Gilbert C., Giraud I.et al.. Pan-arthropod analysis reveals somatic piRNAs as an ancestral defence against transposable elements. Nat. Ecol. Evol. 2018; 2:174–181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Ma X., He K., Shi Z., Li M., Li F., Chen X.-X.. Large-scale annotation and evolution analysis of MiRNA in insects. Genome Biol. Evol. 2021; 13:evab083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. FAIRsharing Team FAIRsharing record for: MirGeneDB.org. 2018; 10.25504/FAIRSHARING.QXSGVF. [DOI] [Google Scholar]
- 56. Sansone S.-A., McQuilton P., Rocca-Serra P., Gonzalez-Beltran A., Izzo M., Lister A.L., Thurston M.FAIRsharing Community. . FAIRsharing as a community approach to standards, repositories and policies. Nat. Biotechnol. 2019; 37:358–367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. RNAcentral Consortium RNAcentral 2021: secondary structure integration, improved sequence search and new member databases. Nucleic Acids Res. 2021; 49:D212–D220. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All MirGeneDB data are publicly and freely available under the Creative Commons Zero license. MirGeneDB is part of FAIRsharing.org (55,56). Data is available for bulk download from http://mirgenedb.org/download. Feedback on any aspect of the MirGeneDB database is welcome by email to Bastian.Fromm@uit.no or Kevin.J.Peterson@dartmouth.edu, or via Twitter (@MirGeneDB).