Abstract
Small non-coding RNAs include powerful regulators of gene expression, transposon mobility and virus activity. Among the various categories, mature microRNAs (miRNAs) guide the translational repression and decay of several targeted mRNAs. The biogenesis of miRNAs depends on few gene products, essentially conserved from basal to higher metazoans, whose protein domains allow specific interactions with dsRNA. Here, we report the identification of key genes responsible of the miRNA biogenesis in 32 bivalves, with particular attention to the aquaculture species Mytilus galloprovincialis and Crassostrea gigas. In detail, we have identified and phylogenetically compared eight evolutionary conserved proteins: DROSHA, DGCR8, EXP5, RAN, DICER TARBP2, AGO and PIWI. In mussels, we recognized several other proteins participating in the miRNA biogenesis or in the subsequent RNA silencing. According to digital expression analysis, these genes display low and not inducible expression levels in adult mussels and oysters whereas they are considerably expressed during development. As miRNAs play an important role also in the antiviral responses, knowledge on their production and regulative effects can shed light on essential molecular processes and provide new hints for disease prevention in bivalves.
Keywords: miRNA biogenesis, Mytilus galloprovincialis, Crassostrea gigas, Bivalves, RNAi
Introduction
Different types of non-coding RNAs (ncRNAs) have gained attention for their powerful regulatory action on eukaryotic genes and other genetic elements (Carninci et al., 2005; Mortimer, Kidwell & Doudna, 2014). The process known as RNA interference (RNAi) exemplifies an evolutionary conserved mechanism of gene silencing based on small guide RNAs and specific interacting proteins (Tomoyasu et al., 2008; Gammon & Mello, 2015). Silencing RNAs (siRNAs) and microRNAs (miRNAs) take part to the same control machinery whereas Piwi-interacting RNAs (piRNAs) peculiarly silence germ-line transposons, among other roles (Théron et al., 2014; Iwasaki, Siomi & Siomi, 2015). Long noncoding RNAs (lncRNAs) in their normal or mutated forms can widely influence physiological and pathological processes, as multiple lines of evidence indicate their involvement in chromosome inactivation and epigenetic modifications, control of mRNA decay and translation, and DNA sequestration of transcription factors (Huarte, 2015; Ruan, 2015). More recently, circular RNAs have been identified as a group of competing endogenous RNAs whose effects in the miRNA function and transcriptional/post-transcriptional regulation are now matter of study (Qu et al., 2015).
miRNAs are single-stranded RNA molecules of around 22 nucleotides, presenting conserved structural features and able to modulate the expression of eukaryotic genes by inhibition of mRNA translation or enhancement of mRNA decay (Ambros, 2003; Bartel, 2004; Tarver, Donoghue & Peterson, 2012). Up to now, diversified sets of miRNAs have been detected in five eukaryotic taxa (eumetazoans, silicisponges, vascular plants, Clamydomonas and Ectocarpus spp.) while they are apparently absent in protists (Grimson et al., 2008; Tarver et al., 2015). Depending on the annotation procedure, the number of human miRNAs varies from 523 to 1,881 miRNA precursors, as reported in MirGeneDB (Fromm et al., 2015) or in miRBase v. 21 (Kozomara, Griffiths & Jones, 2014), respectively. Overall, human miRNAs could target 30–60% of the transcribed genes (John et al., 2004; Sand et al., 2012), with implications in cell differentiation (Berezikov et al., 2005), cell death (Xu et al., 2015), stress responses (Mendell & Olson, 2012) and diseases (Huang et al., 2014; Min & Chan, 2015).
The miRNA biogenesis starts from pri-miRNA transcripts, mostly generated from RNA polymerase II in form of long non-coding RNAs and able to form a hairpin subsequently recognized by the so called microprocessor complex. DROSHA, a double-stranded RNA-specific ribonuclease III, and the RNA binding protein Di-George syndrome Critical Region gene 8 (DGCR8) are the microprocessor’s core proteins which allow interactions with the DDX5 helicase, the RNA binding protein Lin-28 and hnRNP A1, among other elements (Jean-Philippe, Paz & Caputi, 2013; Hong et al., 2013). During the recognition of pri-miRNAs at the dsRNA-ssRNA junction, DGCR8 acts as a crucial molecular anchor and directs DROSHA to cleave 11 bp away from the junction, with consequent release of hairpin-shaped pre-miRNAs (Denli et al., 2004). Pre-miRNAs are firstly exported to the cytoplasm via the Exportin5 (XPO5) by interaction with the small GTPase RAN; then, they are further processed by the RISC loading complex, composed by the endoribonuclease DICER, the RNA binding protein TARBP2 and Argonaute proteins (MacRae et al., 2008; Miyoshi et al., 2009). The evolutionary conserved Argonaute proteins are specialized in binding small RNAs and exist in several isoforms, with AGO and PIWI representing two distinct subclades (Tolia & JoshuaTor, 2007; Ender & Meister, 2010).
AGOs select the ‘guide’ miRNA strand necessary for targeted gene silencing and, therefore, are responsible for final miRNA maturation. Several other proteins have been demonstrated to cooperate in miRNA processing and functions (Ender & Meister, 2010). In fact, AGOs operate transcriptional repression and cause mRNA decay by interacting with the GW-rich N-terminal region of GW182, a protein associated with cellular P-bodies (Van Kouwenhove, Kedde & Agami, 2011). Other proteins involved in the mRNA turnover (CAF1, PABPC1, eIF4G; CCR4-NOT and PAN2-PAN3 deadenylation complexes; in human somatic cells, also the decapping complex DCP1-DCP2 and at least four helicases, DDX5, DDX6, DDX17 and DDX42) may cooperate with the AGO-GW182 complex to reduce the mRNA translation efficiency (Nottrott, Simard & Richter, 2006; Fabian & Sonenberg, 2012).
Unlike AGOs, the PIWI proteins specifically interact with piRNAs to participate in the germline specification, gametogenesis, transposon silencing and in the maintenance of genome integrity (Carmell et al., 2007; Malone & Hannon, 2009; Ghildiyal & Zamore, 2009; Siomi et al., 2011). The piRNA mechanism of action is not so well defined but probably it involves the arginine methyl-transferase PRMT5, tudor domain-containing proteins (TDRDs) and the Maelstrom protein (MAEL) (Sokolova et al., 2011).
With the widespread and cost-effective use of Next Generation Sequencing (NGS) technologies, miRNAs have been deeply explored in non-model organisms, including bacteria (Xu et al., 2014), plants (Rhee, Chae & Kim, 2015) and viruses (Kincaid & Sullivan, 2012; Diebel et al., 2015). The basic set of genes involved in the miRNA biogenesis, and related protein interactions, are well known in mammals (Lau & MacRae, 2009), and also in other metazoans like Cnidaria (Moran et al., 2013), Platyhelminthes (Resch & Palakodeti, 2012) and insects (Lucas & Raikhel, 2013; Hussain & Asgari, 2014). Regarding mollusks, lists of miRNAs have been reported for a few species (Jiao et al., 2014; Chen et al., 2014; Martín-Gómez et al., 2014; Zhou et al., 2014), miRNA families have been investigated in the limpet genome (Kenny et al., 2015) and one study has considered bivalve DICER sequences for phylogenetic analysis (Gao et al., 2014). A general overview on the bivalve miRNA biogenesis complements is still lacking, so we took advantage of several genomic and transcriptomic datasets available for Lophotrochozoa (GIGA Community of Scientists, 2014) to identify and characterize the core elements involved in the miRNA formation pathway in Mytilus and Crassostrea spp. and other bivalves.
Materials & Methods
Sequences coding for proteins centrally involved in the miRNA pathway, namely DROSHA, DGCR8, XPO5, RAN, DICER, TARBP2, AGO and PIWI, have been methodically identified in the genomes and transcriptomes of M. galloprovincialis (Mg) and C. gigas (Cg) as well as in other bivalve and non-bivalve species (66 species, listed in Table 1).
Table 1. Organisms included in the present work.
Phylum | Species | Sequence origin | Ref | Used ID | DROSHA | DGCR8 | XPO5 | RAN | DICER | TARBP2 | AGO or PIWI |
---|---|---|---|---|---|---|---|---|---|---|---|
Ctenophora | Pleurobrachia bachei | G | EM | Ple_bac | No | No | 1 | No | 1 | No | 4 |
Mnemiopsis leidyi | G | EM | Mne_ lei | No | No | 1 | 1 | 1 | No | 4 | |
Porifera | Amphimedon queenslandica | G | EM | Aq | No | 1 | 1 | 1 | 2 | No | 2 |
Placozoa | Trichoplax adhaerens | G | EM | Tri_ adh | No | No | 1 | 1 | 2 | No | 1 |
Cnidaria | Nematostella vectensis | G | EM | Nvec | 1 | 1 | 1 | 1 | 2 | No | 4 |
Porites australiensis | T | TSA | Por_aus | 1 | 1 | 1 | 1 | 1 | No | 6 | |
Anthopleura elegantissima | T | TSA | Ant_ ele | 1 | 1 | 1 | 1 | 1 | No | 6 | |
Nematoda | Caenorhabditis elegans | G | M | Ce | 1 | 1 | 1 | 1 | 1 | 23 | |
Arthropoda | Daphnia pulex | G | EM | Dap_ pul | 1 | 1 | 1 | 1 | 2 | 1 | 10 |
Culex quinquefasciatus | G | EM | Cq | 1 | 1 | 1 | 1 | 2 | 1 | 4 | |
Drosophila melanogaster | G | EM | Dm | 1 | 1 | 1 | 1 | 2 | 1 | 4 | |
Nasonia vitripennis | G | M | Nv | 1 | 1 | 1 | 1 | 2 | 1 | 4 | |
Tribolium castaneum | G | M | Tc | 1 | 1 | 1 | 1 | 2 | 1 | 4 | |
Apis mellifera | G | EM | Am | 1 | 1 | 1 | 1 | 2 | 1 | 4 | |
Lasioglossum albipes | G | K | La | 1 | 1 | 1 | 1 | 2 | 1 | 4 | |
Acyrthosiphon pisum | G | A | Ap | 1 | 1 | 1 | 1 | 2 | 1 | 15 | |
Platyhelmintes | Schistosoma mansoni | G | GD | Sch_ man | 1 | 1 | 2 | 1 | 2 | 1 | 3 |
Schmidtea mediterranea | G | SG | Sch_med | 1 | 1 | 1 | 2 | 1 | 4 | ||
Rotifera | Adineta vaga | G | V | Adi_vag | 1 | 1 | 1 | 1 | 1 | 1 | 4 |
Brachiopoda | Lingula anatina | G | L | Lin_ ana | 1 | 1 | 1 | 1 | 1 | 3 | |
Annelida | Capitella telata | G | EM | Ct | 1 | 1 | 1 | 1 | 1 | 1 | 3 |
Helobdella robusta | G | EM | Hel_ rob | 1 | 1 | 1 | 1 | 1 | 1 | 4 | |
Cephalopoda | Octopus bimaculoides | G | M | Oct_ bim | 1 | 1 | 1 | 1 | 1 | 1 | 4 |
Mollusca | Aplysia californica | G | B | Ac | 1 | 1 | 1 | 3 | 1 | 1 | 4 |
Lottia gigantea | G | M | Lg | 1 | 1 | 1 | 1 | 1 | 1 | 3 | |
Mytilus galloprovincialis | T | Local | Mg | 1 | 1 | 1 | 1 | 1 | 1 | 3 | |
Mytilus edulis | T | Local | Me | 1 | 1 | 1 | 1 | 1 | 2 | ||
Mytilus californianus | T | Local | Mc | 1 | 1 | 2 | |||||
Mytilus trossulus | T | Local | Mt | 1 | |||||||
Anadara trapezia | T | Local | At | 1 | 1 | ||||||
Tegillarca granosa | T | Local | Tg | 1 | |||||||
Bathymodiolus azoricus | T | Local | Ba | 1 | |||||||
Perna viridis | T | Local | Pv | 1 | 1 | 1 | 1 | 2 | |||
Ennucula tenuis | T | Local | Et | 1 | 1 | 1 | |||||
Crassostrea corteziensis | T | Local | Cc | 1 | 1 | 1 | 1 | 4 | |||
Crassostrea gigas | G | EM | Cg | 1 | 1 | 1 | 1 | 1 | 1 | 4 | |
Crassostrea hongkongensis | T | Local | Ch | 1 | 1 | 3 | |||||
Crassostrea virginica | T | Local | Cv | 1 | 4 | ||||||
Crassostrea angulata | T | local | Ca | 1 | 1 | 1 | 1 | 2 | |||
Ostrea chilensis | T | Local | Oc | 1 | |||||||
Ostrea edulis | T | Local | Oe | 1 | 2 | ||||||
Ostrea lurida | T | local | Ol | 1 | 1 | ||||||
Ostreola stentina | T | Local | Os | 1 | |||||||
Saccostrea glomerata | T | Local | Sg | 1 | |||||||
Argopecten irradians | T | Local | Ai | 1 | 1 | ||||||
Mizuhopecten yessoensis | T | Local | My | 1 | 1 | 1 | 1 | 2 | |||
Pecten maximus | T | Local | Pm | 1 | 2 | ||||||
Pinctada fucata | G | F | Pf | 1 | 1 | 1 | 1 | ||||
Solemya velum | T | Local | Sv | 1 | 1 | 1 | 1 | 1 | 3 | ||
Elliptio complanata | T | Local | Ec | 1 | 1 | 1 | 1 | 1 | |||
Pyganodon grandis | T | Local | Pg | 1 | 1 | 2 | |||||
Uniomerus tetralasmus | T | Local | Ut | 1 | 1 | 3 | |||||
Villosa lienosa | T | Local | Vl | 1 | 1 | ||||||
Corbicula fluminea | T | local | Cf | 1 | 1 | ||||||
Meretrix meretrix | T | local | Mm | 1 | 2 | ||||||
Ruditapes decussatus | T | local | Rd | 1 | 1 | ||||||
Ruditapes philippinarum | T | local | Rp | 1 | 1 | ||||||
Echinodermata | Strongylocentrotus purpuratus | G | M | Sp | 1 | 1 | 1 | 1 | 1 | 1 | 3 |
Hemichordata | Saccoglossus kowalevskii | G | M | Sk | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
Chordata | Homo sapiens | G | M | Hs | 1 | 1 | 1 | 1 | 1 | 1 | 8 |
Ciona intestinalis | G | M | Ci | 1 | 1 | 1 | 1 | 1 | 1 | 3 | |
Branchiostoma floridae | G | M | Bf | 1 | 1 | 1 | 1 | 1 | 1 | 7 | |
Oncorhynchus mykiss | G | O | Om | 1 | 1 | 1 | 1 | 1 | 1 | 5 | |
Danio rerio | G | M | Dr | 1 | 1 | 1 | 1 | 1 | 1 | 5 | |
Streptophyta | Arabidopsis thaliana | G | P | At | No | No | 1 | 1 | 4 | No | 10 |
Populus trichocarpa | G | P | Pt | No | No | 1 | 1 | 5 | No | 11 |
Notes.
Abbreviations:
- A
- Aphidbase
- B
- broadinstitute.org/ftp/pub/assemblies/invertebrates/aplysia/
- EM
- Ensambl Metazoa v.29
- F
- Takeuchi et al. (2012) DNA Res. 19(2): 117–130;
- G
- Genome
- GD
- GeneDB
- K
- Kocher et al. (2013) Genome Biology 14 (12): R142
- L
- Lou et al. (2015) Nat Commun. 8; 6:8301
- M
- Metazome v3.0
- O
- Berthelot et al. (2014) Nat Commun. 22; 5: 3657
- P
- Phytozome 11
- SG
- SmedGD
- T
- Transcriptome
- TSA
- NCBI Transcriptome shotgun assembly
- V
- Genoscope
Sequence retrieval and analysis
The Mg WGS project (ID APJB000000000.1(Nguyen, Hayes & Ingram, 2014)) and the Cg genome draft (GCA_000297895 (Zhang et al., 2012)) were retrieved from GenBank, whereas the oyster genome annotations were obtained from Ensembl Metazoa release 29 (http://metazoa.ensembl.org/Crassostrea_gigas/Info/Index). A Mg reference transcriptome was produced using 18,788 ESTs of mixed tissues previously obtained by Sanger sequencing (Venier et al., 2009) and 453 million reads obtained by paired-end (2 × 100 bp) Illumina Hiseq2000 sequencing of digestive gland from North Adriatic Sea mussels (ID: PRJNA88481) (Gerdol et al., 2014), and from haemocytes, gills, mantle and muscle of Spanish mussels (ID: SRP033481) (Moreira et al., 2015). The quality of the sequencing readout was evaluated by the FastQC suite (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) discarding the reads with PHRED quality below 20 and presenting more than two ambiguous nucleotides. De-novo assembly was performed with Trinity, release 2013-08-14 (Grabherr et al., 2011), setting the minimum contig length at 200 bp and using default settings. Subsequently, protein coding sequences (cds) were predicted with Transdecoder (Grabherr et al., 2011). Transcriptomic reads of 30 bivalve spp. (Cg plus other 29 species) were retrieved from the SRA archive and assembled as described above (details in File S1). The protein predictions of further 33 organisms were directly retrieved from public repositories or extracted from the corresponding genome releases. The NCBI transcriptome shotgun assembly (TSA) database was interrogated to retrieve hits for two additional cnidarians, Porites australiensis and Anthopleura elegantissima (Table 1).
Protein domain searches
To investigate the presence of eight key proteins of miRNA biogenesis (DROSHA, DGCR8, XPO5, RAN, DICER, TARBP2, AGO and PIWI), we downloaded their predictive HMM from PFAM v.27 (listed in Table 2) and we scanned the sequence datasets with HMMer v3.1 (Eddy, 2011) applying a cut-off E-value of 0.01. To achieve a meaningful comparison of proteins from different organisms, we retained only hits presenting all diagnostic domains. Moreover, we identified several mussel transcripts related to protein interactions occurring in the miRNA biogenesis. To identify such proteins, we retrieved from PFAM the diagnostic domains of human homologs (listed in Table 3) and we scanned their presence in the Mg transcriptome as described above. Protein domain organization was reconstructed using SMART (Letunic, Doerks & Bork, 2012).
Table 2. Key proteins of the miRNA biogenesis with their structural domains.
Process step | Protein name | Domains |
---|---|---|
Microprocessor complex | DROSHA | 2× RIBO III + 1× DSRM |
DGCR8 | 1× WW + 2× DSRM | |
Moving to cytoplasm | XPO5 | 1× XPO1 |
RAN | 1× RAN | |
RISC loading complex | DICER | PDB + Helicase + DICER + PAZ + 2× RIBO + DSRM |
TARBP2 | 3× DSRM | |
Final miRNA maturation | AGO | DUF + PAZ + PIWI |
PIWI-like | PAZ + PIWI |
Table 3. miRNA biogenesis proteins of Mytilus galloprovincialis.
Protein name | GenBank ID | Transcript length (bp) | Protein length (aa) | Identified domain(s) | Annotation (first hit) | E-value (e∧) | Similarity (%) | |
---|---|---|---|---|---|---|---|---|
Key miRNA biogenesis proteins | MgDROSHA | KT447251 | 4,384 | 1,377 | 2× RIBO III + 1× DSRM | Ribonuclease 3-like (Crassostrea gigas) | 0 | 67 |
MgDGCR8 | KT447252 | 2,483 | 728 | 1× WW + 2× DSRM | Microprocessor complex subunit DGCR8-like (Crassostrea gigas) | 0 | 50 | |
MgXPO5 | KT447259 | 3,875 | 1,201 | XPO1 | Exportin-5-like (Crassostrea gigas) | 0 | 55 | |
MgRAN | KT447254 | 1,113 | 214 | RAN | GTP-binding nuclear protein Ran (Crassostrea gigas) | −143 | 93 | |
MgDICER | KT447258 | 6,013 | 1,850 | PDB + Helicase + DICER + PAZ + 2× RIBO + DSRM | Endoribonuclease Dicer-like (Crassostrea gigas) | 0 | 58 | |
MgTARBP2 | KT447253 | 7,583 | 321 | 3× DSRM | Probable RISC-loading complex subunit (Crassostrea gigas) | −143 | 69 | |
MgAGO | KT447257 | 3,337 | 892 | DUF + PAZ + PIWI | Protein argonaute-2-like (Crassostrea gigas) | 0 | 84 | |
MgPIWIa | KT447255 | 2,686 | 867 | PAZ + PIWI | Piwi-like protein 1 (Crassostrea gigas) | 0 | 75 | |
MgPIWIb | KT447256 | 3,603 | 948 | PAZ + PIWI | Piwi-like protein 2 (Hydra vulgaris) | 0 | 59 | |
Other interacting proteins | MgGW182 | KT447250 | 3,825 | 1,274 | UBA + RRM | Trinucleotide repeat-containing gene 6C protein-like (Crassostrea gigas) | 0 | 45 |
MgCNOT1 | KT694355 | 5,373 | 1,791 | DUF3819 + NOT1 | CCR4-NOT transcription complex subunit 1-like (Crassostrea gigas) | 0 | 69 | |
MgCNOT2 | KT694357 | 864 | 288 | NOT2_ 3_ 5 | CCR4-NOT transcription complex subunit 2 (Pinctada fucata) | −156 | 82 | |
MgCNOT3 | KT694358 | 2,142 | 714 | NOT3 + NOT2_ 3_ 5 | CCR4-NOT transcription complex subunit 3-like (Crassostrea gigas) | 0 | 97 | |
MgCNOT6 | KT694359 | 2,592 | 864 | Exo_ endo_ phos | Uncharacterized protein LOC105348954 isoform X1 (Crassostrea gigas) | 0 | 71 | |
MgCNOT7 | KT694360 | 897 | 299 | CAF1 | CCR4-NOT transcription complex subunit 7-like (Crassostrea gigas) | 0 | 84 | |
MgCNOT9 | KT694361 | 927 | 309 | RCD1 | Cell differentiation protein RCD1 homolog (Crassostrea gigas) | 0 | 93 | |
MgCNOT10 | KT694356 | 2,133 | 711 | TPR_ 1 | CCR4-NOT transcription complex subunit 10-like (Crassostrea gigas) | 0 | 71 | |
MgDDX5 | KT694371 | 1,740 | 538 | DEAD + Helic | ATP-dependent RNA helicase DDX5 (Crassostrea gigas) | 0 | 75 | |
MgDDX6 | KT694372 | 1,332 | 443 | DEAD + Helic | ATP-dependent RNA helicase me31b (Crassostrea gigas) | 0 | 88 | |
MgDDX20 | KT694373 | 1,836 | 612 | DEAD + Helic | ATP-dependent RNA helicase DDX20 (Crassostrea gigas) | 0 | 77 | |
MgDDX42 | KT694374 | 2,196 | 731 | DEAD + Helic | ATP-dependent RNA helicase DDX42 (Crassostrea gigas) | 0 | 72 | |
MgPABP | KT694365 | 1,881 | 627 | 4× RRM + PABP | polyadenylate-binding protein 4 (Hydra vulgaris) | 0 | 74 | |
MgeIF4G | KT694364 | 5,019 | 1,672 | MIF4G + MA3 + W2 | eukaryotic translation initiation factor 4 gamma (Crassostrea gigas) | 0 | 57 | |
MgPAN2 | KT694367 | 3,606 | 1,202 | UCH_ 1 + RNase_T | PAB-dependent poly(A)-specific ribonuclease subunit PAN2 (Lingula anatina) | 0 | 72 | |
MgPAN3 | KT694368 | 2,334 | 778 | None | PAB-dependent poly(A)-specific ribonuclease subunit PAN3 (Lingula anatina) | 0 | 67 | |
MgDCP1 | KT694362 | 1,611 | 536 | DCP1 | mRNA-decapping enzyme 1A-like (Crassostrea gigas) | −126 | 73 | |
MgDCP2 | KT694363 | 1,313 | 385 | DCP2 + NUDIX | m7GpppN-mRNA hydrolase (Lingula anatina) | −117 | 67 | |
MgPRMT5 | KT694369 | 1,893 | 631 | PRMT5 | protein arginine N-methyltransferase 5-like (Crassostrea gigas) | 0 | 72 | |
MgTudor-11 | KT694370 | 2,682 | 894 | 4× SNc + TUDOR | Hypothetical protein mRNA (Lottia gigantea) | 0 | 73 | |
MgMaelstrom | KT694366 | 1,321 | 404 | HMG + MAEL | Protein maelstrom (Crassostrea gigas) | −155 | 62 |
Gene structure analysis
We used the transcript sequences of DROSHA, DGCR8, XPO5, DICER and TARBP2 as blast queries against all Mg genomic contigs (blastn) in order to recover the related gene structures. Positive hits having e-value lower than 10−20 were extracted and assembled on the corresponding transcript, used as backbone. RNA-seq read mappings with adapted parameters (CLC Genomic Workbench large gap mapping tool, with similarity and length fraction set at 0.9) allowed us to ascertain the correct gene assembly. Homolog gene structures were retrieved by interrogating genomic browsers, like Metazome v.3 (for C. intestinalis, B. floridae, D. rerio, S. kowalevskii, S. purpuratus, N. vectensis, T. castaneum, L. gigantea, O. bimaculoides, C. elegans and H. sapiens) and Ensembl Metazoa v.29 (for C. gigas, C. quinquefasciatus, D. melanogaster, N. vitripennis, A. mellifera, A. queenslandica, P. bachei, M. leidyi, T. adhaerens, N. vectensis, D. pulex, S. mansoni, S. mediterranea, A. vaga, L. anatine, H. robusta and C. telata) or by local blastn against the downloaded genomes (A. pisum and L. albipes).
Phylogenetic analysis
The inferred protein sequences were aligned using MUSCLE, release 2014-05-29 (Edgar, 2004). Subsequently, the fasta alignments were analyzed using Gblocks v.0.91 (Castresana, 2000) to extract conserved positions (positions common to 51% of the locally aligned sequences). Trees were built using neighbor joining or maximus likelihood clustering methods with 1,000 bootstrap replicates. Bayesian phylogenies were reconstructed using MrBayes v.3.2.5 (Ronquist et al., 2012), with GTR substitution evolutionary model with gamma-distributed rate variation across sites, evaluating the convergence after 1,000,000 runs (0.5 was considered as cut-off value). Trees were visualized and edited with FigTree v1.4.2 (http://tree.bio.ed.ac.uk/software/figtree/).
Digital expression analysis
To analyze the expression of the selected genes in Cg and Mg RNA datasets, we retrieved all available RNA-seq samples from the NCBI SRA archive. For Cg, we analyzed 123 Illumina RNA-seq samples related to adult tissues or developmental stages. For Mg, we analyzed 13 RNA samples from gills (1), digestive gland (6), haemocytes (2), mantle (2) and muscle (2). Overall, we included in the expression analysis 2,271 and 453M reads for Cg and Mg, respectively (File S2). The trimmed reads were mapped to Cg and Mg genes using the CLC Genomics Workbench v.8.0 (Qiagen, Hilden, Germany) mapping tool, with length and similarity fractions set at 0.75 and 0.95, respectively, and mismatch/insertion/deletion penalties at 3/3/3. The number of uniquely mapped reads of each dataset were counted and used to calculate digital expression values as TPM (Transcripts Per Kilobase Million mapped reads) as described by (Wagner, Kin & Lynch, 2013), considering 3 TPMs as lower detection limit.
Results
Mussel transcripts related to the miRNA biogenesis
We identified Mytilus galloprovincialis transcripts involved in the miRNA biogenesis by systematic searches of diagnostic domains (Table 2) in a transcriptome assembly produced from 453 million Illumina reads. Thus, we recovered nine transcripts coding for DROSHA, DGCR8, XPO5, RAN, DICER, TARBP2 and for three Argonaute genes (one Ago and two Piwi-like proteins, Table 3). We also identified 21 mussel proteins expected to play a role in the miRNA maturation or involved in RNAi processes (File S3).
Figure 1 relates the general process of eukaryotic miRNA biogenesis to the mussel proteins identified in this work. MgDROSHA and MgDGCR8 are expected to start the maturation of pri-miRNAs produced by RNA polymerase II. MgDROSHA codes for a 1,377 aa length protein containing all the canonical domains (2 RIBOc domains in positions 959–1,093 and 1,139–1,271 and one DSRM domain in position 1,278–1,351) whereas MgDGCR8 is a 728 aa length protein having one WW domain in position 229–258, necessary for the interaction with DROSHA, and two DSRM domains (positions 472–536 and 578–642) necessary for pri-miRNA binding. MgXPO5 is expected to cooperate with MgRAN in the pre-miRNA cytoplasmic translocation. MgRAN encodes a 214 aa protein whereas MgXPO5 has a length of 1,201 aa and includes two 5’ conserved domains (IBN_N and Xpo1) and one conserved region necessary for the interaction with interleukin enhancer-binding factor 3 (position 525–562). In mussels, the RISC complex uploading pre-miRNAs is defined by the endoribonuclease MgDICER (1,850 aa) and MgTARBP2 (321 aa). Like in Lophotrocozoa, mussel DICER is encoded by a unique gene and contains the seven canonical domains, namely two helicase domains, one DICER-dimer domain, one PAZ, two RIBOc and a final DSRM domain. MgTARBP2 displays three DSRM domains in positions 9–73, 101–166 and 249–314. Moreover, M. galloprovincialis possess three argonaute proteins ranging from 861 to 941 aa in length and representative of one AGO (DUF1785, PAZ and PIWI domains) and two PIWI-like proteins (PAZ and PIWI domains). We considered the above mentioned gene products as the key complement of the miRNA biogenesis.
Among the possible interacting proteins, we identified MgGW182, a transcript encoding a protein shorter than the human counterparts but holding all the features considered significant for its interaction with AGOs and the CCR4-NOT complex. In fact, MgGW182 possesses 19 N-terminal GW stretches, followed by one UBA domain, a Q-rich region (M domain) and a C-terminal RNA recognition motif (RRM domain). Moreover, we recognized a C-terminal conserved site known as PAM2 (Kozlov et al., 2010), expected to interact with the poly(A) binding protein 1 (MgPABPC1) through the MLLE motif and inhibit the mRNA translation by interfering with the mRNA circularization process (Piao et al., 2010; Van Kouwenhove, Kedde & Agami, 2011). In the mussel transcriptome, we also found putative homologs for a number of CNOT complex proteins (CNOT1, 2, 3, 6, 7, 9, and 10), for the eukaryotic translation initiation factor 4 gamma, 1 eIF4G, PAB-dependent poly(A)-specific ribonuclease subunits PAN2, PAN3, the decapping complex proteins DCP1 and DCP2, and several RNA helicases demonstrated to be crucial in the miRNA maturation (DDX5) and RNAi (DDX5- 6- 20 and 42). Finally, we recognized the putative mussel homologs of protein arginine methyltransferase 5 (MgPRMT5), tudor domain containing protein (MgTDRD-11) and maelstrom spermatogenic transposon silencer (MgMAEL).
Mussel genes related to the miRNA biogenesis
Taking advantage of mussel WGS data (Nguyen, Hayes & Ingram, 2014) we investigated the organization of the main genes involved in the mussel miRNA biogenesis. Fragmentation of the genomic mussel assembly (2.3 million contigs; 700 bp on average) and considerable dimension of the analyzed genes (9.6–17.6 kbp gene size in the case of Cg) prevented the recovery of the full gene sequences. Nevertheless, we can describe the complete gene structures of DROSHA, DGCR8, EXP5, DICER and TARBP2 (i.e., five of eight searched sequences) whose length varies between 7.5 and 27 kbp, confirmed by the back-mapping of 115,377 Illumina paired reads (Fig. 2, File S4). Moreover, these mussel genes showed a remarkable conservation in terms of exon number when compared with a selection of homolog genes from deuterostome and protostome organisms (Table 4).
Table 4. Number of exons of five key miRNA biogenesis genes.
Species | Homo sapiens | Ciona intestinalis | Branchiostoma floridae | Danio rerio | Saccoglossus kowalevskii | Strongylocentrotus purpuratus | Nematostella vectensis | Amphimedon queenslandica | Caenorhabditis elegans | Capitella telata | Culex quinquefasciatus | Drosophila melanogaster | Nasonia vitripennis | Tribolium castaneum | Apis mellifera | Lasioglossus albipes | Acyrthosiphon pisum | Lottia gigantea | Crassostrea gigas | Mytilus galloprovincialis |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
ID | Hs | Ci | Bf | Dr | Sk | Sp | Nvec | Aq | Ce | Ct | Cq | Dm | Nv | Tc | Am | La | Ap | Lg | Cg | Mg |
DROSHA | 27 | 24 | 29 | 17 | 24 | 20 | 13 | 14 | 6 | 28 | 3 | 3 | 11 | 9 | 13 | 23 | 1 | 23 | 30 | 23 |
DGCR8 | 14 | 10 | 15 | 10 | 15 | 13 | 7 | No | 11 | 18 | 4 | 5 | 11 | 6 | 6 | 6 | 2 | 11 | 18 | 16 |
XPO5 | 32 | 4 | 8 | 21 | 29 | 31 | 2 | 28 | No | 30 | 9 | 2 | 11 | 10 | 9 | 10 | 1 | 32 | 34 | 31 |
DICER | 27 | 23 | 26 | 17 | 14 | 19 | 12 | 10 | 26 | 13 | 7 | 8 | 5 | 9 | 29 | 33 | 19 | 16 | 19 | 18 |
TARBP2 | 9 | 1 | 6 | 9 | 6 | 2 | No | No | 11 | 8 | 4 | 5 | 7 | 7 | 7 | 6 | 7 | 6 | 7 | 6 |
Transcripts related to the miRNA biogenesis in bivalve spp
To identify the miRNA biogenesis complements in marine mollusks, we used homologous genes retrieved from the genomes of C. gigas, L. gigantea and A. californica. Since the C. gigas genome includes annotations only for the cds regions, we exploited full-length transcripts obtained from a locally assembled oyster transcriptome to expand the genome annotations in this species. In particular, we updated the annotation of CgDGCR8 and CgDICER and we added new annotations for CgPIWI-1 (CGI_10008757: genomic contig JH815696, position 184178–187825) and CgTARBP2 (JH818440, 414703–419857).
Since many marine bivalve spp. do not have at present a sequenced genome, we used publicly available RNA-seq data to build 29 specie-specific transcriptome assemblies and retrieve the homologous sequences of interest. After domain searching, we carefully considered the high number of positive hits to retain only proteins including all the expected protein features. Thus, we retrieved 132 complete hits from marine mollusks: 10 DROSHAs, 9 DGCR8s, 14 XPO5s, 34 RANs, 7 DICERs, 13 TARBP2s and 45 Argonaute-like proteins, the latter classified in 13 AGO and 32 PIWI proteins by phylogenetic analysis (Table 1, File S5).
Phylogenetic analysis of the miRNA biogenesis proteins
The inferred sequences of single miRNA biogenesis proteins were aligned together with those retrieved from 34 sequenced genomes. Here, we report the phylogenetic analysis of the five proteins centrally involved in the miRNA biogenesis, namely DROSHA, DGCR8, DICER, TARBP2 and AGOs (File S5 includes all protein sequences). We back-traced the presence of a canonical DROSHA up to Cnidaria, although we found only incomplete hits in Porifera and Placozoa and the genomes of Ctenophora spp. lack of both DROSHA and DGCR8, as reported by other authors (Maxwell et al., 2012). The DROSHA sequences from Cnidaria’s appeared as general outgroup whereas those of Chordata clustered as outgroup of the other protostomes. DROSHAs from Mollusca and Arthropoda clustered consistently with the different taxa whereas those from Platyhelmintes, Rotifera, Brachiopoda and Annelida grouped together, with DROSHA from Caenorhabditis elegans (Nematoda) being the most far-related (Fig. 3A). Contrary to DROSHA, we identified a complete DGCR8 also in the Porifera Amphimedon queenslandica, suggesting that also DROSHA should be present in this taxa. Following phylogenetic analysis, we highlighted Cnidaria and Porifera proteins as outgroup, with mollusks (and Annelida) clustering with Arthropoda and more distantly Platyhelmintes and Rotifera hits. The Chordata sequences clustered as a separate group (Fig. 3B).
The finding of putative DICER sequences in Ctenophora spp. supports the presence of this gene through the whole Opisthokonta evolution (Maxwell et al., 2012). Also plants possess DICER homologues which occur in different copy number among taxa: two genes in Porifera, Placozoa, Cnidaria, Platyhelminthes and Arthropoda (with the exception of D. pulex that possess three genes); four genes in plants like A. thaliana and P. trichocarpa and one gene in Ctenophora, Rotifera, Cephalopoda Mollusca and Chordata. Moreover, the presence of DICER was reported in some Protozoa and fungi (Mukherjee, Campos & Kolaczkowski, 2013). Phylogenetic analyses, separate insect DICER-2,plant DICERs from DICER-1. DICER-1 clade shows a consistent clustering of Arthropoda, Mollusca and Chordata hits, whereas some branches of basal metazoans and Platyhelminthes are not well resolved (Fig. 3C). Likewise, the phylogenetic tree regarding TARBP2 displays a clear cut-off between the proteins of mollusks, chordates and arthropods (Fig. 3D). We back-traced the miRNA cytoplasm export complex composed by RAN and XPO5 in all analyzed metazoans. Both RAN and XPO5 represent widely expressed sequences since we found them also in transcriptome assemblies, although with suboptimal sequence coverage.
Several AGO and Piwi proteins can be present in individual organisms and, in fact, we identified a total of 235 proteins. Whereas humans possess eight proteins, we found four proteins in the majority of the analyzed insect spp. (with the exception of 15 proteins in A. pisum) and three or four different proteins in bivalve spp. Also, basal Metazoa possess Argonaute-like sequences: four in the genomes of Ctenophora and Cnidaria spp., one in the Placozoa T. adhaerens and two in A. queenslandica. The case of C. elegans is remarkable since it holds several Argonaute gene families and at least 24 proteins (Hoogstrate et al., 2014). In agreement with other phylogenetic studies (Swarts et al., 2014), the Argonaute proteins from plants and the majority of those from C. elegans formed distinct clades and, moreover, a clear separation was evident between AGO and PIWI proteins. Bivalve protein sequences clustered always separately forming one cluster for AGO-like hits and two clusters for PIWI-like proteins (Fig. 4).
Digital expression analysis of mussel and oyster miRNA biogenesis genes
We used the 13 Mg and 124 Cg RNA-seq samples to evaluate the expression levels of miRNA biogenesis genes in different tissues and conditions. Based on total mapped reads, we computed TPM values and we used elongation factor 1 α (El1α) as normalizer housekeeping gene to compare the expression level of the different genes in each sample.
For Mg, the sequence analysis indicated a scarce basal expression of the genes mentioned above in five adult tissues: gill, digestive gland, haemolymph, muscle and mantle (below 2% of El1α, except for DDX5, RAN and CNOT9). Mantle and muscle appeared the most responsive tissues whereas haemolymph was the least responsive one. In particular, the genes that we considered as the core components of miRNA biogenesis were expressed at levels below 0.5% of El1α (File S6).
For Cg, we analyzed a considerable number of RNA-seq libraries representative of adult tissues (85) and developmental stages (39) (File S6). In adult oysters we observed low basal expression, as detected in the mussel samples. In fact, none of the experimental conditions reported for the analyzed RNA-seq samples influenced substantially the expression of the core miRNA pathway genes (expression levels below 2% of El1α), with the exception of the high levels of CgPIWI-1 levels in male and female gonads (around 3.5%, Fig. 5). Conversely, most of the miRNA biogenesis genes were expressed at remarkable levels during the early stages of the oyster development: mainly from two cells to the rotary movement and, for some genes, also in the next developmental stages until D-shaped larvae, with no detectable signals afterward in spat and juveniles. Hence, these genes are particularly active in the early development, in particular one AGO (CGI_10020511) and two PIWI transcripts from the egg to trocophora (Fig. 5). In the same developmental stages we also noticed a remarkable expression of the key miRNA genes, with the co-expression of DROSHA and DGCR8 evident in all the analyzed samples.
Discussion
Small RNAs are important regulators of the gene expression, as recognized in various model and non-model organisms (Kim et al., 2014a; Kim et al., 2014b; Martini et al., 2014; Hussain & Asgari, 2014; Sahoo et al., 2014; Britton et al., 2014; Poole et al., 2014; Solofoharivelo et al., 2014), including some bivalves (Jiao et al., 2014; Zhou et al., 2014). In addition to the identification of miRNAs, a general comprehension of the miRNA biogenesis in itself is also significant (Grimson et al., 2008; Wu et al., 2011; Moran et al., 2013). However, the main genes involved in miRNA formation in bivalves have not been described and characterized so far. In this study, we have provided an overview on the miRNA biogenesis complements in bivalves spp., with particular attention to M. galloprovincialis and C. gigas. To the best of our knowledge, we report for the first time the presence of a complete miRNA biogenesis pathway in M. galloprovincialis, the full-length transcript sequences of DICER, DGCR8, XPO5, RAN, DROSHA, TARBP2, three Argonaute genes and the identification of many other components that are candidate miRNA complement-interacting proteins such as MgGW182. By using local transcriptome assemblies, we identified these genes also in many other marine bivalves. The general low expression levels of these transcripts in the adult tissues of both M. galloprovincialis and C. gigas, and the considerable gene size, have probably prevented a previous identification of full-length sequences in not-well-covered bivalve transcriptomes. In fact, we obtained complete transcript sequences only from sequenced genomes or highly-covered transcriptomes whereas in other transcriptome assemblies we retrieved only few complete sequences. Overall, we have analyzed 523 miRNA complement sequences, 145 of them belonging to marine mollusks and displaying a consistent sequence clustering (Ostreoida and Mytiloida proteins generated two distinct clades, located always as sister group of arthropods).
However, the copy number of Argonaute genes somewhat differs among bivalves, as C. gigas and A. californica genomes coding for four proteins (2 AGO and 2 PIWI proteins) whereas M. galloprovincialis and L. gigantea possess three proteins (1 AGO and 2 PIWIs). We also highlighted the over-expression of the miRNA biogenesis genes during the first phases of the oyster development. A genome protection mechanism based on piRNA expression during early developmental stages is well known in mammals (Malone & Hannon, 2009; Kim et al., 2014a; Kim et al., 2014b) but such mechanism has not been reported in bivalves and additional investigations are necessary.
Finally, the identification of several mussel proteins either necessary or cooperative in the miRNA biogenesis, supports the existence of a complete and functional miRNA pathway in mussels and, probably, in other bivalves. Up to now, protein–protein or protein-RNA interaction data are not available for bivalve spp. and these topics may represent a direction of work in the future. Meanwhile, the expression analyses of miRNA biogenesis genes coupled with the identification of the miRNAs expressed in naturally infected and laboratory-treated bivalves could provide both validation and new insights on these interesting processes.
Supplemental Information
Funding Statement
This work is supported by PRIN 2010-11 (20109XZEPR) and FP7-KBBE-2010-4-266157 (BIVALIFE). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Additional Information and Declarations
Competing Interests
The authors declare there are no competing interests.
Author Contributions
Umberto Rosani conceived and designed the experiments, performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables.
Alberto Pallavicini conceived and designed the experiments, wrote the paper, reviewed drafts of the paper.
Paola Venier conceived and designed the experiments, contributed reagents/materials/analysis tools, wrote the paper, reviewed drafts of the paper.
DNA Deposition
The following information was supplied regarding the deposition of DNA sequences:
Genbank: KT447251, KT447252, KT447259, KT447254, KT447258, KT447253, KT447257, KT447255, KT447256, KT447250, KT694355, KT694356, KT694357, KT694358, KT694359, KT694360, KT694361, KT694362, KT694363, KT694364, KT694365, KT694366, KT694367, KT694368, KT694369, KT694370, KT694371, KT694372, KT694373, KT694374.
Data Availability
The following information was supplied regarding data availability:
The research in this article did not generate any raw data.
References
- Ambros (2003).Ambros V. MicroRNA pathways in flies and worms: growth, death, fat, stress, and timing. Cell. 2003;113(6):673–676. doi: 10.1016/S0092-8674(03)00428-8. [DOI] [PubMed] [Google Scholar]
- Bartel (2004).Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004;116(2):281–297. doi: 10.1016/S0092-8674(04)00045-5. [DOI] [PubMed] [Google Scholar]
- Berezikov et al. (2005).Berezikov E, Guryev V, Van de Belt J, Wienholds E, Plasterk RHA, Cuppen E. Phylogenetic shadowing and computational identification of human microRNA genes. Cell. 2005;120(1):21–24. doi: 10.1016/j.cell.2004.12.031. [DOI] [PubMed] [Google Scholar]
- Britton et al. (2014).Britton C, Winter AD, Gillan V, Devaney E. microRNAs of parasitic helminths—identification, characterization and potential as drug targets. International Journal for Parasitology: Drugs and Drug Resistance. 2014;4(2):85–94. doi: 10.1016/j.ijpddr.2014.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carmell et al. (2007).Carmell MA, Girard A, Van de Kant HJG, Bourc’his D, Bestor TH, De Rooij DG, Hannon GJ. MIWI2 is essential for spermatogenesis and repression of transposons in the mouse male germline. Developmental Cell. 2007;12(4):503–514. doi: 10.1016/j.devcel.2007.03.001. [DOI] [PubMed] [Google Scholar]
- Carninci et al. (2005).Carninci P, The FANTOM Consortium The transcriptional landscape of the mammalian genome. Science. 2005;309(5740):1559–1563. doi: 10.1126/science.1112014. [DOI] [PubMed] [Google Scholar]
- Castresana (2000).Castresana J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Molecular Biology and Evolution. 2000;17(4):540–552. doi: 10.1093/oxfordjournals.molbev.a026334. [DOI] [PubMed] [Google Scholar]
- Chen et al. (2014).Chen G, Zhang C, Jiang F, Wang Y, Xu Z, Wang C. Bioinformatics analysis of hemocyte miRNAs of scallop chlamys farreri against acute viral necrobiotic virus (AVNV) Fish & Shellfish Immunology. 2014;37(1):75–86. doi: 10.1016/j.fsi.2014.01.002. [DOI] [PubMed] [Google Scholar]
- Denli et al. (2004).Denli AM, Tops BBJ, Plasterk RHA, Ketting RF, Hannon GJ. Processing of primary microRNAs by the microprocessor complex. Nature. 2004;432(7014):231–235. doi: 10.1038/nature03049. [DOI] [PubMed] [Google Scholar]
- Diebel et al. (2015).Diebel KW, Oko LM, Medina EM, Niemeyer BF, Warren CJ, Claypool DJ, Tibbetts SA, Cool CD, Clambey ET, Van Dyk LF. Gammaherpesvirus small noncoding RNAs are bifunctional elements that regulate infection and contribute to virulence in vivo. mBio. 2015;6(1):e01670–e01614. doi: 10.1128/mBio.01670-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eddy (2011).Eddy SR. Accelerated profile HMM searches. PLoS Computational Biology. 2011;7(10):e1763. doi: 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edgar (2004).Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research. 2004;32(5):1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ender & Meister (2010).Ender C, Meister G. Argonaute proteins at a glance. Journal of Cell Science. 2010;123(11):1819–1823. doi: 10.1242/jcs.055210. [DOI] [PubMed] [Google Scholar]
- Fromm et al. (2015).Fromm B, Billipp T, Peck LE, Johansen M, Tarver JE, King BL, Newcomb JM, Sempere LF, Flatmark K, Hovig E, Peterson KJ. A uniform system for the annotation of vertebrate microRNA genes and the evolution of the human microRNAome. Annual Review of Genetics. 2015;49(1):213–242. doi: 10.1146/annurev-genet-120213-092023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fabian & Sonenberg (2012).Fabian MR, Sonenberg N. The mechanics of miRNA-mediated gene silencing: a look under the hood of miRISC. Nature Structural & Molecular Biology. 2012;19(6):586–593. doi: 10.1038/nsmb.2296. [DOI] [PubMed] [Google Scholar]
- Gammon & Mello (2015).Gammon DB, Mello CC. RNA interference-mediated antiviral defense in insects. Current Opinion in Insect Science. 2015;8(April):111–120. doi: 10.1016/j.cois.2015.01.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gao et al. (2014).Gao Z, Wang M, Blair D, Zheng Y, Dou Y. Phylogenetic analysis of the endoribonuclease dicer family. PLoS ONE. 2014;9(4):e1763. doi: 10.1371/journal.pone.0095350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gerdol et al. (2014).Gerdol M, De Moro G, Manfrin C, Milandri A, Riccardi E, Beran A, Venier P, Pallavicini A. RNA Sequencing and de novo assembly of the digestive gland transcriptome in Mytilus galloprovincialis fed with toxinogenic and non-toxic strains of alexandrium minutum. BMC Research Notes. 2014;7:722. doi: 10.1186/1756-0500-7-722. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghildiyal & Zamore (2009).Ghildiyal M, Zamore PD. Small silencing RNAs: an expanding universe. Nature Reviews Genetics. 2009;10(2):94–108. doi: 10.1038/nrg2504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- GIGACommunityofScientists (2014).GIGA Community of Scientists The global invertebrate genomics alliance (GIGA): developing community resources to study diverse invertebrate genomes. The Journal of Heredity. 2014;105(1):1–18. doi: 10.1093/jhered/est084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grabherr et al. (2011).Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, Di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nature Biotechnology. 2011;29(7):644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grimson et al. (2008).Grimson A, Srivastava M, Fahey B, Woodcroft BJ, Chiang HR, King N, Degnan BM, Rokhsar DS, Bartel DP. Early origins and evolution of microRNAs and piwi-interacting RNAs in animals. Nature. 2008;455(7217):1193–1197. doi: 10.1038/nature07415. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hong et al. (2013).Hong S, Noh H, Chen H, Padia R, Pan ZK, Su S-B, Jing Q, Ding H-F, Huang S. Signaling by p38 MAPK stimulates nuclear localization of the microprocessor component p68 for processing of selected primary microRNAs. Science Signaling. 2013;6(266):ra16. doi: 10.1126/scisignal.2003706. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoogstrate et al. (2014).Hoogstrate SW, Volkers RJM, Sterken MG, Kammenga JE, Snoek LB. Nematode endogenous small RNA pathways. Worm. 2014;3(March) doi: 10.4161/worm.28234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang et al. (2014).Huang J-T, Wang J, Srivastava V, Sen S, Liu S-M. MicroRNA machinery genes as novel biomarkers for cancer. Frontiers in Oncology. 2014;4:13. doi: 10.3389/fonc.2014.00113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huarte (2015).Huarte M. The emerging role of lncRNAs in cancer. Nature Medicine. 2015;21(11):1253–1261. doi: 10.1038/nm.3981. [DOI] [PubMed] [Google Scholar]
- Hussain & Asgari (2014).Hussain M, Asgari S. MicroRNAs as mediators of insect host–pathogen interactions and immunity. Journal of Insect Physiology. 2014;70:151–158. doi: 10.1016/j.jinsphys.2014.08.003. [DOI] [PubMed] [Google Scholar]
- Iwasaki, Siomi & Siomi (2015).Iwasaki YW, Siomi MC, Siomi H. PIWI-interacting RNA: its biogenesis and functions. Annual Review of Biochemistry. 2015;84:405–433. doi: 10.1146/annurev-biochem-060614-034258. [DOI] [PubMed] [Google Scholar]
- Jean-Philippe, Paz & Caputi (2013).Jean-Philippe J, Paz S, Caputi M. hnRNP A1: the Swiss army knife of gene expression. International Journal of Molecular Sciences. 2013;14(9):18999–19024. doi: 10.3390/ijms140918999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiao et al. (2014).Jiao Y, Zheng Z, Du X, Wang Q, Huang R, Deng Y, Shi S, Zhao X. Identification and characterization of MicroRNAs in pearl oyster pinctada martensii by solexa deep sequencing. Marine Biotechnology. 2014;16(1):54–62. doi: 10.1007/s10126-013-9528-x. [DOI] [PubMed] [Google Scholar]
- John et al. (2004).John B, Enright AJ, Aravin A, Tuschl T, Sander C, Marks DS. Human microRNA targets. PLoS Biology. 2004;2(11):e363. doi: 10.1371/journal.pbio.0020363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kapinas & Delany (2011).Kapinas K, Delany AM. MicroRNA biogenesis and regulation of bone remodeling. Arthritis Research & Therapy. 2011;13(3) doi: 10.1186/ar3325. Article 220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kenny et al. (2015).Kenny NJ, Namigai EKO, Marlétaz F, Hui JHL, Shimeld SM. Draft genome assemblies and predicted microRNA complements of the intertidal lophotrochozoans patella vulgata (Mollusca, Patellogastropoda) and spirobranchus (Pomatoceros) lamarcki (Annelida, Serpulida) Marine Genomics. 2015;24:139–146. doi: 10.1016/j.margen.2015.07.004. [DOI] [PubMed] [Google Scholar]
- Kim et al. (2014b).Kim S, Günesdogan U, Zylicz JJ, Hackett JA, Cougot D, Bao S, Lee C, Dietmann S, Allen GE, Sengupta R, Surani MA. PRMT5 protects genomic integrity during global DNA demethylation in primordial germ cells and preimplantation embryos. Molecular Cell. 2014b;56(4):564–579. doi: 10.1016/j.molcel.2014.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim et al. (2014a).Kim M-C, Lee S-W, Ryu D-Y, Cui F-J, Bhak J, Kim Y. Identification and characterization of microRNAs in normal equine tissues by next generation sequencing. PLoS ONE. 2014a;9(4):e1763. doi: 10.1371/journal.pone.0093662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kincaid & Sullivan (2012).Kincaid RP, Sullivan CS. Virus-encoded microRNAs: an overview and a look to the future. PLoS Pathogens. 2012;8(12):e1763. doi: 10.1371/journal.ppat.1003018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozlov et al. (2010).Kozlov G, Safaee N, Rosenauer A, Gehring K. Structural basis of binding of P-body-associated proteins GW182 and Ataxin-2 by the Mlle domain of poly(A)-binding protein. The Journal of Biological Chemistry. 2010;285(18):13599–13606. doi: 10.1074/jbc.M109.089540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozomara & Griffiths-Jones (2014).Kozomara A, Griffiths-Jones S. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Research. 2014;42:D68–D73. doi: 10.1093/nar/gkt1181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lau & MacRae (2009).Lau P-W, MacRae IJ. The molecular machines that mediate microRNA maturation. Journal of Cellular and Molecular Medicine. 2009;13(1):54–60. doi: 10.1111/j.1582-4934.2008.00520.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Letunic, Doerks & Bork (2012).Letunic I, Doerks T, Bork P. SMART 7: recent updates to the protein domain annotation resource. Nucleic Acids Research. 2012;40(D1):D302–D305. doi: 10.1093/nar/gkr931. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lucas & Raikhel (2013).Lucas K, Raikhel AS. Insect microRNAs: biogenesis, expression profiling and biological functions. Insect Biochemistry and Molecular Biology. 2013;43(1):24–38. doi: 10.1016/j.ibmb.2012.10.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacRae et al. (2008).MacRae IJ, Ma E, Zhou M, Robinson CV, Doudna JA. In vitro reconstitution of the human RISC-loading complex. Proceedings of the National Academy of Sciences of the United States of America. 2008;105(2):512–517. doi: 10.1073/pnas.0710869105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malone & Hannon (2009).Malone CD, Hannon GJ. Small RNAs as guardians of the genome. Cell. 2009;136(4):656–668. doi: 10.1016/j.cell.2009.01.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martín-Gómez et al. (2014).Martín-Gómez L, Villalba A, Kerkhoven RH, Abollo E. Role of microRNAs in the immunity process of the flat oyster ostrea edulis against bonamiosis. Infection, Genetics and Evolution. 2014;27(October):40–50. doi: 10.1016/j.meegid.2014.06.026. [DOI] [PubMed] [Google Scholar]
- Martini et al. (2014).Martini P, Sales G, Brugiolo M, Gandaglia A, Naso F, De Pittà C, Spina M, Gerosa G, Chemello F, Romualdi C, Cagnin S, Lanfranchi G. Tissue-specific expression and regulatory networks of pig microRNAome. PLoS ONE. 2014;9(4):e1763. doi: 10.1371/journal.pone.0089755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maxwell et al. (2012).Maxwell EK, Ryan JF, Schnitzler CE, Browne WE, Baxevanis AD. MicroRNAs and essential components of the microRNA processing machinery are not encoded in the genome of the ctenophore Mnemiopsis leidyi. BMC Genomics. 2012;13:714. doi: 10.1186/1471-2164-13-714. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mendell & Olson (2012).Mendell JT, Olson EN. MicroRNAs in stress signaling and human disease. Cell. 2012;148(6):1172–1187. doi: 10.1016/j.cell.2012.02.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Min & Chan (2015).Min P-K, Chan SY. The biology of circulating microRNAs in cardiovascular disease. European Journal of Clinical Investigation. 2015;45:860–874. doi: 10.1111/eci.12475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyoshi et al. (2009).Miyoshi K, Okada TN, Siomi H, Siomi MC. Characterization of the miRNA-RISC loading complex and miRNA-RISC formed in the Drosophila miRNA pathway. RNA. 2009;15(7):1282–1291. doi: 10.1261/rna.1541209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moran et al. (2013).Moran Y, Praher D, Fredman D, Technau U. The evolution of microRNA pathway protein components in Cnidaria. Molecular Biology and Evolution. 2013;30(12):2541–2552. doi: 10.1093/molbev/mst159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moreira et al. (2015).Moreira R, Pereiro P, Canchaya C, Posada D, Figueras A, Novoa B. RNA-Seq in mytilus galloprovincialis: comparative transcriptomics and expression profiles among different tissues. BMC Genomics. 2015;16(1):728. doi: 10.1186/s12864-015-1817-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mortimer, Kidwell & Doudna (2014).Mortimer SA, Kidwell MA, Doudna JA. Insights into RNA structure and function from genome-wide studies. Nature Reviews. Genetics. 2014;15(7):469–479. doi: 10.1038/nrg3681. [DOI] [PubMed] [Google Scholar]
- Mukherjee, Campos & Kolaczkowski (2013).Mukherjee K, Campos H, Kolaczkowski B. Evolution of animal and plant dicers: early parallel duplications and recurrent adaptation of antiviral RNA binding in plants. Molecular Biology and Evolution. 2013;30(3):627–641. doi: 10.1093/molbev/mss263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nguyen, Hayes & Ingram (2014).Nguyen TTT, Hayes BJ, Ingram BA. Genetic parameters and response to selection in blue mussel (mytilus galloprovincialis) using a SNP-based pedigree. Aquaculture. 2014;420–421(January):295–301. doi: 10.1016/j.aquaculture.2013.11.021. [DOI] [Google Scholar]
- Nottrott, Simard & Richter (2006).Nottrott S, Simard MJ, Richter JD. Human Let-7a miRNA blocks protein production on actively translating polyribosomes. Nature Structural & Molecular Biology. 2006;13(12):1108–1114. doi: 10.1038/nsmb1173. [DOI] [PubMed] [Google Scholar]
- Piao et al. (2010).Piao X, Zhang X, Wu L, Belasco JG. CCR4-NOT deadenylates mRNA associated with RNA-induced silencing complexes in human cells. Molecular and Cellular Biology. 2010;30(6):1486–1494. doi: 10.1128/MCB.01481-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poole et al. (2014).Poole CB, Gu W, Kumar S, Jin J, Davis PJ, Bauche D, McReynolds LA. Diversity and expression of microRNAs in the filarial parasite, brugia malayi. PLoS ONE. 2014;9(5) doi: 10.1371/journal.pone.0096498. e96498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qu et al. (2015).Qu S, Yang X, Li X, Wang J, Gao Y, Shang R, Sun W, Dou K, Li H. Circular RNA: a new star of noncoding RNAs. Cancer Letters. 2015;365:141–148. doi: 10.1016/j.canlet.2015.06.003. [DOI] [PubMed] [Google Scholar]
- Resch & Palakodeti (2012).Resch AM, Palakodeti D. Small RNA pathways in schmidtea mediterranea. The International Journal of Developmental Biology. 2012;56(1–3):67–74. doi: 10.1387/ijdb.113436ar. [DOI] [PubMed] [Google Scholar]
- Rhee, Chae & Kim (2015).Rhee S, Chae H, Kim S. PlantMirnaT: miRNA and mRNA integrated analysis fully utilizing characteristics of plant sequencing data. Methods. 2015;83:80–87. doi: 10.1016/j.ymeth.2015.04.003. [DOI] [PubMed] [Google Scholar]
- Ronquist et al. (2012).Ronquist F, Teslenko M, Van der Mark P, Ayres D, Darling A, Höhna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Systematic Biology. 2012;61(3):539–542. doi: 10.1093/sysbio/sys029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruan (2015).Ruan X. Long noncoding RNA central of glucose homeostasis. Journal of Cellular Biochemistry. 2015 doi: 10.1002/jcb.25427. Epub ahead of print Nov 26 2015. [DOI] [PubMed] [Google Scholar]
- Sahoo et al. (2014).Sahoo G, Ansari MD, Dikhit M, Gupta N, Rana S, Das P. Computational identification of microRNA-like elements in leishmania major. MicroRNA. 2014;2(3):225–230. doi: 10.2174/2211536602666131203232422. [DOI] [PubMed] [Google Scholar]
- Sand et al. (2012).Sand M, Skrygan M, Georgas D, Sand D, Gambichler T, Altmeyer P, Bechara FG. The miRNA machinery in primary cutaneous malignant melanoma, cutaneous malignant melanoma metastases and benign melanocytic nevi. Cell and Tissue Research. 2012;350(1):119–126. doi: 10.1007/s00441-012-1446-0. [DOI] [PubMed] [Google Scholar]
- Siomi et al. (2011).Siomi MC, Sato K, Pezic D, Aravin AA. PIWI-interacting small RNAs: the vanguard of genome defence. Nature Reviews Molecular Cell Biology. 2011;12(4):246–258. doi: 10.1038/nrm3089. [DOI] [PubMed] [Google Scholar]
- Sokolova et al. (2011).Sokolova OA, Iakushev EI, Stoliarenko AD, Mikhaleva EA, Gvozdev VA, Klenov MS. The interplay of transposon silencing genes in the Drosophila melanogaster germline. Molekuliarnaia Biologiia. 2011;45(4):633–641. [PubMed] [Google Scholar]
- Solofoharivelo et al. (2014).Solofoharivelo MC, Van der Walt AP, Stephan D, Burger JT, Murray SL. MicroRNAs in fruit trees: discovery, diversity and future research directions. Plant Biology. 2014;16(5):856–865. doi: 10.1111/plb.12153. [DOI] [PubMed] [Google Scholar]
- Swarts et al. (2014).Swarts DC, Makarova K, Wang Y, Nakanishi K, Ketting RF, Koonin EV, Patel DJ, Van der Oost J. The evolutionary journey of argonaute proteins. Nature Structural & Molecular Biology. 2014;21(9):743–753. doi: 10.1038/nsmb.2879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tarver et al. (2015).Tarver JE, Cormier A, Pinzón N, Taylor RS, Carré W, Strittmatter M, Seitz H, Coelho SM, Cock JM. microRNAs and the evolution of complex multicellularity: identification of a large, diverse complement of microRNAs in the brown alga ectocarpus. Nucleic Acids Research. 2015;43:6384–6398. doi: 10.1093/nar/gkv578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tarver, Donoghue & Peterson (2012).Tarver JE, Donoghue PCJ, Peterson KJ. Do miRNAs have a deep evolutionary history? BioEssays. 2012;34(10):857–866. doi: 10.1002/bies.201200055. [DOI] [PubMed] [Google Scholar]
- Théron et al. (2014).Théron E, Dennis C, Brasset E, Vaury C. Distinct features of the piRNA Pathway in somatic and germ cells: from piRNA cluster transcription to piRNA processing and amplification. Mobile DNA. 2014;5(1):28. doi: 10.1186/s13100-014-0028-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tolia & JoshuaTor (2007).Tolia NH, Joshua-Tor L. Slicer and the argonautes. Nature Chemical Biology. 2007;3(1):36–43. doi: 10.1038/nchembio848. [DOI] [PubMed] [Google Scholar]
- Tomoyasu et al. (2008).Tomoyasu Y, Miller SC, Tomita S, Schoppmeier M, Grossmann D, Bucher G. Exploring systemic RNA interference in insects: a genome-wide survey for RNAi genes in tribolium. Genome Biology. 2008;9(1) doi: 10.1186/gb-2008-9-1-r10. Article R10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Kouwenhove, Kedde & Agami (2011).Van Kouwenhove M, Kedde M, Agami R. MicroRNA regulation by RNA-binding proteins and its implications for cancer. Nature Reviews Cancer. 2011;11(9):644–656. doi: 10.1038/nrc3107. [DOI] [PubMed] [Google Scholar]
- Venier et al. (2009).Venier P, De Pittà C, Bernante F, Varotto L, De Nardi B, Bovo G, Roch P, Novoa B, Figueras A, Pallavicini A, Lanfranchi G. MytiBase: a knowledgebase of mussel (M. galloprovincialis) transcribed sequences. BMC Genomics. 2009;10(February):72. doi: 10.1186/1471-2164-10-72. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wagner, Kin & Lynch (2013).Wagner GP, Kin K, Lynch VJ. A model based criterion for gene expression calls using RNA-seq data. Theory in Biosciences Theorie in Den Biowissenschaften. 2013;132(3):159–164. doi: 10.1007/s12064-013-0178-3. [DOI] [PubMed] [Google Scholar]
- Wu et al. (2011).Wu B, Jin M, Gong J, Du X, Bai Z. Dynamic evolution of CIKS (TRAF3IP2/Act1) in metazoans. Developmental & Comparative Immunology. 2011;35(11):1186–1192. doi: 10.1016/j.dci.2011.03.027. [DOI] [PubMed] [Google Scholar]
- Xu et al. (2014).Xu M-J, Qiu S-B, Nisbet AJ, Fu J-H, Shao C-C, Zhu X-Q. Global characterization of microRNAs in trichomonas gallinae. Parasites & Vectors. 2014;7 doi: 10.1186/1756-3305-7-99. Article 99. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu et al. (2015).Xu C, Zhang L, Li H, Liu Z, Duan L, Lu C. MiRNA-1469 promotes lung cancer cells apoptosis through targeting STAT5a. American Journal of Cancer Research. 2015;5(3):1180–1189. [PMC free article] [PubMed] [Google Scholar]
- Zhang et al. (2012).Zhang G, Fang X, Guo X, Li L, Luo R, Xu F, Yang P, Zhang L, Wang X, Qi H, Xiong Z, Que H, Xie Y, Holland PWH, Paps J, Zhu Y, Wu F, Chen Y, Wang J, Peng C, Meng J, Yang L, Liu J, Wen B, Zhang N, Huang Z, Zhu Q, Feng Y, Mount A, Hedgecock D, Xu Z, Liu Y, Domazet-Lošo T, Du Y, Sun X, Zhang S, Liu B, Cheng P, Jiang X, Li J, Fan D, Wang W, Fu W, Wang T, Wang B, Zhang J, Peng Z, Li Y, Li N, Wang J, Chen M, He Y, Tan F, Song X, Zheng Q, Huang R, Yang H, Du X, Chen L, Yang M, Gaffney PM, Wang S, Luo L, She Z, Ming Y, Huang W, Zhang S, Huang B, Zhang Y, Qu T, Ni P, Miao G, Wang J, Wang Q, Steinberg CEW, Wang H, Li N, Qian L, Zhang G, Li Y, Yang H, Liu X, Wang J, Yin Y, Wang J. The oyster genome reveals stress adaptation and complexity of shell formation. Nature. 2012;490(7418):49–54. doi: 10.1038/nature11413. [DOI] [PubMed] [Google Scholar]
- Zhou et al. (2014).Zhou Z, Wang L, Song L, Liu R, Zhang H, Huang M, Chen H. The identification and characteristics of immune-related microRNAs in haemocytes of oyster Crassostrea gigas. PLoS ONE. 2014;9(2):e1763. doi: 10.1371/journal.pone.0088397. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The following information was supplied regarding data availability:
The research in this article did not generate any raw data.