Skip to main content
PeerJ logoLink to PeerJ
. 2015 Oct 20;3:e1332. doi: 10.7717/peerj.1332

miRMOD: a tool for identification and analysis of 5′ and 3′ miRNA modifications in Next Generation Sequencing small RNA data

Abhinav Kaushik 1,#, Shradha Saraf 1,#, Sunil K Mukherjee 2, Dinesh Gupta 1,
Editor: Elena Papaleo
PMCID: PMC4662591  PMID: 26623179

Abstract

In the past decade, the microRNAs (miRNAs) have emerged to be important regulators of gene expression across various species. Several studies have confirmed different types of post-transcriptional modifications at terminal ends of miRNAs. The reports indicate that miRNA modifications are conserved and functionally significant as it may affect miRNA stability and ability to bind mRNA targets, hence affecting target gene repression. Next Generation Sequencing (NGS) of the small RNA (sRNA) provides an efficient and reliable method to explore miRNA modifications. The need for dedicated software, especially for users with little knowledge of computers, to determine and analyze miRNA modifications in sRNA NGS data, motivated us to develop miRMOD. miRMOD is a user-friendly, Microsoft Windows and Graphical User Interface (GUI) based tool for identification and analysis of 5′ and 3′ miRNA modifications (non-templated nucleotide additions and trimming) in sRNA NGS data. In addition to identification of miRNA modifications, the tool also predicts and compares the targets of query and modified miRNAs. In order to compare binding affinities for the same target, miRMOD utilizes minimum free energies of the miRNA:target and modified-miRNA:target interactions. Comparisons of the binding energies may guide experimental exploration of miRNA post-transcriptional modifications. The tool is available as a stand-alone package to overcome large data transfer problems commonly faced in web-based high-throughput (HT) sequencing data analysis tools. miRMOD package is freely available at http://bioinfo.icgeb.res.in/miRMOD.

Keywords: 3 and 5 modifications, miRNA, Next Generation Sequencing (NGS), Target alteration, Non-templated additions, Trimming

Introduction

miRNAs are a class of small non-coding RNAs (ncRNAs) of size 20–24 bases, acting as post-transcriptional regulators of gene expression (Bartel, 2009). miRNAs and transcription factors are important components of gene regulatory networks (Gu, Zhang & Wang, 2012). Thus, any molecular modification in the regulators of integrated networks produces several downstream effects (Gu & Xuan, 2013; Wang, Gu & Li, 2014). It is established in plants as well as animal cells that different post-transcriptional processes modify miRNAs after biogenesis. The modifications processes include trimming, addition or substitution of miRNA nucleotides (Ebhardt et al., 2009; Kim, Heo & Kim, 2010). Modified miRNA with terminal additions may be of two types, namely templated or non-templated depending on its alignment or non-alignment with the reference genome, respectively. The non-templated additions (NTA) are biologically relevant and physiologically regulated processes mediated by enzymes (Wyman et al., 2011). Although, the functional significance of such post-transcriptional modifications is yet to be extensively explored, few studies have concluded that the modifications play an important role in miRNA stability and affects miRNA-target binding and hence the efficiency of target repression. For example, in humans-the terminal adenylation of miR-122 enhances the miRNA stability while terminal uridylation of miR-26A reduces its ability to inhibit its target (Jones et al., 2009; Katoh et al., 2009). In general, adenylation increases miRNA stability, whereas uridylation promotes miRNA degradation (Baccarini et al., 2011; Ibrahim et al., 2010; Katoh et al., 2009; Lu, Sun & Chiang, 2009). However, in few exceptional cases, as in humans and drosophila, both the 3′ adenylation and uridylation promote miRNA degradation (Ameres et al., 2010). It is known that amongst the two miRNA terminal modification types, the 3′ end modifications are more common as compared to the 5′ end modifications (Ryan, Robles & Harris, 2010). The less prevalent 5′ end modifications are of great interest as the modifications can modify the seed sequence of its reference miRNA and may alter target binding or even change target mRNAs (Ameres & Zamore, 2013). The 5′ modifications are also reported to be a widely conserved phenomenon, though its functional significance is yet to be explored (Ryan, Robles & Harris, 2010; Wu et al., 2007). Most of the reported miRNA modification studies primarily focus on terminal adenylation and uridylation, however there are fewer reports on modifications with cytosine and guanine (Aravin & Tuschl, 2005; Burroughs et al., 2010; Morin et al., 2008). Previous studies suggest that post-transcriptional modification in miRNAs is a highly regulated phenomenon and few miRNAs are comparatively more frequently subjected to modification (Wyman et al., 2011). Therefore, local and global analysis of miRNA modifications can enhance our understanding of miRNA medicated cellular gene regulatory mechanisms. The availability of affordable high throughput (HT) small RNA (sRNA) sequencing techniques has made it possible to determine and analyze modified sRNA sequences on a global scale. However, analyzing such HT data is not only a computational challenge but also relatively more difficult for biologists with non-computational academic background. Moreover, the literature survey revealed that currently very few computational tools are available to specifically study different types of miRNA modifications in sRNA NGS datasets (see Table 1). For example, isomiRex (Sablok et al., 2013) determines only the miRNA variants that align with the precursor miRNA sequences, whereas tools like miRanalyzer (Hackenberg et al., 2009) and miRGator (Cho et al., 2012) searches only 3′ non-templated modifications. SeqBuster is limited to determination of 3′ mofications and 5′ trimming (Pantano, Estivill & Marti, 2010). CPSS (Zhang et al., 2012) and Chimira (Vitsios & Enright, 2015) servers processes datasets for limited number of species only. Another tool, isomiRID (De Oliveira, Christoff & Margis, 2013), reports modified miRNA list without distinguishing templated and non-templated modifications. IsomiRage (Muller, Marzi & Nicassio, 2014) is a tool for quantifying user defined miRNA modifications in a small RNA NGS. Similar to the existing tools discussed above, the databases like YM500v2 (Cheng et al., 2013) also performs isomiR searches; however, it is restricted to data related to human cancers. Apart from the above-mentioned limitations, the web-based tools and databases also have technical and computational limitations like restriction of size and type of input files. The output files generated by most of the tools for analysis of miRNA modifications are simple text or html files, which more often requires manual efforts to obtain relevant statistics. Moreover, none of the tools focuses on the effect of modifications on miRNA-target interactions- an obvious and expected translational effect of such modifications.

Table 1.

Comparison of miRMOD with other existing tools.

5′ end modification 3′ end modification Graphical display Target alteration analysis Stand alone
Addition Trimming Addition Trimming
miRanalyzer No Yes Yes Yes No No Noa
miRGator No No Yes Yes No No Nob
CPSS Yes Yes Yes Yes Yesc No Noa
isomiRID Yes Yes Yes Yes No No Yes
SeqBuster No Yes Yes Yes Yesc No Yes
miRMOD Yes Yes Yes Yes Yes Yes Yes

Notes.

a

Restricted to selected species only.

b

File size limit <20 MB.

c

Limited features only.

The limitations of the existing tools motivated us to develop miRMOD package, which not only identifies and analyzes miRNA modifications (non-templated additions as well as trimming) but also performs local and global analysis of changes associated with modifications. miRMOD also provides an optional feature to predict the effect of miRNA modifications on miRNA-target affinities, which is consequential for its biological function.

Materials and Methods

Input file generation

Figure 1 is a summary illustration of miRMOD pipeline. miRMOD tool essentially requires three input files- namely, a fasta file of miRNA sequences, a fasta file of sRNA reads and Bowtie generated alignment file (Langmead et al., 2009). Prior to miRMOD run, the sRNA input file for Bowtie and miRMOD is required to be reformatted with the ‘prepare_input’ program, packaged with miRMOD. The ‘prepare_input’ module simply converts input file (fasta/fastq/tab separated format) into a miRMOD accepted preprocessed fasta format (No normalization is performed). The preprocessed file (output from ‘prepare_input’) should be used for Bowtie alignment with its reference genome or pre-miRNAs, using user-defined parameters. miRMOD package is bundled with three sample files, including a fasta file containing mature miRNA sequences in the human genome (2578 sequences; source: miRBase version 20).

Figure 1.

Figure 1

miRMOD workflow.

Design and algorithm

The miRMOD algorithm uses the following criteria in order to search nucleotide additions/trimming in NGS datasets. The first criterion is that the input miRNA sequences must perfectly align with the corresponding reference genome. The second criterion is that the miRNAs reads with mono or poly-nucleotide additions at the 5′ or 3′ end of a miRNA sequence must not align with the reference genome.

For each miRNA, a Z-score is calculated to measure its relative tendency to get modified under given miRMOD run parameters. The score is calculated using the following formula:

Z-score=siμSσS

where, si represents the score for ith miRNA and defined as the product of number of modifications and total number of modified reads observed for a given miRNA. μ(S) represents the mean value of all miRNA scores (S) and σ(S) represents standard deviation of S.

Putative target sequences may also be specified as an optional input in order to compare target-binding affinities of query miRNA sequences with that of modified miRNAs. Target prediction is performed by RNAhybrid (Kruger & Rehmsmeier, 2006) using default parameters, followed by calculation of energy change and alteration in the target sites due to the modifications. miRMOD results are organized in different windows and tabs.

Sample datasets analysis and evaluation

In order to evaluate and check the robustness of miRMOD, we downloaded clean NGS reads of 15 libraries from a GEO entry with accession number GSE21279 (SRA: SRP002272). The reads of size 19–25 nucleotides and abundance equal or greater than 2 were selected for miRMOD analysis. The fasta formatted clean reads were converted into a miRMOD readable format, using miRMOD ‘prepare_input’. The ‘prepare_input’ processed reads were then mapped to the reference human genome using Bowtie (Langmead et al., 2009) . Only known miRNA sequences (2578 miRNA sequences, miRBASE v20) from each dataset libraries are used as the query sequences in order to identify miRNA modifications. The miRMOD output of miRNA modifications for all the 15 libraries was compared and validated with the modifications previously reported for the test case (Li et al., 2012).

Results

miRMOD facilitates identification and analysis of miRNA modifications in sRNA HT data. miRMOD identifies single and multiple additions/deletions of terminal nucleotides in miRNA reads. miRMOD analysis of already reported datasets illustrates the utility of the tool and helped evaluation of the package. Tabular display of output produced by miRMOD helps in visualization and analysis of miRNA modifications at global as well as local level. For each library, the first tab in the main miRMOD output summarizes the results in 3 sub-sections. The first sub-section displays the gross summary results of modified miRNAs in the dataset with SRA ID: SRX018958 (252 reads in this dataset), with their total number of modifications (1,699 reads) and the miRNA in the input dataset with the highest number of modified reads (Fig. 2). The length distribution graph shows that mono-nucleotide modifications are much higher than polynucleotide modifications. The second section of the tab gives the distribution of modified reads in four different types of modifications (3′ addition: 26, 75,808, 5′ addition: 8,512, 3′ trimming: 19, 81,380 and 5′ trimming: 70,237) and a graphical display of the results. The section also displays the percentage of modified reads present in input dataset (in this case it is 40%). The third section of the first tab summarizes the abundance of different type of sRNA reads i.e., miRNAs and the corresponding modified reads present in the input library, facilitating a global analysis. The ‘Details’ tab (Fig. 3B) gives comprehensive information about different types of modifications found for each query miRNA along with its modified form. The ‘Frequency table’ tab provides the percentage occurrence of each type of mono or poly-nucleotide modifications in the input dataset (Fig. 3B). A more detailed description about the utility and unique capabilities of miRMOD are discussed later with appropriate examples and comparisons.

Figure 2.

Figure 2

miRMOD main output page for the sample library SRX108958.

Figure 3.

Figure 3

miRMOD output from the additional tabs.

miRMOD accepts fasta sequences of putative or already established miRNA targets (e.g., 3′ UTR) to predict and compare minimum free energy and target site changes in miRNA:target and modified-miRNA:target interactions. This kind of analysis facilitates the prediction of modifications leading to target alteration, or strength of possible binding (Ryan, Robles & Harris, 2010).

miRMOD performance

We compared the miRMOD analysis of 3′ nucleotide additions with the results reported by Li and coworkers (Li et al., 2012), see ‘Sample dataset analysis and evaluation’ for details of miRMOD run. Notably, using miRMOD we were able to identify all 3′ additions (only non-templated) reported by them in each of the 15 libraries. Additionally, miRMOD also determines several other terminal modifications i.e., 3′ trimming, 5′ additions and 5′ trimming, not reported by them. Thus, miRMOD can help in identification and analysis of different types of novel modifications. We calculated the percentage of single (A, T, G and C) and di-nucleotide (A,T,G,C), 3′ and 5′ additions as well as trimming in each of the sample libraries (Tables 2 and 3). The percentage addition of ‘A’ at the 3′ miRNA end is the highest among all the other nucleotide additions in the libraries, followed by the ‘AT’ and ‘T’. Apart from ‘A’ and ‘T’ additions, miRMOD assisted analysis also revealed significant percentages of ‘C’ and ‘G’ nucleotide additions or miRNA trimmings, in addition to what is already reported in the publication related to the datasets (Li et al., 2012). The frequency of miR122-5p 3′ addition with adenylation is comparatively higher than that of any other miRNA in the libraries, in agreement with the previous reports (Li et al., 2012). As expected, the proportion of ‘C’ and ‘G’ addition at 3′ end is significantly less than ‘A’ or ‘T’ additions. However, the ‘C’ and ‘G’ nucleotides are trimmed mainly at 3′ end except in the sample SRX018961, where ‘A’ dominates all the other mono-nucleotide trimmings. Additionally, we also analyzed the nucleotide additions and trimming at the 5′ ends of miRNA sequences in all the sample libraries. It may be noted that there are comparatively few reports of 5′ modifications in the literature. The miRMOD analysis of the 15 libraries analyzed by us reveal that ‘C’ is the most frequently added nucleotide at the miRNA 5′ ends, followed by ‘A’ and ‘T’, in the order of decreasing frequencies. In the case of 5′ trimming, mono-nucleotide deletion of ‘C’ followed by ‘T’ surpasses all other nucleotide deletions. Thus, we found that ‘C’ is an important nucleotide involved in the 5′ end miRNA modification. The miRMOD analysis reveals that the most common terminal miRNA modification is 3′ additions, followed by 3′ trimming, 5′ trimming and 5′ additions, in the descending order of frequencies. As reported in previous studies, 3′ additions also include AA, TT and TA in significant proportion, which is also confirmed by miRMOD analysis of the dataset (Burroughs et al., 2010; Li et al., 2012). Apart from mono or di nucleotide modifications, miRMOD also determines polynucleotide additions and deletions at the miRNA terminal ends (Table S1); however, these form a very small fraction of the overall modifications.

Table 2.

Frequency table of 3′ and 5′ additions.

(A) Frequency table of 3′ addition
Library A AT T AA TT CA G TA C GT GA AG GG Others
SRX018957 69.84 9.05 3.92 4.76 1.32 0.41 0.62 0.75 0.32 0.19 0.13 0.22 0.02 8.46
SRX018958 74.39 7.8 3.72 3.05 1.25 0.32 0.6 0.55 0.18 0.17 0.08 0.1 0.01 7.8
SRX018959 81.1 5.76 2.93 2.83 1.08 0.24 0.6 0.45 0.17 0.13 0.08 0.09 0 4.53
SRX018960 76.98 5.68 5.26 4.27 0.79 0.55 0.58 0.37 0.35 0.2 0.11 0.18 0.01 4.65
SRX018961 65.5 4.94 12.43 4.67 1.39 2.69 0.84 0.59 0.4 0.35 0.3 0.29 0.06 5.54
SRX018962 72.49 8.35 6.08 3.65 1.02 0.42 0.69 0.56 1.27 0.18 0.17 0.16 0.12 4.82
SRX018963 70.66 9.98 5.88 3.55 1.06 0.46 0.77 0.59 1.07 0.17 0.11 0.14 0.06 5.5
SRX018964 64.92 11.2 6.92 4.03 1.55 0.62 0.76 0.73 1 0.2 0.14 0.17 0.13 7.64
SRX018965 70.08 9.45 4.97 4.27 0.8 0.93 1.04 0.56 1.23 0.21 0.4 0.3 1.46 4.3
SRX018966 77.6 6.05 3.66 3.35 1.34 0.5 0.53 0.62 0.14 0.16 0.08 0.1 0.01 5.85
SRX018967 80.8 5.8 2.74 3.61 0.58 0.37 0.81 0.49 0.13 0.12 0.11 0.15 0.01 4.28
SRX018968 81.68 4.1 3.46 3.64 1.01 0.6 0.52 0.57 0.25 0.16 0.21 0.14 0.02 3.62
SRX018969 75.93 7.02 4.63 2.48 1.34 0.49 0.62 0.48 0.18 0.19 0.08 0.07 0.01 6.5
SRX018970 76.57 5.62 2.67 2.65 1.84 0.22 0.46 0.47 0.16 0.15 0.07 0.09 0.01 9.01
SRX018971 54.78 9.49 5.87 9.64 1.86 3.46 1.11 1.7 0.48 0.9 0.39 0.23 0.01 10.08
Average 72.89 7.35 5.01 4.03 1.22 0.82 0.7 0.63 0.49 0.23 0.16 0.16 0.13 6.17

Table 3.

Frequency table of 3′ and 5′ trimmings.

(A) Frequency table of 3′ trimming
Library G TG A C T GA TT GT CA CC GC TA AA CT AG CG Others
SRX018957 38 14.4 5.69 11.4 13.3 0.49 3.17 0.26 0.03 0.33 0.2 0.45 0.37 0.24 0.16 0.04 11.5
SRX018958 37.1 7.38 12.2 12 7.27 2.55 1.76 1.08 0.76 0.9 0.21 0.35 0.48 0.22 0.14 0.12 15.56
SRX018959 45.6 7.87 8.13 10.3 6.64 2.48 1.63 1.05 0.82 0.74 0.2 0.24 0.36 0.16 0.14 0.08 13.61
SRX018960 31.6 14.9 19.6 10.6 5.67 0.64 1.41 0.76 0.7 0.24 0.43 0.79 0.58 0.22 0.2 0.01 11.62
SRX018961 20.3 23 22.3 8.27 6 1.65 1.95 0.79 0.58 0.28 0.45 0.92 0.33 0.25 0.18 0.03 12.8
SRX018962 33.3 17.8 9.58 7.49 5.51 1.39 2.06 2.25 1.04 0.35 0.63 0.37 0.4 0.2 0.23 0.03 17.35
SRX018963 30.6 15.4 12.8 7.65 8.75 1.23 2.41 1.78 1.02 0.38 0.54 0.54 0.39 0.22 0.21 0.05 16.12
SRX018964 33 13.2 12.3 8.85 8.82 1.34 2.12 1.55 1.02 0.37 0.63 0.42 0.5 0.28 0.18 0.05 15.36
SRX018965 34.4 12.8 8.28 8.65 3.9 2.8 0.48 5.93 1.93 0.47 0.54 0.15 0.31 0.9 0.32 0.31 17.87
SRX018966 42.6 8.14 10.7 8.5 8.49 4.06 1.98 1.11 0.75 0.74 0.25 0.28 0.3 0.27 0.11 0.16 11.62
SRX018967 50.5 7.96 4.69 13.8 4.57 4.42 1.08 0.76 0.81 0.94 0.43 0.1 0.1 0.13 0.07 0.08 9.59
SRX018968 40.8 11.2 7.52 5.14 8.62 4.04 3 1.66 0.79 0.52 0.22 0.31 0.29 0.37 0.19 0.21 15.21
SRX018969 45.8 13.2 7.02 8.51 4.66 3.89 1.34 1.32 0.58 0.72 0.21 0.24 0.27 0.19 0.11 0.12 11.82
SRX018970 41.9 7.62 12.8 9.04 6.1 3.59 1.4 1.23 0.81 0.71 0.22 0.39 0.51 0.3 0.15 0.21 13.07
SRX018971 15 3.36 11 11.5 12.9 13.9 2.9 2.26 2.29 1.02 0.45 0.06 0.21 0.67 1.34 0.35 20.81
Average 36 11.9 11 9.43 7.41 3.23 1.91 1.59 0.93 0.58 0.38 0.37 0.36 0.31 0.25 0.12 14.26

Another interesting observation is that miRMOD determined very few miRNAs (around 1–5) with terminal trimming that transforms it to another previously classified miRNA. For example, miR-548k is converted to miR-548av-5p in the sample SRX108971, due to removal of TCGT at the 3′ end. Although the percentage of such miRNA conversion is very low in the studied dataset, the studies on miRNA transformation in other sequencing datasets and experiments could be interesting.

We also observed a repetitive nature of modifications through the analysis of the 15 datasets. 1012 modifications are common amongst the 15 datasets. Few modified sequences in the datasets are present in higher abundance for example ‘1056282’ for has-miR-122-5p. Such a large number of repetitions of modified sequences rejects the probability that the sequences are mere artifacts and also suggests that such conserved modifications may be functionally significant and call for experimental scrutiny.

Comparison with other existing tools

Compared with the other tools, miRMOD is a user-friendly and easy to install, moreover miRMOD installation requires least software dependencies (see miRMOD manual). Unlike the currently available webservers and tools for miRNA modification analysis, miRMOD has the advantage that large dataset transfer over the Internet is not required and the analysis pipeline is species independent. The tool determines trimming and non-templated nucleotide additions at both terminal ends of miRNAs in sRNA NGS datasets. It also determines the percentage occurrence of each modification in input dataset, per miRNA modification score along with ‘miRNA to miRNA’ conversion.

The utility of miRMOD is best illustrated by a comparison of miRMOD analysis with other existing tools (Table 1). Comparison with CPSS tool requires detailed comparison, as it determines 5′ as well as 3′ trimming apart from addition of non-templated nucleotides. Amongst the other existing tools, miRGator requires input files in BAM format, with file size restriction of 20 MB. However, SeqBuster is a standalone tool which has several software dependencies required for its installation. miRMOD is free from any of the mentioned limitations and equipped with several unique features to facilitate and comprehensive analysis of miRNA modification in user defined inputs.

For comparison with CPSS, we used the sample dataset of human fetal ovary, available from CPSS website. Using miRMOD, we found that 355 miRNAs were modified from 668 miRNAs present in the dataset (Fig. 3A). The different types of modifications (5′ and 3′ non-templated additions and trimming with abundance ≥10, miRMOD default read count threshold) present in the CPSS output file were identified by miRMOD too. In the test library, hsa-let-7a-5p has the highest number of miRNA modifications with 485556 modified reads. Apart from this, the 3′ trimming (72%) is the most common modification type for the given sample dataset. The detailed result section further elaborated the summary for local analysis by separating the reads with different type of modifications (5′ or 3′) in a “TreeView” format as shown in Fig. 3B. This enabled us to infer the most prominent modifications for a selected modified miRNA without re-analyzing the predicted modifications as compared to the CPSS result. In the next section of miRMOD analysis, frequency table helped us in comparing the percentage occurrence of different modifications in the test dataset (Fig. 3C). This information can help us in identifying the most common nucleotides involved in modifications in the dataset and is also a unique miRMOD feature, not available in other tools. For example ‘GTT’ is a preferred nucleotides modification for 3′ trimming whereas ‘A’ is the most preferred modification for 3′ addition over the other types of modifications in the sample dataset. Another unique miRMOD feature ‘miRNA2miRNA conversion’ helps in identification of miRNA(s) that are trimmed and transformed into previously classified miRNA(s) from the list of miRNA(s) submitted by the user. miRMOD performs miRNA scoring and prioritization based on its tendency to modify and produce modified reads (Fig. 3D, see ‘Materials and Methods’), a unique feature of miRMOD tool. Thus, the information like which miRNAs has the higher tendency for modification as compared to other miRNAs is swiftly retrievable from the dataset using miRMOD, unlike other prediction tools. Moreover, the optional and novel miRMOD feature of Target Variation Analysis (TVA) allows multi-way capturing of highly modified miRNAs and its modification(s) to explore if selected modification is playing any role in altering the miRNA-target interactions (Fig. 3E). Thus, miRMOD offers several unique features for sRNA data analysis, which makes it highly useful tool for modification analysis.

miRMOD evaluation with experimentally validated miRNA modifications

We used miRMOD to identify experimentally validated miRNA modifications from a publicly available dataset of human neural dataset (SRA: SRX548700) (Tan et al., 2014). Not surprisingly, miRMOD successfully identified hsa-miR-9-1 miRNA sequence as well as its experimentally verified modified miRNA ‘CTTTGGTTATCTAGCTGTATGA’ (generated as a result of 5′ trimming) in the sRNA dataset. Apart from the above-mentioned modification, two additional modified sequences were also reported by miRMOD for hsa-miR-9-1 viz. 3′ trimming and 3′ addition of ‘A’ nucleotide (see Supplemental Information 1). The execution time for complete analysis of a given miRNA using the benchmark dataset was ∼15 min.

Conclusions

miRMOD is a Windows-based standalone specialized tool to identify and analyze non-templated nucleotide additions and trimming at both the termini of the miRNA sequences in sRNA NGS data. The tool provides useful statistics about various types of miRNA modifications along with its frequencies, differential miRNA-modified miRNA abundance and frequencies of the modified bases found in each of the modifications. miRMOD also analyzes differences in miRNA target interactions, as a result if miRNA modifications. The miRMOD analysis gives a global as well as micro overview of miRNA modifications in sRNA NGS datasets. miRMOD is available as a GUI based as well as cross-platform command-line package, to generate user-friendly graphical representations for miRNA modifications in a NGS sRNA dataset, and its effect on miRNA-target binding affinity.

Availability and Requirements

Project name: miRMOD

Project home page: http://bioinfo.icgeb.res.in/miRMOD

Operating Systems: Windows XP, Windows 7, Windows 8 (recommended)

Programming language: C#

Other requirements: .NET framework 4 and RNAhybrid (optional)

License: Free

Supplemental Information

Table S1. Frequency table of all types of modifications in SRA dataset: SRP002272 .
DOI: 10.7717/peerj.1332/supp-1
Supplemental Information 1. miRMOD sample session file.
DOI: 10.7717/peerj.1332/supp-2

Funding Statement

This work was supported by the Department of Biotechnology (DBT, India) grant for funding the research and Bioinformatics Infrastructure Facility (BIF) at ICGEB (BT/BI/25/001/2006 and BT/PR628/AGR/36/674/2011). SS received a DBT-BINC fellowship and AK received a Council for Scientific and Industrial Research (CSIR), India fellowship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Abhinav Kaushik and Shradha Saraf performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, wrote the paper, prepared figures and/or tables, performed the computation work, reviewed drafts of the paper.

Sunil K. Mukherjee conceived and designed the experiments.

Dinesh Gupta conceived and designed the experiments, analyzed the data, contributed reagents/materials/analysis tools, wrote the paper, performed the computation work.

Data Availability

The following information was supplied regarding data availability:

The source code and installation instructions are available at http://bioinfo.icgeb.res.in/miRMOD.

References

  • Ameres et al. (2010).Ameres SL, Horwich MD, Hung JH, Xu J, Ghildiyal M, Weng Z, Zamore PD. Target RNA-directed trimming and tailing of small silencing RNAs. Science. 2010;328:1534–1539. doi: 10.1126/science.1187058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ameres & Zamore (2013).Ameres SL, Zamore PD. Diversifying microRNA sequence and function. Nature Reviews Molecular Cell Biology. 2013;14:475–488. doi: 10.1038/nrm3611. [DOI] [PubMed] [Google Scholar]
  • Aravin & Tuschl (2005).Aravin A, Tuschl T. Identification and characterization of small RNAs involved in RNA silencing. FEBS Letters. 2005;579:5830–5840. doi: 10.1016/j.febslet.2005.08.009. [DOI] [PubMed] [Google Scholar]
  • Baccarini et al. (2011).Baccarini A, Chauhan H, Gardner TJ, Jayaprakash AD, Sachidanandam R, Brown BD. Kinetic analysis reveals the fate of a microRNA following target regulation in mammalian cells. Current Biology. 2011;21:369–376. doi: 10.1016/j.cub.2011.01.067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Bartel (2009).Bartel DP. MicroRNAs: target recognition and regulatory functions. Cell. 2009;136:215–233. doi: 10.1016/j.cell.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Burroughs et al. (2010).Burroughs AM, Ando Y, De Hoon MJ, Tomaru Y, Nishibu T, Ukekawa R, Funakoshi T, Kurokawa T, Suzuki H, Hayashizaki Y, Daub CO. A comprehensive survey of 3′ animal miRNA modification events and a possible role for 3′ adenylation in modulating miRNA targeting effectiveness. Genome Research. 2010;20:1398–1410. doi: 10.1101/gr.106054.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Cheng et al. (2013).Cheng WC, Chung IF, Huang TS, Chang ST, Sun HJ, Tsai CF, Liang ML, Wong TT, Wang HW. YM500: a small RNA sequencing (smRNA-seq) database for microRNA research. Nucleic Acids Research. 2013;41:D285–D294. doi: 10.1093/nar/gks1238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Cho et al. (2012).Cho S, Jang I, Jun Y, Yoon S, Ko M, Kwon Y, Choi I, Chang H, Ryu D, Lee B, Kim VN, Kim W, Lee S. MiRGator v3.0: a microRNA portal for deep sequencing, expression profiling and mRNA targeting. Nucleic Acids Research. 2012;41:D252–D257. doi: 10.1093/nar/gks1168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • De Oliveira, Christoff & Margis (2013).De Oliveira LF, Christoff AP, Margis R. isomiRID: a framework to identify microRNA isoforms. Bioinformatics. 2013;29:2521–2523. doi: 10.1093/bioinformatics/btt424. [DOI] [PubMed] [Google Scholar]
  • Ebhardt et al. (2009).Ebhardt HA, Tsang HH, Dai DC, Liu Y, Bostan B, Fahlman RP. Meta-analysis of small RNA-sequencing errors reveals ubiquitous post-transcriptional RNA modifications. Nucleic Acids Research. 2009;37:2461–2470. doi: 10.1093/nar/gkp093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gu & Xuan (2013).Gu J, Xuan Z. Inferring the perturbed microRNA regulatory networks in cancer using hierarchical gene co-expression signatures. PLoS ONE. 2013;8:e1332. doi: 10.1371/journal.pone.0081032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Gu, Zhang & Wang (2012).Gu Z, Zhang C, Wang J. Gene regulation is governed by a core network in hepatocellular carcinoma. BMC Systems Biology. 2012;6:32. doi: 10.1186/1752-0509-6-32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Hackenberg et al. (2009).Hackenberg M, Sturm M, Langenberger D, Falcon-Perez JM, Aransay AM. miRanalyzer: a microRNA detection and analysis tool for next-generation sequencing experiments. Nucleic Acids Research. 2009;37:W68–W76. doi: 10.1093/nar/gkp347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ibrahim et al. (2010).Ibrahim F, Rymarquis LA, Kim EJ, Becker J, Balassa E, Green PJ, Cerutti H. Uridylation of mature miRNAs and siRNAs by the MUT68 nucleotidyltransferase promotes their degradation in Chlamydomonas. Proceedings of the National Academy of Sciences of the United States of America. 2010;107:3906–3911. doi: 10.1073/pnas.0912632107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Jones et al. (2009).Jones MR, Quinton LJ, Blahna MT, Neilson JR, Fu S, Ivanov AR, Wolf DA, Mizgerd JP. Zcchc11-dependent uridylation of microRNA directs cytokine expression. Nature Cell Biology. 2009;11:1157–1163. doi: 10.1038/ncb1931. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Katoh et al. (2009).Katoh T, Sakaguchi Y, Miyauchi K, Suzuki T, Kashiwabara S, Baba T. Selective stabilization of mammalian microRNAs by 3′ adenylation mediated by the cytoplasmic poly(A) polymerase GLD-2. Genes and Development. 2009;23:433–438. doi: 10.1101/gad.1761509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Kim, Heo & Kim (2010).Kim YK, Heo I, Kim VN. Modifications of small RNAs and their associated proteins. Cell. 2010;143:703–709. doi: 10.1016/j.cell.2010.11.018. [DOI] [PubMed] [Google Scholar]
  • Kruger & Rehmsmeier (2006).Kruger J, Rehmsmeier M. RNAhybrid: microRNA target prediction easy, fast and flexible. Nucleic Acids Research. 2006;34:W451–W454. doi: 10.1093/nar/gkl243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Langmead et al. (2009).Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Li et al. (2012).Li SC, Tsai KW, Pan HW, Jeng YM, Ho MR, Li WH. MicroRNA 3′ end nucleotide modification patterns and arm selection preference in liver tissues. BMC Systems Biology. 2012;6(Suppl 2):S14. doi: 10.1186/1752-0509-6-S2-S14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Lu, Sun & Chiang (2009).Lu S, Sun YH, Chiang VL. Adenylation of plant miRNAs. Nucleic Acids Research. 2009;37:1878–1885. doi: 10.1093/nar/gkp031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Morin et al. (2008).Morin RD, O’Connor MD, Griffith M, Kuchenbauer F, Delaney A, Prabhu AL, Zhao Y, McDonald H, Zeng T, Hirst M, Eaves CJ, Marra MA. Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Research. 2008;18:610–621. doi: 10.1101/gr.7179508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Muller, Marzi & Nicassio (2014).Muller H, Marzi MJ, Nicassio F. IsomiRage: from functional classification to differential expression of miRNA isoforms. Frontiers in Bioengineering and Biotechnology. 2014;2 doi: 10.3389/fbioe.2014.00038. Article 38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Pantano, Estivill & Marti (2010).Pantano L, Estivill X, Marti E. SeqBuster, a bioinformatic tool for the processing and analysis of small RNAs datasets, reveals ubiquitous miRNA modifications in human embryonic cells. Nucleic Acids Research. 2010;38:e1332. doi: 10.1093/nar/gkp1127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ryan, Robles & Harris (2010).Ryan BM, Robles AI, Harris CC. Genetic variation in microRNA networks: the implications for cancer research. Nature Reviews Cancer. 2010;10:389–402. doi: 10.1038/nrc2867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Sablok et al. (2013).Sablok G, Milev I, Minkov G, Minkov I, Varotto C, Yahubyan G, Baev V. isomiRex: web-based identification of microRNAs, isomiR variations and differential expression using next-generation sequencing datasets. FEBS Letters. 2013;587:2629–2634. doi: 10.1016/j.febslet.2013.06.047. [DOI] [PubMed] [Google Scholar]
  • Tan et al. (2014).Tan GC, Chan E, Molnar A, Sarkar R, Alexieva D, Isa IM, Robinson S, Zhang S, Ellis P, Langford CF, Guillot PV, Chandrashekran A, Fisk NM, Castellano L, Meister G, Winston RM, Cui W, Baulcombe D, Dibb NJ. 5′ isomiR variation is of functional and evolutionary importance. Nucleic Acids Research. 2014;42:9424–9435. doi: 10.1093/nar/gku656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Vitsios & Enright (2015).Vitsios DM, Enright AJ. Chimira: analysis of small RNA sequencing data and microRNA modifications. Bioinformatics. 2015;31(20):3365–3367. doi: 10.1093/bioinformatics/btv380. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wang, Gu & Li (2014).Wang T, Gu J, Li Y. Inferring the perturbed microRNA regulatory networks from gene expression data using a network propagation based method. BMC Bioinformatics. 2014;15:255. doi: 10.1186/1471-2105-15-255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wu et al. (2007).Wu H, Neilson JR, Kumar P, Manocha M, Shankar P, Sharp PA, Manjunath N. miRNA profiling of naive, effector and memory CD8 T cells. PLoS ONE. 2007;2:e1332. doi: 10.1371/journal.pone.0001020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wyman et al. (2011).Wyman SK, Knouf EC, Parkin RK, Fritz BR, Lin DW, Dennis LM, Krouse MA, Webster PJ, Tewari M. Post-transcriptional generation of miRNA variants by multiple nucleotidyl transferases contributes to miRNA transcriptome complexity. Genome Research. 2011;21:1450–1461. doi: 10.1101/gr.118059.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zhang et al. (2012).Zhang Y, Xu B, Yang Y, Ban R, Zhang H, Jiang X, Cooke HJ, Xue Y, Shi Q. CPSS: a computational platform for the analysis of small RNA deep sequencing data. Bioinformatics. 2012;28:1925–1927. doi: 10.1093/bioinformatics/bts282. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S1. Frequency table of all types of modifications in SRA dataset: SRP002272 .
DOI: 10.7717/peerj.1332/supp-1
Supplemental Information 1. miRMOD sample session file.
DOI: 10.7717/peerj.1332/supp-2

Data Availability Statement

The following information was supplied regarding data availability:

The source code and installation instructions are available at http://bioinfo.icgeb.res.in/miRMOD.


Articles from PeerJ are provided here courtesy of PeerJ, Inc

RESOURCES