Abstract
Genetic variants in or near miRNA genes can have profound effects on miRNA expression and targeting. As user-friendly software for the impact prediction of miRNA variants on a large scale is still lacking, we created a tool called miRVaS. miRVaS automates this prediction by annotating the location of the variant relative to functional regions within the miRNA hairpin (seed, mature, loop, hairpin arm, flanks) and by annotating all predicted structural changes within the miRNA due to the variant. In addition, the tool defines the most important region that is predicted to have structural changes and calculates a conservation score that is indicative of the reliability of the structure prediction. The output is presented in a tab-separated file, which enables fast screening, and in an html file, which allows visual comparison between wild-type and variant structures. All separate images are provided for downstream use. Finally, we tested two different approaches on a small test set of published functionally validated genetic variants for their capacity to predict the impact of variants on miRNA expression.
INTRODUCTION
MicroRNAs (miRNAs) are short non-protein-coding RNAs that regulate expression of their target genes mainly by translational repression or mRNA destabilization (1). Targeting of metazoan miRNAs is mostly defined by incomplete complementarity of the mature miRNA sequence to transcripts; the seed region, nt 2–7 of the mature miRNA, is considered to be a major determinant for target recognition (2). As a consequence miRNAs can target a large set of mRNAs (3) and mRNAs can be regulated by several miRNAs, leading to a complex regulation network (4).
Genes encoding miRNAs can be located individually or in clusters and can be intergenic or can be located within host genes. In the canonical biogenesis pathway, the primary transcript (pri-miRNA) is cleaved sequentially to release the mature miRNA(s). In the first processing step, which takes place in the nucleus, the Microprocessor, consisting of a complex between the ribonuclease Drosha and its cofactor DGCR8, cleaves the pri-miRNA to free the precursor miRNA (pre-miRNA) (5–7). The cleavage site of the Microprocessor is determined by ssRNA–dsRNA junctions at both ends of the miRNA hairpin embedded within the pri-miRNA and the distances between the two junctions. The complex cleaves at a position that is located approximately 11 nt from the junction between the lower stem and the flanking regions and approximately 22 nt from the junction between the upper stem and the terminal loop of the hairpin (8–10). In addition to the importance of the hairpin structure within the primary transcript, sequence motifs were also shown to play a role in Drosha processing (11). The pre-miRNA is transported to the cytosol by Exportin 5 (12). In the second maturation step, Dicer cleaves off the terminal loop after recognizing both ends of the pre-miRNA, resulting in a mature miRNA duplex (13–15). After the duplex is loaded into an Argonaute protein, one of the two strands is discarded. The remaining mature miRNA strand guides Argonaute to its target mRNAs (16).
Many proteins regulate the maturation process of miRNAs by inhibiting or enhancing the action of Drosha or Dicer or by affecting the stability of miRNAs (17). At both processing steps the hairpin can be cleaved at multiple positions, thereby creating alternative mature miRNAs, called isomiRs, differing in length and sequence. IsomiRs can also be generated by addition or removal of nucleotides of the 3′ end of the mature miRNA, by editing of the mature miRNA sequence or due to the presence of genetic variants (18). An overview of miRNA biogenesis and its regulation can be found in the review by Ha and Kim (19).
Genetic variants in or near miRNA genes can affect expression and function of the nearby miRNA gene. Variants in the mature miRNA sequence can result in altered target specificity, as was demonstrated by a variant in the seed sequence of MIR627 (20). Mutations in miRNA genes have been shown to cause human disease, such as mutations in the seed sequences (5p and 3p) of MIR96 causing non-syndromic hearing loss (21,22). These mutations lower the expression of the mature miRNA. For one mutation it was shown that this is due to a change in the secondary structure of the hairpin, as a double mutant with restored predicted secondary structure also had restored expression levels (22). A similar structure dependent change in expression was found for a variant in the mature sequence of MIR125 (23). Variants located outside the mature miRNA or hairpin sequence can also result in altered expression of the mature miRNA, as was shown for different variants just outside the precursor sequence of MIR137 (24,25) and for a variant located 199 nt downstream of the MIR124-1 hairpin sequence (26).
Several approaches exist to predict the secondary structure of an RNA molecule. The most commonly used method is the prediction of the optimal structure based on minimal free energy (MFE) (27). This method presumes that there is a single optimal conformation and that the methods and thermodynamic parameters used are without error. Other potential or competing structures can be taken into account by computing a partition function, which sums the contribution of all structures at equilibrium, weighted by their probabilities (28). The maximal expected accuracy (MEA) structure is the structure that maximizes the expected base-pair accuracy based on this partition function (29). The ensemble of possible secondary structures can also be represented by the centroid structure prediction: the structure that has the smallest total base-pair distance to the sampled structures of that ensemble (30,31).
When assessing the potential impact of a variant on the miRNA secondary structure for a large number of variants, such as those generated using massively parallel sequencing, using the currently available tools requires a lot of manual work. Online versions of mfold and RNAfold (32,33), commonly used tools for this type of predictions, require individual RNA sequences as input for each wild-type and variant miRNA. Results then need to be visually compared and interpreted by the researcher for each variant separately. The miRNASNP database website offers a tool to predict secondary structures of miRNAs with variants (34). However this also requires submission of wild-type and variant sequences and visual comparison and cannot be used for batch searches. To automate these time-consuming and error-prone manual processes, we developed miRVaS, a tool that, starting from a list of variants, determines the location of the variant relative to functional regions within the miRNA hairpin and that predicts the impact of the variant on the secondary structure of the miRNA.
MATERIALS AND METHODS
RNA structure prediction
RNAfold (35) (version 2.1.5) is used by miRVaS to predict the secondary structure of RNA sequences. It is run with parameters to calculate the MFE structure, the partition function and MEA and centroid structures (-p and –MEA), adding dangling energies for the bases adjacent to a helix on both sides (-d2). No postscript figure is generated (–noPS).
miRNA database file generation
miRNA database files for the test set were generated by extracting genomic coordinates, hairpin and mature miRNA sequences from miRBase (36) v20 and the human genome file (hg19). Loop regions were extracted from the miRBase v20 miRNA.str file. In cases where the mature miRNA sequence overlaps with the predicted loop region, the loop annotation was shifted outside the mature region for the miRNA database file.
Visualization
RNA secondary structures are visualized using VARNA (37) (version 3.9) using the naview algorithm (-algorithm naview). Output is generated in png and svg format. The svg files are post-processed by miRVaS to allow automatic scaling to the page size.
Illustration of miRVaS using test set
In order to test the usefulness of miRVaS annotation and to test parameters that may predict an effect on miRNA expression due to genetic variants, we collected a list of miRNA variants of which the effect on the expression of the nearby miRNA has been investigated in cell lines. The studies used had to fulfil following criteria for inclusion: (i) genetic variant located within or near miRNA gene (< 0.5 kb), (ii) variant and wild-type miRNA constructs expressed in a cell line (only one variant per construct), (iii) expression of the relevant miRNA in the cell line was assessed by qPCR or northern blot and (iv) genomic location of the variant and alternative allele could be derived from the text. In cases where the location of the variant was given relative to miRNA precursors, without a reference to the miRBase version used or sequences in the text, we searched the sequence of the oldest version of miRBase containing the miRNA gene to deduct the genomic coordinates of the variant. Genomic coordinates of variants were extracted from 26 papers. The resulting variant list (in zero based half open coordinates format) was used as an input for miRVaS, together with the human genome file (hg19) and the miRNA database file based on miRBase v20. Predictions were run with up- and downstream flanks of 100 nt surrounding the hairpin. Calls were calculated based on reported results (categorized as changed expression or no change in expression) and based on miRVaS output (ΔG or highest structural impact region).
For the ΔΔG (defined as ΔG of variant structure – ΔG of wild-type structure) approach, we tested whether a variant that alters the stability of the predicted (MFE) structure affects expression of the miRNA in vivo. Following calls were made for the ‘|ΔΔG| > 0 strategy’: true positive (TP): |ΔΔG| > 0 and the variant induces the expression of the miRNA to change in vivo; true negative (TN): |ΔΔG| = 0 and the variant does not induce the expression of the miRNA to change in vivo; false positive (FP): |ΔΔG| > 0 and no expression change in vivo; false negative (FN): |ΔΔG| = 0 and expression changes in vivo. The ‘density interval’ was determined by plotting the density of the ΔΔG for variants inducing expression change in vivo and variants inducing no expression change (using ggplot2 (38)) and determining the intersection points of the curves. Calls were made as follows: TP: ΔΔG < −2.103226 or ΔΔG > 4.324731 (intersection points of density curves) and the variant induces an expression change in vivo; TN: ΔΔG ≥ −2.103226 and ΔΔG ≤ 4.324731 and no expression change in vivo; FP: ΔΔG < −2.103226 or ΔΔG > 4.324731 and no expression change in vivo; FN: ΔΔG ≥ −2.103226 and ΔΔG ≤ 4.324731 and expression changes in vivo.
For the hairpin approach, we tested whether variants causing a predicted structural change within the hairpin (i.e. the pre-miRNA plus a flanking stem region) affect the expression of the miRNA. Following calls were made for each structure representation (centroid, MEA, MFE): TP: structural change with the highest impact is located within the hairpin and the expression of the miRNA changes in vivo; TN: structural change with the highest impact is confined to a region outside the hairpin and no expression change in vivo; FP: structural change with the highest impact is located within the hairpin and no expression change in vivo; FN: structural change with the highest impact is confined to a region outside the hairpin and expression changes in vivo.
Because predictions were run with flanks of 100 nt, variants located further than 100 nt from the hairpin were treated as negatives, i.e. for the ΔΔG approach with the ‘|ΔΔG| > 0 strategy’: |ΔΔG| = 0, ‘density interval strategy’: ΔΔG ≥ −2.103226 and ΔΔG ≤ 4.324731 or for the hairpin approach: structural change with the highest impact is located outside the hairpin. Sensitivity was defined as TP/(TP+FN)*100; specificity as TN/(TN+FP)*100. For statistical analysis a two-tailed Fisher's exact test was performed.
RESULTS
miRVaS predicts the impact of genetic variants on miRNAs. It annotates the location of the variant relative to functional regions within the miRNA hairpin (seed, mature, loop, hairpin arm, flanks) based on an accompanying database of miRNA genes with all functional regions. Database files with human miRNAs based on miRBase (36) are included in the download from the miRVaS website. These databases, e.g. for other species, can also be generated by a database conversion tool implemented within miRVaS. Next, the impact of the variant on the secondary structure of the miRNA is predicted by extraction of the sequence of the miRNA hairpin and flanking regions from the genome sequence, introduction of the variant in the sequence and subsequent prediction of the structure of both wild-type and variant sequences using RNAfold (35). The resulting string form structures are compared, after realignment in case of insertions or deletions. The location of all bases that changed secondary structure (paired versus unpaired) is annotated relative to the functional regions within the miRNA. Comparison and annotation is done for centroid, MEA and MFE structure representations.
In addition to impact prediction, miRVaS also creates html files with drawings of the predicted structures so that the changes can be visually compared. It uses VARNA (37) to create secondary structure drawings highlighting: (i) the genetic variant, (ii) important regions within the miRNA (hairpin, mature miRNA, seed, terminal loop) and (iii) the nucleotides with structural changes between the wild-type and variant miRNA. miRVaS can thus be used for batch predictions of the impact of genetic variants, generating tabular output, html files and separate image files. The tool can be installed locally and can be operated using a graphical user interface or via command line. Alternatively, users can access an online version of miRVaS for impact prediction of human miRNA gene variants (http://mirvas.bioinf.be/analyse.html).
Input
miRVaS requires three input files: (i) a variant file in tab-delimited format with the genomic coordinates (zero based half open format), reference and alternative allele of the variants or in VCF format; (ii) a genome file and (iii) a miRNA database file describing all functional regions within the miRNAs. A detailed description of these files is provided in the miRVaS manual. The human genome file (build hg19, build hg38) and a miRNA database file based on miRBase (v20, v21) (36) are distributed with the software. miRVaS includes a database conversion utility to convert gff3 files, as provided by miRBase, into the correct format. This can be used to add newer versions of miRBase, miRNA data of other species or novel miRNA data from other sources.
For most miRNAs the exact length of the primary transcript is unknown and hence setting the correct flank sizes surrounding the miRNA hairpin is not possible. Accordingly, the user can choose the flank size used for the predictions by setting the parameters Upstream flank (5p) and Downstream flank (3p). Default miRVaS settings are up- and downstream flanks of 100 nt, based on the suggestion that to establish overexpression of mature miRNAs in vivo, approximately 100 nt of flanking regions surrounding the hairpin should be included in the miRNA construct (39).
Output
miRVaS creates output for all miRNA genes in the miRNA database file that are located in the vicinity of the specified variants, separated in different rows for the same variant. As such, the user does not need to know upfront whether the tested variants are located near a miRNA gene (or which one), which can be useful when working with large variant lists. Firstly, a tab-separated file is created containing variant location annotation relative to functional regions within the miRNA, annotation of structural changes for centroid, MEA and MFE predictions, delta free energies (ΔG) of reference and variant structures and the frequency of the MFE structure in the thermodynamic ensemble for all variants specified in the input file. Additional features are described below. Secondly, an html file is made containing links to all structure predictions with, for each variant, the centroid, MEA and MFE predictions for reference and variant next to each other, allowing easy visual comparison. Functional regions are highlighted in different colours: in orange: seed region; in magenta: mature miRNA; in dark blue: terminal loop; in cyan: hairpin. The variant is indicated in red, structural changes are indicated by black bases. For insertions and deletions, the bases surrounding the inserted or deleted region are also in red in the images to clearly indicate the position of change (Figure 1). Thirdly, a directory containing separate html files per variant and image files for all predictions in png and svg format is generated.
Figure 1.
Visual output of miRVaS. The left figure in each panel represents the wild-type miRNA, the right figure represents the same miRNA with the variant. Orange regions: seed sequence, magenta: mature miRNA, dark blue: terminal loop, cyan: hairpin. The variant is highlighted in red in the wild-type and the variant miRNA, bases with structural changes are highlighted in black. (A) Structure prediction for a SNP within the seed sequence. Up- and downstream flanks used for the prediction: 20 nt/20 nt. The SNP enlarges an interior loop. (B) Structure prediction for a deletion of one base within the flanking region. Up- and downstream flanks used for the prediction: 30nt/40nt. The deletion does not change the secondary structure drastically. (C) Structure prediction for an insertion of two bases in the flanking region. Up- and downstream flanks used for the prediction: 75 nt/75 nt. This insertion causes structural changes in the lower hairpin arms and in the flanking regions.
Annotation
Location and structure annotations in the output are provided in the following format: region (reference ± start:stop), which is illustrated in Figure 2. The first part of the annotation—the region—indicates in which functional segment of the miRNA gene the sequence change and/or structural change is located. The part between parentheses indicates the position of the change within this functional segment: it states the position (start, stop) starting from the border with the flanking region indicated by reference.
Figure 2.
Annotation scheme for variant location and structural changes. Annotations are made in the format: region (reference ± start:stop). A detailed description of the annotation can be found in the Results section. (A) Schematic overview of the miRNA hairpin including flanks. Functional regions used for annotation are highlighted according to the colour scheme used by miRVaS. The length of the flanking region is determined by the user. (B) A schematic view of structural and sequence changes in a miRNA gene with two mature sequences (5p and 3p). Black bars depict regions with structural change, sequence changes are highlighted in red. Regions are denoted in the upper part of the figure and correspond to regions in Panel A. Reference regions are shown below the hairpin (a: arm, m: mature, l: loop). (C) A schematic view of structural and sequence changes in a miRNA gene with one mature sequence (5p). (D) Annotation of examples of sequence and structural changes that are depicted in Panels B and C: (i) a structural change in the 5p flanking region covering bases 50–40 upstream of the hairpin arm; (ii) a sequence change in the lower 5p hairpin arm, 10 bases upstream of the 5p mature, (iii) a structural change in the upper 5p hairpin arm, 4 bases downstream of the mature 5p; (iv) a structural change in the loop region, 6 bases downstream of the upper 5p arm; (v) a structural change in the upper 3p hairpin arm, 5 bases upstream of the mature 3p; (vi) a structural change spanning the end of the mature 3p (bases 19–22 downstream of the upper 3p arm, base 22 is the last base of the 3p mature sequence) and the beginning of the lower 3p arm (1st–3rd base downstream of the mature 3p); (vii) a sequence change outside the given flank region, 88 bases downstream of the lower 3p hairpin arm; (viii) a structural change within the seed sequence, 3rd base downstream of the lower 5p hairpin arm; (ix) a sequence change within the 5p mature sequence, 16 bases downstream of the lower 5p hairpin arm; (x) a structural change within the 3p hairpin arm, 15 bases downstream of the calculated loop region.
The different regions are: (i) upstream/downstream: outside the given flank sizes for the tested miRNA, but less than 2000 nt outside any miRNA hairpin (in this case, location annotation is given, but no structure predictions can be done); (ii) flank: within the given flanking region; (iii) arm: within the hairpin; (iv) loop: within the terminal loop; (v) mature: within the mature miRNA sequence. For changes within nt 2–7 of the mature sequence, the mature region annotation is appended with ‘seed’.
The reference regions are: a: hairpin arm, m: mature region, l: loop. When a change reaches or crosses the end or the start of a region, an ‘e’ is appended to the base position. All annotations are relative to the orientation of the gene.
Additional features
Next to the ΔG energies and frequencies of the structures calculated by RNAfold, and the provided location annotation, structural impact annotation and visualization, we devised two other parameters that may be of use to researchers who want to prioritize variants with a potential functional impact.
Conservation. As the miRNA structure is of paramount importance in the biogenesis of the mature miRNA, it is unlikely that the hairpin structure would be completely absent or largely changed in vivo. When the predicted secondary structure of the hairpin changes by using different flank sizes, this could reflect the unreliability of the secondary structure prediction. To assess the extent of conservation of the secondary structure of the miRNA hairpin, miRVaS calculates whether the hairpin secondary structure for a prediction with specific flank sizes is the same as the prediction of the hairpin structure without any flank. When the hairpin is not conserved, the prediction is marked as ‘changed’, along with the number of changed bases within the hairpin.
Highest structural impact prediction
miRVaS annotates and highlights structural changes that are predicted to occur in the miRNA when the variant is introduced in the sequence. To enable the user to quickly assess for a list of variants where the impact for each of the miRNAs is located, miRVaS displays the ‘most important’ region that is structurally changed due to the presence of the variant, with importance defined from most to least important: seed > mature > arm > loop > flank. This information is given in tabular as well as the html output for centroid, MEA and MFE predictions in the ‘highest impact’ fields.
An example of miRVaS tabular output for a variant in MIR125 and a description of the different fields can be found in Table 1.
Table 1. miRVaS tabular output: description of the different fields.
chromosome | chr19 | Genomic location of the variant in zero based half open coordinates, type of variant, reference and alternative allele. Type can be snp (SNP), ins (insertion) or del (deletion). |
begin | 52196527 | |
end | 52196528 | |
type | snp | |
ref | G | |
alt | T | |
mir_location | mature5p(a+8) | Location of the variant relative to the functional regions within the tested miRNA. |
Example: the variant is located in the 5p mature sequence, 8 nt downstream of the lower 5p hairpin arm. | ||
mir_name | hsa-mir-125a | Tested miRNA. |
MFE_highest_impact* | seed | Highest region in which a structural change is present for the MFE structure. Order: seed > mature > arm > loop > flank. |
Example: the highest region with structural changes is the seed region. | ||
MFE_impact* | mature5p(a+2,5,8:9)seed&mature3p(a+14:15,19:20) | Describes all structural changes in the variant structure compared to the reference structure for the MFE structure. |
Example: structural changes are within the 5p mature and seed sequence (positions 2, 5 and 8–9, counting from the lower 5p hairpin arm) and within the 3p mature sequence (positions 14–15 and 19–20, counting from the upper 3p hairpin arm). | ||
ref_mfefreq | 0.000285697 | Frequency of the MFE structure for the reference allele. |
var_mfefreq | 0.0000634247 | Frequency of the MFE structure for the variant allele. |
MFE_conservation* | conserved | Whether the hairpin structure is conserved with given flank sizes compared to when using no flanks. When not conserved, the number of bases with changed structure is given. |
Example: the structure of the hairpin is conserved. | ||
ref_MFE_deltaG* | −91.2 | Free energy of the MFE structure prediction for the reference allele (kcal/mol). |
var_MFE_deltaG* | −85.5 | Free energy of the MFE structure prediction for the variant allele (kcal/mol). |
As an example, the output for a variant within MIR125 is described (for up- and downstream flanks of 100 nt). Rows and columns are transposed for clarity. *Columns are also calculated for centroid and MEA predictions (not shown).
Illustration of miRVaS using functionally studied test cases
As shown in Figure 1 and Table 1, miRVaS provides a wealth of information that can be used to assess the potential functional impact of miRNA gene variants. It can be difficult to assess what kind of structural change may likely result in an expression change in vivo. For this reason we tested two different approaches to predict the impact of variants using a set of functionally investigated variants as a reference. To obtain this test set, we searched the literature for human genetic variants in or near miRNA genes for which the effect on the expression of the nearby miRNA was functionally investigated. For 41 genetic variants the impact of the variant on the expression of the 32 nearby miRNA(s) was studied in cell lines (21–23,25,26,40–60). The set of validated miRNA gene variants to date is still very limited, and is in line with the range of variants that was reported by Gong et al. (61). Finding the correct flank sizes introduces a problem, as for most miRNA genes the exact primary transcript lengths are unknown. Due to the fact that several validated variants are located outside the pre-miRNA and that it has been suggested to include flanks of approximately 100 nt surrounding the hairpin when establishing overexpression of miRNAs (39), we ran miRVaS with flank sizes of 100 nt.
The first approach we tested to predict variant-induced miRNA expression changes, the ‘ΔΔG approach’, is based on the hypothesis proposed by Gong et al. (61). The original hypothesis poses that variants inducing destabilization of the hairpin reduce the expression of the mature miRNA in vivo, while variants that increase the stability increase the expression of the mature miRNA. Because we are interested in whether a variant may affect expression of the miRNA or not, regardless of the direction, we tested whether a variant that affects the stability of the predicted structure (in any direction, |ΔΔG| > 0) affects the expression of the miRNA in vivo. However, this approach does not seem to be a good predictive model, as the ΔΔG and the absence or presence of expression changes are not associated (P = 0.3825, Table 2, Supplementary Table S1).
Table 2. Performance of miRVaS determined by the test set for the ΔΔG approach with two different ΔΔG strategies (|ΔΔG| > 0 or density interval).
|ΔΔG| > 0 | Density interval | |
---|---|---|
TP | 25 | 11 |
FP | 11 | 0 |
TN | 3 | 14 |
FN | 3 | 17 |
Sensitivity (%) | 89 | 39 |
Specificity (%) | 21 | 100 |
P-value | 0.3825 | 0.0075 |
P-value: two-tailed P-value of Fisher's exact test.
The ‘|ΔΔG| > 0 strategy’ may be too strict as small deviations in ΔG may not have an effect on expression, but a large test set is not available for optimization of this parameter. For this reason we plotted the densities of the ΔΔG for variants that induce a miRNA expression change in vivo and variants that have no effect on miRNA expression (Supplementary Figure S1). Though the density plots largely overlap, the plot for the variants that affect miRNA expression is more spread. Therefore, we used the intersection points of the curves as thresholds (‘density interval strategy’). This strategy performed better (P = 0.0075, Table 2, Supplementary Table S1). Although the specificity is now much higher (100%), this is at the cost of a very low sensitivity (39%).
As an alternative, we devised an approach that takes into account the location of the predicted structural change, i.e. the ‘hairpin approach’, and tested whether it may be useful in the impact prediction of variants. In this approach, we assessed whether a variant that induces a predicted structural impact change within the hairpin (calculated by miRVaS as the ‘highest structural impact’ parameter) results in expression changes in vivo. The miRNA hairpins deposited in miRBase contain the pre-miRNA and a small part of the flanking region (on average a hairpin stem of 10 nt outside the pre-miRNA). This point, the ssRNA–dsRNA junction approximately 11 nt upstream of the pre-miRNA, together with the ssRNA–dsRNA junction at the terminal loop region, is recognized by the Microprocessor to be able cleave the pre-miRNA at the correct position (8–10). As the structure of the hairpin is paramount for its correct processing, structural changes within the hairpin are likely to influence the expression of the miRNA by affecting Drosha or Dicer processing. Structural changes that are only located outside this region are less likely to affect Microprocessor processing, though it is a possibility, for example for structural changes just outside the hairpin that affect pri-miRNA recognition. To explore and compare the effects of using alternative structure prediction methods, we tested this approach on the results of the centroid, MEA and MFE structure predictions. Despite being a simplified model, the hairpin approach turns out to be a good predictive model in our limited test set, especially when using the MFE structure prediction (P = 0.0008, Table 3, Supplementary Table S2). The sensitivity and specificity for the MFE structure are 71% and 86%, respectively. However, due to the limited number of variants that are currently functionally validated, we do not consider this result definitive, until larger variant sets become available to train and validate the software on. Consequently, this approach is not included in the miRVaS software as a direct prediction. Nevertheless, we do support the use of this approach, as it simplifies variant prioritization and performs well on the currently available test set.
Table 3. Performance of miRVaS determined by the test set for the hairpin approach.
CEN | MEA | MFE | |
---|---|---|---|
TP | 19 | 19 | 20 |
FP | 2 | 2 | 2 |
TN | 12 | 12 | 12 |
FN | 9 | 9 | 8 |
Sensitivity (%) | 68 | 68 | 71 |
Specificity (%) | 86 | 86 | 86 |
P-value | 0.0025 | 0.0025 | 0.0008 |
CEN: centroid, MEA: maximal expected accuracy, MFE: minimal free energy.
DISCUSSION
Several online databases gather information on variants in or near miRNAs, such as miRvar (62), miRNA SNiPer (63), PolymiRTS (64) and miRNASNP (34). The miRNASNP database is an extensive resource that includes annotation of variant location and presents secondary structure predictions for wild-type and variant miRNAs. It also presents a call about the potential impact of the variant on the expression of the miRNA based on the difference in free energy between the wild-type and variant structures. However, it only does this for variants within the pre-miRNA and the database cannot be used for batch searches, making investigations of multiple variants laborious. For novel variants, databases cannot be used to extract variant information. Commonly used online available tools for secondary structure impact prediction of miRNA variants are mfold and RNAfold (32,33). However, these tools are not designed to assess the effects of variants on (mi)RNAs and as such require the user to prepare wild-type and variant sequences manually and run the program separately for each. The resulting structures then need to be compared and interpreted manually for each miRNA. The miRNASNP website includes, next to tools to assess the impact of miRNA seed variants and 3′ UTR variants on miRNA targeting, a tool for secondary structure impact prediction for wild-type and variant (precursor) miRNAs, but here again the user needs to prepare and submit individual sequences and compare the results (34). The RNAsnp web server is focused on prediction of RNAs with variants as well (65). Users can supply genomic coordinates of the query sequence and of one or more variants of interest within that sequence. However, insertions and deletions cannot be tested and the user still needs to run the analysis for each miRNA separately and compare the results. Thus, using these tools in the context of miRNA gene variant impact prediction is labor-intensive and error-prone, especially when more than one variant needs to be investigated, for example when exploring the numerous variants obtained with massively parallel sequencing platforms. Furthermore, the majority of these tools are not designed specifically for miRNA research and as such it is up to the user to determine where functionally important regions are located within the structure prediction. Though a tool for automation of prediction of the impact of batches of (novel and known) variants in the 3′ UTR of target genes on miRNA binding does exist (mrSNP (66)), such a user-convenient tool is still lacking for the secondary structure prediction of miRNAs with variants. To meet these needs, we developed miRVaS.
miRVaS greatly facilitates and speeds up the evaluation of the impact of genetic variants on miRNAs. The tool annotates the location of the variant relative to functional regions within the miRNA gene, describes all structural changes between the structure predictions of the miRNA with and without the variant and generates visual output, while eliminating the need to prepare FASTA files. The tool can run large numbers of genetic variants simultaneously, thereby significantly reducing the manual workload for the user. Furthermore, the user does not need to know which miRNAs are located in the vicinity of the variant, as the tool explores the potential impact on any nearby miRNA gene(s). Therefore, the tool is of great use to assess and prioritize variant lists originating from genome sequencing or RNA sequencing experiments, as no prior filtering or analysis needs to be done to assess which genes may be affected by the variants. Of note, the use of miRVaS is not limited to variants within the human genome or to miRNA genes deposited in miRBase, but can be expanded to miRNAs in other species and to novel miRNA genes, depending on the research focus.
To date only a limited number of published studies have described the effect of a genetic variant on the expression of the nearby miRNA gene and so the set of available variants used for testing miRVaS was small. We believe it would be beneficial to the community if information about functionally validated miRNA gene variants would be included in existing miRNA gene variant databases. Bias towards the publication of positive results (i.e. variants that affect miRNA expression) in the test set can be expected. Nevertheless, we did find negative results (variants with no effect on the expression of the nearby miRNA gene), including a study where none of the tested variants had an effect on expression (42) and four studies reporting a variant with no effect on expression (22,44,52,60), but most studies only reported positive results. In addition, negative reported results are more uncertain as they may result from differences too small to be picked up by the technologies used (e.g. northern blot), or because under the conditions studied, no effects could be seen, but that effects are visible under other conditions. This complicated picture is illustrated well by rs2910164 located in MIR146A: Jazdzewski et al. found a difference in expression of miR-146a due to the variant in U2OS cells, Shen et al. also found a difference in expression, but in the opposite direction in MCF-7 cells, while a study by Lung et al. did not find changed expression of miR-146a in 293FT cells (43,45,55).
Based on the currently available set of genetic variants for which the impact on the miRNA expression was tested in cell lines, we illustrated the use of miRVaS. Two different approaches were tested to interpret the results. The first approach, based on the difference in free energy in the secondary structure prediction induced by the variant, did not perform well. The second approach was based on the hypothesis that structural changes within the hairpin may result in in vivo expression changes. Though this is a simplified approach, and though it is very likely that the impact of variants inducing large structural changes just outside the hairpin region will be missed, when applied to our test cases, this assumption did perform well. Of the three different structure representations, the MFE structure scored best. As such, this result is a validation of the usefulness of the MFE structure as the most commonly used structure representation in the context of miRNA variant impact prediction. Due to the lack of an extensive test set to train the software and the lack of full understanding of the biogenesis of miRNAs, these results cannot be considered definitive. Therefore the ‘hairpin approach’ is not incorporated into the tool as a direct prediction, although we do believe it is a good approach to prioritize variants based on the results of the test set. To allow investigation of alternative assumptions or a multistage approach, miRVaS presents all gathered data, such as location of the variant, all structural changes and a highest impact parameter, but the final decision of the relevance of the variants is up to the researcher.
Several assumptions could be used to determine which variants are most likely to have a functional impact on miRNAs. A first possible focus is to look at the variant location annotation, as variants within the mature or seed sequence, whether or not they are predicted to change the structure of the hairpin, might change the targeting of the miRNA. Next, the researcher can use the highest impact classifications to determine whether the impact is located within or outside the hairpin and assume that a structural change within the hairpin is more likely to have an effect on expression than a change confined to a region outside the hairpin. Since the structure representations are predicted, it is important that the researcher can assess the reliability of the structure predictions. This can be done by taking into account the calculated conservation parameter, for instance by filtering out predictions with a lot of changes within the hairpin when using a specific flank size, or by looking for consistent changes when running miRVaS with different flank sizes or between different structure representations.
Functional validation of the effect of variants on miRNAs is expensive and time-consuming. With the increasing, massively parallel sequencing driven, identification of genetic variants near and in miRNA genes, it is crucial to predict upfront their functional effect, enabling selection of the most interesting variants for further detailed functional studies.
AVAILABILITY
miRVaS can be downloaded from http://mirvas.bioinf.be/ or can be used directly on the website. miRNA database files (based on miRBase v20, v21), human genome files (hg19, hg38), the variant file for the test set (‘testset_input.tsv’) and the manual can also be accessed from this website.
Supplementary Material
Acknowledgments
We gratefully acknowledge Dr Ligia Mateiu for helpful discussions regarding the calculations in the ‘Illustration of miRVaS’ section. We thank Dr Bart Smets, Dr Ligia Mateiu, Jan Verheijen, Eline Wauters, Svenn D'Hert and Jolien Roovers for helpful discussion concerning the annotation scheme.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
Research Foundation—Flanders [G027410N to J.D.F.]; University Research Fund [UA-KP BOF/MS-2013 6264 to M.S.]. Funding for open access charge: Research Foundation—Flanders [G027410N].
Conflict of interest statement. None declared.
REFERENCES
- 1.Filipowicz W., Bhattacharyya S.N., Sonenberg N. Mechanisms of post-transcriptional regulation by microRNAs: are the answers in sight? Nat. Rev. Genet. 2008;9:102–114. doi: 10.1038/nrg2290. [DOI] [PubMed] [Google Scholar]
- 2.Bartel D.P. MicroRNAs: target recognition and regulatory functions. Cell. 2009;136:215–233. doi: 10.1016/j.cell.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Lim L.P., Lau N.C., Garrett-Engele P., Grimson A., Schelter J.M., Castle J., Bartel D.P., Linsley P.S., Johnson J.M. Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs. Nature. 2005;433:769–773. doi: 10.1038/nature03315. [DOI] [PubMed] [Google Scholar]
- 4.Peter M.E. Targeting of mRNAs by multiple miRNAs: the next step. Oncogene. 2010;29:2161–2164. doi: 10.1038/onc.2010.59. [DOI] [PubMed] [Google Scholar]
- 5.Lee Y., Ahn C., Han J., Choi H., Kim J., Yim J., Lee J., Provost P., Rådmark O., Kim S., et al. The nuclear RNase III Drosha initiates microRNA processing. Nature. 2003;425:415–419. doi: 10.1038/nature01957. [DOI] [PubMed] [Google Scholar]
- 6.Han J., Lee Y., Yeom K.-H., Kim Y.-K., Jin H., Kim V.N. The Drosha-DGCR8 complex in primary microRNA processing. Genes Dev. 2004;18:3016–3027. doi: 10.1101/gad.1262504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Yeom K.-H., Lee Y., Han J., Suh M.R., Kim V.N. Characterization of DGCR8/Pasha, the essential cofactor for Drosha in primary miRNA processing. Nucleic Acids Res. 2006;34:4622–4629. doi: 10.1093/nar/gkl458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Zeng Y., Yi R., Cullen B.R. Recognition and cleavage of primary microRNA precursors by the nuclear processing enzyme Drosha. EMBO J. 2005;24:138–148. doi: 10.1038/sj.emboj.7600491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Han J., Lee Y., Yeom K.-H., Nam J.-W., Heo I., Rhee J.-K., Sohn S.Y., Cho Y., Zhang B.-T., Kim V.N. Molecular basis for the recognition of primary microRNAs by the Drosha-DGCR8 complex. Cell. 2006;125:887–901. doi: 10.1016/j.cell.2006.03.043. [DOI] [PubMed] [Google Scholar]
- 10.Ma H., Wu Y., Choi J.-G., Wu H. Lower and upper stem-single-stranded RNA junctions together determine the Drosha cleavage site. Proc. Natl. Acad. Sci. U.S.A. 2013;110:20687–20692. doi: 10.1073/pnas.1311639110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Auyeung V.C., Ulitsky I., McGeary S.E., Bartel D.P. Beyond secondary structure: primary-sequence determinants license pri-miRNA hairpins for processing. Cell. 2013;152:844–858. doi: 10.1016/j.cell.2013.01.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Yi R., Qin Y., Macara I.G., Cullen B.R. Exportin-5 mediates the nuclear export of pre-microRNAs and short hairpin RNAs. Genes Dev. 2003;17:3011–3016. doi: 10.1101/gad.1158803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Grishok A., Pasquinelli A.E., Conte D., Li N., Parrish S., Ha I., Baillie D.L., Fire A., Ruvkun G., Mello C.C. Genes and mechanisms related to RNA interference regulate expression of the small temporal RNAs that control C. elegans developmental timing. Cell. 2001;106:23–34. doi: 10.1016/s0092-8674(01)00431-7. [DOI] [PubMed] [Google Scholar]
- 14.Hutvágner G., McLachlan J., Pasquinelli A.E., Bálint E., Tuschl T., Zamore P.D. A cellular function for the RNA-interference enzyme Dicer in the maturation of the let-7 small temporal RNA. Science. 2001;293:834–838. doi: 10.1126/science.1062961. [DOI] [PubMed] [Google Scholar]
- 15.Park J.-E., Heo I., Tian Y., Simanshu D.K., Chang H., Jee D., Patel D.J., Kim V.N. Dicer recognizes the 5′ end of RNA for efficient and accurate processing. Nature. 2011;475:201–205. doi: 10.1038/nature10198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Czech B., Hannon G.J. Small RNA sorting: matchmaking for Argonautes. Nat. Rev. Genet. 2011;12:19–31. doi: 10.1038/nrg2916. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Siomi H., Siomi M.C. Posttranscriptional regulation of microRNA biogenesis in animals. Mol. Cell. 2010;38:323–332. doi: 10.1016/j.molcel.2010.03.013. [DOI] [PubMed] [Google Scholar]
- 18.Neilsen C.T., Goodall G.J., Bracken C.P. IsomiRs–the overlooked repertoire in the dynamic microRNAome. Trends Genet. 2012;28:544–549. doi: 10.1016/j.tig.2012.07.005. [DOI] [PubMed] [Google Scholar]
- 19.Ha M., Kim V.N. Regulation of microRNA biogenesis. Nat. Rev. Mol. Cell Biol. 2014;15:509–524. doi: 10.1038/nrm3838. [DOI] [PubMed] [Google Scholar]
- 20.Gong J., Tong Y., Zhang H.-M., Wang K., Hu T., Shan G., Sun J., Guo A.-Y. Genome-wide identification of SNPs in microRNA genes and the SNP effects on microRNA target binding and biogenesis. Hum. Mutat. 2012;33:254–263. doi: 10.1002/humu.21641. [DOI] [PubMed] [Google Scholar]
- 21.Mencía A., Modamio-Høybjør S., Redshaw N., Morín M., Mayo-Merino F., Olavarrieta L., Aguirre L.A., del Castillo I., Steel K.P., Dalmay T., et al. Mutations in the seed region of human miR-96 are responsible for nonsyndromic progressive hearing loss. Nat. Genet. 2009;41:609–613. doi: 10.1038/ng.355. [DOI] [PubMed] [Google Scholar]
- 22.Soldà G., Robusto M., Primignani P., Castorina P., Benzoni E., Cesarani A., Ambrosetti U., Asselta R., Duga S. A novel mutation within the MIR96 gene causes non-syndromic inherited hearing loss in an Italian family by altering pre-miRNA processing. Hum. Mol. Genet. 2012;21:577–585. doi: 10.1093/hmg/ddr493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Duan R., Pak C., Jin P. Single nucleotide polymorphism associated with mature miR-125a alters the processing of pri-miRNA. Hum. Mol. Genet. 2007;16:1124–1131. doi: 10.1093/hmg/ddm062. [DOI] [PubMed] [Google Scholar]
- 24.Bemis L.T., Chen R., Amato C.M., Classen E.H., Robinson S.E., Coffey D.G., Erickson P.F., Shellman Y.G., Robinson W.A. MicroRNA-137 targets microphthalmia-associated transcription factor in melanoma cell lines. Cancer Res. 2008;68:1362–1368. doi: 10.1158/0008-5472.CAN-07-2912. [DOI] [PubMed] [Google Scholar]
- 25.Strazisar M., Cammaerts S., van der Ven K., Forero D.A., Lenaerts A.-S., Nordin A., Almeida-Souza L., Genovese G., Timmerman V., Liekens A., et al. MIR137 variants identified in psychiatric patients affect synaptogenesis and neuronal transmission gene sets. Mol. Psychiatry. 2015;20:472–481. doi: 10.1038/mp.2014.53. [DOI] [PubMed] [Google Scholar]
- 26.Qi L., Hu Y., Zhan Y., Wang J., Wang B.-B., Xia H.-F., Ma X. A SNP site in pri-miR-124 changes mature miR-124 expression but no contribution to Alzheimer's disease in a Mongolian population. Neurosci. Lett. 2012;515:1–6. doi: 10.1016/j.neulet.2012.02.061. [DOI] [PubMed] [Google Scholar]
- 27.Zuker M., Sankoff D. RNA secondary structures and their prediction. Bull. Math. Biol. 1984;46:591–621. [Google Scholar]
- 28.McCaskill J.S. The equilibrium partition function and base pair binding probabilities for RNA secondary structure. Biopolymers. 1990;29:1105–1119. doi: 10.1002/bip.360290621. [DOI] [PubMed] [Google Scholar]
- 29.Lu Z.J., Gloor J.W., Mathews D.H. Improved RNA secondary structure prediction by maximizing expected pair accuracy. RNA. 2009;15:1805–1813. doi: 10.1261/rna.1643609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ding Y., Lawrence C.E. A statistical sampling algorithm for RNA secondary structure prediction. Nucleic Acids Res. 2003;31:7280–7301. doi: 10.1093/nar/gkg938. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ding Y., Chan C.Y., Lawrence C.E. RNA secondary structure prediction by centroids in a Boltzmann weighted ensemble. RNA. 2005;11:1157–1166. doi: 10.1261/rna.2500605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 2003;31:3406–3415. doi: 10.1093/nar/gkg595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Gruber A.R., Lorenz R., Bernhart S.H., Neuböck R., Hofacker I.L. The Vienna RNA websuite. Nucleic Acids Res. 2008;36:W70–W74. doi: 10.1093/nar/gkn188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Gong J., Liu C., Liu W., Wu Y., Ma Z., Chen H., Guo A.-Y. An update of miRNASNP database for better SNP selection by GWAS data, miRNA expression and online tools. Database J. Biol. Databases Curation. 2015:bav029. doi: 10.1093/database/bav029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lorenz R., Bernhart S.H., Höner Zu Siederdissen C., Tafer H., Flamm C., Stadler P.F., Hofacker I.L. ViennaRNA Package 2.0. Algorithms Mol. Biol. 2011;6:26. doi: 10.1186/1748-7188-6-26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kozomara A., Griffiths-Jones S. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 2014;42:D68–D73. doi: 10.1093/nar/gkt1181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Darty K., Denise A., Ponty Y. VARNA: interactive drawing and editing of the RNA secondary structure. Bioinforma. Oxf. Engl. 2009;25:1974–1975. doi: 10.1093/bioinformatics/btp250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Wickham H. ggplot2: elegant graphics for data analysis. NY: Springer; 2009. [Google Scholar]
- 39.Chen C.-Z., Li L., Lodish H.F., Bartel D.P. MicroRNAs modulate hematopoietic lineage differentiation. Science. 2004;303:83–86. doi: 10.1126/science.1091903. [DOI] [PubMed] [Google Scholar]
- 40.Gao X., Yang L., Ma Y., Yang J., Zhang G., Huang G., Huang Q., Chen L., Fu F., Chen Y., et al. No association of functional variant in pri-miR-218 and risk of congenital heart disease in a Chinese population. Gene. 2013;523:173–177. doi: 10.1016/j.gene.2013.03.119. [DOI] [PubMed] [Google Scholar]
- 41.Calin G.A., Ferracin M., Cimmino A., Di Leva G., Shimizu M., Wojcik S.E., Iorio M.V., Visone R., Sever N.I., Fabbri M., et al. A MicroRNA signature associated with prognosis and progression in chronic lymphocytic leukemia. N. Engl. J. Med. 2005;353:1793–1801. doi: 10.1056/NEJMoa050995. [DOI] [PubMed] [Google Scholar]
- 42.Diederichs S., Haber D.A. Sequence variations of microRNAs in human cancer: alterations in predicted secondary structure do not affect processing. Cancer Res. 2006;66:6097–6104. doi: 10.1158/0008-5472.CAN-06-0537. [DOI] [PubMed] [Google Scholar]
- 43.Shen J., Ambrosone C.B., DiCioccio R.A., Odunsi K., Lele S.B., Zhao H. A functional polymorphism in the miR-146a gene and age of familial breast/ovarian cancer diagnosis. Carcinogenesis. 2008;29:1963–1966. doi: 10.1093/carcin/bgn172. [DOI] [PubMed] [Google Scholar]
- 44.Wu M., Jolicoeur N., Li Z., Zhang L., Fortin Y., L'Abbe D., Yu Z., Shen S.-H. Genetic variations of microRNAs in human cancer and their effects on the expression of miRNAs. Carcinogenesis. 2008;29:1710–1716. doi: 10.1093/carcin/bgn073. [DOI] [PubMed] [Google Scholar]
- 45.Jazdzewski K., Murray E.L., Franssila K., Jarzab B., Schoenberg D.R., de la Chapelle A. Common SNP in pre-miR-146a decreases mature miR expression and predisposes to papillary thyroid carcinoma. Proc. Natl. Acad. Sci. U.S.A. 2008;105:7269–7274. doi: 10.1073/pnas.0802682105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Feng J., Sun G., Yan J., Noltner K., Li W., Buzin C.H., Longmate J., Heston L.L., Rossi J., Sommer S.S. Evidence for X-chromosomal schizophrenia associated with microRNA alterations. PloS One. 2009;4:e6121. doi: 10.1371/journal.pone.0006121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Sun G., Yan J., Noltner K., Feng J., Li H., Sarkis D.A., Sommer S.S., Rossi J.J. SNPs in human miRNA genes affect biogenesis and function. RNA. 2009;15:1640–1651. doi: 10.1261/rna.1560209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Shen J., DiCioccio R., Odunsi K., Lele S.B., Zhao H. Novel genetic variants in miR-191 gene and familial ovarian cancer. BMC Cancer. 2010;10:47. doi: 10.1186/1471-2407-10-47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Harnprasopwat R., Ha D., Toyoshima T., Lodish H., Tojo A., Kotani A. Alteration of processing induced by a single nucleotide polymorphism in pri-miR-126. Biochem. Biophys. Res. Commun. 2010;399:117–122. doi: 10.1016/j.bbrc.2010.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Li L., Meng T., Jia Z., Zhu G., Shi B. Single nucleotide polymorphism associated with nonsyndromic cleft palate influences the processing of miR-140. Am. J. Med. Genet. A. 2010;152:856–862. doi: 10.1002/ajmg.a.33236. [DOI] [PubMed] [Google Scholar]
- 51.Li L., Zhu G.-Q., Meng T., Shi J.-Y., Wu J., Xu X., Shi B. Biological and epidemiological evidence of interaction of infant genotypes at Rs7205289 and maternal passive smoking in cleft palate. Am. J. Med. Genet. A. 2011;155:2940–2948. doi: 10.1002/ajmg.a.34254. [DOI] [PubMed] [Google Scholar]
- 52.Hu Y., Liu C.-M., Qi L., He T.-Z., Shi-Guo L., Hao C.-J., Cui Y., Zhang N., Xia H.-F., Ma X. Two common SNPs in pri-miR-125a alter the mature miRNA expression and associate with recurrent pregnancy loss in a Han-Chinese population. RNA Biol. 2011;8:861–872. doi: 10.4161/rna.8.5.16034. [DOI] [PubMed] [Google Scholar]
- 53.Tu H.-F., Liu C.-J., Chang C.-L., Wang P.-W., Kao S.-Y., Yang C.-C., Yu E.-H., Lin S.-C., Chang K.-W. The association between genetic polymorphism and the processing efficiency of miR-149 affects the prognosis of patients with head and neck squamous cell carcinoma. PloS One. 2012;7:e51606. doi: 10.1371/journal.pone.0051606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Hoffman A.E., Liu R., Fu A., Zheng T., Slack F., Zhu Y. Targetome profiling, pathway analysis and genetic association study implicate miR-202 in lymphomagenesis. Cancer Epidemiol. Biomark. Prev. 2013;22:327–336. doi: 10.1158/1055-9965.EPI-12-1131-T. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Lung R.W.-M., Wang X., Tong J.H.-M., Chau S.-L., Lau K.-M., Cheng S.-H., Woo J.K.-S., Woo J., Leung P.-C., Ng M.H.-L., et al. A single nucleotide polymorphism in microRNA-146a is associated with the risk for nasopharyngeal carcinoma. Mol. Carcinog. 2013;52(Suppl. 1):E28–E38. doi: 10.1002/mc.21937. [DOI] [PubMed] [Google Scholar]
- 56.Lv H., Pei J., Liu H., Wang H., Liu J. A polymorphism site in the pre-miR-34a coding region reduces miR-34a expression and promotes osteosarcoma cell proliferation and migration. Mol. Med. Rep. 2014;10:2912–2916. doi: 10.3892/mmr.2014.2582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Fu A., Hoffman A.E., Liu R., Jacobs D.I., Zheng T., Zhu Y. Targetome profiling and functional genetics implicate miR-618 in lymphomagenesis. Epigenetics. 2014;9:730–737. doi: 10.4161/epi.27996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Zhang A.-Q., Gu W., Zeng L., Zhang L.-Y., Du D.-Y., Zhang M., Hao J., Yue C.-L., Jiang J. Genetic variants of microRNA sequences and susceptibility to sepsis in patients with major blunt trauma. Ann. Surg. 2015;261:189–196. doi: 10.1097/SLA.0000000000000687. [DOI] [PubMed] [Google Scholar]
- 59.Id Said B., Malkin D. A functional variant in miR-605 modifies the age of onset in Li-Fraumeni syndrome. Cancer Genet. 2015;208:47–51. doi: 10.1016/j.cancergen.2014.12.003. [DOI] [PubMed] [Google Scholar]
- 60.Conte I., Hadfield K.D., Barbato S., Carrella S., Pizzo M., Bhat R.S., Carissimo A., Karali M., Porter L.F., Urquhart J., et al. MiR-204 is responsible for inherited retinal dystrophy associated with ocular coloboma. Proc. Natl. Acad. Sci. U.S.A. 2015;112:E3236–E3245. doi: 10.1073/pnas.1401464112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Gong J., Tong Y., Zhang H.-M., Wang K., Hu T., Shan G., Sun J., Guo A.-Y. Genome-wide identification of SNPs in microRNA genes and the SNP effects on microRNA target binding and biogenesis. Hum. Mutat. 2012;33:254–263. doi: 10.1002/humu.21641. [DOI] [PubMed] [Google Scholar]
- 62.Bhartiya D., Laddha S.V., Mukhopadhyay A., Scaria V. miRvar: a comprehensive database for genomic variations in microRNAs. Hum. Mutat. 2011;32:E2226–E2245. doi: 10.1002/humu.21482. [DOI] [PubMed] [Google Scholar]
- 63.Zorc M., Obsteter J., Dovc P., Kunej T. Genetic variability of MicroRNA genes in 15 animal species. J. Genomics. 2015;3:51–56. doi: 10.7150/jgen.11246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Bhattacharya A., Ziebarth J.D., Cui Y. PolymiRTS Database 3.0: linking polymorphisms in microRNAs and their target sites with human diseases and biological pathways. Nucleic Acids Res. 2014;42:D86–D91. doi: 10.1093/nar/gkt1028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Sabarinathan R., Tafer H., Seemann S.E., Hofacker I.L., Stadler P.F., Gorodkin J. The RNAsnp web server: predicting SNP effects on local RNA secondary structure. Nucleic Acids Res. 2013;41:W475–W479. doi: 10.1093/nar/gkt291. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Deveci M., Catalyürek U.V., Toland A.E. mrSNP: software to detect SNP effects on microRNA binding. BMC Bioinformatics. 2014;15:73. doi: 10.1186/1471-2105-15-73. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.