Abstract
Editing in microRNAs, particularly in seed can significantly alter the choice of their target genes. We show that out of 13 different human tissues, different regions of brain showed higher adenosine to inosine (A-to-I) editing in mature miRNAs. These events were enriched in seed sequence (73.33%), which was not observed for cytosine to uracil (17.86%) editing. More than half of the edited miRNAs showed increased stability, 72.7% of which had ΔΔG values less than −6.0 Kcal/mole and for all of them the edited adenosines mis-paired with cytosines on the pre-miRNA structure. A seed-editing event in hsa-miR-411 (with A – C mismatch) lead to increased expression of the mature form compared to the unedited version in cell culture experiments. Further, small RNA sequencing of GBM patients identified significant miRNA hypoediting which correlated with downregulation of ADAR2 both in metadata and qRT-PCR based validation. Twenty-two significant (11 novel) A-to-I hypoediting events were identified in GBM samples. This study highlights the importance of specific sequence and structural requirements of pre-miRNA for editing along with a suggestive crucial role for ADAR2. Enrichment of A-to-I editing in seed sequence highlights this as an important layer for genomic regulation in health and disease, especially in human brain.
Introduction
RNA molecules undergo multiple post-transcriptional modifications1 for performing diverse functions. Various efforts have been made for identification2, 3 and understanding the significance of these modifications4, 5. RNA editing – the most well studied modification - changes the information encoded by the genome and adds complexity to the gene regulatory networks6–8. The predominant editing event, adenosine-to-inosine (A-to-I) is mediated by ADAR (Adenosine deaminase acting on RNA) family members which acts on double-stranded RNA (dsRNA) with or without a perfect complementarity9. With the advent of next generation sequencing multiple groups have devised experimental10, 11 as well as computational12, 13 approaches to identify genome-wide A-to-I editing events in RNA. For protein-coding transcripts A-to-I editing is essential for normal development14, 15 and is enriched in the brain16. A-to-I modification happens more promiscuously within perfect dsRNA substrates, deaminating up to 50% of the adenosine residues17 whereas internal mismatches and bulges in dsRNA substrates is associated with ADAR selectivity18.
Another form of canonical RNA editing event involves cytosine to uracil (C-to-U) deamination15 mediated by APOBEC1 (apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like 1). APOBEC1 mediated editing events provide tissue specificity and diversity for ApoB mRNAs19 but deregulation of APOBEC1 can also bring about devastating phenotype like cancer20.
MicroRNAs (miRNAs) are ~22 nucleotide long, non-coding RNA which usually down regulate gene expression by binding to the 3′-untranslated region (3′-UTR) of mRNAs21. Bases 2–8 (seed region) from the 5′-end of the mature miRNA are critical determinants of target complementarity22. Premature forms of a miRNA, being a dsRNA molecule, can undergo A-to-I editing at different stages of biogenesis affecting it’s maturation and expression9, 23. A recent paper has shown that ADAR1 can bind to miRNAs in its primary, precursor and mature forms, where binding to the primary miRNA was found to be the highest24. A-to-I editing in miRNAs can affect its cleavage in the nucleus25 or cytoplasm26 and might also result in altered target genes. MiRNA editing has been shown to be important in tissue specific regulation in normal brain27. A recent study has also shown that A-to-I editing in miRNA increases during development, by analysing different developmental stages of mouse brain28.
There is a considerable body of literature for A-to-I editing events in miRNAs27, 29, 30. Recently, studies have also started reporting importance of C-to-U editing in miRNAs31, 32. However, for both these canonical miRNA editing types, the tissue specific spectrum in normal human tissues remains to be seen. In addition, currently there is no consensus on effect of editing at pri/pre level on processing and expression of mature miRNAs. There are reports that indicate both enhanced33, 34 and reduced25, 26, 35 processing and expression upon editing.
In this study we have performed a massively parallel sequencing based large-scale analysis for both A-to-I and C-to-U editing on human miRNAs across 13 different tissues. We explored the positional bias of these events and the role of editing in pri-miRNA on mature miRNA expression. Further, editing in different parts of the brain from same individuals were analyzed to look for intra-individual variability and compared with the scenario in brain from patients of glioblastoma multiforme.
Results
A-to-I editing in miRNAs are enriched in seed sequence in diverse human tissues
We have analysed >1 billion sequences from 50 small RNA sequencing experiments representing 13 diverse healthy human tissues (Supplemental Table S1) and identified 60 and 56 non-redundant A-to-I and C-to-U editing events, respectively (Supplemental Table S2). A-to-I editing levels within mature miRNAs were found to be the highest in prefrontal cortex followed by total RNA from brain (Fig. 1A) whereas for C-to-U, liver revealed higher editing (Supplemental Figure S1). Prefrontal cortex harbored 30 non-redundant A-to-I sites (4.63% of the total expressed miRNAs; average of six independent experiments), 11 of which were found in all six samples (Supplemental Figure S2A). Total RNA from brain had 24 A-to-I sites (3.47% of the total expressed miRNAs; average of three independent experiments) out of which eight sites were found in all three samples (Supplemental Figure S2B). Amongst other tissues editing was found to be higher in lung (3.16%; average of six independent experiments; Fig. 1) with 23 non-redundant sites, eight of which were shared in all six samples (Supplemental Figure S2C). Such consistent editing events across multiple samples for other tissues were also found. A detailed list of all A-to-I and C-to-U editing events in all tissues is provided in Supplemental Table S2.
Thirteen out of 60 non-redundant A-to-I editing events (21.66%) were located on chromosome 14. Ten out of these 13 were in miRNAs belonging to the large miR-379/miR-656 cluster alone (henceforth C14MC). For all tissues analysed the fraction of edited miRNAs from C14MC was significantly higher compared to the random chance namely the miRnome average (p < 0.008; Fig. 1B).
We focused on editing events occurring within the seed sequence of mature miRNAs and found 73.33% (44/60) such sites, whereas such a bias was not observed for C-to-U editing (Fig. 2A; p = 6.66 × 10−9). Interestingly, 43% (19/44) of the A-to-I events were located in 4th and 5th positions of the seed without any specific bias for adenosine in those positions (Fig. 2B and Supplemental Figure S3).
Neuron rich frontal cortex has more A-to-I editing events than corpus callosum of the same individuals
We have performed small RNA sequencing using massively parallel sequencing methods in paired samples of frontal cortex (FC) and corpus callosum (CC) from six individuals. We had exome sequence (DNA) data available for four of these pairs. The exome data was used to rule out the possibility that the A-to-I editing sites were due to variations at the DNA level of the specific individuals. Out of all sites, 75% (33/44) and 70.58% (24/34) were captured by the exome data, in FC and CC, respectively (Supplemental Figure S4A and B). Two observed A > G variations were found independently in FC and CC DNA and hence were excluded from the editing dataset. Over-laying the sites with dbSNP (build 142) identified two SNPs, one in FC (rs515924, A-to-G) and the other (rs554506562) both in FC and CC, which were also excluded. After filtering out these variations at the DNA level, the extent of editing was significantly higher in FC than CC (p = 0.015; Fig. 3A) as well as for all other tissues (p = 0.02 with respect to lung). Our analysis revealed a total of 157 and 122 mature miRNA A-to-I sites in FC and CC, respectively (41 and 32 non-redundant sites in FC and CC, respectively; Supplemental Table S2). As observed for other tissues, in FC and CC also we found an enrichment of editing events within the seed region (70.73% in FC and 75% for CC, Supplemental Table S2). More than one-third of them were present in all six samples for FC and CC. The novel A-to-I miRNA editing events detected in our study is provided in Supplemental Table S3. We did not find significant difference (p = 0.54) in the levels of C-to-U editing between FC and CC samples (Supplemental Figure S4C and D and Supplemental Figure S5).
An event in let-7e showed tissue specificity as well as intra-individual differences in miRNA editing. Hsa-let-7e was expressed in all tissues, while an editing event in its seed sequence was exclusively found in FC (4 out of 6, up to 4.76% edited, Supplemental Table S2) but not in the corpus callosum of the same individuals which has been validated using targeted SnaPshot experiments (Supplemental Figure S6). The edited form potentially targets three genes whereas the unedited form has five targets with none in common (Supplemental Table S4). Apart from these there were five other editing events, which were specific to FC samples and one event specific to CC samples (Supplemental Table S4).
Since ADAR family members are known to mediate the A-to-I editing events, we checked the expression of ADAR1 and ADAR2 in FC and CC by quantitative PCR. Interestingly, for five sample-pairs tested, ADAR2 expression levels were significantly higher in FC than CC (p = 0.008; Fig. 3B) in line with increased editing levels in FC (Fig. 3A). Such a significant expression difference was not observed for ADAR1 (p = 0.63, Fig. 3B).
A-to-I (and not C-to-U) edited miRNAs are predictably more stable than their unedited forms
Using RNA secondary structure analysis we observed that 55% of the edited sites in pre-miRNAs (33/60; Supplemental Table S5A) show an increased stability whereas 16.67% (10/60) show no change in free energy and the rest show a minor decrease in stability (ΔΔG is 0.1 to 0.9 Kcal/mole). 72.7% (24/33) of the edited sites in pre-miRNAs that gain on stability has ΔΔG values less than −6.0 Kcal/mole (Fig. 4A). Only six out of 56 C-to-U edited events (10.71% compared to 55% for A-to-I) were predicted to have increased stability (ΔΔG < 0 kcal/mole) upon editing (Supplementary Figure S7 and Supplementary Table S5B). As ADAR specifically edits double stranded RNA (dsRNA) substrates, we checked the sequence context of the pre-miRNA hairpins. For all the miRNAs that gained stability upon editing (ΔΔG < −6.0 Kcal/mole) the edited adenosines had mis-paired cytosines on the pre-miRNA hairpin structure (Fig. 4B and Supplemental Table S5A). This was not observed for the edited sites that did not provide a stability advantage based on free energy change (Supplemental Table S5A). To validate the altered minimum free energy levels upon editing, we randomly selected 18 miRNAs to check in the three-dimensional structure. Nine out of the 10 candidates that gained in stability in 2D analysis showed the same trend in three-dimensional analysis (Supplemental Table S5A). All these miRNAs had an A-C mismatch in the pre-miRNA hairpin structure pointing towards the importance of this mismatch in conferring stability upon editing.
Figure 5 shows the structural features of unedited and edited forms of miR-1301 as a representative example in both two and three dimensional space. The edited species makes 4 hydrogen bonds with guanine while the unedited species has only 1 with adenine (the total hydrogen bonds was 113 and 122, in unedited and edited structures, respectively). The decrease in Gibb’s free energy in 2D is reflected by more compact and ordered structure in 3D (Fig. 5C) with corresponding change in its entropy profile (Fig. 5B).
ADAR2 downregulation and miRNA hypoediting in Glioblastoma Multiforme
In order to check the expression of ADARs in a disease scenario, we have analyzed data available in the TCGA server for 593 GBM samples and controls. The expression of ADAR2 was significantly downregulated (after bonferroni correction, p = 0.004; Supplemental Figure S8) in GBM. In line with the results described above such significant downregulation was not observed for ADAR1 in the TCGA data (p = 0.2; Supplemental Figure S8).
We observed significant hypoediting of mature miRNAs in a set of five GBM samples that we sequenced compared to the six FC and six CC samples used as controls (2.06% in GBM, 4.43% in FC and 3.38% in control; p = 0.009 and p = 0.004, respectively; Fig. 6A). These GBM samples also showed significant downregulation for ADAR2 (p = 0.002 and p = 0.009; Fig. 6B) which most likely have resulted in the observed hypoediting. When checked for individual miRNAs in GBM samples that were edited in FC and CC (total 29 non-redundant events in at least 3 FC and CC samples), 22 (75.86%) showed significant hypoediting (Wilcoxon-two-tailed, p < 0.05) or no editing in GBM. Eleven out of these 22 events were novel miRNA hypoediting events in GBM (Table 1). The hypoediting for three miRNAs was also confirmed by SnaPshot experiments (Fig. 7A and Supplemental Figure S9). Sixteen out of 22 events were within the seed sequence and would result in drastic redirection of targets with the maximum target overlap before and after editing being only 7.53% (Table 1). For eight of these hypoedited miRNAs the editing level was below the level of detection in all GBM samples and amongst others we observed up to 5-fold reduction of the level of editing in GBM (e.g. for hsa-miR-411, Table 1).
Table 1.
miRNAs | aPresence in samples | bMedian editing (in %) | Seed editing event | Position in precursor | Target Prediction | cOverlap (%) |
---|---|---|---|---|---|---|
Before/After editing | ||||||
hsa-mir-598-3p | 6/5/0 | 0.49/0.34/0 | Yes | 62 | 11/9 | 0 (0) |
hsa-mir-376a-1-5p | 6/6/0 | 11.24/8.43/0 | Yes | 9 | 131/166 | 4 (3.05) |
hsa-mir-337-3p* | 4/1/0 | 4.21/−/0 | Yes | 66 | 146/197 | 11 (7.53) |
hsa-mir-376c-3p | 6/5/2 | 3.72/1.9/− | Yes | 48 | 156/192 | 11 (7.05) |
hsa-mir-1301-3p#,* | 6/6/2 | 7.59/3.94/− | Yes | 52 | 230/7 | 2 (0.87) |
hsa-mir-421 | 6/6/3 | 1.40/0.61/0.57 | Yes | 54 | 271/4 | 1 (0.37) |
hsa-mir-99b-3p* | 6/6/2 | 3.61/1.65/− | Yes | 47 | 33/21 | 0 (0) |
hsa-mir-641-5p | 6/6/3 | 5.62/7.08/3.35 | Yes | 18 | 355/128 | 11 (3.1) |
hsa-let-7e-3p* | 4/0/0 | 2.09/0/0 | Yes | 57 | 5/3 | 0 (0) |
hsa-mir-1251-5p#,* | 4/4/0 | 11.98/11.87/0 | Yes | 10 | 58/305 | 4 (6.9) |
hsa-mir-381-3p# | 6/6/5 | 6.87/7.15/3.07 | Yes | 52 | 638/302 | 48 (7.52) |
hsa-mir-411-5p | 6/6/5 | 27.57/30.85/5.71 | Yes | 20 | 64/58 | 0 (0) |
hsa-mir-130a-3p | 5/5/0 | 0.76/0.97/0 | Yes | 56 | 724/172 | 27 (3.73) |
hsa-mir-151a-3p | 6/6/3 | 2.87/1.26/0.48 | Yes | 49 | 76/77 | 3 (3.95) |
hsa-let-7d-3p | 6/4/0 | 0.70/0.31/0 | Yes | 66 | 9/1 | 0 (0) |
hsa-mir-27b-3p* | 5/0/0 | 0.25/0/0 | Yes | 64 | 921/8 | 2 (0.22) |
hsa-mir-301b-3p* | 5/2/1 | 1.24/−/− | No | 63 | NA | NA |
hsa-mir-340-3p* | 4/0/0 | 1.19/0/0 | No | 70 | NA | NA |
hsa-mir-377-3p* | 4/3/0 | 8.10/4.6/0 | No | 54 | NA | NA |
hsa-mir-539-5p* | 6/6/0 | 2.23/1.62/0 | No | 18 | NA | NA |
hsa-mir-889-3p* | 5/5/0 | 0.62/0.41/0 | No | 62 | NA | NA |
hsa-mir-99a-5p | 6/6/4 | 3.40/1.44/0.30 | No | 13 | NA | NA |
22 miRNAs were found to be significantly hypoedited in GBM (wilcoxon-two-tailed, p < 0.05).
aPresence in samples (FC/CC/GBM).
bMedian editing (FC/CC/GBM).
cPercentage overlap was calculated by overlapped targets/targets before editing.
NA, not applicable because they were not seed editing events.
“−”Indicates miRNAs where median value could not be calculated.
The asterisk (*) represents 11 novel miRNAs found to be hypoedited in GBM in this study.
“#”Indicates events validated by SNPShot (data in Fig. 7A and Supplemental Figure S9).
We investigated differentially expressed miRNAs in GBM compared to FC samples from the miRnome-wide data and identified 263 and 292 up- and downregulated miRNAs, respectively (303 and 268 up- and downregulated miRNAs when compared with CC, Supplemental Table S6A and B). When checked for the altered expression of the hypoedited miRNAs, we found 45.45% (10/22) to be downregulated in GBM (Table 1 and Supplemental Table S6A). Downregulation of four miRNAs found to be hypoedited and downregulated in GBM compared to both FC and CC samples were further validated by qRT-PCR (Fig. 7B and Supplemental Table S7). Seven pre-miRNAs out of these 10 were predicted to have increased structural stability upon editing (ΔΔG values < −6.0 Kcal/mole) and all of them had an A-C mismatch in the pre-miRNA hairpin structure (Supplemental Table S5A). To test the hypothesis that such a gain in stability can positively influence pri-miRNA processing – we have selected miR-411 as a representative miRNA, based on its editing across all tissues studied, an A-C mismatch at the editing site, seed sequence editing and its being one of the miRNAs showing downregulation and hypoediting in GBM. In cell based assays it was found that the expression level of mature miR-411 (edited) was 3.5 fold higher than the un-edited version (Fig. 8). The results were in agreement when compared with the empty vector construct (Supplemental Figure S10).
We found miRnome-wide hypoediting also for C-to-U editing events in GBM compared to FC and CC samples (Supplemental Figure S11) and identified 10 novel hypoediting events (Supplemental Table S8) in GBM.
Discussion
A-to-I editing in miRNA, or its absence, has been shown to have major consequences for cellular outcomes27, 36. We embarked on a large-scale analysis of A-to-I editing in mature miRNAs from small RNA sequencing data across 13 different human tissues. Overall we observed that adenosines flanked by a uracil (5′) and guanine (3′) were more prone to get edited (Supplemental Figure S12), which is in agreement with recent reports29, 34, 37, 38. MiRNAs where the edited adenosine was part of the UAG motif showed significantly higher levels of editing than those, which did not have this sequence context (average values 10.21% vs. 3.05%, one-tailed t-test, p = 0.02). Notably, in almost all possible motifs where the adenine residue was preceded by a guanine – it was not edited (Supplemental Figure S12D).
The ADAR mediated deamination requires the substrate to be double stranded9. We analysed the base-pairing context of the edited adenosines in the hair-pin structure of the pre-miRNA – which revealed a bias for a mis-pair with cytosine (Fig. 4B). An editing at these sites will create a better base-pairing of cytosine with guanine (inosine), predictably enhancing the stability of the dsRNA – also supported by other studies29, 39. This enhanced stability was observed at the level of both 2D and 3D structures of the precursor miRNAs (Figs 4A and 5). Overall, the precursors that gained maximum stability upon editing (ΔΔG ≤ −6.0 Kcal/mole) showed higher levels of editing (average editing 6.94%) compared to those, which did not have any gain in free energy (ΔΔG ≥ 0, average editing 2.55%, one-tailed t-test; p = 0.03). We have also shown for miR-411 in cell based assays that the edited form was significantly more expressed than the unedited version (Fig. 8 and Supplemental Figure S10). Recently, similar observations have been made where edited miR-497* (also with A-C mismatch) showed increased processing by Drosha and was found to be the most downregulated in ADAR2 deficient mice33. It is reported that for a pre-miRNA secondary structure a perfect hairpin might be a barrier for efficient maturation process and/or a promiscuous editing17, 40. However, our study along with others33 indicates that a local mismatch (A-C), which can be removed through editing (I/G-to-C), can actually lead to a better processing for specific miRNAs. The functional link between ADAR group of enzymes and editing vis-à-vis processing of miRNA needs to be studied in greater depth. A recent paper has shown ADAR2 overexpression alters expression of large number of miRNAs35. The miRNAs showing reduced expression were mostly onco-miRs, where an editing event can be protective by negatively affecting processing to mature miRNAs35. Similarly, it will be important to identify tumor suppressor miRNAs where an editing event can lead to a natural protection by positively affecting the miRNA maturation. Interestingly, in our study, miR-411 is a potential tumor suppressor miRNA41 where an editing event leads to increased processing – indicating such possibilities in a larger scale.
ADAR family members, especially ADAR1 and ADAR2 have been shown to have both overlapping and unique editing targets for mRNAs42 and miRNAs27, 42, 43. It has been recently shown that ADAR2 expression correlated more with the mRNA editing levels in cerebellum and prefrontal cortex than it did with ADAR144 – which is also supported in case of miRNA by others29, 33 and by this study. We observed significantly higher expression of ADAR2 in FC compared to the corresponding CC samples (not observed for ADAR1), hinting that one of the many possible reasons for higher editing in FC can be due to higher expression of ADAR2. It remains to be seen whether functional requirement of higher editing levels in post-mitotic cells of the adult brain (e.g. neurons) induces increased expression of the editing enzyme.
We have studied glioblastoma multiforme (GBM) as a representative disease of the brain to understand the possible dysregulation of miRNA editing in disease. Small RNA sequencing of five GBM samples revealed overall hypoediting of miRNA compared to six FC and six CC samples and identified 22 (11 novel) hypoediting events in GBM. Interestingly, eight of these 22 hypoedited miRNAs belonged to the miR-379/miR-656 cluster. It has been suggested that miRNAs in C14MC results from a large polycistron45 and it possibly functions as a single tumor suppressor gene41. Whether an increased editing adds to this cluster’s functional diversity will be revealed by further studies. We also observed downregulation of ADAR2 in GBM samples both in meta-data (TCGA dataset, Supplemental Figure S8) and qRT-PCR based validation (Fig. 7B) in our samples.
Through this study we have been able to show that in both healthy and diseased state, miRNA editing is an important layer of information with specific sequence and structural preferences – especially in the human brain. Such varied layers of regulatory parameters in the biological systems makes them inherently complex and intricately interconnected – achieving remarkable plasticity that is required to dynamically evolve in situations hitherto unknown.
Materials and Methods
Samples
Portions of Frontal cortex (Grey matter) and Corpus Callosum (White matter) were obtained from post-mortem samples of road accident victims. The samples were obtained from NIMHANS Brain Bank, Bangalore, India. GBM samples were obtained from AIIMS, New Delhi, India. The samples were collected according to the Helsinki Declaration and the ethical review board of All India Institute of Medical Sciences, Delhi, India approved the project. Sample collection, characterization and storage were done as described previously41. Diagnosis and grading of tumor samples were done as per 2007 WHO classification.
The details of the samples used in the study are provided in Supplemental Table S1.
RNA isolation, library preparation and sequencing
Total RNA was isolated from Frontal Cortex, Corpus Callosum and GBM samples using miRvana miRNA isolation kit (Ambion, USA) as per the manufacturer’s instructions. Libraries were prepared using Illumina’s TruSeq Small RNA Sample prep Kit following the manufacture’s protocol. Cluster generation and sequencing was done on Illumina HiSeq2000 using standard Illumina sequencing workflow with the multiplexing option. Two samples were loaded in one lane and 50 bases single-end sequencing was done. The in-house small RNA sequencing data is deposited at the sequence read archive (SRA ID: SRP063390).
Data analysis
Data analysis was done using the published pipeline29 with the default parameters. Briefly, the 3′ adapters were removed from the sequencing reads. Reads shorter than 15 bases and longer than 28 bases were discarded. The trimmed reads were aligned against the genome using Bowtie 0.12.7 allowing 1 mismatch. Reads were mapped to pre-miRNA sequences (from miRBase 20) and binomial statistics was used to filter-out sequencing errors. Only significant (p-value < 0.05, post Bonferroni correction) modifications were considered. miRNAs with a minimum read-count 10 was considered. The software and detail description is available at <tau.ac.il/~elieis/miR_editing>.
dbSNP build 142 and exome sequencing data was used to filter out variations at the DNA level.
Percentage of A-to-I editing was calculated in the following way:
Percentage A-to-I editing = (Number of edited miRNAs/Total miRNAs expressed with greater than equal to 10 read counts) * 100. The same was followed for C-to-U editing.
Publicly available small RNA sequencing data
The detailed list of the public domain sequencing data is provided in Supplemental Table S1. The same pipeline was used for both in-house and publicly available dataset for calling miRNA editing events.
DNA isolation and Exome Sequencing
DNA was isolated from brain tissues using Omniprep Genomic DNA isolation kit (G-Biosciences, USA) as per manufacturer’s protocol. Exome capture was done using Illumina TruSeq Exome capture kit. 100 base pair paired end sequencing was done using Illumina HiSeq 2000 (Ilumina, USA). The exome sequence data is publicly available at the sequence read archive (SRA ID: SRP045655).
Exome sequencing data analysis
Raw data was checked for per base quality score and reads having 80% bases with phred quality score 30 and greater were carried forward for downstream analysis and rest were discarded.
Alignment
Reads were aligned to the reference genome (hg19) using BWA (version 0.6.1)46 allowing for 2 mismatches. More than 98% percent of the data was aligned to reference for each sample. Data was also checked for PCR duplicates and the same were removed.
Variation Calling
Genome Analysis Toolkit version 1.547 along with Samtools (version 0.1.18)48 was used to call variations from all the paired samples (CC vs. FC). Variants with >90% strand bias were removed, minimum base quality score was kept at 20 and minimum mapping quality of a read was kept at 40.
Target prediction
The target prediction for unedited and edited miRNAs was done using TargetScanHuman 5.2 Custom (http://www.targetscan.org/vert_50/seedmatch.html) with the 7-mer seed sequence from 2 to 8 nucleotides as an input49.
Motif analysis
Ten bases up-stream and down-stream of the edited adenosine was taken for the motif analysis using WebLogo (http://weblogo.berkeley.edu/logo.cgi)50 and the frequency of bases at each position was calculated. The motif analysis was done for all the 60 miRNAs found to be edited in our study.
To look for tri-nucleotide sequence enrichment within pre-miRNAs, 1872 human pre-miRNAs was downloaded from miRBase v20. Three bases sliding window was taken to analyse enriched tri-nucleotides within pre-miRNAs using an in-house perl program.
Two dimensional (2D) and three dimensional (3D) structural analyses
Minimum free energy (MFE; ΔG) for the unedited and edited miRNAs was predicted at the 2D and 3D level. For secondary structure analysis RNAfold WebServer (http://rna.tbi.univie.ac.at/cgi-bin/RNAfold.cgi)51 was used to calculate the partition function and base pairing probability matrix in addition to the MFE structure. MFE is calculated using a loop-based energy model (using Turner model for Energy Parameters) and McCaskill’s algorithm52 for the secondary structures contributing towards the minimum free energy in the RNA by summing the contributing free energies from the loops at 37 °C. The unedited and the edited pre-miRNAs were given as inputs for ΔG prediction. ΔΔG was calculated by subtracting ΔG of unedited from ΔG of edited pre-miRNAs.
For 3D structural analysis, 18 candidate miRNAs were chosen (Supplemental Table S5A) and the optimal secondary structure in dot-bracket notation (from RNAfold WebServer) was used as structural constrains in the MC-Fold | MC-Sym pipeline53. Structural constrains forces certain nucleotides to be either paired or unpaired and will restrict the conformational search space. The advantage of using a dual approach is that it shall use the best secondary model from the first method and feed it as a template to guide and further predict the new secondary structure using the mcfold algorithm. The final tertiary structure of the pre-miRNA was predicted using mc-fold mc-sym pipeline. The energy-minimized model was obtained using the method’s scoring function, which calculates the base pairing energy contribution by reducing the nucleotides into cyclic motifs ΔΔG of the tertiary structure was calculated in the same way as that of the secondary structure.
Expression analysis of ADAR1 and ADAR2
1 micro-gram of RNA was converted into cDNA using High-Capacity cDNA Reverse transcription kit (Thermo Fischer Scientific) as per manufacturer’s protocol in a reaction volume of 20 μl. qRT-PCR (in duplicates) of ADAR1 and ADAR2 was done in five pairs of FC and CC samples. qRT-PCR was not performed in one pair due to limited amount of sample. The expression level of B2M was used for normalization. Statistical significance was calculated on the basis of two-tailed paired t-test.
We have downloaded the mRNA expression data of 593 GBM patients and 10 controls (Agilent G4502A array) from TCGA data portal and analyzed expression, which was lowess normalized data (level3). Differential expression of mRNA between patients and controls was determined using a non-parametric two-tailed wilcoxon test. A Bonferroni correction was done to find out the list of significant differentially expressed mRNAs (p < 0.05, post-correction). The significant (two-tailed paired t-test) down regulation of ADAR2 was validated in five GBM samples compared to five FC and five CC samples by qRT-PCR.
Differential miRNA editing and expression in GBM
Small RNA sequencing was done for five GBM samples. The status of miRNAs found to be edited in at least three FC and CC samples (29 non-redundant events) were compared with the GBM samples and differentially edited miRNAs were identified using two-tailed Wilcoxon test (p < 0.05).
DESeq2 package in R54 was used to identify differential expressed miRNAs in GBM compared to FC and CC samples. Two-step filtering was done to reduce false positives; an adjusted p-value (p < 0.1) cut-off followed by a fold change value of ±1.5 was used for further analysis. The data for DESeq2 (Supplemental Table S6A and B) represents the fold change in log2 scale (≥0.60 for upregulation and ≤−0.60 for downregulation).
Hypoedited and downregulated miRNAs in GBM
Hypoedited miRNAs were validated using SnaPshot reaction. Downregulation of miRNAs were validated using qRT-PCR. Briefly 0.4 micro-gram of RNA was converted into cDNA using QuantiMir kit (System Biosciences, USA) as per manufacturer’s protocol in a reaction volume of 10 μl. qRT-PCR (in duplicates) of hypoedited miRNAs were done in five GBM and five CC samples. U6 snRNA expression level was used for normalization.
Overexpression in HEK293T cell line
A genomic region encompassing mir-411 was cloned in a modified pRIP vector (673 base-pairs). Site-directed mutagenesis was performed using QuikChange Site-Directed Mutagenesis Kit (Agilent) following manufacturer’s protocol. The unedited and edited constructs were separately transfected for overexpression in HEK293T cell line in three biological replicates. Cells were harvested 48 hours post-transfection and mirVana miRNA Isolation Kit (Thermo Fisher Scientific) was used to isolate total RNA. 1 micro-gram of total RNA was treated with DNase I, RNase free (Thermo Fisher Scientific) to get rid of genomic DNA contamination. DNase I treated RNA was used to make cDNA using High-Capacity cDNA Reverse Transcription Kit (Thermo Fisher Scientific) and QuantiMir Kit (System Biosciences) following manufacturer’s protocol. qRT-PCR was done to quantify the levels of Pri-, Pre- and mature mir-411. Relative fold change of edited miR-411 over unedited was calculated using 2−ΔΔCt method.
Electronic supplementary material
Acknowledgements
We acknowledge the Human Brain Bank at the National Institute of Mental Health and Neurosciences (NIMHANS), Bengaluru, India for providing the FC and CC samples. We thank Mr. Tony Jose and Dr. Mohammed Faruq for their technical help during the sample preparation and sequencing run of the small RNA libraries. We thank Mr. Asgar Hussain Ansari for helping us in depositing the raw sequencing data in sequence read archive (SRA). We also thank Dr. Dhurjhoti Saha for his technical help in cell culture overexpression experiment. The work is funded by the Council of Scientific and Industrial Research (CSIR), Government of India (Grant Number- BSC-0123). In addition, BSC-0121 (computing facility) is also acknowledged for central facility. Deepanjan Paul acknowledges CSIR for fellowship.
Author Contributions
Deepanjan Paul has performed all the miRNA related experiments, analyzed the data and written the manuscript. Ashis Narayan Sinha, Debasish Mukherjee and Saurabh V. Laddha had helped in setting up the study including establishment of the analysis pipeline. Arjun Ray has performed the structural analyses of miRNA editing. Megha Lal has performed the differential expression analysis. Subhashree Nayak has helped in procurement and processing of the GBM samples with the supervision of Ashish Suri and Chitra Sarkar. Anchal Sharma has contributed in analyzing exome data. Bharati Mehani has contributed in overall bioinformatics support for the transcriptome analysis. Arijit Mukhopadhyay has designed and led the study including arranging for the required funds. All authors have read and approved the manuscript.
Competing Interests
The authors declare that they have no competing interests.
Footnotes
Electronic supplementary material
Supplementary information accompanies this paper at doi:10.1038/s41598-017-02397-6
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Cantara WA, et al. The RNA Modification Database, RNAMDB: 2011 update. Nucleic acids research. 2011;39:D195–201. doi: 10.1093/nar/gkq1028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Meyer KD, et al. Comprehensive analysis of mRNA methylation reveals enrichment in 3′ UTRs and near stop codons. Cell. 2012;149:1635–1646. doi: 10.1016/j.cell.2012.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Chen W, Xing P, Zou Q. Detecting N6-methyladenosine sites from RNA transcriptomes using ensemble Support Vector Machines. Scientific reports. 2017;7:40242. doi: 10.1038/srep40242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Suzuki T, Nagao A. Human mitochondrial diseases caused by lack of taurine modification in mitochondrial tRNAs. Wiley interdisciplinary reviews. RNA. 2011;2:376–386. doi: 10.1002/wrna.65. [DOI] [PubMed] [Google Scholar]
- 5.Meyer KD, Jaffrey SR. The dynamic epitranscriptome: N6-methyladenosine and gene expression control. Nature reviews. Molecular cell biology. 2014;15:313–326. doi: 10.1038/nrm3785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Bass BL. RNA editing by adenosine deaminases that act on RNA. Annual review of biochemistry. 2002;71:817–846. doi: 10.1146/annurev.biochem.71.110601.135501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Blanc V, Davidson NO. C-to-U RNA editing: mechanisms leading to genetic diversity. The Journal of biological chemistry. 2003;278:1395–1398. doi: 10.1074/jbc.R200024200. [DOI] [PubMed] [Google Scholar]
- 8.Farajollahi S, Maas S. Molecular diversity through RNA editing: a balancing act. Trends in genetics: TIG. 2010;26:221–230. doi: 10.1016/j.tig.2010.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Nishikura K. Functions and regulation of RNA editing by ADAR deaminases. Annu Rev Biochem. 2010;79:321–349. doi: 10.1146/annurev-biochem-060208-105251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bahn JH, et al. Accurate identification of A-to-I RNA editing in human by transcriptome sequencing. Genome research. 2012;22:142–150. doi: 10.1101/gr.124107.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Sakurai M, et al. A biochemical landscape of A-to-I RNA editing in the human brain transcriptome. Genome research. 2014;24:522–534. doi: 10.1101/gr.162537.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Chen W, Feng P, Ding H, Lin H. PAI: Predicting adenosine to inosine editing sites by using pseudo nucleotide compositions. Scientific reports. 2016;6:35123. doi: 10.1038/srep35123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chen W, et al. iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences. Oncotarget. 2017;8:4208–4217. doi: 10.18632/oncotarget.13758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Horsch M, et al. Requirement of the RNA-editing enzyme ADAR2 for normal physiology in mice. The Journal of biological chemistry. 2011;286:18614–18622. doi: 10.1074/jbc.M110.200881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Keegan LP, Gallo A, O’Connell MA. The many roles of an RNA editor. Nature reviews. Genetics. 2001;2:869–878. doi: 10.1038/35098584. [DOI] [PubMed] [Google Scholar]
- 16.Paul MS, Bass BL. Inosine exists in mRNA at tissue-specific levels and is most abundant in brain mRNA. The EMBO journal. 1998;17:1120–1127. doi: 10.1093/emboj/17.4.1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Polson AG, Bass BL. Preferential selection of adenosines for modification by double-stranded RNA adenosine deaminase. EMBO J. 1994;13:5701–5711. doi: 10.1002/j.1460-2075.1994.tb06908.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kallman AM, Sahlin M, Ohman M. ADAR2 A–>I editing: site selectivity and editing efficiency are separate events. Nucleic acids research. 2003;31:4874–4881. doi: 10.1093/nar/gkg681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Powell LM, et al. A novel form of tissue-specific RNA processing produces apolipoprotein-B48 in intestine. Cell. 1987;50:831–840. doi: 10.1016/0092-8674(87)90510-1. [DOI] [PubMed] [Google Scholar]
- 20.Yamanaka S, et al. Apolipoprotein B mRNA-editing protein induces hepatocellular carcinoma and dysplasia in transgenic animals. Proceedings of the National Academy of Sciences of the United States of America. 1995;92:8483–8487. doi: 10.1073/pnas.92.18.8483. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ambros V. The functions of animal microRNAs. Nature. 2004;431:350–355. doi: 10.1038/nature02871. [DOI] [PubMed] [Google Scholar]
- 22.Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004;116:281–297. doi: 10.1016/S0092-8674(04)00045-5. [DOI] [PubMed] [Google Scholar]
- 23.Ohman M. A-to-I editing challenger or ally to the microRNA process. Biochimie. 2007;89:1171–1176. doi: 10.1016/j.biochi.2007.06.002. [DOI] [PubMed] [Google Scholar]
- 24.Bahn JH, et al. Genomic analysis of ADAR1 binding and its involvement in multiple RNA processing pathways. Nature communications. 2015;6:6355. doi: 10.1038/ncomms7355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yang W, et al. Modulation of microRNA processing and expression through RNA editing by ADAR deaminases. Nature structural & molecular biology. 2006;13:13–21. doi: 10.1038/nsmb1041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kawahara Y, Zinshteyn B, Chendrimada TP, Shiekhattar R, Nishikura K. RNA editing of the microRNA-151 precursor blocks cleavage by the Dicer-TRBP complex. EMBO reports. 2007;8:763–769. doi: 10.1038/sj.embor.7401011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Kawahara Y, et al. Redirection of silencing targets by adenosine-to-inosine editing of miRNAs. Science. 2007;315:1137–1140. doi: 10.1126/science.1138050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Ekdahl Y, Farahani HS, Behm M, Lagergren J, Ohman M. A-to-I editing of microRNAs in the mammalian brain increases during development. Genome research. 2012;22:1477–1487. doi: 10.1101/gr.131912.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Alon S, et al. Systematic identification of edited microRNAs in the human brain. Genome Res. 2012;22:1533–1540. doi: 10.1101/gr.131573.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Warnefors M, Liechti A, Halbert J, Valloton D, Kaessmann H. Conserved microRNA editing in mammalian evolution, development and disease. Genome Biol. 2014;15:R83. doi: 10.1186/gb-2014-15-6-r83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Negi V, et al. Altered expression and editing of miRNA-100 regulates iTreg differentiation. Nucleic Acids Res. 2015;43:8057–8065. doi: 10.1093/nar/gkv752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Zheng Y, et al. Accurate detection for a wide range of mutation and editing sites of microRNAs from small RNA high-throughput sequencing profiles. Nucleic acids research. 2016;44:e123. doi: 10.1093/nar/gkw471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Vesely C, et al. ADAR2 induces reproducible changes in sequence and abundance of mature microRNAs in the mouse brain. Nucleic acids research. 2014;42:12155–12168. doi: 10.1093/nar/gku844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kawahara Y, et al. Frequency and fate of microRNA editing in human brain. Nucleic acids research. 2008;36:5270–5280. doi: 10.1093/nar/gkn479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Tomaselli S, et al. Modulation of microRNA editing, expression and processing by ADAR2 deaminase in glioblastoma. Genome biology. 2015;16:5. doi: 10.1186/s13059-014-0575-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Choudhury Y, et al. Attenuated adenosine-to-inosine editing of microRNA-376a* promotes invasiveness of glioblastoma cells. The Journal of clinical investigation. 2012;122:4059–4076. doi: 10.1172/JCI62925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Pinto Y, Cohen HY, Levanon EY. Mammalian conserved ADAR targets comprise only a small fragment of the human editosome. Genome biology. 2014;15:R5. doi: 10.1186/gb-2014-15-1-r5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lehmann KA, Bass BL. Double-stranded RNA adenosine deaminases ADAR1 and ADAR2 have overlapping specificities. Biochemistry. 2000;39:12875–12884. doi: 10.1021/bi001383g. [DOI] [PubMed] [Google Scholar]
- 39.Wong SK, Sato S, Lazinski DW. Substrate recognition by ADAR1 and ADAR2. RNA. 2001;7:846–858. doi: 10.1017/S135583820101007X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Feng Y, Zhang X, Graves P, Zeng Y. A comprehensive analysis of precursor microRNA cleavage by human Dicer. RNA. 2012;18:2083–2092. doi: 10.1261/rna.033688.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Laddha SV, et al. Genome-wide analysis reveals downregulation of miR-379/miR-656 cluster in human cancers. Biol Direct. 2013;8:10. doi: 10.1186/1745-6150-8-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Nishikura K. A-to-I editing of coding and non-coding RNAs by ADARs. Nature reviews. Molecular cell biology. 2016;17:83–96. doi: 10.1038/nrm.2015.4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Heale BS, et al. Editing independent effects of ADARs on the miRNA/siRNA pathways. The EMBO journal. 2009;28:3145–3156. doi: 10.1038/emboj.2009.244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Chen JY, et al. RNA editome in rhesus macaque shaped by purifying selection. PLoS genetics. 2014;10:e1004274. doi: 10.1371/journal.pgen.1004274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Fiore R, et al. Mef2-mediated transcription of the miR379-410 cluster regulates activity-dependent dendritogenesis by fine-tuning Pumilio2 protein levels. The EMBO journal. 2009;28:697–710. doi: 10.1038/emboj.2009.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.McKenna A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome research. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Li H, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Lewis BP, Shih IH, Jones-Rhoades MW, Bartel DP, Burge CB. Prediction of mammalian microRNA targets. Cell. 2003;115:787–798. doi: 10.1016/S0092-8674(03)01018-3. [DOI] [PubMed] [Google Scholar]
- 50.Crooks GE, Hon G, Chandonia JM, Brenner SE. WebLogo: a sequence logo generator. Genome research. 2004;14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Lorenz R, et al. ViennaRNA Package 2.0. Algorithms for molecular biology: AMB. 2011;6:26. doi: 10.1186/1748-7188-6-26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Zuker M, Stiegler P. Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucleic acids research. 1981;9:133–148. doi: 10.1093/nar/9.1.133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Parisien M, Major F. The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data. Nature. 2008;452:51–55. doi: 10.1038/nature06684. [DOI] [PubMed] [Google Scholar]
- 54.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.