Abstract
We previously developed TRIBE, a method for the identification of cell-specific RNA-binding protein targets. TRIBE expresses an RBP of interest fused to the catalytic domain (cd) of the RNA-editing enzyme ADAR and performs adenosine-to-inosine editing on RNA targets of the RBP. However, target identification is limited by the low editing efficiency of the ADARcd. Here we describe HyperTRIBE, which carries a previously characterized hyperactive mutation (E488Q) of the ADARcd. HyperTRIBE identifies dramatically more editing sites, many of which are also edited by TRIBE but at a much lower editing frequency. HyperTRIBE therefore more faithfully recapitulates the known binding specificity of its RBP than TRIBE. In addition, separating RNA binding from the enhanced editing activity of the HyperTRIBE ADAR catalytic domain sheds light on the mechanism of ADARcd editing as well as the enhanced activity of the HyperADARcd.
Keywords: RNA-binding protein, RNA editing
INTRODUCTION
Pre-mRNAs and mRNAs are subject to many post-transcriptional regulatory events, which are mediated by RNA-binding proteins (RBPs). RBPs have been found to be essential for pre-mRNA splicing, 3′ end formation, mRNA translocation from the nucleus to the cytoplasm, and translation, among other processes (Zhao et al. 1999; Jansen 2001; Witten and Ule 2011; Szostak and Gebauer 2013; Gerstberger et al. 2014). Numerous human diseases have been linked to RBPs, e.g., amyotrophic lateral sclerosis (ALS) (Nussbacher et al. 2015). Identifying the RNA targets of an RBP is therefore not only crucial for deciphering its function but also an important aspect of deciphering RBP-related human diseases.
The current standard technique for identifying RBP targets in vivo is CLIP (cross-linking and immunoprecipitation) and its variants (Ule et al. 2005; Hafner et al. 2010). These methods use UV light to covalently link the RBP to its targets, immunoprecipitate the RNA–protein complex, and then purify and identify the covalently bound RNA. CLIP has definite advantages, for example, the ability to identify the exact RBP binding sites on RNA, but there is also a disadvantage: its requirement for large amounts of material to make an extract and do immunoprecipitations (Darnell 2010). It is therefore often difficult to perform CLIP in a cell-specific manner.
We recently developed TRIBE (targets of RNA-binding proteins identified by editing) to study RBP targets in specific cells, especially in small numbers of circadian neurons of the Drosophila adult brain. This method expresses in specific cells a fusion protein of an RBP and the catalytic domain of Drosophila ADAR (McMahon et al. 2016). ADAR is a well-conserved RNA-editing enzyme. It deaminates adenosine to inosine, which is read by ribosomes and reverse transcriptase as guanosine (Basilio et al. 1962; Bass and Weintraub 1988). ADAR consists of two modular parts, double-strand RNA-binding motifs (dsRBMs), and a catalytic domain (Kim et al. 1994; O'Connell et al. 1995; Macbeth et al. 2005). TRIBE replaces the dsRBMs of the Drosophila ADAR enzyme with the RBP of interest (McMahon et al. 2016). Fusion protein binding and mRNA specificity are therefore driven by the RBP due to the absence of the dsRBMs. TRIBE-dependent editing sites are identified by deep sequencing of RNA extracted from cells of interest. The editing percentage of any mRNA nucleotide is the number of adenosines edited to inosines divided by total number of reads at that site. In the initial TRIBE paper (McMahon et al. 2016), we used a conservative criterion to minimize false positives, namely, a site is only counted if it has an editing percentage >10%, and a read coverage greater than 20, i.e., at least two editing events/20 reads in both biological replicates.
Although we were successful in identifying bona fide targets of three RBPs (the Drosophila HnRNP protein Hrp48, the nuclear protein NonA and FMRP, the Drosophila version of the mammalian Fragile X protein), the number of targets identified by TRIBE in tissue culture was substantially reduced compared to CLIP data with the same RBP (McMahon et al. 2016). Even assuming some CLIP false positives (Darnell 2010; Lambert et al. 2014), TRIBE may still be experiencing a high false-negative problem. This could be due in part to innate editing specificity of the ADARcd. It preferentially edits adenosines bordered by 5′ uridines and 3′ guanosines, i.e., a UAG sequence (Lehmann and Bass 2000; Eggington et al. 2011), which is also observed in TRIBE (McMahon et al. 2016). The ADARcd also prefers to edit adenosines surrounded by a double-stranded region (Bass and Weintraub 1988; Kim et al. 1994; O'Connell et al. 1995; Eggington et al. 2011; Matthews et al. 2016). Consequently, a substantial fraction of RBP-target mRNAs may go unidentified.
To enhance the efficiency and/or reduce the specificity of the ADARcd, we turned our attention to a mutational screen directed at the catalytic domain of human ADAR2 (Kuttan and Bass 2012). It identified a “hyperactive” E488Q mutation, which was reported to have these precise characteristics. Ideally, incorporating this hyperactive E488Q mutation into TRIBE (HyperTRIBE) could reduce its false-negative rate.
Hrp48 HyperTRIBE, which carries this E488Q mutation within the Drosophila ADARcd (dADARcd), indeed identifies dramatically more editing sites than Hrp48 TRIBE. Many of these sites correspond to below-threshold TRIBE editing targets. This indicates that they are bona fide targets, which are edited more efficiently by HyperTRIBE. They also overlap much more successfully with CLIP data, indicating that HyperTRIBE has a much reduced false-negative problem and suggesting that it more faithfully recapitulates the known binding specificity of its RBP than TRIBE. The HyperTRIBE data also have mechanistic implications about the TRIBE fusion protein strategy as well as about the function of the ADARcd and its E488Q variant.
RESULTS
The original Hrp48 TRIBE construct was made by fusing coding DNA for the Hrp48 protein followed by a short linker and then dADARcd (henceforth called TRIBE). To make the companion HyperTRIBE construct, we first identified the dADARcd glutamate corresponding to human ADAR2 E488. That residue as well as the surrounding sequence is highly conserved between hADAR2cd and the dADARcd (data not shown), suggesting that it should function similarly in the Drosophila enzyme. We then introduced the E488Q mutation into the original Hrp48 TRIBE construct via QuikChange site-directed mutagenesis. We had no difficulty making a stable S2 cell line expressing Hrp48 TRIBE (McMahon et al. 2016) but failed with Hrp48 HyperTRIBE (henceforth called HyperTRIBE), perhaps because of the greatly enhanced editing frequency (see below). Most experiments were therefore performed by transiently expressing fusion proteins in Drosophila S2 cells together with GFP and sorting GFP-positive cells by FACS. Editing sites were always defined as the sites in common between two experiments, i.e., >10% editing and >20 reads/nucleotide in both biological replicates (McMahon et al. 2016). A third replicate of TRIBE minimally reduced the number of common sites, i.e., 80% were present in the third replicate (data not shown). Transient and stable expression of TRIBE in S2 cells detected comparable target gene numbers (200–300), ∼40% of which were identical (Supplemental Fig. S1). This rather low rate of overlap is likely due to the substantial difference between transiently and stably expressing cells.
Expression of HyperTRIBE resulted in approximately 20× the number of editing events compared to TRIBE (Fig. 1A). Expression of the ADARcd with E488Q mutation alone does not increase the number of editing events above the endogenous level of S2 cells (Fig. 1A). This is despite the fact that the HyperADARcd is stable and expressed at comparable levels to those of the other TRIBE constructs (data not shown), so most if not all editing by HyperTRIBE—like editing by regular TRIBE—requires the RNA-binding ability of the fused RBP (McMahon et al. 2016).
The ratio of HyperTRIBE-edited genes compared to TRIBE-edited genes is only eight (Fig. 1A), threefold lower than the editing site ratio, indicating a substantial increase in the number of edited sites per gene in HyperTRIBE. Indeed, HyperTRIBE generates many more multiple-edited genes with a median of three edited sites per gene compared to a median of one for TRIBE (Fig. 1B). Some HyperTRIBE editing sites are near the original TRIBE sites (Fig. 1C), suggesting that the higher editing rate of HyperTRIBE is due in part to its ability to edit multiple adenosines near the original ADARcd interacting region. Moreover, the data show that HyperTRIBE unique editing sites that are on the same molecule as common sites—within a single RNA-seq read—have a higher editing percentage than all unique editing sites (Supplemental Fig. S2). This indicates that the HyperADARcd may edit additional, nearby adenosines without fully releasing the mRNA.
To extend the enhanced editing of HyperTRIBE to Drosophila, we performed cell-sorting experiments on flies expressing HyperTRIBE in all adult fly brain neurons using the elav-gsg-Gal4 driver (Abruzzi et al. 2015; McMahon et al. 2016). Similar to the tissue culture result, HyperTRIBE exhibits an 11-fold increase in the number of edited sites and a fivefold increase in the number of edited genes compared to TRIBE (Fig. 1D).
We next compared the HyperTRIBE editing data with our previous CLIP data as well as with regular TRIBE editing data in Drosophila S2 cells. HyperTRIBE not only identifies 282 (97%) of the edited sites and 220 (98%) of the edited genes identified by TRIBE (Fig. 2A,B), but these data also correlate well with the CLIP results: 73% of the HyperTRIBE sites are identified by CLIP (78% for the TRIBE sites) (Fig. 2B). However, there is a striking difference of 66% versus 4% in the ability of HyperTRIBE and TRIBE to recognize CLIP-identified genes, respectively (Fig. 2B; Supplemental Fig. S3). This distinction is due to the many fewer sites and genes identified by TRIBE, indicating that HyperTRIBE significantly lowers the TRIBE false-negative rate and thereby provides a much more complete binding signature of the RBP. We obtained similar results with FMRP HyperTRIBE (Supplemental Fig. S4), another RBP assayed in the original TRIBE paper, indicating that the higher efficiency of HyperTRIBE is not limited to Hrp48.
Because only a minimal seven amino acid linker was used between Hrp48 and the ADARcd in the original TRIBE paper (McMahon et al. 2016) and in the HyperTRIBE assays shown to this point, we investigated whether expanding the linker and altering its character would impact HyperTRIBE editing. We imagined that a longer and more flexible linker might increase editing, perhaps even dramatically, or it might decrease editing if proximity to the target RNA was important. Editing efficiency is indeed decreased but only about twofold, even with a 200 AA flexible linker (Fig. 2C). This indicates that the precise relationship between the RBP and the ADARcd is not critical, also suggesting that other means of delivering the ADARcd to RNA should be possible, for example, via protein dimerization schemes (Stankunas et al. 2003).
To further address possible reasons for the additional HyperTRIBE editing, we compared the editing frequencies of the sites identified by both HyperTRIBE and TRIBE. We divided the HyperTRIBE editing sites into two categories, unique sites and common sites; the latter were also identified by TRIBE. Although HyperTRIBE edits its unique sites at similar frequencies to the much smaller number of TRIBE-edited sites, the common sites experience much higher editing frequencies with HyperTRIBE than with TRIBE (Fig. 3A). This is true not only on average but also when the sites are examined individually (Fig. 3B). This indicates that common sites are also preferred HyperTRIBE substrates.
Is it possible that the unique sites are also quantitatively rather than qualitatively different between HyperTRIBE and TRIBE? This suggests that many of them might be edited by TRIBE but below the required threshold. Indeed, there are 4017 adenosines that meet this criterion, i.e., at least one editing event for each specific adenosine in both replicates but with <10% editing percentage. The correspondence between replicates for these adenosines and HyperTRIBE editing sites is highly significant (Z-test performed, P-value ≈ 0), and more than 30% of these 4017 adenosines correspond to HyperTRIBE editing sites (Fig. 3C). This number is much greater than the number of C-to-T editing/mutations, which serve as control events (data not shown). Notably, these 4017 below-threshold editing sites occur in 3473 different genes, which overlap well with genes identified by Hrp48 HyperTRIBE and Hrp48-ADARcd CLIP (Fig. 3D). We conclude that below-threshold TRIBE-edited adenosines make a significant contribution to the extra HyperTRIBE editing sites and that much of the distinction between HyperTRIBE and TRIBE is quantitative.
The ADARcd with the hyperactive mutation (E488Q) has been shown to have less neighboring sequence preference surrounding the edited adenosine (Kuttan and Bass 2012). Not surprisingly perhaps, this preference for 5′ uridines and 3′ guanosines is also reduced for the sites identified by HyperTRIBE compared to TRIBE (Fig. 4A). This result indicates that much if not most of this neighboring sequence preference derives from the ADARcd, despite an indication that the 3′ guanosine preference might derive from the ADAR dsRBDs (Stefl et al. 2010). Although binding could be sequential, i.e., first dsRBD recognition followed by a conformational change and ADARcd recognition, the ADARcd appears to contain most of the specificity (Matthews et al. 2016).
We also calculated the preference of TRIBE and HyperTRIBE for local double-stranded structure surrounding the edited adenosines. Based on the preference of Hrp48 for mRNA 3′UTR regions, random 3′UTR sequences of the same length and centered on an adenosine were used as a control. This strategy was similar to that used previously for TRIBE sites (McMahon et al. 2016), but many more sites were included in this calculation.
As previously observed for TRIBE (McMahon et al. 2016), there is a strong HyperTRIBE preference for double-stranded structure surrounding a relatively unstructured (bulged) adenosine, features that are absent from the control 3′UTR sequences (Fig. 4B). Notably, the 3′ side of the TRIBE editing sites was more structured than the 5′ side. This asymmetry was not present in the HyperTRIBE sites, indicating a less stringent requirement of HyperTRIBE for substrate structure on the 3′ side of the editing sites (see Discussion). Not surprisingly, the below-threshold sites appear similar to the HyperTRIBE sites (Fig. 4B) despite the higher HyperTRIBE editing frequency.
To address the edited sites distribution in different mRNA regions, we compared the fraction of sites between coding, 5′UTR and 3′UTR sequences. This distribution is an indication of RBP specificity since Hrp48 preferentially binds to mRNA 3′UTR regions with CLIP as well as with TRIBE assays (McMahon et al. 2016). When normalized for read coverage in the different mRNA regions, HyperTRIBE still preferentially edits 3′UTR sites but with a 3.5-fold preference, somewhat less than the fivefold preference of TRIBE (Supplemental Fig. S5). Moreover, coding sequence (CDS) editing preference has increased in HyperTRIBE compared to TRIBE, from 0.4 to 0.7. This is probably due at least in part to the enhanced ability of HyperTRIBE to edit more efficiently adenosines further from the RBP binding sites.
To address this issue directly, we analyzed the distance of the edited sites from the Hrp48-ADARcd CLIP peaks. They presumably locate the positions of RBP binding. The HyperTRIBE-edited sites are indeed located further away from the CLIP peaks comparing to the TRIBE sites (Fig. 5A). For example, HyperTRIBE has only 40% of its sites located within 100 nt of the CLIP peaks versus 60% for TRIBE, whereas the fraction of edited sites further than 500 nt from the CLIP peaks is 29% and 15%, respectively (Fig. 5A). Presumably, more of the distant sites are poorly edited by TRIBE and then fall below the required threshold (see below). Not surprisingly perhaps, these below-threshold TRIBE sites are also located further from the CLIP peaks than regular (above-threshold) TRIBE sites (Fig. 5A). Proximity presumably contributes to a higher editing frequency.
A structural comparison between edited sites at different distances from the CLIP peaks is revealing. The closer sites, <100 nt away from the CLIP peaks, have a relaxed structural requirement compared to more distant sites (Fig. 5B). Because these distance effects should not impact local RNA structure, they suggest that RNA looping contributes quantitatively to the Kd of ADARcd binding (Fig. 6; see Discussion) and that local structure serves to localize the ADARcd moiety rather than only for dsRBD binding or for editing site selection (Stephens et al. 2004; Eggington et al. 2011). In other words, editing sites that are closer to the Hrp48 binding sites are more efficiently edited and can tolerate weaker ADARcd binding than sites that are further away. The recent identification of a conserved RNA-binding loop within hADAR2cd (Matthews et al. 2016) also indicates that the ADARcd normally complements the dsRBDs and contributes to substrate binding, perhaps by stabilizing the RNA–protein complex.
DISCUSSION
Our recently developed TRIBE method (McMahon et al. 2016) expresses in vivo a chimeric protein, which is a fusion of an RBP of interest to the catalytic domain of ADAR. TRIBE performs adenosine-to-inosine editing on RNAs recognized by the RNA-binding moiety of the protein, i.e., the RBP. However, TRIBE-mediated editing is quite selective and probably gives rise to a high false-negative rate for identification of RBP target RNAs. We present here the in vivo characterization of HyperTRIBE, which carries the hyperactive E488Q mutation within the ADARcd.
The HyperTRIBE results overlap much more successfully than TRIBE results with our previously published Hrp48 CLIP results. These CLIP data were very similar, whether from endogenous Hrp48 or from overexpressed Hrp48 TRIBE fusion protein (McMahon et al. 2016), indicating that the TRIBE protein interacts with RNA similarly to endogenous Hrp48. The much better overlap suggests that HyperTRIBE has a much reduced false-negative problem compared to TRIBE. A different HyperTRIBE fusion protein, containing the RBP FMRP, also has more editing sites and genes compared to regular TRIBE with this RBP (Supplemental Fig. S4; McMahon et al. 2016). Importantly, perfect overlap with CLIP data is not necessarily expected, i.e., iCLIP data may identify its own set of false positives, meaning RNAs of little interest. They could, for example, reflect low affinity, transient interactions with transcripts that cross-link efficiently to the RBP.
Might there be a comparable source of HyperTRIBE false positives? Activity by the HyperADARcd alone is a likely candidate, and we are aware of editing by the human HyperADAR1-cd in other systems (Wang et al. 2015). However, we never obtained any substantial editing with expression of only the Drosophila HyperADARcd (e.g., Fig. 1A), despite comparable expression levels to the HyperTRIBE fusion protein (data not shown).
A conservative threshold of 10% editing was initially established to ensure that the TRIBE assay was not impacted by substantial numbers of false positives (McMahon et al. 2016). We began this HyperTRIBE study with the same 10% threshold, so that the new data could be directly compared with our previous TRIBE data. Strikingly, HyperTRIBE identifies many more sites and genes above this threshold, consistent with the original characterization of the E488Q mutation in yeast (Kuttan and Bass 2012). More surprising was the realization that many of the new HyperTRIBE >10% editing sites correspond to proper TRIBE editing sites but with editing frequencies well below the 10% threshold. The correspondence of these sites not only with HyperTRIBE sites but between replicate TRIBE experiments further indicates that they are genuine editing sites. Three conclusions from these data are (i) HyperTRIBE edits more efficiently than TRIBE; (ii) even very low editing frequencies identify true TRIBE editing sites if they are identified in replicate experiments; and (iii) there are many more bona fide TRIBE editing events than are identified with conservative thresholds. Nonetheless, many TRIBE sites will require substantial sequencing depth to be identified. Much of this increased sequencing can probably be avoided by using HyperTRIBE, which should more generally be a superior approach for the identification of RBP targets in mammalian as well as fly cells and neurons.
Nonetheless, the fraction of edited adenosines is low even for HyperTRIBE, presumably still reflecting the sequence and structural requirements of the ADARcd, i.e., the nearest neighbor sequence preference and the double-stranded character surrounding a bulged adenosine. This structural landscape is very different from that of random 3′UTR sequences (Fig. 4B), suggesting that the ability to form a local intramolecular helix is important for HyperTRIBE as well as for TRIBE. Nonetheless, HyperTRIBE has a clear reduction in these sequence and structural requirements (Fig. 4A,B). The relative lack of structure on the 3′ side of the HyperTRIBE and the below-threshold editing sites is particularly striking (Fig. 4B). The asymmetry is probably less important for HyperTRIBE editing, which may reflect differences in rate-limiting steps, e.g., the ADARcd binding affinity/off-rate is more important for the slower TRIBE enzyme (see below).
The distance effects (Fig. 5A) are presumably a simple consequence of RNA looping between the RBP binding site and the edited region. Similar to the effect of increased loop size on the stability of a RNA hairpin, the stability of a weak protein–RNA interaction should be negatively impacted by a bigger loop size, i.e., a greater distance between the RBP binding site and the RNA-editing substrate region. Taken together with the relaxed structural constraints on editing sites close to the Hrp48 binding sites (Fig. 5B), the distance effects suggest that the binding affinity of the ADARcd to the substrate region impacts editing efficiency.
Importantly, looping provides a large positive effect, by increasing the local substrate concentration and overcoming an RNA substrate–ADARcd interaction that is otherwise too weak to generate substantial numbers of editing events in vivo (Fig. 6). The RBP moiety should provide all of the mRNA specificity and almost all of the RNA affinity, for HyperTRIBE as well as for TRIBE, whereas the ADARcd and its binding specificity determine the precise mRNA regions and adenosine(s) that will be edited. The HyperTRIBE data more generally indicate that the broad enhancement of editing frequency by the E488Q mutation can be successfully divorced from substrate binding; this enhancement probably requires that the ADARcd–adenosine interaction is thermodynamically supported by a much more stable RBP–mRNA interaction. A long dwell time of the RBP can also allow the ADARcd to sample different mRNA regions and even different RNA configurations, i.e., the local helix surrounding the substrate adenosine (Fig. 6) may be in dynamic equilibrium with less structured configurations but briefly stabilized by binding of the ADARcd.
As previously discussed (Kuttan and Bass 2012; Matthews et al. 2016), the E488Q mutation either enhances base flipping, perhaps by enhanced amino acid insertion by the HyperTRIBE ADARcd into the RNA A helix, or it has a much higher rate of successful editing/flipping events. Notably, the ADARcd loop that occupies the displaced A during flipping (Matthews et al. 2016) contains amino acid 488. Better binding of the HyperTRIBE ADARcd to its substrate region might enhance the transition state lifetime and therefore editing efficiency (Kuttan and Bass 2012; Matthews et al. 2016). This is further suggested by the impact of distance from the CLIP sites on editing efficiency as well as on the structural requirement surrounding the edited adenosine (Fig. 5). A positive relationship between editing efficiency and substrate binding is also indicated by the enhanced structural requirement of TRIBE sites as compared to below-threshold or HyperTRIBE sites (Fig. 4B). Indeed, we speculate that enhanced editing of the common sites by TRIBE, which is comparable to the editing of the HyperTRIBE unique sites (Fig. 3A), is achieved by better TRIBE binding to these common sites because of the increased structure on the 3′ side of these edited adenosines (Fig. 4B).
MATERIALS AND METHODS
Molecular biology
RBP-ADARcd with E488Q mutation was created by performing Quikchange Site-directed Mutagenesis on pMT-RBP-ADARcd-V5 plasmid (McMahon et al. 2016). Primers 5′-TCGAGTCCGGTCA GGGGACGATTCC and 5′-GGAATCGTCCCCTGACCGGACTC GA were used to induce point mutation to the underlined nucleotide. Fifteen AA, 50 AA, and 100 AA flexible linkers and 15 AA, 50 AA rigid linkers (Amet et al. 2009) were chemically synthesized by Integrated DNA Technologies, Inc. and cloned into pMT-Hrp48-ADARcd-E488Q-V5 plasmid using Gibson Assembly from NEB. The other linkers were created by PCR duplicating the fragment and cloning. Transient expression of TRIBE constructs was performed by cotransfecting pMT TRIBE plasmids with pActin-EGFP to Drosophila S2 cells using Cellfectin II from Thermo Fisher Scientific. Cells were allowed 48 h after transfection for adequate expression of GFP before sorting with a BD FACSAria II machine for GFP-positive cells. Total RNA was extracted from the sorted cells with TRIzol LS reagent. TRIBE protein expression was induced with copper sulfate 24 h before FACS sorting. Expression of all fusion proteins was assayed by transient expression in S2 cells and western blot against V5 tag (Invitrogen, 46-1157). TRIBE stable cell lines used in this paper are the same as the original TRIBE paper (McMahon et al. 2016). GFP-labeled neurons are manually selected by using a glass micropipette from dissected, digested, and triturated fly brains as previously described (Abruzzi et al. 2015; McMahon et al. 2016).
Standard Illumina TruSeq RNA Library Kit was used to construct an RNA-seq library from S2 cells. Manually sorted cells were subjected to RNA-seq library protocol as previously described (Abruzzi et al. 2015; McMahon et al. 2016). All libraries were sequenced by Illumina NextSeq 500 sequencing system using NextSeq High Output Kit v2 (75 cycles). Each sample was covered by ∼20 million raw reads.
RNA-editing analysis
The criteria for RNA-editing events were as follows: (i) The nucleotide is covered by a minimum of 20 reads in each replicate; (ii) more than 80% of genomic DNA reads at this nucleotide are A with zero G (use the reverse complement if annotated gene is in the reverse strand); (iii) a minimum of 10% G is observed at this site in mRNA (or C for the reverse strand). Genomic DNA of S2 cells and background fly strain were sequenced to identify and exclude possible polymorphisms on the DNA level. RNA sequencing data were analyzed as previously described (Rodriguez et al. 2012; McMahon et al. 2016), with minor modifications. Background editing sites found in samples expressing Hyper-ADARcd alone were subtracted from the TRIBE identified editing sites both in S2 cells and in fly neurons. Overlap of editing sites from two data sets was identified using “bedtools intersect” with parameters “-f 0.9 -r”.
Quantification of RNA sequencing reads distribution was performed with read_distribution.py script in RSeQC v2.3.7 (Wang et al. 2012). RNA structure folding analysis was carried out with UNAFold (Markham and Zuker 2008) on flanking sequences of Hrp48 TRIBE editing sites, Hyper TRIBE editing sites, TRIBE below-threshold editing sites, and random 3′ UTR sites centered around an “A,” which excludes any genes with Hrp48 CLIP or Hyper TRIBE signal. One thousand sites were blindly selected in each pool, except 221 sites were selected from TRIBE edit sites due to its small size. In order to remove oversampling of any region, only one site was considered per gene. A flanking region of 250 nt both 5′ and 3′ to the editing site or random site was folded with UNAFold parameters (hybrid-ss-min --suffix DAT --mfold=5,8,200 --noisolate). Base-pairing was counted in the predicted minimum free energy (MFE) and predicted suboptimal structures. A profile of double-strandedness was created for each sequence, which was then averaged over all 500 sequences and plotted.
Fly lines
HyperTRIBE injection plasmid was generated by site-directed mutagenesis of pJFRC7-20xUAS-Hrp48-ADARcd-V5 plasmid to introduce the E488Q mutation. The Hyper-ADARcd only construct was generated in the same way. The transgenes were injected by Rainbow Transgenic Flies, Inc., UAS-RBP-ADARcd-V5; UAS-eGFP flies (Bloomington stock center #1522) were crossed to the elav-gsg-Gal4 driver line to allow adult-only expression of the fusion proteins in all neurons due to the lethality of constitutive pan-neuronal expression (Osterwalder et al. 2001; McMahon et al. 2016). Prior to manual cell sorting, food containing RU486 (0.2 µg/mL, Sigma-Aldrich) was used to induce transgene expression in young flies (∼3 d old) for 3 d.
DATA DEPOSITION
The accession number for the raw sequencing data and processed RNA-editing tracks reported in this paper is GSE102814 (NCBI GEO). The scripts used in the analysis are available on GitHub (https://github.com/rosbashlab/TRIBE).
SUPPLEMENTAL MATERIAL
Supplemental material is available for this article.
COMPETING INTEREST STATEMENT
The authors declare that a PCT patent application (PCT patent application no.: PCT/US2016/05425) has been filed based on the HyperTRIBE method described in this paper.
Supplementary Material
ACKNOWLEDGMENTS
Special thanks are due to Aoife McMahon, a prior laboratory member, for her important input into this work. We thank Joshua Rosenthal, Brenda Bass, Amy Lee, and Aoife McMahon for helpful comments on a very early version of this manuscript as well as current Rosbash laboratory members for comments and discussion. The work was supported by the Howard Hughes Medical Institute and by a National Institutes of Health EUREKA grant (DA037721).
Author contributions: W.X., M.R., and Aoife McMahon are responsible for conceptualization; W.X. performed the investigation; R.R. contributed to software; W.X. and R.R. contributed to formal analysis; W.X. and R.R. contributed to visualization; W.X. and M.R. wrote the original draft; M.R. acquired funding.
Footnotes
Article is online at http://www.rnajournal.org/cgi/doi/10.1261/rna.064691.117.
Freely available online through the RNA Open Access option.
REFERENCES
- Abruzzi K, Chen X, Nagoshi E, Zadina A, Rosbash M. 2015. RNA-seq profiling of small numbers of Drosophila neurons. Methods Enzymol 551: 369–386. [DOI] [PubMed] [Google Scholar]
- Amet N, Lee HF, Shen WC. 2009. Insertion of the designed helical linker led to increased expression of tf-based fusion proteins. Pharm Res 26: 523–528. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Basilio C, Wahba AJ, Lengyel P, Speyer JF, Ochoa S. 1962. Synthetic polynucleotides and the amino acid code. V. Proc Natl Acad Sci 48: 613–616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bass BL, Weintraub H. 1988. An unwinding activity that covalently modifies its double-stranded RNA substrate. Cell 55: 1089–1098. [DOI] [PubMed] [Google Scholar]
- Darnell RB. 2010. HITS-CLIP: panoramic views of protein-RNA regulation in living cells. Wiley Interdiscip Rev RNA 1: 266–286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eggington JM, Greene T, Bass BL. 2011. Predicting sites of ADAR editing in double-stranded RNA. Nat Commun 2: 319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gerstberger S, Hafner M, Tuschl T. 2014. A census of human RNA-binding proteins. Nat Rev Genet 15: 829–845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hafner M, Landthaler M, Burger L, Khorshid M, Hausser J, Berninger P, Rothballer A, Ascano M Jr, Jungkamp AC, Munschauer M, et al. 2010. Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell 141: 129–141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jansen RP. 2001. mRNA localization: message on the move. Nat Rev Mol Cell Biol 2: 247–256. [DOI] [PubMed] [Google Scholar]
- Kim U, Wang Y, Sanford T, Zeng Y, Nishikura K. 1994. Molecular cloning of cDNA for double-stranded RNA adenosine deaminase, a candidate enzyme for nuclear RNA editing. Proc Natl Acad Sci 91: 11457–11461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuttan A, Bass BL. 2012. Mechanistic insights into editing-site specificity of ADARs. Proc Natl Acad Sci 109: E3295–E3304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lambert N, Robertson A, Jangi M, McGeary S, Sharp PA, Burge CB. 2014. RNA Bind-n-Seq: quantitative assessment of the sequence and structural binding specificity of RNA binding proteins. Mol Cell 54: 887–900. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lehmann KA, Bass BL. 2000. Double-stranded RNA adenosine deaminases ADAR1 and ADAR2 have overlapping specificities. Biochemistry 39: 12875–12884. [DOI] [PubMed] [Google Scholar]
- Macbeth MR, Schubert HL, Vandemark AP, Lingam AT, Hill CP, Bass BL. 2005. Inositol hexakisphosphate is bound in the ADAR2 core and required for RNA editing. Science 309: 1534–1539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Markham NR, Zuker M. 2008. UNAFold: software for nucleic acid folding and hybridization. Methods Mol Biol 453: 3–31. [DOI] [PubMed] [Google Scholar]
- Matthews MM, Thomas JM, Zheng Y, Tran K, Phelps KJ, Scott AI, Havel J, Fisher AJ, Beal PA. 2016. Structures of human ADAR2 bound to dsRNA reveal base-flipping mechanism and basis for site selectivity. Nat Struct Mol Biol 23: 426–433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McMahon AC, Rahman R, Jin H, Shen JL, Fieldsend A, Luo W, Rosbash M. 2016. TRIBE: hijacking an RNA-editing enzyme to identify cell-specific targets of RNA-binding proteins. Cell 165: 742–753. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nussbacher JK, Batra R, Lagier-Tourenne C, Yeo GW. 2015. RNA-binding proteins in neurodegeneration: seq and you shall receive. Trends Neurosci 38: 226–236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O'Connell MA, Krause S, Higuchi M, Hsuan JJ, Totty NF, Jenny A, Keller W. 1995. Cloning of cDNAs encoding mammalian double-stranded RNA-specific adenosine deaminase. Mol Cell Biol 15: 1389–1397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Osterwalder T, Yoon KS, White BH, Keshishian H. 2001. A conditional tissue-specific transgene expression system using inducible GAL4. Proc Natl Acad Sci 98: 12596–12601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rodriguez J, Menet JS, Rosbash M. 2012. Nascent-seq indicates widespread cotranscriptional RNA editing in Drosophila. Mol Cell 47: 27–37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stankunas K, Bayle JH, Gestwicki JE, Lin YM, Wandless TJ, Crabtree GR. 2003. Conditional protein alleles using knockin mice and a chemical inducer of dimerization. Mol Cell 12: 1615–1624. [DOI] [PubMed] [Google Scholar]
- Stefl R, Oberstrass FC, Hood JL, Jourdan M, Zimmermann M, Skrisovska L, Maris C, Peng L, Hofr C, Emeson RB, et al. 2010. The solution structure of the ADAR2 dsRBM-RNA complex reveals a sequence-specific readout of the minor groove. Cell 143: 225–237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stephens OM, Haudenschild BL, Beal PA. 2004. The binding selectivity of ADAR2's dsRBMs contributes to RNA-editing selectivity. Chem Biol 11: 1239–1250. [DOI] [PubMed] [Google Scholar]
- Szostak E, Gebauer F. 2013. Translational control by 3′-UTR-binding proteins. Brief Funct Genomics 12: 58–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ule J, Jensen K, Mele A, Darnell RB. 2005. CLIP: a method for identifying protein-RNA interaction sites in living cells. Methods 37: 376–386. [DOI] [PubMed] [Google Scholar]
- Wang L, Wang S, Li W. 2012. RSeQC: quality control of RNA-seq experiments. Bioinformatics 28: 2184–2185. [DOI] [PubMed] [Google Scholar]
- Wang Y, Havel J, Beal PA. 2015. A phenotypic screen for functional mutants of human adenosine deaminase acting on RNA 1. ACS Chem Biol 10: 2512–2519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Witten JT, Ule J. 2011. Understanding splicing regulation through RNA splicing maps. Trends Genet 27: 89–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao J, Hyman LE, Moore C. 1999. Formation of mRNA 3′ ends in eukaryotes: mechanism, regulation, and interrelationships with other steps in mRNA synthesis. Microbiol Mol Biol Rev 63: 405–445. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.