Abstract
Millions of adenosines are deaminated throughout the transcriptome by ADAR1 and/or ADAR2 at varying levels, raising the question of what are the determinants guiding substrate specificity and how these differ between the two enzymes. We monitor how secondary structure modulates ADAR2 vs ADAR1 substrate selectivity, on the basis of systematic probing of thousands of synthetic sequences transfected into cell lines expressing exclusively ADAR1 or ADAR2. Both enzymes induce symmetric, strand-specific editing, yet with distinct offsets with respect to structural disruptions: −26 nt for ADAR2 and −35 nt for ADAR1. We unravel the basis for these differences in offsets through mutants, domain-swaps, and ADAR homologs, and find it to be encoded by the differential RNA binding domain (RBD) architecture. Finally, we demonstrate that this offset-enhanced editing can allow an improved design of ADAR2-recruiting therapeutics, with proof-of-concept experiments demonstrating increased on-target and potentially decreased off-target editing.
Subject terms: RNA, Transcriptomics, Transcriptomics
Human ADAR1 and ADAR2 edit millions of adenosines transcriptome-wide, altering RNA structure. Here the authors show that variations in RNA binding domains influence site-specific editing, enhancing ADAR2-targeted therapeutics.
Introduction
Millions of adenosines are deaminated into inosines transcriptome-wide1,2, catalyzed by two deaminating enzymes, ADAR1 (ADAR) and ADAR2 (ADARB1). Inosine is perceived as guanosine by the internal cellular machinery, and hence editing can result in protein recoding3–7, alternative splicing8–10, and alterations in targeting and maturation of microRNA11,12. In parallel, editing can also alter the RNA secondary structure, and in doing so modulate the immunogenicity of self and viral RNAs within cells13–17. In accordance with the wide distribution of edited sites, abnormal dysregulation of A-to-I deamination has been associated with a broad spectrum of human diseases18, and targeting of ADAR enzymes is an emerging therapeutic strategy in cancer19.
Different adenosines throughout the transcriptome are edited at dramatically different efficiencies (or not at all), begging the question of what governs enzymatic selectivity towards specific targets. Understanding the rules guiding these two enzymes to their diverse targets is of intense interest not only from biological and pathological perspectives, but also from therapeutic ones. In recent years, unraveling the rules dictating deamination via the ADAR enzymes has accrued substantial interest in the context of ongoing efforts to achieve targeted mRNA editing. Targeted editing is emerging as a therapeutic modality that may potentially offer a safer alternative to correct single-nucleotide mutations20 in comparison to CRISPR-mediated DNA editing. Diverse approaches have been implemented in recent years to recruit ADAR enzymes towards specific substrates21–25. Although successful, these attempts often resulted only in partial efficiencies and in some cases also with considerable off-target effects26. Improving our understanding of the rules guiding inosine formation and of the factors determining enzyme specificity will pave the path toward the development of both more optimal editors and improved guides.
Studies exploring the targeting efficiencies of ADAR1 and ADAR2 have revealed several general principles. First, the specificity of these two enzymes is only partially overlapping27,28, suggesting differences in the selectivity of these two enzymes. Second, RNA secondary structure plays a critical role. ADAR1 targets are nearly inevitably within long double-stranded RNAs29, and hence highly enriched in repetitive elements such as Alu and long interspersed elements30–32. ADAR2 targets tend to be in duplex regions interrupted by mismatches or loops33–36, and in-vitro work has shown that distal bulges can, at times, impact editing efficiency34. Yet, the structural rules governing editing—which would be of critical importance for predictive models—are not understood. In addition, for both ADAR1 and ADAR2, A-C mismatch harboring targets are particularly prone to undergo editing37. Finally, the sequence also plays a role in target selectivity. In-vitro editing assays with artificial RNA duplexes revealed that ADAR1 and ADAR2 prefer to edit adenosines depleted of G’s at the position preceding the target and show some bias for a G downstream of the target edited site35,38,39.
The factors underlying the differences in specificity between ADAR1 and ADAR2 are understood only to a limited extent. These two paralog proteins, which likely evolved via a gene duplication event roughly 700 million years ago40, differ in their protein domain architecture. The catalytic domain, present on both ADARs, was shown to play a role in the definition of selectivity37,41. In addition to the catalytic domain, human ADAR1 contains either one or two Zα domains (dependent on the isoform) and three RNA binding domains (RBDs) whereas human ADAR2 contains two RBDs but no Zα domains. The RBDs participate in dsRNA substrate recognition and RNA binding42, and were suggested to partially mediate ADAR selectivity via both sequence-specific and non-specific mechanisms34,43. The Z domains, binding left-handed nucleic acids, have been implicated in allowing co-transcriptional binding of ADAR1 to nascent RNAs44,45 and more recently in preventing Z-RNA dependent activation of pathogenic interferon by Z-DNA binding protein 1 (ZBP1)46–48. Whether they have a role in defining substrate specificity is unclear.
To systematically dissect how substrate selectivity by ADAR1 is governed by secondary structure, we previously screened ADAR1-mediated editing across thousands of sequence variants, which had been designed to systematically perturb the secondary structure along two highly double-stranded backbones. We discovered that the introduction of structural disruptions within an otherwise perfect double-stranded RNA structure gives rise to robust and predictable ADAR1-mediated editing at a fixed offset of 35 base pairs (bp) upstream from the disruption49. Whether structural disruptions of ADAR2 targets lead to editing at a fixed offset, and what the mechanistic basis for this offset is, remains unknown.
Here, we systematically monitor how secondary structure modulates ADAR2 substrate selectivity, on the basis of systematic probing of thousands of synthetic sequences transfected into ADAR1-deleted cell lines exogenously expressing ADAR2. We find that similarly to ADAR1, structural disruptions give rise to symmetric, strand-specific induced editing at a fixed offset. However, in contrast to ADAR1 acting at a −35 bp offset, in the case of ADAR2, structural disruptions give rise to induced editing at an offset of −26 bp. We dissect the basis for the differences in offset between ADAR1 and ADAR2 via diverse mutants, domain-swaps and ADAR evolutionary homologs. We uncover that the difference in the offset is encoded by the differential RNA binding domain architecture of the two ADARs, yet that it is not determined by the number of RBDs. We demonstrate that this understanding of ADAR2 specificity can allow an improved design of ADAR2-recruiting therapeutics, yielding increased on-target editing, with some evidence also for reduced off-target editing. Our findings provide comprehensive insight into the features determining ADAR2 substrate selectivity and into the roles of the RNA binding domains of ADAR1 and ADAR2 in mediating differential targeting, and should facilitate the design of improved ADAR2-recruiting therapeutics.
Results
Screening of ADAR2 substrates
We sought to systematically compare the targeting specificity of ADAR2 to its ADAR1 counterpart. Toward this goal, we employed a pool of thousands of sequence variants that we had previously designed to probe the specificity of ADAR1, described in49. In brief, these sequence variants are based on two distinct backbones folding into a perfect hairpin structure: the endogenous mouse B2 element, serving as a more natural editing target, and a sequence complementary to the 3’ UTR of the fluorescent reporter mNeonGreen (mNG) transcript, serving as a completely synthetic target. In both cases, this hairpin consists of a 146-nt long stem and a 46-nt long loop (Fig. 1A). For each of these two backbones, we previously designed and synthesized roughly two thousand sequence variants systematically perturbing the hairpin structure via random structural disruptions, systematic incorporation of single, double, or random mismatches, the introduction of pyrimidine-rich bulges, and systematic shortening or elongation of the stem (Fig. 1B). These perturbations were all designed to take place in the ‘lower’ arm of the stem structure, whereas the ‘upper’ arm remained constant. We transfected each oligo library into ADAR1-knockout HEK293T cells (in which ADAR2 is not expressed)50, alongside a plasmid expressing either ADAR2 or ADAR1 or neither of the two (‘No-ADAR’), as a negative control. Subsequently, RNA was extracted, and the constant upper arm of each construct was reverse transcribed, PCR-amplified, and sequenced (Fig. 1C).
All B2 and mNG constructs were detected across all treatments with a mean coverage of ~4000 reads per barcode per sample across all conditions (ADAR1, ADAR2, and No-ADAR). No editing was observed in ADAR1 KO cells transfected with the No-ADAR vector, corroborating that all deamination activity is triggered by the two exogenously overexpressed ADAR enzymes (Fig. 1D). In ADAR-expressing cells, editing percentages between technical duplicates were highly reproducible (r > 0.99, P < 2.2e-16 for all treatments) (Fig. 1E & Figure S1A). In addition, editing measurements were independent of barcode identity, as was assessed by comparing editing levels at a subset of identical sequences with distinct barcodes (Fig. 1F & Figure S1B).
The editing patterns in the B2 and mNG constructs following ADAR1 overexpression were well correlated with ones observed in WT HEK293T cells in which ADAR1 is expressed at endogenous levels (Fig. 1D), suggesting that ADAR overexpression is a valid approach for interrogating the rules defining the substrate specificities of ADAR enzymes. The editing patterns following overexpression of ADAR1 and ADAR2 were substantially less correlated, in line with previous reports indicating their only partially overlapping target specificity27,28 (Fig. 1D). We also note that ADAR2 overexpression gave rise to higher levels of editing in comparison to ADAR1, in line with previous reports51. Western blot analysis demonstrated elevated expression levels of ADAR2 compared to ADAR1 within our system, indicating that the differences in absolute levels may be attributed to the increased expression of ADAR2 (Figure S1C). In mNG constructs, a median of ~22 out of 44 adenosines per molecule was edited in ADAR2-overexpressing cells, in comparison to ~18 in ADAR1-overexpressing counterparts (Fig. 1G). This trend was even more pronounced, with ~3 and ~12 out of 41 edited sites per molecule in B2 constructs in ADAR1- and ADAR2-overexpressing cells, respectively.
As an additional quality control, we assessed editing levels across a series of constructs in which the double-stranded stem was randomly disrupted to varying levels. Consistent with our expectations, we found that editing by both ADAR1 and ADAR2 was continuously disrupted with progressive disruption of the secondary structure (Fig. 1H & Figure S1D). ADAR2 was slightly more resilient to the introduction of structural disruptions, consistent with previous studies showing that ADAR2 can efficiently edit shorter double-stranded substrates than ADAR138,52. Collectively, these analyses establish that the two synthetic constructs and their perturbed counterparts are edited by both ADAR1 and ADAR2, yet these two enzymes are associated with both varying levels and different patterns of editing.
ADAR2-mediated editing is induced 26 nt upstream of structural disruptions
We next sought to assess whether structural disruptions within dsRNAs induce ADAR2-mediated editing at a fixed offset, given our previous discoveries of a −35 bp offset for ADAR149. To explore this, we analyzed the series of constructs into which we had systematically introduced secondary-structure disrupting sequences—either in the form of mismatches or of bulges—throughout the stem. Indeed, in both ADAR1- and ADAR2-overexpressing cells, increased editing levels were observed at a fixed offset (Fig. 2A–C & Figure S2A). In the case of ADAR1-overexpressing cells, we recapitulated our previous observations of increased editing levels 35 bp upstream and 30 bp downstream of structural disruptions (Fig. 2B, C)49. In contrast, in ADAR2-overexpressing cells, structural disruptions led to increased editing levels in a window between 23-31 nt upstream of the structural disruption, peaking at position −26 nt (Fig. 2B, C). Though the magnitude of the increase in editing levels at position −26 following ADAR2 overexpression was lower than the increase at position −35 following ADAR1 overexpression (~1.3–1.5 mean fold at position −26 in comparison to ~3.3-6 mean fold at position −35), the phenomenon was reproducibly observed across the two different constructs as well as using different forms of structural disruption including mismatches of varying lengths (Fig. 2B & Figure S2B–G) and pyrimidine-rich bulges (Fig. 2C & Figure S3A–C). The increase in editing levels at position −35 and −26 in ADAR1- and ADAR2-overexpressing cells, respectively, was dependent on the size of the mismatch, with the highest median editing increase observed in constructs carrying 3 nucleotide mismatches (Fig. 2D). In parallel, the introduction of mismatches also led to a reproducible negative signal (indicative of adenosines resistant to editing) that was distributed in a complex—yet highly reproducible—manner with respect to the structural mismatches. The negative signal extended between positions −26 and +29. The signal was at its minimum at position 0 and +1 for ADAR1 and ADAR2, respectively, consistent with previous reports53, with two local maximums at positions −9 and positions +6/+7 in both ADAR1 and ADAR2, and an additional ADAR2-specific local maximum at position +15 (Fig. 2B).
In the case of ADAR1, we had previously found that structural disruptions led to a symmetric induction of editing, resulting in induced editing 35 bp upstream of the structural disruption on the ‘upper’ arm of the dsRNA, and in parallel also resulting in induced editing 35 bp upstream of the structural disruption on the ‘lower’ arm49. Given that all results obtained thus far had only been on the basis of sequencing of the ‘upper’ (and invariable) arm, we next amplified and sequenced the ‘lower’ variable arm of each B2 construct from ADAR1-KO HEK293T cells overexpressing ADAR2 (Fig. 2E). A prominent peak 26 bp upstream from the structural disruption was observed on the opposite strand (Fig. 2F), indicating that the induction of editing by human ADAR2 is symmetric and orientation-dependent at a fixed interval as was the case for ADAR1, but in this case 26 nt upstream from structural disruptions.
Differences in editing offsets among ADARs are mediated by double-stranded RNA binding domains
We next sought to understand why structural disruptions led to editing at an offset of 35 nt in the case of ADAR1, but of 26 nt in the case of ADAR2. To explore whether the offset was dictated by the catalytic domain of the two ADAR enzymes or by the RBDs, we designed two ADAR variants, by swapping the RBD domains among the ADARs: (1) An ‘ADAR2-RBDs_ADAR1-deaminase’ variant, harboring the catalytic domain of ADAR1 fused to the two RBDs originating from ADAR2, and (2) An ‘ADAR1-RBDs_ADAR2-deaminase’ variant, harboring the catalytic domain of ADAR2 fused to the three RBDs originating from ADAR1. We next used the above-described human ADAR1-depleted system, into which we transfected B2 and mNG oligo libraries along with plasmids overexpressing these two ADAR variants. The two hybrids gave rise to deamination activity on both B2 and mNG positive control constructs (Figure S4A, B), albeit at substantially reduced levels in comparison to the WT counterparts (Fig. 3A). Remarkably, we found that the offset size segregated with RBDs: ‘ADAR1-RBDs_ADAR2-deaminase’ showed induced editing levels at position −35, recapitulating the patterns observed in WT ADAR1 expressing cells. In parallel, ‘ADAR2-RBDs_ADAR1-deaminase’ exhibited induced activity at roughly −30 nt, as had similarly been observed for ADAR2 (Fig. 3B, C; Figure S5A, B; Figure S5E, F). Thus, these findings suggest that the size of the offset is encoded by the differential RBD architecture.
How do the different RBDs give rise to differential offsets? We hypothesized that the RBDs might serve as molecular rulers and that the size of the offset might scale roughly linearly with the number of RBDs. Under this scenario the offset in ADAR2 with respect to ADAR1 might reflect the loss of one RBD in ADAR2, harboring 2 RBDs, in comparison to ADAR1, harboring 3 RBDs. To test this hypothesis, we designed two ADAR2 variants harboring only a single RBD by either maintaining only the first or only the second RBD, with the anticipation that these might lead to an offset potentially even smaller than −26 nt. Both mutants were active within cells, albeit at drastically different levels (Figs. 3A, S4A, S4B), with the mutant harboring only the first RBD exhibiting very low levels of activity, in contrast to the RBD2-harboring mutant that gave rise to higher levels of editing than WT ADAR2, consistent with54. Nonetheless, in both cases, the size of the offset remained fixed at roughly −26, similar to WT ADAR2 (Fig. 3D; Figure S5C, S5G). These findings thus suggest that the size of the offset is not determined by the number of RBDs.
The above results left open the possibility that the effect of the number of RBDs might be threshold-dependent. Under such a scenario, one or two RBDs might invariably give rise to an offset of −26, but never to an offset of −35, for which a third RBD would be required. To test this possibility, we selected two additional ADAR homologs from Suricata suricatta and Octopus vulgaris, harboring one and two RBDs, respectively, to assess whether these invariably gave rise to editing at an offset of −26 nt. The two ADAR enzymes elicited deamination activity on B2 or mNG positive control constructs (Figure S4A, B), albeit at varying levels and with differences in substrate selectivity (Fig. 3A). Interestingly, the two ADAR1 homologs gave rise to different offsets: suricata ADAR gave rise to an offset of −35 similar to human ADAR1, whereas octopus ADAR displayed a peak at position −28, similar to human ADAR2 (Fig. 3E; Figure S5D, H). Collectively, these findings thus establish that while the size of the offsets is encoded within the RBD architecture, it is not encoded in the number of RBD domains either in a linear or a threshold-dependent manner, and instead it appears to be an inherent property that can be encoded even within a single RBD (see Discussion).
Evidence for offset-mediated editing is observed in endogenous targets in human tissues
Our conclusions thus far relied on offsets observed on synthetic constructs in human cell lines, leaving open the question of whether these findings are of relevance to editing at endogenous targets in human tissues. To explore this question, we set out to assemble a dataset of all endogenously edited sites within transposable elements harboring a resolvable secondary structure. Given the uncertainty, in most cases, regarding which elements base pair with each other, we limited our analysis to a subset of 624 edited transposable elements stringently selected based on having only a single transposable element of opposite orientation in close proximity. We then predicted the secondary structures of each of these elements, and retrieved editing levels for each adenosine forming part of the structures. Editing levels were calculated based on surveying the entirety of the GTEX dataset (spanning 9125 samples from 47 tissues, collected from 548 donors). On the basis of this collection, we next annotated each of the 40 nt flanking each editing site as being either closed (if it resided in a stem) or open (if it resided in a loop or in a bulge). Finally, we generated an editing metaplot, wherein we calculated the extent to which structuredness within a window of 80-nt centered around an editing site impacted editing levels. Remarkably, this analysis revealed two regions in which an open structure led to increased editing: A major window 35 nt downstream of the edited site and a more minor window 30 nt upstream of the edited site (Figure S6A). This analysis is thus fully consistent with our analyses on ADAR1, and demonstrates that the −35/30 nt rule is not only discernible in synthetic constructs in cell lines, but also apparent in an unbiased analysis in tissue-derived RNA. We furthermore found these results to be reproducible also when confining them to specific tissues (not shown), again pointing at their generality.
In the above analysis, we did not observe evidence for offset-mediated editing also at position −26, which is consistent with the fact that most targets within transposable elements are mediated by ADAR1, rather than by ADAR2. Drastically fewer editing sites dependent exclusively on ADAR2 were characterized, and thus it was not possible to conduct the above analysis for ADAR2-associated editing sites. To assess whether we could find evidence for offset-mediated editing of ADAR2, we considered a set of 10 sequences which harbor highly curated well-established targets of ADAR228. Among those sequences prediction of secondary structure could be established with high confidence for 7 sequences. An examination of ADAR2-mediated editing sites within these targets revealed that in 5 of the 7 most structures (in FLNA, GRM4, FLNB, and NOVA1), the highly edited target harbored a mismatch 26 nt away (Fig. S6B). Moreover, in the cases of GRIA4, a mismatch was present at position +27, whereas in SON it was present at position +25. While this evidence is more anecdotal in nature, it again points to the generality of our results and their relevance to editing at endogenous targets.
26-bp offset rule can improve the efficiency of ADAR2-mediated targeted editing
To explore whether the recently identified −26 nt rule of ADAR2 might lend itself to improved design of ADAR recruiting therapeutics, we designed ADAR-recruiting RNAs (arRNA) to elicit editing on five distinct targets harboring distinct consensus motifs located on four different endogenous transcripts: PPIB-ORF:UAG, GAPDH-UTR:UAG, SMAD4-UTR:CAG, PPIB-UTR:UAG and STAT1-ORF:UAU via recruitment of exogenously expressed ADAR2. For each of these targets, we designed three arRNA constructs: (1) a 151-nt long arRNA containing a C opposite to the target A located between two 75-nt stretches which are perfectly complementary to the endogenous transcript. Such constructs were used in23,55 and served as a positive control; (2) an arRNA as in (1) but harboring 3-bp mismatch 26 or 27 nt upstream from the target adenosine, and (3) an empty vector serving as a negative control (Fig. 4A). Consistent with our expectations, we found that in 2 of the 5 cases (GAPDH-UTR and SMAD4-UTR), the 3-nt disruptions significantly increased editing levels with respect to the positive controls, and in a third case (PPIB-ORF) the same trend was observed albeit it did not pass statistical significance (Fig. 4B). The relatively low increase in these cases as well as the absence of an increase in the two remaining cases are consistent with the relatively mild effect size of induced editing at position −26 (Fig. 3B) and may be suggestive of context-specificity remaining to be uncovered.
In some clinical contexts, it could potentially be beneficial to induce editing only in cells expressing one of the two ADAR enzymes, for instance in order to achieve selectivity in brain tissues in which ADAR2 is expressed. Given the different offsets at which ADAR1 and ADAR2 induce editing, we sought to assess whether this could be leveraged to achieve such selective editing. Indeed, we found that an arRNA with a structural disruption at an offset of 35 nt selectively induced editing by ADAR1, and not by ADAR2, in comparison to a positive control lacking a structural disruption (Fig. 4C, D). Conversely, an arRNA with a structural disruption at a 26 nt offset selectively induced editing by ADAR2, and not by ADAR1 (Fig. 4D). These results thus suggest that engineered structural disruptions at fixed offsets can be utilized to tune the relative susceptibility of targets to editing via ADAR1 vs ADAR2.
Finally, we sought to assess whether the introduction of structural disruptions at a 26 bp offset would not only increase on-target editing levels but also decrease off-target levels. To assess this, we amplicon-sequenced the GAPDH amplicon following targeted editing via either the ‘positive control’ or the ‘Mismatch 27’ arRNA. In this analysis, we only identified a single adenosine that was edited at levels exceeding 2% across either of these two samples. Remarkably, this position was edited at levels of 6.13% in the positive control sample, which decreased to 1.05% in the ‘Mismatch 27’ samples (Fig. 4E, F). This off-target site resided 26 nt downstream of the targeted adenosine, and therefore the reduced editing levels in the ‘Mismatch 27’ sample are likely a direct consequence of this position no longer being base-paired in the ‘Mismatch 27’ arRNA. With the caveat of only relying on a single off-target site, these findings suggest that rationally designed structural disruptions within arRNAs can be designed to both increase on-target rates and decrease off-target ones.
Overall, these findings lend support to the observations that structural disruptions lead to increased ADAR2-mediated editing at a fixed offset and provide a proof of principle that this rule can allow improved recruitment of ADAR2 towards target adenosines in therapeutic settings.
Characterization of ADAR1 and ADAR2 sequence selectivity across diverse ADAR variants
The experimental design of the oligo-array libraries employed in this study had been primarily geared towards interrogating the impact of RNA secondary structure on editing. Nonetheless, the availability of measurements of editing levels across distinct sites and in varying sequence contexts allowed investigating the impact of sequence on editing, and the extent to which this varied across the eight ADAR variants interrogated here.
We found that across all ADAR enzymes, the position immediately upstream of the edited site was depleted of G at the upstream position, consistent with38,39,56. The position immediately downstream displayed less of a bias, consistent with35,38,39,57. (Fig. 5A). We next explored the extent to which the identity of nucleotides opposite of the target adenosine impacted editing across the different ADAR variants. We found that editing by all ADAR variants was induced when a C was introduced opposite of the target A (Fig. 5B). Introduction of A-A or A-G mismatches opposite of the edited site both substantially decreased editing at the targeted position and gave rise to increased editing at a −26 bp offset in ADAR2-expressing cells (Fig. 5B). Finally, we extended this analysis to mismatches occurring in the vicinity of the edited site. This analysis revealed that editing at adenosines in a GA context (underlined A is edited) tends to be substantially higher when the nucleotide opposite of the G at position −1 is a guanosine, and even more so an adenosine (Fig. 5C). We further found induced levels of editing when a 3-nt mismatch was centered around an edited site in a ‘GA’ context (Fig. 5D). These findings were consistently observed across all ADAR variants (Figure S7A, B). The facts that these sequence preferences are independent of the RBD domain structure and that they occur at sites that are in physical interaction with the deaminase domain suggest that these sequence preferences are an inherent property of the ADAR deaminase domains and that they are shared across ADAR1 and ADAR2 homologs.
Given that adenosines in GA contexts are typically edited at low efficiencies, we sought to investigate whether editing in GA contexts could be induced via the introduction of arRNAs designed to harbor a G-G or a G-A mismatch at position −1, or via guides introducing a 3-nt mismatch at the edited site. Indeed, we found that an arRNA harboring a G-A mismatch yielded the highest editing level within a SMAD4-ORF target, followed by arRNA harboring a G-G mismatch, whereas the fully complementary arRNA yielded background levels of editing (Fig. 5E, F). These findings are in line with reports by58,59. In parallel, we also found that introduction of such GA mismatches, while increasing on-target effects, was also associated with increased off-target effects at an offset of 26 nt (Figure S7C). Collectively, our findings establish how editing at target sites can be induced either by introducing mismatches at a relatively distant fixed offset via a mechanism impacting recognition through the RBDs, or in close vicinity to the target site via a mechanism likely impacting recognition through the deaminase domain.
Discussion
Despite widespread interest in unraveling the determinants guiding the selectivity of ADAR1 and ADAR2, these have remained poorly understood and to a considerable extent unpredictable. It has been previously suggested that the basis for selectivity resides within mismatches37, bulges, loopsַ53, and long-range tertiary pseudoknots60,61. Such structural elements are evolutionarily conserved60,62 suggesting that the secondary35 and tertiary RNA structures63 play an important role in regulating the editing efficiency and specificity. Accordingly, mismatches and bulges have also been included in the design of prior arRNA recruiting modalities22,55. Yet, the rules governing such selectivity—e.g., where do structural mismatches contribute to editing? When are they prohibitive?—have remained poorly understood. Our study contributes two key insights to our understanding: First, we establish a simple rule, namely that structural disruptions of diverse types (bulges, mismatches) will give rise to induced ADAR2-mediated editing at a fixed offset of 26 bp upstream of the disruption, contrasting with ADAR1 which induces editing at a 35 bp offset. Second, we uncover that these distinct offsets by the two ADARs are encoded via the distinct RBD domains of the two enzymes. Importantly, our analyses of GTEX-based data confirm the relevance of offset-mediated editing to endogenous sites within human tissues.
Our work uncovers interesting commonalities and differences between the two ADAR enzymes. Activity by both enzymes is induced at a fixed offset from structural disruptions. In both cases, there is substantial evidence for symmetricity, as is evident from comparing the upper and lower strand editing levels. Moreover, in both cases, the induction of editing is orientation-specific, with editing being induced on both strands upstream of the structural disruption. However, the size of the offset is different (−35 vs −26 nt). In addition, the magnitude of induction is also different, with more dramatic effects being typically observed for ADAR1 than for ADAR2. Finally, for ADAR1 in addition to the major peak at −35, we had also observed a more minor peak in editing activity 30 bp downstream of the edited site. We do not observe such a downstream peak for ADAR2. This may either reflect a difference in the mechanism driving induced editing, or the lower dynamic ranges which may limit us from clearly observing such a secondary, more minor peak for ADAR2.
A major question left open by our study is the basis for the different offsets of ADAR1 and ADAR2. While based on the RBD swapping experiments it is clearly encoded by the RBD architecture, we rule out that this is a function of the number of RBDs, as offsets of 26 and 35 nt are achieved by variants and mutants with a distinct number of RBDs. Another possibility is that the difference in offset is not due to the difference in domains, but to the difference in the size of the linker between the RBD and the deaminase domain. However, we can largely rule out this possibility as well, because in our RBD swapping experiments between ADAR1 and ADAR2 we had maintained the original linkers, and the offset sizes segregated with the RBDs. Given that single amino acids in the RBD were shown to be important in RNA recognition and binding43, it is possible that the basis for the difference in selectivity between the two enzymes lies within such individual changes. Dissecting this systematically via genetic approaches is rendered challenging, given that mutations within RBDs oftentimes also abolish editing. Indeed, six additional RBD-disrupting ADAR mutants that we generated over the course of this study (data not shown) failed to show any substantial editing activity, consistent also with previous observations64. Our inability to obtain active mutants harboring only the catalytic domains and none of the dsRBDs (data not shown) also prevents us from formally ruling out the possibility that part of the offset might be directly mediated by the dsRBD itself. We anticipate that the structural dissection of these two enzymes bound to RNA targets will contribute towards answering this question.
In attempting to understand the basis for a 26 nt offset of ADAR2, we found two potentially relevant clues in the literature. First, in a structural study of the Glu receptor target in complex with the ADAR2 RBDs, each of the two domains was found to associate with 12-14 nt. Thus, 26 nt is well within the range of the size that would be protected by two RBD43. While our findings suggest that a 26 nt offset can also be maintained via ADAR mutants and variants harboring a single RBD, they do leave open the possibility that an offset of 26 nt could be the combined outcome of the RBDs of two ADAR enzymes acting as a dimer, given that both ADAR1 and ADAR2 act as homodimers65–68. Second, our studies resonate to some extent with findings that ADAR substrates are distributed periodically at ~50 bp intervals from each other69. Given that we find editing induced 26 nt upstream of structural disruptions on the top strand, but also at 26 nt upstream of the disruption on the lower strand (Fig. 2F), and given our previous observations on editing symmetricity49, it is tempting to speculate that structural disruptions could serve as a mechanism spacing edited sites at ~52 bp intervals from each other. However, in the cited study69 the same intervals were observed for ADAR1 and ADAR2, whereas different intervals would be predicted for ADAR1 vs ADAR2 based on such a model and our findings, and thus it is unclear to us whether these findings are mechanistically related.
In our study, we also perform proof-of-principle experiments demonstrating that our improved understanding of editing specificity by the two ADAR enzymes lends itself towards the improved design of ADAR recruiting RNA sequences. We demonstrate that the offsets at a fixed distance can enhance on-target editing levels at the specified targets, potentially reduce off-target editing, and can provide some level of control over which of the two enzymes mediates it. While the effect sizes obtained in our hands are in most cases relatively modest, we anticipate that they might potentially be boosted, if combined with more potent arRNAs, such as chemically modified ones22.
Our study also suffers from several limitations. First, it was primarily geared to dissect relative differences in substrate selectivity between ADAR1 and ADAR2, rather than absolute differences. Given that it relies on overexpression of the two enzymes, and to unequal levels, such absolute differences—involving comparison of the same target across experiments, rather than of different targets within a single sample—need to be interpreted with caution. Second, our study was conducted using a synthetic set of sequences within a human cell line. While this raises questions as to its relevance on endogenous substrates in human cells and tissues, our ability to confirm offset-mediated editing in GTEX cell lines strongly supports the relevance of our conclusions. Third, our study was primarily designed to monitor the impact of structural perturbations on editing. While we do also utilize our measurements to interrogate the role played by sequence (Fig. 5), the sequence space that we sample is limited, which may potentially skew the results. While our results in this section generally are consistent with prior literature, we do not observe a clear enrichment for a G downstream of the edited site (Fig. 5A), in contrast to previous studies35,38,39. This may reflect the limited, monitored sequence contexts in our study- only 29 AG dinucleotides were monitored, 11 of which (38%) in a ‘GAG’ context, whereby it is known (and also observed here) that an upstream G plays an inhibitory role on editing. Finally, given that editing is a product of both sequence and structural preferences, it will be important to simultaneously perturb both, in order to understand to what extent these two dimensions act in an additive or synergistic manner. This dimension could not be profiled using our current design.
Collectively, our findings shed light on the mechanisms underlying the only partially overlapping target spectrum of ADAR1 and ADAR2, while advancing our technical toolkit to target these two enzymes towards clinically relevant targets.
Methods
ADAR plasmid generation
Full-length human ADAR2 (UniProt: P78563-2), ADAR2 RBD1 Deaminase, and ADAR2 RBD2 Deaminase coding sequences were amplified from the AAVS1-hADAR2, pYES-DEST52-hADAR2-dRBM1-Deaminase domain, and pYES-DEST52-hADAR2-dRBM2-Deaminase domain plasmids, respectively, using primers that included XbaI and EcoRI sites. Full-length human ADAR1 (UniProt: P55265-5) was amplified from the AAVS1-hADAR1 plasmid by primers containing XbaI and HindIII sites. All of those PCR products (primers in Supplementary Data 1) were subsequently digested and ligated into the corresponding restriction sites of the digested pcDNA3.1(-) vector. Additionally, both recombinant ADAR constructs contain a FLAG tag (peptide sequence DYKDDDDK) at their N-terminus.
For designing ADAR1-ADAR2 hybrid plasmids, the ADAR1-pcDNA3.1(-) and ADAR2-pcDNA3.1(-) plasmids were used as templates for PCR reactions (primers in Supplementary Data 1) using Phusion® Hot Start II DNA polymerase (Thermo Fisher Scientific). The gel-purified DNA fragments were assembled according to Gibson Assembly® Master Mix (NEB). The assembled products were transformed using a Gibson Assembly Cloning Kit (NEB), and all constructs were confirmed via Sanger sequencing on PCR-based positive clones (primers in Supplementary Data 1). Final ADAR-plasmid-containing clones were grown in ampicillin-supplemented LB liquid media, and DNA was extracted according to the QIAprep Spin Miniprep Kit (QIAGEN).
The pTwist CMV vectors containing the human-codon optimized sequences of ADAR from Octopus vulgaris (UniProt: A0A6P7SCW6_OCTVU), and Suricata suricatta (UniProt: A0A673T544_SURSU) were ordered from Twist Bioscience. Bacteria from glycerol stocks were inoculated and grown in ampicillin-supplemented LB liquid media, and plasmid DNA was extracted as previously mentioned.
Transient transfections
ADAR1-knockout HEK293T cells70, which were obtained as gift from lab of Prof. Dan Stetson, were grown (37 °C, 5% CO2) in Gibco Dulbecco’s Modified Eagle Medium (DMEM) supplemented with 10% fetal bovine serum, 1% Penicillin and Streptomycin, and 4ug/ml Puromycin. 5 × 105 cells were plated on a 6-well plate so that cells reached 70–90% confluency at the time of the second transfection. 24 and 48 hours after cell seeding, 1.6 μg of ADAR-expressing pcDNA3.1(-) plasmid and 4 μg B2 or mNG library DNA were transfected respectively according to Lipofectamine® 2000 DNA Transfection Reagent Protocol (Thermo Fisher Scientific). 24 hours later, cell harvesting was performed.
RNA processing and library preparation
Total RNA was extracted using Nucleozol (Macherey-Nagel), poly-A selected using oligo dT-beads (Dynabeads mRNA DIRECT Kit life tech), and DNase treated (Thermo Fisher Scientific). The upper constant arms or lower variable arms including 8 nucleotide barcodes of constructs were reverse transcribed, PCR amplified (primers in Supplementary Data 1), and sequenced using NovaSeq 6000 SP Reagent Kit v1.5 (300 cycles).
Data analysis of NGS data
Fastq files were assessed by a custom R script (Supplementary Data 2). The read-filtering process removed reads containing wrong start and end, lacking the established barcodes, and misaligning at adenosine positions. Read 1 and 2 were merged into a single sequence by custom truncation and matching. For each barcode, the editing percentage was quantified as (G/(A + G))*100 at each adenosine position. Δ editing was calculated as the difference of editing levels at adenosine positions between each structurally altered sequence and perfect-double stranded construct, respectively.
Western blotting
Lysates were harvested 48 h after transfection as previously described. Cells were washed twice with ice-cold PBS and harvested by centrifugation for 20 min at 200 g and 4 °C. Pellets were re-suspended in RIPA lysis buffer (150 mM NaCl, 50 mM Tris/HCl pH 8, 1% NP-40, 0.5% sodium deoxycholate, 0.1% SDS) supplemented with complete protease inhibitor cocktail (Roche). Lysates were incubated for 10 min on ice and subsequently centrifuged for 10 min at 13.000 g and 4 °C. 5x SDS-PAGE protein loading buffer was added to the supernatant. The samples were then incubated at 95 °C for 5 minutes. Tris-Glycine SDS-PAGE and wet immunoblotting were performed for subsequent analysis. Antibodies used were anti-mouse FLAG (1:2000, Sigma, F3165), anti-goat Actin HRP (1:5000, Jackson, 805-035-180) and anti-mouse HRP (1:5000, abcam, ab97040).
GTEX analysis of ADAR1 and ADAR2 targets
To explore the effect of mismatches in specific locations within the dsRNA structure on ADAR1-mediated editing levels, we retrieved a list of Alu elements from the human genome using the Repeatmasker71 track of annotation downloaded from the UCSC genome browser72. We then filtered for pairs of Alu elements that were (1) located in UTRs, (2) oppositely oriented, (3) within 2000 nt from each other, and (4) lacked any other Alu elements in the window. The dsRNA structures of the Alu pairs were predicted using the FOLD73 program from the RNAStructure package73 with default parameters. The bpRNA tool was used to assign structural elements such as loop, bulge and stems to each nucleotide in the structure74. The editing level for each adenosine in those structures was calculated using pooled reads from the GTEX database75 (9125 samples in total). Only adenosines exhibiting editing levels above 3% were considered for further analysis. We then considered a window of 80 nt centered around each editing site, and annotated each position within this window—for each editing site—as being either closed (stem structure) or open (loop or bulge). For each position within this window, we then divided all editing sites into ones that harbor an open structure at that position, versus a closed one, and calculated the difference in mean editing levels. Statistical significance was determined using a t-test.
For ADAR2 analysis on the basis of the GTEX dataset, we began with 17 editing sites identified as ADAR2 targets in mice28. We were able to identify to map and retrieve editing levels on the basis of4 for 10 of the 17 targets. Genomic sequences of different lengths (500, 1000, 1500, 2000, and 3000 bp) from both sides of the editing sites were extracted, and the dsRNA structures were predicted using the FOLD program from the RNAStructure package73 with default parameters. Structures that included the highest number of additional editing sites previously described in these genomic locations4 with the lowest free energy were chosen for further analysis. As a minimal support for the relevance of the structure, we required that at least two editing sites be found across the predicted structure, retaining 8 of the 10 targets. In our analyses, we considered only the most highly edited site in each structure, and filtered out one case where this site was at the end of a stem/loop structure (and where the +26 position was a position predicted to base pair with a position upstream of the site). The bpRNA tool74 was used to assign structural elements such as loop, bulge and stems to each nucleotide in the structure.
Target RNA editing by recruiting exogenous ADAR2 using plasmid-born arRNAs
Plasmid construction
Gene fragments containing arRNAs and KpnI sites were ordered from Twist Bioscience. All sequences were KpnI-digested and cloned into the digested EPB104 backbone (Addgene plasmid # 68369) with transcription of arRNA driven by a U6 promoter. The list of TWIST gene fragments is described in Supplementary Data 1. Additionally, the pDECKO-mCherry plasmids expressing ‘Positive ctrl’, ‘Empty Vector’ and ‘Mismatch 35’ arRNAs were retrieved from49.
Transient transfections
5 × 105 cells were plated on a 6-well plate so that cells reached 70–90% confluency at the time of transfection. 24 hours after cell seeding, 1ug of ADAR1- or ADAR2-expressing pcDNA3.1(-) plasmid, 0.1 ug of pEGFP-N1 plasmid (for assessment of transfection efficiency), and 3ug of the corresponding arRNA-expressing plasmid were transfected according to Lipofectamine® 2000 DNA Transfection Reagent Protocol. 24 hours later, the medium was changed, and 12 hours later, cells were harvested.
RNA processing and editing quantification
RNA isolation, DNase digestion, and reverse transcription were performed using NucleoZOL (Macherey-Nagel), Amplification Grade DNase I (Thermo Fisher Scientific), and MultiScribe Reverse Transcriptase cDNA synthesis kit (Thermo Fisher Scientific), respectively. The subsequent PCR with KAPA HiFi HotStart ReadyMix (Roche) was performed using transcript-specific primers (Supplementary Data 1). Finally, A-to-I editing within the target mRNA was determined via Sanger sequencing (Supplementary Data 1) and the quantitative analysis using the EditR tool76 and MultiEditR77.
GAPDH amplicon library preparation and analysis of sequencing data
Editing elicited by GAPDH-targeting arRNAs was quantified using Amplicon Illumina Sequencing. Total RNA was poly-A selected using oligo dT-beads (Dynabeads mRNA DIRECT Kit life tech), and DNase-treated (Thermo Fisher Scientific). The target UTR editing region was reverse transcribed, PCR amplified (primers in Supplementary Data 1), and sequenced on the Illumina Novaseq platform. Data was analyzed by a custom R script. Reads containing wrong starting and ending sequences, and GAPDH-unaligned reads were filtered out. The editing percentage was quantified as (G/(A + G)) *100 at the target adenosine position.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
On behalf of S.S., this project has been funded by the Israel Science Foundation (ISF) under grant 913/21-[Schwartz]. Additionally, E.Y.L. received support from the Foundation Fighting Blindness and the ISF.
Author contributions
S.S., M.Z., and E.Y.L. conceived the study. M.Z. and M.W. performed the experiments with valuable assistance from A.U. and R.N. Data analysis was carried out by M.Z., S.S., Z.R., and E.Y.L., with support from A.U. The manuscript was collaboratively written by S.S., M.Z., and M.W., with significant contributions from A.U., R.N., S.B.A., and E.Y.L.
Peer review
Peer review information
Nature Communications thanks Meng How Tan and the other, anonymous, reviewers for their contribution to the peer review of this work. A peer review file is available.
Data availability
The NGS data generated in this study have been deposited in the NCBI BioProject database under accession code ID PRJNA943413. The raw data regarding expression of B2 and mNG constructs in WT HEK293T cells49 used in this study are available in the Gene Expression Omnibus database under accession code GSE155490. For the analysis of ADAR1 and ADAR2 endogenous targets, data were retrieved from the GTEX database75.
Code availability
The R script for analyzing NGS sequencing data from the B2 and mNG oligo libraries is accessible as Supplementary Data 2.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-43633-0.
References
- 1.Ramaswami G, Li JB. RADAR: a rigorously annotated database of A-to-I RNA editing. Nucleic Acids Res. 2014;42:D109–D113. doi: 10.1093/nar/gkt996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Porath HT, Carmi S, Levanon EY. A genome-wide map of hyper-edited RNA reveals numerous new sites. Nat. Commun. 2014;5:4726. doi: 10.1038/ncomms5726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bazak L, et al. A-to-I RNA editing occurs at over a hundred million genomic sites, located in a majority of human genes. Genome Res. 2014;24:365–376. doi: 10.1101/gr.164749.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gabay O, et al. Landscape of adenosine-to-inosine RNA recoding across human tissues. Nat. Commun. 2022;13:1184. doi: 10.1038/s41467-022-28841-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Sommer B, Köhler M, Sprengel R, Seeburg PH. RNA editing in brain controls a determinant of ion flow in glutamate-gated channels. Cell. 1991;67:11–19. doi: 10.1016/0092-8674(91)90568-J. [DOI] [PubMed] [Google Scholar]
- 6.Burns CM, et al. Regulation of serotonin-2C receptor G-protein coupling by RNA editing. Nature. 1997;387:303–308. doi: 10.1038/387303a0. [DOI] [PubMed] [Google Scholar]
- 7.Hoopengardner B, Bhalla T, Staber C, Reenan R. Nervous system targets of RNA editing identified by comparative genomics. Science. 2003;301:832–836. doi: 10.1126/science.1086763. [DOI] [PubMed] [Google Scholar]
- 8.Lev-Maor G, et al. RNA-editing-mediated exon evolution. Genome Biol. 2007;8:R29. doi: 10.1186/gb-2007-8-2-r29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Solomon O, et al. Global regulation of alternative splicing by adenosine deaminase acting on RNA (ADAR) RNA. 2013;19:591–604. doi: 10.1261/rna.038042.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Tang SJ, et al. Cis- and trans-regulations of pre-mRNA splicing by RNA editing enzymes influence cancer development. Nat. Commun. 2020;11:799. doi: 10.1038/s41467-020-14621-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kawahara Y, et al. Redirection of silencing targets by adenosine-to-inosine editing of miRNAs. Science. 2007;315:1137–1140. doi: 10.1126/science.1138050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Tomaselli S, et al. ADAR enzyme and miRNA story: a nucleotide that can make the difference. Int. J. Mol. Sci. 2013;14:22796–22816. doi: 10.3390/ijms141122796. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Liddicoat BJ, et al. RNA editing by ADAR1 prevents MDA5 sensing of endogenous dsRNA as nonself. Science. 2015;349:1115–1120. doi: 10.1126/science.aac7049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Yang S, et al. Adenosine deaminase acting on RNA 1 limits RIG-I RNA detection and suppresses IFN production responding to viral and endogenous RNAs. J. Immunol. 2014;193:3436–3445. doi: 10.4049/jimmunol.1401136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Samuel CE. Adenosine deaminase acting on RNA (ADAR1), a suppressor of double-stranded RNA–triggered innate immune responses. J. Biol. Chem. 2019;294:1710–1720. doi: 10.1074/jbc.TM118.004166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Mannion NM, et al. The RNA-editing enzyme ADAR1 controls innate immune responses to RNA. Cell Rep. 2014;9:1482–1494. doi: 10.1016/j.celrep.2014.10.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Liddicoat BJ, Chalk AM, Walkley CR. ADAR1, inosine and the immune sensing system: distinguishing self from non-self. Wiley Interdiscip. Rev. RNA. 2016;7:157–172. doi: 10.1002/wrna.1322. [DOI] [PubMed] [Google Scholar]
- 18.Slotkin W, Nishikura K. Adenosine-to-inosine RNA editing and human disease. Genome Med. 2013;5:105. doi: 10.1186/gm508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ishizuka JJ, et al. Loss of ADAR1 in tumours overcomes resistance to immune checkpoint blockade. Nature. 2019;565:43–48. doi: 10.1038/s41586-018-0768-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Aquino-Jarquin G. Novel Engineered Programmable Systems for ADAR-mediated RNA editing. Mol. Ther. Nucleic Acids. 2020;19:1065–1072. doi: 10.1016/j.omtn.2019.12.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fukuda M, et al. Construction of a guide-RNA for site-directed RNA mutagenesis utilising intracellular A-to-I RNA editing. Sci. Rep. 2017;7:41478. doi: 10.1038/srep41478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Merkle T, et al. Precise RNA editing by recruiting endogenous ADARs with antisense oligonucleotides. Nat. Biotechnol. 2019;37:133–138. doi: 10.1038/s41587-019-0013-6. [DOI] [PubMed] [Google Scholar]
- 23.Qu L, et al. Programmable RNA editing by recruiting endogenous ADAR using engineered RNAs. Nat. Biotechnol. 2019;37:1059–1069. doi: 10.1038/s41587-019-0178-z. [DOI] [PubMed] [Google Scholar]
- 24.Katrekar D, et al. In vivo RNA editing of point mutations via RNA-guided adenosine deaminases. Nat. Methods. 2019;16:239–242. doi: 10.1038/s41592-019-0323-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Montiel-Gonzalez MF, Vallecillo-Viejo I, Yudowski GA, Rosenthal JJC. Correction of mutations within the cystic fibrosis transmembrane conductance regulator by site-directed RNA editing. Proc. Natl Acad. Sci. USA. 2013;110:18285–18290. doi: 10.1073/pnas.1306243110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Buchumenski I, et al. Global quantification exposes abundant low-level off-target activity by base editors. Genome Res. 2021;31:2354–2361. doi: 10.1101/gr.275770.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Tan MH, et al. Dynamic landscape and regulation of RNA editing in mammals. Nature. 2017;550:249–254. doi: 10.1038/nature24041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Chalk AM, Taylor S, Heraud-Farlow JE, Walkley CR. The majority of A-to-I RNA editing is not required for mammalian homeostasis. Genome Biol. 2019;20:1–14. doi: 10.1186/s13059-019-1873-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Morse DP, Aruscavage PJ, Bass BL. RNA hairpins in noncoding regions of human brain and Caenorhabditis elegans mRNA are edited by adenosine deaminases that act on RNA. Proc. Natl Acad. Sci. USA. 2002;99:7906–7911. doi: 10.1073/pnas.112704299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Levanon EY, et al. Systematic identification of abundant A-to-I editing sites in the human transcriptome. Nat. Biotechnol. 2004;22:1001–1005. doi: 10.1038/nbt996. [DOI] [PubMed] [Google Scholar]
- 31.Athanasiadis A, Rich A, Maas S. Widespread A-to-I RNA editing of Alu-containing mRNAs in the human transcriptome. PLoS Biol. 2004;2:e391. doi: 10.1371/journal.pbio.0020391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hundley HA, Bass BL. ADAR editing in double-stranded UTRs and other noncoding RNA sequences. Trends Biochem. Sci. 2010;35:377–383. doi: 10.1016/j.tibs.2010.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Herb A, Higuchi M, Sprengel R, Seeburg PH. Q/R site editing in kainate receptor GluR5 and GluR6 pre-mRNAs requires distant intronic sequences. Proc. Natl Acad. Sci. USA. 1996;93:1875–1880. doi: 10.1073/pnas.93.5.1875. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Källman AM, Sahlin M, Ohman M. ADAR2 A->I editing: site selectivity and editing efficiency are separate events. Nucleic Acids Res. 2003;31:4874–4881. doi: 10.1093/nar/gkg681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Dawson TR, Sansam CL, Emeson RB. Structure and sequence determinants required for the RNA editing of ADAR2 substrates. J. Biol. Chem. 2004;279:4941–4951. doi: 10.1074/jbc.M310068200. [DOI] [PubMed] [Google Scholar]
- 36.Ohman M, Källman AM, Bass BL. In vitro analysis of the binding of ADAR2 to the pre-mRNA encoding the GluR-B R/G site. RNA. 2000;6:687–697. doi: 10.1017/S1355838200000200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wong SK, Sato S, Lazinski DW. Substrate recognition by ADAR1 and ADAR2. RNA. 2001;7:846–858. doi: 10.1017/S135583820101007X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lehmann KA, Bass BL. Double-stranded RNA adenosine deaminases ADAR1 and ADAR2 have overlapping specificities. Biochemistry. 2000;39:12875–12884. doi: 10.1021/bi001383g. [DOI] [PubMed] [Google Scholar]
- 39.Eggington JM, Greene T, Bass BL. Predicting sites of ADAR editing in double-stranded RNA. Nat. Commun. 2011;2:319. doi: 10.1038/ncomms1324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Grice LF, Degnan BM. The origin of the ADAR gene family and animal RNA editing. BMC Evol. Biol. 2015;15:4. doi: 10.1186/s12862-015-0279-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Wang Y, Park S, Beal PA. Selective Recognition of RNA Substrates by ADAR Deaminase Domains. Biochemistry. 2018;57:1640–1651. doi: 10.1021/acs.biochem.7b01100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Goodman RA, Macbeth MR, Beal PA. ADAR proteins: structure and catalytic mechanism. Curr. Top. Microbiol. Immunol. 2012;353:1–33. doi: 10.1007/82_2011_144. [DOI] [PubMed] [Google Scholar]
- 43.Stefl R, et al. The solution structure of the ADAR2 dsRBM-RNA complex reveals a sequence-specific readout of the minor groove. Cell. 2010;143:225–237. doi: 10.1016/j.cell.2010.09.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Herbert A, et al. A Z-DNA binding domain present in the human editing enzyme, double-stranded RNA adenosine deaminase. Proc. Natl Acad. Sci. USA. 1997;94:8421–8426. doi: 10.1073/pnas.94.16.8421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Barraud P, Allain FH-T. ADAR proteins: double-stranded RNA and Z-DNA binding domains. Curr. Top. Microbiol. Immunol. 2012;353:35–60. doi: 10.1007/82_2011_145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hubbard NW, et al. ADAR1 mutation causes ZBP1-dependent immunopathology. Nature. 2022;607:769–775. doi: 10.1038/s41586-022-04896-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Jiao H, et al. ADAR1 averts fatal type I interferon induction by ZBP1. Nature. 2022;607:776–783. doi: 10.1038/s41586-022-04878-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.de Reuver R, et al. ADAR1 prevents autoinflammation by suppressing spontaneous ZBP1 activation. Nature. 2022;607:784–789. doi: 10.1038/s41586-022-04974-w. [DOI] [PubMed] [Google Scholar]
- 49.Uzonyi A, et al. Deciphering the principles of the RNA editing code via large-scale systematic probing. Mol. Cell. 2021;81:2374–2387.e3. doi: 10.1016/j.molcel.2021.03.024. [DOI] [PubMed] [Google Scholar]
- 50.Schaffer AA, et al. The cell line A-to-I RNA editing catalogue. Nucleic Acids Res. 2020;48:5849–5858. doi: 10.1093/nar/gkaa305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Hajji K, et al. ADAR2 enzymes: efficient site-specific RNA editors with gene therapy aspirations. RNA. 2022;28:1281–1297. doi: 10.1261/rna.079266.122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Bhalla T, Rosenthal JJC, Holmgren M, Reenan R. Control of human potassium channel inactivation by editing of a small mRNA hairpin. Nat. Struct. Mol. Biol. 2004;11:950–956. doi: 10.1038/nsmb825. [DOI] [PubMed] [Google Scholar]
- 53.Lehmann KA, Bass BL. The importance of internal loops within RNA substrates of ADAR1. J. Mol. Biol. 1999;291:1–13. doi: 10.1006/jmbi.1999.2914. [DOI] [PubMed] [Google Scholar]
- 54.Macbeth MR, Lingam AT, Bass BL. Evidence for auto-inhibition by the N terminus of hADAR2 and activation by dsRNA binding. RNA. 2004;10:1563–1571. doi: 10.1261/rna.7920904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Yi Z, et al. Engineered circular ADAR-recruiting RNAs increase the efficiency and fidelity of RNA editing in vitro and in vivo. Nat. Biotechnol. 2022;40:946–955. doi: 10.1038/s41587-021-01180-3. [DOI] [PubMed] [Google Scholar]
- 56.Morse DP, Bass BL. Long RNA hairpins that contain inosine are present in Caenorhabditis elegans poly(A)+ RNA. Proc. Natl Acad. Sci. USA. 1999;96:6048–6053. doi: 10.1073/pnas.96.11.6048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Polson AG, Bass BL. Preferential selection of adenosines for modification by double-stranded RNA adenosine deaminase. EMBO J. 1994;13:5701–5711. doi: 10.1002/j.1460-2075.1994.tb06908.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Schneider MF, Wettengel J, Hoffmann PC, Stafforst T. Optimal guideRNAs for re-directing deaminase activity of hADAR1 and hADAR2 in trans. Nucleic Acids Res. 2014;42:e87. doi: 10.1093/nar/gku272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Doherty EE, et al. ADAR activation by inducing a syn conformation at guanosine adjacent to an editing site. Nucleic Acids Res. 2022;50:10857–10868. doi: 10.1093/nar/gkac897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Reenan RA. Molecular determinants and guided evolution of species-specific RNA editing. Nature. 2005;434:409–413. doi: 10.1038/nature03364. [DOI] [PubMed] [Google Scholar]
- 61.Rieder LE, Staber CJ, Hoopengardner B, Reenan RA. Tertiary structural elements determine the extent and specificity of messenger RNA editing. Nat. Commun. 2013;4:2232. doi: 10.1038/ncomms3232. [DOI] [PubMed] [Google Scholar]
- 62.Aruscavage PJ, Bass BL. A phylogenetic analysis reveals an unusual sequence conservation within introns involved in RNA editing. RNA. 2000;6:257–269. doi: 10.1017/S1355838200991921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Tian N, et al. A structural determinant required for RNA editing. Nucleic Acids Res. 2011;39:5669–5681. doi: 10.1093/nar/gkr144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Valente L, Nishikura K. RNA binding-independent dimerization of adenosine deaminases acting on RNA and dominant negative effects of nonfunctional subunits on dimer functions. J. Biol. Chem. 2007;282:16054–16061. doi: 10.1074/jbc.M611392200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Jaikaran DCJ, Collins CH, MacMillan AM. Adenosine to inosine editing by ADAR2 requires formation of a ternary complex on the GluR-B R/G site. J. Biol. Chem. 2002;277:37624–37629. doi: 10.1074/jbc.M204126200. [DOI] [PubMed] [Google Scholar]
- 66.Cho D-SC, et al. Requirement of dimerization for RNA editing activity of adenosine deaminases acting on RNA. J. Biol. Chem. 2003;278:17093–17102. doi: 10.1074/jbc.M213127200. [DOI] [PubMed] [Google Scholar]
- 67.Poulsen H, et al. Dimerization of ADAR2 is mediated by the double-stranded RNA binding domain. RNA. 2006;12:1350–1360. doi: 10.1261/rna.2314406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Thuy-Boun AS, et al. Asymmetric dimerization of adenosine deaminase acting on RNA facilitates substrate recognition. Nucleic Acids Res. 2020;48:7958–7972. doi: 10.1093/nar/gkaa532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Song Y, et al. irCLASH reveals RNA substrates recognized by human ADARs. Nat. Struct. Mol. Biol. 2020;27:351–362. doi: 10.1038/s41594-020-0398-4. [DOI] [PubMed] [Google Scholar]
- 70.Pestal K, et al. Isoforms of RNA-Editing Enzyme ADAR1 independently control nucleic acid sensor MDA5-driven autoimmunity and multi-organ development. Immunity. 2015;43:933–944. doi: 10.1016/j.immuni.2015.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Smit, A. F. A., Hubley, R. & Green, P. Repeat-Masker Open-3.0. (2013–2015).
- 72.Kent WJ, et al. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. doi: 10.1101/gr.229102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Reuter JS, Mathews DH. RNAstructure: software for RNA secondary structure prediction and analysis. BMC Bioinforma. 2010;11:129. doi: 10.1186/1471-2105-11-129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Danaee P, et al. bpRNA: large-scale automated annotation and analysis of RNA secondary structure. Nucleic Acids Res. 2018;46:5381–5394. doi: 10.1093/nar/gky285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Lonsdale J, et al. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 2013;45:580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Kluesner MG, et al. EditR: A method to quantify base editing from Sanger Sequencing. CRISPR J. 2018;1:239–250. doi: 10.1089/crispr.2018.0014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Kluesner MG, et al. MultiEditR: The first tool for the detection and quantification of RNA editing from Sanger sequencing demonstrates comparable fidelity to RNA-seq. Mol. Ther. Nucleic Acids. 2021;25:515–523. doi: 10.1016/j.omtn.2021.07.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The NGS data generated in this study have been deposited in the NCBI BioProject database under accession code ID PRJNA943413. The raw data regarding expression of B2 and mNG constructs in WT HEK293T cells49 used in this study are available in the Gene Expression Omnibus database under accession code GSE155490. For the analysis of ADAR1 and ADAR2 endogenous targets, data were retrieved from the GTEX database75.
The R script for analyzing NGS sequencing data from the B2 and mNG oligo libraries is accessible as Supplementary Data 2.