Abstract
N4-Acetylcytidine (ac4C) is a post-transcriptional modification of RNA that is conserved across all domains of life. All characterized sites of ac4C in eukaryotic RNA occur in the central nucleotide of a 5′-CCG-3′ consensus sequence. However, the thermodynamic consequences of cytidine acetylation in this context have never been assessed due to its challenging synthesis. Here, we report the synthesis and biophysical characterization of ac4C in its endogenous eukaryotic sequence context. First, we develop a synthetic route to homogeneous RNAs containing electrophilic acetyl groups. Next, we use thermal denaturation to interrogate the biochemical effects of ac4C on duplex stability and mismatch discrimination in a native sequence found in human rRNA. Finally, we demonstrate the ability of this chemistry to incorporate ac4C into the complex modification landscape of human tRNA and use duplex melting to highlight an enforcing role for ac4C in this unique sequence context. By enabling ex vivo biophysical analyses of nucleic acid acetylation in its physiological sequence context, these studies establish a chemical foundation for understanding the function of a universally conserved nucleobase in biology and disease.
Graphical Abstract
INTRODUCTION
N4-Acetylcytidine (ac4C) is a modified RNA nucleobase that is universally conserved among all domains of life (Figure 1a).1 Cytidine acetylation was first identified in eukaryotic tRNA (tRNAs) in the 1960s.2,3 Subsequent quantitative mapping studies have defined helices 34 and 45 of 18S rRNA (rRNA) and the D-stem of tRNASer and tRNALeu as the dominant sites of ac4C in eukaryotes.4–6 In humans, cytidine acetylation is catalyzed by the essential RNA acetyltransferase enzyme NAT10, which works in concert with protein and snoRNA adapters to address its distinct targets.4,5 Dysregulation of NAT10 has been associated with many diseases, including premature aging syndromes and cancer.7,8 Precisely why ac4C is so highly conserved in eukaryotic rRNA and tRNA remains unknown.
Knowledge of the molecular effects of ac4C largely derive from modeling and structural studies of the free nucleoside.9,10 Crystallographic data indicate the N4-acetyl group in ac4C prefers a conformation in which it is oriented proximal to cytidine C5, reflecting the influence of a weak C–H···O interaction formed between the acetamide carbonyl oxygen and pyrimidine C5 C–H (Figure 1b).15,16 This ordered structure is compatible with canonical base-pairing, as it places the bulk of the acetyl group toward the major groove of duplex RNA. N4-Acetylation also stabilizes the C3′-endo conformation of cytidine’s ribose sugar, an effect common to other rRNA modifications such as pseudouridine and 2′-O-methylation.11 These features were recently corroborated in a series of high resolution cryo-EM structures of eukaryotic and archaeal ribosomes.6,12,13
Every specific site of ac4C thus far localized in human RNA occurs at the central base of a 5′-CCG-3′ consensus sequence (Figure 1c).14 An identical 5′-CCG-3′ sequence is acetylated in members of the archaeal order Thermococcales, whose RNA harbors the most ac4C of any organism yet characterized on Earth.15,16 Interestingly, many cytidines that are dynamically acetylated in response to temperature in Thermococcales occur at the stem of hairpin structures adjacent to the loop, suggestive of a role for ac4C in enforcing duplex stability.6 Biophysical analyses of ac4C would provide foundational data as to its role in biology and disease. However, ac4C has yet to be characterized in any physiologically relevant sequence context due to a lack of methods to site-specifically introduce it into RNA.
Previous studies have incorporated ac4C into RNA enzymatically using in vitro transcription.17,18 While this approach facilitates many applications, it results in a non-physiological, homogeneous replacement of every templated cytidine with an ac4C. Combining in vitro transcription with ligation provides a potential route to incorporate ac4C into a subset of sites;19,20 however, these methods are not well-suited to producing the short homogeneous nucleic acids required for biophysical studies and would require ligation at the modified nucleotide to incorporate ac4C into its physiological 5′-CCG-3′ context, a step that to date is unprecedented. Conventional protocols for solid-phase synthesis of RNA oligonucleotides are similarly incapable of producing ac4C RNA.21 This is because these methods employ N4-acetylation to protect the exocyclic amine of cytidine during iterative coupling and deprotection steps and thus have been designed (even in the case of “fast-deprotecting” phenoxyacyl protection)22 to remove this modification during nucleobase deprotection or upon nucleophilic cleavage from ester-linked resins (Figure 2a). Despite being listed as a potential component of nucleic acid therapeutics in many patent applications,23 the synthesis and characterization of ac4C at defined positions in RNA have never been reported. The synthesis of site-specific acetylated RNA is a prerequisite for understanding the biological role of ac4C and applying it as a functional element in nucleic acid therapeutics. These opportunities highlight the need for a synthetic route.
RESULTS AND DISCUSSION
An Orthogonal Protection Strategy Compatible with Cytidine Acetylation.
Site-specific incorporation of cytidine acetylation into RNA requires (i) a protecting group for the exocyclic nucleobase nitrogens and (ii) a solid-phase support linkage that can be cleaved without removing the N4-acetyl group of ac4C (Figure 2a). To address the first criterion, we were inspired by prior syntheses of O-acetylated RNAs, which like ac4C are sensitive to nucleophilic cleavage.24,25 These studies employed N-cyanoethyl O-carbamate (N-ceoc)26 nucleobases that could be deprotected using the non-nucleophilic base 1,5-diazabicyclo(4.3.0)non-5-ene (DBU). To determine the orthogonality of N-ceoc protection and N4-acetylation, we analyzed the compatibility of DBU and ac4C using a series of simple model substrates (Figure 2b). Over 4 h, DBU cleanly removed the N-ceoc group from ceoc-C, while leaving N4-acetylation intact (Figure 2c). However, degradation was observed in the presence of morpholine (10% v/v) (Figure S1). The exquisite sensitivity of ac4C to nucleophilic cleavage reagents differentiates this study from prior work, which were able to use morpholine to scavenge acrylonitrile during the synthesis of O-acetylated RNAs.24 These studies define an orthogonal condition for the protection and deprotection of ac4C-containing RNA.
Synthesis of Building Blocks and Solid-Phase for Site-Specific ac4C RNA Synthesis.
To develop our strategy in an oligonucleotide context, we next synthesized N-ceoc-protected phosphoramidites of adenosine, cytidine, and guanosine via a 3′,5′-cyclic silyl-protected strategy (Figure 3, top).27 Briefly, parent nucleosides were first protected at the ribose sugar via treatment with di-tert-butylsilyl bistriflate, followed by addition of tert-butyl dimethylsilyl chloride and imidazole. In the case of cytidine, the pyrimidine ring was protonated using one equivalent of triflic acid prior to addition of the silyl bistriflate. After protection of ribose, the exocyclic nitrogens were carbamoylated using ceoc-carbonyl-N-methylimidazolium chloride. In the case of guanosine, the O6 was first protected via the Mitsunobu reaction with (4-nitrophenyl) ethanol prior to carbamoylation at N2 using ceoc-chloroformate. Selective removal of the 3′,5′-cyclic silyl ether was achieved using HF-pyridine in dichloromethane. Finally, regioselective introduction of dimethoxytrityl at the 5′ position, followed by phosphitylation, yielded A, C, and G phosphoramidite monomers in sufficient yields for solid-phase synthesis.
Next, we sought to devise a solid-phase support that could release RNA oligonucleotides without deacetylating ac4C. Given the incompatibility of ac4C with nucleophiles, we chose to pursue a photocleavable approach.28,29 We hypothesized that a nitroveratryl-based linker may be optimal for this purpose, allowing for mild cleavage upon irradiation at 365 nm while minimizing photochemical reactions of RNA caused by shorter wavelength UV light (Figure 3, bottom). This necessity is underscored by ac4C’s relatively red-shifted absorbance (λmax = 302 nm) and previously observed photochemistry.30,31 Synthesis of the linker began with alkylation of vanillin followed by trifluoroacetic acid-mediated nitration. Reduction of the aldehyde afforded nitroveratryl alcohol, which was further protected with dimethoxytrityl chloride in pyridine to yield the elaborated linker. Subsequent deprotection and coupling to long-chain alkylamine derivatized controlled pore glass (LCAA-CPG) provided access to photocleavable solid support.
Synthetic Optimization Enables Site-Specific Incorporation of ac4C in RNA.
With these reagents in hand, we set out to establish conditions for the synthesis of ac4C RNA oligomers (Figure 3a). These studies employed standard phosphoramidite coupling time (6 min), coupling reagents (ETT), oxidation conditions (I2, pyridine, H2O), and decapping reagent (3% TCA in DCM). To avoid reaction of acetic anhydride with N-protected exocyclic amines, conventional 5′-OH capping was omitted. This step was further determined to be dispensable based on production of similar amounts of full-length RNA in uncapped and pivalic anhydride-capped samples (Figure S2).32 However, while solid-phase synthesis proved straightforward, successfully isolating homogeneous N4-acetylated RNA required several innovations compared to previous approaches. First, to avoid nucleobase alkylation by acrylonitrile during N-ceoc removal, we developed an on-column deprotection scheme (Figure 4a, optimization #1). This protocol passes DBU (0.5 M in acetonitrile) over the nascent RNA oligomer on solid support to efficiently deprotect N-ceoc bases while limiting their exposure to acrylonitrile thus obviating the need for a nucleophilic scavenger such as morpholine. Second, to maximize yields of ac4C-containing RNA, we identified photolysis conditions that efficiently cleave product from solid support but minimize undesired N4-deacetylation (Figure 4a, optimization #2). Initial experiments using a model RNA (5′-UU(ac4C)UUp-3′) indicated ~47% of ac4C was deacetylated during photolysis or 2′-O-TBS removal (Figure S3). Addition of Hunig’s base to the desilylation reaction modestly impeded deacetylation (47% to 35%, optimization #3). Changing the photolysis solvent to buffered acetonitrile had a more profound effect, reducing the extent of deacetylation to less than 5% (Figure S3). Elimination of these deacetylation products greatly facilitates the synthesis of ac4C-containing RNAs by both improving yield and streamlining purification. Finally, inspired by the work of Sekine et al.,33 we tested the synthesis using an N-unprotected guanosine phosphoramidite (Figure 4b). The use of this synthetically facile monomer further improved accumulation of full-length RNA products. Combining these innovations led to the identification of high proportions of desired products in crude cleavage reactions (Figure S4, Figure 4c), which could be further purified using polyacrylamide electrophoresis (PAGE) to yield pure ac4C-containing RNAs (Figure 4d). Overall, these studies define an effective solid-phase synthetic route to RNAs containing the endogenous electrophilic base modification ac4C.
Synthesis and Characterization of ac4C in an Endogenous 5′-CCG-3′ Sequence Context: 18S rRNA.
Human small subunit (SSU) 18S rRNA contains two high stoichiometry sites of ac4C (C1280 and C1842), each of which is embedded in a fully base-paired 5′-CCG-3′ sequence (Figure 1b–c).4–6 To study cytidine acetylation in this context, we synthesized an RNA decamer corresponding to the ac4C-containing strand of SSU helix 45 (Table 1). Annealing to a complementary RNA enabled analysis of ac4C’s effects on duplex stability and mismatch discrimination via UV-melting experiments. Consistent with its presence in hyperthermophile RNA,6 pilot analyses found ac4C is not labile upon heating (Figure S5). Thermal denaturation curves were analyzed by both nonlinear regression and van’t Hoff plots to extract thermodynamic constants (Table 1, Supporting Information). Agreement between these two analysis methods supports a two-state denaturation model for all duplexes analyzed. Focusing first on a fully complementary duplex (Table 1, entry 1), cytidine acetylation was found to have an overall stabilizing effect on duplex RNA (ΔTmac4C v. C = +1.7 °C). The free energy change caused by ac4C is accounted for by increased enthalpy upon duplex formation relative to cytidine RNA. The only previous study of N4-acetylcytidine in a hybridized oligonucleotide context was in a polyuridine DNA and observed a smaller increase in melting temperature (ΔTmac4C v. C = +0.4 °C).34 Further study will be required to determine whether this difference reflects unique experimental conditions or a stabilizing effect of ac4C on its evolutionarily conserved RNA sequence context. Duplexes containing mismatches across from cytidine or ac4C mismatch duplexes each exhibited reduced melting temperatures relative to their match counterparts (Table 1, entries 2–4). Overall, the effects of ac4C on mismatch discrimination (ΔΔTmC–G v. C–A) are small and within the error of our experimental measurement. Taking into account the average stabilities of match and mismatch, substitution of cytidine with ac4C appears to slightly discriminate against a C–A mismatch (ΔΔTm = −0.4 °C) and increase tolerance for a C–U mismatch (ΔΔTm = +2.0 °C). The increased C–A mismatch and C–U tolerance appear to be enthalpically driven (ΔΔH ac4C–A v. C–A = +44 kcal/mol and ΔΔH ac4C–U v. C–U = −41 kcal/mol). This could reflect the electron-withdrawing effect of the N4-acetyl group, which disfavors tautomerization (required for C–A pairing) and may render N4-H a more effective hydrogen bond donor. Improved C–A mismatch discrimination by ac4C is consistent with prior studies of E. coli tRNAMet, where incorporation of N4-acetylation has been shown to prevent misreading of AUA codons.35,36 The potential for ac4C to engage in noncanonical pairing with uridine has not been previously described but is anecdotally supported by the recent cryo-EM visualization of this base pair in an archaeal ribosome (Figure S6).6
Table 1.
RNA duplex | variable | Tm (°C) [5 μM] | ΔTm (°C) (ac4C) | ΔG (kcal/mol) | ΔS (eu) | ΔH (kcal/mol) |
---|---|---|---|---|---|---|
| ||||||
C-G | 65.9 | +1.7 | −16.6 ± 0.2 | −265 ± 7 | −101 ± 3 | |
67.6 | −18.6 ± 0.6 | −281 ± 16 | −106 ± 5 | |||
| ||||||
C-A | 46.2 | + 1.3 | −11.5 ± 0.8 | −283 ± 18 | −99 ± 5 | |
47.5 | −11.5 + 1.1 | −239 ± 39 | −86 ± 13 | |||
| ||||||
C-U | 43.0 | +3.7 | −10.7 ± 0.4 | −238 ± 51 | −85 ± 15 | |
46.7 | −11.5 ± 0.5 | −279 ± 60 | −96 ± 21 | |||
| ||||||
C-C | 42.9 | +2.3 | −11.0 ± 0.2 | −234 ± 32 | −81 ± 12 | |
45.2 | −11.4 ± 0.4 | −267 ± 65 | −94 ± 20 | |||
| ||||||
G•U | 60.9 | +3.1 | −15.3 ± 0.7 | −283 ± 5 | −103 ± 20 | |
64.0 | −16.3 ± 1.1 | −291 ± 5 | −107 ± 19 | |||
| ||||||
G•U (−1) | 56.0 | +1.7 | −14.4 ± 0.9 | −320 ± 53 | −114 ± 17 | |
57.7 | −15.1 ± 0.8 | −258 ± 29 | −95 ± 10 |
Duplexes were designed to test the effect of ac4C on canonical base-pairing, mismatch discrimination, and compatibility with adjacent G•U wobble pairs (n = 3). ΔTm = Tmac4C v. TmC [5 μM]. Exemplary melting curves and van’t Hoff plots are provided in the Supporting Information.
Both known sites of cytidine acetylation in human rRNA reside two bases from a G•U wobble base pair (Figure 1b–c).4–6 Given the significance of G•U pairing to the RNA structure,37 we next set out to determine how ac4C alters the stability of duplex RNAs containing this element. Cytidine and ac4C duplexes were prepared containing a G•U pair +2 bp from ac4C, effectively replicating the stem sequence found in helix 45 of human 18S rRNA. Once again, RNA duplexes containing ac4C were found to be more stable than those containing cytidine (ΔTmac4C v. C = +3.1 °C, entry 5). Cytidine acetylation also stabilizes duplexes containing a G•U pair directly proximal to the modified nucleotide (entry 6), albeit to a lesser extent. Differences in basal stability confound a quantitative comparison of G•U versus and fully complementary RNA duplexes (entries 1 and 5). However, the observation that ac4C is slightly more stabilizing in the G•U duplex (3.1 °C vs 1.7 °C) indicates the high compatibility of this modification with adjacent noncanonical RNA base pairs. Moreover, these studies provide the first empirical evidence that site-specific cytidine acetylation can enforce RNA structure in a physiologically relevant sequence context.
Synthesis of ac4C in a Complex Modification Landscape: tRNASer.
Eukaryotic tRNASer constitutes the first site of cytidine acetylation ever characterized.2,3 Deposition of ac4C in eukaryotic tRNAs occurs at C12 of the D-arm, is exclusive to tRNASer/Leu (Figure 5a), and requires both a cytidine acetyltransferase (Nat10 in humans; Kre33 in yeast) and an adapter protein (Thumpd1 in humans; Tan1 in yeast).5,38 Deletion of yeast Tan1 causes loss of tRNASer, rapid decay of tRNASerAGA, and growth defects at elevated temperatures.39 This could indicate a critical role for ac4C in enforcing tRNASer structure or, alternatively, reflect an ac4C-independent effect caused by loss of Tan1. Emphasizing the need to consider this latter possibility, previous studies have found proteins that carry out tRNA modifications can aid tRNA maturation independent of their catalytic activity.40 Differentiating between these scenarios would be greatly aided by the ability to isolate the biophysical effects of ac4C in the unique context of the tRNA D-arm. This led us to ask the following question: does cytidine acetylation alter the stability of this tRNASer substructure.
Human and yeast tRNASerCGA share an identical sequence and modification profile in their D-arm, which is composed of a 4-bp stem that contains an internal ac4C–G and an 8-nt D-loop with three dihydrouridines (D) and one 2′-O-methylguanosine (Gm).1,3,41 Its synthesis presents a challenge due to its length (16 nt) and the presence of an additional labile nucleobase, D, that is prone to ring-opening. Previous studies have shown D-containing RNA can be obtained using phenoxyacyl-protected nucleobases.42 This led us to hypothesize that the even gentler N-ceoc protecting group strategy would be compatible with D, while also facilitating incorporation of ac4C (and Gm) into the hypermodified tRNASer hairpin. To obtain the necessary building blocks, 5′-O-DMT-protected phosphoramidite monomers of D and Gm were synthesized. The synthesis of protected D used an adaptation of a previously reported method (Figure S7),42 while the Gm monomer was readily obtained via nucleobase deprotection of a commercial starting material in a single step.43 These materials were then applied in combination with the previously described building blocks using the optimized solid-phase protocol to synthesize tRNASer D-arm models containing either C or ac4C at the C12 position. Analysis of crude reaction products revealed higher amounts of truncation products during the synthesis of tRNA hairpins relative to our rRNA-derived 10-mer, consistent with its longer linear sequence (Figure 5b). PAGE purification, and in the case of ac4C subsequent HPLC-purification, yielded the desired C- and ac4C-containing tRNASer D-arm hairpins in quantities sufficient for biophysical characterization (Figure 5c).
Melting temperature measurements were performed at higher concentrations to account for the short stem structure and presence of nonaromatic nucleobases in the tRNA hairpin. Thermal denaturation analysis revealed a clear helix to coil transition for the ac4C-containing RNA at 71.4 ± 0.4 °C, while the nonacetylated hairpin melted at 62.9 ± 0.7 °C (ΔTmac4C v. C = +8.2 °C) (Table 2). This represents a stabilization of ~1 kcal/mol, similar in magnitude to the free energy change caused by inserting pseudouridine into a base-paired duplex.44 The UV-melting profile of tRNASer was not sensitive to concentration, consistent with a unimolecular (hairpin) as opposed to a bimolecular (duplex) process. Across evolution, serine and leucine tRNAs are characterized by two unique elements: a variable region of more than 10 nucleotides and a conserved purine–purine (G13•A23) pair.45 Of note, these features converge on the tertiary structure of the tRNA formed by the D-arm. Our studies suggest ac4C may constitute a third distinct functional element in eukaryotic tRNASer and tRNALeu and support the plausibility of a mechanism whereby cytidine acetylation regulates tRNA half-life and overall fitness by modulating the structural dynamics of these noncoding RNAs at elevated temperatures (Figure 6).
Table 2.
variable | Tm (°C) [5 μM] | ΔTm (°C) (ac4C) | ΔG (kcal/mol) | ΔS (eu) | ΔH (kcal/mol) |
---|---|---|---|---|---|
C | 62.9 | −2.4 ± 0.1 | −86 ± 3 | −29 ± 1 | |
ac4C | 71.1 | +8.2 | −3.4 ± 0.2 | −89 ± 7 | −32 ± 3 |
Left: Schematic of tRNASer, showing the site of ac4C at C12 in the stem of the D-loop. Right: Curves fit to melting data of the C- and ac4C-containing human tRNASer D-arm hairpins. Bottom: Thermodynamic parameters obtained from UV-melting experiments (n = 3). Full melting curves are provided in the Supporting Information.
CONCLUSION
Recent evidence linking Nat10 to disease has invigorated the study of cytidine acetylation in RNA. However, the precise effects of ac4C on nucleic acid structure and function remain unknown.17 Here, we report the synthesis and evaluation of N4-acetylcytidine in its physiological RNA sequence context. Systematic development of a mild non-nucleophilic RNA synthesis enabled the preparation of homogeneous ac4C oligonucleotides. In a duplex RNA based on helix 45 of human 18S rRNA, we find that ac4C increases C–G base pair stability, an effect that is slightly augmented by the presence of a physiological G•U pair proximal to the acetylated 5′-CCG-3. Our synthetic method also provides access to the hypermodified D-arm hairpin of eukaryotic tRNASer, and we find that ac4C is highly stabilizing in this context (ΔTmac4C v. C = +8.2 °C). Previous studies have shown that destabilization of the D-stem can propagate in a zipper-like manner toward the anticodon arm,46 triggering disruption of the tRNA tertiary structure and recognition by decay machinery.47 By providing empirical evidence that D-arm stabilization is highly dependent on ac4C at elevated temperatures, our studies differentiate the catalytic and noncatalytic functions of the Kre33/Tan1 complex and provide a molecular rationale for why Δtan1 strains exhibit decreased tRNASer and reduced fitness under environmental stress.
Clarifying the thermodynamic consequences of ac4C in these physiologically relevant sequence contexts also raises new questions. First, how does ac4C stabilize duplex RNA? Comparative cryo-EM analyses of an archaeal ribosome with >100 sites of ac4C did not observe large perturbations of hydrogen-bonding when this modification was deleted (Figure S8), albeit at limited resolution. One source of stability may come from the exocyclic acetyl group of ac4C, which projects into the major groove of duplex RNA and has been hypothesized to contribute to binding enthalpy by serving as a stable covalent replacement for ordered waters at elevated temperatures.6 Another analogy may be found in 5-formylcytidine, which presents an exocyclic hydrogen bonding network toward the C–H edge that is similar to the favored conformation of ac4C and has been shown to increase base-stacking.48 Simple modeling of a 5′-CCG-3′ RNA duplex reveals the potential for ac4C to similarly improve stacking with the upstream nucleotide (Figure S9), which may provide an additional enthalpic contribution. Understanding the molecular basis for ac4C-dependent duplex stabilization will benefit from additional biophysical interrogation and high resolution structural analyses, both of which will be facilitated by our method.
A second question is why is cytidine acetylation restricted to tRNASer and tRNALeu. As noted above, these species are distinguished from other tRNAs by large variable regions (>10 nt) and the presence of a purine–purine (G•A) pair in the D-stem.45 Here, we suggest two hypotheses. These tRNAs could be uniquely recognized as substrates by the Nat10/Thumpd1 (Kre33/Tan1) complex. Alternatively, these tRNAs could be uniquely susceptible to modification-dependent structural stabilization, given the rare occurrence of a noncanonical G•A pair directly adjacent to ac4C.49,50 Previous studies have demonstrated the G•A interaction is highly sensitive to sequence context.51 Understanding how RNA modifications affect adjacent noncanonical base pairs is an important question posed by our observations, which future applications of site-specific ac4C synthesis will seek to address. Our studies of the D-arm also emphasize a limitation of our strategy, which is the inaccessibility of full-length tRNAs to purely synthetic methods. In addition to genetic analyses,47 ligation-based strategies have proven in stitching together differentially modified fragments into full-length tRNAs to study their functional effect.52 The products of our synthetic route should prove readily integrable with such methods.
Finally, we anticipate the synthetic method described here will enable several additional exciting applications. Besides the D-arm of eukaryotic tRNASer/Leu, hypermodified ac4C-containing RNAs are also present in the bacterial anticodon arm and the archaeal ribosome (Figure S10).35,53 The latter also contains 2′-O-methylated ac4C (ac4Cm), the so-called “most conformationally rigid nucleobase” present in nature.54 Our methods should facilitate pioneering biophysical and structural analyses of these modification-rich contexts. In addition, the ability to site-specifically incorporate ac4C opens the door to exploring its effects in functional nucleic acids, including short guide RNAs, short interfering RNAs, and antisense oligonucleotides. A recent study found homogeneous replacement of cytidine with ac4C reduced the immunogenicity of synthetic mRNAs,18 suggesting this modification’s potential therapeutic utility. We envision our synthetic route as being amenable to incorporation of diverse N4-acylated cytidines55 as well as other electrophilic nucleobases,56 further extending the chemical functionalities that may be explored in these applications. By illuminating how ac4C influences duplex RNA stability in physiological sequence contexts, this chemistry provides a foundation for understanding and exploiting cytidine acetylation as a novel regulatory element in biology, biotechnology, and disease.
Supplementary Material
ACKNOWLEDGMENTS
The authors thank Prof. Moran Shalev-Benami (Weizmann Institute) and Prof. Marc Greenberg (Johns Hopkins University) for helpful discussions. We thank the Biophysics Resource in the Center for Structural Biology, Center for Cancer Research, NCI at Frederick for assistance with high resolution LC-MS characterization and UV-melting studies. Figures were created with the help of Biorender.com. This work was supported by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research (ZIA-BC011488-05).
Footnotes
The authors declare no competing financial interest.
ASSOCIATED CONTENT
Supporting Information
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/jacs.1c11985.
Figure S1, DBU treatment of model ac4C compound and extended HPLC traces; Figure S2, MALDI-TOF analysis of product distribution in crude RNA synthesis products; Figure S3, optimization of RNA deprotection; Figure S4, PAGE purification and MALDI ac4C decamer; Figure S5, heat stability of ac4C’s N4-acetyl group; Figure S6, ac4C•U wobble pair; Figure S7, synthesis of dihydrouridine phosphoramidite; Figure S8, structural overlay of T. kodakarensis rRNA helix 45 rRNA and zoomed-in overlay of ac4C–G and C–G base pairs in wild-type and TkNat10 KO (ac4C-less) T. kodakarensis rRNA helix 45; Figure S9, potential base stacking interactions of ac4C; Figure S10, examples of additional hypermodified RNA contexts that contain ac4C; experimental section, and materials and methods (PDF)
Complete contact information is available at: https://pubs.acs.org/10.1021/jacs.1c11985
REFERENCES
- (1).Boccaletto P; Machnicka MA; Purta E; Piątkowski P; Baginśki B; Wirecki TK; de Crécy-Lagard V; Ross R; Limbach PA; Kotter A; Helm M; Bujnicki JM MODOMICS: A Database of RNA Modification Pathways. 2017 Update. Nucleic Acids Res. 2018, 46, D303–D307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (2).Zachau HG; Dütting D; Feldmann H Nucleotide Sequences of Two Serine-Specific Transfer Ribonucleic Acids. Angew. Chem., Int. Ed. Engl. 1966, 5 (4), 422. [DOI] [PubMed] [Google Scholar]
- (3).Staehelin M; Rogg H; Baguley BC; Ginsberg T; Wehrli W Structure of a Mammalian Serine tRNA. Nature. 1968, 219, 1363–1365. [DOI] [PubMed] [Google Scholar]
- (4).Ito S; Horikawa S; Suzuki T; Kawauchi H; Tanaka Y; Suzuki T; Suzuki T Human NAT10 Is an ATP-Dependent RNA Acetyltransferase Responsible for N4-Acetylcytidine Formation in 18 S Ribosomal RNA (rRNA). J. Biol. Chem. 2014, 289 (52), 35724–35730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (5).Sharma S; Langhendries J-L; Watzinger P; Kötter P; Entian K-D; Lafontaine DLJ Yeast Kre33 and Human NAT10 Are Conserved 18S rRNA Cytosine Acetyltransferases That Modify tRNAs Assisted by the Adaptor Tan1/THUMPD1. Nucleic Acids Res. 2015, 43 (4), 2242–2258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (6).Sas-Chen A; Thomas JM; Matzov D; Taoka M; Nance KD; Nir R; Bryson KM; Shachar R; Liman GLS; Burkhart BW; Gamage ST; Nobe Y; Briney CA; Levy MJ; Fuchs RT; Robb GB; Hartmann J; Sharma S; Lin Q; Florens L; Washburn MP; Isobe T; Santangelo TJ; Shalev-Benami M; Meier JL; Schwartz S Dynamic RNA Acetylation Revealed by Quantitative Cross-Evolutionary Mapping. Nature 2020, 583 (7817), 638–643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (7).Larrieu D; Britton S; Demir M; Rodriguez R; Jackson SP Chemical Inhibition of NAT10 Corrects Defects of Laminopathic Cells. Science 2014, 344 (6183), 527–532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (8).Tschida BR; Temiz NA; Kuka TP; Lee LA; Riordan JD; Tierrablanca CA; Hullsiek R; Wagner S; Hudson WA; Linden MA; Amin K; Beckmann PJ; Heuer RA; Sarver AL; Yang JD; Roberts LR; Nadeau JH; Dupuy AJ; Keng VW; Largaespada DA Insertional Mutagenesis in Mice Identifies Drivers of Steatosis-Associated Hepatic Tumors. Cancer Res. 2017, 77 (23), 6576–6588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (9).Parthasarathy R; Ginell SL; De NC; Chheda GB Conformation of N4-Acetylcytidine, a Modified Nucleoside of tRNA, and Stereochemistry of Codon-Anticodon Interaction. Biochem. Biophys. Res. Commun. 1978, 83 (2), 657–663. [DOI] [PubMed] [Google Scholar]
- (10).Kumbhar BV; Kamble AD; Sonawane KD Conformational Preferences of Modified Nucleoside N(4)-Acetylcytidine, ac4C Occur at “Wobble” 34th Position in the Anticodon Loop of tRNA. Cell Biochem. Biophys. 2013, 66 (3), 797–816. [DOI] [PubMed] [Google Scholar]
- (11).Harcourt EM; Kietrys AM; Kool ET Chemical and Structural Effects of Base Modifications in Messenger RNA. Nature 2017, 541 (7637), 339–346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (12).Natchiar SK; Myasnikov AG; Kratzat H; Hazemann I; Klaholz BP Visualization of Chemical Modifications in the Human 80S Ribosome Structure. Nature 2017, 551 (7681), 472–477. [DOI] [PubMed] [Google Scholar]
- (13).Coureux P-D; Lazennec-Schurdevin C; Bourcier S; Mechulam Y; Schmitt E Cryo-EM Study of an Archaeal 30S Initiation Complex Gives Insights into Evolution of Translation Initiation. Commun. Biol. 2020, 3 (1), 58. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (14).Thalalla Gamage S; Sas-Chen A; Schwartz S; Meier JL Quantitative Nucleotide Resolution Profiling of RNA Cytidine Acetylation by ac4C-Seq. Nat. Protoc. 2021, 16 (4), 2286–2307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (15).Orita I; Futatsuishi R; Adachi K; Ohira T; Kaneko A; Minowa K; Suzuki M; Tamura T; Nakamura S; Imanaka T; Suzuki T; Fukui T Random Mutagenesis of a Hyperthermophilic Archaeon Identified tRNA Modifications Associated with Cellular Hyperthermotolerance. Nucleic Acids Res. 2019, 47 (4), 1964–1976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (16).Kowalak JA; Dalluge JJ; McCloskey JA; Stetter KO The Role of Posttranscriptional Modification in Stabilization of Transfer RNA from Hyperthermophiles. Biochemistry 1994, 33 (25), 7869–7876. [DOI] [PubMed] [Google Scholar]
- (17).Sinclair WR; Arango D; Shrimp JH; Zengeya TT; Thomas JM; Montgomery DC; Fox SD; Andresson T; Oberdoerffer S; Meier JL Profiling Cytidine Acetylation with Specific Affinity and Reactivity. ACS Chem. Biol. 2017, 12 (12), 2922–2926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (18).Nance KD; Gamage ST; Alam MM; Yang A; Levy MJ; Link CN; Florens L; Washburn MP; Gu S; Oppenheim JJ; Meier JL Cytidine Acetylation Yields a Hypoinflammatory Synthetic Messenger RNA. Cell Chemical Biology. 2021, DOI: 10.1016/j.chembiol.2021.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Maroney PA; Chamnongpol S; Souret F; Nilsen TW Direct Detection of Small RNAs Using Splinted Ligation. Nature Protocols. 2008, 3, 279–287. [DOI] [PubMed] [Google Scholar]
- (20).Li Y; Fin A; McCoy L; Tor Y Polymerase-Mediated Site-Specific Incorporation of a Synthetic Fluorescent Isomorphic G Surrogate into RNA. Angew. Chem. 2017, 129, 1323–1327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (21).Roy S; Caruthers M Synthesis of DNA/RNA and Their Analogs via Phosphoramidite and H-Phosphonate Chemistries. Molecules 2013, 18 (11), 14268–14284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (22).Schulhof JC; Molko D; Teoule R The Final Deprotection Step in Oligonucleotide Synthesis Is Reduced to a Mild and Rapid Ammonia Treatment by Using Labile Base-Protecting Groups. Nucleic Acids Res. 1987, 15 (2), 397–416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (23).Hu B; Zhong L; Weng Y; Peng L; Huang Y; Zhao Y; Liang X-J Therapeutic siRNA: State of the Art. Signal Transduction and Targeted Therapy. 2020, 5, 101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (24).Xu J; Duffy CD; Chan CKW; Sutherland JD Solid-Phase Synthesis and Hybrization Behavior of Partially 2′/3′-O-Acetylated RNA Oligonucleotides. Journal of Organic Chemistry. 2014, 79, 3311–3326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (25).Bowler FR; Chan CKW; Duffy CD; Gerland B; Islam S; Powner MW; Sutherland JD; Xu J Prebiotically Plausible Oligoribonucleotide Ligation Facilitated by Chemoselective Acetylation. Nat. Chem. 2013, 5 (5), 383–389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (26).Merk C; Reiner T; Kvasyuk E; Pfleiderer W Nucleotides, Part LXVII, The 2-Cyanoethyl and (2-Cyanoethoxy)carbonyl Group for Base Protection in Nucleoside and Nucleotide Chemistry. Helv. Chim. Acta 2000, 83, 3198–3210. [Google Scholar]
- (27).Serebryany V; Beigelman L An Efficient Preparation of Protected Ribonucleosides for Phosphoramidite RNA Synthesis. Tetrahedron Lett. 2002, 43, 1983–1985. [Google Scholar]
- (28).Greenberg MM Photochemical Cleavage of Oligonucleotides from Solid Phase Supports. Tetrahedron Lett. 1993, 34, 251–254. [Google Scholar]
- (29).McMinn DL; Greenberg MM Novel Solid Phase Synthesis Supports for the Preparation of Oligonucleotides Containing 3′-Alkyl Amines. Tetrahedron. 1996, 52, 3827–3840. [Google Scholar]
- (30).Miller N; Cerutti P The Synthesis of N4-Acetyl-3,4,5,6-Tetrahydrocytidine and Copolymers of Cytidylic Acid and N4-Acetyl-3,4,5,6-Tetrahydrocytidylic Acid. J. Am. Chem. Soc. 1967, 89, 2767–2768. [Google Scholar]
- (31).Helene C; Douzou P; Michelson AM Energy Transfer in Dinucleotides. Proceedings of the National Academy of Sciences. 1966, 55, 376–381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (32).Zhu Q; Delaney MO; Greenberg MM Observation and Elimination of N-Acetylation of Oligonucleotides Prepared Using Fast-Deprotecting Phosphoramidites and Ultra-Mild Deprotection. Bioorg. Med. Chem. Lett. 2001, 11 (9), 1105–1107. [DOI] [PubMed] [Google Scholar]
- (33).Ohkubo A; Sakamoto K; Miyata K-I; Taguchi H; Seio K; Sekine M Convenient Synthesis of N-Unprotected Deoxynucleoside 3′-Phosphoramidite Building Blocks by Selective Deacylation of N-Acylated Species and Their Facile Conversion to Other N-Functionalized Derivatives. Org. Lett. 2005, 7 (24), 5389–5392. [DOI] [PubMed] [Google Scholar]
- (34).Wada T; Kobori A; Kawahara S-I; Sekine M Synthesis and Properties of Oligodeoxyribonucleotides Containing 4-N-Acetylcytosine Bases. Tetrahedron Lett. 1998, 39, 6907–6910. [Google Scholar]
- (35).Stern L; Schulman LH The Role of the Minor Base N4-Acetylcytidine in the Function of the Escherichia Coli Noninitiator Methionine Transfer RNA. J. Biol. Chem. 1978, 253 (17), 6132–6139. [PubMed] [Google Scholar]
- (36).Ohashi Z; Murao K; Yahagi T; Von Minden DL; McCloskey JA; Nishimura S Characterization of C + Located in the First Position of the Anticodon of Escherichia Coli tRNA Met as N 4-Acetylcytidine. Biochim. Biophys. Acta 1972, 262 (2), 209–213. [PubMed] [Google Scholar]
- (37).Varani G; McClain WH The G·U Wobble Base Pair. EMBO reports. 2000, 1, 18–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (38).Johansson MJO The Saccharomyces Cerevisiae TAN1 Gene Is Required for N4-Acetylcytidine Formation in tRNA. RNA 2004, 10, 712–719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (39).Dewe JM; Whipple JM; Chernyakov I; Jaramillo LN; Phizicky EM The Yeast Rapid tRNA Decay Pathway Competes with Elongation Factor 1A for Substrate tRNAs and Acts on tRNAs Lacking One or More of Several Modifications. RNA 2012, 18 (10), 1886–1896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (40).Roovers M A Primordial RNA Modification Enzyme: The Case of tRNA (m1A) Methyltransferase. Nucleic Acids Res. 2004, 32, 465–476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (41).Jühling F; Mörl M; Hartmann RK; Sprinzl M; Stadler PF; Pütz J tRNAdb 2009: Compilation of tRNA Sequences and tRNA Genes. Nucleic Acids Res. 2009, 37 (Database issue), D159–D162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (42).Dyubankova N; Sochacka E; Kraszewska K; Nawrot B; Herdewijn P; Lescrinier E Contribution of Dihydrouridine in Folding of the D-Arm in tRNA. Org. Biomol. Chem. 2015, 13 (17), 4960–4966. [DOI] [PubMed] [Google Scholar]
- (43).Ohkubo A; Kuwayama Y; Kudo T; Tsunoda H; Seio K; Sekine M O-Selective Condensation Using P–N Bond Cleavage in RNA Synthesis without Base Protection. Org. Lett. 2008, 10 (13), 2793–2796. [DOI] [PubMed] [Google Scholar]
- (44).Meroueh M; Grohar PJ; Qiu J; SantaLucia J Jr; Scaringe SA; Chow CS. Unique Structural and Stabilizing Roles for the Individual Pseudouridine Residues in the 1920 Region of Escherichia Coli 23S rRNA. Nucleic Acids Res. 2000, 28 (10), 2075–2083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (45).Giegé R; Puglisi JD; Florentz C tRNA Structure and Aminoacylation Efficiency. Prog. Nucleic Acid Res. Mol. Biol. 1993, 45, 129–206. [DOI] [PubMed] [Google Scholar]
- (46).Bonnefond L; Florentz C; Giegé R; Rudinger-Thirion J Decreased Aminoacylation in Pathology-Related Mutants of Mitochondrial tRNATyr Is Associated with Structural Perturbations in tRNA Architecture. RNA 2008, 14 (4), 641–648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (47).Guy MP; Young DL; Payea MJ; Zhang X; Kon Y; Dean KM; Grayhack EJ; Mathews DH; Fields S; Phizicky EM Identification of the Determinants of tRNA Function and Susceptibility to Rapid tRNA Decay by High-Throughput in Vivo Analysis. Genes Dev. 2014, 28 (15), 1721–1732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (48).Wang R; Luo Z; He K; Delaney MO; Chen D; Sheng J Base Pairing and Structural Insights into the 5-Formylcytosine in RNA Duplex. Nucleic Acids Res. 2016, 44 (10), 4968–4977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (49).Dock-Bregeon AC; Westhof E; Giegé R; Moras D Solution Structure of a tRNA with a Large Variable Region: Yeast tRNASer. J. Mol. Biol. 1989, 206 (4), 707–722. [DOI] [PubMed] [Google Scholar]
- (50).Biou V; Yaremchuk A; Tukalo M; Cusack S The 2.9 A crystal structure of T. thermophilus seryl-tRNA synthetase complexed with tRNA(Ser). Science 1994, 263 (5152), 1404–1410. [DOI] [PubMed] [Google Scholar]
- (51).Schroeder S; Kim J; Turner DHGA and U U Mismatches Can Stabilize RNA Internal Loops of Three Nucleotides. Biochemistry 1996, 35 (50), 16105–16109. [DOI] [PubMed] [Google Scholar]
- (52).Kobitski AY; Hengesbach M; Seidu-Larry S; Dammertz K; Chow CS; van Aerschot A; Ulrich Nienhaus G; Helm M Single-Molecule FRET Reveals a Cooperative Effect of Two Methyl Group Modifications in the Folding of Human Mitochondrial tRNALys. Chemistry & Biology. 2011, 18, 928–936. [DOI] [PubMed] [Google Scholar]
- (53).Taniguchi T; Miyauchi K; Sakaguchi Y; Yamashita S; Soma A; Tomita K; Suzuki T Acetate-Dependent tRNA Acetylation Required for Decoding Fidelity in Protein Synthesis. Nat. Chem. Biol. 2018, 14 (11), 1010–1020. [DOI] [PubMed] [Google Scholar]
- (54).Bruenger E; Kowalak JA; Kuchino Y; McCloskey JA; Mizushima H; Stetter KO; Crain PF 5S rRNA Modification in the Hyperthermophilic Archaea Sulfolobus Solfataricus and Pyrodictium Occultum. FASEB J. 1993, 7 (1), 196–200. [DOI] [PubMed] [Google Scholar]
- (55).Wada T; Kobori A; Kawahara S-I; Sekine M Synthesis and Hybridization Ability of Oligodeoxyribonucleotides Incorporating N-Acyldeoxycytidine Derivatives. Eur. J. Org. Chem. 2001, 2001, 4583. [Google Scholar]
- (56).Dai W; Li A; Yu NJ; Nguyen T; Leach RW; Wühr M; Kleiner RE Activity-Based RNA-Modifying Enzyme Probing Reveals DUS3L-Mediated Dihydrouridylation. Nat. Chem. Biol. 2021, 17, 1178. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.