Summary
Geranylation is a hydrophobic modification discovered in several bacteria tRNAs with the function of promoting codon bias during translation. However, why nature selects this C10-geranyl group remains a question. We conduct synthesis, UV-thermal denaturation, and molecular simulation studies in RNA duplexes and reveal possible reasons behind this natural selection. Among methyl-(C1), dimethylallyl-(C5), geranyl-(C10), and farnesyl-(C15) modified 2-thiouridines, only geranyl-group promotes U:G over U:A pair. Molecular simulation shows all the modified terpene groups point to the minor groove of RNA duplexes. The discrimination between U:G and U:A pairs derives from the difference in hydrogen bonding and interactions of the chain with the hydrophobic area in the minor groove. Geranyl group has perfect length to discriminate U:G and U:A pairs, whereas the others are either too long or too short to achieve the same behavior. This work indicates that geranyl group cannot be replaced by other terpene groups in promoting codon-specificity.
Subject Areas: Biochemistry, Molecular Structure, Structural Biology
Graphical Abstract

Highlights
-
•
Geranyl group on ges2U has perfect length to enhance binding to G over A in RNA
-
•
Other ges2U analogues has no pairing preference between G or A
-
•
MD shows the molecular dynamic behavior of different terpene chains in RNA duplex
Biochemistry; Molecular Structure; Structural Biology
Introduction
Nucleic acids are naturally modified to diversify their structures and functions. There are more than 160 chemical modifications that have been reported in RNA systems. Among all the RNA species, tRNA contains the highest number (over 90) of modifications (Cantara et al., 2011; Machnicka et al., 2013). RNA modifications have been frontier research topics for their significance in exploring fundamental biochemistry and developing new therapeutics (Calore et al., 2015; Sobczak et al., 2003; Torres et al., 2014). Modifications on tRNA have been reported to be involved in many diseases including virus infections and cancers (Torres et al., 2014). They have been shown to regulate gene expression, fine-tune tRNA structure, enhance ribosomal binding, and affect the fidelity and efficiency of codon-anticodon recognition (Hou et al., 2015; Nachtergaele and He, 2017; Torres et al., 2014). The wobble position 34 or the first anti-codon position is frequently modified with a wide range of chemical groups that can affect the codon-anticodon interactions and fine-tune the translational efficiency and fidelity (Dumelin et al., 2012; Hou et al., 2015; Rozov et al., 2016; Urbonavicius et al., 2001). One of the common modification catalogs on uridine at wobble position is 2-thiouridine (s2U, Figure 1(2)) derivatives including 5-methylaminomethyl-2-thiouridine (mnm5s2U: Figure 1(3)) and 5-carboxymethylamino-methyl-2-thiouridine (cmnm5s2U: Figure 1(4)). It has been shown that modifications on position 5 of uridine enhance the ribosomal binding and provide stronger codon recognition to both adenosine (A) and guanosine (G) (Hou et al., 2015; Rozov et al., 2016).
Figure 1.
Chemical Structure of Uridine Analogues
Uridine (U,1), 2-thiouridine (s2U, 2), 5-methylaminomethyl-2-thiouridine (mnm5s2U, 3), 5-carboxylmethylaminomethyl-2-thiouridine (cmnm5s2U, 4), geranylated 2-thiouridine (ges2U, 5), 5-methylaminomethyl-S-geranyl-2-thiouridine (mnm5ges2U, 6), and 5-carboxylmethylaminomethyl-S-geranyl-2-thiouridine (cmnm5ges2U, 7).
Thiolation at position 2 of uridine has been discovered on 16 out of 60 natural modified uridines (Cantara et al., 2011) and has been shown to increase the binding specificity of U to A while decreasing the recognition of G (Kumar and Davis, 1997; Larsen et al., 2015; Strobel and Cech, 1995). In addition, the sulfur atom at position 2 of uridine could be further replaced by the selenium atom, catalyzed by the selenouridine synthase (SelU, also called MnmH) (Veres and Stadtman, 1994). Interestingly, the same enzyme can also install geranyl group onto the sulfur atom at the position 2 using geranyl-pyrophosphate as the substrate (Figure 2) (Dumelin et al., 2012; Jager et al., 2016; Sierant et al., 2016; Wang et al., 2017). The geranyl group has been found in the wobble position (U34) of tRNAs specific for glutamate, glutamine, and lysine at the frequency of ∼7% in many bacteria such as Escherichia coli, Enterobacter aerogenes, Pseudomonas aeruginosa, and Salmonella enterica var. Typhimurium. The geranylated-tRNALys has been reported to reduce −1 frameshifting during the translation of E. coli, whereas geranylated-tRNAGlu promotes the codon bias of GAG to GAA (Dumelin et al., 2012). It is a fundamentally interesting question that how SelU conducts the bifunctions of both selenation and geranylation. It has been reported that the geranylation level was reduced as the level of selenium substrate exceeded 10 nM (Dumelin et al., 2012), which could be interpreted as geranyl-2-thiouridine being an intermediate of the transformation process from 2-thiouridine to 2-selenouridine (Figure 2) (Sierant et al., 2016, 2018). However, the geranylated uridine may play roles in certain biological processes such as tRNA localization, translational regulation, and cellular stress response, in addition to merely being the intermediate of selenouridine. We previously synthesized the geranyl-modified DNA and RNA oligonucleotides and studied their base pairing stability and specificity. The geranyl group was found to promote the T/U:G pairing over the others in both DNA (ges2T) (Wang et al., 2015) and RNA (ges2U) duplexes (Wang et al., 2016).
Figure 2.
Proposed Catalytic Scheme of Selenouridine Synthase (SelU)
Sulfur atom of 2-thiouridine (2-thioU) can be functionalized by SelU in the presence of geranyl pyrophosphate, resulting in the geranyl-2-thiouridine (A), which can be further converted into 2-selenouridine (B) with selenophosphate. The X group at position 5 represents either the mnm- or cmnm- shown in the compound 6 and 7 of Figure 1.
The addition of terpene group on sulfur atom at position 2 of uridine generates a new H-bonding pattern by switching the N3 from an H-bond donor (HD) to an acceptor (HA), resulting in the gain of second H-bond when pairing with guanosine (Figure 3B) and the loss of an H-bond when pairing with adenosine (Figure 3A). However, one would expect that irrespective of the length of the hydrophobic terpene group, the addition of just a methyl group at the same position (the sulfur atom of 2-thiouridine) would result in the same base pairing pattern, leading to the question, why does nature modify tRNA only with geranyl group when all the other possible terpene chains with different lengths are also available in cells? We have previously demonstrated the effect of terpene chain on the base pairing stability and specificity in the context of DNA duplexes containing geranyl-2-thiothymidine analogues (Haruehanroengra et al., 2017). In DNA-duplex, we found that at least a ten-carbon chain is necessary to achieve the base pairing specificity of xT:dG over xT:dA pairing. Both methyl and dimethylallyl groups at the same position showed no discrimination between these two pairs. Interestingly, farnesyl group showed similar specificity, but with slightly lower duplex stability compared with the geranyl one. Our molecular dynamic simulations showed that the xT:dA-containing duplexes are destabilized by the hydrophobic terpene groups. The longer the terpene modification, the more destabilization caused by the hydrophobic group by pushing the opposite nucleotides out of the duplex context, thereby weakening the stacking interactions and widening the DNA loop. In case of xT:dG-containing duplexes, however, both geranyl and farnesyl groups fit into the minor groove and compensate the overall stability loss of duplexes, whereas methyl and dimethylallyl groups are too short to interact with the hydrophobic area nor achieve the same effect.
Figure 3.
Proposed Base Pairing Patterns
The geranyl-2-thiouridine analogues (xU) with different length of carbon chains (R-group) pair with A, G, C, and U, respectively (A–D).
Considering the structural differences between DNA and RNA duplexes, the geometries and interactions between the long terpene chains and RNA minor groove could be different from the DNA one. In this study, we applied the same set of terpene analogues, including methyl- (C1), dimethylallyl- (C5), geranyl- (C10), and farnesyl- (C15) modified 2-thiouridines into RNA duplexes, targeting to mimic the codon-anticodon interactions and further elucidate the significance of natural geranyl modification in promoting xU:G pair over xU:A in tRNA system. By annealing with the complementary RNA strands containing either canonical base pair or the mismatched non-canonical ones, we obtained the base pairing specificity and stability information of each modification via thermal denaturation studies. It is interesting to observe that the same hydrophobic terpene chain has different effects on duplex stability in the contexts of DNA and RNA. The geranyl group was shown to promote the pairing of ges2U:G over ges2U:A pair, which is consistent with DNA duplexes, whereas the base pairing discrimination between G and A in RNA context was not observed in farnesyl-modified RNA, unlike the case in DNA duplexes. We also performed molecular dynamic simulations to understand the molecular behavior of each hydrophobic terpene group in RNA duplexes containing xU:A or xU:G respectively and quantified their interactions and dynamics by employing the hydration patterns, time-series data of hydrogen bonding, base-stacking, and the locations of the modification.
Results
Synthesis of S-geranyl-2-thiouridine Analogues and the RNA Oligonucleotides
As shown in Scheme 1, the synthesis started from the commercially available 5′-DMTr-2′-TBDMS-2-thiouridine phosphoramidite. The terpene modification including methyl, dimethylallyl, and farnesyl-groups were functionalized at the 2-thio position on the uracil under basic condition to generate the key compound 2a-c in high yields. The 31P-NMR spectra showed that the phosphoramidite group of the product 2a-c is quite stable under this reaction condition (Figures S2, S5, and S8). The modified phosphoramidites were further confirmed by 1H-NMR and QTOF-MS (Figures S1–S9). The phosphoramidites were then incorporated into RNA oligonucleotides. The oligonucleotides were processed as described in Transparent Methods section and finally were verified by HR-MS. It has also been demonstrated previously that these terpene groups are stable with the standard reagents and conditions of solid-phase synthesis (Haruehanroengra et al., 2017).
Scheme 1.
Synthesis of Geranyl-2-thiouridine Analogues Phosphoramidites
(i) Methyl iodine, N,N-diisopropylethylamine (DIEA), methanol (MeOH), room temperature (a); dimethylallyl bromide, DIEA, MeOH, RT (b); farnesyl bromide, DIEA, MeOH, RT (c). The overall yields are above 90%.
Thermal Denaturation and Base Pairing Studies of RNA Duplexes Containing Modified Uridines
In this study, we used the 12mer RNA-duplex as the model, 5′-GGACU(xU)CUGCAG-3′ and 5′-CUGCAG(Y)AGUCC-3′, where X:Y pair is either canonical U:A or non-canonical U: C/G/U pairs (Figure 4), which is consistent to our previous study on geranyl-RNA (Wang et al., 2016). The UV-melting temperatures (Tm) of xU-containing RNA duplexes were compared with the native counterparts (Figure 5). The corresponding melting temperatures and thermodynamic data are summarized in Table 1. All the modifications at sulfur position on 2-thiouridine (xs2U) result in the decrease of Tm values, indicating that the addition of terpene groups on 2-thio position reduced the overall base pairing stability of the duplex, consistent with what we observed in the DNA context. However, the effect of terpene chain length on base pairing specificity seems quite different. Overall, studying the UV melting temperatures of RNA duplexes containing these uridine analogues (mes2U, dmas2U, ges2U, and fas2U) revealed the importance of geranyl group on base pairing selection in RNA system.
Figure 4.
Modified RNA Strand Containing S-geranyl-2-thiouridine Derivatives xU
(A) The strand containing S-methyl-2-thiouridine (mes2U), S-dimethylallyl-2-thiouridine (dmas2U), S-geranyl-2-thiouridine (ges2U), and S-farnesyl-2-thiouridine (fas2U).
(B) The RNA complementary strand for thermal denaturation study.
Figure 5.
Normalized UV-Melting Curves of RNA Duplexes
(A–E) The duplex [5′-GGACU(xU)CUGCAG-3’] and [5′-CUGCAG(Y)AGUCC-3’] contain (xU) pairing with Y, with the xU representing (A) native uridine, (B) S-methyl-2-thiouridine (mes2U), (C) S-dimethylallyl-2-thiouridine (dmas2U), (D) S-geranyl-2-thiouridine (ges2U), and (E) S-farnesyl-2-thiouridine (fas2U).
(F) Summary of the melting temperatures. The curves represent the average data from four ramps of heating and cooling process.
Table 1.
RNA Duplex Stability and Base Pairing Specificity
| Entry | Base Pairs |
Tm (°C)a | ΔTm(°C)b | -ΔG°37 (kcal/mol)c | -ΔH°37 (kcal/mol)c | -ΔS°37 (cal/K∗mol)c | |
|---|---|---|---|---|---|---|---|
| X | Y | ||||||
| 1 | U | A | 62.5 | 16.6 | 107.6 | 293.3 | |
| 2 | U | U | 53.3 | −9.2 | 14.0 | 112.8 | 318.6 |
| 3 | U | G | 59.6 | −3.0 | 16.0 | 111.8 | 308.9 |
| 4 | U | C | 50.9 | −11.7 | 12.6 | 100.3 | 282.8 |
| 5 | mes2U | A | 39.7 | 9.1 | 48.8 | 127.8 | |
| 6 | mes2U | U | 40.4 | 0.7 | 9.1 | 45.8 | 118.4 |
| 7 | mes2U | G | 39.9 | 0.1 | 9.1 | 41.6 | 104.5 |
| 8 | mes2U | C | 41.2 | 0.7 | 9.4 | 58.8 | 159.5 |
| 9 | dmas2U | A | 57.0c | 14.1 | 78.3 | 207.1 | |
| 10 | dmas2U | U | 47.3 | −9.8 | 10.2 | 57.4 | 152.0 |
| 11 | dmas2U | G | 55.1 | −1.9 | 14.1 | 102.6 | 285.5 |
| 12 | dmas2U | C | 45.8 | −11.2 | 10.5 | 66.5 | 180.4 |
| 13 | ges2U | A | 34.5 | 7.6 | 52.5 | 144.6 | |
| 14 | ges2U | U | 35.9 | 1.4 | 7.95 | 44.1 | 116.5 |
| 15 | ges2U | G | 44.2 | 9.7 | 10.1 | 93.6 | 269.1 |
| 16 | ges2U | C | 37.3 | 2.9 | 8.2 | 100.8 | 298.7 |
| 17 | fas2U | A | 51.9 | 10.9 | 50.7 | 128.1 | |
| 18 | fas2U | U | 42.2 | −9.8 | 9.3 | 44.6 | 114.0 |
| 19 | fas2U | G | 50.3 | −1.6 | 10.4 | 53.4 | 138.5 |
| 20 | fas2U | C | 41.6 | −10.3 | 9.3 | 54.2 | 144.6 |
The duplex contains methyl-2thiouridine (mes2U), dimethylallyl-2-thiouridine (dmas2U), and geranyl-2-thiouridine (ges2U) and farnesyl-2-thiouridine (fas2U), respectively in the context of 12nt-RNA: 5′-GGACU(xU)CUGCAG-3’] and [5′-CUGCAG(Y)AGUCC-3′].
The Tm were measured in 10 mM sodium phosphate buffer containing 100 mM NaCl (pH 7.0) and repeated four times.
ΔTm values are relative to the duplex with xU-A pair of the same modification (x).
Tm was estimated from the average of two-stage melting temperatures.
As shown in Table 1, in the mes2U-containing duplexes, the replacement of oxygen at position 2 by methylated sulfur dramatically decreased melting temperatures (Tm) of duplexes containing mes2U:A, mes2U:U, mes2U:G, and mes2U:C when compared with native U:A by 22.8°C, 12.9°C, 19.7°C, and 9.7°C, corresponding to an increased ΔG° of 7.5, 4.9, 7.0, and 3.3 kcal/mol, respectively. Similarly, in the dmas2U-containing duplexes, the Tm decreased by 5.5°C, 4.5°C, 6.0°C, and 5.1°C, with the increased ΔG° of 2.5, 2.0, 3.8, and 2.1 kcal/mol, respectively. For the farnesyl modification, the Tm of fas2U duplexes were lower than its native counterparts by 10.6°C, 11.2°C, 9.3°C, and 9.2°C, respectively, corresponding to an increased ΔG° of 5.7, 4.7, 5.6, and 3.3 kcal/mol. If we compare the relative Tm of both U:A and U:G pairs containing duplexes with different lengths of modification, as shown in Figure 5F, while the RNA duplexes were stabilized by dimethylallyl group comparing to the methyl one, the stability dropped again when the terpene chain grew longer to C-10 geranyl group and recovered back with a C-15 farnesyl modification. It is also worthwhile to mention that the melting of dmas2U:A duplex showed a two-stage curve, indicating some specific interactions that this group may have with RNA duplexes to form certain stable transition intermediate structures during the duplex dissociation with heating.
We also reported the melting temperature of internal 2-thiouridine pairing with A/U/G/C (Figure S15). Our results agree with the earlier reports that 2-thiouridine stabilizes the pairing with A and U while slightly destabilizing the pairing with G (Kumar and Davis, 1997; Larsen et al., 2015), suggesting that the alteration of base pairing specificity observed earlier attributes to the addition of terpene groups on 2-thiouridine. The wobble position on tRNA is less discriminatory to achieve the degeneracy of the genetic code; however, it is naturally modified with the most diversity of chemical groups, which are known to regulate the translation efficiency. For example, s2U is commonly found on the wobble position of the anti-codon ending with U. It helps stabilizing the pairing of adenosine and destabilizing the pairing of guanosine, providing the codon bias during the translation process. Based on our study, the base pairing specificity can be switched to the opposite trend with the addition of geranyl group on sulfur atom of 2-thiouridine. In addition, we confirm that the terpene length of the geranyl is crucial for base pairing discrimination of ges2U:G over ges2U:A because other terpene groups (methyl, dimethylallyl, and farnesyl) could not provide such discrimination despite of the similarity of H-bond pattern. It is fascinating how nature selectively decorates the wobble position with various types of modifications as the tools to fine-tune the base recognition during translation. Moreover, the base pairing specificity studies of RNA with artificial groups such as methyl, dimethylallyl, geranyl, and farnesyl on 2-thiouridine can be applied to the development of DNA-/RNA-based therapeutics such as antisense and siRNA oligonucleotides (Reviews, 2007).
Molecular Simulation
The molecular dynamics simulation studies were subsequently performed to study more mechanistic insights into these 2-thiouridine-bonded terpene chains with various lengths in RNA duplexes with either U:A or U:G pairs. Figure 6 shows the surface maps of the RNA colored based on their contacts with the modifications, thereby identifying their likely location in duplex context. Both of the canonical and modified bases are firmly held in place through stacking and base pairing interactions, and the hydrophobic chain behaves largely similarly when the opposite base is adenine or guanine. Unlike the DNA, the minor groove of the RNA is more hydrophilic due to the larger solvent accessible surface area and the presence of the polar 2′ hydroxyl group, which is not accessible to the hydrophobic chain of the modification. Therefore, the modification is largely seen to interact with the backbone, including C5′ and C4′ atoms of the sugar. Although the contact with the duplex increases with chain length as expected, it is interesting to note that the modification in the A-paired system shows preferential interaction to the 3′ side of the modified base, while being more promiscuous in the G-paired system (both the 5′ and 3′ sides).
Figure 6.
Surface Representation of the RNA Duplexes
The strands are colored according to average number of contacts between the modification and the rest of the 12nt-RNA (low to high: blue to red) containing xU:A (A–D) and xU:G (E–H). The sulfur atom is shown in yellow and the hydrophobic chain of the modifications is shown in green.
Considering the importance of hydration (Siegfried et al., 2010) in the interaction pattern, we quantified the hydration change of RNA in the presence of different modifications by calculating the average number of water molecules in the first hydration shell (<4 Angstroms) of the RNA heavy atoms. The inset in Figure 7 shows the RNA heavy atoms colored with respect to the observed hydration behavior. As expected, the hydrophilic backbone shows stronger hydration (red to white) than the major and minor grooves (white to blue). Note that the major groove is not visible in this orientation. Interestingly, there is significant dehydration in the minor groove proximal to the local of the modification (highlighted by the stars), as shown by the distinct blue patch, which increases with the hydrophobic chain length, but saturates with the geranyl modification. To quantify this change, we calculated the average hydration for the whole RNA by averaging between all the heavy atoms. As expected, the hydration is non-monotonic and is the least for the geranyl containing RNA. This observation is also consistent with the preferential minor groove binding of the geranyl group driven by hydrophobicity.
Figure 7.
Overall Hydration Index
Quantified by average number of water molecules in the first hydration shell (<4 Angstroms) of the heavy atoms in the RNA. The inset shows the RNA duplex in surface form and colored by hydration index (from blue to white to red).
We next calculated the average number of hydrogen bonds between uridine and the modified uridines with the opposite adenine and guanine nucleobases (Figure 8). Although the guanine maintains roughly two hydrogen bonds with uridine and the modified derivatives, which is expected from Figure 3, the bonding between the modified uridines and adenosine is observed to be very weak, with the number of hydrogen bonds dropping significantly to roughly 0.5 H-bond across the modifications. Interestingly, although the overall H-bonding behavior is similar between RNA and DNA duplexes, the stacking interactions are stronger in RNA. Figure 9 shows the probability of stacking distances between the adenine and guanine with their 5′ and 3′ neighbors. Similar to the DNA systems, in the case of guanine pairing, with the addition of the modification, the stacking interactions are strengthened toward the 5′ side as evidenced by the decreased stacking distance and remains strengthened with subsequent increase in the chain length. The distances are unchanged for guanine in the 3′ side. However, in the case of adenine, unlike DNA, the stacking interactions are preserved in all the cases, even in those where the hydrogen bonding interactions are significantly weakened. Even though the stacking distances increase for the A paired systems with both 5′ and 3′ bases, they still remain within stacking distance. These results indicate that, although the hydrogen bonding with adenine is weakened by the modifications, the stacking interactions is largely preserved, which is a stark difference compared with the DNA systems.
Figure 8.
Average Number of Hydrogen Bonds
From the pairs of Xs2U to G (black) or A (red) when X is terpene modification methyl (meS2U), dimethylallyl (dmaS2U), geranyl (geS2U), and farnesyl (faS2U).
Figure 9.
Probability Distributions of Stacking Distances between Adenine or Guanine and the Neighboring Base
Defined by the geometric centers of the heavy atoms constituting the ring of the nucleobases.
(A and B) Show the distributions for Adenine with the 5′ and 3′ neighbors, respectively.
(C and D) Show the same for Guanine.
In addition to the conformational preference of the modification in the context of the duplex, we quantified its dynamics from the time-series data of the location of the modification. Specifically, after fitting the trajectory to the reference structure (by minimizing RMSD), we generated the time-series data for the location of the terminal terpene group, with the exception of the methyl modification, for which the whole methyl group was used. The mass center of the constituting carbon atoms was used to define the location of the terpene group. The time-series data are then used to calculate the auto-correlation function (ACF), using the following formula (Equation 1), where the x vector denotes the time-dependent xyz coordinates of the modification, and ρ(t) is the time-series average.
| (Equation 1) |
The ACF for all the systems with modifications are shown in Figure 10. For the A-paired systems, the dynamics of the modification is as reflected in the ACF follows a monotonic trend: faster to slower with the increase in the size of the modification. This trend is likely due to the increase in the preferential interaction of the modification to the 3′ side with the increased chain length. However, for the G-paired systems, there is a reversal in this overall trend as seen in a non-monotonic behavior. The dimethylallyl modification possesses the longest correlation time, indicating a possible stronger preference for that group to interact with the duplex. With further increase in the modification chain length, the correlation time decreases, indicating faster dynamics. Although the contact maps show increase in interactions with chain length, the ACF curves for the G-paired systems indicate faster dynamics for larger chains, suggesting multiple modes (5′ and 3′ orientations) of interactions for the longer chains.
Figure 10.
Autocorrelation Functions of the Position Vectors of the Modifications
The position vector is defined as follows: methyl carbon for methyl meS2U and the mass center of the terpene carbons for the other modifications.
Discussion
Why does nature use the geranyl modification in the anti-codon stem-loop of the tRNA? Although this is a truly challenging question and needs more thorough biological investigations, from our experimental and simulation studies, we find some interesting difference in structure and stability between the various modifications even though they are expected to form the same base pairing pattern. We analyzed the effect of the terpene chain length on the base pairing discrimination especially between U:A and U:G pairs. It turns out that geranyl is the only group that can promote the binding difference between G and A within these terpene modifications. In our previous DNA duplex studies, we observed that both the geranylated and farnesylated 2-thiothymidines prefer to pair with G over A because the long terpene groups can further stabilize the DNA-duplexes by binding to the minor groove in DNA duplexes when they pair with G (Haruehanroengra et al., 2017). In the meantime, the fas2T:A pair was destabilized by the farnesyl group, generating the distinct discrimination between fas2T base pairing with G and A. However, this is not the case in RNA duplex. Although farnesyl group in fas2U:A could stabilize the overall RNA duplexes as shown in the increasing of melting temperature when compared with geranyl modification (Table 1, entry 13 and 17), this 15-carbon chain does not result in the base pairing discrimination between A and G in RNA system. The global shape of DNA or RNA duplexes should be the key factor in the behavioral change of the hydrophobic terpene groups. RNA duplexes are generally wider in diameter and shorter in length than DNA duplexes, providing the stronger van der Waals stacking force between nucleobases. Moreover, the addition of 2′-hydroxyl group of RNA provides a more hydrophilic environment in the minor groove of RNA duplexes.
Analyzing the molecular behavior of terpene chain provides the explanation to the stability and specificity studies. It is clear that the hydrophobic modifications were pointed into the minor groove for both A and G as the paired base albeit with different orientations. In the case of S-methyl-2-thiouridine-containing RNA duplexes, there is no significant interaction of methyl group in the minor groove (Figures 6A and 6E) due to its small size, suggesting the similar stability of mes2U:A and mes2U:G RNA duplexes. The increasing stability of RNA duplexes when the hydrophobic chain length increased from 1C to 5C (dimethylallyl group) is consistent with the enhanced interaction of the modification in the contact maps shown in Figures 6B and 6F. With an increase in the length of hydrophobic chains, as in the dimethylallyl group, there is enhanced interaction with the 4′ and 5′ carbons. The hydrophobic and van der Waals interactions between the dimethylallyl group and those hydrophobic regions of duplexes can enhance the overall stability of the RNA duplexes, consistent with the increase of the melting temperature when the terpene chain grew from 1C (methyl) to 5C (dimethylallyl). With further increase in chain length, there is a non-monotonic trend in the melting temperature, in contrast to the contact maps, which suggests stronger interaction for the geranyl group over the dimethylallyl group in both A-paired and G-paired systems. However, the stability of the farnesyl modification over the geranyl group is consistent with the enhanced interaction of the hydrophobic chain. The decrease in stability for the duplexes with geranyl group, especially for the A-paired system, might be a quirk of that chain length. We find that intrachain backbone contacts start to appear when the chain length increases from 5C to 10C, which might detract from the duplex-stabilizing interchain contacts. However, such intrachain contacts can coexist with the interchain ones with the longer chain, as in the case of the farnesyl group (15C). In addition, the interaction modes of the geranyl and farnesyl modifications are also different in the A-paired and G-paired systems. Although the hydrophobic chains interact preferentially with the 3′ side in the A-paired system, the interaction is more promiscuous in the G-paired system. This behavior is also consistent with our hydration pattern studies and the faster dynamics exhibited by the modified groups in the G-paired systems. Therefore, we speculate that the promiscuous interaction of the geranyl group in combination with the balance of intra- and interchain contacts render the maximal differentiating capability in its pairing with A and G.
It is important to note that dmas2U:A pair showed a two-states binding. At this point we could not provide a rational explanation of how the dimethylallyl imparts an intermediate state to the duplex. However, geranyl group is chemically less reactive compared with dimethylallyl that contains the allylic proton, which could be one of the reasons why geranyl group was evolutionary selected to promote wobble pairing to G. Collectively, geranyl group seems to be the right “sweet point” for introducing the pairing to G over the canonical base pairing A in ges2U-RNA duplexes context. This fundamental structural study of terpene chain on the effect of base pairing specificity and stability provides not only the understanding of the naturally existed modification but also alternative modifications for future RNA therapeutic development.
Limitations of the Study
The effect of terpene chain length observed in this study is in the context of RNA duplexes where the internal modified uridine is located at the center of an artificial 12-nt RNA strand. This study does not address the effects of these geranyl-analogues in different RNA sequences. In nature, geranyl group presents at the wobble position of the tRNA anticodon stem loop where the interaction situation between the modified uridine and mRNA is different. The presence of surrounding ribosomal binding further complicates this interaction mode. Therefore, it will be very interesting to check the overall effect of geranyl analogues in the natural context, as well as to explore the impacts of these modifications on the translation efficiency, which could lead to potential therapeutic development. In addition, the molecular simulation (MD) studies had been performed with a lack of existing structural confirmation. Crystal or NMR structures will shed light on how these different length of terpene chains actually interact with the RNA duplexes and explain the two transitions state that has been observed in the case of dmas2U pairing to A.
Conclusion
The systematic study on the effect of chain length of geranyl analogues on 2-thiouridine in the context of RNA duplexes revealed that geranyl group has the proper terpene chain length (C10) that promotes the binding of geranylated 2-thiouridine to guanosine (ges2U:G) over adenosine (ges2U:A) in RNA duplexes context. The hydrophobic modifications on 2-thiouridine theoretically altered the hydrogen bond pattern in the similar way despite of the length of terpene chains. Thermal denaturation study, however, revealed that methyl (C1), dimethylallyl (C5) and farnesyl (C15) showed no discrimination between G and A base pairing but geranyl (C10) group promoted the U:G over U:A pair. Interestingly, dimethylallyl presented two-states binding with adenosine pair (dmas2U:A) causing uncertainty on the base pairing preference. Therefore, among the four different chain lengths from C1 to C15, only geranyl group provided reliable base pairing preference of G over A. The molecular dynamic simulation provided detailed molecular behavior of the hydrophobic modification on 2-thiouridine. In RNA context, when the modified uridine pairs to G, two H-bonds were formed, whereas theoretical 0.5 H-bonds were formed when pairing with A. The modified uridines still align in the RNA duplex because of the base-stacking force. However, the difference in the orientation of the modified uracil base affected the possible interaction of the hydrophobic chain on the sulfur atom to the hydrophobic area of the minor groove. Collectively, this study indicates the importance of decorating the geranyl group on tRNA might stem from the length of the terpene chain to perform its function. It is significant for nature to selectively use geranyl modification to fine-tune and promote the base pairing specificity of geranyl-uridine to G over A and to achieve the biological significance such as reducing the −1 frameshifting. Moreover, geranylation in RNA could be a relic of the RNA world where this modification may have enhanced the structural and functional diversity of RNA prior to the emergence of proteins. More investigation is underway to understand if geranyl-modification on tRNA performs other significant biological functions in processes such as ribosomal binding, amino acylation, cellular stress responses, and RNA translocation.
Resource Availability
Lead Contact
Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Jia Sheng (jsheng@albany.edu).
Materials Availability
There are no unique materials available.
Data and Code Availability
The original NMR and MS spectra, HPLC profile, Tm curves are available in the Supplemental Information. The forcefield parameters generated for the modifications, as described in the Methods section, are provided in GROMACS format. Simulation data was generated and stored at the HPC hardware housed at the Data Center in UAlbany and are available for sharing upon request to the corresponding author.
Methods
All methods can be found in the accompanying Transparent Methods supplemental file.
Acknowledgments
This work was supported by The National Science Foundation (MCB-1715234 to J.S. and MRI grant 1726724) and University at Albany, State University of New York. A.A.C acknowledges support from NIH grant R35GM133469. We thank Cen Chen, Dr. Zhen Huang, Dr. Leah Seebald, Dr. Maksim Royzen, and Dr. Daniele Fabris for their help in mass spectrometry analysis. This work used computing resources from the Extreme Science and Engineering Discovery Environment (XSEDE).
Author Contributions
Conceptualization, J.S.; Methodology, P.H., S.V., S.V.R., and A.A.C.; Investigation, P.H., S.M., and M.D.S.; Writing—Original Draft, P.H., S.V.R., and J.S.; Writing—Review & Editing, P.H. and J.S.; Funding Acquisition, J.S.; Resources, J.S., S.V.R., and A.A.C.; Supervision, J.S.
Declaration of Interests
All authors declare no conflict of interest.
Published: December 18, 2020
Footnotes
Supplemental Information can be found online at https://doi.org/10.1016/j.isci.2020.101866.
Supplemental Information
References
- Calore M., De Windt L.J., Rampazzo A. Genetics meets epigenetics: genetic variants that modulate noncoding RNA in cardiovascular diseases. J. Mol. Cell. Cardiol. 2015;89(Pt A):27–34. doi: 10.1016/j.yjmcc.2015.10.028. [DOI] [PubMed] [Google Scholar]
- Cantara W.A., Crain P.F., Rozenski J., McCloskey J.A., Harris K.A., Zhang X., Vendeix F.A., Fabris D., Agris P.F. The RNA modification database, RNAMDB: 2011 update. Nucleic Acids Res. 2011;39:D195–D201. doi: 10.1093/nar/gkq1028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dumelin C.E., Chen Y., Leconte A.M., Chen Y.G., Liu D.R. Discovery and biological characterization of geranylated RNA in bacteria. Nat. Chem. Biol. 2012;8:913–919. doi: 10.1038/nchembio.1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haruehanroengra P., Vangaveti S., Ranganathan S.V., Wang R., Chen A., Sheng J. Nature's selection of geranyl group as a tRNA modification: the effects of chain length on base-pairing specificity. ACS Chem. Biol. 2017;12:1504–1513. doi: 10.1021/acschembio.7b00108. [DOI] [PubMed] [Google Scholar]
- Hou Y.M., Gamper H., Yang W. Post-transcriptional modifications to tRNA--a response to the genetic code degeneracy. RNA. 2015;21:642–644. doi: 10.1261/rna.049825.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jager G., Chen P., Bjork G.R. Transfer RNA bound to MnmH protein is enriched with geranylated tRNA--A possible intermediate in its selenation? PLoS One. 2016;11:e0153488. doi: 10.1371/journal.pone.0153488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumar R.K., Davis D.R. Synthesis and studies on the effect of 2-thiouridine and 4-thiouridine on sugar conformation and RNA duplex stability. Nucleic Acids Res. 1997;25:1272–1280. doi: 10.1093/nar/25.6.1272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Larsen A.T., Fahrenbach A.C., Sheng J., Pian J., Szostak J.W. Thermodynamic insights into 2-thiouridine-enhanced RNA hybridization. Nucleic Acids Res. 2015;43:7675–7687. doi: 10.1093/nar/gkv761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Machnicka M.A., Milanowska K., Osman Oglou O., Purta E., Kurkowska M., Olchowik A., Januszewski W., Kalinowski S., Dunin-Horkawicz S., Rother K.M. MODOMICS: a database of RNA modification pathways--2013 update. Nucleic Acids Res. 2013;41:D262–D267. doi: 10.1093/nar/gks1007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nachtergaele S., He C. The emerging biology of RNA post-transcriptional modifications. RNA Biol. 2017;14:156–163. doi: 10.1080/15476286.2016.1267096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reviews A. Annual review of biophysics and biomolecular structure. Annu. Rev. Biophys. Biomol. Struct. 2007;36:1–505. [Google Scholar]
- Rozov A., Demeshkina N., Khusainov I., Westhof E., Yusupov M., Yusupova G. Novel base-pairing interactions at the tRNA wobble position crucial for accurate reading of the genetic code. Nat. Commun. 2016;7:10457. doi: 10.1038/ncomms10457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siegfried N., Kierzek R., Bevilacqua P. Role of unsatisfied hydrogen bond acceptors in RNA energetics and specificity. J. Am. Chem. Soc. 2010;132:5342–5344. doi: 10.1021/ja9107726. [DOI] [PubMed] [Google Scholar]
- Sierant M., Leszczynska G., Sadowska K., Komar P., Radzikowska C.E., Sochacka E., Nawrot B. Escherichia coli tRNA 2-selenouridine synthase (SelU) converts S2U-RNA to Se2U-RNA via S-geranylated-intermediate. FEBS Lett. 2018;592:2248–2258. doi: 10.1002/1873-3468.13124. [DOI] [PubMed] [Google Scholar]
- Sierant M., Leszczynska G., Sadowska K., Dziergowska A., Rozanski M., Sochacka E., Nawrot B. S-Geranyl-2-thiouridine wobble nucleosides of bacterial tRNAs; chemical and enzymatic synthesis of S-geranylated-RNAs and their physicochemical characterization. Nucleic Acids Res. 2016;44:10986–10998. doi: 10.1093/nar/gkw727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sobczak K., de Mezer M., Michlewski G., Krol J., Krzyzosiak W.J. RNA structure of trinucleotide repeats associated with human neurological diseases. Nucleic Acids Res. 2003;31:5469–5482. doi: 10.1093/nar/gkg766. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Strobel S.A., Cech T.R. Minor groove recognition of the conserved G.U pair at the Tetrahymena ribozyme reaction site. Science. 1995;267:675–679. doi: 10.1126/science.7839142. [DOI] [PubMed] [Google Scholar]
- Torres A.G., Batlle E., Ribas de Pouplana L. Role of tRNA modifications in human diseases. Trends Mol. Med. 2014;20:306–314. doi: 10.1016/j.molmed.2014.01.008. [DOI] [PubMed] [Google Scholar]
- Urbonavicius J., Qian Q., Durand J.M., Hagervall T.G., Bjork G.R. Improvement of reading frame maintenance is a common function for several tRNA modifications. EMBO J. 2001;20:4863–4873. doi: 10.1093/emboj/20.17.4863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Veres Z., Stadtman T.C. A purified selenophosphate-dependent enzyme from Salmonella typhimurium catalyzes the replacement of sulfur in 2-thiouridine residues in tRNAs with selenium. Proc. Natl. Acad. Sci. U S A. 1994;91:8092–8096. doi: 10.1073/pnas.91.17.8092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang R., Ranganathan S.V., Basanta-Sanchez M., Shen F., Chen A., Sheng J. Synthesis and base pairing studies of geranylated 2-thiothymidine, a natural variant of thymidine. Chem. Commun. (Camb.) 2015;51:16369–16372. doi: 10.1039/c5cc07479g. [DOI] [PubMed] [Google Scholar]
- Wang R., Vangaveti S., Ranganathan S.V., Basanta-Sanchez M., Haruehanroengra P., Chen A., Sheng J. Synthesis, base pairing and structure studies of geranylated RNA. Nucleic Acids Res. 2016;44:6036–6045. doi: 10.1093/nar/gkw544. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang R., Haruehanroengra P., Sheng J. Synthesis of geranyl-2-thiouridine-modified RNA. Curr. Protoc. Nucleic Acid Chem. 2017;68:4.72.1–4.72.13. doi: 10.1002/cpnc.22. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The original NMR and MS spectra, HPLC profile, Tm curves are available in the Supplemental Information. The forcefield parameters generated for the modifications, as described in the Methods section, are provided in GROMACS format. Simulation data was generated and stored at the HPC hardware housed at the Data Center in UAlbany and are available for sharing upon request to the corresponding author.











