Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2022 Aug 5;23(18):e202200352. doi: 10.1002/cbic.202200352

Synthesis of a Peptidoyl RNA Hairpin via a Combination of Solid‐Phase and Template‐Directed Chain Assembly

Jennifer Bremer 1, Christian Richter 2, Harald Schwalbe 2, Clemens Richert 1,
PMCID: PMC9542650  PMID: 35867587

Abstract

Peptidoyl RNAs are the products of ribosome‐free, single‐nucleotide translation. They contain a peptide in the backbone of the oligoribonucleotide and are interesting from a synthetic and a bioorganic point of view. A synthesis of a stabilized version of peptidoyl RNA, with an amide bond between the C‐terminus of a peptide and a 3′‐amino‐2′,3′‐dideoxynucleoside in the RNA chain was developed. The preferred synthetic route used an N‐Teoc‐protected aminonucleoside support and involved a solution‐phase coupling of the amino‐terminal oligonucleotide to a dipeptido dinucleotide. Exploratory UV‐melting and NMR analysis of the hairpin 5′‐UUGGCGAAAGCdC‐LeuLeu‐AA‐3′ indicated that the peptide‐linked RNA segments do not fold in a cooperative fashion. The synthetic access to doubly RNA‐linked peptides on a scale sufficient for structural biology opens the door to the exploration of their structural and biochemical properties.

Keywords: NMR spectroscopy, oligonucleotides, peptides, RNA, translation


A synthetic access to peptidoyl RNA, where a peptide segment is doubly linked to RNA strands, is described. Peptidoyl RNAs are a fascinating species that may have played a role in the origin of translation.

graphic file with name CBIC-23-0-g001.jpg

Introduction

Covalently linked hybrids of peptides and nucleic acids are fascinating target molecules that are rich in functional groups and stereogenic centers. Their synthesis combines challenges of peptide and oligonucleotide chemistry.[ 1 , 2 ] Each of the biopolymers has its own set of side reactions during chain assembly. Further, their intrinsic reactivity requires conditions for deprotection and purification that are orthogonal in many respects.[ 3 , 4 ] The synthetic challenge is particularly acute for hybrids containing RNA as nucleic acid, since RNA is more labile than DNA and gives lower yields in solid‐phase synthesis. [5]

Several groups have reported successfully overcoming the synthetic challenges of preparing hybrid molecules that contain RNA and a peptide. They all used covalent linkages more stable than the ester bond found in the peptidyl tRNA intermediates of translation.[ 6 , 7 , 8 ] Other synthetic constructs were prepared using squarate‐diester‐mediated coupling, [9] boronate‐forming reactions, [10] or native ligation. [11] Our own synthetic work has focused on phosphoramidate‐linked peptide‐RNA hybrids called ′peptido RNA′, for which a solution‐phase coupling approach has recently been reported. [12]

Peptido RNAs contain the phosphoramidate bond between the 5′‐terminal phosphate of the oligoribonucleotide chain and the N‐terminus of the peptide.[ 13 , 14 ] This leaves the C‐terminus of the peptide free for coupling to the 3′‐terminus of another nucleic acid strand, such as the bond found in the peptidyl RNAs of translation. Template‐directed coupling reactions were indeed observed by us when dipeptido dinucleotides were allowed to react with 3′‐aminoterminal strands on an RNA template. [15] The resulting doubly linked species were named ′peptidoyl RNAs′ to indicate that they contain the covalent links found in peptido and in peptidyl RNAs.

The rate of formation of peptidoyl RNAs was found to depend strongly on the relative position of the two strands on the template and on the amino acid residues of the dipeptide. [15] Further, the reaction was fastest when no gap was left between the double helices formed by the peptide‐bearing strand and the strand coupling at its 3′‐terminus. Even more surprisingly, the sterically demanding leucinylleucine dipeptide sequence (LeuLeu) gave the fastest coupling, not GlyGly with its unencumbered chain. [15] A similar finding was made when dipeptidoyl RNAs were allowed to form in ribosome‐free single‐ nucleotide translation assays, in which a 3′‐aminoacylated transfer‐nucleotide couples to a 3′‐aminoacidyl primer on an RNA strand acting as primitive mRNA. [16] Again, sterically demanding amino acid residues like Val and Leu were among the species reacting fast and efficiently. These unexpected findings were particularly intriguing because of the relevance for the origin of the genetic code[ 17 , 18 , 19 , 20 ] and biological homochirality.[ 21 , 22 ]

We therefore initiated a project aimed at gaining insights into the three‐dimensional structure of peptidoyl RNAs. Building on our earlier work involving NMR‐monitored binding studies with primer‐template systems,[ 23 , 24 , 25 ] we designed hairpin 1 (Figure 1) as our target molecule. This peptidoyl hairpin features a LeuLeu dipeptide in the backbone of the nucleic acid strand and a GAAA tetraloop[ 26 , 27 ] to connect the termini of what would otherwise be a primer and a template. It is small enough to be studied by NMR without having to resort to isotopically enriched building blocks. Here we report the synthesis of 1, together with the results of an exploratory study on its folding properties.

Figure 1.

Figure 1

Structure of target molecule 1.

Results and Discussion

It was decided that the assembly of the RNA portion of 1 was to occur via solid‐phase chain assembly, using the TBDMS method. [28] The dipeptide H‐Leu‐Leu‐OH is available from a commercial source, obviating the need to prepare it by Boc‐based solution‐phase synthesis. The main synthetic issue was thus to set up the amide and the phosphoramidate linkages. Figure 2a shows key structures of three synthetic strategies that were explored experimentally.

Figure 2.

Figure 2

Retrosynthetic considerations and key intermediates of synthetic paths for the preparation of peptidoyl RNA hairpin 1.

Three different approaches for the synthesis of peptidoyl RNA were tested. The first approach involved the assembly of the 5′‐terminal segment, including the dipeptide at its 3’‐terminus (2), and coupling this species to dinucleotide 3 (path I, Figure 2). To this end, an N‐trimethylsilylethyloxycarbonyl (Teoc) protected leucinylleucine was coupled to the 3’‐position of 3′‐amino‐2′,3′‐dideoxycytidine, and the resulting nucleoside was immobilized via a linker to its N4 position on controlled‐pore glass (cpg), followed by RNA chain assembly. The Teoc group was known to be stable during the steps of phosphoramidite‐based chain assembly and to be removable with TBAF. [29] While the overall yield of the peptidyl nucleoside was 17 %, and the loading of the support was 10 μmol/g, the final coupling of 2 to a preactivated dinucleotide gave no more than traces of 1, as determined by MALDI‐TOF analysis of the reaction mixture. Attempts to improve the yield were unsuccessful, and path I was not pursued further.

The second approach used path II of Figure 2. Here, the N‐Teoc protected 3′‐dipeptidyl 3′‐amino‐2′,3′‐dideoxycytidine residue was N4 acetylated and 5′ DMT‐protected, followed by removal of the Teoc group with TBAF, to give nucleosidic building block 4. Separately, an AA dimer was synthesized by automated RNA synthesis and then 5′‐phosphitylated, followed by partial hydrolysis with dicyanoimidazole/water to obtain solid‐support bound H‐phosphonate 5. The phosphoramidate linkage was established by redox condensation using CBr4 and NEt3, as previously described for DNA. [30] Unfortunately, no more than a loading of 3.4 μmol per gram of cpg was achieved, so that subsequent elaboration of the 5′‐terminal RNA segment by solid‐phase synthesis gave too little product for NMR spectroscopy.

The preferred synthesis followed path III of Figure 2. It uses a template‐directed coupling of a dipeptidoyl dinucleotide to 3′‐aminoterminal hairpin 6 as the final step. The elaboration of 6 via solid support 7 is shown in Scheme 1. As mentioned for the earlier two paths, above, 3′‐amino‐2′,3′‐dideoxycytidine (8) was the starting point. The aminonucleoside was Teoc‐protected to give 9, which was then tritylated with DMT−Cl in modest yield to afford doubly protected 10. The remaining N4 amino group of the nucleobase was then used as anchoring point for immobilization on controlled‐pore glass, using the methodology of Brown and coworkers. [31] To accomplish the immobilization of the nucleoside, long‐chain alkylamine controlled pore glass (lcaa‐cpg) was first treated with succinic anhydride in the presence of DMAP to give 11 and then with 10 in the presence of DIC/HOBt. The nucleoside loading of 7 was found to be 17 μmol/g by trityl assay, and chain assembly by automatic synthesis proceeded uneventfully to give hairpin 6 after basic deprotection followed by Teoc removal with fluoride.

Scheme 1.

Scheme 1

Synthesis of 3′‐aminoterminal hairpin 6. Conditions: a) 1. Teoc‐succinimide, DIPEA, DMF/H2O; b) DMT−Cl, DMAP, pyridine; c) DIC, HOBt, pentachlorophenol, pyridine; d)  1. RNA chain assembly with commercial phosphoramidite building blocks, BPG=ABz, CAc or GiBu, 2. AMA, 55 °C, 3. TBAF, THF.

The synthesis of the dipeptido dinucleotide portion was performed using a modification of the method of Räuchle et al., [12] as shown in Scheme 2. Ribonucleotide dimer 3 was converted to HOBt ester 12, which was then coupled with leucinylleucine to yield 13 in 48 % overall yield. Coupling between the hairpin and the activated dipeptido dinucleotide to form peptidoyl RNA 1 was then performed in template‐directed fashion in aqueous condensation buffer at 0 °C. [13] That the coupling of 13 to a 3′‐aminoterminal nucleotide is a template‐directed reaction had previously been demonstrated experimentally (see Figure 3 in Ref. [15]). While monitoring in the present case indicated well over 50 % conversion to product, purification proved challenging. The current protocol for HPLC, using a gradient of acetonitrile in ammonium bicarbonate buffer on a C18 phase gave a yield of 18 % of spectroscopically pure 1. Figure 3 shows a MALDI‐TOF mass spectrum and an excerpt of the one‐dimensional 1H‐NMR spectrum of our target compound.

Scheme 2.

Scheme 2

Elaboration of dipeptido dinucleotide 13, and assembly of hairpin 1 by coupling with amino‐terminal 2. a) HOBt, EDC, pH 6, followed by precipitation; b) leucinylleucine, H2O, pH 8.3; c) EDC in 200 mM NaCl, 50 mM MOPS, pH 7.4, 0 °C.

Figure 3.

Figure 3

Characterization of 1. A) MALDI‐ToF mass spectrum, and B) region of the 1H‐NMR spectrum in D2O, 50 mM phosphate buffer, 100 mM NaCl, pH 7.0, 900 MHz, 25 °C showing resonances from the riboses and H5 protons of pyrimidines.

Since the NMR spectrum showed well resolved signals and a dominant conformation, a series of two‐dimensional NMR spectra was measured for 1, 6 and unmodified RNA hairpin sequence 5′‐UUGGCGAAAGCCAA‐3′ (14) as control, allowing for assignment of key resonances for all residues of 1. Figure 4 shows an excerpt of the NOESY spectrum with assignments indicated on the margins. Additional spectra can be found in the Supporting Information. Well‐resolved and sufficiently intense cross peaks were observed for the core RNA portion of the hairpin (G3‐dC12) to be identified as folded according to our design, with the expected base pairing for this part of hairpin 1 at room temperature (25 °C). Two intense and one weak imino resonance were observed in the low‐field region of spectra measured in H2O/D2O (9 : 1), as expected for a stem with two shielded and one exposed G : C base pair (see Figure S18, Supporting Information).

Figure 4.

Figure 4

Section of the NOESY‐spectrum of 1 in D2O, 50 mM phosphate 100 mM, NaCl, pH 7.0, 900 MHz, 25 °C, and a mixing time of 300 ms showing cross peaks between anomeric protons and H2’ protons used for assignment.

For the peptide segment, the situation was less clear. For the 3’‐terminal region, with its weakly pairing UU/AA dinucleotides and the LeuLeu portion, a single, stably folded conformation would probably not lead to the combination of cross peaks observed (Figure 5). Both internucleotide NOESY cross peaks between dC12 and A13, and between the amino acid residues of LeuLeu and A13 were observed. The former are expected for an uninterrupted, continuously stacked double helix, whereas the latter point to a partially unwound helix. Further, attempts to generate a three‐dimensional structure by restrained torsion angle molecular dynamics, using the available NOE constraints and protocols previously applied to similar short helices,[ 32 , 33 , 34 ] did not give any violation‐free structures despite numerous computational runs. Therefore, we suspected that 1 was disordered or incompletely folded in its terminal region at 25 °C.

Figure 5.

Figure 5

Excerpts of 1H‐NOESY NMR spectrum of 1 showing cross peaks from amino acid residues of LeuLeu or the sugar of dC12 to a nucleotide of the 3’‐terminal dinucleotide, with NOEs indicated as grey double‐headed arrows in the structural drawings. A) NOEs to H8 of A13, and B) NOEs to H5’/5’’ of A13.

This hypothesis was corroborated by the results of a UV‐melting study. When hairpin 1 was subjected to thermal denaturation, a biphasic curve was obtained (Figure 6A). The lower transition had its point of inflection at 21 °C, as seen in the maximum of the first derivative, and the higher temperature transition gave a melting point of 70 °C. By comparison, unmodified RNA hairpin 5′‐UUGGCGAAAGCCAA‐3′ (14) showed a single transition, with a melting point of 83 °C, as expected for a cooperative folding/disassembly process involving the entire stem, including the terminal UU/AA segment. (Figure 6B). Melting curves at other salt concentrations, as well as melting curves with pruned versions of 1, including 6, confirmed these findings and are presented in chapter 3 of the Supporting Information. Biphasic transitions were observed for 1, independent of the NaCl content, and a single transition was found for 14. Together they indicate that the core portion of 1, including its GAAA tetraloop, folds into a very stable motif, but that the LeuLeu dipeptide segments prevents the AA:UU duplex from folding cooperatively with the remainder of the hairpin when the dipeptide is inserted in the RNA backbone. Instead, the terminal segment now melts independently, with the transition occurring at or below room temperature. Line broadening observed in the 1H NMR spectra for the H8 signals of A13 and A14 upon cooling from 25 °C to 3 °C (Figure S19, Supporting Information) again confirm this hypothesis.

Figure 6.

Figure 6

UV‐Melting curves of hairpins with first derivatives shown as grey broken lines. A) Peptidoyl hairpin 1, and B) control hairpin 5′‐UUGGCGAAAGCCAA‐3′ (14). Conditions: 5 μM hairpin, in 50 mM phosphate buffer, pH 7, measured at 260 nm and a heating rate of 1 °C/min; first derivatives generated in QtiPlot.

These findings are interesting because they indicate that reactions anchoring the C‐terminus of peptidoyl RNA readily produce products in template‐directed fashion that are much less stable thermodynamically than an uninterrupted double helix. [15] Likewise, the dipeptidoyl RNA products of single‐nucleotide translation with sterically demanding aliphatic amino acid residues [16] may not be formed due to product‐development control. Rather, their formation with faster rates than those of less sterically demanding residues [16] may be the consequence of kinetic control.

Conclusions

The results presented here show how RNA‐peptide hybrid molecules with the peptide embedded in the backbone of the RNA can be prepared on a scale suitable for structural biology studies by a combination of solid‐phase chain assembly and a template‐directed peptide coupling in aqueous solution. The method should allow for the preparation of a range of different sequences and chain lengths, allowing for an exploration of the chemical and structural properties of peptidoyl RNAs, and their possible role in the prebiotic phase of evolution. This may include structural studies employing X‐ray crystallography. One of the questions that will be interesting to address is whether other peptide sequences have a smaller duplex‐destabilizing effect than the LeuLeu dipeptide studied here. Studies on DNA with amino acids incorporated in the backbone suggest that this may be the case. [35] Independent of prebiotically motivated studies, though, our results demonstrate the power of template‐directed reactions in synthetic chemistry, complementing better‐known usages of such reactions in enzyme‐free genetic copying. [36]

Experimental Section

Protocols, synthetic schemes, and additional spectra can be found in the Supporting Information.

Conflict of interest

The authors declare no conflict of interest.

1.

Supporting information

As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.

Supporting Information

Acknowledgements

The authors thank Franziska Welsch for experimental contributions during an early phase of the project. This work was supported by Volkswagen Foundation (grant Az 92 768) and Deutsche Forschungsgemeinschaft (DFG) project ID 364653263‐TRR 235. Work at BMRZ is supported by the State of Hesse. Open Access funding enabled and organized by Projekt DEAL.

Dedicated to Ulf Diederichsen

J. Bremer, C. Richter, H. Schwalbe, C. Richert, ChemBioChem 2022, 23, e202200352.

Data Availability Statement

The data that support the findings of this study are available in the supplementary material of this article.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.

Supporting Information

Data Availability Statement

The data that support the findings of this study are available in the supplementary material of this article.


Articles from Chembiochem are provided here courtesy of Wiley

RESOURCES