Skip to main content
eLife logoLink to eLife
. 2018 May 15;7:e35255. doi: 10.7554/eLife.35255

Ribozyme-catalysed RNA synthesis using triplet building blocks

James Attwater 1, Aditya Raguram 1,, Alexey S Morgunov 1, Edoardo Gianni 1, Philipp Holliger 1,
Editor: Anna Marie Pyle2
PMCID: PMC6003772  PMID: 29759114

Abstract

RNA-catalyzed RNA replication is widely believed to have supported a primordial biology. However, RNA catalysis is dependent upon RNA folding, and this yields structures that can block replication of such RNAs. To address this apparent paradox, we have re-examined the building blocks used for RNA replication. We report RNA-catalysed RNA synthesis on structured templates when using trinucleotide triphosphates (triplets) as substrates, catalysed by a general and accurate triplet polymerase ribozyme that emerged from in vitro evolution as a mutualistic RNA heterodimer. The triplets cooperatively invaded and unraveled even highly stable RNA secondary structures, and support non-canonical primer-free and bidirectional modes of RNA synthesis and replication. Triplet substrates thus resolve a central incongruity of RNA replication, and here allow the ribozyme to synthesise its own catalytic subunit ‘+’ and ‘–’ strands in segments and assemble them into a new active ribozyme.

Research organism: None

eLife digest

Life as we know it relies on three types of molecules: DNA, which stores genetic information; proteins that carry out the chemical reactions necessary for life; and RNA, which relays information between the two. However, some scientists think that before life adopted DNA and proteins, it relied primarily on RNA.

Like DNA, strands of RNA contain genetic data. Yet, some RNA strands can also fold to form ribozymes, 3D structures that could have guided life’s chemical processes the way proteins do now. For early life to be built on RNA, though, this molecule must have had the ability to make copies of itself.

This duplication is a chemical reaction that could be driven by an ‘RNA replicase’ ribozyme. RNA strands are made of four different letters attached to each other in a specific order. When RNA is copied, one strand acts as a template, and a replicase ribozyme would accurately guide which letters are added to the strand under construction. However, no replicase ribozyme has been observed in existing life forms; this has led scientists to try to artificially create RNA replicase ribozymes that could copy themselves.

Until now, the best approaches have assumed that a replicase would add building blocks formed of a single letter one by one to grow a new strand. Yet, although ribozymes can be made to copy straight RNA templates this way, folded RNA templates – including the replicase ribozyme itself – impede copying. In this apparent paradox, a ribozyme needs to fold to copy RNA, but when folded, is itself copied poorly. Here, Attwater et al. wondered if choosing different building blocks might overcome this contradiction.

Biochemical techniques were used to engineer a ribozyme that copies RNA strands by adding letters not one-by-one, but three-by-three. Using three-letter ‘triplet’ building blocks, this new ribozyme can copy various folded RNA strands, including the active part of its own sequence.

This is because triplet building blocks have different, and sometimes unexpected, chemical properties compared to single-letter blocks. For example, these triplets work together to bind tightly to RNA strands and unravel structures that block RNA copying.

All life on Earth today uses a triplet RNA code to make proteins from DNA, and these experiments showed how RNA triplets might have helped RNA sustain early life forms. Further work is now needed to improve the ribozyme designed by Attwater et al. for efficient self-copying.

Introduction

The premise that some RNA sequences can catalyse and template their own replication - reciprocally synthesizing their own ‘+’ and ‘–’ strands - underpins current thinking about early genetic systems (Crick, 1968; Orgel, 1968; Szostak et al., 2001). Any ancient ribozyme with such RNA replicase capability seems to be lost, but efforts are ongoing to recreate RNA self-replication in the laboratory (Martin et al., 2015) as a critical test of the ‘RNA world’ hypothesis (Gilbert, 1986). Early on, derivatives of naturally occurring self-splicing introns (Doudna et al., 1991; Green and Szostak, 1992; Hayden and Lehman, 2006) as well as later in vitro evolved ligase ribozymes (Lincoln and Joyce, 2009; Sczepanski and Joyce, 2014) were shown to be able to assemble one of their own strands from cognate constituent RNA segments. However, a critical drawback of such systems is their need for specific preformed building blocks of at least eight nucleotides (nt) average length, limiting their potential for open-ended evolution, and precluding their replication from pools of random-sequence oligonucleotide substrates (Green and Szostak, 1992; Doudna et al., 1993).

In a contrasting approach, RNA polymerase ribozymes (RPRs) have been developed that can use general monomer building blocks (ribonucleoside 5’ triphosphates (NTPs)) in RNA-templated RNA synthesis (Johnston et al., 2001; Zaher and Unrau, 2007; Wochner et al., 2011; Attwater et al., 2013b; Horning and Joyce, 2016), akin to the activity of modern proteinaceous polymerases. However, even the most highly-evolved RPRs (Horning and Joyce, 2016) are substantially impeded by template secondary structures. Such structures are ubiquitous in larger, functional RNAs (including the RPRs themselves) and generally indispensable for function. The strong inhibitory role of this central feature of RNA leads to an antagonism between the degree to which an RNA sequence is able to fold into a defined three-dimensional structure to encode function (such as catalysis) and the ease with which it can be replicated (Boza et al., 2014). This ostensible ‘structure vs. replication’ paradox would have placed stringent probability constraints on the emergence of an RNA replicase and generally impeded the ability of RNA to function as an early genetic polymer.

We wondered whether this paradox might be avoided through a re-consideration of plausible building blocks for early RNA replication. Models of non-enzymatic polymerisation of all four activated ribonucleotides – the presumed source of the first RNA sequences – yield pools of di-, tri- and tetranucleotide etc. length oligonucleotides (in decreasing abundance) dominating the population alongside longer products (Monnard et al., 2003). Here, we have examined whether substrates of such lengths can support RNA-catalyzed RNA replication, by developing a ribozyme capable of iterative templated ligation of 5’-triphosphorylated RNA trinucleotides (henceforth called triplets). This heterodimeric triplet polymerase ribozyme demonstrated a striking capacity to copy a wide range of RNA sequences, including highly structured, previously intractable RNA templates, as well as its own catalytic domain and encoding template in segments. Its characterization revealed emergent properties of triplet-based RNA synthesis, including cooperative invasion and unraveling of stable RNA structures by triplet substrates, bi-directional (both 5’−3’ and 3’−5’) and primer-free (triplet-initiated) RNA synthesis, and fidelity augmented by systemic properties of the random triplet pools.

Results

In vitro evolution of triplet polymerase activity

We set out to explore the potential of short RNA oligonucleotides as substrates for RNA-catalyzed RNA replication. To do this, we required a ribozyme capable of general, iterative RNA-templated oligonucleotide ligation. Previously-described RNA polymerase ribozymes such as the ‘Z’ RPR (Wochner et al., 2011) can use NTPs to iteratively extend a primer hybridized to an RNA template, but do not accommodate oligonucleotides bound downstream of the primer or accept them as substrates. However, we detected a weak templated ligation activity in a truncated version of the Z RPR comprising its catalytic core domain (Zcore) (Figure 1a), which supported incorporation of oligonucleotide substrates as short as three nt (Figure 1—figure supplement 1) when incubated in the eutectic phase of water ice (Attwater et al., 2010; Mutschler et al., 2015).

Figure 1. Monomer polymerisation and triplet polymerisation.

(A) Scheme outlining initial derivation of a triplet polymerase activity from a mononucleotide polymerase ribozyme via directed evolution. Z RPR truncation effects are shown in Figure 1—figure supplement 1, the selection cycle is outlined in Figure 1—figure supplement 2, and the selection conditions of rounds 1–7 are listed in Figure 1—source data 1. Below, modes of action and secondary structures of the mononucleotide polymerase ribozyme (Z RPR) and a triplet polymerase ribozyme (0core), both depicted surrounding primer (tan)/template (grey) duplexes with a mononucleoside triphosphate (NTP) or trinucleotide triphosphate (triplet) substrate present (red). Here, the templates are hybridised to the ribozyme upstream of the primer binding site, flexibly tethered to enhance local concentration and activity (via L repeats of an AACA sequence, for example L = 5 in templates SR1-4 below). Z RPR residues comprising its catalytic core (Zcore) are black; mutations in 0core arising from directed evolution of Zcore are in teal. (B) Primer extension by the Z RPR using monomers (1 mM NTPs) or by 0core using triplets (5 μM pppAUA and pppCGC), on a series of 6-nucleotide repeat templates (SR1-4, examples below) with escalating secondary structure potential that quenches Z RPR activity beyond the shortest template SR1 (−7˚C ice 17 days, 0.5 μM/RNA). Extension by the triplet polymerase ribozyme 0core can overcome these structure tendencies up to the longest template SR4. (C) Triplet concentration dependence of extension using templates SR1 (grey circles) and SR3 (black triangles) by 0core (pppAUA and pppCGC, 0.1 μM of primer A10, template and ribozyme, −7˚C supercooled 15 days, ± s.d., n = 3); shown below is a model of cooperative triplet-mediated unfolding of template SR3 structure to explain the sigmoidal triplet concentration dependence (red curve) of extension upon it. Numerical values are supplied in Figure 1—source data 2.

Figure 1—source data 1. Selection conditions of rounds 1–7.
DOI: 10.7554/eLife.35255.006
Figure 1—source data 2. Triplet concentration-dependent extension values.
DOI: 10.7554/eLife.35255.007

Figure 1.

Figure 1—figure supplement 1. Templated ligase activity from a mononucleotide polymerase.

Figure 1—figure supplement 1.

Extension of primer A11 on template HTI by the Z RPR (Wochner et al., 2011) and a 3’ truncation of it (Zcore, Figure 1a) at −7˚C in ice for 8 days (0.5 μM of each RNA), adding as substrates either a long oligonucleotide (‘11 nt’, pppGCGAAGCGUGU at 0.5 μM), a triplet (‘3 nt’, pppGCG at 5 μM), or NTPs (‘1 nt’, at 1 mM each). Gel densitometry was used to calculate the percentage of primer extended (shown below each lane). Z RPR has only a very limited template ligation activity (slightly exceeding background nonenzymatic ligation [Rohatgi et al., 1996]). In contrast, 3’ truncation (which abolished mononucleotide polymerase activity) accommodated and enhanced templated ligation.
Figure 1—figure supplement 2. Selection scheme for in vitro evolution of triplet polymerase activity.

Figure 1—figure supplement 2.

Illustration of the in vitro evolution strategy to enrich iterative templated ligase activity from libraries of selection constructs. Centre, an example selection construct is shown comprising Zcore with a 3’ 30 nt random domain followed by a fixed hairpin-forming reverse transcription site (1). This construct is flexibly tethered at its 5’ in cis (Tagami et al., 2017; Attwater et al., 2010) (via L repeats (3-8) of a flexible AACA linker) to a template region (grey) bound to 5’-dual biotin modified primer (tan). This construct requires successive polymerisation of two pppUCG triplet substrates (red) in −7˚C ice followed by self-ligation to its triphosphorylated 5’ terminus to attach itself to the biotinylated primer (2). Active triplet polymerases are then identified and size-purified by denaturing PAGE to deplete unreacted ribozymes and primer (3), then captured on streptavidin beads and subjected to a denaturing wash to ensure covalent linkage to primer. Remaining dual-purified bead-linked ribozymes were then reverse transcribed (4). cDNAs were eluted with NaOH and neutralised, before acting as templates for ‘+’ strand DNA synthesis from 5’ biotinylated ‘rescue’ oligonucleotides that only prime correctly on the complete synthesised sequence (5), ensuring that constructs that deviated from the specified templated triplet polymerization fail to prime on cDNA preventing their ‘+’ strand synthesis. ‘+’ strands were then captured on fresh streptavidin beads, stripped of cDNAs, reamplified and diversified to generate selection construct for a new round of selection (6). See Materials and Methods for protocol details and Figure 1—source data 1, Figure 2—source data 1, Figure 4—source data 1 and Supplementary file 3 for oligonucleotides used in selections.

To be able to properly examine such RNA trinucleotide triphosphates (triplets) as replication substrates, we first sought to convert Zcore into an effective triplet polymerase ribozyme using in vitro evolution. We devised a selection strategy that required iterative templated triplet ligation by ribozymes to achieve their covalent linkage to a tagged primer (Figure 1—figure supplement 2). This enables their recovery, amplification and mutagenesis before further rounds of selection to enrich the selection pool in improved triplet polymerase ribozyme variants.

We initiated selections from a library of 1.5 × 1015 Zcore variants with a new random 3’ N30 region under eutectic phase conditions that increase RNA half-life and enhance ribozyme activity (Attwater et al., 2010;Attwater et al., 2013b). After 7 rounds of in-ice evolution, one-quarter of the selection pool comprised an improved ribozyme (type 0). Its core domain (0core, Figure 1a) could catalyse the iterative polymerization of multiple triplets allowing us to begin to investigate the properties of triplet-based RNA replication.

Significantly, we found that 0core could catalyze triplet polymerisation on a series of structured templates, which had proven intractable to the parental Z RPR (Figure 1b). Here, primer extension exhibited a steep sigmoidal dependence upon triplet concentrations (Figure 1c), suggestive of a cooperative invasion and unraveling of template secondary structures by the triplet substrates themselves. Although still inefficient, the fact that the nascent activity of the 0core ribozyme could already copy templates that had confounded an established RPR encouraged us to continue to seek improved triplet polymerase ribozymes to leverage this substrate behaviour.

Emergence of cooperativity and characterisation of a ribozyme heterodimer

We continued selections for a further 14 rounds. At this point, the type 0 ribozyme had gone extinct, replaced by six new types of RNA each characterised by a unique 3’ domain (Figure 2a, Figure 2—figure supplement 1). Type 1 RNAs were the most abundant, comprising ~50% of pool sequences, but mysteriously were catalytically inactive with diverse mutations in their core domains. In contrast, the type 2–6 RNAs all displayed triplet polymerase activity, but fell short of the polyclonal activity of the selection pool (Figure 2—figure supplement 2). To attempt to explain this discrepancy, we explored potential interactions among the different pool lineages, and found that addition of an equimolar amount of type 1 RNA substantially enhanced triplet polymerase activity of all the other ribozyme types 2–6 (Figure 2b).

Figure 2. Emergence of cooperativity during in vitro evolution.

(A) Composition of the round 21 selection pool as a % of total pool sequences; selection conditions of rounds 8–21 are listed in Figure 2—source data 1. Secondary structures of ribozyme type 1–6 archetypes are shown in Figure 2—figure supplement 1, and comparison of their activities to that of the polyclonal selection pool is shown in Figure 2—figure supplement 2. (B) Primer extension with triplets by these emergent triplet polymerase ribozyme types 2–6 (‘Rz’, alongside the starting Zcore ribozyme with ‘cap+’ sequence from selection, see Figure 3a), alone or with added truncated type 1 (+1, see Figure 3a) which boosted their triplet polymerase activities (0.5 μM Rzs/A10 primer/SR3 template, 5 μM pppAUA and pppCGC, −7˚C ice 16 hr).

Figure 2—source data 1. Selection conditions of rounds 8–21.
DOI: 10.7554/eLife.35255.011

Figure 2.

Figure 2—figure supplement 1. Secondary structures of type 1–6 ribozymes.

Figure 2—figure supplement 1.

The archetypes shown are active truncations of the most common/enriched variants of each type. Types 2–6 are depicted hybridised to a flexibly tethered primer (tan)/template (grey)/substrate (red) duplex analogous to the selection construct (Figure 1—figure supplement 2), supporting extension on the SR1-4 templates (Figure 2b). Coloured bases deviate from Zcore or derive from the random sequence region. Type 1, the most common type in the output, displayed a high degree of sporadic mutation in the catalytic core domain, consistent with a loss of catalytic function. Shown here are the four most common core mutations, which were each present in >45% of type 1 sequences.
Figure 2—figure supplement 2. Clonal versus polyclonal activity.

Figure 2—figure supplement 2.

Type 1 and 2 RNAs, and the indicated polyclonal ribozyme pools late in the selection, were transcribed as selection constructs generated with Tri8AUAM (then annealed to primer A10). Their abilities to incorporate eight pppAUA triplets then self-ligate in this selection construct context were compared (left, 0.1 μM each RNA, 2 μM pppAUA, in −7˚C ice for 5 days; right, 0.2 μM each RNA, 2 μM pppAUA and pppAUG3’d, in −7˚C ice for 9 days). Left: Despite being the most active type (Figure 2b), type two activity fell far short of the round 19 pool polyclonal activity. Right: The incorporation of wobble pairing 3’-deoxy ‘terminator’ triplet pppAUG3’d (see results section on fidelity, yielding a faster-migrating band than incorporation of the cognate pppAUA triplet) was decreased sharply after round 19, as calculated by densitometry and averaging amongst the ligation junctions in each lane, indicative of enrichment of higher fidelity triplet polymerases in the selection pool.

Dissecting type 1 RNA function, we found that 5’ truncation of the region that previously contacted the primer/template duplex (Shechner et al., 2009) did not affect its cofactor activity (Figure 3a, Figure 3—figure supplement 1). As judged by gel mobility shift (Figure 3b) and activity enhancement (Figure 3—figure supplement 1), type 1 RNA appears to form a 1:1 heterodimeric complex directly with active triplet polymerase ribozymes. Our attention was drawn to their selection construct-derived 5’ hairpin elements, which differed between active triplet polymerases (‘cap+’, Figure 3a, Figure 2—figure supplement 1) and the most common type 1 variants in the selection pool where this hairpin had acquired a mutation (yielding ‘cap’, Figure 3a). ‘cap–’ was dispensible for type 1’s cofactor activity, but when replacing ‘cap+’ in active triplet polymerases it abolished both their activity enhancement by type 1 (Figure 3—figure supplement 1) and complex formation (Figure 3b). This points to the ‘cap+’ hairpin as the critical site of interaction with type 1; ‘cap–’ in type 1 presumably served to deter its homodimerisation during selection.

Figure 3. Heterodimer formation and behaviour.

(A) Secondary structures of the most common type 1 and type 5 clones from the selection, with in vitro-selected 3’ domain and core mutations coloured orange (type 1) or blue (type 5). 5’ truncation of type 1 (faded, including its putative primer/template interacting region), yielding the minimal type 1 variant ‘1’, maintained its enhancement activity (see Figure 3—figure supplement 1). The effects of transplanting the indicated 5’ hairpin ‘cap+’ element from type 5 to other ribozymes are shown in Figure 3—figure supplement 2. The inset shows type 5 ‘cap+’ hairpin element alteration to ‘cap–’ (yielding type 5cap-). (B) Gel mobility shift characteristic of complex formation resulting from mixing of type 5 ribozyme with type 1 RNA (equimolar, or with the indicated equivalents). Type 5cap- loses this shift and its susceptibility to type 1 activity enhancement (Figure 3—figure supplement 1). Below, shifted band intensities with increasing type 1 addition are plotted (quantified relative to the indicated type 5+1 lane intensities, n = 4 ± s.d.), signifying 1:1 heterodimer formation; numerical values are supplied in Figure 3—source data 1. (C) Type 1 enhancement allows type 5 variants to synthesise long RNAs using triphosphorylated oligonucleotide (11 nt) or short triplet (3 nt) substrates (Sub, 3.6 or 5 μM each, substrate sequences in grey beside lanes; 0.4 μM primer A11/template I-8, 2 μM each Rz, −7˚C ice for 16 days). This activity is independent of template tethering (Wochner et al., 2011), as comparable synthesis is achieved by versions of type 5 whose 5’ regions allow or avoid hybridisation to the template (type 5cis or type 5trans respectively, schematic above, sequences in Supplementary file 1). The average extent of ligation at the end of the reaction amongst all junctions in a lane is shown beneath each lane.

Figure 3—source data 1. Relative intensities of shifted bands when varying type 1 equivalents.
DOI: 10.7554/eLife.35255.015

Figure 3.

Figure 3—figure supplement 1. Parameters of type 1 activity enhancement.

Figure 3—figure supplement 1.

(A) PAGE of primer extension reactions comprising combinations of type 5 and type 1 variants (0.2 μM primer A10, template SR3, and type 5 variant, here annealed together and combined with 0.2 μM of separately-annealed type 1 variant after buffer addition; −7˚C in ice for 63 hr, 2 μM each pppAUA, pppAUG3’d, pppCGC). Full-length type 1 enhanced the activity of type 5 as did 5’ truncated type 1 (‘1’, Figure 3a) without its duplex-interacting A-minor motif (Shechner et al., 2009) and its ‘cap−’ sequence (originally present on some type 1 sequences in the selection). Replacing the type 5 ribozyme’s selected 5’ ‘cap+’ sequence with ‘cap-’ (type 5cap-) abolished its enhancement by type 1 RNA. (B) Stoichiometry of type 1 enhancement of type 3 activity (0.5 μM primer A10, template SR3, 2 μM each pppAUA, pppGUA3’d, pppCGC in −7˚C ice for 20 hr); quantification of average extension per primer in each lane is consistent with the 1:1 stoichiometry implied by type 1’s ~ 50% abundance in the round 21 selection pool.
Figure 3—figure supplement 2. Type 1 enhancement of parental ribozymes.

Figure 3—figure supplement 2.

Left, transplanting the conserved 5’ ‘cap+’ sequence from types 2–6 allows the Zcore but not the 0core ribozyme triplet polymerase activity to be enhanced by type 1; shown are PAGE of extensions of primer A10 on template CCCMisAUG by the indicated ribozyme cores (5 μM of pppAUG and pppCCC, 0.5 μM of each RNA with type 1 annealed and added separately, −7˚C in ice for 22 hr). Right, RNA polymerase activity of Z RPR using NTPs is enhanced by type 1 addition, but only when modified with the 5’ ‘cap+’ sequence (Z RPRcap+), extending primer A10 on tethered template HTI (0.5 μM of each RNA, 4 mM of each NTP, at 4˚C for 68 hr).

Indeed, transplanting the ‘cap+’ element could make the parental ribozymes (Zcore and Z RPR) receptive to activity enhancement by type 1 RNA (Figure 2b, Figure 3—figure supplement 2). The catalytically inert type 1 RNA thus represents a general, mutualistic RNA species. This molecular symbiont appears to have emerged spontaneously during in vitro evolution by forming a heterodimeric holoenzyme with triplet polymerase ribozymes, enhancing their activity to boost the recovery prospects of both complex components.

In complex with type 5 (the fastest enriching triplet polymerase ribozyme in the final selection pool), type 1 boosts polymerization of triplets (or longer oligonucleotides) to enable synthesis of long RNAs (Figure 3c). Here, it became apparent that type 1 also obviates the need for ribozyme-template tethering. Due to their poor affinity for primer/template duplex (Lawrence and Bartel, 2003), RPRs generally depend upon such tethering to template (Attwater et al., 2010; Wochner et al., 2011; Horning and Joyce, 2016), which enhances local ribozyme concentration and promotes formation of the RPR-primer/template holoenzyme (Attwater et al., 2010; Attwater et al., 2013a). In contrast, the triplet polymerase heterodimer appears to have a capacity for true intermolecular, sequence-general interaction with primer-template duplexes, which enables holoenzyme formation and copying of RNA templates without requiring specific ribozyme-template hybridization sites.

Secondary structure invasion by triplet substrates

We performed an additional five rounds of in vitro evolution to further evolve the type 5 triplet polymerase ribozyme (now in the presence of truncated type 1 RNA), diversifying the previously-fixed 3’ domain reverse transcription primer binding sequence. This reselection yielded a shorter final heterodimeric triplet polymerase holoenzyme, hereafter termed ‘t5+1’ (Figure 4). This robust triplet polymerase activity now proved suitable for exploring the scope and potential of triplet-based RNA replication.

Figure 4. A trans-acting heterodimeric triplet polymerase.

Top, scheme outlining derivation of the final t5+1 triplet polymerase archetype from the type 1 and type 5 RNAs (shown in Figure 3a) by reselection using the conditions in Figure 4—source data 1. Below, the secondary structure of this ribozyme heterodimer, 135 nt (1) and 153 nt (t5) long, is depicted operating in trans on a non-tethered primer/template duplex. Type 5 3’ domain bases that re-emerged after randomisation during reselection are coloured black in the t5 3’ domain. Ribozyme development is summarized in Figure 4—figure supplement 1; all ribozyme sequences are listed in Supplementary file 1.

Figure 4—source data 1. Selection conditions of rounds 1–5 of the reselection.
DOI: 10.7554/eLife.35255.017

Figure 4.

Figure 4—figure supplement 1. Summary of ribozyme development in this work.

Figure 4—figure supplement 1.

Left, map of triplet polymerase selection and engineering. Ribozyme sequences are listed in Supplementary file 1. Right, secondary structures of the central ribozymes in this work.

As a first examination of t5+1 activity, we revisited triplet-based RNA synthesis on structured templates. To provide a stringent test of template structure inhibition, we now examined hairpin-containing templates (4S, 6S, 8S) with increasing RNA hairpin stability and estimated TMs of up to 93˚C (8S). The latter had previously strongly arrested even the most advanced mononucleotide RPRs at higher temperatures (Horning and Joyce, 2016). However, using triplets as substrates t5+1 robustly copied all of these (Figure 5a), even when templates were pre-folded allowing RNA secondary structures to form prior to triplet addition (Figure 5—figure supplement 1). The triplet concentration-dependent cooperative structure invasion and unraveling (previously observed with the simpler 0core domain and partly wobble-paired RNA template structures [Figure 1b,c]) was recapitulated with t5+1 and the highly stable 8S hairpin template (Figure 5—figure supplement 2). In contrast, dinucleotide triphosphate substrates yielded extension only up to the structured region (Figure 5—figure supplement 2).

Figure 5. Triplet-mediated structured and functional template copying.

(A) Extension on three structured hairpin templates (4S, 6S, 8S) with increasing stability. Top, the mfold-predicted (Markham and Zuker, 2005) structure and TM of the most stable 8S template; below, primer extensions on these templates by t5+1 triplet polymerase (with 5 μM each encoded triplet) or type 1-enhanced Z polymerase ribozyme (Z RPRcap+ +1, with 1 mM each NTP) (2 μM ribozyme, 0.5 μM 4S, 6S or 8S template and primer A9, −7˚C ice 25 days). The self-complementary region in each template is indicated between each pair of lanes (shaded by triplet), with the encoded triplet substrate sequences at the left (in grey, with 5’ template overhangs in brackets). Syntheses using different substrate compositions and concentrations are shown in Figure 5—figure supplement 1 and Figure 5—figure supplement 2. While all hairpin templates are robustly copied by t5+1, synthesis by the Z RPR is completely arrested by the 6S and 8S hairpins. (B) Synthesis of the broccoli aptamer. The native secondary structure is shown above (Tan: bases from ‘+’ strand synthesis primer. Green: bases from triplets. Outlined green: primer binding site for the ‘−’ strand synthesis). Below left, t5+1-catalysed synthesis of fluorescent broccoli aptamer, run alongside standard (Broc+, synthesized by in vitro transcription), and stained for RNA with SYBR Gold (magenta) or folded with DFHBI-1T ligand (green fluorescence) (2 μM t5+1, 1 μM BBrc10/TBrc, 5 μM each triplet (in grey), −7˚C ice 22 days). Below right, ‘−’ strand synthesis on Broc + standard (0.5 μM without ligand in ribozyme extension buffer, 0.5 μM FBrcb6 primer, 2 μM t5+1, 5 μM each triplet (in grey), −7˚C ice 38 days). t5+1 is able to synthesise both full-length functional (fluorescent) Broccoli ‘+’ and encoding ‘–’ strands.

Figure 5.

Figure 5—figure supplement 1. Substrates for structured template copying.

Figure 5—figure supplement 1.

Primer extension reactions are shown using sets of defined triplet substrates (above the lanes), or random pools of all 64 triplets (pppNNN), each at the indicated concentrations on three templates with progressively longer, more stable secondary structure elements (right, TMs estimated using [Zuker, 2003]) (0.5 μM primer A9, 0.5 μM 4S, 6S, 8S templates, 2 μM t5+1,–7˚C ice for 25 days). Structure invasion and primer extension across the structured template region (beginning at the dotted red line on the image) occurred in all cases (with increased efficiency at higher substrate concentrations for the 8S template (5 µM)), including extensions where template secondary structures were allowed to form in the absence of triplets (*, triplets added last after 30 min supercooled at −7˚C).
Figure 5—figure supplement 2. Substrate concentration dependence of structured template copying.

Figure 5—figure supplement 2.

Extension on the 8S structured template with a defined set of complementary triplets (left) or dinucleotides (right), at the indicated substrate concentrations (0.5 μM primer A9, template 8S, 2 μM t5+1, –7 ˚C ice for 25 days). Left, quantification of the extent of extension into the structured template region below each lane (measured by the fraction of third triplet addition to primers already extended by two) recapitulates a sigmoidal relationship with triplet concentration. Right, dinucleotides (like mononucleotides) exhibit negligible extension into the template hairpin structure.

We began to explore whether triplet-based RNA synthesis by t5+1 might exhibit the generality required not just for synthesis of arbitrary structured sequences, but for replication of functional sequences (requiring synthesis of both ‘+’ and ‘−’ strands). Encouragingly, t5+1 could synthesise both a functional fluorescent ‘+’ strand of the 52 nt Broccoli RNA aptamer (Filonov et al., 2014) and its encoding ‘−’ strand template from their 13 (+) and 12 (−) different constitutive triplets (Figure 5b).

Ribozyme sequence self-synthesis and assembly

We next turned to the critical test of generality: could triplet substrates allow self-synthesis? As t5+1 currently lacks the efficiency to synthesise RNAs its own length, we divided the catalytic t5 ribozyme into five segments α, β, γ, δ and ε. This segmentation strategy (akin to that used by some RNA viruses e.g. influenza) could reduce tertiary structures (Doudna et al., 1991; Mutschler et al., 2015) and ease product separation during RNA replication (Szostak, 2012). Starting from ~8 nt RNA primers, t5+1 achieved synthesis of the β+, γ+, and δ+ segments from their constitutive triplets as well as all of the ‘−’ strand segments α-, β-, γ-, δ- and ε-, but required some triplets pre-linked (as e.g. hexa- or nonanucleotides) for synthesis of full-length α+ and ε+ segments (Figure 6a).

Figure 6. Ribozyme self-synthesis and assembly of its own catalytic domain.

(A) t5+1-catalysed syntheses of the five catalytic domain ‘+’ and ‘−’ segments via triplet extension of primers (grey) in −7˚C ice. Triplets are coloured by segment and shown alongside the lanes; longer oligonucleotide substrates (faded) were provided for α+ and ε+ syntheses to combat ribozyme-template pairing as shown in Figure 6—figure supplement 2 and Figure 6—figure supplement 3. The triplets were supplied at 5 μM (α+ to ε+), 10 μM (β to ε), or 20 μM (α) each, with 0.5 μM primer/template (P/T; 1 μM for Fβ6/Tβ) and oligonucleotides equimolar to template sites. Use of substrates of more heterogenous compositions and lengths is shown in Figure 6—figure supplement 4 and Figure 6—figure supplement 5, respectively. Densitometry gave yields of full-length products (boxed, by % of total primer), and a geometric mean of the final extents of ligation across all 70 junctions in this self-synthesis context (78%). These segment sequences derive from t5b, a t5 variant with a neutral signature mutation (Supplementary file 1). (B) Secondary structure representation of a t5 catalytic domain (αβ+/γδε+, t5b sequence), formed via non-covalent assembly of t5+1-synthesised ‘+’ strand fragments in Figure 6—figure supplement 1, coloured by segment and synthesis substrate as in (A). (C) Activity of ribozyme-synthesised αβ+/γδε+ (B), compared to protein-synthesised αβ+/γδε+ and full-length t5b equivalents. These were assayed for synthesis of a δ- strand segment on a ribozyme-synthesised δ+ template, with added in vitro transcribed type 1 (2 μM each Rz, 5 μM triplets, 0.5 μM P/T, −7˚C 0.25× ice 10 days). The ribozyme-synthesized and assembled αβ+/γδε+ ribozyme is as active as in vitro transcribed equivalents, and can efficiently utilize ribozyme-synthesized RNA (δ+) as a template.

Figure 6.

Figure 6—figure supplement 1. Ribozyme catalytic domain self-synthesis and assembly.

Figure 6—figure supplement 1.

Top, conditions and yields for self-synthesis and assembly of the catalytic t5 domain (as αβ+ and γδε+ fragments, Figure 6b) by t5+1. To obtain maximal amounts of fully synthesized and assembled t5b ribozyme for activity testing (Figure 6c), we implemented t5 ‘+’ strand synthesis using sequence-specified triplet substrates for segment syntheses (as shown). The synthesis scale corresponds to the limiting component present. *: Unlike other segment ligation steps, the δε ligation was carried out using freeze-thaw cycling (Mutschler et al., 2015) (2.5 hr at −30˚C, 21 hr at −7˚C, then 0.5 hr at 37˚C), which modestly improved yield. Below, scheme of t5+1-catalysed t5 synthesis and assembly reactions. PAGE separations of syntheses (xt) alongside purified reference segments and fragments (in bold) were stained with SYBR Gold to quantify boxed full-length products. These were excised for use in subsequent assembly steps (illustrated by dashed black arrows). Synthetic schemes of colour-coded segments are shown beside corresponding lanes, denoting ‘+’ strand synthesis (bold primer and dashes) on ‘–’ strand template. Fully synthesized segments are shown as bold lines. Assembly reactions involve use of fully synthesized segments as the final substrate in a primer extension reaction (α + β, γ + δε) or direct templated ligation of synthesized segments (δ + ε). The αβ+ and γδε+ fragments associate spontaneously to form an active triplet polymerase ribozyme (Figure 6b), tested for activity in Figure 6c. The synthesised segment sequences were derived from the t5b variant of t5, which differs by one neutral signature mutation from t5 in the α segment, and by another neutral signature mutation from t5a in the ε segment (mutations highlighted in Supplementary file 1); these neutral mutations, not present in the t5 and t5a used to synthesise the segments (above), allowed verification of the synthetic origin of the product fragments by sequencing, which revealed the correct signature mutations in each fragment ruling out contamination of the synthesized ‘+’ strand products by the synthesizing (‘+’ strand) triplet polymerase ribozyme.
Figure 6—figure supplement 2. Substrate competition attenuates inhibitory ε+- pairing during self-synthesis.

Figure 6—figure supplement 2.

(A) Longer oligonucleotide substrates are required to maximise synthesis of full-length ε+ segment (0.5 µM Fε9 primer/Tε template/oligonucleotide substrate and 5 µM each triplet (left, omitted where equivalent oligonucleotide substrate was present), 13 days in −7˚C ice). Replacing two triplets with a preformed hexanucleotide substrate (pppUGAAUG) boosts full-length ε+ product synthesis by t5+1 (left panel), but not by the type 6s+1 ribozyme - with a different accessory domain (see Figure 2—figure supplement 1, sequence in Supplementary file 1) - which synthesizes ε+ independent of this hexanucleotide (right panel, allowing some full-length ε+ synthesis with only triplets). This likely reflects unfavourable competition between the t5 ribozyme’s own ε+ domain and triplet substrates for complementary pairing to the ε- RNA used as template. Consistent with this hypothesis, increasing t5+1 concentrations with this template is counterproductive (middle panel). For both ribozymes, using a preformed nonanucleotide substrate pppUUUUUCAUG at the end of the template boosted full-length product over use of a pppUUCAUG hexanucleotide +pppUUU triplet, or just the three constituent triplets (S = 9 vs. 6/3 vs. 3/3/3); this is not due to the individual triplet sequences as t5+1 can efficiently incorporate both pppUUU and pppUUC (see Figure 5b, Figure 6a). (B) Doubling triplet concentrations to 10 µM can attenuate t5+1’s requirement for the pppUGAAUG hexanucleotide substrate during ε+ synthesis. This suggests that at higher triplet concentrations they successfully compete with this part of t5 for ε- template hybridisation (0.5 µM Fε9 primer/Tε template/longer oligonucleotide substrates, 13 days in −7˚C ice). With a fourfold excess of primer/template duplex and substrates (right lane), the amount of full-length segment generated exceeds the ribozyme added, evidence of multiple turnover of full-length ε+ product.
Figure 6—figure supplement 3. Ribozyme stabilisation attenuates inhibitory δ+- pairing during self-synthesis.

Figure 6—figure supplement 3.

Left, δ+ synthesis using 5 μM of each complementary triplet substrate (−hex) or 1 μM of pppUGACAU (+hex) replacing the final two triplets (0.25 μM primer Fδ7/template HTδ, 0.5 μM each ribozyme, 6 days in −7˚C ice). Type 5 with the initially isolated ε domain (type 5s) needs hexamer substrate for δ+ synthesis, but with the reselected ε domain (t5a) does not. See Supplementary file 1 for ribozyme sequences. Right, model of substrate/ribozyme competition for template binding. The region at the end of the δ segment exhibits irregular pairing in the initially isolated ε domain (above), and appears vulnerable to δ- template pairing during δ synthesis with triplet substrates. Reselection strengthened base-pairing in the corresponding ε domain stem (below), replacing a G-U wobble with a cognate G-C pair and a U.U mispair with a cognate U-A pair, stabilizing the δ - ε domain junction and potentially reducing competition with triplets during δ segment synthesis.
Figure 6—figure supplement 4. Ribozyme segment synthesis with random substrate pools.

Figure 6—figure supplement 4.

For maximum self-synthesis yield, we had used specific triplet substrate sets. Here we compare synthesis by 0.5 µM t5+1 of the five t5 ‘+’ segments (30 days in −7˚C ice) using specific triplets vs. random triplet pools or reduced G-content random triplet pools. Reactions included 5 µM of each triplet in specific triplet sets (‘tri’, as in Figure 6a+’ syntheses), random pppNNN, or a low-G pppNNN. The low-G pppNNN had the same overall triplet concentration as pppNNN, but individual triplets were five-fold less common for each constituent G (~10/2/0.4/0.08 µM for 0/1/2/3 Gs per triplet; some primer/template (P/T) concentrations were reduced here to ensure excess substrate over template). Synthesis is compared with and without some longer oligonucleotide substrates (indicated below, equimolar to template sites, replacing the corresponding triplets in ‘tri’ substrate mixes). Importantly, for all ‘+’ strand segments, full-length products are generated using pppNNN with yields approaching those obtained when using specific triplets (calculated by densitometry, above the gel); intriguingly the low-G pppNNN pools often gave superior yields, possibly helped by the higher concentrations of weaker-binding AU-rich triplets therein.
Figure 6—figure supplement 5. Ribozyme segment synthesis with mixed length substrate pools.

Figure 6—figure supplement 5.

δ+ (left) or γ+ (right) syntheses were performed (using 0.5 µM each of t5+1 and the primer/templates above, 15 days in −7˚C ice) with their constitutive triplet sets alone (*, see Figure 6a) or the random-sequence substrate mixes indicated. Introduction of dinucleotide and mononucleotide substrates (pppNN, pppN i.e. NTPs) decreases full-length product band intensity but not ligations performed (here quantified whilst assuming each extension product was formed from the fewest substrates possible), with increasing numbers of extension products deviating from the starting triplet register. Dinucleotides and mononucleotides appear poor substrates in the absence of triplets.

Operating across 70 distinct ligation junctions in these reactions including AU-rich sequences, t5+1demonstrates the sequence generality for self-synthesis using triplet substrates. Notably, the average extent of ligation per junction during synthesis of t5 ‘+’ and ‘−’ strands (78%) was similar to that observed when t5+1 used an unstructured model template (74%, Figure 3c) upon which the parental Z and other RPRs excel (Attwater et al., 2013b; Horning and Joyce, 2016).

At this point, we tested whether the broad oligonucleotide ligation capacity of t5+1 (Figure 3c) might allow assembly of synthesised ‘+’ strand segments. Indeed, t5+1 could assemble these into αβ+ and γδε+ fragments, guided only by partially overlapping ‘−’ strands (Figure 6b, Figure 6—figure supplement 1). Through non-covalent association (Vaish et al., 2003; Mutschler et al., 2015), the ribozyme-synthesised αβ+ and γδε+ fragments spontaneously reconstituted a new catalytically active triplet polymerase ribozyme (with in vitro transcribed type 1 RNA). We found that this synthesis product could regenerate fresh δ- segment using t5+1 ribozyme-synthesised δ+ (left over from ribozyme assembly) as a template (Figure 6c), recapitulating elements of a self-replication cycle. However, while the t5+1 ribozyme displays a nascent capacity for templated synthesis of its own catalytic domain ‘+’ strands (and ‘−’ strands), efficiency of both segment synthesis and assembly will need to be increased significantly to realise a full self-replication cycle (which would also require synthesis and replication of the type 1 subunit).

Primer-free RNA synthesis

Templated ‘+’ strand self-synthesis is a central element of ribozyme self-replication. However, a limitation of our above strategy in the context of triplet-based self-replication is the continued requirement for some pre-synthesized longer oligonucleotides to act as primers and occasional substrates (together providing here the equivalent of ~25% of triplet junctions pre-ligated). In particular, some specific oligonucleotide substrates were required for efficient synthesis of α+ and ε+ segments to compete out inhibitory mutual hybridisation between ‘−’ strand template and corresponding ‘+’ strand unstructured elements in the t5 ribozyme (Figure 6—figure supplement 2). In vitro selections that stabilise the ribozyme tertiary structure (Figure 6—figure supplement 3) may contribute to attenuating this requirement. Additionally, more concentrated triplet substrates can successfully compete with ribozyme unstructured elements for hybridization to ‘−’ strand templates (Figure 6—figure supplement 2).

The majority of specific oligonucleotides, however, were provided as primers to initiate syntheses, as required by all RPRs akin to the activity of replicative polymerases in biology. As a consequence of this, the priming sequence would effectively be excluded from evolution during replication. Furthermore, RNA oligonucleotides able to act as specific primers are unlikely to be prevalent in prebiotic substrate pools, and their depletion during successive replication cycles could lead to loss of sequence at genome ends. This ‘primer problem’ has previously been noted in the context of nonenzymatic replication (Szostak, 2012) as one of the fundamental obstacles to RNA self-replication.

Unexpectedly, triplet substrates provide a route to bypass the ‘primer problem’. We observed that t5+1 can extend primers bidirectionally, in both the canonical 5’−3’ as well as the reverse 3’−5’ directions (Figure 7a). This allows not only completion of RNA synthesis from either template end but also initiation from anywhere along a template, potentially allowing non-classical hierarchical or distributive RNA replication schemes as previously proposed (Szostak, 2011). Given this flexibility, we wondered if t5+1 even had a requirement for a primer oligonucleotide. Indeed, this triplet polymerase could achieve ‘primer free’ RNA synthesis (whereby synthesis is presumably initiated by ligation of adjacent triplets anywhere on the template), as exemplified here for the β+ segment (Figure 7b), as well as ‘primer free’ RNA replication as shown for the ‘+’ and ‘−’ strands of the γ segment, which can be replicated using triplets alone (Figure 7c).

Figure 7. Triplet-initiated template sequence copying.

Figure 7.

(A) Extension by t5+1 of fluorescein-labelled primers bound to either the 3’ (A10) or 5’ (pppba) ends of a template (T8GAA, 10 μM pppGAA, 0.5 μM/RNA, −7˚C ice 69 hr), demonstrating extension in either 5’−3’ (A10) or 3’−5’ (pppba) directions. (B) Synthesis of β+ on Tβ template via t5+1-catalysed polymerisation of the substrates indicated on the right (2 μM t5+1, 5 μM each triplet, 0.5 μM template (lane 2: 0.5 μM hexanucleotide), −7˚C ice 9 days). Extension products in lanes 2–5 were eluted from template, PAGE-separated and SYBR-Gold stained alongside in vitro transcribed full-length segment control (lane 1). Lane 3 shows full-length synthesis of β+ segment from triplets alone. (C) Triplet-based replication of the γ segment. Top left, synthesis of γ+ from the indicated substrates (1 μM t5+1, 5 μM each triplet, 2 μM TγHP template (lane 2: 2 μM Bioγ7 primer), −7˚C ice 7 days). Biotinylated extension products in lanes 2–4 were isolated from template, PAGE-separated and SYBR-Gold stained alongside in vitro transcribed staining marker (Mγ+m1, lane 1). This indicated 10% yield (per template) of full-length synthesis of γ+ segment from triplets alone (the final band in lane 4), which was purified for use as a template in γ - segment synthesis (bottom right, 1 μM t5+1, 5 μM each triplet, 0.05 μM template, with 0.05% Tween-20, –7˚C ice 27 days). Extension products in lanes 5 and 6 were eluted from template, PAGE-separated and SYBR-Gold stained alongside in vitro transcribed full-length segment control (Mγ -m1, lane 7). This indicated 6% yield (per template) of full-length synthesis of γ - segment from triplets alone (the final band in lane 6).

Thus, the capacity of triplet substrates to pre-organise themselves on a template not only enables replication of structured templates but also allows complete copying of some RNA sequences exclusively from triplet building blocks, suggesting an alternative to the canonical end-primed replication strategies inspired by PCR. Such a ribozyme operating in a more distributive polymerisation mode might be able to replicate RNA sequences directly from the putative pools of short random RNA oligonucleotides furnished by prebiotic chemistry.

Fidelity of triplet-based RNA synthesis

Next, we investigated the consequences of using analogues of such prebiotic pools as a source of substrates for the t5+1 triplet polymerase ribozyme. Random sequence triplet pools (‘pppNNN’, comprising equimolar amounts of all 64 triplets) could be used as substrates by t5+1 in segment syntheses in place of defined triplet sets (Figure 6—figure supplement 4). Furthermore, extension activity remained robust upon pool supplementation with noncanonical dinucleotide and mononucleotide substrates (Figure 6—figure supplement 5).

However, a replicase must incorporate the correct template-complementary substrate from random sequence pools, or genetic information may become irretrievably corrupted during replication (Eigen, 1971). Sequence fidelity is therefore a critical parameter of RNA replication. The fidelity challenge is exacerbated in triplet-based RNA replication by the need to discriminate between 64 distinct substrates; indeed, a previous investigation into the incorporation of individual trinucleotides indicated that misincorporations could outstrip cognate incorporation for some triplets (Doudna et al., 1993).

In order to assess the fidelity of triplet polymerase ribozymes of widely differing activity, we identified the triplets incorporated from random pppNNN triplet pools using 12 different compositionally representative N′N′N′ triplet sequences as templates. These were examined in a consistent sequence context (5’-GGG-N′N′N′-GGG-3’) and collated, which allowed an estimation of ribozyme misincorporation tendencies. On average, the starting Zcore ribozyme exhibited ~91% fidelity per position (Figure 8a), lower than that described for RPRs (92% – 97% [Attwater et al., 2013b; Horning and Joyce, 2016]). Furthermore, its accuracy exhibited a pronounced downward gradient from the first (5’) to the third (3’) triplet position, highlighting escalating risks to fidelity of synthesis founded on longer building blocks.

Figure 8. Fidelity of ribozyme-catalysed triplet polymerisation.

(A) To estimate its fidelity, each ribozyme was provided with an equimolar mix of all 64 triplet substrates (pppNNN at 5 μM each) for primer extension using templates containing twelve representative trinucleotide sequences (N′N′N′). Deep sequencing of extension products identified the triplets added opposite each template trinucleotide, yielding position-specific error tendencies; the overall fidelity was calculated as a geometric mean of positional errors at each triplet position (n and s.d. of this value shown for ribozymes assayed multiple times, see Figure 8—source data 1 for analysis of collated errors). The triplet polymerases exhibit diverse fidelity profiles; fidelity profiles of other type 5 variants are shown in Figure 8—figure supplement 1. (B) Collation of error rates by base type and position for type 5 with (t5+1) and without (αβγδ+1) the ε fidelity domain. Positional and overall fidelities are calculated as geometric means (see Figure 8—source data 1); individual positional fidelities are plotted in Figure 8—figure supplement 2. (C) Schematic summary of effects of triplet minor groove modification upon the fidelity phenotype. In the depicted trinucleotide RNA duplex segment, spheres represent minor groove groups potentially available for hydrogen bonding in a sequence-general manner. For three of these groups (highlighted in black), we assayed whether their modification in substrates (2’F = 2’ fluoro, 2S = 2-thio) affected the fidelity domain’s mismatch discrimination capabilities (detailed in Figure 8—figure supplement 3, with data and calculations in Figure 8—source data 2). These groups are labelled with the fraction of fidelity phenotype retained (ϕ) when discriminating between the indicated modified substrates. Colour reflects the impact of that group’s modification upon the fidelity phenotype, with red denoting a strong disruptive effect, and yellow weak or negligible effects.

Figure 8—source data 1. Analysis of collated errors by ribozymes in the fidelity assay.
elife-35255-fig8-data1.xlsx (158.8KB, xlsx)
DOI: 10.7554/eLife.35255.033
Figure 8—source data 2. Calculation of residual fidelity phenotypes in Figure 8c.
DOI: 10.7554/eLife.35255.034

Figure 8.

Figure 8—figure supplement 1. Fidelity of type 5 variants.

Figure 8—figure supplement 1.

Positional error rates and fidelities (determined as in Figure 8a) for type 5 ribozyme variants; the overall fidelity was calculated as a geometric mean of positional errors at each triplet position (n and s.d. of this value shown for ribozymes assayed multiple times, see Figure 8—source data 1). Fidelity of the initial type 5 isolate (tethered to the assay templates) is modestly improved in the absence of type 1. However, with type 1, fidelity is improved when operating fully in trans (type5s +1). Reselection and stabilisation of the ε domain yields a further fidelity improvement (t5+1) but ε truncation (αβγδ+1) reverts the pattern of error tendencies along the triplet towards that of the starting core. See Supplementary file 1 for ribozyme sequences.
Figure 8—figure supplement 2. Fidelity domain influence upon template- and position-specific error rates.

Figure 8—figure supplement 2.

At each position in each of the twelve triplet templates (top) tested in the fidelity assay (Figure 8a), three different errors are possible after analysis of sequence composition of added triplets (see Figure 8—source data 1). Errors are represented by discs, coloured by mutation (e.g. A to G: green; A to C: blue; G to C: blue). Disc size is proportional to error frequency without the fidelity domain (its occurrence using αβγδ+1). The effect of the ε fidelity domain upon these error tendencies is plotted on the y-axis (by comparison of error rates between t5+1 and αβγδ+1). Many of the most significant errors (using αβγδ+1) exhibit the greatest proportional reductions with the fidelity domain, including substantial (>10 fold) reductions in several misincorporations at the third position. Improvements in fidelity are variable but focused on the second and third triplet positions, while the first triplet position shows an increase specifically in A to G mutations, indicative of a tolerance for G-U wobble pairing.
Figure 8—figure supplement 3. Determination of residual fidelity phenotype when using minor groove-modified substrates.

Figure 8—figure supplement 3.

Top: framework for estimation of residual fidelity phenotype (ϕ). Shown are model logistic curves for incorporation of a triplet, for example GAU (at a fixed concentration, opposite 3’-AUC-5’), versus increasing concentrations of a mismatching triplet, for example GGU. A ribozyme ‘+’ with a fidelity function (e.g. t5a, blue curve) will incorporate equal amounts of the matched and mismatched triplets at a different concentration of the mismatched triplet compared to a ribozyme ‘−’ without a fidelity function (e.g. αβγδ, red curve). x+ and x represent the lns of these concentrations for + and −; their separation (x+ - x) is a proxy for the strength of the ‘+’ fidelity phenotype. We measured the relative incorporation (W) of triplet pairs by the two triplet polymerases at test concentrations (t) of mismatched triplet, chosen to maximise the difference in the ribozymes’ resulting fractional incorporations (marked by a blue dot vs. a red dot on respective curves). If a triplet modification (*, right) interferes with mismatch discrimination by ‘+’, it would shift the ‘+’ relative incorporation curve (blue) towards that of ‘−’ (red), reducing the difference in fractional incorporation. Assuming curve steepness (k) remains constant, the residual phenotype (ϕ) for that modification is described by the new separation (x+* - x−*) as a proportion of the original separation (x+ - x); numerical values and calculations from the measurements described below are supplied in Figure 8—source data 2. Middle: measurements of ratios of incorporation vs. misincorporation (W) for unmodified and modified triplet pairs (at the indicated test concentrations) by the t5a +1 triplet polymerase compared to the ε fidelity domain-truncated αβγδ+1 (at 0.5 μM each). The expected triplet additions are in grey along the left of each gel; the average W from n independent experiments, calculated via densitometry of products containing slower-migrating G misincorporations, is shown below each lane along with the average ϕ. Primers (P), templates (T) and additional triplets were at 0.5/0.5/5 μM, incubated for three days in −7˚C ice. Left: presence of a 2-thiouracil (2SU) at the third position of the triplet abolishes the fidelity domain’s ability to discriminate against a second position wobble pair (P: A10, T: CCCMisGAU, +5 μM pppCCC). Centre-left: the same third position modification (2SU) also abolished the fidelity domain’s preference for G misincorporation at the first position (P: Fγ7, T: TγAGU, +5 μM HOCUG). Centre-right: in contrast, a 2SU at the first position of the triplet exerted no clear influence upon third position wobble discrimination (P: A10, T: CCCMisUGA, +5 μM pppCCC). Right: replacement of a 2’ hydroxyl group with a 2’ fluoro (2’F) at the first triplet position likewise had no effect upon third position mismatch discrimination (P: Fγ7, T: TγAGU, +5 μM HOCUG). Below: Measurement of fidelity phenotype in the presence or absence of a downstream triplet. Residual phenotypes indicate attenuation or abolishment of third position (left panel, P Fγ7, T TγAGU), second position (middle panel, P Fγ7, T TγGAU), and third position (right panel, P Fγ7, T TγAGU) fidelity effects from downstream triplet absence. No effects are seen (upon third position discrimination) from the presence or absence of a 5’ triphosphate on the downstream triplet (ϕppp). In the absence of a triplet bound downstream on the template, the fidelity phenotype is severely compromised, suggesting this adjacent triplet:template duplex plays a critical role in positioning the fidelity domain relative to the incoming triplet.

To investigate if ribozymes could exhibit higher triplet incorporation fidelity, we had included a persistent adaptive pressure for fidelity during in vitro evolution, spiking in an excess of mispairing 3’-deoxy ‘terminator’ triplets from round nine onwards, precluding recovery of ribozymes that incorporated these mispairs (Figure 2—source data 1, Figure 4—source data 1, Figure 2—figure supplement 2). This yielded reshaped and improved fidelity profiles in the ‘surviving’ type 2–6 ribozymes (Figure 8a). Notably, the final t5+1 ribozyme achieves an average positional fidelity of 97.4% using pppNNN in this sequence context, higher than the best RPR fidelity with NTPs under comparable eutectic conditions (Attwater et al., 2013b). Deep sequencing of internal triplet positions of a defined sequence (β+ segment) synthesised by t5+1 using pppNNN indicated similar aggregate fidelity could be achieved during longer product synthesis excluding the final triplet (Table 1).

Table 1. Sequencing of ribozyme-synthesised β+ segment.

Shown are the individual base fidelities (%) along the β+ sequences (top) synthesised by t5+1, using the six specific triplets (tri), or random (pppNNN) or compositionally-biased random (low-G pppNNN, see Figure 6—figure supplement 4) substrate pools, from Fβ6 primer (the first six positions at the left) with template Tβ (1 μM each RNA, 13 days −7˚C ice). For their sequencing, extension products were eluted from templates, and full-length products were gel-purified, ligated to adaptor, reverse-transcribed and PCR amplified. For compositional analysis, a small percentage of unrelated amplified products were excluded (those with >9 mutations vs. the expected β+ sequence; similar levels were excluded if a > 6 mutation threshold was applied, 0.2%/0.2–3.7%/4.2% and 3.7%/3.8% for tri & pppNNN and low-G pppNNN). These sequences mostly appeared to derive from off-target priming and extension of Fβ6 on the ribozyme in the presence of pppNNN. The sequencing of products generated from specific triplets provides an estimate of background error arising from amplification and sequencing. The final triplet constitutes an error hot-spot - likely to mutate to a more mutationally stable triplet during self-replication - exacerbated in pppNNN samples by the inability of the fidelity domain to operate in the absence of a downstream triplet (Figure 8—figure supplement 3). The geometric average of internal triplet position fidelities is used to gauge overall t5+1 fidelity during RNA synthesis. While overall fidelity drops from defined to random triplets (98.8 to 96.7%), much of this loss in fidelity can be recovered by adjusting the triplet composition to a low-G random pool, where reductions in G-U wobble pairing more than compensate for increases in rarer misincorporations opposite template C.

G C C A U C A A A G C U U G A G A G C A U C U U Internal triplets’
average:
10 μM each tri: 93.3 99.4 99.5 97.8 99.8 99.4 99.6 97.9 99.1 99.3 99.5 99.1 99.8 98.9 99.5 98.2 96.0 96.3 98.79
10 μM each PPPNNN: 92.8 97.0 98.8 99.2 99.5 99.3 99.1 97.8 98.6 99.4 99.1 98.7 94.3 81.4 96.6 97.4 59.8 42.3 96.65
10 μM average, low-G PPPNNN: 97.3 98.2 99.5 97.3 99.7 99.4 99.6 97.8 99.0 99.3 98.9 98.8 97.6 97.3 98.8 97.4 73.2 43.2 98.56

Molecular basis of triplet polymerase ribozyme fidelity

Having established that accurate triplet-based copying is possible (in at least some sequence contexts), we sought to understand how the triplet polymerase ribozyme achieves it. Investigating the fidelity contributions of different t5+1 ribozyme components, we found that the type 1 RNA cofactor did not contribute; rather, fidelity gains appeared to be mediated by the newly-evolved t5 ‘ε’ 3’-domain, as its deletion (yielding the truncated ‘αβγδ’ ribozyme) reverted the fidelity profile towards that of Zcore (Figure 8—figure supplement 1). Presence of the ε domain did not uniformly increase fidelity, but selectively reduced the most acute errors at the second and third triplet positions (with over 10-fold reductions for some errors, Figure 8b, Figure 8—figure supplement 2). Overall error rates at the second and third triplet positions were reduced by 4-fold and 9-fold compared to Zcore (Figure 8a), though increased (1.3-fold) at the first triplet position due to a localised asymmetric tolerance of G:U wobble pairing (Figure 8b). The ε domain fidelity function is contingent upon the presence of a downstream triplet, operating only with basal fidelity for final triplet incorporation (Figure 8—figure supplement 3).

Dissecting the molecular determinants of the fidelity phenotype, we found that using triplet substrates modified at the third position with a 2-thiouracil in place of a uracil (disrupting minor groove hydrogen bonding capabilities) rendered the ε fidelity domain unable to discriminate mismatches (Figure 8c, Figure 8—figure supplement 3). Previously, a similar replacement of a uracil 2-keto group with a 2-thio modification had been shown to impair Z RPR activity when present upstream in the primer/template region (Attwater et al., 2013a), where Z is thought to rely upon sequence-general minor groove contacts through an ‘A-minor’ motif (Shechner et al., 2009). Modification at the third triplet position reverts ε’s divergent effects on fidelity at the adjacent second and the distal first triplet positions (Figure 8—figure supplement 3); disruption of this minor groove contact site thus abolishes overall ε fidelity domain operation. ε sensitivity to minor groove composition may be critical to its recognition of cognate Watson-Crick base pairs, reminiscent of Tetrahymena group I intron folding (Battle and Doudna, 2002) and the decoding centre of the ribosome (which also tolerates wobble pairing at the analogous (5’) triplet position) (Ogle et al., 2001).

Systems-level properties of triplet pools

An important contribution to triplet fidelity also appears to arise from unexpected behaviours of the triplet substrates themselves. We observed that in some direct pair-mispair triplet contests, inclusion of their complementary triplets caused a striking (~3 fold) drop in misincorporation errors (Figure 9). A potential explanation may arise from differential formation of triplet:anti-triplet dimers in the reaction: for example, more extensive pppGCC:pppGGC (than pppACC:pppGGU) dimer formation would selectively reduce the effective concentration of free pppGCC vs. pppACC upon inclusion of their complementary pppGGC and pppGGU.

Figure 9. Substrate pool interactions improve triplet fidelity.

Applying the fidelity assay (Figure 8a, using t5+1) to single templates with only an encoded triplet and a mispairing one as substrates (at 5 μM each), we observed that relative mispair incorporation was proportionally reduced (by 61% (left) and 73% (right)) upon introduction of complementary triplets. Using all 64 triplets (pppNNN) has an analogous effect upon these pair/mispair comparisons with fidelity progressively improved upon increasing overall pppNNN concentrations, with examples of effects on other triplets and overall fidelity presented in Figure 9—figure supplement 1, and comprehensive error rates and ratios in Figure 9—source data 1.

Figure 9—source data 1. Collated error rates and ratios at different substrate concentrations.
DOI: 10.7554/eLife.35255.038

Figure 9.

Figure 9—figure supplement 1. Influence of random triplet pool concentrations upon triplet misincorporation tendencies.

Figure 9—figure supplement 1.

Left: 0.2 µM t5+1 was tested in the fidelity assay at a series of pppNNN concentrations. The observed overall error rate (black circles) in these reactions halved as pppNNN concentrations were increased to 7 µM each (after which extension activity began to reduce and error rates increase). Diverse patterns were observed in the concentration dependence of individual triplet positional errors, of which a selection are displayed here (×) coloured according to whether they raise (red-yellow) or lower (blue) the GC content of the expected triplet, and coloured more intensely for higher GC content triplets. Right: For a broader set of errors with 0.5 µM t5+1 (those with >0.3% incidence at 5 µM pppNNN, x-axes) the changes in their incidences from raising concentrations of each pppNNN from 2 to 10 µM substrate are plotted on the y-axes. The frequencies of several mutations that increase triplet GC content in GC-rich triplets are reduced at 10 µM pppNNN concentrations (top panel). Conversely, the frequencies of some mutations that reduce GC content are increased at 10 µM pppNNN concentrations (bottom panel); these suggest relative depletion of available GC rich triplets as overall triplet concentrations rise. Such pool behaviour can explain the reduction of overall error rates in ~5 µM pppNNN triplet pools, as misincorporations that increase GC content (e.g. G opposite U) are responsible for more errors than those that decrease it (bottom left). Values and ratios for a wider range of errors are supplied in Figure 9—source data 1.

These pairwise reductions were recapitulated in the presence of random pppNNN substrate pools (Figure 9). Indeed, counterintuitively, raising pppNNN concentrations from 0.5 to 5 μM each almost halved the overall error rate (Figure 9—figure supplement 1). Although diverse effects upon individual misincorporations were observed, this fidelity enhancement was driven by pronounced reductions in errors where the mismatched triplet has a high GC content compared to the cognate triplet, including common G-U wobble mispairs (Figure 9—figure supplement 1). Dimer formation among pppNNN substrate pools would be expected to selectively buffer the free concentrations of the more strongly-pairing GC-rich triplets, which could promote both fidelity and sequence generality through normalization of triplet availability against template (and complementary triplet) binding strength.

Indeed, more efficient, higher fidelity segment synthesis was observed when partially mimicking this outcome using an pppNNN pool formulated with a reduced G content (Figure 6—figure supplement 4, Table 1). In a prebiotic scenario, substrate pool composition would have been determined by the abundance and nontemplated polymerization tendencies of the different nucleotides; large biases in these could skew triplet compositions or deplete a triplet (resulting in mismatch incorporation). However, the potential for replication to proceed in different triplet registers may provide a degree of resilience towards such biases.

Discussion

Here, we describe the discovery and characterization of a ribozyme (t5+1) with a robust ability to polymerize RNA trinucleotide triphosphate (triplet) substrates. Unusually, this triplet polymerase ribozyme comprises a heterodimer of a catalytic triplet polymerase subunit (t5) and a non-catalytic RNA cofactor (type 1), which enhances triplet polymerase activity and abrogates the need for template tethering. Such a quaternary structure - involving a heterodimer of a full-length and a truncated subunit - is reminiscent of the processivity factors of some proteinaceous polymerases such as the heterodimeric p66/p51 HIV reverse transcriptase holoenzyme (Huang et al., 1992). There are multiple examples of dimerization in RNA evolution - such as the VS ribozyme (Suslov et al., 2015), retroviral RNA genome dimerization (Paillart et al., 2004), in vitro evolved heterodimeric RNA liposome binders (Vlassov et al., 2001), and recently the homodimeric CORN fluorescent RNA aptamer (Warner et al., 2017). However, the spontaneous emergence of a general, mutualistic RNA cofactor has not previously been observed for ribozymes and may suggest an underappreciated dimension to the evolutionary dynamics of ribozyme pools under stringent adaptive pressures. Indeed, the extinction of previously dominant species in the selection that were unable to benefit from type 1 enhancement (e.g. type 0, see Figure 3—figure supplement 2) and succession with cooperative RNA species (Vaidya et al., 2012) illustrates the potential for such symbioses to shape RNA molecular ecologies.

The t5+1 ribozyme’s principal current shortcoming is its low catalytic efficiency. In the optimal context for mononucleotide polymerase ribozymes, this triplet polymerase heterodimer yields ~4 fold more unligated junctions than the RPR tC19Z (Attwater et al., 2013b), which itself is 240-fold slower than the currently most advanced RPR 24–3 (Horning and Joyce, 2016). Yet despite this modest catalytic power, t5+1 displays much enhanced generality in RNA synthesis and now achieves both copying of previously intractable structured RNA templates, and templated synthesis and assembly of an active ‘+’ strand copy of its catalytic domain, suggesting key contributions of the triplet substrates themselves.

Indeed, one of the main findings of our work are the compelling advantages that triplet substrates appear to offer for sequence general RNA replication. For instance, when binding templates, triplets incur a lower entropic cost per position compared to canonical mononucleotides (thus aiding copying of sequences rich in weakly pairing A and U bases), with particularly helpful stability contributions from intra-triplet base stacking (Eigen, 1971). Furthermore, energetically favourable inter-triplet stacking interactions appear to instigate cooperative binding and unfolding of even highly stable RNA template structures (Figures 1b and 5a) upon reaching the required substrate concentration threshold. In our work, this process is aided by the cold temperature and solute concentration effects of eutectic ice phase formation (Attwater et al., 2010; Mutschler et al., 2015). Counterintuitively, a general solution to the copying of structured RNAs arises not from conditions that disfavour base-pairing (which would also hinder substrate binding), but rather from conditions that promote it.

Together these favourable molecular traits serve to pre-organize the template towards a double-stranded RNA duplex with triplet junctions poised for ligation. A triplet/template duplex presents a more ordered, regular target for sequence-general ribozyme docking (by e.g. the ε domain) than a single stranded template (variably prone to secondary structure formation or sequence-specific interactions with the ribozyme [Wochner et al., 2011]). Such general duplex interactions also underlie other notable features observed in our triplet-based RNA synthesis such as in trans template binding (Figure 3c) as well as the capacity for bidirectional (5’−3’/3’−5’) and primer-free RNA synthesis (Figure 7).

Contrary to expectations RNA-catalyzed triplet polymerisation can proceed with a fidelity matching or exceeding even the best mononucleotide RNA polymerase ribozymes (Attwater et al., 2013b; Horning and Joyce, 2016). t5+1 ribozyme fidelity is due to both a readout of cognate minor groove interactions by the ribozyme ε domain (Figure 8) and an unanticipated fidelity boost arising from systems-level properties of triplet pools, that appear to normalize the availability of free triplet (and potentially longer oligonucleotide) substrates against their base-pairing strength (Figure 9). Though further work will be required to characterize triplet pool properties, they likely involve formation of cognate or near-cognate triplet:anti-triplet interaction networks, as formation of tRNA dimers via cognate anticodon:anticodon interactions has been observed in a similar concentration range (Eisinger and Gross, 1975).

While phylogenetically unrelated, mechanistic analogies between the triplet polymerase ribozyme and the ribosome are apparent. Both are RNA heterodimers that operate in a triplet register along a single-stranded RNA template, whilst enforcing a minor-groove mediated pattern of triplet or anticodon readout (including tolerance of 5’ wobble pairing), suggestive of convergent adaptive solutions to the challenges of replication and decoding. It has long been speculated that the decoding centre of the small ribosomal subunit might have had its origins in an ancestral RNA replicase, but the implied triplet-based character of such a replicase was conspicuously discordant with modern mononucleotide-based replication (Weiss and Cherry, 1993; Poole et al., 1998; Noller, 2012). The utility of triplets as substrates for RNA synthesis and self-synthesis described herein suggests that these early ideas deserve to be reconsidered. In the context of initial uncorrelated evolution of the small and large ribosomal subunits (Petrov et al., 2015), it is tempting to speculate that an early reliance upon triplets in RNA replication could have inadvertently supplied a decoding center for translation.

In conclusion, the unexpected emergent properties of triplets – including cooperative binding and unfolding of structured RNA templates, enhanced incorporation of AU-rich substrates, and error attenuation (resulting from triplet pool interaction networks) – argue that short RNA oligonucleotides may represent predisposed substrates for RNA-catalyzed RNA replication. Some of these benefits might also extend to codon/anticodon dynamics in early translation, and to the non-enzymatic replication of RNA (Szostak, 2012), where downstream trinucleotides have recently been shown to enhance incorporation of preceding activated mononucleotides both through stacking and positioning effects (Vogel et al., 2005; Zhang et al., 2018) and the formation of a highly reactive intermediate (Prywes et al., 2016; O'Flaherty et al., 2018). Taken together, the interaction of triplet substrate pools with RNA templates promotes uncoupling of an RNA’s sequence (i.e. information content, and associated folding tendencies) from its replicability, thereby enhancing RNA’s capacity to serve as an informational polymer.

Materials and methods

Templated RNA-catalysed RNA synthesis

Standard ribozyme activity assays (modified where specified) comprise 5 pmol of each ribozyme annealed in 2.5 μl water (80˚C 2 min, 17˚C 10 min), with 2 μl of 1 M MgCl2 and 0.5 μl of 1 M tris•HCl pH 8.3 (at 25˚C, pH raised to 9.2 at −7˚C) then added on ice, and left for >5 min to ensure folding. This was added to 5 pmol each of primer and template and 50 pmol of each triplet pre-annealed in 5 μl water, then frozen on dry ice (10 min) and incubated at −7˚C in a R4 series TC120 refrigerated cooling bath (Grant (Shepreth, UK)) to allow eutectic phase formation and reaction.

Final pre-freezing concentrations of components are displayed throughout (in this example, yielding 0.5 μM ribozyme/primer/template, 5 μM each triplet, 200 mM MgCl2, 50 mM tris•HCl pH 8.3). Supercooled reactions (Figure 1c) remained liquid by omitting the dry-ice freezing step, maintaining these concentrations. Ice crystal formation upon eutectic phase equilibration, however, concentrates all solutes ~4–5 fold (Attwater et al., 2010) to their final operational levels and cooling elevates tris-buffered pH to ~9.2.

Some substrate mixes (e.g. pppNNN) led to a higher final reaction volume, but eutectic phase equilibration restored standard operational concentrations, also applicable to the four-fold-diluted extensions with the fragmented ribozyme (Figure 6c). These used 2 pmol each ribozyme/fragment annealed in 3.25 μl 62 mM MgCl2, 15 mM tris•HCl pH 8.3 (37˚C 5 min, ramped to 4˚C at 0.1˚C/s, 4˚C 10 min), with pre-annealed primer/template/substrates (0.5/0.5/5 pmol) added in 0.75 μl water. These reactions, and preparative syntheses (Figure 6—figure supplement 1, Figure 7c), were supercooled at −7˚C followed by ice crystal addition for quick freezing and optimal activity.

Figure 1c extensions were set up by adding buffer, then RNAs (preannealed together, 0.1 μM final concentrations) to triplets. RNAs for ε+ syntheses were chilled on ice instead of annealing, with ribozyme/MgCl2/tris•HCl pH 8.3 mixed with the other RNAs at −7˚C. Oligonucleotide substrates were added equimolar to template binding sites in the primer/template/substrate anneal. NTPs, on the other hand, were added with the MgCl2/tris•HCl pH 8.3 to the ribozyme polymerase.

Extension product separation

At the end of standard incubations, reactions were thawed and 2 μl aliquots added to stop buffer (1 μl 0.44 M EDTA (pH 7.4), with urea to a 6 M final concentration and a 10–20 fold molar excess over template of complementary competing oligonucleotide (see Supplementary file 3) to prevent long product/template reannealing). Samples were denatured (94˚C 5 min) and RNAs separated by 8 M urea 1 × TBE denaturing PAGE.

To avoid using potentially confounding competing oligonucleotide when purifying extension products, reactions with a biotinylated primer or template (stopped as above) could be purified by bead capture using MyOne C1 (Invitrogen) streptavidin-coated paramagnetic microbeads (using 5 μg pre-washed beads per pmol biotinylated RNA) in 0.5 × − 0.8 × bead buffer (BB: 200 mM NaCl, 10 mM tris•HCl pH 7.4 (at 25˚C), 1 mM EDTA, 0.1% Tween-20). After washing twice in BB to remove unbound components, beads were incubated (1 min) in 25 mM NaOH, 1 mM EDTA, 0.05% Tween-20 to denature the duplexes (Horning and Joyce, 2016). To recover biotinylated extension products (e.g. Figure 5b left panels, Figure 6—figure supplement 1 αβ+/γδε+, Figure 7c left panel) the supernatant was discarded, and beads were washed first in BB with 200 mM tris•HCl pH 7.4, then in BB, then heated (94˚C 4 min) in 95% formamide, 10 mM EDTA to release primers for urea-PAGE. To recover extension products bound to biotinylated templates (e.g. Figure 6—figure supplement 1 β+, δ+, ε+, Figure 7c right panel) the supernatant was removed, neutralized with 500 mM tris•HCl pH 7.4, spin-concentrated using Ultracel 3K filters (MerckMillipore, UK), recovered and denatured in 6M urea/10 mM EDTA before urea-PAGE. β+ synthesised in Figure 6—figure supplement 1 was not spin-concentrated, leading to a lower recovery yield; δε+ synthesis was denatured directly from the ligation reaction in 60% formamide with excess EDTA.

For gel mobility shift assays (Figure 3b), ribozymes were mixed at 0.5 μM, pre-annealed and buffer added on ice as for extension reactions, then mixed with 5 × loading buffer (50% glycerol, 250 mM tris•HCl pH 8.3, 125 mM MgCl2) for separation by native PAGE (0.5 × TB, 8% 59:1 acrylamide:bisacrylamide, 25 mM MgCl2, run in a Hoefer SE600 Chroma (ThermoFisher, Waltham, USA) (upper chamber: 0.5 × TB 50 mM NaOAc, lower chamber: 0.5 × TB 25 mM Mg(OAc)2) kept at 4˚C in a circulator bath for 6–8 hr at 10 W), then SYBR Gold stained as below.

Extension product detection, quantification and purification

Fluorescent primer extension products were detected using the appropriate laser wavelength on a Typhoon Trio scanner (GE Healthcare (GE) (Chicago, USA)); gel densitometry allowed quantification of RNA synthesis efficiency. Gel contrasts in figures were linearly adjusted to optimize display of bands of differing intensities.

The gel in Figure 5b (middle panel) was washed thrice (5 min) in water, incubated with 10 μM DFHBI-1T ligand in buffer for 20 min to fold full-length broccoli aptamer (as in [Filonov et al., 2015]) and scanned. The ligand was then eluted in three 1 × TBE washes (leaving negligible background fluoresence), and stained in 1 × TBE with SYBR Gold (1:10000), washed again, and re-scanned to detect all RNA products (left panel); scans were aligned via an adjacent Cy5-labelled primer extension lane (not shown).

Full-length product yields in the Figure 6—figure supplement 1 plus-strand syntheses were calculated by running samples of bead-eluted products (or raw reaction for δε+) alongside known amounts of the positive controls indicated, followed by SYBR-Gold staining. To purify, bead-eluted products were run similarly, and excised using UV shadowing. Products were then eluted from the gel fragments in 10 mM tris•HCl pH 7.4, and Spin-X column filtrate (Costar (Sigma-Aldrich, UK)) precipitated in 75% ethanol with 1 μl 1% glycogen carrier (omitted for β+). Recovered full-length product yields were calculated similarly to reaction yields for αβ+/δε+/γδε+, or using A260s for β+, δ+, ε+.

Fragment sequencing

Purified ribozyme- and TGK-synthesized αβ+/γδε+ fragments were sequenced by first ligating a 3’ adaptor (10 U/μl T4 RNA Ligase 2 truncated KQ in 1 × RNA ligase buffer (New England Biolabs (NEB), (Ipswich, USA)) with 15% PEG-8000 and 2 μM AdeHDVLig at 10˚C overnight). These reactions were bound to MyOne C1 microbeads (ThermoFisher (Invitrogen)), washed with BB to remove unligated adaptor, and reverse transcribed (50˚C 30 min) with 1 μM HDVrec primer using Superscript III (Invitrogen). Beads were washed again then PCR amplified (five cycles with a 40˚C annealing step, then 20 cycles with a 50˚C annealing step) using GoTaq HotStart master mix (Promega (Madison, USA)) and 0.8 μM each of primers P3HDV, and P5Xα8 or P5Xγ7, for high-throughput sequencing (Illumina (San Diego, USA) MiSeq or HiSeq) after PCR product agarose gel purification. β+ syntheses’ cDNAs were amplified with P3HDV and P5Xβ6.

Fidelity assay

To estimate RNA synthesis fidelity, ribozymes extended primers using pppNNN on templates encoding CCC-XXX-CCC, where XXX were 12 different triplet sequences evenly exploring base composition and distribution (see Supplementary file 3; for XXX = ACC, template encodes CCC-ACC-UCC to avoid a terminal run of Gs).

Each primer/template pair (0.45/0.525 pmol per reaction) was annealed in 4 mM MgCl2, 1 mM tris•HCl pH 7.4 (80˚C 2 min, ramped to 4˚C at 0.1˚C/s, then kept on ice). The 12 pairs were combined in 0.27M MgCl2/67 mM tris•HCl pH 8.3 on ice to discourage primer-template assortment (of which sequencing later revealed negligible levels). 36 pmol of each triplet in pppNNN (equivalent to 5 μM final concentration after considering eutectic phase equilibration effects upon this more dilute reaction) were added to a reaction vessel in 10.8 μl water, to which 5.4 μl of the primer/template/buffer mix was added followed by 7.2 pmol of ribozyme pre-annealed (80˚C 2 min 17˚C 10 min, ice >5 min) in 1.8 μl water (f.c. equivalent 1 μM, in excess over the 0.875 μM template to which some ribozymes could tether to enhance extension). Reactions were frozen and incubated (7 days at −7˚C) as described above.

Reactions were stopped with 3.6 μl 0.44 M EDTA and 10.5 pmol of each template's competing oligonucleotide (migrating above product, with marker mutations to ensure exclusion), denatured with 6 M urea, and urea-PAGE separated. After alignment with a fluorescence scan of the gel, a region of the sample lane corresponding to primers extended by +4 to +14 nt was excised (encompassing 2–4 triplet additions), and extension products were eluted, precipitated in 77% ethanol with 1 μl 1% glycogen carrier, washed in 85% ethanol and resuspended in water.

These extension products were 3’ adaptor ligated as for fragment sequencing. Products were reverse transcribed (0.2 × adaptor ligation reaction, 1 μM HDVrec primer in Superscript III reaction, 50˚C 30 min) and then PCR-amplified (1/30th reverse transcription mix, 0.8 μM each of primers P3HDV and P5GGGX) for sequencing as above (yielding 2 × 105 – 4 × 106 sequences per ribozyme assay).

After processing and 3’ adaptor trimming, sequences corresponding to primer extended by CCC +1–3 additional triplets were collated for analysis. Variations in upstream primer sequences (see Supplementary file 3) allowed the partner template to be identified for each sequenced product; the triplet incorporated after the first CCC was counted. Separately, 10 μl extensions by t5+1 of each primer/template alone with its encoded triplet and pppCCC (and pppUCC for the ACC pair) were combined for purification and sequencing as above, to allow isolation of the ribozyme-mediated errors resulting from inclusion of the other 62 (61 for ACC) triplets in the reaction (versus errors from sequencing, recombination etc.). The counts of cognate triplet (C) and each error triplet (E) in the positive control (p) reduced error counts in the experimental samples (x) to yield ribozyme-mediated error counts (Er) thusly: Er = Ex - Ep*(Cx/Cp) (not reducing Ex below 0, and reallocating all reductions to Cr; pppCCC counts (and pppUCC for the ACC template) remained uncorrected).

For each template, counts were then collated at the first/second/third positions to yield base-specific mutation rates for each position (Figure 8—figure supplement 2, Figure 8—source data 1). Across the 12 triplets, A, C, G, and U were encoded at each position three times; linear averages were calculated to map the position’s error profile (Figure 8b) and geometric means of the four nucleobases yielded the position’s overall fidelity (Figure 8a, Figure 8—figure supplement 1).

Triphosphorylated triplet synthesis

Triplets (and some other short oligonucleotides) were prepared from NTPs by T7 RNA polymerase run-off transcription of a 5’ single-stranded DNA overhang downstream of a DNA duplex T7 promoter sequence. In most cases, the 5’ overhang encoded (was the reverse complement of) the desired oligonucleotide. These oligonucleotides were short enough to synthesise during the abortive initiation stage of transcription, attenuating sequence constraints on the first bases of the transcript. However, T7 RNA polymerase exhibited tendencies to skip the first (or even second) base (most severe for U > C > A > G before second position purines: encoding CGU yielded some pppGU, encoding UAC yielded just pppAC) or use oligonucleotides generated during transcription to re-initiate (e.g. encoding GAG yielded pppGAGAG, encoding AAA yielded pppA6-9, encoding UCC yielded pppCCC, encoding CGC yielded some pppGCGC; this tendency was most severe when the oligonucleotide could be accommodated opposite the final template bases of the promoter).

These tendencies could be subverted by encoding additional first bases (usually without providing the corresponding NTP). This initiated the oligonucleotide at the second position where skipping tendencies were lower (e.g. encoding CUAG without CTP yielded pppUAG, encoding UUAC yielded some pppUAC), and reduced recruitment as initiators of products with bases not complementary to the introduced first position template base (e.g. encoding CGAG without CTP yielded pppGAG, encoding CAA without CTP yielded pppAA and pppAAA, encoding AUCC without ATP yielded pppUCC, encoding UCGC without UTP yielded pppCGC).

Each 30 μl transcription reaction contained 72 nmol of each desired product base as an NTP (Roche) (e.g. for pppUCC, 72 nmol UTP, 144 nmol CTP) in 1 × MegaShortScript kit buffer with 1.5 μl MegaShortScript T7 enzyme (ThermoFisher). Also present were 15 pmol of each DNA oligonucleotide forming the transcription duplex target (see Supplementary file 2). The reactions were incubated overnight at 37˚C, stopped with 3 μl 0.44 M EDTA and 17 μl 10 M urea, and separated by electrophoresis (35 W, 4.5 hr) on a 35 × 18 × 0.15 cm 30% 19:1 acrylamide:bis-acrylamide 3 M urea tris-borate gel. Products were identified through their relative migrations (reflecting overall composition, fastest to slowest: C > U≈A > G) by UV shadowing. Triplet bands were excised and eluted overnight in 10 mM tris•HCl pH 7.4, and filtrate (Spin-X) precipitated with 0.3 M sodium acetate pH 5.5 in 85% ethanol. Pellets were washed in 85% ethanol, resuspended in water, and UV absorbances measured with a Nanodrop ND-1000 spectrophotometer (ThermoFisher). Oligocalc (Kibbe, 2007) was used to calculate sequence-specific concentrations and yields. pppNNN was generated by combination of equal amounts of each of the 64 triplet stocks in a lo-bind microcentrifuge tube (Eppendorf (Hamburg, Germany)).

3’-deoxy triphosphorylated ‘terminator’ triplets were transcribed as above but using a 3’ deoxynucleoside 5’ triphosphate (Trilink biotechnologies) for the last position, migrating faster during PAGE than the equivalent all-RNA triplet. Triplets with 2-thiouridine residues were transcribed as for their corresponding U, replacing UTP with U2STP (Jena Bioscience (Jena, Germany)); incorporation and migration were similar between the two, and their concentrations were calculated from A260nm by comparison to the A260nm of mixtures of the component ribonucleotides with UTP vs. U2STP. Triplets with 2’-fluoro, 2’-deoxy positions could also be transcribed, with lower efficiency, by substituting the corresponding triphosphate (Trilink Biotechnologies). The biotinylated pppGAU–Bio triplet used in γ segment synthesis (Figure 7c) was transcribed as for pppGAU, replacing UTP with biotin-16-aminoallyluridine-5’-triphosphate (Trilink Biotechnologies (San Diego, USA)), quantified via by comparison to the A290nm of mixtures of the component ribonucleotides.

Longer triphosphorylated oligonucleotides used in ribozyme self-synthesis were generated similarly, but using ~200 ng of fully double stranded DNA as a template. Candidate product bands were purified and the desired oligonucleotide identified by ribozyme-catalysed in-frame incorporation and, for some, fragment sequencing.

RNA oligonucleotide/ribozyme preparation

Transcriptions were performed on ~15 ng/μl dsDNA using MegaShortScript enzyme and buffer (ThermoFisher) with 7.8 mM of each NTP, or, to yield a 5’ monophosphate on the product to avoid aberrant ligation, 10 mM GMP (guanosine monophosphate) and 2 mM of each NTP (‘GMP transcription’).

dsDNA templates for some of these (in Supplementary file 3) were generated (‘fill-in’) using three cycles of mutual extension (GoTaq HotStart, Promega) between the associated DNA oligonucleotide and 5T7 (or, where indicated, HDVrt for defined 3’ terminus formation [Schürer et al., 2002]) followed by column purification (QiaQuick, Qiagen).

Some 5’ biotinylated RNAs were synthesized using the TGK polymerase (Cozens et al., 2012) (56 μg/ml, in 1 × Thermopol buffer (NEB) supplemented with 3 mM MgCl2) to extend 5’ biotinylated RNA primers (0.75 μM) on DNA templates (1 μM) using 2.5 mM of each NTP (94˚C 30 s, 45˚C 2 min, 65˚C 30 min, 45˚C 2 min, 65˚C 30 min, then all repeated). Biotinylated products were bead-purified as above.

3’ biotinylation of RNAs was achieved in two stages: 3’ azidylation (at 2 μM with 25 U/μl yeast poly-A polymerase (ThermoFisher) and 0.5 mM 2’-azido-2’-deoxycytidine triphosphate (Trilink Biotechnologies) for 1 hr at 37˚C) with subsequent acidic phenol/chloroform extraction and 75% ethanol precipitation, then copper-catalysed biotin-(PEG)4-alkyne (ThermoFisher) cycloaddition (Winz et al., 2012) with subsequent 75% ethanol precipitation followed by resuspension and buffer exchange in Ultracel 3K filters (Amicon) to remove residual biotin-alkyne.

Selection library synthesis

Round one libraries were synthesised by mutual extension of 4 nmol of oligonucleotides 1baN30 and 1GMPfo or 1GTPfo at 1 μM each in 1 × isothermal amplification buffer (NEB) with 250 μM each dNTP, annealed (80˚C 3 min, 65˚C 5 min) before addition of 0.4 U/μl Bst 2.0 (NEB) and 30 min 65˚C incubation.

After purification, 375 μg of each DNA (~1.5 × 1015 molecules) were transcribed in 5 ml transcription reactions (36 mM tris•HCl pH 7.9 (at 25˚C), 1.8 mM spermidine, 9 mM DTT, 10.8 mM MgCl2, 2 mM each NTP, 1% 10 × MegaShortScript buffer, 2% 1:9 MegaShortScript:NEB T7 RNA polymerase, 37˚C overnight). These were treated with DNase, acid phenol/chloroform extracted and 73% ethanol precipitated prior to urea-PAGE purification, elution, filtering (Spin-X) and re-precipitation, yielding the 1GTP Zcore selection construct (Supplementary file 1). 10 mM GMP was present in transcriptions of the 1GMP construct, and for future transcriptions of the GMP construct selection branch and rounds 8–18; round 19–21 and reselection libraries were transcribed without GMP. Most subsequent selection rounds were transcribed in 1/10th scale transcriptions with 15 μg of DNA (~6 × 1013 molecules) derived from amplification of recovered PCR products (see later).

For round 8, 700 pmol DNA was formed, with Tri3CUUQ amplifying round seven merged output (50 pmol), round seven merged output recombined by StEP (Zhao and Zha, 2006) (200 pmol), and 0core ribozyme with the starting 3’ N30 library domain added (450 pmol, but extinct at the end of selection). DNA encoding type 5s amplified with AACAt5s was used to generate reselection libraries by PCR amplification using primers TriGAA7GAAM and T5ba13N/T5ba20N/T5ba28N; 5 pmol of the three dsDNA products were transcribed to generate reselection constructs.

In vitro evolution cycle

An outline of the selection strategy is shown in Figure 1—figure supplement 2, with detailed lists of selection oligonucleotides and extension parameters in Figure 1—source data 1, Figure 2—source data 1, Figure 4—source data 1 and Supplementary file 3. First, selection construct was annealed with equimolar dual-5’ biotinylated primer in water (80˚C 2–4 min, 17˚C 10 min), then chilled extension buffer and triplets were added before freezing and −7˚C incubation.

At the end of incubation the reaction was thawed on ice. To link the primer 3’ hydroxyl to the 5’ monophosphate of GMP constructs, selection constructs were buffer-exchanged directly after thawing using a PD-10 column (GE) in a cold room, into 3 ml ligation mix (optimised to prevent ligation over gaps) (2 mM MgCl2, 50 mM tris•HCl pH 7.4, 0.1 mM ATP, 1 mM DTT, 2 μM HOGCG (Rounds 1–7) or 2 μM HOCUG (Rounds 8–18) with 30 U/ml T4 RNA Ligase 2 (NEB)). After incubation at 4˚C for 1 hr, these were stopped with 2.2 mM EDTA and acid phenol/chloroform treated.

Constructs were then precipitated with glycogen carrier and 0.3 M sodium acetate in isopropanol (55%) before resuspension and denaturation (94˚C 4 min, in 6M urea 10 mM EDTA with a 3 × excess of competing oligonucleotide against the primer). For the reselection rounds, constructs were then treated with polynucleotide kinase (NEB) before denaturation to resolve the HDV-derived 2’, 3’-cyclic phosphates and allow later adaptor ligation.

Constructs were urea-PAGE separated alongside FITC-labelled RNA markers equivalent to successfully ligated constructs. The marker-adjacent gel region in the construct lane was excised, excluding the bulk unreacted construct (judged by UV shadowing). Biotinylated (primer-linked) constructs were eluted overnight into BB with 100 μg MyOne C1 beads. After 30 μm filtering (Partec Celltrics(Wolflabs (York, UK))) of the supernatant to remove gel fragments, the beads were washed in BB then denaturing buffer (8 M urea, 50 mM tris•HCl pH 7.4, 1 mM EDTA, 0.1% Tween-20, 10 μM competing oligonucleotide, 60˚C 2 min) to confirm covalent linkage of construct to primer, before further BB washing and transfer (to a fresh microcentrifuge tube to minimize downstream contamination). At this stage in the reselection, 3’ adaptors were then ligated to bead-bound constructs as above for 2 hr (with buffer/enzyme added after bead resuspension in other reaction components including 0.04% Tween-20), and beads BB washed and transferred again.

Bead-bound constructs were now reverse-transcribed using 1 μM RTri (or HDVRec for the reselection) by resuspension in a Superscript III reaction with added 0.02% Tween-20 (50˚C 30 min). Beads were BB washed and the RNA-bound cDNA 3’ end blocked by incubation with terminal deoxynucleotidyl transferase (ThermoFisher) and 0.2 mM dideoxy-ATP (TriLink) with 0.02% added Tween-20 (37˚C 30 min), and beads were BB washed and transferred again.

cDNAs were eluted (10 μl 0.1 M NaOH 0.1% Tween-20 20 min), neutralized and plus strands regenerated with 0.2 μM rescue oligonucleotide in an IsoAmp II universal tHDA kit (NEB) reaction (65˚C 60 min) to read through the structured product region. Whilst at this temperature, reactions were stopped with 5 mM EDTA and one volume of BB with 50–100 μg beads to bind the nascent biotinylated plus strands at room temperature. These beads were then BB washed, NaOH washed again to discard cDNAs (and recover only correctly-primed plus strands), and washed and transferred again.

Each 50 μg of beads were then subjected to plus strand recovery PCR in a 100 μl GoTaq HotStart reaction with 0.5 μM each RTri (or HDVrec for reselection) and RecInt (rounds 1–5)/RecIntQ (rounds 6–9)/RecIntL (rounds 10–14, 19–21 and reselection)/RecIntQL (rounds 15–18). The product was agarose size-purified, A260nm quantified and added to construct synthesis PCR in 3 × molar amount of the anticipated recovered RNA (judged by test extensions) that yielded it.

This final PCR for construct transcription in the subsequent selection round used 1 μM of the indicated construct synthesis primer, plus 1 μM RTri (or HDVrt for the reselection), in GoTaq HotStart reactions or (where indicated in source data) the GeneMorph II kit for mutagenesis (Agilent (Santa Clara, USA)).

Conditions for selections are included as source data 1 for Figures 1, 2 and 4. Numerical data for Figures 1c and 3b are included in Figure 1—source data 2 and Figure 3—source data 1 respectively. Numerical data and calculations for Figure 8 and its Figure 8—figure supplement 1 and 2 are supplied as Figure 8—source data 1. Numerical data and calculations for Figure 8—figure supplement 3 are supplied as Figure 8—source data 2. A more extensive selection of substrate concentration-dependent error rates is supplied in Figure 9—source data 1. Sequences of ribozymes, triplet synthesis templates, and oligonucleotides used in this study are supplied in Supplementary files 1, 2 and 3 respectively.

Acknowledgements

We thank S Thomsen, M McKie and M Sharrock for important discussions, and BT Porebski and SL James for assistance with sequencing analysis. This work was supported by a Homerton College junior research fellowship (JA.), a Harvard-Cambridge summer fellowship (AR.) and by the Medical Research Council (JA., EG., AM., PH.; programme no. MC_U105178804).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Philipp Holliger, Email: ph1@mrc-lmb.cam.ac.uk.

Anna Marie Pyle, Yale University, United States.

Funding Information

This paper was supported by the following grant:

  • Medical Research Council MC_U105178804 to Philipp Holliger.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Formal analysis, Investigation, Methodology, Writing—original draft.

Acquired and analysed minor groove modification effect data.

Characterised initial Zcore activity.

Analysed ribozyme heterodimer formation by gel mobility shift.

Conceptualization, Supervision, Methodology, Writing—original draft.

Additional files

Supplementary file 1. Ribozyme sequences.

All sequences are written in a 5’-to-3’ direction, and generated by GMP transcription of the corresponding PCR-generated dsDNA of the sequence downstream of 5T7 sequence duplex. All transcripts were PAGE-purified. HDV ribozyme sequences (blue) were transcribed in series with reselected type 5 ribozymes and cleave themselves off during transcription (Schürer et al., 2002) to yield precise 3’ ends with 2’, 3’-cyclic phosphates; the presence of this group did not affect type 5 activity. Sequences corresponding to the 5’ ‘cap+’ or ‘cap−’ regions from the selection, presenting a target for type 1 interaction, are coloured light green, with alternative arbitrary inert 5’ hairpin-forming sequences in dark green. Single-stranded sequences capable of hybridization with sites at the 3’ (or 5’, type 5cis) ends of certain templates to endow flexible ribozyme-template duplex tethering (Wochner et al., 2011; Attwater et al., 2010) are coloured yellow. Accessory domains 3’ of the catalytic core are in bold.

elife-35255-supp1.docx (147.1KB, docx)
DOI: 10.7554/eLife.35255.039
Supplementary file 2. Transcription of triplets.

Each target was transcribed from oligonucleotides 5T7 (Supplementary file 3) and 5’-(var)-TATAGTGAGTCGTATTAATTTCGCGGGCGAGATCGATC-3’, where the (var) overhang encodes (is the DNA reverse complement of) the sequence indicated below. Guide yields: >15 nmol = ***, 10–15 nmol = **, 5–10 nmol = *, <5 nmol = ~. †: These transcriptions yield one main product suitable for use as markers to identify comigrating triplets with a similar G/AU/C content.

elife-35255-supp2.docx (111.6KB, docx)
DOI: 10.7554/eLife.35255.040
Supplementary file 3. Oligonucleotide sequences.

All sequences are written in a 5’-to-3’ direction. DNA sequences are coloured grey. RNA sequences are coloured black, with the exception of hammerhead ribozyme (brown) and HDV ribozyme (blue) sequences transcribed in series that cleave themselves off during transcription (Schürer et al., 2002) to yield precise 5’ and 3’ ends respectively. All RNAs were denaturing PAGE-purified, and DNAs were not, unless otherwise noted (‘(non)-GP’). Competing oligonucleotides, listed beneath templates, were purified using RNeasy columns (Qiagen). Primer and oligonucleotide binding sites on templates are underlined.

elife-35255-supp3.docx (151.1KB, docx)
DOI: 10.7554/eLife.35255.041
Transparent reporting form
DOI: 10.7554/eLife.35255.042

Data availability

All data generated during this study and collated sequencing results are included in the manuscript and its supporting files. Source data files have been provided for Figures 1-4, 8 and 9.

References

  1. Attwater J, Tagami S, Kimoto M, Butler K, Kool ET, Wengel J, Herdewijn P, Hirao I, Holliger P. Chemical fidelity of an RNA polymerase ribozyme. Chemical Science. 2013a;4:2804. doi: 10.1039/c3sc50574j. [DOI] [Google Scholar]
  2. Attwater J, Wochner A, Holliger P. In-ice evolution of RNA polymerase ribozyme activity. Nature Chemistry. 2013b;5:1011–1018. doi: 10.1038/nchem.1781. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Attwater J, Wochner A, Pinheiro VB, Coulson A, Holliger P. Ice as a protocellular medium for RNA replication. Nature Communications. 2010;1:1–8. doi: 10.1038/ncomms1076. [DOI] [PubMed] [Google Scholar]
  4. Battle DJ, Doudna JA. Specificity of RNA-RNA helix recognition. PNAS. 2002;99:11676–11681. doi: 10.1073/pnas.182221799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Boza G, Szilágyi A, Kun Á, Santos M, Szathmáry E. Evolution of the division of labor between genes and enzymes in the RNA world. PLoS Computational Biology. 2014;10:e1003936. doi: 10.1371/journal.pcbi.1003936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Cozens C, Pinheiro VB, Vaisman A, Woodgate R, Holliger P. A short adaptive path from DNA to RNA polymerases. PNAS. 2012;109:8067–8072. doi: 10.1073/pnas.1120964109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Crick FH. The origin of the genetic code. Journal of Molecular Biology. 1968;38:367–379. doi: 10.1016/0022-2836(68)90392-6. [DOI] [PubMed] [Google Scholar]
  8. Doudna JA, Couture S, Szostak JW. A multisubunit ribozyme that is a catalyst of and template for complementary strand RNA synthesis. Science. 1991;251:1605–1608. doi: 10.1126/science.1707185. [DOI] [PubMed] [Google Scholar]
  9. Doudna JA, Usman N, Szostak JW. Ribozyme-catalyzed primer extension by trinucleotides: a model for the RNA-catalyzed replication of RNA. Biochemistry. 1993;32:2111–2115. doi: 10.1021/bi00059a032. [DOI] [PubMed] [Google Scholar]
  10. Eigen M. Selforganization of matter and the evolution of biological macromolecules. Die Naturwissenschaften. 1971;58:465–523. doi: 10.1007/BF00623322. [DOI] [PubMed] [Google Scholar]
  11. Eisinger J, Gross N. Conformers, dimers, and anticodon complexes of Transfer-Rna-Glu2 (Escherichia-Coli) Biochemistry. 1975;14:4031–4041. doi: 10.1021/bi00689a016. [DOI] [Google Scholar]
  12. Filonov GS, Kam CW, Song W, Jaffrey SR. In-gel imaging of RNA processing using broccoli reveals optimal aptamer expression strategies. Chemistry & Biology. 2015;22:649–660. doi: 10.1016/j.chembiol.2015.04.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Filonov GS, Moon JD, Svensen N, Jaffrey SR. Broccoli: rapid selection of an RNA mimic of green fluorescent protein by fluorescence-based selection and directed evolution. Journal of the. 2014;136:16299–16308. doi: 10.1021/ja508478x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Gilbert W. Origin of life: the RNA world. Nature. 1986;319:618. doi: 10.1038/319618a0. [DOI] [Google Scholar]
  15. Green R, Szostak JW. Selection of a ribozyme that functions as a superior template in a self-copying reaction. Science. 1992;258:1910–1915. doi: 10.1126/science.1470913. [DOI] [PubMed] [Google Scholar]
  16. Hayden EJ, Lehman N. Self-assembly of a group I intron from inactive oligonucleotide fragments. Chemistry & Biology. 2006;13:909–918. doi: 10.1016/j.chembiol.2006.06.014. [DOI] [PubMed] [Google Scholar]
  17. Horning DP, Joyce GF. Amplification of RNA by an RNA polymerase ribozyme. PNAS. 2016;113:9786–9791. doi: 10.1073/pnas.1610103113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Huang SC, Smith JR, Moen LK. Contribution of the p51 subunit of HIV-1 reverse transcriptase to enzyme processivity. Biochemical and Biophysical Research Communications. 1992;184:986–992. doi: 10.1016/0006-291X(92)90688-H. [DOI] [PubMed] [Google Scholar]
  19. Johnston WK, Unrau PJ, Lawrence MS, Glasner ME, Bartel DP. RNA-catalyzed RNA polymerization: accurate and general RNA-templated primer extension. Science. 2001;292:1319–1325. doi: 10.1126/science.1060786. [DOI] [PubMed] [Google Scholar]
  20. Kibbe WA. OligoCalc: an online oligonucleotide properties calculator. Nucleic Acids Research. 2007;35:W43–W46. doi: 10.1093/nar/gkm234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Lawrence MS, Bartel DP. Processivity of ribozyme-catalyzed RNA polymerization. Biochemistry. 2003;42:8748–8755. doi: 10.1021/bi034228l. [DOI] [PubMed] [Google Scholar]
  22. Lincoln TA, Joyce GF. Self-sustained replication of an RNA enzyme. Science. 2009;323:1229–1232. doi: 10.1126/science.1167856. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Markham NR, Zuker M. DINAMelt web server for nucleic acid melting prediction. Nucleic Acids Research. 2005;33:W577–W581. doi: 10.1093/nar/gki591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Martin LL, Unrau PJ, Müller UF. RNA synthesis by in vitro selected ribozymes for recreating an RNA world. Life. 2015;5:247–268. doi: 10.3390/life5010247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Monnard PA, Kanavarioti A, Deamer DW. Eutectic phase polymerization of activated ribonucleotide mixtures yields quasi-equimolar incorporation of purine and pyrimidine nucleobases. Journal of the American Chemical Society. 2003;125:13734–13740. doi: 10.1021/ja036465h. [DOI] [PubMed] [Google Scholar]
  26. Mutschler H, Wochner A, Holliger P. Freeze-thaw cycles as drivers of complex ribozyme assembly. Nature Chemistry. 2015;7:502–508. doi: 10.1038/nchem.2251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Noller HF. Evolution of protein synthesis from an RNA world. Cold Spring Harbor Perspectives in Biology. 2012;4:a003681. doi: 10.1101/cshperspect.a003681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. O'Flaherty DK, Kamat NP, Mirza FN, Li L, Prywes N, Szostak JW. Copying of Mixed-Sequence RNA Templates inside Model Protocells. Journal of the American Chemical Society. 2018;140:5171–5178. doi: 10.1021/jacs.8b00639. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Ogle JM, Brodersen DE, Clemons WM, Tarry MJ, Carter AP, Ramakrishnan V. Recognition of cognate transfer RNA by the 30S ribosomal subunit. Science. 2001;292:897–902. doi: 10.1126/science.1060612. [DOI] [PubMed] [Google Scholar]
  30. Orgel LE. Evolution of the genetic apparatus. Journal of Molecular Biology. 1968;38:381–393. doi: 10.1016/0022-2836(68)90393-8. [DOI] [PubMed] [Google Scholar]
  31. Paillart JC, Shehu-Xhilaga M, Marquet R, Mak J. Dimerization of retroviral RNA genomes: an inseparable pair. Nature Reviews Microbiology. 2004;2:461–472. doi: 10.1038/nrmicro903. [DOI] [PubMed] [Google Scholar]
  32. Petrov AS, Gulen B, Norris AM, Kovacs NA, Bernier CR, Lanier KA, Fox GE, Harvey SC, Wartell RM, Hud NV, Williams LD. History of the ribosome and the origin of translation. PNAS. 2015;112:15396–15401. doi: 10.1073/pnas.1509761112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Poole AM, Jeffares DC, Penny D. The path from the RNA world. Journal of Molecular Evolution. 1998;46:1–17. doi: 10.1007/PL00006275. [DOI] [PubMed] [Google Scholar]
  34. Prywes N, Blain JC, Del Frate F, Szostak JW. Nonenzymatic copying of RNA templates containing all four letters is catalyzed by activated oligonucleotides. eLife. 2016;5:e17756. doi: 10.7554/eLife.17756. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Rohatgi R, Bartel DP, Szostak JW. Kinetic and mechanistic analysis of nonenzymatic, template-directed oligoribonucleotide ligation. Journal of the American Chemical Society. 1996;118:3332–3339. doi: 10.1021/ja953712b. [DOI] [PubMed] [Google Scholar]
  36. Schürer H, Lang K, Schuster J, Mörl M. A universal method to produce in vitro transcripts with homogeneous 3' ends. Nucleic Acids Research. 2002;30:e56. doi: 10.1093/nar/gnf055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Sczepanski JT, Joyce GF. A cross-chiral RNA polymerase ribozyme. Nature. 2014;515:440–442. doi: 10.1038/nature13900. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Shechner DM, Grant RA, Bagby SC, Koldobskaya Y, Piccirilli JA, Bartel DP. Crystal structure of the catalytic core of an RNA-polymerase ribozyme. Science. 2009;326:1271–1275. doi: 10.1126/science.1174676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Suslov NB, DasGupta S, Huang H, Fuller JR, Lilley DM, Rice PA, Piccirilli JA. Crystal structure of the Varkud satellite ribozyme. Nature Chemical Biology. 2015;11:840–846. doi: 10.1038/nchembio.1929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Szostak JW, Bartel DP, Luisi PL. Synthesizing life. Nature. 2001;409:387–390. doi: 10.1038/35053176. [DOI] [PubMed] [Google Scholar]
  41. Szostak JW. An optimal degree of physical and chemical heterogeneity for the origin of life? Philosophical Transactions of the Royal Society B: Biological Sciences. 2011;366:2894–2901. doi: 10.1098/rstb.2011.0140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Szostak JW. The eightfold path to non-enzymatic RNA replication. Journal of Systems Chemistry. 2012;3:2. doi: 10.1186/1759-2208-3-2. [DOI] [Google Scholar]
  43. Tagami S, Attwater J, Holliger P. Simple peptides derived from the ribosomal core potentiate RNA polymerase ribozyme function. Nature Chemistry. 2017;9:325–332. doi: 10.1038/nchem.2739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Vaidya N, Manapat ML, Chen IA, Xulvi-Brunet R, Hayden EJ, Lehman N. Spontaneous network formation among cooperative RNA replicators. Nature. 2012;491:72–77. doi: 10.1038/nature11549. [DOI] [PubMed] [Google Scholar]
  45. Vaish NK, Jadhav VR, Kossen K, Pasko C, Andrews LE, McSwiggen JA, Polisky B, Seiwert SD. Zeptomole detection of a viral nucleic acid using a target-activated ribozyme. RNA. 2003;9:1058–1072. doi: 10.1261/rna.5760703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Vlassov A, Khvorova A, Yarus M. Binding and disruption of phospholipid bilayers by supramolecular RNA complexes. PNAS. 2001;98:7706–7711. doi: 10.1073/pnas.141041098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Vogel SR, Deck C, Richert C. Accelerating chemical replication steps of RNA involving activated ribonucleotides and downstream-binding elements. Chemical Communications. 2005;39:4922–4924. doi: 10.1039/b510775j. [DOI] [PubMed] [Google Scholar]
  48. Warner KD, Sjekloća L, Song W, Filonov GS, Jaffrey SR, Ferré-D'Amaré AR. A homodimer interface without base pairs in an RNA mimic of red fluorescent protein. Nature Chemical Biology. 2017;13:1195–1201. doi: 10.1038/nchembio.2475. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Weiss R. Cherry J. Speculations on the origin of ribosomal translocation. In: Gesteland R. F, editor; Atkins J. F, editor. The RNA World. 1st ed 1993. [Google Scholar]
  50. Winz ML, Samanta A, Benzinger D, Jäschke A. Site-specific terminal and internal labeling of RNA by poly(A) polymerase tailing and copper-catalyzed or copper-free strain-promoted click chemistry. Nucleic Acids Research. 2012;40:e78. doi: 10.1093/nar/gks062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Wochner A, Attwater J, Coulson A, Holliger P. Ribozyme-catalyzed transcription of an active ribozyme. Science. 2011;332:209–212. doi: 10.1126/science.1200752. [DOI] [PubMed] [Google Scholar]
  52. Zaher HS, Unrau PJ. Selection of an improved RNA polymerase ribozyme with superior extension and fidelity. RNA. 2007;13:1017–1026. doi: 10.1261/rna.548807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Zhang W, Tam CP, Zhou L, Oh SS, Wang J, Szostak JW. Structural rationale for the enhanced catalysis of nonenzymatic RNA primer extension by a Downstream oligonucleotide. Journal of the American Chemical Society. 2018;140:2829–2840. doi: 10.1021/jacs.7b11750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Zhao H, Zha W. In vitro 'sexual' evolution through the PCR-based staggered extension process (StEP) Nature Protocols. 2006;1:1865–1871. doi: 10.1038/nprot.2006.309. [DOI] [PubMed] [Google Scholar]
  55. Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Research. 2003;31:3406–3415. doi: 10.1093/nar/gkg595. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Anna Marie Pyle1

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "Ribozyme-catalysed RNA synthesis using triplet building blocks" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor and James Manley as the Senior Editor. The following individual involved in review of your submission has agreed to reveal his identity: Nebojsa Janjic (Reviewer #3).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission. The most important thing to improve is the written clarity of the manuscript. There was a lot of agreement between the reviewers' comments so we attach their full reviews below for your attention. All their concerns should be attended to.

Reviewer #1:

Attwater et al. report an advance in efforts to develop a robust self-replicating RNA by presenting an example of an RNA polymerase ribozyme (RPR) capable of transcribing RNA from structured RNA templates of substantial length with fairly good fidelity. This RPR uses a trinucleotide building block (in contrast to NTPs used by standard polymerases), that the authors speculate could have implications in the evolution of mRNA decoding by the ribosome.

The major findings from this work are as follows:

1) The authors exploit the ligase activity in the catalytic core of their previously reported RPR Z and evolve a suite of catalytic sequences capable of extending an RNA primer using RNA triplets as substrates. Through further iterations of in-vitro selection for triplet dependent RPRs, an RNA sequence emerged without any catalytic activity itself, but it enhanced triplet extension by these newly evolved RPR sequences (named t2 through t6) through a bimolecular interaction between the catalytically proficient sequence. Further rounds of selection using the cofactor t1 resulted in the most efficient triplet RPR, t5+1 which was used for downstream studies. This RPR could use structured hairpins as templates, even those with Tm >90oC that were intractable to previous RPRs that use NTPs as substrates. The authors demonstrate the efficiency of this RPR by transcribing both + and – strands of the Broccoli aptamer, a structured and fluorescent RNA.

2) The authors used t5+1 RPR to replicate fragments of itself (both + and – strands), using triplet building blocks (and in some cases hexamers). The complete active RPR could be reconstituted from the five individual components of the RPR transcribed by the same RPR. Although this replicase ribozyme is still not able to replicate itself in its entirely, it does give rise to a functioning replicase by using building blocks as small as trimers, Previous efforts had primarily used the ligase activity of the RPRs to assemble oligonucleotides representing their sequence to assemble a 'replica' of itself. Considering the challenges of self-replication, even incremental advances in the right direction are significant.

Overall this work introduces a new RPR system that brings the field closer to a self-replicating system, which is one of the cornerstones of Origin of life research. The authors explore properties of this replicase including fidelity and catalytic efficiency and explain the molecular basis for these properties. This work also introduces trinucleotides as plausible substrates on which ancient RNA-based replicases could have acted. Considering the relative ease of non-enzymatic polymerization of monomers into triplets, the presence of these triplets in a prebiotic setting seems plausible. The current work is detailed and the authors have explored multiple facets of the new system they have setup. This new RPR presents a major advance in terms of its ability to create a fully functional copy of itself (albeit in fragments). As such, this would be a great addition to the current understanding of RNA-based self-replication systems.

1) Overall, the manuscript was very difficult to follow. It is up to the authors to make their presentation as simple and direct as possible. Given the availability of supporting information, there is no reason why the authors could not add schemes where appropriate to clarify their presentation, logic, and workflow.

2) Regarding the dimerization, it is not clear how the authors implicated the 5'-hairpin to begin with, leading to the experiment that showing that a single mutation in that loop inhibits dimerization.

3) The assertion that the two RNAs form a 1:1 complex is not quantitatively substantiated from Figure 3B.

4) Considering their success in replicating structured hairpins and achieving primer free synthesis of an 18 nt strand (β+), it would be interesting (though not necessary for publication) to see how much of the Broccoli RNA could be transcribed without the addition of a primer sequence. If this is not possible, the authors could demonstrate the complete replication of a functional RNA of tractable length, like the Hammerhead ribozyme, to illustrate the ability of their trinucleotide RPR to replicate entire sequences of functional RNAs by exclusively using triplets as substrates.

5) The fidelity analysis is based on testing 12 of the possible 64 trinucleotides, so one cannot be convinced of the generality of their observations and inferences. The authors need to underscore this in the discussion of fidelity. The authors show that increased concentrations of triplets help fidelity, however in the absence of the 'correct' triplet, it is likely that RPR will incorporate 'mismatch' triplets. The authors should comment on this aspect.

Reviewer #2:

1) In this paper, the authors demonstrate interesting and novel results that provide further evidence for RNA's capacity to act as prebiotic genetic material. The key feature associated with this study involved using trinucleotide triphosphates (triplets) rather than NTPs as substrates for a novel RNA polymerase ribozyme. The use of triplets allowed structured templates to be unfolded and in some cases avoided the need for primers, circumventing an ongoing obstacle to RNA self-replication. This study thus makes some significant steps towards identifying a truly self-replicating prebiotic biopolymer. Literature precedents are well-discussed, and the results are interpreted in a detailed and convincing manner. I would nonetheless encourage the authors to try to make these important results as accessible as possible to readers who are unfamiliar with in vitro evolution.

Specific comments:

2) Subsection “in vitro evolution of triplet polymerase activity”, last paragraph. The use of "in-ice evolution" (a technique developed by the authors) is reported. A sentence could be added describing why this technique is being used / why it is necessary.

3) For certain experiments (e.g. subsection “Fidelity of triplet-based RNA synthesis”), random triplet pools were used. What kind of triplet pools were used in other sequence-copying experiments? Were they non-random? This needs to be more clearly articulated.

4) "… we found that using modified substrates with a disrupted minor groove hydrogen bond acceptor at the 3rd position…" – What is the modification? This should be stated in the body-text.

5) What is the scope of substrates for the t5+1 ribozyme? Triplets reportedly incur a lower entropic cost than NTPs (Discussion, third paragraph), but what would happen when a mixture of the two were used? Would both the NTPs and triplets be incorporated?

If these points are satisfactorily addressed, then I am in favour of acceptance.

Reviewer #3:

Attwater et al. used an in vitro evolution method to enhance the trace amount of ligase activity with short (3-nucleotide) substrates inherent in the truncated version of the Z RPR ribozyme (called "Zcore"). This was achieved by adding a random region of 30 nucleotides at the 3' end of Zcore and a template at the 5' end, followed by selection for variants able to extend a primer hybridized to the 5' template with two 5'-triphosphate triplets and ultimately ligate the extended primer to the 5' terminal triphosphate. After 7 rounds of activity enrichment, the authors identified a sequence that represented 25% of the activity-enriched pool called "type 0" that, upon truncation to its minimal active sequence ("0core"), was able to catalyze multiple cycles of templated ligation with triplet substrates, including the ability to extend thought double-stranded regions of the template by cooperative invasion through the secondary structure of the template (this was not achievable with monomer substrates, NTPs).

After additional 14 rounds selection, the composition of the pool showed considerable changes: type 0 sequence had disappeared and was replaced by six new types of sequences, types 1-7. Remarkably, the most prevalent type 1 sequence (50% of the pool) was completely inactive and types 2-6 were less active than the activity enriched pool. This mystery was solved by the observation that a truncated version of the type 1 sequence, while being inactive, can form augment the activity of type 2-6 ribozymes by forming heterodimeric complexes. In the presence of type 1 sequence, the best stand-alone ribozyme, type 5, no longer required the template to be tethered to the 5' end of the type 5 ribosome, in a remarkable example of an intermolecular complex between the dimeric type 1:type5 ribozyme, the untethered primer template, and the triplet substrates. Further optimization of the type 5 ribozyme by re-randomization of the previously fixed 3' end sequences and reselection in the presence of minimal type 1 sequence (called "1") resulted in the final heterodimeric holoenzyme called "t5+1. t5+1 was able to catalyze the extension through templates with Tm values of 93 ⁰C, and was capable of generating both the functionally active Broccoli RNA aptamer and its complementary template. Finally, t5+1 was able to replicate its own t5 sequence split into five segments, mostly from triplets, with some help from longer substrates (hexamers) needed for two of the segments, thus falling short of having the capacity for a full replication cycle, but nonetheless, exhibiting impressive catalytic efficiency.

The surprising observation that t5+1 can extend primers in both the canonical 5'-3' as well as the reverse 3'-5' direction raises the potential for primer extension from short sequences in a primer-free manner, therefore potentially solving the "primer problem" (depletion of longer but presumably sparse primers needed for replication in prebiotic conditions). Impressive overall positional fidelity of more than 97% was achieved by counter-selection for mispairing triplets with a clever inclusion of excess 3'-deoxy terminator triplets during late rounds of selection. Attwater et al. mapped the region responsible for fidelity to the 3' domain of t5. Further reduction in mis-incorporation comes from cognate triplet-triplet interactions have a self-correcting feature of countering the effect of triplets with strong base-pairing tendency.

The authors make a strong case that triplets represent relatively short and yet powerful substrates able to invade highly stable structural features in RNA templates. The evolution of a heterodimeric ribozyme capable of generating functional RNAs with triplet substrates represents a major accomplishment that advances the plausibility of RNA-catalyzed RNA replication in prebiotic environment. Despite its considerable length, as well as its remarkable breadth and scope, the manuscript reads like a riveting novel. It is a tour de force scientific accomplishment, and as such, it will be of considerable interested to a wide audience of readers of eLife. [Editors’ note: this reviewer’s minor comments have not been included.]

eLife. 2018 May 15;7:e35255. doi: 10.7554/eLife.35255.045

Author response


Reviewer #1:

[…] 1) Overall, the manuscript was very difficult to follow. It is up to the authors to make their presentation as simple and direct as possible. Given the availability of supporting information, there is no reason why the authors could not add schemes where appropriate to clarify their presentation, logic, and workflow.

We apologize for the poor presentation and have made several changes to enhance clarity. These include textual changes to reduce ambiguity and lay out our reasoning more thoroughly as well as explanatory schemes of ribozyme activity in Figure 1 and Figure 1—figure supplement 1. Furthermore, we have redesigned Figures 3 and 7, and now provide an additional dataset presenting substrate concentration-dependent error rates in a more straightforward manner in Figure 9—figure supplement 1.

The manuscript structure is complicated by the multiple stages of RNA evolution and characterization involved in the project, and to help navigate the progress of our ribozyme development, we now provide the new Figure 4—figure supplement 1 detailing all the stages of evolution and engineering of the different triplet polymerase ribozymes to provide a “roadmap”.

2) Regarding the dimerization, it is not clear how the authors implicated the 5'-hairpin to begin with, leading to the experiment that showing that a single mutation in that loop inhibits dimerization.

We welcome the opportunity to clarify this. We had originally noticed the ‘cap-’ mutation in the most abundant isolated type 1 clones, and when testing this in different constructs observed that it quenched type 1 enhancement pointing towards the 5’ hairpin region as the critical interaction site. We now refer to this in the manuscript text.

Our knowledge about this dimerization interface is currently limited and we have therefore generally refrained from speculation. However, it seems possible that the ‘cap-’ mutation was enriched in these type 1 sequences to diminish homodimerisation of type 1 that may interfere with heterodimerisation with active triplet polymerase in the selection pool.

3) The assertion that the two RNAs form a 1:1 complex is not quantitatively substantiated from Figure 3B.

We now include quantitative data on the stoichiometry of the complex as observed by gel shift (EMSA) titration in Figure 3B, and activity tests in Figure 3—figure supplement 1. These support a 1:1 stoichiometric complex in the active ribozyme as expected from the final selection pool composition.

4) Considering their success in replicating structured hairpins and achieving primer free synthesis of an 18 nt strand (β+), it would be interesting (though not necessary for publication) to see how much of the Broccoli RNA could be transcribed without the addition of a primer sequence. If this is not possible, the authors could demonstrate the complete replication of a functional RNA of tractable length, like the Hammerhead ribozyme, to illustrate the ability of their trinucleotide RPR to replicate entire sequences of functional RNAs by exclusively using triplets as substrates.

The reviewer highlights an important challenge that such ribozymes must ultimately face – copying without primers both strands of functional RNAs to achieve their complete replication using only short oligonucleotide substrates. With the current triplet polymerase ribozyme, while primer-initiated synthesis is almost general (as we show by synthesizing both + and – strands of the Broccoli RNA aptamer), we cannot currently achieve primer-free synthesis of all sequences. The latter includes Broccoli where yields of full-length RNA are poor using triplets alone. This may reflect inefficient initiation of synthesis or poor upstream inclusion of some 5’ terminal triplets.

Nevertheless, the current triplet polymerase ribozyme can achieve this for some functional sequences, most relevantly from t5 itself. To demonstrate this, we have now included data on the primer-free synthesis and replication of the Υ segment. Specifically, we describe primer-free synthesis using triplets alone of the γ+ strand segment with sufficient yield to use this ribozyme-synthesized RNA as the template for primer-free γ- strand synthesis, demonstrating complete primer-free sequence replication using only triplets. These new data are now included as Figure 7C, replacing the (now in our view redundant) primer-free γ+ syntheses.

5) The fidelity analysis is based on testing 12 of the possible 64 trinucleotides, so one cannot be convinced of the generality of their observations and inferences. The authors need to underscore this in the discussion of fidelity. The authors show that increased concentrations of triplets help fidelity, however in the absence of the 'correct' triplet, it is likely that RPR will incorporate 'mismatch' triplets. The authors should comment on this aspect.

We measured fidelity of triplet incorporation with all64 triplet substrates present but in the context of 12 defined template triplet sequences flanked by conserved CCC incorporation sites. These were chosen to be compositionally balanced, including both GC-rich and AU-rich template triplets, to provide fair estimates of ribozyme fidelity, as well as an effective tool to compare the fidelities of different ribozymes.

This assay was designed to allow us to separate incorporation fidelity (i.e. discrimination among 64 triplets for decoding a defined anti-triplet) from each ribozyme’s extension capabilities by providing a uniform, tractable sequence context.

The important qualification regarding this assay, we feel, is therefore that while it provides an accurate representation of fidelity in a defined sequence context (triplet-triplet junction), it may not provide a comprehensive picture of triplet polymerase fidelity performance in all sequence contexts. We now provide this caveat in the text.

Reviewer #2:

1) In this paper, the authors demonstrate interesting and novel results that provide further evidence for RNA's capacity to act as prebiotic genetic material. The key feature associated with this study involved using trinucleotide triphosphates (triplets) rather than NTPs as substrates for a novel RNA polymerase ribozyme. The use of triplets allowed structured templates to be unfolded and in some cases avoided the need for primers, circumventing an ongoing obstacle to RNA self-replication. This study thus makes some significant steps towards identifying a truly self-replicating prebiotic biopolymer. Literature precedents are well-discussed, and the results are interpreted in a detailed and convincing manner. I would nonetheless encourage the authors to try to make these important results as accessible as possible to readers who are unfamiliar with in vitro evolution.

As discussed above, we have made a significant number of changes to the manuscript in order to present our results more clearly.

These include textual changes to improve clarity and remove ambiguities. We have also made an effort to lay out our experimental progress and reasoning more thoroughly, using explanatory schemes of ribozyme activity in Figure 1 and Figure 1—figure supplement 1, redesigning Figures 3 and 7, and providing an additional more straightforward dataset in Figure 9—figure supplement 1. We hope this will make the manuscript easier to read and more accessible to the general reader.

Specific comments:

2) Subsection “in vitro evolution of triplet polymerase activity”, last paragraph. The use of "in-ice evolution" (a technique developed by the authors) is reported. A sentence could be added describing why this technique is being used / why it is necessary.

We have expanded our description of the mechanisms of in vitro evolution and discuss in more detail the reasoning behind our use of eutectic ice phases and in-ice evolution. As the properties of ice as a medium for RNA-catalyzed RNA synthesis, replication and evolution are discussed at length in the cited references, a fairly brief discussion must suffice. We hope nevertheless that this will now clearly illustrate our rationale in using this technique in our work.

3) For certain experiments (e.g. subsection “Fidelity of triplet-based RNA synthesis”), random triplet pools were used. What kind of triplet pools were used in other sequence-copying experiments? Were they non-random? This needs to be more clearly articulated.

We generally list the triplet substrates used for different experiments but apologise for any ambiguity.

In general for preparative syntheses we have avoided using pools of random triplets, providing instead a set of defined triplets as specified by the template sequence to minimise substrate consumption. We have however verified that random triplet pools can be used for key triplet based RNA synthesis experiments including hairpin invasion (see Figure 5—figure supplement 1) and ribozyme segment synthesis (see Figure 6—figure supplements 4 and 5), as well as in experiments on ribozyme fidelity (see e.g. Figure 8). We now explicitly state the substrate composition in the text and figure legends for all experiments described.

4) "… we found that using modified substrates with a disrupted minor groove hydrogen bond acceptor at the 3rd position…" – What is the modification? This should be stated in the body-text.

We apologize for this omission and have amended this. We now state in the text that uracil was replaced by 2-thio uracil in the triplets.

5) What is the scope of substrates for the t5+1 ribozyme? Triplets reportedly incur a lower entropic cost than NTPs (Discussion, third paragraph), but what would happen when a mixture of the two were used? Would both the NTPs and triplets be incorporated?

This is an interesting suggestion considering that abiotically-generated substrate pools would likely include a mixture of different length substrates including mono- and dinucleotides. We have performed additional experiments to explore this and now include data on their incorporation during segment synthesis in Figure 6—figure supplement 5.

In summary, t5+1 does incorporate both dinucleotide triphosphates and NTPs, but with reduced efficiency. Furthermore, they appear to serve as poor substrates for segment synthesis with triplets incorporated much more efficiently. Increasing amounts of NTP and dinucleotide substrates vs. triplets (up to a total 4:2:1 ratio) do lead to diversification of extension products outside the initial triplet register, but do not notably affect the total amount of ligation performed.

As seen in Figure 6—figure supplement 2, more length-heterogenous substrate pools (including longer oligomers) may offer both advantages and disadvantages, but we feel that a more systematic investigation of diverse substrate pools and their effect on yield and fidelity of RNA-catalyzed RNA replication goes beyond the scope of the current manuscript.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Figure 1—source data 1. Selection conditions of rounds 1–7.
    DOI: 10.7554/eLife.35255.006
    Figure 1—source data 2. Triplet concentration-dependent extension values.
    DOI: 10.7554/eLife.35255.007
    Figure 2—source data 1. Selection conditions of rounds 8–21.
    DOI: 10.7554/eLife.35255.011
    Figure 3—source data 1. Relative intensities of shifted bands when varying type 1 equivalents.
    DOI: 10.7554/eLife.35255.015
    Figure 4—source data 1. Selection conditions of rounds 1–5 of the reselection.
    DOI: 10.7554/eLife.35255.017
    Figure 8—source data 1. Analysis of collated errors by ribozymes in the fidelity assay.
    elife-35255-fig8-data1.xlsx (158.8KB, xlsx)
    DOI: 10.7554/eLife.35255.033
    Figure 8—source data 2. Calculation of residual fidelity phenotypes in Figure 8c.
    DOI: 10.7554/eLife.35255.034
    Figure 9—source data 1. Collated error rates and ratios at different substrate concentrations.
    DOI: 10.7554/eLife.35255.038
    Supplementary file 1. Ribozyme sequences.

    All sequences are written in a 5’-to-3’ direction, and generated by GMP transcription of the corresponding PCR-generated dsDNA of the sequence downstream of 5T7 sequence duplex. All transcripts were PAGE-purified. HDV ribozyme sequences (blue) were transcribed in series with reselected type 5 ribozymes and cleave themselves off during transcription (Schürer et al., 2002) to yield precise 3’ ends with 2’, 3’-cyclic phosphates; the presence of this group did not affect type 5 activity. Sequences corresponding to the 5’ ‘cap+’ or ‘cap−’ regions from the selection, presenting a target for type 1 interaction, are coloured light green, with alternative arbitrary inert 5’ hairpin-forming sequences in dark green. Single-stranded sequences capable of hybridization with sites at the 3’ (or 5’, type 5cis) ends of certain templates to endow flexible ribozyme-template duplex tethering (Wochner et al., 2011; Attwater et al., 2010) are coloured yellow. Accessory domains 3’ of the catalytic core are in bold.

    elife-35255-supp1.docx (147.1KB, docx)
    DOI: 10.7554/eLife.35255.039
    Supplementary file 2. Transcription of triplets.

    Each target was transcribed from oligonucleotides 5T7 (Supplementary file 3) and 5’-(var)-TATAGTGAGTCGTATTAATTTCGCGGGCGAGATCGATC-3’, where the (var) overhang encodes (is the DNA reverse complement of) the sequence indicated below. Guide yields: >15 nmol = ***, 10–15 nmol = **, 5–10 nmol = *, <5 nmol = ~. †: These transcriptions yield one main product suitable for use as markers to identify comigrating triplets with a similar G/AU/C content.

    elife-35255-supp2.docx (111.6KB, docx)
    DOI: 10.7554/eLife.35255.040
    Supplementary file 3. Oligonucleotide sequences.

    All sequences are written in a 5’-to-3’ direction. DNA sequences are coloured grey. RNA sequences are coloured black, with the exception of hammerhead ribozyme (brown) and HDV ribozyme (blue) sequences transcribed in series that cleave themselves off during transcription (Schürer et al., 2002) to yield precise 5’ and 3’ ends respectively. All RNAs were denaturing PAGE-purified, and DNAs were not, unless otherwise noted (‘(non)-GP’). Competing oligonucleotides, listed beneath templates, were purified using RNeasy columns (Qiagen). Primer and oligonucleotide binding sites on templates are underlined.

    elife-35255-supp3.docx (151.1KB, docx)
    DOI: 10.7554/eLife.35255.041
    Transparent reporting form
    DOI: 10.7554/eLife.35255.042

    Data Availability Statement

    All data generated during this study and collated sequencing results are included in the manuscript and its supporting files. Source data files have been provided for Figures 1-4, 8 and 9.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES