Abstract
Because tRNA is the core biological intellectual property that was necessary to evolve translation systems, tRNAomes, ribosomes, aminoacyl-tRNA synthetases, and the genetic code, the evolution of tRNA is the core story in evolution of life on earth. We have previously described the evolution of type-I tRNAs. Here, we use the same model to describe the evolution of type-II tRNAs, with expanded V loops. The models are strongly supported by inspection of typical tRNA diagrams, measuring lengths of V loop expansions, and analyzing the homology of V loop sequences to tRNA acceptor stems. Models for tRNA evolution provide a pathway for the inanimate-to-animate transition and for the evolution of translation systems, the genetic code, and cellular life.
Keywords: type-II tRNA, V loop, V loop-acceptor stem homology, minihelices, evolution of tRNA, evolution of translation systems
1. Introduction
Ribosomes, mRNA, translation systems, genetic coding, and aminoacyl tRNA synthetases (aaRS enzymes; i.e., SerRS) evolved around cloverleaf tRNA. The evolution of tRNA, therefore, is the central problem in understanding evolution of life on earth. A model was determined for the evolution of type-I tRNAs, lacking a V loop expansion (Figure 1) [1,2]. The model is based on the ligation of three 31-nt minihelices followed by two symmetrical 9-nt deletions within ligated 3′- and 5′-acceptor stems. The model posits that type-I and type-II V loops are homologous to acceptor stems.
Cloverleaf tRNA evolved from short, defined genetic segments. A 31-nt minihelix is a 17-nt microhelix flanked 5′- and 3′- by 7-nt acceptor stems. Acceptor stems are based on a GCG repeat and its CGC complement, so the primordial tRNA acceptor stems are 5′-GCGGCGG-3′ (5′-As; As for acceptor stem) and 5′-CCGCCGC-3′ (3′-As). Ligation of a primordial 3′-As and 5′-As, therefore, gives the 14-nt sequence 5′-CCGCCGCGCGGCGG-3′. In generating type-I tRNAs, symmetrical 9-nt deletions leave the sequences 5′-GGCGG-3′ (5′-As*; As* for acceptor stem remnant) and 5′-CCGCC-3′ (3′-As*). The 5′-As* sequence is the last 5-nt of what others describe as the D loop, but which we identify as an acceptor stem remnant [1,2]. The 3′-As* sequence represents the primordial 5-nt V loop sequence for type-I tRNAs.
The 17-nt D-loop microhelix is based on a UAGCC repeat, 5′-UAGCCUAGCCUGGCCUA-3′. The G for A substitution in the third UAGCC repeat allows intercalation of D loop G19 between T loop A60 and A61 in cloverleaf tRNA (Figure 2) [1]. The numbering of tRNAs in this paper follows our adjusted numbering system, based on a D loop that lacks deletions from tRNAPri (the primordial cloverleaf tRNA). Our numbering, therefore, may vary from what some readers might expect by +3 nt after the D loop (i.e., the anticodon wobble position is listed here as 37 rather than 34).
The structure of the anticodon loop and strong interactions of the D loop, T loop, and V loop make tRNA a relatively stiff and efficient adapter for translation. The anticodon (Ac loop) and T loop microhelices derive from a stem-loop-stem sequence very similar to 5′-CCGGGUUCAAAACCCGG-3′. The CCGGG and CCCGG complementary stems are strongly supported by sequence analysis [1,2]. For tRNAPri, there is slight sequence ambiguity within the 7-nt loops, which, significantly, form a U-turn after the second U (between loop positions 2 and 3). The U-turn within the 7-nt Ac loop is necessary to present a 3-nt anticodon to support a 3-nt genetic code [1]. Without the 7-nt U-turn loop, a 3-nt genetic code would not be possible. In the anticodon loop, loop bases 3–7 stack within the loop as if in a helix, making the 7-nt U-turn Ac loop a compact loop to support a relatively stiff adapter. The T loop has the same 7-nt U-turn loop as the Ac loop, but intercalation of D loop G19 between T loop A60 and A61 lifts A61 to fill the loop, flipping A62 and U63 out of the T loop [1]. Interestingly, A62 and U63 participate in a stack of nucleotide bases that are part of the D loop-V loop-T loop interaction (extending to the “elbow”).
The 3-minihelix model is supported by inspection of archaeal tRNAs from ancient species such as Pyrococcus furiosis, Staphylothermus marinus, and Pyrobaculum aerophilum (Figure 2) [1,3]. There is some controversy in the literature about whether archaea or bacteria are closer relatives to the last universal common ancestor (LUCA), but, in terms of translation systems and tRNA, ancient archaea are clearly most similar to LUCA [4]. This can easily be shown by observing typical tRNA diagrams [3]. Very clearly, the CCGGG and CCCGG Ac loop and T loop stems are conserved, demonstrating that the Ac loop and the T loop are homologs. Because the Ac loop and T loop are homologs, no model based on only two minihelices can account for tRNA evolution [1]. Models based on two minihelices require splitting the Ac stem-loop-stem in two to compare tRNA halves, which is inconsistent with Ac loop and T loop homology, and which is evident from inspection (Figure 2) [1].
The GCG and CGC repeats make up acceptor stems and acceptor stem remnants. The UAGCC repeats in D loop sequences are also apparent. These patterns begin to degrade in bacterial tRNAs with evolution. Significantly, ancient archaeal tRNAs were generated from highly-ordered sequences, repeats, and inverted repeats (i.e., to form stem-loop-stems). Cloverleaf tRNA, therefore, evolved from an ordered and repetitive sequence, identified in some ancient archaea, to a more chaotic sequence in more derived archaea and bacteria.
2. Results
2.1. A Model for Evolution of Type-II tRNAs
Figure 1 shows a model for evolution of type-I and type-II tRNAs. The model for type-II tRNAs posits that the primordial length of the V loop expansion is 14 nt (7 nt (3′-As) + 7 nt (5′-As)). The model further posits homology of V loops with acceptor stems and acceptor stem remnants. Because archaeal tRNAs are more similar to LUCA tRNAs than are bacterial tRNAs, initially, archaeal tRNAs were collected and compared. In archaea, with rare exceptions, only tRNALeu and tRNASer are type-II tRNAs. We find that expanded and 5-nt V loops are misaligned in tRNAdb and gtRNA databases [3,5,6]. In those databases, V loops were aligned to optimize sequence similarities, introducing inappropriate gaps, rather than, as we align them here, by evolutionary comparisons and secondary structures.
2.2. Archaeal tRNAs with Expanded V Loops
Because V loops are variable in length, they are numbered V1 to VN, in which N = length of the V loop. For archaeal tRNALeu, N = 14, typically, as expected from the model (Figure 1). For archaeal tRNASer, N = 16, typically. Analysis of tRNAomes (all of the tRNAs for an organism displayed as an evolutionary tree and rooted to tRNAPri) indicates that tRNALeu (N = 14) evolves to tRNASer (N = 16), indicating that V loop expansions are derived from N = 14 (Figure 1) [4]. We posit that the initial length of an expanded V loop was N = 14, and that longer and shorter V loop expansions are generated by the insertion or deletion of bases most often located approximately to the middle of the V loop.
V loops in cloverleaf tRNA are under different selection pressures than acceptor stems. In Figure 3, some of these interactions are highlighted. Figure 3A shows a set of stacked bases stabilizing interactions of the D loop, V loop, and T loop. Figure 3B–D shows some details of interactions. In archaeal tRNALeu and tRNASer, U V1 is selected to form a G29~U V1 wobble base pair, and C VN is selected to form a reverse Watson-Crick base pair (G15:C VN), termed the “Levitt” base pair (Figure 3C) [7,8]. In archaea, G15 is often modified to archaeosine, which stabilizes the G15 (archaeosine):C VN interaction, particularly in the presence of Mg2+. Typical secondary structures of expanded V loops are selected to be different for tRNALeu and tRNASer, so that aaRS enzymes make few errors charging tRNALeu, tRNASer, and other tRNAs. Similarly, in type-I archaeal tRNAs, a G29~U V1 wobble pair and a G15:C V5 reverse Watson-Crick Levitt base pair are selected strongly.
Figure 4 shows V loop expansions in archaea. As noted above, Pyrococcus is an ancient archaeal family with significant similarity to LUCA tRNAs [4]. Three Pyrococcus species are compared for tRNALeu (Figure 4A) and tRNASer (Figure 4B). Using typical tRNA diagrams, for tRNALeu, N = 14, typically. For tRNASer, N = 15, typically. The G15:C VN Levitt base pair and the G29~U V1 wobble pair are evident. Secondary structures are sufficiently different for LeuRS and SerRS to discriminate tRNALeu from tRNASer.
For all archaea, results are very similar (Figure 4C–F). For tRNALeu (Figure 4C), N = 14, typically. For tRNASer (Figure 4D), N = 16, typically, indicating further V loop expansion through the archaeal domain comparing to the most ancient archaea such as Pyrococcus (Figure 4B). Histograms of V loop lengths for archaea are shown in Figure 4E,F. The G15:C VN reverse Levitt base pair and the G29~U V1 wobble pair are evident (Figure 4C,D). Secondary structures are distinct for tRNALeu and tRNASer V loops, so LeuRS, SerRS and other aaRS enzymes can discriminate tRNALeu and tRNASer. V loop secondary structures for all archaea are very similar to those observed for Pyrococcus tRNAs (Figure 4A,B). As predicted and expected, analysis of archaeal tRNAs with V loop expansions presents a very simple story of evolution that fits to the same model for evolution of type-I tRNAs (Figure 1).
2.3. Evolution of Bacterial tRNAs with Expanded V Loops
Bacteria are expected to be more derived than archaea for tRNA evolution, and bacteria have additional type-II tRNAs that are absent in archaea (Figure 5). In Figure 5A,B, bacterial tRNALeu and tRNASer are compared as typical tRNA diagrams [3]. For bacterial tRNALeu (Figure 5A), N = 15, typically, and for tRNASer (Figure 5B), N = 19, typically, indicating that bacterial tRNAs are more derived from LUCA than archaeal tRNAs. In contrast to archaeal tRNALeu, in bacterial tRNALeu, an atypical A15: U VN Levitt base pair may be indicated. In bacterial tRNASer, a conserved G15:C VN Levitt base pair is typical. G29~U V1 wobble pairs are typical for both tRNALeu and tRNASer in bacteria. V loop secondary structures are distinct for archaeal and bacterial tRNALeu and tRNASer. The bacterial tRNASer V loop is particularly floppy, with fewer stabilizing V loop stem base pairs than are present in archaeal tRNASer (Figure 4B,D). The tRNASer V loop is a major positive determinant for recognition by SerRS [10]. Because archaeal and bacterial V loop expansions are distinct, it is likely that there is archaeal-bacterial speciation that limits tRNA sharing between domains, i.e., via horizontal gene transfer. Different modifications of tRNAs in the two domains must also suppress tRNA exchanges.
Some bacteria and a few species of archaea utilize tRNASec (Sec for selenocysteine) (Figure 5C). To charge tRNASec, serine is first attached to tRNASec, and is then modified to selenocysteine [11]. For tRNASec, N = 22, typically, and V loop base pairing is extensive, in contrast to bacterial tRNASer. The floppiness of the tRNASer V loop, therefore, may, in part, help to discriminate tRNASer and tRNASec to support the tRNASec modification pathway. tRNASec appears to be derived from tRNASer. Because tRNASec can also be a type-I tRNA, it appears that the ability to process the V loop from type-II to type-I may have persisted in evolution [3,5,6]. Human tRNASec (PDB 3A3A) shows a slightly modified cloverleaf fold lacking a Levitt base pair [12].
In bacteria, but not in archaea, tRNATyr is a type-II tRNA (Figure 5D). Bacterial tRNATyr, therefore, may have been reassigned relative to tRNATyr in archaea. From analysis of tRNAomes [4], it appears that bacterial tRNATyr may have arisen from a tRNASer. For bacterial tRNATyr, N = 13, typically. The G15:C VN reverse Watson-Crick Levitt base pair is typically present, along with the G29~U V1 wobble base pair. Bacterial tRNATyr is discriminated from other bacterial type-II tRNAs by the shorter length of its V loop and the distributions of its V loop stem pairs. In Figure 5E, the structure of the tRNATyr 14-nt V loop from Thermus thermophilus is shown along with its connections from a co-crystal with TyrRS (PDB 1H3E) [13]. The G15~C V14 and G29~U V1 connections and expected V loop secondary structures are evident. TyrRS reads V loop bases directly to charge tRNATyr.
2.4. Statistical Analyses
2.4.1. Expanded V Loops are Derived from Acceptor Stems
Our model for evolution of type-II tRNA V loops posits that they arose from a sequence very close to CCGCCGCGCGGCGG, which is a primordial 3′-As ligated to a 5′-As (Figure 1). The prediction is tested by comparing the 3′-As to the first 7-nt of V loops and the 5′-As to the last 7-nt of V loops. This approach works for sequences of N > 13, so V loops with N < 14 were not considered. The first and the last nt were dropped from the comparisons, because the first V loop nt (V1) interacts with position 29 (generally G29~U V1), and the last V loop nt (VN) interacts with position 15 (generally G15~C VN; the Levitt base pair). We are assuming that V loops longer than N = 14 have insertions near the center of the sequence.
We have used a random permutation test [1] to indicate the similarities of two long alignments (n sequences; n is large) of potentially homologous short and aligned sequences. In this comparison, collected alignments of length l (l1 = l2) and number of sequences n (n1, n2) are aligned. The code utilized requires comparison of n1 = n2 for the two aligned sequences, so, in cases in which n2 > n1, 50 random selections of n2 = n1 are selected, and the comparison is repeated. In each comparison, 1000 random permutations of l2, n2 = n1 are compared to l1, n1. P-values of 50 repetitions are averaged. If the P-value for the comparison is small (i.e., <0.05), this indicates similarity and possible homology. If the P-value for the comparison is larger (i.e., >0.05), this indicates that sequences are more dissimilar and possibly not homologous. The results are shown in Table 1.
Table 1.
P-values against Archaeal Acceptor Stems | ||||||
V loop | Arch LEU | Arch SER | Bact LEU | Bact SER | Bact TYR | Bact SEC |
AVERAGE | 0.001 | 0.001 | 0.020 | 0.999 | 1.000 | 0.001 |
P-values against Bacterial Acceptor Stems | ||||||
V loop | Arch LEU | Arch SER | Bact LEU | Bact SER | Bact TYR | Bact SEC |
AVERAGE | 0.001 | 0.013 | 1.000 | 0.277 | 0.020 | 0.860 |
We conclude that expanded type-II V loops are homologous to acceptor stems (3′-As ligated to 5′-As) (Table 1) as predicted by the model for tRNA evolution (Figure 1). Archaeal tRNALeu and tRNASer V loops test as potentially homologous to both archaeal (P-values = 0.001 and 0.001) and bacterial (P-values = 0.001 and 0.013) acceptor stems. Because 1000 random permutations are compared, the lowest P-value that is possible is 0.001. This result appears to demonstrate the model shown in Figure 1 for V loop expansions. The bacterial tRNALeu V loop also tests as potentially homologous to archaeal acceptor stems (P-value = 0.020). Bacterial tRNASec (Sec for selenocysteine) tests as being likely homologous to archaeal acceptor stems (P-value = 0.001). Interestingly, bacterial tRNATyr with an expanded V loop tests as potentially homologous to bacterial acceptor stems (P-value = 0.020) but not to archaeal acceptor stems (P-value = 1). The bacterial tRNATyr P-value = 0.020 result may be attributable to convergent evolution of C-G rich sequences, because bacterial tRNATyr is probably derived from sequences similar to bacterial tRNALeu or bacterial tRNASer. Also, similarly to bacterial tRNATyr, bacterial tRNASer scores as more similar to bacterial acceptor stems (P-value = 0.277) than archaeal acceptor stems (P-value = 0.999). As we have shown in other ways, archaeal tRNAs are significantly less radiated from a primordial tRNA than bacterial tRNAs [1,4], and this observation is documented from comparisons of type-II tRNA V loop expansions.
2.4.2. Kinship of Expanded V Loops
The relatedness of expanded V loops is described in Figure 6 using comparisons of P-values. Once again, a P-value < 0.05 indicates probable homology of compared alignments. Archaeal tRNALeu and tRNASer are closely related in sequence [4], and all bacterial V loop expansions appear to relate closely to archaeal tRNALeu and tRNASer. These results indicate that archaeal tRNAs are closer to LUCA tRNAs than bacterial tRNAs. In bacteria, tRNALeu and tRNASer have diverged from one another, presumably to support aaRS discrimination in tRNA charging [10]. We posit that type-II bacterial tRNATyr was derived from tRNASer through a process of tRNA re-assignment, which we have previously described for another tRNA [4]. Essentially, a type-I tRNATyr (as in archaea) was eliminated and replaced by a type-II tRNATyr derived from a tRNASer.
3. Discussion
The mechanism for type-II tRNA evolution is the same as the mechanism for type-I tRNA evolution, but lacks a processing step (Figure 1). The primordial length of the type-II V loop was 14-nt, as observed for archaeal tRNALeu (Figure 4). We posit that tRNALeu evolved to tRNASer, which has a longer V loop because of insertions near the middle of the loop [4]. Both in length and sequence, archaeal tRNALeu and tRNASer appear closer to LUCA tRNAs than bacterial tRNALeu and tRNASer. Type-II bacterial tRNATyr and tRNASec appear to be derived from tRNASer. Bacterial type-II tRNATyr appears to have evolved by tRNA reassignment: elimination of a type-I tRNATyr and reassignment of a tRNASer to tRNATyr.
The model for evolution of type-I tRNAs was developed by inspection, similarly to the solving of a puzzle [1,2]. From inspection of typical tRNA diagrams, using ancient archaea, homology of the Ac and T stem-loop-stems was evident (Figure 2). This accounts for 34-nt of tRNA, and the type-I tRNA core (lacking 3′-ACCA) is initially 75 nt. Considering acceptor stems, 48 nt of tRNA are described. This is more than half the tRNA. Eventually, the D loop microhelix was solved as a truncated UAGCC repeat of 17 nt (Figure 2). Because three 17-nt microhelix sequences were present (D loop, Ac loop and T loop), and the D loop and T loop were flanked on one side by a 7-nt acceptor stem, this indicated that the molecule was derived from three 31-nt minihelices. Therefore, evolution of type-I tRNAs could be solved by two symmetrical 9-nt deletions, and the two remaining 5-nt sequences were the last 5 nt of the D loop (5′-As*) and the 5-nt V loop (3′-As*) (Figure 1). The length of the primordial cloverleaf tRNA lacking 3′-ACCA was 75 nt (Figure 1 and Figure 2). This model for type-I tRNA evolution was strongly supported using a battery of statistical sequence comparisons [1]. In the current paper, the model for type-I tRNA evolution was extended to describe evolution of type-II tRNAs with V-loop expansions (Figure 1). The model for type-II tRNAs is identical, except there is no processing of the 3′ 14-nt ligated 3′- and 5′-acceptor stems (7-nt + 7-nt), which becomes the expanded V loop. Slight variations of the same model, therefore, completely and cleanly account for evolution of type-I and type-II tRNAs.
Type-II tRNAs have been processed in evolution to type-I tRNAs, consistent with our model for evolution of type-I tRNAs (Figure 1). Notably, tRNASer and tRNASec, which primarily are type-II tRNAs, include type-I tRNAs [3]. The model in Figure 1 indicates that the first cloverleaf tRNAs may have included 93, 84 and 75-nt core sequences. Most tRNAs are of the type-I variety, but tRNALeu, tRNASer, tRNASec and bacterial tRNATyr remained mostly type II. Because of the apparent appearance of type-I tRNAs from type-II tRNAs in modern lineages, it is possible that type-II tRNAs can still be processed under some conditions to type-I tRNAs. Perhaps, these 93, 84, and 75-nt species of tRNA cloverleafs co-existed, until selection of primarily type-I tRNAs and a smaller collection of type-II tRNAs. It appears that V loop lengths and sequences were mostly selected to optimize discrimination by ~20 aaRS enzymes [10].
3.1. Comparison of tRNA Evolution Models
Competing models for tRNA evolution have been advanced [14,15,16,17,18,19]. Some of these models indicate that two minihelices might be ligated to form a primordial tRNA. Our model, by contrast, requires ligation of three 31-nt minihelices representing two different 17-nt microhelix core sequences (1:2; the D loop and the homologous Ac and T stem-loop-stems) (Figure 1). We show clearly that the Ac loop and T loop are homologs (Figure 2) [1]. In a two minihelix model, however, the Ac and the T stem-loop-stem cannot be homologous, because the Ac loop must be bisected to make the comparison, spoiling the alignment. Rather, a two minihelix model predicts that the D loop and the T loop should be similar in sequence, which they clearly are not. As we show here, and as we have shown previously, the D-loop microhelix is based on a UAGCC repeat, which cannot be similar in sequence to a CCGGGUUCAAAUCCCGG T stem-loop-stem [1]. In Figure 4B, we show two perfect UAGCC repeats in the D loop, indicating the UAGCC repeat. Another criticism of the two minihelix models is that they appear to require unlikely sequence and structural convergence of the 7-nt U-turn Ac and T loops. If the homology of the Ac and T stem-loop-stems is accepted (Figure 2), only the three minihelix model makes sense. One proposed two minihelix model is based too heavily on analysis of tRNA introns in the Ac loop of one archaeal species [15]. Introns are found in many sites of archaeal tRNAs, not just in the Ac loop [20]. Our three minihelix model is strongly supported by identification of internal D loop (5′-As*) and V loop (type I: 3′-As*; type II: 3′-As ligated to a 5′-As) homologies to acceptor stems. Our model for type-II tRNAs strongly supports the model we previously proposed for processing a ligated 3′-As and 5′-As (14 nt) by deletion of 9 nt to yield a 3′-As* type-I V loop [1], because we identify the previously predicted intermediate in processing to a type-I tRNA as existing in type-II tRNA. Put more simply, type-II tRNA is the predicted intermediate in processing of type-I tRNA (Figure 1) [1]. Our model makes strong sequence predictions, which are all justified by statistical tests (Table 1) [1]. So far as we can judge, two minihelix models do not make strong sequence predictions that can be justified by any analysis we can apply.
3.2. Evolution of the Genetic Code
Because tRNA evolution is such a simple story, evolution of the genetic code and translation systems becomes simpler to understand [4,21]. Significantly, the tRNA-centric view provides a simplified understanding of genetic code evolution. As viewed from the perspective of mRNA, in which all 64 codons are used, >1084 genetic codes and up to 63 encoded amino acids might be possible [22]. Viewed from the perspective of tRNA, however, the genetic code is half the size: a 32-letter code in tRNA versus a 64 letter code in mRNA [4,21]. The reason the code in tRNA is smaller than it is in mRNA is that ambiguity in reading the wobble position of tRNA limits the size of the code. Essentially, because codon-anticodon contacts are not fully proofread for the wobble position base on the ribosome, there is only purine versus pyrimidine discrimination at the wobble position, not single base (A,G,C,U) recognition. The single exception is tRNAIle (UAU) versus tRNAMet (CAU), which is supported by extensive modifications to tRNAMet (CAU) [23,24,25]. Furthermore, for the most part, tRNAIle (UAU) is only utilized in eukaryotes and not in prokaryotes. The maximum complexity of the genetic code in tRNA, therefore, is 4 × 4 × 2, instead of 4 × 4 × 4 in mRNA. The standard code, therefore, evolved to encode 20 amino acids rather than a larger number. There are additional dimensions to this story described in other work [4,21].
3.3. Evolution of tRNA Sequence Proceeded from Order to Chaos
Archaeal tRNAs are better preserved from LUCA than bacterial tRNAs, and archaeal tRNAs are more highly-ordered in their sequence [1,2]. Ancient archaea such as Pyrococcus, Pyrobaculum and Staphylothermus have tRNAomes (i.e., typical tRNA diagrams) that are more similar to a LUCA tRNAome than more derived species [4]. These tRNAomes are more ordered in sequence, because tRNA evolved from repeating sequences (Figure 1). The ancient world of ~4 billion years ago, therefore, in some cases, evolved biological complexity from ordered sequence, in the form of repeats and inverted repeats. From analysis of tRNA evolution, therefore, the assumption that biological complexity was generated only from random polymer sequences is incorrect. Processes such as replication slippage and abortive initiation generated repeats and/or short RNA fragments that could be attached by ligation. Evolution from ordered repeats to chaos can clearly be seen in tRNA evolution, by inspection of typical tRNA diagrams for ancient archaea compared to more derived bacteria. Furthermore, mechanisms probably existed in the ancient world to measure the lengths of sequences, because repeats were clipped into functional units of 5 nt (Ac and T loop stems), 7 nt (i.e., acceptor stems, Ac and T loops) and 17 nt (i.e., D loop, Ac loop and T loop microhelices). Of course, these length selections may represent selections for evolving biological function.
3.4. The Inanimate to Animate Transition
The central advance in biological intellectual property in evolution of life on earth was cloverleaf tRNA, the adapter that permits biological coding, and around which coding functions evolved [4,21]. From conserved tRNA sequences, the pathway of tRNA evolution has been determined (Figure 1). This is a story of building biological complexity from ordered repeats and snap back stem-loop-stems. So, life on earth was snapped together (ligated) similarly to the children′s game of LEGO (trademark) (Figure 1). The inanimate to animate transition is described as a simple model in Figure 7, tracking the evolution of microhelices→minihelices→cloverleaf tRNA→translation systems→cellular life. Life, therefore, evolved from a primitive inanimate polymer world that includes short sequences (i.e., ACCA; abortive initiation), repeats (i.e., GCG, CGC and UAGCC repeats; replication slippage) and inverted repeats (Ac loop and T loop microhelices; stem-loop-stems, which can attach to form replication primers). Polymers are generated via dehydration reactions, so cycles of hydration and dehydration may be sufficient to describe generation of the first biopolymers [26,27].
From a strange polymer world that includes 17-nt microhelices (i.e., D loop, Ac loop and T loop microhelices), a small collection of ribozymes appears necessary and possibly sufficient to generate cloverleaf tRNA and translation systems. These ribozyme activities have been largely reinvented in vitro, and some of these ribozymes can be quite small, indicating that their evolution via simple non-biotic processes might be possible [28,29,30,31,32,33,34]. Hydration-dehydration cycles drive polymerization reactions and concentrate cofactors such as Mg2+.
As we have previously proposed, microhelices, minihelices, and cloverleaf tRNA may have been initially evolved to synthesize polyglycine, proposed to have been used to stabilize protocells, as in bacterial cell walls [4,21]. We note that polyglycine requires a membrane anchored carbohydrate to form cell wall-like cross links, so polyglycine by itself is not sufficient for stabilization of protocells. One evidence for the polyglycine model is that tRNAPri (the primordial tRNA cloverleaf; Figure 2) is closest in sequence to tRNAGly in ancient archaea [4]. We have described a Darwinian pathway to evolve the 21-letter genetic code (20 amino acids with stops) from a one letter code synthesizing polyglycine (using any mRNA sequence). Once cloverleaf tRNA evolves, therefore, evolution of the genetic code appears to be assured. We conclude that evolution of the tRNA cloverleaf is the major advance in evolution of biological intellectual property that led to evolution of the genetic code, translation systems and cellular life. Once cloverleaf tRNA evolves, Darwinian selection drives evolution of dependent processes (translation, genetic code, aaRS enzymes), so evolution of cloverleaf tRNA is the core advance.
As described previously, the evolution of the ribosome requires initially a scaffold on which to mount and move mRNA (a decoding center) and perhaps a mobile peptidyl transferase center [2]. The peptidyl transferase center can be viewed as a dehydration and molecular crowding chamber to drive the polymerization of polypeptide chains [27]. If the peptidyl transferase center is a “ribozyme”, it is not a good one. Every other function of the ribosome is a refinement or add-on: i.e., translocation, proofreading, initiation, and termination functions. Unlike tRNA (Figure 1), the evolutionary source of rRNA is more obscure. Without breakthrough success, our laboratory has attempted to solve this problem in collaboration with Robert Root-Bernstein (MSU). As a cautionary tale, rRNA sequences appear cloverleaf tRNA-like, and ancient archaeal rRNAs appear more tRNA-like than more derived species, indicating that cloverleaf tRNA was one of the building blocks of rRNA sequences. Back translating rRNA sequences (BlastX; NCBI) gives apparent open reading frames, but none of these can clearly be traced to an independent functional gene or close homolog. Generally, in archaeal and bacterial genomes, these long open reading frames are only found in rRNA sequences. Open reading frames with an apparent annotation, i.e., a “cell wall hydrolase” embedded in the peptidyl transferase center of 23S rRNA as a reverse orientation gene, cannot be confirmed to encode a cell wall hydrolase with a known function that exists anywhere as a stand-alone gene or that has notable homology to other cell wall hydrolases or structures. We concluded that this sequence does not encode a cell wall hydrolase, and was wrongly annotated but propagated through the reported annotations of many genomes leading to potential confusion, including our own. So, although a fairly simple model for ribosome evolution can be proposed, to our knowledge, in contrast to tRNA (Figure 1), the detailed evolution of rRNA sequences and the evolution of the ribosome remain largely unsolved but compelling mysteries.
4. Materials and Methods
Methods and databases have been described previously [1,2,3,4,21]. The statistical permutation test is useful for comparing two short, aligned sequences with many examples of each for possible homology [1]. Most tRNA sequences were taken from gtRNAdb [5,6]. For convenience, typical tRNA diagrams were taken from the older tRNAdb [3]. V loop sequences were added to typical tRNA diagrams by hand. Structures were analyzed using UCSF Chimera [35,36].
5. Conclusions
We conclude that evolution of cloverleaf tRNA, as described here (Figure 1), drove the inanimate to animate transition in evolution of life on earth (Figure 7). The inanimate world is characterized by a strange polymer world with unexpected order that includes sequence repeats (i.e., GCG, CGC and UAGCC repeats), short abundant potentially functional sequences (i.e., ACCA), and snap-back primers (stem-loop-stems, inverted repeats; microhelices, minihelices). Ordered polymers led to evolution of microhelices, minihelices, and cloverleaf tRNA (Figure 1). Once cloverleaf tRNA evolved, evolution of the genetic code, the ribosome, translation systems, and cellular life were assured. Strangely, very few uncertainties remain in this amazing story recorded and told in genetic sequences that evolved about 4 billion years ago.
Abbreviations
aaRS | Aminoacyl-tRNA synthetase (i.e., LeuRS) |
As | Acceptor stems |
As* | Acceptor stem remnants |
Ac loop | Anticodon loop |
LUCA | Last universal common (cellular) ancestor |
T loop | T loop or TΨC loop |
V loop | Variable loop |
Author Contributions
Conceptualization, B.K., K.O. and Z.F.B.; Data curation, Y.K. and Z.F.B.; Formal analysis, Y.K. and Z.F.B.; Investigation, Y.K. and Z.F.B.; Methodology, Y.K. and Z.F.B.; Project administration, B.K., K.O. and Z.F.B.; Software, Y.K. and K.O.; Supervision, K.O.; Validation, Y.K.; Visualization, Z.F.B.; Writing–original draft, Z.F.B.; Writing–review & editing, Z.F.B.
Funding
This research received no external funding.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Pak D., Root-Bernstein R., Burton Z.F. tRNA structure and evolution and standardization to the three nucleotide genetic code. Transcription. 2017;8:205–219. doi: 10.1080/21541264.2017.1318811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Root-Bernstein R., Kim Y., Sanjay A., Burton Z.F. tRNA evolution from the proto-tRNA minihelix world. Transcription. 2016;7:153–163. doi: 10.1080/21541264.2016.1235527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Juhling F., Morl M., Hartmann R.K., Sprinzl M., Stadler P.F., Putz J. tRNAdb 2009: Compilation of tRNA sequences and tRNA genes. Nucleic Acids Res. 2009;37:D159–D162. doi: 10.1093/nar/gkn772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Pak D., Du N., Kim Y., Sun Y., Burton Z.F. Rooted tRNAomes and evolution of the genetic code. Transcription. 2018;9:137–151. doi: 10.1080/21541264.2018.1429837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chan P.P., Lowe T.M. GtRNAdb 2.0: An expanded database of transfer RNA genes identified in complete and draft genomes. Nucleic Acids Res. 2016;44:D184–D189. doi: 10.1093/nar/gkv1309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Chan P.P., Lowe T.M. GtRNAdb: A database of transfer RNA genes detected in genomic sequence. Nucleic Acids Res. 2009;37:D93–D97. doi: 10.1093/nar/gkn787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Chawla M., Abdel-Azeim S., Oliva R., Cavallo L. Higher order structural effects stabilizing the reverse Watson-Crick Guanine-Cytosine base pair in functional RNAs. Nucleic Acids Res. 2014;42:714–726. doi: 10.1093/nar/gkt800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Oliva R., Tramontano A., Cavallo L. Mg2+ binding and archaeosine modification stabilize the G15 C48 Levitt base pair in tRNAs. RNA. 2007;13:1427–1436. doi: 10.1261/rna.574407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Westhof E., Dumas P., Moras D. Restrained refinement of two crystalline forms of yeast aspartic acid and phenylalanine transfer RNA crystals. Acta. Crystallogr. A. 1988;44 Pt 2:112–123. doi: 10.1107/S010876738700446X. [DOI] [PubMed] [Google Scholar]
- 10.Perona J.J., Gruic-Sovulj I. Synthetic and editing mechanisms of aminoacyl-tRNA synthetases. Top Curr. Chem. 2014;344:1–41. doi: 10.1007/128_2013_456. [DOI] [PubMed] [Google Scholar]
- 11.Turanov A.A., Xu X.M., Carlson B.A., Yoo M.H., Gladyshev V.N., Hatfield D.L. Biosynthesis of selenocysteine, the 21st amino acid in the genetic code, and a novel pathway for cysteine biosynthesis. Adv. Nutr. 2011;2:122–128. doi: 10.3945/an.110.000265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Itoh Y., Chiba S., Sekine S., Yokoyama S. Crystal structure of human selenocysteine tRNA. Nucleic Acids Res. 2009;37:6259–6268. doi: 10.1093/nar/gkp648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Yaremchuk A., Kriklivyi I., Tukalo M., Cusack S. Class I tyrosyl-tRNA synthetase has a class II mode of cognate tRNA recognition. EMBO J. 2002;21:3829–3840. doi: 10.1093/emboj/cdf373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Bernhardt H.S. Clues to tRNA Evolution from the Distribution of Class II tRNAs and Serine Codons in the Genetic Code. Life (Basel) 2016;6:10. doi: 10.3390/life6010010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Di Giulio M. A comparison among the models proposed to explain the origin of the tRNA molecule: A synthesis. J. Mol. Evol. 2009;69:1–9. doi: 10.1007/s00239-009-9248-z. [DOI] [PubMed] [Google Scholar]
- 16.Widmann J., Di Giulio M., Yarus M., Knight R. tRNA creation by hairpin duplication. J. Mol. Evol. 2005;61:524–530. doi: 10.1007/s00239-004-0315-1. [DOI] [PubMed] [Google Scholar]
- 17.Nagaswamy U., Fox G.E. RNA ligation and the origin of tRNA. Orig. Life Evol. Biosph. 2003;33:199–209. doi: 10.1023/A:1024658727570. [DOI] [PubMed] [Google Scholar]
- 18.Tamura K. Origins and Early Evolution of the tRNA Molecule. Life (Basel) 2015;5:1687–1699. doi: 10.3390/life5041687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Branciamore S., Di Giulio M. The presence in tRNA molecule sequences of the double hairpin, an evolutionary stage through which the origin of this molecule is thought to have passed. J. Mol. Evol. 2011;72:352–363. doi: 10.1007/s00239-011-9440-9. [DOI] [PubMed] [Google Scholar]
- 20.Sugahara J., Fujishima K., Morita K., Tomita M., Kanai A. Disrupted tRNA gene diversity and possible evolutionary scenarios. J. Mol. Evol. 2009;69:497–504. doi: 10.1007/s00239-009-9294-6. [DOI] [PubMed] [Google Scholar]
- 21.Pak D., Burton Z.F. Aminoacyl-tRNA synthetase proofreading, anticodon wobble preference and sectoring of the genetic code via tRNA charging errors. Transcription. 2018;9:205–224. doi: 10.1080/21541264.2018.1467718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Jose M.V., Govezensky T., Garcia J.A., Bobadilla J.R. On the evolution of the standard genetic code: Vestiges of critical scale invariance from the RNA world in current prokaryote genomes. PLoS ONE. 2009;4:e4340. doi: 10.1371/journal.pone.0004340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Agris P.F., Eruysal E.R., Narendran A., Vare V.Y.P., Vangaveti S., Ranganathan S.V. Celebrating wobble decoding: Half a century and still much is new. RNA Biol. 2018;15:537–553. doi: 10.1080/15476286.2017.1356562. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Vare V.Y., Eruysal E.R., Narendran A., Sarachan K.L., Agris P.F. Chemical and Conformational Diversity of Modified Nucleosides Affects tRNA Structure and Function. Biomolecules. 2017;7:29. doi: 10.3390/biom7010029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Agris P.F., Vendeix F.A., Graham W.D. tRNA′s wobble decoding of the genome: 40 years of modification. J. Mol. Biol. 2007;366:1–13. doi: 10.1016/j.jmb.2006.11.046. [DOI] [PubMed] [Google Scholar]
- 26.Rodriguez-Garcia M., Surman A.J., Cooper G.J., Suarez-Marina I., Hosni Z., Lee M.P., Cronin L. Formation of oligopeptides in high yield under simple programmable conditions. Nat. Commun. 2015;6:8385. doi: 10.1038/ncomms9385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Bowman J.C., Hud N.V., Williams L.D. The ribosome challenge to the RNA world. J. Mol. Evol. 2015;80:143–161. doi: 10.1007/s00239-015-9669-9. [DOI] [PubMed] [Google Scholar]
- 28.Illangasekare M., Yarus M. Small aminoacyl transfer centers at GU within a larger RNA. RNA Biol. 2012;9:59–66. doi: 10.4161/rna.9.1.18039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kennedy R., Lladser M.E., Wu Z., Zhang C., Yarus M., De Sterck H., Knight R. Natural and artificial RNAs occupy the same restricted region of sequence space. RNA. 2010;16:280–289. doi: 10.1261/rna.1923210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Yarus M. Ahead and behind: A small, small RNA world. RNA. 2015;21:769–770. doi: 10.1261/rna.051086.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Yarus M. The meaning of a minuscule ribozyme. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2011;366:2902–2909. doi: 10.1098/rstb.2011.0139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Turk R.M., Chumachenko N.V., Yarus M. Multiple translational products from a five-nucleotide ribozyme. Proc. Natl. Acad. Sci. USA. 2010;107:4585–4589. doi: 10.1073/pnas.0912895107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Turk R.M., Illangasekare M., Yarus M. Catalyzed and spontaneous reactions on ribozyme ribose. J. Am. Chem. Soc. 2011;133:6044–6050. doi: 10.1021/ja200275h. [DOI] [PubMed] [Google Scholar]
- 34.Chumachenko N.V., Novikov Y., Yarus M. Rapid and simple ribozymic aminoacylation using three conserved nucleotides. J. Am. Chem. Soc. 2009;131:5257–5263. doi: 10.1021/ja809419f. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Yang Z., Lasker K., Schneidman-Duhovny D., Webb B., Huang C.C., Pettersen E.F., Goddard T.D., Meng E.C., Sali A., Ferrin T.E. UCSF Chimera, MODELLER, and IMP: An integrated modeling system. J. Struct. Biol. 2012;179:269–278. doi: 10.1016/j.jsb.2011.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Couch G.S., Hendrix D.K., Ferrin T.E. Nucleic acid visualization with UCSF Chimera. Nucleic Acids Res. 2006;34:e29. doi: 10.1093/nar/gnj031. [DOI] [PMC free article] [PubMed] [Google Scholar]