Abstract
Template-directed polymerization of nucleotides is believed to be a pathway for the replication of genetic material in the earliest cells. We assume that activated monomers are produced by prebiotic chemistry. These monomers can undergo spontaneous polymerization, a system that we call “prelife.” Adding template-directed polymerization changes the equilibrium structure of prelife if the rate constants meet certain criteria. In particular, if the basic reproductive ratio of sequences of a certain length exceeds one, then those sequences can attain high abundance. Furthermore, if many sequences replicate, then the longest sequences can reach high abundance even if the basic reproductive ratios of all sequences are less than one. We call this phenomenon “subcritical life.” Subcritical life suggests that sequences long enough to be ribozymes can become abundant even if replication is relatively inefficient. Our work on the evolution of replication has interesting parallels to infection dynamics. Life (replication) can be seen as an infection of prelife.
1 Introduction
Evolution is based on replication, mutation, and selection. Replication proliferates information. Mutation results in errors in replication and thereby produces diversity. Selection occurs when some mutants are more fit than others. In order to understand the beginning of evolution, we must understand the origins of replication. In our earlier work, we studied “prelife” dynamics and the onset of replication (Nowak and Ohtsuki, 2008; Manapat et al., 2009; Ohtsuki and Nowak, 2009). Prelife is a model of a soup of activated monomers that undergo spontaneous polymerization. How might prebiotic replication occur in reality? Template-directed synthesis is believed to be a mechanism for copying nucleic acid sequences before enzymes evolved (Ninio and Orgel, 1978; Lohrmann et al., 1980; Joyce, 1987; Orgel, 2004). In this mechanism, the template sequence accelerates polymerization of a complementary strand through base-pairing interactions with the nucleotide monomers. This effect accelerates polymerization by at least one or two orders of magnitude (Kanavarioti and White, 1987). We model such synthesis as a three-step process: (1) association between primer and template, (2) template-directed elongation of the primer, and (3) dissociation of the double-stranded product. In addition, the grouping together of molecules (e.g., using vesicle membranes) is thought to be important for the evolution of replication enzymes (Szathmáry and Demeter, 1987; Szostak et al., 2001; Chen et al., 2006; Traulsen and Nowak, 2006). A recent experimental study demonstrated template-directed synthesis inside model protocells using activated monomers supplied from the exterior (Mansy et al., 2008). The system we study in this paper is in some ways similar to the system of nucleic acid sequences in a model protocell.
There have been many investigations of the origin of life, both experimental and theoretical (Oparin, 1953; Crick, 1968; Orgel, 1968, 1992; Eigen, 1971; Eigen and Schuster, 1982; Dyson, 1982, 1999; Kuppers, 1983; Stein and Anderson, 1984; Farmer et al., 1986; Fontana and Schuster, 1998; Lifson and Lifson, 1999; de Duve, 2005, 2007). The first step towards life must have been the synthesis of basic organic building blocks—amino acids, carbohydrates, and nucleotides—in primitive-Earth conditions (Miller, 1953; Allen and Ponnamperuma, 1967; Miller and Orgel, 1974; Hargreaves et al., 1977; Rao et al., 1982; Rushdi and Simoneit, 2001; Benner et al., 2002; Ricardo et al., 2004; Benner and Ricardo, 2005; Wachtershauser, 2007; Powner et al., 2009). After those basic components became available, mechanisms for storing information and catalyzing chemical reactions must have arisen. While other genetic polymers may have preceded RNA, it is generally accepted that an “RNA world” evolved at some point. In this world, RNA acted as both information carrier and enzyme (Woese, 1967; Crick, 1968; Orgel, 1986; Joyce, 1989, 2002; Ellington and Szostak, 1990; Cech, 1993; Sievers and von Kiedrowski, 1994; Johnston et al., 2001; Steitz and Moore, 2003; Hughes et al., 2004). Evidence supporting the RNA world theory comes from several sources, including the discovery that RNA forms the catalytic core of the ribosome (Nissen et al., 2000) and the discovery of RNA molecules that catalyze nucleic acid chemistry (Wilson and Szostak, 1999). A particularly noteworthy RNA molecule, evolved from a ligase ribozyme, catalyzes template-directed RNA polymerization, demonstrating a step toward a replication enzyme (Bartel and Szostak, 1993; Johnston et al., 2001). In a recent experiment, Lincoln and Joyce (2009) constructed paired RNA catalysts that can synthesize copies of each other when supplied with the right chemical precursors. This system resembles the “hypercycle” of Eigen and Schuster (1977, 1979), a cooperative network of enzymes that may have been one of the first forms of life. Here, we focus on a relatively early stage of life in which templating polymers (e.g., nucleic acids) existed but enzymes had not necessarily yet evolved.
One difficulty in the origin of life is the problem of combinatorial complexity. Very short RNA sequences generally do not have catalytic activity because they are not long enough to fold into active structures. Short ribozymes, for example, are about 30 bases long (Lee and Suga, 2001; Pan and Uhlenbeck, 1992). Hence, if (1) a molecule must be relatively long to have enzymatic activity and (2) most long molecules do not act as enzymes (Carothers and Szostak, 2006), then the probability that random polymerization will lead to a ribozyme is exceedingly small. One possible resolution to this problem is to somehow bias the formation of RNA sequences toward active sequences. For example, catalysis of RNA polymerization by montmorillonite clay appears to promote some lineages while suppressing others (Miyakawa and Ferris, 2003; Ferris et al., 2004). Such biases can dramatically reduce the effective size of sequence space. We investigated selection by differential rates of production (“selection before replication”) previously (Nowak and Ohtsuki, 2008). However, this mechanism does not necessarily bias the sequence pool toward active replicators. Another way to increase the chances of producing ribozymes is to bias the length distribution toward long sequences. Here we investigate how such a bias might arise.
This paper is organized as follows. In Section 2, we introduce our model. In Section 3, we present simulation results for a special case. In Section 4, we compute the first crucial quantities, the basic reproductive ratio of templates and the discriminant. In Sections 5, we generalize our model to allow for templates of multiple lengths, introducing the notions of subcritical life, the chain amplification ratio, and the cumulative basic reproductive ratio. In Section 6, we draw some analogies to infection dynamics. Finally, in Section 7, we summarize our findings.
2 The model
2.1 Basic setup
We consider “unary” sequences, Ni. For example, N3 = N N N, where N represents an arbitrary monomer (e.g., the ribonucleotide A, C, G, or U). Here we focus on the efficiency of replication and the equilibrium length distribution, rather than mutation and diversity, so for simplicity we do not distinguish between different types of monomers. Sequences grow by incorporating activated monomers, *N, according to the chemical reaction Ni + *N → Ni+1.
Let x denote the abundance of activated monomers and yi the abundance of sequences of length i. The dynamics of these quantities are described by the following differential equations:
(1) |
Activated monomers enter the system at the constant rate λ. They are deactivated (e.g., by hydrolysis) at rate α. When a sequence of length i and an activated monomer meet, the sequence may be extended to form a new one of length i + 1. Extension occurs at rate axyi. All sequences are removed from the system at rate d. This system is what we call prelife (Nowak and Ohtsuki, 2008; Manapat et al., 2009; Ohtsuki and Nowak, 2009).
Now we introduce replication through template-directed synthesis (Figure 1). For simplicity, we first assume that only sequences of a particular length, n, can be templates. (Later, we consider the more realistic scenario in which all sequences with length between a minimum and maximum are templates.) Replication occurs when a primer of length k binds to a template so that their ends are aligned. The primer length, k, might be determined by salt concentrations, temperature, etc. Template-directed elongation occurs until the primer has been extended to length n. The strands then dissociate. The product sequences can then participate in prelife or replication reactions. For a given primer length k and template length n, the dynamics of the system are described by the following system of differential equations:
(2) |
The abundance of activated monomers is x and the abundance of sequences of length i is yi. Abundances of sequences other than primers and templates obey the prelife equations (1). The binding of a primer and template to form a reactive complex occurs at rate B. The variables zi, where k ≤ i ≤ n, denote the abundances of double strands: zk is the abundance of pairs in which one strand has length k and the other length n (formed after a primer binds to a template), zk+1 is the abundance of pairs in which one strand has length k + 1 and the other length n (formed after one elongation step has occurred), and so forth. The parameter β is the template-directed elongation rate. The double-stranded complex dissociates at rate D. Such dissociation could be induced by thermal denaturation, which itself may be driven, for instance, by diurnal cycling. When B = 0, (2) reduces to the prelife system (1).
2.2 Model parameters and model protocells
Although prebiotic parameter values are unknown, several relevant rates have been determined in an experimental model system. Let us consider a model protocell consisting of a vesicle encapsulating nucleic acid sequences that replicate by template-directed synthesis (Mansy et al., 2008). In this case, the rate of input of activated monomers (λ) represents the rate of permeation of activated mononucleotides into the protocell. The equilibration half-time for a commonly used reactive monomer (5′-phosphorimidazolides) across model membranes based on fatty acids is on the order of seconds to hours, depending on the temperature (Mansy and Szostak, 2008; Mansy et al., 2008). A typical experimental concentration of activated monomers is on the order of 5–50 mM (Kanavarioti and White, 1987; Kozlov and Orgel, 2000; Mansy et al., 2008). Thus λ would be on the order of 10−3 to 101 M/hr in this system. We assume that monomers generated by prebiotic chemistry have similar properties so that the resulting sequences have similar binding, dissociation, and polymerization rates. For example, we do not consider the consequence of using monomers of different chirality (D-and L-nucleotides). Although this is a simplification, previous work has demonstrated physical and chemical mechanisms for obtaining high enantiomeric excess from a racemic mixture (Blackmond, 2004; Klussmann et al., 2006; Viedma et al., 2008)
Inactivation of phosphorimidazolide monomers during experiments is due to hydrolysis that yields the nucleotide monophosphate. The half-life of these monomers under template-directed polymerization conditions has been measured to be between 7 and 150 hours, depending on reaction conditions (Kanavarioti and White, 1987; Rajamani and Chen, 2009). Thus α would be roughly 0.005 to 0.1 hr−1.
The presence of a template substantially enhances polymerization rates (Orgel, 2004). This effect was measured for activated ribonucleotides (Kanavarioti and White, 1987). In these experiments, the rate constant of non-templated polymerization was found to be 0.09 M−1 hr−1 when considering 3′-5′ linkages (0.7 M−1 hr−1 when including formation of 2′-5′ linkages). Addition of a template increased this rate constant to 13 M−1 hr−1 (3′-5′ linkages alone) or 14.9 M−1 hr−1 (2′-5′ and 3′-5′ linkages). A similar template effect was found in a study of RNA polymerization catalyzed by Pb2+ and Zn2+, in which the longest products of non-templated polymerization were trimers, but templated polymerization yielded products 30–40 bases long (Sawai and Orgel, 1975). More highly activated chemistries could increase these rate constants. Vogel et al. (2005) observed a rate of 40 M−1 hr−1 for templated polymerization, for example. In terms of the parameters of our model, these experiments suggest that the non-templated polymerization rate a would be on the order of 0.1 to 1 M−1 hr−1, and the template-directed elongation rate β would be on the order of 10 to 100 M−1 hr−1.
Annealing a primer to a template is generally a fast process compared with the other reactions in the system. For example, one measurement of the second order rate constant for annealing 20 bases is around 104 M−1 hr−1 (Gartner et al., 2002). However, this rate depends on the temperature and the length of the annealing segment and can be very sensitive to buffer conditions, particularly ionic strength (Eun, 1996). A typical reaction for template-directed synthesis might allow a few minutes for annealing. Thus, the primer-template binding rate B is likely to be among the highest rate constants in the model, but its value may lie in a relatively large range. On the other hand, strand dissociation may be on the order of once per day (diurnal thermocycling), suggesting a value of D ≈ 0.04 hr−1, although faster cycling could occur in thermal convection cells (Krishnan et al., 2002).
2.3 Model assumptions
For the purposes of simulation, it is useful to transform the infinite system (2) into a finite system. To do this, we introduce a new variable w,
(3) |
representing the total abundance of all sequences of length greater than n. Then
(4) |
The complete system (2) can then be simulated with 2n − k + 3 differential equations. Extensive numerical simulations suggest that the equilibrium of (2) is globally stable.
In our model, the lengths of primers and templates are constrained in two ways. First, for the sake of analytic tractability, we assume that only sequences of a particular length, k, can act as primers. This simplification is justified because (1) a primer has a minimum length required for annealing, which depends on the environmental conditions, and (2) long primers are relatively rare (i.e., their abundance decreases exponentially with length in prelife; see below). We relaxed this assumption in numerical simulations in which sequences of many lengths acted as primers and obtained qualitatively similar results. Second, we assume that only sequences with lengths in a certain range can act as templates. We consider two versions of this assumption. We begin by doing our calculations for the simplified scenario of a single template length. However, our main focus is the realistic scenario in which templates have lengths in some range. The minimum length of a template is determined by the annealing length. For example, the annealing length—and thus the minimum possible primer length—is approximately 3 when the system is at less than 10 degrees C. The maximum length is determined by the relative rates of template-directed polymerization and of thermocycling-induced strand dissociation. If template-directed polymerization proceeds at one base per hour (Kanavarioti and White, 1987; Vogel et al., 2005), and thermocycling results in one dissociation event per day, then a typical template’s length will be somewhere between 20 and 30 bases. For sequences with lengths in the permissible range for templates, replication is generally completed before strands dissociate.
By studying unary sequences, we have ignored the effects of complementary base pairing. However, we can interpret our model in another way. Suppose we have a binary alphabet for our sequences, so a sequence is a string of 0’s and 1’s. We consider a particular polymerization lineage. For example, 0 → 01 → 011 → 0110 → · · ·. We then let xi denote the sum of the abundance of the sequence of length i in this lineage and the abundance of its complement. The zi’s are interpreted analogously. In this way, the results of our analysis below also describe the effect that replication with complementary base pairing has on the length distribution of sequences in a particular lineage.
3 Templates of a single length: numerical simulations
Figure 2(a) shows the prelife dynamics of sequences of lengths 1 to 30. There is no template-directed synthesis. All initial abundances are zero. At t = 0, activated monomers begin to enter the system. Some are hydrolyzed to form sequences of length 1. Longer and longer sequences are built as activated monomers are appended to the ends of existing sequences. At equilibrium, longer sequences are exponentially less common than shorter sequences.
Figure 2(b) shows the corresponding dynamics when sequences of length 30 (red) replicate using length-3 primers (dark blue). As before, all abundances are initially zero. Prelife generates primers and templates. In this example, the parameter values allow only slow replication, corresponding to a scenario in which template-directed polymerization is not much more efficient than non-templated polymerization. Many templates get trapped in the double-stranded state, and at equilibrium sequences of length 30 are less abundant than they are in the absence of replication.
In contrast, for Figure 2(c), the template-directed elongation rate β has been increased by an order of magnitude. Now the relative rate of template-directed to non-templated polymerization is in the range of experimentally observed values for an RNA-based system (Kanavarioti and White, 1987). At t ≈ 10, the abundance of templates begins to increase sharply and the abundance of primers begins to decrease. A chain reaction has started: the more templates there are, the more primers are consumed in replication reactions. At equilibrium, the abundance of length-30 sequences is almost ten times its prelife value.
Figure 3 shows the equilibrium abundances of sequences of lengths 1 to 40, as functions of the primer-template binding rate B, when sequences of length 30 (red) replicate using length-3 primers (dark blue). As B varies, all other parameters are held constant. For small values of B, the equilibrium abundance of length-30 sequences remains close to its prelife value. Since primer-template binding is weak, double strands are slow to form and replication does not lead to a significant increase in the abundance of templates. For large values of B, the equilibrium abundance of templates is almost an order of magnitude greater than it is when B is 0. A high binding rate means that templates and primers associate rapidly to form double strands. Other factors, such as the rate at which double strands dissociate and the availability of primer and template precursors, then become rate-limiting. This is the reason for the saturation we observe at high B. As B increases through intermediate values, there is a dramatic increase in the equilibrium abundance of templates. The abundance is most sensitive to changes in B when B = 20.89.
We observe this transition from prelife to life, which is characterized by a high abundance of long replicating sequences, when other parameters are varied as well. For instance, as Figure 2 suggests, when β increases as all other parameters are kept constant, there is a critical range during which the equilibrium abundance of templates increases most rapidly. The same is true as D is increased. For fixed primer length k and template length n, the model parameter space—λ, α, a, d, B, D, and β take values in [0, ∞)7—consists of two regions. When the model parameters are in one region, the equilibrium structure is almost the same as for prelife. In the other region, template-directed polymerization dominates. The two regions are separated by a critical surface. As the parameters approach and cross this surface, there is a transition from prelife to life.
4 The critical surface
4.1 Calculation of the surface
Consider the following scenario. We start with the system (2), but the primer-template binding rate is zero and so no replication is possible. The system could be in a high temperature environment, for example. Sequence abundances settle to their prelife equilibrium values. Now the system’s environment cools, so binding of primers and templates can occur and replication begins. If the reaction rates cause templates to be more abundant at the new equilibrium than they were before replication was possible, then we are “above” the critical surface. For this to happen, the abundance of templates must increase from its prelife equilibrium value. If this increase is insignificant, then we are “below” the surface. Although replication is possible, the system essentially remains very close to its prelife equilibrium. Hence, we will find the critical surface by examining how the abundance of templates deviates from prelife equilibrium after double-strand formation becomes possible.
We assume that the template-directed elongation rate, β, is large enough to produce at least some complete copies before thermocycling results in strand dissociation. (This requirement motivated our restrictions on possible template length.) In this case, there is a separation of time scales, and we can make the steady-state assumption that żi = 0 for all i. Intuitively, if the elongation reactions are very fast, then the zi adjust very quickly to the slower changes in x and the yj.
At steady-state, we have
(5) |
It follows that
(6) |
Equation (6) describes the relationship among the abundances x, yk, yn, and zn at steady-state. The dynamical change of the template abundance is given by
(7) |
We evaluate (7) when x, yk, yn−1, and yn are at their prelife equilibrium values. Longer sequences are exponentially less common than shorter sequences at prelife equilibrium, which is given by
(8) |
(A * denotes a quantity’s prelife equilibrium value.) Thus, for large n we can make the approximation . After substituting (6), we find that the initial growth of templates at prelife equilibrium is
(9) |
The first two terms in brackets give the removal rate and the third and fourth terms the “production,” or replication, rate. The templates’ abundance will increase significantly when the production terms are larger than the removal terms. Let
(10) |
If Rn > 1, the production terms are larger than the removal terms, and the abundance of templates grows significantly. If Rn < 1, the removal terms are larger than the production terms, and the abundance of templates remains low, close to its prelife value. Thus the critical surface is Rn = 1. Note that we obtain a different surface for different values of the primer length k and the template length n.
Figure 2 shows how the abundance of sequences of length 30 (red) evolves in time for different parameter values. Figure 2(a) shows the progression towards prelife equilibrium. For Figure 2(b), sequences of length 30 replicate. Their equilibrium abundance is less than it is in prelife. This is expected since R30 = 0.50, so the parameters that produce the dynamics lie below the critical surface. On the other hand, for Figure 2(c), R30 = 5.00. The parameters are far above the critical surface, and the equilibrium abundance of templates is almost ten times its prelife value.
Figure 3 shows the equilibrium abundance of length-30 templates, in red, as a function of the primer-template binding rate B. The broken vertical line is the critical boundary R30 = 1. The transition from low abundance to high abundance occurs as this boundary is crossed. The red arrow indicates the point at which the equilibrium abundance of templates is changing fastest as a function of B.
We can interpret Rn as a basic reproductive ratio (Anderson and May, 1979; May and Anderson, 1979; Nowak and May, 2000; Nowak, 2006). At prelife equilibrium, the average lifetime of a sequence is 1/(ax* + d). Once replication begins, the rate at which a template is copied is
(11) |
By multiplying this rate with the average lifetime, we obtain the basic reproductive ratio given by (10). Hence Rn is the expected number of templates that arise as copies of a single template, during its lifetime, at the onset of replication. Rn is an increasing, linear function of B—as more double strands are formed, more copies of the template are produced—and an increasing, saturating function of D (the dissociation rate) and β (the template-directed polymerization rate).
4.2 The discriminant
Let
(12) |
Observe that if Δ < 0, then Rn < 0. In particular, if Δ < 0, then Rn will never be greater than 1, regardless of how large B is. We call Δ the “discriminant.” The sign of the discriminant indicates whether selection of the templates is possible, regardless of the value of B.
We can write the condition Δ < 0 as
(13) |
If we take d to be very small, then we can make the approximation
(14) |
Hence we can rewrite (13) as
(15) |
This says that if the average polymerization time (the time it takes to extend a primer to a full copy of the template) is too long, then the templates can never be selected. This is reminiscent of the “error threshold” of quasispecies theory if we rewrite the condition as
(16) |
where
(17) |
If the average time it takes to add one base (along the template) is greater than the inverse of the template length, then selection for the template is impossible, regardless of how fast double strands are formed. This limit leads naturally to our main scenario, in which the upper limit of template length is determined by the number of monomers that can be added before the next thermocycle.
5 Templates of many lengths
5.1 Reversals
For simplicity, we formulated the system (2) so that only sequences of length n can be templates, but we can extend the model so that all sequences with lengths in a certain range can replicate. We assume that the binding (B), dissociation (D), and elongation (β) rates are independent of a template’s length, and we continue to assume that primers have length k. Rn decreases as n increases because of the [βx*/(βx* + d)]n−k term in (10). This means shorter templates have larger basic reproductive ratios than longer templates. Fewer monomers need to be polymerized to make a copy of a shorter template. Hence, the probability that a double-stranded intermediate is degraded during the replication process is smaller for shorter sequences.
Suppose all sequences with lengths between n1 and n2 can be templates. If n1 ≤ n ≤ n2, then a sequence of length n can arise in one of two ways: (1) it can be created as a direct copy of an already-existing sequence of length n (replication), or (2) it can be formed when a monomer is appended to the end of a sequence of length n − 1 (prelife extension). This coupling of templates by prelife extension can result in “reversals” in the length distribution in the sense that slower replicating, longer sequences become more abundant than faster replicating, shorter sequences (Manapat et al., 2009).
Figure 4 shows that the prelife equilibrium structure remains intact when B is small. In Figure 4(a), n1 = 25 and n2 = 30. In Figure 4(b), n1 = 20 and n2 = 30. In Figure 4(c), n1 = 4 and n2 = 30. Longer sequences are less abundant than shorter ones. This is expected since shorter sequences have larger basic reproductive ratios than longer sequences. As B increases, Ri increases for each i between n1 and n2. But it remains the case that Ri > Rj if i < j. When B is sufficiently large, however, the equilibrium abundance of length-j templates can be higher than that of length-i templates. When all templates are being replicated quickly, longer templates benefit from the fact shorter templates can be extended into longer ones by prelife polymerization. Hence we obtain reversals in equilibrium abundances such as the ones between B = 1 and B = 2 in Figures 4(a) and 4(b) and B = 10 and B = 11 in Figure 4(c).
5.2 Subcritical life
When there are multiple templates linked together by prelife extension, we observe a phenomenon that we call “subcritical life.” Subcritical life is characterized by the existence of templates with Rn < 1 that nevertheless attain equilibrium abundances significantly greater than in prelife. In other words, there are templates for which the derivative of their equilibrium abundance, as a function of the binding rate B, is maximized at a value of B such that Rn < 1. Let ȳi(B) denote the equilibrium abundance of templates of length i as a function of B (all other parameters are held constant).
In Figure 3, only sequences of length 30 replicate, and the derivative ∂ȳ30/∂B is maximized when B = 20.89 and R30 = 1.06. In Figure 4(a), sequences of lengths 25 to 30 replicate. For large enough values of B, sequences of length 30 are most abundant. The derivative ∂ȳ30/∂B is now maximized when B = 6.92 and R30 = 0.35. In Figure 4(b), sequences of length 20 to 30 replicate, and ∂ȳ30/∂B is largest when B = 4.50 and R30 = 0.23.
Subcritical life has two interesting features. First, the value of B necessary to induce the transition from prelife to life decreases as the length of the chain of consecutive templates increases. Hence it is easier to select templates when there is a greater range of template lengths. For example, when there are six templates in the chain, the transition is steepest when B = 6.92 (Figure 4(a)). When there are eleven templates in the chain, the transition is steepest when B = 4.50 (Figure 4(b)). Second, the transition occurs at a value of B such that Rn < 1 not just for the template of maximal length but in fact for templates of all lengths in the chain. Subcritical life occurs because the abundance of the longest template is amplified not only by the replication of those sequences but also by the replication (and subsequent extension) of upstream sequences. Thus prelife can “jump-start” life.
We can now ask for a general condition that predicts the onset of subcritical life. When there are templates of just one length, this condition should reduce to Rn = 1. When sequences of many lengths can be templates, the condition should reflect the effects of prelife extension appropriately. We propose two candidates: the chain amplification ratio and the cumulative basic reproductive ratio.
5.3 The chain amplification ratio
Our first candidate is a quantity that reflects whether long sequences are amplified relative to short sequences. We begin with several definitions. A first-generation sequence of length n is one that is made by a prelife reaction (i.e., not by the copying of another sequence of length n). For example, a sequence that is obtained by extending one of length n − 1 is a first-generation sequence. A second-generation sequence is one that is a direct copy of a first-generation sequence. Consider a replicating sequence of length n. The basic reproductive ratio, Rn, is the expected number of second-generation sequences of length n that are obtained from a single sequence of length n at the onset of replication.
Suppose sequences of length n and n + 1 replicate. What is the expected number of second-generation sequences of length n + 1 that are obtained from a single sequence of length n (at the onset of replication)? We denote this quantity by Rn,n+1 and compute it as follows. Consider the initial sequence of length n. It will undergo one of three processes. (1) It can replicate at rate
(18) |
Each of the two resulting sequences will then yield Rn,n+1 second-generation sequences of length n + 1. (2) It can be lost through removal at rate d, yielding no sequences of length n + 1. Or (3) it can be extended to form a sequence of length n + 1 at rate ax*. From the extended sequence, we eventually obtain Rn+1 second-generation sequences of length n. The probability of each of these events is proportional to the corresponding rate constant. Thus,
(19) |
from which we obtain
(20) |
Since Rn = rn/(d + ax*), we can rewrite this as
(21) |
Suppose now that sequences of length n1 to n2 replicate. We define the chain amplification ratio, which we denote by RA, to be the expected number of second-generation sequences of length n2 obtained from a single sequence of length n1. If RA is greater than one, short sequences lead to more and more copies of long sequences. A generalization of (21) yields
(22) |
Figure 5(a) gives an example of the calculation of RA when sequences of length 3 and 4 replicate. We start with a single template of length 3. This sequence replicates twice. One of the copies replicates again but then is lost (due to removal). There are three remaining sequences of length 3, each of which is extended to form a sequence of length 4. Thus there are three first-generation sequences of length 4. The offspring of these sequences—of which there are five—form the second generation. Hence RA = 5 in this example, if the numbers are interpreted as averages.
5.4 The cumulative basic reproductive ratio
Our second candidate is a quantity that reflects the number of copies made from a particular template throughout its entire lifetime in the system, even as it undergoes non-templated polymerization. We remain in the situation in which sequences of length n1 to n2 replicate. Suppose we tag a particular sequence of length n1 just as replication begins. This is analogous to radioactive labeling of the 5′ end of an RNA sequence. Since extension occurs at the 3′ end, the tag is preserved as the sequence is extended, but copies of the RNA sequence do not have the tag. We can ask how many direct copies are made of the tagged sequence, even as it is extended to form longer and longer sequences. We call this quantity the cumulative basic reproductive ratio and denote it by RC.
Consider first the case in which there are templates of just two lengths, so n2 = n1 + 1. The initial sequence of length n1 will produce Rn1 direct copies before it is lost. When a sequence is lost, it is either removed completely, with probability d/(d + ax*), or extended by non-templated polymerization, with probability ax*/(d + ax*). If it is extended, an additional Rn1 direct copies are made of the longer sequence. Hence,
(23) |
Generalizing to arbitrary n2 > n1, we have
(24) |
Figure 5(b) gives an example of the calculation of RC when sequences of lengths 3 to 4 are templates. We start with a template of length 3. Two copies of this length three sequence are produced. The tagged sequence is then extended, and three copies are made of the extension. In total, the sequence is copied five times, so RC = 5, if the numbers denote averages.
In Figures 4, the broken red line indicates where RA = 1 and the broken magenta line where RC = 1. The red arrow indicates where the abundance of the longest templates is increasing fastest. We see that subcritical life begins close to RA = 1 and RC = 1. Thus, these quantities can be used to predict when the transition to subcritical life occurs. However, an analytical justification of this fact appears to be difficult.
6 The analogy with infection dynamics
The equations of our model suggest that life, characterized by highly abundant long sequences, can be viewed as an infection of prelife. Templates infect primers by turning them into other templates. But a primer can also be extended, by prelife, to form a template. This “spontaneous generation of infection” results in an equilibrium population structure that is much different from the structure when there is no “mutation” from uninfected hosts to infected hosts.
Bonhoeffer and Nowak (1994) study the following scenario. There are n types of infection, linearly ordered so that a host infected with type i can, by mutation of the pathogen, become a host infected by type i + 1. Pathogen i has basic reproductive ratio Ri. Let denote the equilibrium abundance of hosts infected by type i. They find that the equilibrium population is characterized as follows. If Ri < 1 for all i, then for all i. Suppose, on the other hand, that Rj > 1 for at least one j. Then for all i ≥ i0 and for all j < i0, where i0 is such that Ri0 > Rk for all k ≠ i0. In other words, if at least one of the pathogens has a basic reproductive ratio greater than one, then it is precisely the types that are downstream of the one with the highest basic reproductive ratio that survive at equilibrium.
Suppose now that there is spontaneous generation of infection. This means that uninfected hosts can spontaneously acquire an infection by type 1 (without having to come into contact with a host infected by type 1). It no longer makes sense to ask when the all-zeros equilibrium is stable. Since there is mutation from uninfected hosts, for all i. We can, however, ask the following. Suppose the transmissibility of all the pathogens is zero. There is a natural equilibrium in which , for all i, due to mutation. How large do the transmissibilities have to be for the to be much greater than they are in the pre-transmission equilibrium?
We can frame this question in the context of our model (2). The system starts at prelife equilibrium. When template-directed synthesis becomes possible, templates can “infect” primers. What must be true of the parameters for the abundance of templates to increase significantly from the initial prelife values? We find that RA = 1 and RC = 1 are the crucial criteria in this case.
7 Conclusion
We have introduced a model of template-directed synthesis. Activated monomers enter the system and polymerize in a template-independent (slow) or template-dependent (fast) mechanism. For the purpose of analyzing this model, we have greatly simplified the scenario by neglecting the possibility of template-directed ligation (James and Ellington, 1997; Gao and Orgel, 2000), of a primer binding in the middle of a template to produce a shorter sequence, and of the breaking up of sequences to produce shorter sequences. Our assumptions correspond to an experimental scenario in which a defined primer sequence is used, activated polymers are scarce relative to activated monomers, and hydrolysis of the nucleic acid is relatively slow (Mansy et al., 2008). We also neglected the role of sequence diversity in order to focus on the length distribution.
We found that for some values of the kinetic rate constants, the equilibrium abundance of long templates remains low. This corresponds to a system with poor templating ability, where β is relatively low compared to a. For others, the equilibrium shifts rapidly toward long templates. A system with good templating ability, like RNA, would be characterized by a relatively large β. This shift in length distribution could be particularly important during the origin of life since a bias toward long sequences could increase the frequency of ribozymes in a protoell. We computed the critical surface in parameter space separating these two regions. In the case of a single template length, the surface is precisely the locus of points where the basic reproductive ratio equals one.
If there is a chain of templates, each being extended by non-templated polymerization into the next, long sequences can dominate even before the critical surface is reached. Even if each template is unviable on its own (i.e., has an Rn less than one), the equilibrium abundance of the longest templates in the chain can still be much larger than the equilibrium abundance of those sequences in prelife. This is not the result of Darwinian selection—all the sequences have basic reproductive ratios less than one and the differences in replication rate are minimal—but rather an effect of the interplay between chemical polymerization and replication. We have proposed two quantities, the chain amplification ratio and the cumulative basic reproductive ratio. These quantities are near one when then the transition from low to high abundance occurs for the longest templates. Prelife combines replication potentials: it links weak replicators together to jump-start life.
Acknowledgments
We thank Tibor Antal, Yoh Iwasa, David Rand, and Daniel Rosenbloom for enlightening discussions. M. L. M. and M. A. N. received support from the John Templeton Foundation, the NSF/NIH joint program in mathematical biology (NIH grant R01GM078986), the Bill and Melinda Gates Foundation (Grand Challenges grant 37874), and J. Epstein. I. A. C. received support from the FAS Center for Systems Biology at Harvard University and NIH grant P50GM068763.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- Allen WV, Ponnamperuma C. A possible prebiotic synthesis of monocarboxylic acids. Curr Mod Biol. 1967;1:24–28. doi: 10.1016/0303-2647(67)90017-2. [DOI] [PubMed] [Google Scholar]
- Anderson RM, May RM. Population biology of infectious diseases: part I. Nature. 1979;280:361–367. doi: 10.1038/280361a0. [DOI] [PubMed] [Google Scholar]
- Bartel DP, Szostak JW. Isolation of new ribozymes from a large pool of random sequences. Science. 1993;261:1411–1418. doi: 10.1126/science.7690155. [DOI] [PubMed] [Google Scholar]
- Benner SA, Caraco MD, Thomson JM, Gaucher EA. Planetary biology—paleontological, geological, and molecular histories of life. Science. 2002;296:864–868. doi: 10.1126/science.1069863. [DOI] [PubMed] [Google Scholar]
- Benner SA, Ricardo A. Planetary systems biology. Mol Cell. 2005;17:471–472. doi: 10.1016/j.molcel.2005.02.005. [DOI] [PubMed] [Google Scholar]
- Blackmond DG. Asymmetric autocatalysis and its implications for the origin of homochirality. Proc Nat Acad Sci USA. 2004;101:5732–5736. doi: 10.1073/pnas.0308363101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonhoeffer S, Nowak MA. Mutation and the evolution of virulence. Proc R Soc Lond B. 1994;258:133–140. [Google Scholar]
- Carothers JM, Szostak JW. In vitro selection of functional oligonucleotides and the origins of biochemical activity. In: Klussmann S, editor. The aptamer handbook: functional oligonucleotides and their applications. Wiley; Weinheim: 2006. pp. 3–28. [Google Scholar]
- Cech TR. The efficiency and versatility of catalytic RNA: implications for an RNA world. Gene. 1993;135:33–36. doi: 10.1016/0378-1119(93)90046-6. [DOI] [PubMed] [Google Scholar]
- Chen IA, Hanczyc MN, Sazani PL, Szostak JW. Protocells: genetic polymers inside membrane vesicles. In: Gesteland RF, Cech TR, Atkins JF, editors. The RNA world. 3. Cold Spring Harbor Press; Woodbury: 2006. pp. 57–88. [Google Scholar]
- Crick FH. The origin of the genetic code. J Mol Biol. 1968;38:367–379. doi: 10.1016/0022-2836(68)90392-6. [DOI] [PubMed] [Google Scholar]
- de Duve C. The onset of selection. Nature. 2005;433:581–582. doi: 10.1038/433581a. [DOI] [PubMed] [Google Scholar]
- de Duve C. Chemistry and selection. Chem Biodivers. 2007;4:574–583. doi: 10.1002/cbdv.200790051. [DOI] [PubMed] [Google Scholar]
- Dyson FJ. A model for the origin of life. J Mol Evol. 1982;18:344–350. doi: 10.1007/BF01733901. [DOI] [PubMed] [Google Scholar]
- Dyson FJ. Origins of life. Cambridge University Press; Cambridge and New York: 1999. [Google Scholar]
- Eigen M. Molecular self-organization and the early stages of evolution. Q Rev Biophys. 1971;4:149–212. [PubMed] [Google Scholar]
- Eigen M, Schuster P. The hypercycle, a principle of natural self-organization. Part A: emergence of the hypercycle. Naturwissenschaften. 1977;64:541–565. doi: 10.1007/BF00450633. [DOI] [PubMed] [Google Scholar]
- Eigen M, Schuster P. The hypercycle, a principle of natural self-organization. Springer-Verlag; Berlin and New York: 1979. [DOI] [PubMed] [Google Scholar]
- Eigen M, Schuster P. Stages of emerging life—five principles of early organization. J Mol Evol. 1982;19:47–61. doi: 10.1007/BF02100223. [DOI] [PubMed] [Google Scholar]
- Ellington AD, Szostak JW. In vitro selection of RNA molecules that bind specific ligands. Nature. 1990;346:818–822. doi: 10.1038/346818a0. [DOI] [PubMed] [Google Scholar]
- Eun H-M. Enzymology primer for recombinant DNA technology. 1. Academic Press; San Diego: 1996. [Google Scholar]
- Farmer JD, Kauffman SA, Packard NH. Autocatalytic replication of polymers. Physica D. 1986;22:50–67. [Google Scholar]
- Ferris JP, Joshi PC, Wang KJ, Miyakawa S, Huang W. Catalysis in prebiotic chemistry: application to the synthesis of RNA oligomers. Adv Space Res. 2004;33:100–105. [Google Scholar]
- Fontana W, Schuster P. Continuity in evolution: on the nature of transitions. Science. 1998;280:1451–1455. doi: 10.1126/science.280.5368.1451. [DOI] [PubMed] [Google Scholar]
- Gao K, Orgel LE. Polyphosphorylation and non-enzymatic template-directed ligation of oligonucleotides. Origins Life Evol B. 2000;30:45–51. doi: 10.1023/a:1006683507864. [DOI] [PubMed] [Google Scholar]
- Gartner ZJ, Kanan MW, Liu DR. Expanding the reaction scope of DNA-templated synthesis. Agnew Chem. 2002;41:1796–1800. doi: 10.1002/1521-3773(20020517)41:10<1796::aid-anie1796>3.0.co;2-z. [DOI] [PubMed] [Google Scholar]
- Hargreaves WR, Mulvihill S, Deamer DW. Synthesis of phospholipids and membranes in prebiotic conditions. Nature. 1977;266:78–80. doi: 10.1038/266078a0. [DOI] [PubMed] [Google Scholar]
- Hughes RA, Robertson MP, Ellington AD, Levy M. The importance of prebiotic chemistry in the RNA world. Curr Opin Chem Biol. 2004;8:629–633. doi: 10.1016/j.cbpa.2004.09.007. [DOI] [PubMed] [Google Scholar]
- James KD, Ellington AD. Surprising fidelity of template-directed chemical ligation of oligonucleotides. Chem Biol. 1997;4:595–605. doi: 10.1016/s1074-5521(97)90245-3. [DOI] [PubMed] [Google Scholar]
- Johnston WK, Unrau PJ, Lawrence MS, Glasner ME, Bartel DP. RNA-catalyzed RNA polymerization: accurate and general RNA-templated primer extension. Science. 2001;292:1319–1325. doi: 10.1126/science.1060786. [DOI] [PubMed] [Google Scholar]
- Joyce GF. Nonenzymatic template-directed synthesis of informational macromolecules. Cold Spring Harb Symp Quant Biol. 1987;52:41–51. doi: 10.1101/sqb.1987.052.01.008. [DOI] [PubMed] [Google Scholar]
- Joyce GF. RNA evolution and the origins of life. Nature. 1989;338:217–224. doi: 10.1038/338217a0. [DOI] [PubMed] [Google Scholar]
- Joyce GF. The antiquity of RNA-based evolution. Nature. 2002;418:214–221. doi: 10.1038/418214a. [DOI] [PubMed] [Google Scholar]
- Kanavarioti A, White DH. Kinetic analysis of the template effect in ribooligoguanylate elongation. Origins Life Evol B. 1987;17:333–349. doi: 10.1007/BF02386472. [DOI] [PubMed] [Google Scholar]
- Klussmann M, Iwamura H, Mathew SP, Wells DH, Pandya U, Armstrong A, Blackmond DG. Thermodynamic control of asymmetric amplification in amino acid catalysis. Nature. 2006;441:621–623. doi: 10.1038/nature04780. [DOI] [PubMed] [Google Scholar]
- Kozlov IA, Orgel LE. Nonenzymatic template-directed synthesis of RNA from monomers. Mol Biol. 2000;34:781–789. [PubMed] [Google Scholar]
- Krishnan M, Ugaz VM, Burns MA. PCR in a rayleigh-bénard convection cell. Science. 2002;298:793. doi: 10.1126/science.298.5594.793. [DOI] [PubMed] [Google Scholar]
- Kuppers B. Molecular theory of evolution: outline of a physico-chemical theory of the origin of life. Springer-Verlag; Berlin and New York: 1983. [Google Scholar]
- Lee N, Suga H. Essential roles of innersphere metal ions for the formation of the glutamine binding site in a bifunctional ribozyme. Biochemistry. 2001;40:13633–13643. doi: 10.1021/bi011103c. [DOI] [PubMed] [Google Scholar]
- Lifson S, Lifson H. A model of prebiotic replication: survival of the fittest versus extinction of the unfittest. J Theor Biol. 1999;199:425–433. doi: 10.1006/jtbi.1999.0969. [DOI] [PubMed] [Google Scholar]
- Lincoln TA, Joyce GF. Self-sustained replication of an RNA enzyme. Science. 2009;323:1229–1232. doi: 10.1126/science.1167856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lohrmann R, Bridson PK, Orgel LE. Efficient metal-ion catalyzed template-directed oligonucleotide synthesis. Science. 1980;208:1464–1465. doi: 10.1126/science.6247762. [DOI] [PubMed] [Google Scholar]
- Manapat M, Ohtsuki H, Bürger R, Nowak MA. Originator dynamics. J Theor Biol. 2009;256:586–595. doi: 10.1016/j.jtbi.2008.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mansy SS, Schrum JP, Krishamurthy M, Tobé S, Treco DA, Szostak JW. Template-directed synthesis of a genetic polymer in a model protocell. Nature. 2008;454:121–125. doi: 10.1038/nature07018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mansy SS, Szostak JW. Thermostability of model protocell membranes. Proc Nat Acad Sci USA. 2008;105:13351–13355. doi: 10.1073/pnas.0805086105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- May RM, Anderson RM. Population biology of infectious diseases: part II. Nature. 1979;280:455–461. doi: 10.1038/280455a0. [DOI] [PubMed] [Google Scholar]
- Miller SL. A production of amino acids under possible primitive earth conditions. Science. 1953;117:528–529. doi: 10.1126/science.117.3046.528. [DOI] [PubMed] [Google Scholar]
- Miller SL, Orgel LE. The origins of life on the earth. Prentice Hall; Englewood Cliffs: 1974. [Google Scholar]
- Miyakawa S, Ferris JP. Sequence- and regioselectivity in the montmorillonite-catalyzed synthesis of RNA. J Am Chem Soc. 2003;125:8202–8208. doi: 10.1021/ja034328e. [DOI] [PubMed] [Google Scholar]
- Ninio J, Orgel LE. Heteropolynucleotides as templates for non-enzymatic polymerizations. J Mol Evol. 1978;12:91–99. doi: 10.1007/BF01733260. [DOI] [PubMed] [Google Scholar]
- Nissen P, Hansen J, Ban N, Moore PB, Steitz TA. The structural basis of ribosome activity in peptide bond synthesis. Science. 2000;289:920–930. doi: 10.1126/science.289.5481.920. [DOI] [PubMed] [Google Scholar]
- Nowak MA. Evolutionary dynamics. Harvard University Press; Cambridge and London: 2006. [Google Scholar]
- Nowak MA, May RM. Virus dynamics. Oxford University Press; Oxford: 2000. [Google Scholar]
- Nowak MA, Ohtsuki H. Prevolutionary dynamics and the origin of evolution. Proc Nat Acad Sci USA. 2008;105:14924–14927. doi: 10.1073/pnas.0806714105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ohtsuki H, Nowak MA. Prelife catalysts and replicators. Proc R Soc B. 2009;276:3783–3790. doi: 10.1098/rspb.2009.1136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oparin AI. The origin of life. Dover; New York: 1953. [Google Scholar]
- Orgel LE. Evolution of the genetic apparatus. J Mol Biol. 1968;38:381–393. doi: 10.1016/0022-2836(68)90393-8. [DOI] [PubMed] [Google Scholar]
- Orgel LE. RNA catalysis and the origins of life. J Theor Biol. 1986;123:127–149. doi: 10.1016/s0022-5193(86)80149-7. [DOI] [PubMed] [Google Scholar]
- Orgel LE. Molecular replication. Nature. 1992;358:203–209. doi: 10.1038/358203a0. [DOI] [PubMed] [Google Scholar]
- Orgel LE. Prebiotic chemistry and the origin of the RNA world. Crit Rev Biochem Mol Biol. 2004;39:99–123. doi: 10.1080/10409230490460765. [DOI] [PubMed] [Google Scholar]
- Pan T, Uhlenbeck OC. In vitro selection of RNAs that undergo autolytic cleavage with lead(2+) Biochemistry. 1992;31:3887–3895. doi: 10.1021/bi00131a001. [DOI] [PubMed] [Google Scholar]
- Powner MW, Gerland B, Sutherland JD. Synthesis of activated pyrimidine ribonucleotides in prebiotically plausible conditions. Nature. 2009;459:239–242. doi: 10.1038/nature08013. [DOI] [PubMed] [Google Scholar]
- Rajamani S, Chen IA. 2009 Unpublished data. [Google Scholar]
- Rao M, Eichberg J, Oro J. Synthesis of phosphatidylcholine under possible primitive Earth conditions. J Mol Evol. 1982;18:196–202. doi: 10.1007/BF01733046. [DOI] [PubMed] [Google Scholar]
- Ricardo A, Carrigan MA, Olcott AN, Benner SA. Borate minerals stabilize ribose. Science. 2004;303:196. doi: 10.1126/science.1092464. [DOI] [PubMed] [Google Scholar]
- Rushdi AI, Simoneit BR. Lipid formation by aqueous Fischer-Tropsch-type synthesis over a temperature range of 100 to 400 degrees C. Origins Life Evol B. 2001;31:103–118. doi: 10.1023/a:1006702503954. [DOI] [PubMed] [Google Scholar]
- Sawai H, Orgel LE. Oligonucleotide synthesis catalyzed by the zinc(2+) ion. J Am Chem Soc. 1975;97:3532–3533. doi: 10.1021/ja00845a050. [DOI] [PubMed] [Google Scholar]
- Sievers D, von Kiedrowski G. Self-replication of complementary nucleotide-based oligomers. Nature. 1994;369:221–224. doi: 10.1038/369221a0. [DOI] [PubMed] [Google Scholar]
- Stein DL, Anderson PW. A model for the origin of biological catalysis. Proc Nat Acad Sci USA. 1984;81:1751–1753. doi: 10.1073/pnas.81.6.1751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steitz TA, Moore PB. RNA, the first macromolecular catalyst: the ribosome is a ribozyme. Trends Biochem Sci. 2003;28:411–4118. doi: 10.1016/S0968-0004(03)00169-5. [DOI] [PubMed] [Google Scholar]
- Szathmáry E, Demeter L. Group selection of early replicators and the origin of life. J Theor Biol. 1987;128:463–486. doi: 10.1016/s0022-5193(87)80191-1. [DOI] [PubMed] [Google Scholar]
- Szostak JW, Bartel DP, Luisi PL. Synthesizing life. Nature. 2001;409:387–390. doi: 10.1038/35053176. [DOI] [PubMed] [Google Scholar]
- Traulsen A, Nowak MA. Evolution of cooperation by multilevel selection. Proc Nat Acad Sci USA. 2006;103:10952–10955. doi: 10.1073/pnas.0602530103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Viedma C, Ortiz JE, de Torres T, Izumi T, Blackmond DG. Evolution of solid phase homochirality for a proteinogenic amino acid. J Am Chem Soc. 2008;130:15274–15275. doi: 10.1021/ja8074506. [DOI] [PubMed] [Google Scholar]
- Vogel SR, Deck C, Richert C. Accelerating chemical replication steps of RNA involving activated ribonucleotides and downstream-binding elements. Chem Commun. 2005:4922–4924. doi: 10.1039/b510775j. [DOI] [PubMed] [Google Scholar]
- Wachtershauser G. On the chemistry and evolution of the pioneer organism. Chem Biodivers. 2007;4:584–602. doi: 10.1002/cbdv.200790052. [DOI] [PubMed] [Google Scholar]
- Wilson DS, Szostak JW. In vitro selection of functional nucleic acids. Annu Rev Biochem. 1999;68:611–647. doi: 10.1146/annurev.biochem.68.1.611. [DOI] [PubMed] [Google Scholar]
- Woese C. The genetic code: the molecular basis for genetic expression. Harper and Row; New York: 1967. [Google Scholar]