Abstract
Previous study of eleven different in vitro-selected RNA aptamers that bind guanosine triphosphate (GTP) with Kds ranging from 8µM to 9nM showed that more information is required to specify the structures of the higher-affinity aptamers. We are interested in understanding how the more complex aptamers achieve higher affinities for the ligand. In vitro selection produces structural solutions to a functional problem that are are as simple as possible in terms of the information content needed to define them. It has long been assumed that the simplest way to improve the affinity of an aptamer is to increase the shape and functional group complementarity of the RNA binding pocket for the ligand. This argument underlies the hypothesis that selection for higher-affinity aptamers automatically leads to structures that bind more specifically to the target molecule. Here, we examined the binding specificities of the eleven GTP aptamers by carrying out competition binding studies with sixteen different chemical analogs of GTP. The aptamers have distinct patterns of specificity, implying that each RNA is a structurally-unique solution to the problem of GTP binding. However, these experiments failed to provide evidence that higher-affinity aptamers bind more specifically to GTP. We suggest that the simplest way to improve aptamer Kds may be to increase the stability of the RNA tertiary structure with additional intramolecular RNA-RNA interactions; increasingly-specific ligand binding may emerge only in response to direct selection for specificity.
Introduction
RNA aptamers are functional oligonucleotides that fold into distinct three dimensional shapes and bind target ligands1–3. In the laboratory, RNA aptamers can be generated by searching large combinatorial pools of random sequences for rare functional molecules using in vitro selection4. In vitro selection can yield a diverse array of distinct binding structures which vary significantly in their affinities for the ligand2,5,6. With respect to their frequency in sequence space (the ensemble of all possible sequences a particular length) the products of in vitro selection from random sequence pools tend to be the simplest, most abundant, structural solutions to a given problem7. For this reason, in vitro-selected aptamers are likely to utilize the least complex structural mechanisms possible to attain the binding affinities that they exhibit. Therefore, by defining the mechanistic and structural factors that underlie differences in affinity for a set of RNA aptamer structures from the same selection, it should be possible to uncover the simplest ways to improve binding.
It is perhaps easiest to imagine that tighter binding to a target molecule, measured by a lower apparent dissociation constant (Kd), emerges through improved shape and functional group complementarity between the ligand and the aptamer binding pocket3,8,9. If the binding pocket is rigid, a high degree of interaction and geometric complementarity with the ligand could also be expected to lead to high-specificity binding3,9, where specificity is defined as preferential binding to a target ligand compared to other molecules9,10. In fact, it has long been assumed that selection for higher-affinity binding automatically leads to aptamers that bind more specifically to a target molecule9. However, this view discounts the potential relevance of contributions to the free energy of binding from intramolecular contacts within the RNA itself, as opposed to intermolecular contacts between the RNA and the ligand10. To date, this issue has not been explicitly addressed through the analysis of specificity as a function of affinity for a set of aptamers selected against the same target.
The set of 11 different GTP-binding aptamers that we previously isolated5,6 provides an ideal testing ground for the affinity-specificity hypothesis (Figure 1, Table 1). The optimized versions of these aptamers vary greatly in sequence and affinity; the Kd for GTP of the weakest binder is 8 µM while the strongest binder in the set has a Kd for GTP of 9 nM. We used in vitro selection to determine the number of conserved nucleotides and base pairs needed for binding in each aptamer by identifying functional sequence variants of the original isolates6. From these data, the information content11–13 necessary to specify the aptamer structures in the conditions of the selection was calculated. We found that tighter-binding aptamers are larger, have more intricate secondary structures and are more informationally-complex than weaker-binding aptamers. On average, approximately 10 bits of additional information content, equivalent to specifying 5 more conserved positions or base pairs within an RNA, are required to define structures capable of 10-fold better binding. While this analysis proved that more nucleotide or base-pair positions must be specified in tighter-binding aptamers, there has not yet been a mechanistic or structural explanation for how the more informationally-complex aptamers accomplish higher-affinity binding6,14.
Figure 1.
GTP aptamer secondary structures. The secondary structure of each GTP aptamer used in this analysis is shown with invariant positions in the recognition loops outlined in red.
Table 1.
Aptamer Properties.
aptamer | Kd (nM)a | IC (bits) | Size (nt) | ΔG2° folding (−kcal mol−1)b |
||
---|---|---|---|---|---|---|
9-4 | 9±1 | 65.0±1.0 | 69 | 26 | 10 | |
Cl V | 17±4 | 55.0±1.0 | 68 | 31 | 9 | |
10-10 | 30±6 | 67.0±1.0 | 60 | 19 | 9 | |
Cl I | 76 ±3 | 45.0±0.5 | 41 | 11 | 11 | |
10–59 | 250 ±20 | 43.0±0.5 | 56 | 22 | 11 | |
9–12 | 300± 50 | 53.0±0.5 | 43 | 10 | 10 | |
10–24 | 300 ±50 | 44.0±1.5 | 55 | 17 | 12 | |
10-6 | 300 ±100 | 67.0±1.0 | 54 | 13 | 9 | |
Cl II | 400 ±200 | 40.0±1.0 | 30 | 6 | 12 | |
Cl IV | 900 ±200 | 33.0±1.0 | 43 | 10 | 9 | |
Cl III | 8000 ±1000 | 42.0±1.0 | 41 | 14 | 8 |
Apparent Kd and information content (IC) are from ref. 6. Other values were measured or calculated as described in the text.
ΔG2° folding: calculated free energy of secondary structure formation (−kcal mol−1).
Here we present a detailed biochemical characterization of the binding specificities of the set of 11 GTP aptamers. We determined the aptamer affinities for a set of 16 different nucleotide and nucleoside analogs of GTP in competition binding assays and compared these to the aptamer affinities for GTP. First, we find that each aptamer exhibits a unique pattern of ligand-binding specificity compared to the others in the set. Taken together with the fact that there are no sequence motifs common to more than one of the GTP aptamers, this study provides further evidence that each aptamer is a distinct structural solution to the problem of binding GTP with moderate to high-affinity6. Furthermore, contrary to expectations9, these data do not demonstrate that binding specificity is closely related to binding affinity; aptamers with higher affinities for GTP do not bind more specifically to GTP than aptamers with lower affinities for GTP. We infer that the more complex, higher-affinity aptamers do not make more intermolecular aptamer-ligand functional group interactions or have higher degrees of steric complementarity with the ligand. Instead, we propose that much of the observed6 informational cost of improving binding reflects the need to increase the number and quality of the contacts within the RNA itself, rather than between the RNA and the ligand.
Experimental Section
Binding Assays
The apparent Kds of the aptamers for the GTP analogs were measured in competition binding experiments using a spin filtration method15 as previously described5 in a buffer of 200 mM KCl, 10 mM potassium phosphate, 5 mM MgCl2, 0.1 mM EDTA, pH 6.2. RNA was prepared at a concentration of 4 times the Kd for GTP (final concentration = 2 × Kd) in 100 µl H2O, heated to 80°C for 5 min and then equilibrated overnight at room temperature. Next, 100 µl of 2× binding buffer containing 0.22 nM α-32P-GTP and varying concentrations of unlabeled analog were added. After equilibrating the samples over night at room temperature, they were transferred to the top chamber of a Microcon YM-30 spin filter. The tubes were centrifuged at 13,000 g for 5 sec to saturate the membrane. The filters were placed into new collection tubes and again spun at 13,000 g for 20 sec, allowing 25 µl of solution to flow through the membrane. This process is similar to equilibrium dialysis in that the solution that remains above the molecular-weight cutoff membrane contains free GTP ([L]f), free analog ([I]f), free RNA ([R]f), RNA-bound GTP ([RL]), and RNA-bound analog ([RI]); the filtrate contains only free GTP ([L]f) and free analog ([I]f). A 20 µl sample was removed from each side of the filter, and the radioactivity from the α 32P-GTP was measured by scintillation counting for 1 min. The radioactive counts were baseline-corrected to account for background retention in the upper chamber of the spin filter16. The fraction of labeled GTP bound to the aptamer, [RL]/[L]t (where the total concentration of GTP, [L]t = [RL]+[L]f), was normalized such that when the concentration of analog competitor was zero, [RL]/[L]t equaled 100%. The Kds were derived from a nonlinear least squares fit of the normalized fraction of labeled GTP bound to the aptamer versus the concentration of analog [I] using R: A Language and Environment for Statistical Computing version 2.0.0 for Linux (R Foundation for Statistical Computing, Vienna, Austria) (see Supporting methods).
Calculating Free Energy of Secondary Structure Formation
The free energy of aptamer secondary structure formation for the 11 optimized sequences was calculated using the program RNAeval from the ViennaRNA Package version 1.6 for Intel Linux17. The optimized sequences were folded at 37°C using the default settings with the secondary structure models shown in Figure 1.
To calculate the free energy of secondary structure formation predicted from sequence length, a set of 1000 random sequences was computer generated for each aptamer. The free energy of secondary structure formation for each sequence in the set was then calculated without specifying the secondary structure using the program RNAfold from the ViennaRNA package version 1.6. Sequences were folded at 37°C using the default settings. In the same manner, 1000 sets of 11 aptamer-length sequences were generated and folded using RNAfold to estimate the likelihood that all of the aptamers in the set would, by chance, have free energies of secondary structure formation more stable than the median values for same-length random sequences.
Correlation coefficients and regressions
The Spearman rank correlation coefficient (rs) was applied as a nonparametric test of association, since it does not require specific assumptions about data distribution and is insensitive to deviations from the normal distribution.18 95% confidence intervals were used for all rankings and significance was determined under the null hypothesis of no correlation with a one-tailed test, α = 0.05. P values were calculated from the test statistic as described19. To quantify the number of aptamer-analog pairs with log10 Kd (analog)/Kd (GTP) values at a given threshold, significance was determined with the Kds and standard deviations obtained from nonlinear least square fits of the analog binding data under the null hypothesis of no difference with a two-tailed test, α = 0.05 (Note: none of the major conclusions would be different if smaller α values had been used). Kendall's robust line-fit method is a nonparametric approach to curve fitting that does not require independent and normally-distributed data19. The slope is given by the median slope of all pairwise combinations of points. The intercept is the median intercept of all points computed with the median slope.
Results
Measuring aptamer specificity for GTP
In order to determine the extent to which the aptamers (Figure 1) bind specifically to GTP, we measured their affinities for two different sets of chemical analogs of GTP (Figure 2). Aptamer affinities for each analog were obtained from competition binding assays with radiolabeled 32P-GTP and unlabeled analog. Details of the assays are described in the Experimental Section and the supporting information.
Figure 2.
GTP functional groups and their importance for aptamer binding. The role of each functional group in binding was tested by comparison of analog and GTP binding affinities. The number of aptamers (out of 11 total) that suffered no more than a 100-fold loss in affinity to the analog (log10 Kd (Analog) / Kd (GTP) <2) is shown in parentheses next to each functional group. Functional groups on the nucleobase play a more significant role in aptamer binding than functional groups on the sugar and triphosphate.
The 10 analogs in the first set (Figure 2A) have bulky substituents, and were chosen to provide a way of screening for close approaches between the aptamer RNA and the GTP ligand. The set consists of six different methylated analogs, 8-amino-guanosine, and three analogs in which a larger sulfur atom replaces an oxygen atom: 6-thio GTP, GTP-α-S and GTP-γ-S. Such bulky analogs could have altered binding affinity compared to GTP for one, or more, of the following reasons. There may be drastic reductions in binding affinity (increases in Kd) if the aptamer cannot sterically accommodate the bulky substituent without radically restructuring the binding pocket. Relative to GTP, 3-methyl-guanosine and 7-methyl-guanosine have potential hydrogen bond acceptors that are blocked; the hydrogen-bonding ability of the sulfur-substituted analogs is also reduced20. Other analogs, 2'O-methyl-GTP, 3'O-methyl-GTP, 1-methyl-guanosine, and N2-methyl-2'-deoxy-GTP, have potential hydrogen bond donors that are eliminated. If the potential hydrogen bond donor or acceptor is utilized as such in the aptamer-ligand complex, its absence should have a negative effect on affinity. Finally, with the exception of the sulfur-containing analogs, the addition of the bulky substituents changes the hydrophobicity of the ligand, where the effect on the binding affinity could be either positive or negative21.
The 6 molecules in the second set of analogs (Figure 2B) are characterized by the removal or alteration of individual functional groups. These modifications include the loss of phosphate groups, sugar hydroxyls, or the exocyclic amine, or change of N7 in the nucleobase to CH. Here, a difference in binding affinity for one of these analogs compared to GTP captures the net energetic effect of changes in solvation, hydrogen bonding, and van der Waals and electrostatic interactions.
The apparent dissociation constant (Kd) for an aptamer-ligand complex describes the ratio of the concentration of the complex to the concentrations of free aptamer and free ligand at equilibrium. Explicitly, Kd=[RNA][Ligand]/[RNA:Ligand], where the equilibrium constant is directly related to the change in standard free energy, ΔG=−RT ln 1/Kd22. Specificity can be measured as the difference in the free energy of binding to one ligand (L) versus another (L')9,10 :
(1) |
Taking
(2) |
quantifies the magnitude by which an aptamer binds specifically to GTP compared to an analog. For example, the 10-10 aptamer Kd for GTP is 30±6 nM (Table 1), while its Kd for 2'-deoxy-GTP (6500±200 nM) is more than two orders of magnitude higher (log10 Kd (2'-deoxy-GTP) / Kd (GTP) = 2.3) (Table 2, Supporting Figure 1C). In other words, the 10-10 aptamer favors binding to GTP more than 200 fold compared to 2'-deoxy-GTP. Equivalently, the 10-10 specificity for GTP compared to 2'-deoxy-GTP could be expressed as ΔΔG = 3.2 kcal mol−1 at 25° C. In contrast, the 9-4 aptamer Kd for 2'-deoxy-GTP (10±3 nM) (Supporting Figure 1A) is the same as the 9-4 aptamer Kd for GTP (9±1 nM) (Table 1). Hence, for the 9-4 aptamer, log10 Kd (2'-deoxy-GTP) / Kd (GTP) and the corresponding ΔΔG are close to zero, indicating that the 9-4 aptamer does not bind specifically to GTP compared to 2'-deoxy-GTP.
Table 2.
Aptamer binding to GTP analogs bearing bulky substituents.
aptamer | GTPγ-S | GTP-α-S | 2'-O-me- GTP |
3'-O-me- GTP |
1-me- gua |
N2-me- 2'-deoxy- gua |
3-me- gua |
6-thio- GTP |
7-me- GTP |
8-amino- gua |
---|---|---|---|---|---|---|---|---|---|---|
9-4 | <1 | <2 | <1 | <2 | >>3 | >>3 | >>3 | >>3 | >>3 | <1 |
Cl V | <1 | <2 | <2 | <2 | >>3 | >>3 | >>3 | >>3 | >>3 | >2 |
10-10 | <1 | <2 | >2 | <2 | >>3 | >>3 | >>3 | <1 | <1 | >2 |
Cl I | <1 | <2 | <2 | <2 | >2 | >>3 | <2 | >>3 | >3 | <1 |
10–59 | <1 | <2 | <1 | <2 | >2 | >>3 | <1 | >2 | >>3 | <1 |
9–12 | <1 | <1 | <2 | <1 | >>3 | >>3 | >>3 | <2 | >>3 | >3 |
10–24 | <1 | <1 | <2 | <1 | >2 | <2 | >3 | >3 | >2 | <1 |
10-6 | <1 | <1 | >>3 | <1 | >>3 | >>3 | >>3 | >>3 | <1 | >>3 |
Cl II | <2 | <2 | <2 | <2 | >>3 | >>3 | >>3 | >2 | <2 | <2 |
Cl IV | <1 | <2 | >>3 | >>3 | >>3 | >>3 | <2 | >>3 | >>3 | <1 |
Cl III | <1 | <1 | <1 | >>2 | >>2 | >>2 | >>2 | >>2 | <2 | >>2 |
The order of magnitude increase in aptamer-analog Kd compared to the aptamer-GTP Kd is presented for each aptamer-analog pair. Analogs for which log10 Kd (analog) / Kd (GTP) >2 are highlighted with bold text. Analogs for which no binding was observed are marked >>3 or >>2, where applicable (see text). Binding curves used to calculate the aptamer-analog Kds are presented in Supporting Figure 2. Supporting Table 1 gives the Kd (analog) / Kd (GTP) values for this set of analogs. me: methyl. gua: guanosine.
The log10 Kd (analog) / Kd (GTP) values for all of the aptamers are plotted in Figure 3 (also see Tables 2 and 3 and supporting Table 1). For 34% of the aptamer-analog pairs, aptamer binding was not very specific for GTP relative to the analog; log10 Kd (Analog) / Kd (GTP) was < 1 and ΔΔG < 1.4 kcal mol−1. For 38% of the aptamer-analog pairs, aptamer binding was moderately specific for GTP relative to the analog; log10 Kd (Analog) / Kd (GTP) was > 1 but < 2 (1.4 kcal mol−1 < ΔΔG < 2.7 kcal mol−1). In the remaining 28% of the aptamer-analog pairs, binding was very specific for GTP, with log10 Kd (Analog) / Kd (GTP) > 2 (ΔΔG > 2.7 kcal mol−1). For many of these cases the 32P-GTP was not detectably displaced in the competition binding assays, i.e. no binding was detected, corresponding to log10 Kd (Analog) / Kd (GTP) >> 3 except in the case of the low affinity Class III aptamer, for which no binding corresponds to log10 Kd (Analog) / Kd (GTP) >> 2.
Figure 3.
Aptamer specificities for GTP analogs. log10 Kd (Analog) / Kd (GTP), which is the magnitude by which binding is specific for GTP compared to an analog, is plotted for each aptamer/analog pair. The aptamers are arranged from left to right in order of affinity for GTP (shown in parentheses). For each aptamer, the number of analogs to which no binding was observed is represented in the bar graph at the top.
Table 3.
Aptamer binding to GTP analogs with removed or altered functional groups.
aptamer | GDP | GMP | guanosine | 2'-deoxy- GTP |
3'-deoxy- GTP |
7-deaza- GTP |
---|---|---|---|---|---|---|
9-4 | <1 | <2 | <1 | <1 | <2 | >>3 |
Cl V | <1 | <2 | <2 | <2 | <1 | >>3 |
10-10 | <1 | <1 | <2 | >2 | <2 | >2 |
Cl I | <1 | <1 | <1 | <2 | <1 | >>3 |
10–59 | <1 | <1 | <1 | <1 | <2 | >>3 |
9–12 | <1 | <1 | <2 | <2 | <1 | >2 |
10–24 | <1 | <2 | <1 | <1 | <2 | <1 |
10-6 | <1 | <1 | <2 | >2 | <1 | <2 |
Cl II | <2 | <2 | <2 | <1 | <2 | <2 |
Cl IV | <1 | <1 | <2 | <2 | <2 | >>3 |
Cl III | <1 | <1 | >>2 | <1 | <1 | >>2 |
This table presents the order of magnitude increase in aptamer-analog Kd compared to the aptamer-GTP Kd for each aptamer-analog pair. Analogs for which log10 Kd (analog) / Kd (GTP) >2 are highlighted with bold text. Analogs for which no binding was observed are marked >>3 or >>2, where applicable (see text). Binding curves used to calculate the aptamer-analog Kds are presented in Supporting Figure 2. Supporting Table 2 gives the Kd (analog) / Kd (GTP) values for this set of analogs.
Overall, the aptamers have very little specificity for the ligand triphosphate region, and only slightly more discrimination for modifications to the ribose sugar. All of the aptamers are clearly most specific for the GTP nucleobase (Figure 2).
Only the Class III aptamer showed greatly reduced binding (log10 Kd (Analog) / Kd (GTP) >2) to analogs where the triphosphate had been modified or deleted. The Class III aptamer does not bind to guanosine even though the Class III aptamer Kds for GDP, GMP, GTP-γ-S and GTP-α-S are nearly the same as the Kd for GTP (Tables 1 and 2). Note that due to limited analog solubility, we could not accurately measure the Class III aptamer affinity for any analog where the Kd was >2 orders of magnitude larger than the Class III aptamer Kd (8 µ M) for GTP. Nonetheless, we can conclude that the Class III aptamer Kd for guanosine is at least two orders of magnitude worse than the Class III Kd for GTP.
Across the entire set, the aptamers display slightly more specificity for the ligand ribose sugar than the triphosphate region. Only two of the aptamers had >100-fold increases in Kd when the 2' OH was removed (Table 2) and none were affected that strongly by loss of the 3' OH. Aptamers 10-10, 10-6 and Class IV bound poorly to 2'-O-methyl-GTP and neither the Class IV nor the Class III aptamers exhibited binding to 3'-O-methyl-GTP.
Modifications to the nucleobase moiety (Figure 2) caused by far the greatest effects on binding. In fact, just 25% (19 of 77) of the aptamer-analog combinations involving analogs with nucleobase changes or deletions exhibited binding within 100-fold that of GTP (log10 Kd (Analog) / Kd (GTP) < 2). When free in solution, guanosine nucleotides and nucleosides typically adopt syn conformations about their glycosidic bonds, in contrast to most other nucleotides and nucleosides that adopt anti conformations23. Inosine triphosphate (ITP) differs from GTP only in the loss of the 2-amino group, one effect of which is that ITP exists in the anti conformation23. The importance of the glycosidic bond conformation and the sensitivity of the selected aptamers to modifications on the ligand nucleobase are illustrated by the fact that none of the aptamers bind detectably to ITP, or to ATP, CTP, TTP or UTP (data not shown).
Distinct patterns of binding specificity
We used the results from the competition binding assays to diagram the pattern of binding specificity for each aptamer (Tables 2 and 3; Figure 4). Ligand positions where modification resulted in substantially-reduced binding are denoted by red shading in Figure 4. Interestingly, each aptamer displays a pattern of binding specificity that is different from the other aptamers in the set. Here, specific binding to GTP compared to a given analog was demarcated by a log10 Kd (Analog) / Kd (GTP) value >2.
Figure 4.
Each aptamer has a unique pattern of binding specificity. For each aptamer, ligand positions where chemical modification resulted in large reductions in aptamer binding affinity relative to GTP (i.e. log10 Kd (Analog) / Kd (GTP) >2) are colored red. Ligand positions where chemical modifications did not reduce binding affinity more than 100 fold relative to GTP are colored green. The large shaded circles represent results for analogs with bulky modifications while the smaller inset circles represent results obtained with modified or deleted functional groups. Only the GMP moiety of the ligand is shown here because none of the modifications to the ligand β or γ phosphates that we tested resulted in substantially-reduced binding affinity relative to GTP. Also see Supporting Figure 2.
Changing the cut-off used to signify specific binding to, for example, log10 Kd (Analog) / Kd (GTP) >1, alters the patterns illustrated in figure 4. However, even with different cut-offs to signify specific binding, the aptamers appear sensitive to distinct subsets of ligand modifications (also see Supporting Figure 3). We interpret this to mean that each aptamer has a distinct ligand-binding surface.
Affinity and Specificity
We sought evidence that aptamers with high affinity for GTP are also more specific for GTP binding than aptamers with low affinity for GTP. We expected that aptamers with high affinity (lower Kd) for GTP would bind well to a smaller number of analogs than aptamers with low affinity (higher Kd) for GTP. For each aptamer we plotted the number of analogs to which the RNA exhibited high-affinity binding relative to GTP against the aptamer Kd for GTP (Figure 5). Initially, we examined analogs with log10 Kd (Analog) / Kd (GTP) values <2 (colored red in Figure 5). All of the aptamers displayed binding in this range to 8 to 12 of the analogs. Surprisingly, we do not see any evidence that the number of analogs to which each aptamer binds in this range is correlated with binding affinity for GTP (rs=0.07, P=0.42).
Figure 5.
Higher affinity aptamers are not more specific. For each aptamer, the number of analogs for which log10 Kd (Analog) / Kd (GTP) <N is plotted against Kd (GTP). Three separate values of N were employed to visualize different degrees of specificity: log10 Kd (Analog) / Kd (GTP) <1 (open black circles), <2 (open red squares), and <3 (solid green circles). The lines were generated by applying Kendall's robust line-fit method19 to the data indicated by the respective colors. Note that several of the plot points are the same for different values of N, resulting in overlap between red squares and solid green circles.
In order to ensure that choosing log10 Kd (Analog) / Kd (GTP) <2 as a cut-off did not obscure more subtle differences in specificity, we also computed the correlation statistic using log10 Kd (Analog) / Kd (GTP) < 1 and <3 as the cut-offs (Figure 5). There was no significant change in the correlation statistic whether the log10 Kd (Analog) / Kd (GTP) cut-off for specific binding was <1, <2 or <3. Thus, we do not find that aptamers that bind with more affinity for GTP are more specific for GTP binding compared to chemically-related analogs.
Free energy of secondary structure formation
Within the set of selected aptamers, the higher-affinity molecules tend to have longer sequences, are more informationally-complex and have more intricate secondary structures6 (Figure 1; Table 1). It is known that selection for more stable RNA tertiary structures can produce RNAs with more stable secondary structures24. We wondered whether selection for tighter-binding yields aptamers with unusually-stable secondary structures.
We calculated the free energy of secondary structure formation using the program RNAeval17 (Table 1). There is a significant correlation (P<0.02) between the apparent Kd for GTP and the free energy of secondary structure formation (ΔG2° (folding)) (Figure 6A). We compared the calculated ΔG2° (folding) value for each aptamer with 1000 computer-generated random sequences of the same length (Figure 6B). All eleven aptamers have ΔG2° (folding) values more stable than the median same-length random sequence values. The likelihood that eleven aptamer-length sequences would have calculated ΔG2° (folding) values larger than the median by chance is very small (P<0.001), suggesting that selection for secondary structural stability may have taken place. However, only the 9-4, Class V, 10–59 and Class III aptamers have calculated free energies of secondary structure formation that are significantly more stable than the same-length random sequences (i.e. ΔG2° (folding) values more stable than 95% of the same-length random sequence values).
Figure 6.
Aptamer secondary structures are not unusually stable. (A) Free energy of secondary structure formation (ΔG2°folding) calculated for each aptamer using the program RNAeval (see Experimental Section) is plotted against Kd (GTP) in nanomolar. The line was generated by applying Kendall's robust line-fit method. (B) Calculated ΔG2°folding for each aptamer is plotted against aptamer sequence length (solid black circles). The red line indicates the median ΔG2°folding values of 1000 random sequences plotted as a function of sequence length; the 33rd, 67th, and 95th percentiles are signified by the green, orange and purple lines, respectively.
It is possible that the calculated ΔG2° (folding) values do not accurately capture the magnitude of the actual secondary structure free energies because they do not consider the potential influence of the ligand or tertiary interactions within the RNA. Still, to a first approximation, these results indicate that selection for higher-affinity binding leads to RNAs with more stable secondary structures, but not necessarily to aptamers with secondary structures that are substantially more stable than predicted on the basis of size.
Discussion
We investigated the relationship between RNA aptamer binding affinity and binding specificity for a target ligand. The test set of evolved aptamers consisted of molecules with Kds for GTP that range from 8 µM to 9 nM. We assessed the affinity of each of the aptamers for a series of sixteen different chemical analogs of GTP, along with ATP, CTP, UTP, TTP and ITP in competition binding assays. Aptamer specificity for GTP was quantified as log10 Kd (analog) / Kd (GTP). It has been argued that selection for higher-affinity binding to a ligand automatically leads to aptamers with more specificity for the ligand9. Surprisingly, we did not find any evidence that the higher-affinity aptamers are more specific for GTP than the lower-affinity aptamers (Table 1; Figures 3 and 5).
We examined a wide range of degrees of specificity in order to maximize our chances of observing an affinity-specificity correlation. We initially set the cut-off for significant differences in specificity as log10 Kd (analog) / Kd (GTP) > 2 because previous studies have suggested that differences of that magnitude could be attributed to readily-observable structural features of the aptamer-ligand complex such as a steric clash or loss of a hydrogen bond15,21,25–27. Solution NMR studies of the Class I GTP aptamer28 demonstrate that analogs for which log10 Kd (Analog) / Kd (GTP) >2 have modifications that introduce discernible steric clashes and/or the loss of hydrogen-bonding interactions with the aptamer. Likewise, biochemical and structural15,27 studies of an RNA aptamer that binds theophylline showed that ligand modifications that result in approximately 100 fold or greater increases in Kd relative to theophylline correspond to particular intermolecular aptamer-ligand contacts. As noted above, we could not measure the Class III aptamer affinity for any analog whose Kdwas > 100 fold higher than its Kd for GTP (8 µM) because of limited analog solubility. Therefore, both empirical observations of aptamer structures and practical considerations indicate that the most conservative treatment of the data was to use log10 Kd (Analog) / Kd (GTP) >2 as the cut-off for meaningful differences in binding specificity. However, using log10 Kd (Analog) / Kd (GTP) cut-offs of >1 or >3 did not lead to either quantitatively or qualitatively different outcomes (Fig. 5).
The GTP analogs that we tested for aptamer binding had bulky substituents and functional group substitutions and deletions in the nucleobase, sugar and triphosphate regions. It is possible that differences in binding specificity are limited to sites for which we could not screen the effect of chemical group modifications, such as the sugar 1' and 4' positions or the nucleobase 4 and 5 positions. There were also many cases where the analogs bound so poorly that the binding affinity could only be described qualitatively as 'no binding detected'. Neither of these caveats alters the overall picture: there is no obvious trend toward substantially more binding specificity for GTP as the aptamer affinity for GTP increases.
It is notable that all of the aptamers in our set have unique patterns of ligand-binding specificity (Figure 4; Supporting Figure 3). NMR structural studies of the Class I GTP aptamer28 illustrate that the RNA-ligand contacts are mediated by platforms of aptamer purines that sandwich the planar portion of the ligand and a base-base interaction between the GTP and an aptamer guanosine. This general mode of recognition was seen previously in RNA aptamers that bind to ATP25,26,29, FMN30,31, and theophylline3,15,27. Further high resolution structural studies will be required to determine whether this motif is present in any of the other GTP aptamers. Although some of the GTP aptamers may employ common modes of ligand binding, the fact that they have unique patterns of binding specificity implies that there are differences in the shapes and compositions of their binding pockets. These results, combined with the observation that the GTP aptamer recognition bulges do not share sequence identity6 (Figure 1), reinforce the notion that large sets of structurally-diverse aptamers can be recovered from sparse searches of sequence space5,6.
The GTP aptamers were isolated from pools of molecules containing long stretches of random sequence5. The aptamers most likely to be recovered are the simplest structures, defined as those lowest in informational complexity, with the binding affinity required to survive the process of selection7,32. Therefore, the physical mechanisms used to attain the required Kds should be the simplest to implement structurally. Earlier efforts to understand the mechanisms by which aptamers achieve low Kds focused exclusively on the energetics of the local intermolecular aptamer-ligand interactions3,9. Those analyses assumed that maximizing the intermolecular geometric complementarity and the quality and quantity of aptamer-ligand functional group interactions within the binding pocket was the easiest way to improve the aptamer Kd for a ligand (e.g. aligning hydrogen-bond donors within the RNA with hydrogen-bond acceptors in the ligand). All else being equal, increased aptamer-ligand steric and functional group complementarity would cause higher-affinity aptamers to bind more specifically to the target ligand than lower-affinity aptamers9. Conversely, since we found that higher-affinity aptamers do not bind more specifically to the target ligand (Table 1, Figure 5) we conclude that their binding pockets are unlikely to have better shape and functional group complementarity with the ligand than the binding pockets of lower-affinity aptamers.
How might aptamers achieve higher-affinity binding without an increased number of intermolecular interactions and better shape complementarity with the ligand? Obviously, the local energetics of the aptamer-ligand interaction are an important aspect of the binding affinity3,8,9. However, the aptamer-ligand Kd is determined by the ΔG between the bound and unbound states of the entire system, not just the local aptamer-ligand interactions10. In fact, structural studies have revealed that ligand binding and RNA aptamer folding are often tightly coupled3,26–28,33, underscoring the potential relevance of tertiary intramolecular RNA-RNA interactions in improving the aptamer-ligand Kd. We hypothesize that differences in the tertiary stabilities of the RNA folds could account for much of the observed variation in aptamer-GTP Kds. If so, the same physical mechanisms that improve the aptamer Kd for the target ligand would also improve the aptamer Kd for chemically-related ligands. In turn, this would explain why we do not see evidence that higher-affinity aptamers bind more specifically to GTP.
There is a positive correlation between aptamer-GTP Kd and the calculated ΔG of secondary structure formation (Figure 6A). This is consistent with a role for the stability of the RNA secondary structure fold in differentiating the higher-affinity from the lower-affinity aptamers. Yet, only four of the eleven aptamers (Figure 6B) have calculated ΔGs of secondary structure formation that are significantly more stable than predicted from their sequence lengths. One possibility is that while the higher-affinity aptamers could have more stable tertiary folds, there was no direct selection pressure for more stable secondary structures34. Because RNA tertiary folding is highly-cooperative and facilitated by non-local interactions10 it is difficult to separate the ligand binding energies from RNA folding energies except where only simple base-pairing is involved35. Here the analysis was restricted to consideration of the calculated free energies of secondary structure formation. Clearly, high resolution structure models will be necessary to fully understand the role of the RNA-RNA interactions, separate from the RNA-ligand interactions, in determining tertiary structural stability and aptamer-ligand affinities.
We know from sequence comparison data that more nucleotides in the higher-affinity GTP aptamers are subject to functional constraints6. The functional constraints are a consequence of the need to make intermolecular contacts with the ligand in the context of a folded RNA secondary and tertiary structure. For this set of aptamers, the informational cost to improve binding affinity 10-fold is approximately 10 bits, enough to specify 5 conserved positions or 5 base pairs in an RNA structure in the conditions of the original selection5. The findings presented here suggest that increasing the RNA shape and functional group complementarity for the ligand is not the simplest way to improve binding affinity. Rather, the informational cost of improving aptamer affinity may be a function of the increased complexity required to form more stable tertiary RNA structures.
One prediction arising from the suggestion that higher-affinity binding may be achieved through the formation of more stable tertiary structures is that selection for stability could inadvertently result in molecules that are more specific for the target ligand. Selection for stability could produce a molecule with more rigidity in the elements surrounding a binding pocket, eliminating the structural flexibility needed to accommodate chemically-related analogs of a given ligand36. Consistent with this expectation, a non-biological ATP binding protein subjected to selection for increased stability in the presence of a chemical denaturant showed more specific binding to ATP when tested with a panel of analogs36.
We have previously suggested6 that RNA aptamers that bind planar, polar ligands, such as GTP, require less sequence information to specify than structures that bind flexible, non-polar targets with the same affinity in similar environments. We further argued that a particular degree of improvement in binding affinity could have the same informational cost, regardless of the chemical nature of the ligand, if increasing the number of interactions within the RNA, rather than between the RNA and the ligand, is generally the simplest way to improve affinity. Determining whether the observations and conclusions made here about GTP can be extended to other non-planar, or non-polar ligands will be an important test of this hypothesis.
Increasing aptamer affinity by stabilizing the tertiary fold of the RNA, or by improving the quality of the interactions with the ligand, do not seem mutually exclusive. Why then is more specific binding, predicted from the latter process, not observed in our set of aptamers? We propose several potentially relevant factors. Across the whole set of aptamers, (Figures 2 and 4) binding is greatly affected by modifications to the ligand nucleobase. It may be difficult for additional RNA-ligand nucleobase interactions to come into play. At the same time, modifications to the ligand ribose or triphosphate frequently have only small negative effects on binding. Since the aptamers were selected from a random sequence pool, we only see the most likely solutions. Improving aptamer Kd by increasing the tertiary stability of the RNA must require structures that are less informationally-complex than those needed to improve aptamer Kd by interacting more strongly with the sugar or triphosphate moieties of the ligand37. Although higher-affinity aptamers that bind more specifically to the target ligand may exist in sequence space, their greater informational complexities mean that they can only be isolated by selecting for both affinity and specificity.
Acknowledgments
We thank G. Cho for help with the competition binding curve analysis and A. Keefe and R. Larralde for many useful suggestions. We thank B. Seelig for comments on the manuscript. This work was supported by a grant from the US National Institutes of Health (GM53936 to J.W.S.). J.M.C. was supported in part by a US National Science Foundation Graduate Fellowship. S.C.O. was supported in part by a PhD scholarship of the Foundation of the German Chemical Industry. J.W.S. is an Investigator of the Howard Hughes Medical Institute.
Footnotes
Supporting Information Available Methods for Kd determination (Supporting methods). Chemical analogs of GTP employed in this study (Supporting Figure 1). Aptamer-analog competition binding curves (Supporting Figure 2). Aptamer patterns of binding specificity (Supporting Figure 3). Kd (analog) / Kd (GTP) values for analogs with bearing bulky substituents (Supporting Table 1). Kd (analog) / Kd (GTP) values for analogs with modified or deleted functional groups (Supporting Table 2).
References
- 1.Ellington AE, Szostak JW. Nature. 1990;346:818–822. doi: 10.1038/346818a0. [DOI] [PubMed] [Google Scholar]
- 2.Wilson DS, Szostak JW. Annu. Rev. Biochem. 1990;68:611–647. doi: 10.1146/annurev.biochem.68.1.611. [DOI] [PubMed] [Google Scholar]
- 3.Hermann T, Patel DJ. Science. 2000;287:820–825. doi: 10.1126/science.287.5454.820. [DOI] [PubMed] [Google Scholar]
- 4.Carothers JM, Szostak JW. In: The aptamer handbook: functional oligonucleotides and their applications. Klussmann S, editor. Berlin: Wiley-VCH; 2006. pp. 3–28. [Google Scholar]
- 5.Davis JD, Szostak JW. Proc. Natl. Acad. Sci. U.S.A. 2002;99:11616–11621. doi: 10.1073/pnas.182095699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Carothers JM, Oestreich SC, Davis JH, Szostak JW. J. Am. Chem. Soc. 2004;126:5130–5135. doi: 10.1021/ja031504a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Lorsch JR, Szostak JW. Acc. Chem. Res. 1996;29:103–110. doi: 10.1021/ar9501378. [DOI] [PubMed] [Google Scholar]
- 8.Lancet D, Sadovsky E, Seidemann E. Proc. Natl. Acad. Sci. U.S.A. 1993;90:3715–3719. doi: 10.1073/pnas.90.8.3715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Eaton BE, Gold L, Zichi DA. Chem. & Biol. 1995;2:633–638. doi: 10.1016/1074-5521(95)90023-3. [DOI] [PubMed] [Google Scholar]
- 10.Szwajkajzer D, Carey J. Bioploymers. 1997;44:181–198. doi: 10.1002/(SICI)1097-0282(1997)44:2<181::AID-BIP5>3.0.CO;2-R. [DOI] [PubMed] [Google Scholar]
- 11.Schneider TD, Stormo GD, Gold L, Ehrenfeucht A. J. Mol. Biol. 1986;188:415–431. doi: 10.1016/0022-2836(86)90165-8. [DOI] [PubMed] [Google Scholar]
- 12.Adami C, Cerf NJ. Physica D (Amsterdam) 2000;37:62–69. 2000. [Google Scholar]
- 13.Adami C. Physics of Life Reviews. 2004;1:3–22. [Google Scholar]
- 14.Lau MWL, Cadieux KEC, Unrau PJ. J. Am. Chem. Soc. 2004;126:15686–15693. doi: 10.1021/ja045387a. [DOI] [PubMed] [Google Scholar]
- 15.Jenison RD, Gill SC, Pardi A, Polisky B. Science. 1994;263:1425–1429. doi: 10.1126/science.7510417. [DOI] [PubMed] [Google Scholar]
- 16.Motulsky H, Christopoulos A. Fitting Models to Biological Data using Linear and Nonlinear Regression. San Diego, CA: Graphpad; 2003. [Google Scholar]
- 17.Hofacker IL, Fontana W, Stadler PF, Bonhoeffer LS, Tacker M, Schuster P. Monatshefte f. Chemie. 1994;125:167–188. [Google Scholar]
- 18.Sokol RR, Rohlf J. Biometry: The Principals and Practice of Statistics in Biological Research. 3rd ed. New York, NY: W. H. Freeman & Co.; 1995. [Google Scholar]
- 19.Rosner BA. Fundamentals of Biostatistics. 4th ed. Belmont, CA: Duxbury Press; 1995. [Google Scholar]
- 20.Christian EL, Yarus M. J. Mol. Biol. 1992;228:743–758. doi: 10.1016/0022-2836(92)90861-d. [DOI] [PubMed] [Google Scholar]
- 21.Koizumi M, Breaker RR. Biochemistry. 2000;39:8983–8992. doi: 10.1021/bi000149n. [DOI] [PubMed] [Google Scholar]
- 22.Eisenberg D, Crothers DM. Physical Chemistry with Applications to the Life Sciences. Menlo Park, CA: Benjamin; 1979. [Google Scholar]
- 23.Saenger W. Principles of nucleic acid structure. New York, NY: Springer-Verlag New York; 1984. [Google Scholar]
- 24.Juneau K, Podell E, Harrington DJ, Cech TR. Structure (Camb) 2001;9:221–231. doi: 10.1016/s0969-2126(01)00579-2. [DOI] [PubMed] [Google Scholar]
- 25.Sassanfar M, Szostak JW. Nature. 1993;364:550–553. doi: 10.1038/364550a0. [DOI] [PubMed] [Google Scholar]
- 26.Dieckmann T, Suzuki E, Nakamura GK, Feigon J. RNA. 1996;2:628–640. [PMC free article] [PubMed] [Google Scholar]
- 27.Zimmermann GR, Jenison RD, Wick CL, Simorre JP, Pardi A. Nat. Struct. Biol. 1997;4:644–649. doi: 10.1038/nsb0897-644. [DOI] [PubMed] [Google Scholar]
- 28.Carothers JM, Davis JH, Chou JJ, Szostak JW. RNA. 2006;12:567–579. doi: 10.1261/rna.2251306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Jiang F, Kumar RA, Jones RA, Patel DJ. Nature. 1996;382:183–186. doi: 10.1038/382183a0. [DOI] [PubMed] [Google Scholar]
- 30.Burgstaller P, Famulok M. Angew. Chem. Int. Ed. Engl. 1994;33:1084–1087. [Google Scholar]
- 31.Fan P, Suri A, Fiala R, Live D, Patel D. J. Mol. Biol. 1996;258:480–500. doi: 10.1006/jmbi.1996.0263. [DOI] [PubMed] [Google Scholar]
- 32.Salehi-Ashtiani K, Szostak JW. Nature. 2001;414:82–84. doi: 10.1038/35102081. [DOI] [PubMed] [Google Scholar]
- 33.Sussman D, Nix JC, Wilson C. Nat. Struct. Biol. 2000;7:53–57. doi: 10.1038/71253. [DOI] [PubMed] [Google Scholar]
- 34.Guo F, Ceck TR. Nat. Struct. Biol. 2002;9:855–861. doi: 10.1038/nsb850. [DOI] [PubMed] [Google Scholar]
- 35.Hertel KJ, Peracchi A, Uhlenbeck OC, Herschlag D. Proc. Natl. Acad. Sci. U.S.A. 1997;94:8497–8502. doi: 10.1073/pnas.94.16.8497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Chaput JC, Szostak JW. Chem. & Biol. 2004;11:865–874. doi: 10.1016/j.chembiol.2004.04.006. [DOI] [PubMed] [Google Scholar]
- 37.Sazani PL, Larralde R, Szostak JW. J. Am. Chem. Soc. 2004;126:8370–8371. doi: 10.1021/ja049171k. [DOI] [PMC free article] [PubMed] [Google Scholar]