Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2016 Oct 7;113(43):12076–12081. doi: 10.1073/pnas.1605086113

Controlling uncertainty in aptamer selection

Fabian Spill a,b, Zohar B Weinstein c, Atena Irani Shemirani b, Nga Ho b, Darash Desai b,1, Muhammad H Zaman b,d,1
PMCID: PMC5087011  PMID: 27790993

Significance

Oligonucleotide aptamers have increasing applications as a class of molecules that bind with high affinity and specificity to a target. Aptamers are typically selected from a large pool of random candidate nucleic acid libraries through competition for the target. Using a stochastic hybrid model, we are able to study the combined impact of important evolutionary success factors such as competition, randomness, and changes in the environment. Whereas the environment may be tuned with experimental parameters such as target concentration, competition varies with differences in the initial distribution of aptamer–target binding affinities, and random events can eliminate even the ligands with the highest affinity.

Keywords: aptamer, SELEX, evolutionary dynamics, stochastic process, hybrid model

Abstract

The search for high-affinity aptamers for targets such as proteins, small molecules, or cancer cells remains a formidable endeavor. Systematic Evolution of Ligands by EXponential Enrichment (SELEX) offers an iterative process to discover these aptamers through evolutionary selection of high-affinity candidates from a highly diverse random pool. This randomness dictates an unknown population distribution of fitness parameters, encoded by the binding affinities, toward SELEX targets. Adding to this uncertainty, repeating SELEX under identical conditions may lead to variable outcomes. These uncertainties pose a challenge when tuning selection pressures to isolate high-affinity ligands. Here, we present a stochastic hybrid model that describes the evolutionary selection of aptamers to explore the impact of these unknowns. To our surprise, we find that even single copies of high-affinity ligands in a pool of billions can strongly influence population dynamics, yet their survival is highly dependent on chance. We perform Monte Carlo simulations to explore the impact of environmental parameters, such as the target concentration, on selection efficiency in SELEX and identify strategies to control these uncertainties to ultimately improve the outcome and speed of this time- and resource-intensive process.


Understanding and exploiting target–ligand binding are bedrocks of the biomedical sciences and support a host of applications ranging from diagnostics, therapeutics, and drug discovery to biosensing, imaging, and gene regulation. Antibodies and rational design provide a constructive playground to develop these applications, yet there generally remains a paucity of strong and specific binders for the innumerable viral, protein, and small-molecule targets under investigation.

Aptamers offer an alternative to antibodies, yet despite their growth (14), the discovery of high-affinity aptamers remains a challenge, especially for small-molecule targets (5, 6). Systematic Evolution of Ligands by EXponential Enrichment (SELEX) (7, 8) is the premier framework for aptamer development and isolates high-affinity ligands from an initial library similar to how advantageous traits are enriched in a biological population through Darwinian selection. In a cyclic process, ligands are incubated with the target, and those that exhibit preferential binding are amplified and survive to the next round. Target molecules are typically immobilized on a substrate material to facilitate easy separation of target-bound and unbound ligands. Through numerous rounds of selection, an initial library can be reduced to a handful of high-affinity aptamers. Nucleic acids comprise the vast majority of libraries used in SELEX, where sequence regions are randomized to generate tremendous structural diversity. Whereas this diversity underpins the evolutionary nature of SELEX, numerous works suggest that initial library design is a significant contributor to its overall success (9).

Although conceptually simple, the practical application of SELEX is plagued by uncertainty. Despite the impact of library design, the initial affinity distribution for any library toward a specific target remains a priori unknown. Target immobilization further complicates the procedure, particularly for small molecules. In comparison with large molecular weight targets such as proteins (10), viruses (11), and whole cells (12, 13), the immobilization of small molecules eliminates ligand binding sites and is thus impractical. Newer approaches instead bind the library itself to a substrate material using noncovalent equilibrium binding, but this introduces the opportunity for competitive losses of high-affinity ligands that are initially present in extremely low numbers. Wash steps and other experimental procedures may lead to further random losses, whereas nonspecific selection of ligands can counter environmental pressures and stall selection. In short, these uncertainties may quickly compound to apply tremendous risk toward the guarantee of successful selection.

Mathematical modeling therefore has great potential to help understand the uncertainties of aptamer selection and devise strategies to optimize environmental parameters and improve selection outcomes. Previous models have explored SELEX for protein targets, considering parameters such as target concentration (1416), separation efficiency of target-bound and unbound ligand (17), nonspecific binding of DNA to target (18), and negative selection steps (19). These studies predict that, despite its experimental complexity, the evolutionary nature of SELEX guarantees selection of the highest affinity ligand from the initial library. However, these works focus primarily on the use of deterministic equilibrium equations (14), whereas the presence of ligands in low copy numbers and the role of other experimental uncertainties suggest the use of more fundamental stochastic models rather than deterministic approximations. Mathematically, the chemical master equation provides a framework to test this hypothesis and generalize the above-mentioned deterministic models to include intrinsic stochasticity (20). Whereas this approach could be applied toward a purely stochastic model for SELEX, the result cannot currently be solved analytically or simulated by conventional techniques such as the Gillespie algorithm (21), due to the large number of molecules present. These limitations are common for many stochastic multiscale problems in biology, chemistry, and physics; the development of novel analytic approximations or numerical techniques to address this problem is an important ongoing research topic (22).

Using these ideas as our foundation, we introduce a hybrid model for aptamer selection that builds on the chemical master equation to introduce stochastic uncertainty in SELEX modeling. Here, ligands are separated into two categories of high and low copy number. In the former case, the master equation is simplified toward a deterministic equilibrium system, whereas in the latter it can be approximately solved analytically. Unlike previous efforts to incorporate stochasticity into aptamer modeling (23, 24), our framework allows us to simultaneously investigate the impact of low copy number ligands and their competitive binding to target molecules and immobilization substrates among the presence of high copy number ligands. Most importantly, this approach can capture total loss of individual ligands, which can strongly contribute to protocol outcome. Such events have not previously been investigated and cannot be captured by other approximations of the master equation such as the Langevin approximation, which rely on the presence of sufficiently high numbers of molecules and thereby diminish the possibility of extinction events (25).

Using this framework, we investigate unexplored sources of uncertainty in SELEX, beginning with a systematic analysis of the role the initial library affinity distribution plays in selection. We further challenge the assumption that this distribution is continuous at its tails and evaluate the impact of adding noise at these extremes. We find that introducing as few as 20 additional ligands outside the bulk distribution of 1015 molecules can strongly affect the outcome of selection. In light of these results, we revisit the topic of optimizing target concentration as discussed in previous works (1416), and show that the assumed initial KD distribution strongly influences protocol optimizations. We also provide additional insights regarding noncovalent ligand immobilization to support more recent efforts to develop robust protocols for small-molecule SELEX (2628). Integrating these ideas, we show that simultaneously lowering the target concentration and the substrate binding dissociation constant over the SELEX cycles can lead to improved selection outcomes for a wide range of initial conditions.

Computational Model of Selection Dynamics

The original SELEX protocol (7, 8) serves as the basis for our model, with additional modifications to accommodate small-molecule targets as described in ref. 26. Whereas this marks a model that specifically considers small-molecule targets, the main ideas and conclusions derived from this work remain applicable to other targets and selection schemes. The main steps of our approach are summarized in Fig. 1. We begin with a library of Aitot ligands of type i, where i={1,,MA} and MA is the total number of unique ligands. The ligands are then noncovalently immobilized using Stot substrate molecules, where KS is the ligand–substrate dissociation constant. These complexes are then subjected to wash steps to remove unbound ligands, from which AiI ligands of type i survive. Surviving ligands are then incubated with Ttot target molecules, where a ligand of type i binds to the target with a dissociation constant KD,i. Ligands that are bound to a target or have unbound from the substrate are partitioned from those that remain bound to the substrate. Finally, the partitioned ligands are amplified via PCR, modeled as a constant factor increase of αPCR, and used to begin the next cycle. The proceeding sections highlight the notable details of our hybrid approach, whereas a more thorough description and derivation of the model can be found in the SI Appendix. Throughout these sections, quantities that refer to an absolute number of molecules are denoted with a tilde, whereas those without represent concentrations.

Fig. 1.

Fig. 1.

Sample candidate library of ligands Ai is prepared by letting the ligands bind to a substrate S. Then, the target is added, leading to competitive binding between the different aptamers for substrate and target molecules T. The ligands still bound to the substrate are then separated from those which are either bound to a target, or have randomly unbound from the substrate. The latter two are subsequently amplified and taken into the next cycle.

Deterministic Model of Ligand Binding.

Earlier works use equilibrium conditions to characterize ligand–target interactions during selection (1417), focusing on changes in bulk properties, such as the mean dissociation constant, to study the enrichment of a single best candidate. We instead monitor the full ligand affinity distribution in an effort to better understand how parameters such as the initial SD also impact selection dynamics. Because modeling each of the MA1015 unique ligands is computationally intractable, we discretize the initial distribution of MA unique ligands into MB bins, each containing Ai ligands of dissociation constant KD,i, where i={1,,MB}. We choose MB to be large enough that the results do not depend on the binning, and small enough to optimize simulation performance. We further build on this analysis by introducing additional equilibrium conditions for nonspecific ligand–substrate interactions represented by a dissociation constant KS. In ref. 26, substrate–ligand binding is accomplished through DNA base pairing using a fixed sequence, and is thus constant. Altering the length of this fixed sequence is a means to tune KS. Moreover, different immobilization techniques, such as the use of graphene oxide (27, 28), will lead to variations of KS within a given pool, but we do not consider such cases here and instead treat KS to be constant throughout a single cycle of SELEX. Combining ligand–target and ligand–substrate binding, the full system of steady-state equilibrium binding conditions can be described by the set of equations:

[SAi]=1KS(AiI[SAi][TAi])Sfree,i=1,,MB,[TAi]=1KD,i(AiI[SAi][TAi])Tfree,i=1,,MB,Stot=i=1MB[SAi]+Sfree,Ttot=i=1MB[TAi]+Tfree. [1]

Here, [SAi] and [TAi] denote the concentration of ligand–substrate and ligand–target complexes, representing 2MB independent variables that are solved for; the quantities Ttot,Tfree and Stot,Sfree denote the concentrations of total and free target and substrate, respectively. From these results, we determine the concentration of ligands which survive selection, denoted by AiS,D, and are amplified by PCR for the next cycle. The superscripts denote that this number is obtained after selection and using the deterministic model defined by Eq. 1. This concentration is simply the sum of free- and target-bound ligands, and is hence given by

AiS,D=[TAi]+Aifree=AiI[SAi]. [2]

Stochastic Model of Ligand Selection.

Chemical reactions are fundamentally stochastic in nature, with forward and backward reactions occurring constantly. Whereas powerful and simple, Eq. 1 is based on real-valued concentrations which require sufficiently high molecular copy numbers to make discreteness and random fluctuations negligible. This is challenged at the tails of the KD distribution, where appropriate binning results in few ligands per bin. To address this, a hybrid approach is used where additional stochastic analysis is applied when Eq. 1 predicts AiS,D to be below a threshold Θ. To distinguish these quantities for stochastic analysis, we denote them as AψS,D, where ψ represents the subset of indices i that satisfy the condition AiS,D<Θ. Results exploring the choice for Θ are provided in SI Appendix, Fig. S6. We then calculate the probability for selecting AψS,S ligands, p(AψS,S); the superscripts denotes that the number is obtained after selection and using the stochastic model. As described in the SI Appendix, we find that by starting with the chemical master equation, p(AψS,S) is well-approximated by a binomial distribution:

p(AψS,S)=(AψtotAψS,S)pψAψS,S(1pψ)AψtotAψS,S,forAψS,S=0,,Aψtot. [3]

Here, the quantity pψ represents the probability that a single ligand is selected out of Aψtot ligands of type ψ. To provide the most accurate description, we account for stochastic contributions from both the immobilization and incubation steps. The contribution from immobilization is approximately the same for all candidates, and is given by AI/Atot, the fraction of remaining immobilized ligands after wash steps over those present before immobilization, where AI=i=1MBAiI and Atot=i=1MBAitot. The contribution from incubation is calculated as the fraction of predicted ligands, AψS,D, out of an initial number of AψI. Using these contributions, the total probability that a ligand in bin ψ survives is given by

pψ=AIAψS,DAtotAψI. [4]

Finally, Eq. 3 requires Aψtot to be integer-valued, as it denotes a number of molecules. However, the deterministic equations yield real-valued concentrations that must be renormalized to an integer. We separate Aψtot into its integer and fractional parts, Aψtot=Aψ,tot+Aψ,ftot, and then interpret 0Aψ,ftot<1 as the probability to have an extra molecule present. We then draw a uniformly distributed random number 0r1, and set Aψtot=Aψ,tot+1 if r<Aψ,ftot, and Aψtot=Aψ,tot otherwise. Following this renormalization, we finally draw a random variate distributed according to Eq. 3 to simulate the set of ligands AψS,S that remain after both immobilization and selection.

Results and Discussion

Using a hybrid computational approach, our model provides a generalized framework that can be used to analyze both deterministic and stochastic effects in SELEX. We use the model to deconstruct two main forms of uncertainties in aptamer selection. The first is parameter uncertainty, including the unknown initial KD distribution as well as the experimentally tunable quantities KS and Ttot. These are analyzed using a parameter study that observes the impact of these factors on SELEX dynamics. The second is stochastic uncertainty associated with low copy number binding phenomena. As this form of uncertainty is random in nature, we use Monte Carlo simulations to observe the variability in outcomes between repeated SELEX procedures and extract conclusions which are robust with respect to stochastic fluctuations. Unless mentioned otherwise, the parameters from SI Appendix, Table S1 are used in all simulations.

Effect of KD Distribution on Selection Efficiency.

Gaussian distributions describing the initial ligand pool dominate SELEX models in literature (16), yet we are not aware of any prior systematic approach to study the impact of various distributions on the outcome of SELEX. Whereas strong justifications have been made for the assumption of a log-normal Gaussian description (29), we explore various Gaussian as well as non-Gaussian distributions and their impact on selection. Our convention for log-normal KD distributions is such that a Gaussian N(μ,σ) with mean μ and SD σ in log-space translates to a mean of 10μ in KD space; we do not shift the mean by (1/2)σ2 as is customary in Ito calculus. Fig. 2 highlights the dramatic difference observed for just two different assumed distributions, and demonstrates the significant role the initial KD distribution plays in SELEX. This point is further accentuated by the fact that different selection targets may significantly alter the initial KD distribution for any given library. SI Appendix, Fig. S1 confirms that for a variety of other distributions, including non-Gaussians, distribution shape has a dramatic impact on selection dynamics.

Fig. 2.

Fig. 2.

Initial distribution affects SELEX dynamics. We plot the distribution of ligand binding affinities with increasing SELEX cycles for the same experimental parameters and two different assumed Gaussian distributions at cycle 1, N(3,0.4) (blue triangles) and N(5,0.8) (red dots). The dynamics of the two cases are totally different. For N(5,0.8), the distribution shifts to the left and becomes considerably narrower, while for N(3,0.4), the distribution additionally skews to the left, such that after cycle 12 the highest affinity binders have outcompeted the rest of the distribution.

In addition to shape, we also explore the assumption that the KD distribution is continuous everywhere. Whereas this assumption is credible near the distribution mean where the frequency of molecules is sufficiently high, we expect it to fail at the extreme tails where stochastic effects dominate and highly specific sequences can create gaps in the affinity distribution. Indeed, it is well-known that even single base-pair changes in DNA can dramatically impact binding (30). Ligands in this regime are highly prized, but may also be at highest risk to be lost to stochastic effects due to low copy numbers.

We investigate this risk by using an initial N(4,0.4) distribution and adding a fixed noise component that is randomly sampled from a uniform distribution in log-space. Fig. 3 and Movie S1 show a comparison of 2 Monte Carlo simulations where there are only 20 ligands present in the range of KD<107M, i.e., where the continuous Gaussian distribution is effectively zero. We find that random binding effects can lead to total loss of those 20 ligands, resulting in a very different evolution of the KD distribution from cycle 12 onward in comparison with the case where only 2 of those ligands survive. SI Appendix, Fig. S2 shows a distribution of the mean ligand KD at cycle 20 obtained from 250 Monte Carlo simulations, confirming this enormous variability in outcomes, where the mean KD value spans 3 orders of magnitude.

Fig. 3.

Fig. 3.

Noise affects SELEX dynamics. We fix the experimental parameters, the initial Gaussian distribution N(4,0.4) and the added noise of only 20 additional ligands initially present between KD=1010M and 5×108M. Two different Monte Carlo simulations from these identical initial conditions show dynamics of selection under random loss of the 20 strongest binders (blue triangles), versus dynamics when only two of those strong binders with affinities between 1010 and 109M are selected (red dots). In the latter case, these two high-affinity binders completely dominate the distribution from cycle 12 on and outcompete the remaining ligands with low affinities (KD>107M).

These results demonstrate the tremendous sensitivity of selection dynamics to both distribution shape and noise. They illustrate that selection pressures are parameterized not only by extrinsic environmental conditions given by the experimental setup, such as the tunable quantities KS and Ttot, but just as importantly by inherently uncertain intrinsic population parameters that govern relative competition between ligands of varying affinities.

Revisiting Target Concentration.

Optimization of the target concentration, Ttot, has long stood as a critical step in adjusting selection pressure based on experimental parameters (1416). However, the results from the previous section now suggest that in addition to these experimental factors, the intrinsic affinity distribution of the initial ligand pool may have a significant influence on the impact Ttot exerts on the overall selection pressure. In light of this, we revisit the topic to study this impact by varying both Ttot and the initial distribution. Fig. 4 and Movie S2 first show the dramatic impact of target concentration on selection dynamics. The results indicate that Ttot=104M (blue) provides optimal selection out of the three investigated target concentrations that use the initial Gaussian distribution N(4,0.4). To investigate the impact of Ttot more systematically, Fig. 5 shows the mean KD value of ligands selected after 20 cycles as a function of Ttot for 9 different initial distributions. Note that as the mean KD decreases, the average binding strength of the pool increases. Fig. 5 confirms that intermediate values of Ttot yield optimal selection. SI Appendix, Fig. S7 AC further shows that adding noise to the initial distributions introduces additional variability, but provides similar qualitative results. Interestingly, we find that different initial distributions can have very different optimal Ttot, stressing the importance of devising a strategy to mitigate this impact and thereby control the inherent uncertainty associated with the initial KD distribution.

Fig. 4.

Fig. 4.

Impact of target concentration on SELEX dynamics. Evolution of KD distribution for three different values of the target concentrations is shown. Under a high target concentration of Ttot=102M, the distribution shifts to the left and narrows, but does not skew toward high-affinity ligands. Additional skewing is achieved by reducing to Ttot=104M, which increases selection pressure by intensifying ligand competition. However, further reduction to Ttot=108M has the opposite effect and actually halts selection. In this case, the target concentration is so low that nonspecific ligand–substrate equilibria dominate selection dynamics and nullifies the selection pressure.

Fig. 5.

Fig. 5.

Optimal target concentrations strongly depend on assumed initial KD distribution. The plot shows the mean KD as a measure of pool binding strength for the SELEX pool at cycle 20 using different constant target concentration. Depending on the initial distribution of ligands, we find vastly different optimal target concentrations, i.e., concentrations with lower mean KD.

KS Dependence and Nonspecific Selection.

Our hybrid model has allowed us to explore the impact of the unknown initial KD distribution and the target concentration Ttot, which are both present in all SELEX protocols. However, our model additionally introduces a ligand–substrate interaction that has never before been studied and offers a unique opportunity to apply it toward more recent selection schemes aimed at small-molecule aptamer development (2628). We therefore extend our analysis to study uncertainties that govern an optimum KS, and observe how changes in KS impact selection dynamics for different KD distributions.

Fig. 6 and Movie S3 show the evolution of a single initial KD distribution for three different values of KS, showing an optimal outcome for KS=1012M (blue). Noting these dynamics, we next vary KS systematically and observe the mean KD value of ligands present at cycle 20 for 9 different initial KD distributions (Fig. 7 and SI Appendix, S7 DF). Similar to target concentration, we find an optimum in the intermediate ranges of KS and a clear dependence on the initial distribution. However, contrary to target concentration, the mean KD for smaller KS is relatively insensitive. Thus, these results suggest that a lower value of KS=1016M would provide similar results across a multitude of initial distributions.

Fig. 6.

Fig. 6.

Impact of KS on SELEX dynamics. The plot shows the evolution of KD distribution for three different values of KS. Similar to target concentration, we find an optimal outcome in the middle range (KS=1012M, blue), but the outcome for low KS is not as adverse as for low Ttot, because the distribution still shifts toward low KD with increasing cycles.

Fig. 7.

Fig. 7.

Optimal KS depends on initial distribution. Plot of mean KD for the SELEX pool at cycle 20 using different values of KS. Reducing KS from its optimal value does not increase the mean KD as strongly as a reduction of the target concentration from its optimum, as shown in Fig. 5.

As it pertains to small-molecule selection schemes, these results provide useful insights into the impact that substrate binding affinity has on selection efficiency, and may offer some guidance in the appropriate selection of a substrate material. The results also provide general insights into the impact of partitioning efficiency and nonspecific binding on selection across various initial distributions and suggest that a given partitioning efficiency or fraction of nonspecific selection can impact different initial distributions in vastly different ways.

Improving Selection Efficiency.

We have shown that the initial KD distribution has a tremendous impact on selection efficiency and plays a significant role in modulating the impact of experimental parameters such as Ttot and KS. These results highlight that whereas established protocols are expected to perform well for some distributions, they may perform moderately for others. To address this variability in outcomes, we finally explore strategies to mitigate these impacts using only the experimental parameters Ttot and KS. As a metric for our analysis, we introduce the quantity ϕ(c), which describes the fraction of ligands with KD<1010M at cycle c={1,,C}. Using this quantity, we further introduce two measures of efficiency: success probability Φ=ϕ(C) and success speed SC defined as the cycle c at which ϕ(c)=0.5ϕ(C).

We have seen that KS and Ttot play distinct roles in the evolutionary dynamics of the KD distribution. However, both parameters exhibit regimes of optimal selection that depend heavily on the initial distribution mean and width. Figs. 5 and 7 show that high values for Ttot and KS have a similar impact across all distributions, and suggest a conservative approach of beginning at these high values for the initial cycles. This reduces the risk of eliminating high-affinity, low copy number ligands early on. As these high-affinity ligands are amplified in subsequent rounds, Ttot and KS can be lowered to rapidly eliminate the remaining low-affinity ligands (SI Appendix, Figs. S3 and S4). Whereas ideas to lower the target concentrations have been discussed previously (26), our results indicate that other parameters such as KS can be tuned simultaneously to improve outcome across a multitude of initial distributions and stochastic conditions. Fig. 8 shows Φ and SC obtained from 50 Monte Carlo simulations of an improved protocol where both Ttot and KS are decreased over the cycles as described in SI Appendix, Table S2. These results are compared with the original protocol with constant values Ttot=104M and KS=1012M (26); SI Appendix, Fig. S5 shows ϕ(c) including the SDs. Using six different initial Gaussian distributions with noise added similar to Fig. 3, we observe that the improved protocol with decreasing Ttot and KS is faster and leads to a higher fraction of high-affinity binders than the original protocol. As an alternative metric of protocol performance, SI Appendix, Fig. S8 shows the evolution of mean KD across the cycles, and also introduces two alternative protocols where Ttot or KS are decreased faster than in the improved protocol. The results indicate that whereas faster decreases can further improve performance for some distributions, they may also lead to adverse outcome for others.

Fig. 8.

Fig. 8.

Plots comparing the fraction of high-affinity ligands Φ and speed SC of SELEX for six different KD distributions. The values are obtained from averaging 50 Monte Carlo simulations. We observe that decreasing Ttot and KS over the rounds will lead to a higher fraction of strong binders (here with KD<1010M) and will reach this fraction faster than when Ttot and Ks are kept constant.

Conclusions and Outlook

Deterministic models for SELEX have shed tremendous insight on the challenges faced in aptamer selection, but have been unable to capture its inherently uncertain nature. Here, we have presented a hybrid model that captures stochastic binding and furthermore incorporates noncovalent ligand–substrate immobilization. Using this framework, we have investigated previously unexplored questions including the role of the initial library KD distribution, impact of distribution noise, and the effect of these factors on the optimization of experimental parameters such as the total target concentration Ttot and the substrate dissociation constant KS.

The results of our modeling draw striking parallels to outcomes in evolutionary biology, where environmental parameters define a fitness landscape and competition can change this landscape to influence survival and reproduction (31). Within SELEX, ligands compete for target molecules to ensure survival into the next cycle, whereas substrate binding traps the ligands and leads to their removal. Reduction of target concentration can increase competition, but when few target molecules are present, even high-affinity binders are unlikely to find a target. Similar to competition in limited resources scenarios, we find that the chance of survival for even the highest affinity ligand strongly depends on the strengths of the other ligands present in the population. Our surprising finding that a handful of high-affinity ligands can outcompete a pool of 1015 ligands is also seen in evolutionary biology, where highly advantageous traits can quickly spread in a population, given the right conditions. The model enables one to identify the parameters impacting selection, and can thus be used to improve selection efficiency. A further important component of evolution in biological systems is mutations. Mutations in SELEX can also appear during PCR amplification, but usually lead to reduced affinities of the strongest aptamers (30), so we ignored them in our current approach. However, for some SELEX protocols, mutations can be beneficial to expand the experimental sampling space (32), and it may be interesting to extend our model to those protocols.

In summary, our model provides a better understanding of the impact of the uncertainties in SELEX, and how experimental parameters can be tuned to improve outcome and speed of this expensive and time-consuming protocol. We have demonstrated how optimization of the parameters can enhance selection efficiency of one protocol dramatically, and we envisage that simple adaptations of our model can be used to improve the many other established protocols, as well as guide the design of novel protocols, which aim to limit the impact of uncertainties in selection methods.

Supplementary Material

Supplementary File
Supplementary File
Download video file (583.5KB, avi)
Supplementary File
Download video file (798.7KB, avi)
Supplementary File
Download video file (805.5KB, avi)

Acknowledgments

We acknowledge the support of the National Cancer Institute Grant 5U01CA177799, Saving Lives at Birth Consortium, and US Pharmacopeia/Promoting the Quality of Medicines Cooperative Agreement. Z.B.W. is supported by National Institute of General Medical Sciences Training Program in Biomolecular Pharmacology Grant T32GM008541.

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission. C.A.F. is a Guest Editor invited by the Editorial Board.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1605086113/-/DCSupplemental.

References

  • 1.Shangguan D, et al. Aptamers evolved from live cells as effective molecular probes for cancer study. Proc Natl Acad Sci USA. 2006;103(32):11838–11843. doi: 10.1073/pnas.0602615103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ferguson BS, et al. Real-time, aptamer-based tracking of circulating therapeutic agents in living animals. Sci Transl Med. 2013;5(213):213ra165. doi: 10.1126/scitranslmed.3007095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Keefe AD, Pai S, Ellington A. Aptamers as therapeutics. Nat Rev Drug Discov. 2010;9(7):537–550. doi: 10.1038/nrd3141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Bunka DH, Stockley PG. Aptamers come of age - at last. Nat Rev Microbiol. 2006;4(8):588–596. doi: 10.1038/nrmicro1458. [DOI] [PubMed] [Google Scholar]
  • 5.McKeague M, Derosa MC. Challenges and opportunities for small molecule aptamer development. J Nucleic Acids. 2012;2012:748913. doi: 10.1155/2012/748913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Blind M, Blank M. Aptamer selection technology and recent advances. Mol Ther Nucleic Acids. 2015;4(1):e223. doi: 10.1038/mtna.2014.74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Tuerk C, Gold L. Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. Science. 1990;249(4968):505–510. doi: 10.1126/science.2200121. [DOI] [PubMed] [Google Scholar]
  • 8.Ellington AD, Szostak JW. In vitro selection of RNA molecules that bind specific ligands. Nature. 1990;346(6287):818–822. doi: 10.1038/346818a0. [DOI] [PubMed] [Google Scholar]
  • 9.Luo X, et al. Computational approaches toward the design of pools for the in vitro selection of complex aptamers. RNA. 2010;16(11):2252–2262. doi: 10.1261/rna.2102210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cho M, et al. Quantitative selection and parallel characterization of aptamers. Proc Natl Acad Sci USA. 2013;110(46):18460–18465. doi: 10.1073/pnas.1315866110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Roh C, Kim SE, Jo SK. Label free inhibitor screening of hepatitis C virus (HCV) NS5B viral protein using RNA oligonucleotide. Sensors (Basel) 2011;11(7):6685–6696. doi: 10.3390/s110706685. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Farokhzad OC, et al. Targeted nanoparticle-aptamer bioconjugates for cancer chemotherapy in vivo. Proc Natl Acad Sci USA. 2006;103(16):6315–6320. doi: 10.1073/pnas.0601755103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Sefah K, et al. In vitro selection with artificial expanded genetic information systems. Proc Natl Acad Sci USA. 2014;111(4):1449–1454. doi: 10.1073/pnas.1311778111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Irvine D, Tuerk C, Gold L. SELEXION. Systematic evolution of ligands by exponential enrichment with integrated optimization by non-linear analysis. J Mol Biol. 1991;222(3):739–761. doi: 10.1016/0022-2836(91)90509-5. [DOI] [PubMed] [Google Scholar]
  • 15.Levine HA, Nilsen-Hamilton M. A mathematical analysis of SELEX. Comput Biol Chem. 2007;31(1):11–35. doi: 10.1016/j.compbiolchem.2006.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wang J, Rudzinski JF, Gong Q, Soh HT, Atzberger PJ. Influence of target concentration and background binding on in vitro selection of affinity reagents. PLoS One. 2012;7(8):e43940. doi: 10.1371/journal.pone.0043940. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Chen CK, Kuo TL, Chan PC, Lin LY. Subtractive SELEX against two heterogeneous target samples: Numerical simulations and analysis. Comput Biol Med. 2007;37(6):750–759. doi: 10.1016/j.compbiomed.2006.06.015. [DOI] [PubMed] [Google Scholar]
  • 18.Cherney LT, Obrecht NM, Krylov SN. Theoretical modeling of masking DNA application in aptamer-facilitated biomarker discovery. Anal Chem. 2013;85(8):4157–4164. doi: 10.1021/ac400385v. [DOI] [PubMed] [Google Scholar]
  • 19.Seo YJ, Nilsen-Hamilton M, Levine HA. A computational study of alternate SELEX. Bull Math Biol. 2014;76(7):1455–1521. doi: 10.1007/s11538-014-9954-6. [DOI] [PubMed] [Google Scholar]
  • 20.Van Kampen NG. Stochastic Processes in Physics and Chemistry. Vol 1 North-Holland; Amsterdam: 1992. [Google Scholar]
  • 21.Gillespie DT. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J Comput Phys. 1976;22(4):403–434. [Google Scholar]
  • 22.Gillespie DT, Hellander A, Petzold LR. Perspective: Stochastic algorithms for chemical kinetics. J Chem Phys. 2013;138(17):170901. doi: 10.1063/1.4801941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sun F, Galas D, Waterman MS. A mathematical analysis of in vitro molecular selection-amplification. J Mol Biol. 1996;258(4):650–660. doi: 10.1006/jmbi.1996.0276. [DOI] [PubMed] [Google Scholar]
  • 24.Chen CK. Complex SELEX against target mixture: Stochastic computer model, simulation, and analysis. Comput Methods Programs Biomed. 2007;87(3):189–200. doi: 10.1016/j.cmpb.2007.05.008. [DOI] [PubMed] [Google Scholar]
  • 25.Gillespie DT. The chemical Langevin equation. J Chem Phys. 2000;113(1):297–306. [Google Scholar]
  • 26.Stoltenburg R, Nikolaus N, Strehlitz B. Capture-selex: Selection of DNA aptamers for aminoglycoside antibiotics. J Anal Methods Chem. 2012;2012:415697. doi: 10.1155/2012/415697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Park JW, Tatavarty R, Kim DW, Jung HT, Gu MB. Immobilization-free screening of aptamers assisted by graphene oxide. Chem Commun (Camb) 2012;48(15):2071–2073. doi: 10.1039/c2cc16473f. [DOI] [PubMed] [Google Scholar]
  • 28.Nguyen VT, Kwon YS, Kim JH, Gu MB. Multiple GO-SELEX for efficient screening of flexible aptamers. Chem Commun (Camb) 2014;50(72):10513–10516. doi: 10.1039/c4cc03953j. [DOI] [PubMed] [Google Scholar]
  • 29.Vant-Hull B, Payano-Baez A, Davis RH, Gold L. The mathematics of SELEX against complex targets. J Mol Biol. 1998;278(3):579–597. doi: 10.1006/jmbi.1998.1727. [DOI] [PubMed] [Google Scholar]
  • 30.Katilius E, Flores C, Woodbury NW. Exploring the sequence space of a DNA aptamer using microarrays. Nucleic Acids Res. 2007;35(22):7626–7635. doi: 10.1093/nar/gkm922. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Nowak MA, Sigmund K. Evolutionary dynamics of biological games. Science. 2004;303(5659):793–799. doi: 10.1126/science.1093411. [DOI] [PubMed] [Google Scholar]
  • 32.Hoinka J, et al. Large scale analysis of the mutational landscape in HT-SELEX improves aptamer discovery. Nucleic Acids Res. 2015;43(12):5699–5707. doi: 10.1093/nar/gkv308. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File
Supplementary File
Download video file (583.5KB, avi)
Supplementary File
Download video file (798.7KB, avi)
Supplementary File
Download video file (805.5KB, avi)

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES