Abstract
SFM4-3, KOD DGLNK, and Therminator polymerase are investigated for their compatibility with SELection with Modified Aptamers (SELMA), an aptamer discovery method that enables incorporation of large nucleobase modifications such as glycans. We demonstrated that with suitable modifications to the primer design and protocol, these enzymes are compatible with SELMA, enabling 2’-fluoro or 2’-methoxy ribose modifications at all positions. In the case of 2’-fluoro modifications, Therminator exhibits cleaner incorporation of an alkyne-modified nucleobase for click chemistry.
Aptamers are genetically encodable polymers, usually oligonucleotides, that have been selected in vitro from a random library to bind to a protein or molecule of interest.1-4 Oligonucleotide aptamers are typically 20-100 bases in length and can achieve high binding affinities for their targets due to their specific folding, shape complementarity and positioning of functional groups.5 Aptamers possess several advantages compared with antibodies, including straightforward synthesis and quality control, low immunogenicity6, 7, and small size.5, 8 Aptamers are typically discovered using SELEX (Systematic Evolution of Ligands by EXponential Enrichment), in which libraries of random DNA or RNA sequences flanked by constant primer regions are incubated with a target of interest and the bound sequences are amplified by PCR. Two aptamer therapeutics have been approved for clinical use. Pegaptanib targets vascular endothelial growth factor (VEGF) for the treatment of age-related macular degeneration.9, 10 Very recently in 2023, avacincaptad pegol, an aptamer targeting complement protein C5, was approved for the treatment of geographic atrophy (GA) secondary to age-related macular degeneration.11, 12
A barrier to the development of oligonucleotide aptamer therapeutics has been the instability of naked RNA or DNA to nucleases in serum.13, 14 To address this, aptamers have been extensively stabilized with modified nucleotides (XNA), such as fluoro- (2’-F-) or methoxy (2’-OMe-) ribose, which confer nuclease resistance.14 Pegaptanib and avacincaptad are illustrative, as they both bear 2’-F- or -OMe at nearly every position. However, these modifications typically must be introduced into natural DNA or RNA aptamers after selection by SELEX, and trial and error is necessary to find substitutions that do not compromise the binding activity of the selected molecule. Incorporation of XNA throughout the selection process would be more desirable and efficient but requires technical innovation, as XNA libraries may not be easily synthesized and/or PCR amplified by polymerases with limited tolerance for modified nucleoside triphosphates (NTPs).15-42
As a solution to this problem, our group has reported SELection of Modified Aptamers (SELMA, Figure 1)43-48, which allows for extensive modifications, not only of ribose, but of the nucleobase (e.g., oligosaccharide attachment) and includes a variant that can use 2’-F pyrimidine NTPs48 throughout the selection. In SELMA, PCR and/or RT-PCR of modified oligonucleotides is circumvented by tethering an unmodified, PCR-amplifiable DNA copy to each XNA sequence in the library.
Figure 1.

The SELection with Modified Aptamers (SELMA) construct contains a PCR amplifiable DNA genotype that is covalently linked to a phenotype strand containing XNA and/or 5-ethynyluridine modified by click chemistry.
However, this technique has not yet been adapted to selections with libraries containing the 2’-F modification on all four nucleotides, which would require an RNA or DNA polymerase capable of incorporating them. Although several polymerases are known to accommodate ribose-modified NTPs, the activity and fidelity of these enzymes remains poorly understood and difficult to assess by traditional sequencing methods. Ultimately, a better understanding of the performance of these enzymes is needed before they can be widely adopted for aptamer selection applications.
To this end, we have investigated three different polymerases capable of incorporating 2’-modified XNA into the SELMA protocol. Therminator DNA polymerase is a commercially available mutant of 9°N DNA polymerase that is able to incorporate a variety of non-canonical nucleic acids including 2’-F-modified NTPs (2’-F-NTPs).49 SFM4-3, which also incorporates 2’-F-NTPs, is a variant of the Stoffel fragment of TAQ DNA polymerase developed by Chen et al. using directed evolution.50 KOD DGLNK is a mutant KOD polymerase developed by Hoshino et al., that accepts both LNA and 2’-OMe-modified nucleotides.51 We purchased commercially available Therminator and expressed SFM4-3 and KOD DGLNK for generation of the ssXNA “phenotype” strand (G, Figure 2). Library construction begins with a short ssDNA strand containing the random library template (A), which is annealed to a phosphorylated overhanging “regeneration” primer (B). Bidirectional DNA polymerase extension creates a longer duplex (C) and lambda exonuclease digestion of the phosphorylated strand results in a single stranded hairpin (D). The 3’ end of D is self-priming and can be extended with XNA using one of the three above enzymes to afford double stranded hairpin (E). The XNA strand is then displaced (F) by treatment with a “hairpin poison primer” (HPP) and its extension with BST 2.0 to generate Form G. Compared with the DNA/DNA duplex of our original SELMA method, this relatively long primer is required for the strand displacement of the stronger 2’-F-RNA-DNA interaction.35, 52, 53 The resulting Form G library contains modified ssXNA that can adopt a sequence-dependent fold and can undergo selection towards a target, while PCR amplification of the tethered dsDNA in the bound library fraction affords a new template in double stranded Form H. The use of the biotinylated primer in PCR allows for strand separation and a return to ssDNA library template (Form A).
Figure 2.

XNA SELMA Library build. Blue = constant primer sites; red = random region; black = constant hairpin region; orange = XNA.
For KOD DGLNK and SFM4-3, some optimization was done to characterize the behavior of each enzyme at various temperatures. We found that, for our applications, increasing the extension temperature from previously reported conditions resulted in more efficient extension and higher yields of Form E (Figures S3, S4).50, 51 After determining the ideal extension conditions, each enzyme was used to form double stranded hairpin and subsequently strand displaced as shown in Figure 3.
Figure 3.

A) Scheme of hairpin extension and strand displacement. B) Denaturing PAGE gel of extended (lanes 1, 3 and 5) and displaced (lanes 2, 4, and 6) hairpin made with KOD DGLNK and MeO-RNA (lanes 1 and 2), SFM4-3 and F-RNA (lanes 3 and 4) and Therminator and F-RNA (lanes 5 and 6). The band below 50 bp in lane 4 is the NEB WarmStart aptamer.
To demonstrate compatibility of each enzyme with the SELMA workflow, we performed mock selections in which a known target sequence of 2’-F- or 2’-OMe-RNA was selected from a random library by hybridization to a biotinylated complementary oligonucleotide (Figure 4A). The target sequence contained an NheI restriction site, which enabled easy detection of the target sequence by the presence of four fragment bands after digestion with NheI (Figure 4B, Group 4). This sequence was spiked at a ratio of 1:1000 into N25 random libraries generated with 2’-F-RNA using Therminator or SFM4-3, or with 2’-OMe-RNA using KOD DGLNK. Prior to selection, the known sequence was not visible by gel analysis of the spiked libraries following NheI digest (e.g., Figure 4B, Group 2). Each library was then subjected to one round of selection. Species that bound to the biotinylated complementary oligonucleotide were recovered using streptavidin magnetic beads. Bound material was then PCR amplified and digested with NheI, resulting in the expected four small fragment bands in all three selections (Figure 4B, Groups 3, 5, 6). Thus, we observed clear enrichment of our target sequence after just one round of selection using libraries generated with each of the enzymes tested here. Control selections containing only random library (Figure 4B, Group 1) or only NheI (Figure 4B, Group 4) sequence were also performed in parallel. The successful mock selection demonstrates the capability of the SELMA method to evolve a fully fluorinated or fully methoxy library and generate enrichment without the need for a reverse transcription step.
Figure 4.

A) NheI selection construct design and enrichment test. B = biotin; SA = streptavidin B) Gel analysis of NheI-digested templates from selections. NheI-treated samples are indicated with a (+). Group 1 = random library post selection, Group 2 =1:1000 NheI template:random library template pre selection, Group 3 = 1:1000 NheI template:random library template post selection, Group 4 = NheI template only post selection. Groups 1-4 were prepared with SFM4-3. Groups 5 and 6 are post-selection libraries analogous to Group 3 but made with KOD DGLNK/MeO-NTPs Therminator polymerase/F-NTPs, respectively. Complete gels corresponding to Groups 5 and 6 can be found in Figure S5.
To investigate chemical functionalization of libraries, we focused on 2’-F-RNA and used either Therminator or SFM4-3 to generate an F-RNA strand containing three 5-ethynyluridine (fEdU)48 bases by primer extension against a 56-mer DNA template containing three adenines (Figure 5). Following CuAAC attachment of oligomannose (Man9) glycan,44 the results were visualized by gel, in which the functionalized F-RNA strand migrates higher due to added glycans. When Therminator was used to generate the F-RNA strand, the corresponding glyco-F-RNA migrates primarily as one band, corresponding to addition of three glycans, with a small amount of a lower band corresponding to incomplete functionalization. However, when SFM4-3 was used to generate the F-RNA, the glyco-F-RNA is a series of bands, corresponding to different numbers of attached glycans in some cases exceeding three. These results suggested substantial misincorporation of fEdU by SFM4-3 and have led us to focus on using Therminator for future libraries.54
Figure 5.

Generation of alkyne-containing F-RNA/DNA duplex, CuAAC modification with Man9-azide glycan, and gel electrophoresis. Lanes 1 and 3 = product of primer extension with Therminator and SFM4-3 respectively, in the presence of f(A,G,C,EdU) triphosphates. Lanes 2 and 4 = Therminator and SFM4-3-derived oligos after click functionalization to attach Man9 glycan.
Conclusions
In conclusion, we have described several modifications to the SELMA methodology that allow for the incorporation and in vitro selection of fully 2’-modified XNA without the need for reverse transcription or post-SELEX medicinal chemistry optimization to incorporate nuclease resistant modifications. Additionally, we have demonstrated that fully 2’-modified XNA SELMA constructs can be successfully enriched towards a known target. This expansion of SELMA capabilities will simplify discovery of aptamers that could be directly used in the presence of nucleases. We have shown that the 2’F-RNA can be chemically modified by CuAAC, and we have found in the process that Therminator is less prone to misincorporation of fEdU than SFM4-3. Selection applications of SELMA with XNA and CuAAC modifications are currently underway with carbohydrate binding targets. Using this approach, we will develop potent modulators of clinically relevant carbohydrate binding proteins that are directly ready for use in vivo.
Supplementary Material
‡Supplementary Information available: [details of any supplementary information available should be included here]. See DOI: 10.1039/x0xx00000x
Acknowledgements
This work was supported by NIH Grants R01GM151492 (I.J.K.), R01GM127920 (L.C.H-W. and I.J.K.), and 5T32GM139798 (K.B.F) as well as the NSF Graduate Research Fellowship No. 2139752 (E.D.Z.). I.J.K acknowledges the NIH Shared Instrumentation grant S10OD034395 for purchase of an NMR spectrometer. We would like to thank Dr. Floyd Romesburg for the plasmid of SFM4-3 and Dr. Satoshi Obika for the plasmid of KOD DGLNK. Dr. Richard Redman and Dr. Lizbeth Hedstrom are gratefully acknowledged for helpful discussions.
Footnotes
Conflicts of interest
The authors declare no competing interests.
Data availability
The derived data supporting these findings can be found in the ESI‡. The raw data generated by Azenta Life Sciences and the python scripts to analyze this data can be found at https://github.com/emilyziperman/Polymerase-Fidelity-Analysis
References
- 1.Ellington AD and Szostak JW, Nature, 1990, 346, 818–822. [DOI] [PubMed] [Google Scholar]
- 2.Gold L, Janjic N, Jarvis T, Schneider D, Walker JJ, Wilcox SK and Zichi D, Cold Spring Harbor Perspectives in Biology, 2012, 4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Stoltenburg R, Reinemann C and Strehlitz B, Biomolecular Engineering, 2007, 24, 381–403. [DOI] [PubMed] [Google Scholar]
- 4.Tuerk C and Gold L, Science, 1990, 249, 505–510. [DOI] [PubMed] [Google Scholar]
- 5.Thiviyanathan V and Gorenstein DG, PROTEOMICS – Clinical Applications, 2012, 6, 563–573. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ophthalmology, 2003, 110, 979–986. [DOI] [PubMed] [Google Scholar]
- 7.Group ES, Retina, 2002, 22, 143–152.11927845 [Google Scholar]
- 8.Di Mauro V, Lauta FC, Modica J, Appleton SL, De Franciscis V and Catalucci D, JACC Basic Transl Sci, 2024, 9, 260–277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gragoudas ES, Adamis AP, Cunningham ET Jr., Feinsod M and Guyer DR, NEJM, 2004, 351, 2805–2816. [DOI] [PubMed] [Google Scholar]
- 10.Lee JH, Canny MD, De Erkenez A, Krilleke D, Ng YS, Shima DT, Pardi A and Jucker F, Proc. Natl. Acad. Sci. U. S. A, 2005, 102, 18902–18907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Jaffe GJ, Westby K, Csaky KG, Monés J, Pearlman JA, Patel SS, Joondeph BC, Randolph J, Masonson H and Rezaei KA, Ophthalmology, 2021, 128, 576–586. [DOI] [PubMed] [Google Scholar]
- 12.Mullard A, Nature Reviews Drug Discovery, 2023, 22, 774. [DOI] [PubMed] [Google Scholar]
- 13.Healy JM, Lewis SD, Kurz M, Boomer RM, Thompson KM, Wilson C and McCauley TG, Pharmaceutical Research, 2004, 21, 2234–2246. [DOI] [PubMed] [Google Scholar]
- 14.Kratschmer C and Levy M, Nucleic Acid Therapeutics, 2017, 27, 335–344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lin Y, Qiu Q, Gill SC and Jayasena SD, Nucleic Acids Res, 1994, 22, 5229–5234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Jhaveri S, Olwin B and Ellington AD, Bioorganic & Medicinal Chemistry Letters, 1998, 8, 2285–2290. [DOI] [PubMed] [Google Scholar]
- 17.Burmeister PE, Lewis SD, Silva RF, Preiss JR, Horwitz LR, Pendergrast PS, McCauley TG, Kurz JC, Epstein DM, Wilson C and Keefe AD, Chem. Biol, 2005, 12, 25–33. [DOI] [PubMed] [Google Scholar]
- 18.Ichida JK, Zou K, Horhota A, Yu B, McLaughlin LW and Szostak JW, J Am Chem Soc, 2005, 127, 2802–2803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Minakawa N, Sanji M, Kato Y and Matsuda A, Bioorganic and Medicinal Chemistry, 2008, 16, 9450–9456. [DOI] [PubMed] [Google Scholar]
- 20.Pinheiro VB, Taylor AI, Cozens C, Abramov M, Renders M, Zhang S, Chaput JC, Wengel J, Peak-Chew S-Y, McLaughlin SH, Herdewijn P and Holliger P, Science, 2012, 336, 341–344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Yu H, Zhang S, Dunn MR and Chaput JC, J Am Chem Soc, 2013, 135, 3583–3591. [DOI] [PubMed] [Google Scholar]
- 22.Kuwahara M and Obika S, Artif DNA PNA XNA, 2013, 4, 39–48. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Taylor AI, Pinheiro VB, Smola MJ, Morgunov AS, Peak-Chew S, Cozens C, Weeks KM, Herdewijn P and Holliger P, Nature, 2015, 518, 427–430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Friedman AD, Kim D and Liu R, Biomaterials, 2015, 36, 110–123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Taylor AI and Holliger P, Nature Protocols, 2015, 10, 1625–1642. [DOI] [PubMed] [Google Scholar]
- 26.Elle IC, Karlsen KK, Terp MG, Larsen N, Nielsen R, Derbyshire N, Mandrup S, Ditzel HJ and Wengel J, Molecular BioSystems, 2015, 11, 1260–1270. [DOI] [PubMed] [Google Scholar]
- 27.Alves Ferreira-Bravo I, Cozens C, Holliger P and DeStefano JJ, Nucleic Acids Res, 2015, 43, 9587–9599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Thirunavukarasu D, Chen T, Liu Z, Hongdilokkul N and Romesberg FE, Journal of the American Chemical Society, 2017, 139, 2892–2895. [DOI] [PubMed] [Google Scholar]
- 29.Liu Z, Chen T and Romesberg FE, Chemical Science, 2017, 8, 8179–8182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Taylor AI and Holliger P, Current Protocols in Chemical Biology, 2018, 10, e44. [DOI] [PubMed] [Google Scholar]
- 31.Rangel AE, Chen Z, Ayele TM and Heemstra JM, Nucleic Acids Research, 2018, 46, 8057–8068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Arangundy-Franklin S, Taylor AI, Porebski BT, Genna V, Peak-Chew S, Vaisman A, Woodgate R, Orozco M and Holliger P, Nature Chemistry, 2019, 11, 533–542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Eremeeva E, Fikatas A, Margamuljana L, Abramov M, Schols D, Groaz E and Herdewijn P, Nucleic Acids Res, 2019, 47, 4927–4939. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Rose KM, Alves Ferreira-Bravo I, Li M, Craigie R, Ditzler MA, Holliger P and DeStefano JJ, ACS Chemical Biology, 2019, 14, 2166–2175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Dunn MR, McCloskey CM, Buckley P, Rhea K and Chaput JC, Journal of the American Chemical Society, 2020, 142, 7721–7724. [DOI] [PubMed] [Google Scholar]
- 36.Hoshino H, Kasahara Y, Kuwahara M and Obika S, Journal of the American Chemical Society, 2020, 142, 21530–21537. [DOI] [PubMed] [Google Scholar]
- 37.Houlihan G, Arangundy-Franklin S, Porebski BT, Subramanian N, Taylor AI and Holliger P, Nature Chemistry, 2020, 12, 683–690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Zhang L and Chaput JC, Molecules, 2020, 25, 4194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Shao Q, Chen T, Sheng K, Liu Z, Zhang Z and Romesberg FE, Journal of the American Chemical Society, 2020, 142, 2125–2128. [DOI] [PubMed] [Google Scholar]
- 40.Dey S and Sczepanski JT, Nucleic Acids Research, 2020, 48, 1669–1680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Alves Ferreira-Bravo I and DeStefano JJ, Viruses, 2021, 13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wu KB, Skrodzki CJA, Su Q, Lin J and Niu J, Chem. Sci, 2022, 13, 6873–6881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Macpherson IS, Temme JS, Habeshian S, Felczak K, Pankiewicz K, Hedstrom L and Krauss IJ, Angewandte Chemie International Edition, 2011, 50, 11238–11242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Temme JS, Drzyzga MG, Macpherson IS and Krauss IJ, Chemistry – A European Journal, 2013, 19, 17291–17295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Temme JS, Macpherson IS, Decourcey JF and Krauss IJ, Journal of the American Chemical Society, 2014, 136, 1726–1729. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Temme JS and Krauss IJ, Current Protocols in Chemical Biology, 2015, 7, 73–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Macpherson IS, Temme JS and Krauss IJ, Chemical Communications, 2017, 53, 2878–2881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Redman RL and Krauss IJ, Journal of the American Chemical Society, 2021, 143, 8565–8571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gardner AF, Jackson KM, Boyle MM, Buss JA, Potapov V, Gehring AM, Zatopek KM, Corrêa IR Jr., Ong JL and Jack WE, Front. Mol. Biosci, 2019, 6, 28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Chen T, Hongdilokkul N, Liu Z, Adhikary R, Tsuen SS and Romesberg FE, Nature Chemistry, 2016, 8, 556–562. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Hoshino H, Kasahara Y, Fujita H, Kuwahara M, Morihiro K, Tsunoda S-I and Obika S, Bioorganic & Medicinal Chemistry Letters, 2016, 26, 530–533. [DOI] [PubMed] [Google Scholar]
- 52.Patra A, Paolillo M, Charisse K, Manoharan M, Rozners E and Egli M, Angewandte Chemie International Edition, 2012, 51, 11863–11866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Martin-Pintado N, Deleavey GF, Portella G, Campos-Olivas R, Orozco M, Damha MJ and González C, Angewandte Chemie International Edition, 2013, 52, 12065–12068. [DOI] [PubMed] [Google Scholar]
- 54.Error rates for F-RNA synthesis by each these two enzymes were investigated. However, due to the large error rate of reverse transcription to produce DNA for sequencing, we were unable to discern the specific F-RNA synthesis error rates.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The derived data supporting these findings can be found in the ESI‡. The raw data generated by Azenta Life Sciences and the python scripts to analyze this data can be found at https://github.com/emilyziperman/Polymerase-Fidelity-Analysis
