Abstract
Synthetic biology seeks to probe fundamental aspects of biological form and function by construction [i.e., (re)synthesis] rather than deconstruction (analysis). In this sense, biological sciences now follow the lead given by the chemical sciences. Synthesis can complement analytic studies but also allows novel approaches to answering fundamental biological questions and opens up vast opportunities for the exploitation of biological processes to provide solutions for global problems. In this review, we explore aspects of this synthesis paradigm as applied to the chemistry and function of nucleic acids in biological systems and beyond, specifically, in genome resynthesis, synthetic genetics (i.e., the expansion of the genetic alphabet, of the genetic code, and of the chemical make-up of genetic systems), and the elaboration of orthogonal biosystems and components.
Keywords: synthetic biology, orthogonality, genome synthesis, synthetic genetics, XNA, unnatural base pairs
1. Introduction
The intricate molecular machinery of living cells and organisms is controlled by a central information system based on the nucleic acids DNA and RNA, as conceptualized in the central dogma of molecular biology. In this review, we seek to provide an overview of current synthetic biology approaches to engineering, controlling, and expanding the central dogma via use of synthetic approaches for the reassignment of the informational units and exploration of alternative chemistries for their ability to support life or provide expanded functionalities.
Arguably the biggest impact of such synthetic approaches has come from the advances enabled by the ever-increasing efficiency and scale of the classic phosphoramidite DNA synthesis of Caruthers (13). Together with more sophisticated DNA assembly methods, these advances have begun to enable the engineering and wholesale rewriting of genetic information at the genome level (Figure 1). We discuss recent advances in genome resynthesis specifically for codon reassignment. Furthermore, we highlight possible routes toward the generation of biosystems whose genomes and/or their readout are informationally or chemically orthogonal, providing a genetic firewall against unwanted release of engineered organisms.
Figure 1.
Synthetic genomes: design, construction, and repurposing. Three main genome resynthesis projects have been applied in Mycoplasma mycoides, Escherichia coli, and Saccharomyces cerevisiae, respectively, with sizes ranging from 1 Mbp to 12 Mbp. Work on Syn1.0 established the hierarchical construction workflow to synthesize the M. mycoides genome, from oligonucleotides to larger fragments that were eventually assembled into the 1 Mbp genome. With the bottom-up method, a genome of half the size—Syn3.0—was constructed to study what genes are essential for maintaining a near-minimal cell. In the synthetic genome for E. coli, two serine codons and a stop codon were removed and replaced by their isoforms, resulting in the Syn61 strain with 61 codons. Further deletion of the corresponding tRNAs enabled resistance to viral infection and provided orthogonal tRNAs with space for heteropolymer synthesis. The synthetic yeast project includes the swap of a stop codon, removal of repetitive elements, and addition of recombinase recognition sites. Various rearrangements and genome diversity can be generated by SCRaMbLE, enabling directed evolution of the synthetic strain for downstream applications. Abbreviations: RF1, release factor 1; tRNA, transfer RNA.
2. Genome Synthesis
In 2010, the J.C. Venter Institute successfully transplanted an entirely chemically synthesized genome of Mycoplasma mycoides to a Mycoplasma capricolum recipient cell, which then developed into M. mycoides progeny; this experiment conclusively established the primacy of genotype in determining phenotype (35). This pioneering work established the feasibility of artificial genome synthesis and introduced several technological advances, such as the hierarchical in vitro and in vivo assembly strategy from which subsequent synthetic genome projects benefited, including the minimal genome JCVI-syn3.0 (531 kbp, truncated from the JCVI-syn1.0 of 1,079 kbp) (47). Importantly, when—during the design-build-test cycle—it was found that not all of the (proposed) nonessential genes could be deleted, the synthesis and assembly platform technology enabled a systematic study and evaluation of the essential genes and encoded functions that appeared to be essential to maintaining cellular life.
Two other powerful applications of whole-genome synthesis (or rewriting) are genetic code expansion (GCE) and reassignment, both of which have the ultimate aim of converting cellular protein translation into a general system for the sequence-defined synthesis of artificial polymers. GCE originally relied on amber suppression, based on the pioneering work of Schultz and colleagues (117). This strategy requires aminoacyl-transfer RNA (tRNA)-synthetase (aaRSx)–tRNA (tRNAx) pairs that are (a) orthogonal (x) to the tRNAs and aaRSs in the target organism (i.e., are not charged by a noncognate aaRS or do not charge noncognate tRNAs), (b) engineered for the orthogonal aaRSx to load a defined unnatural amino acid (uAA) (X) onto the tRNAx, and (c) engineered so that the tRNAx recognizes a stop codon, thus in effect reassigning the stop codon to the uAA. GCE has undergone many improvements and expansions with regard to the types of amino acids that are suitable for incorporation, resulting in widespread applications (16). However, the scope of amber suppression is inherently limited by competition with the natural (stop) function. Engineering of release factor 1 (RF1) (96) or the ribosome for enhanced amber suppression (116) can alleviate some of these issues.
Genome-wide codon reassignment, which relies on the genetic engineering or de novo synthesis of organisms with a compressed genetic code, has been transformative for the field. By systematically replacing target codons with synonymous codons, the degeneracy of the genetic code can be leveraged to generate synthetic organisms with free codons. Initially, multiplex automated genome engineering (MAGE) was used to remove all amber codons from the Escherichia coli genome by iterative mutagenesis and the deletion of release factor 1 (RF1) (48) to free up the amber codon for reassignment. More recently, replicon excision for enhanced genome engineering through programmed recombination (REXER) (115), an advanced new method for genome engineering, enabled the rational identification of specific code compression schemes and the resynthesis of the 4-Mbp genome of E. coli (31) (Figure 1), freeing up three codons for reassignment (89).
Wholesale genome resynthesis is also advancing at pace in a eukaryotic organism. The Sc 2.0 project aims to resynthesize the chromosomes of the yeast Saccharomyces cerevisiae, applying three design principles: TAG stop codons are swapped to TAA, and synonymous mutations are introduced to generate distinguishing watermarks; repetitive elements and introns are removed, while the tRNA genes are relocated to create the plus one neochromosome; and recombinase-recognizable loxPsym sites are added between nonessential genes to enable genome rearrangement and evolution (the SCRaMbLE system) (25). By 2017, 6.5 chromosomes had been synthesized separately, and approximate wild-type phenotypes were observed with these strains (88). The SCRaMbLE system showed its power as a directed evolution platform to evolve yeast strains with improved fitness and productivity (10, 62). The complete synthesis of the 16 yeast chromosomes in a whole synthetic Sc 2.0 strain could also be combined with other intriguing work that fused all chromosomes into one (100) to test the limits of genomic engineering in yeast. All of these advances have spurred the even more ambitious undertaking of a “human genome project-write” for human genome synthesis (11). The effective implementation of genome resynthesis and engineering at this scale will likely require significant advances in high-fidelity DNA synthesis and the handling of even larger DNA fragments for the construction of gigabase-scale genomes. On a smaller scale, the power of synthetic genomics was illustrated in the rapid (1-week) assembly of the complete genome of SARS-CoV-2 (111).
3. Synthesis Of Orthogonal Biosystems
Genome resynthesis provides an opportunity for the wholesale reconfiguration of biological systems, for example, in the pursuit of orthogonal biosystems, i.e., systems that are sufficiently decoupled from natural or host organism biology that they can function and evolve independently (59). In light of the ever-increasing possibilities and extent of genome engineering and modification, possible effects of an unplanned release of engineered lifeforms into the environment and their potential interactions with natural life require serious consideration (112). Thus, the establishment of effective safety measures is of utmost importance. Orthogonal biosystems, as defined above, which rely on divergent genetic information and a dedicated set of engineered components for propagation and/or expression, together with engineered differences in metabolism or the reliance on building blocks not found in nature, have great potential as biosafety systems, as orthogonality is not easily reverted by mutational drift.
Orthogonal genetic elements such as T7 RNA polymerase have been used for decades, often merely for the sake of convenience in the absence of cross-reactivity with the host chassis; the same motivation sparked the application of an orthogonal hypermutation-enabling DNA polymerase in yeast (85). A broader motivation for orthogonality, however, arises from the desire to prevent biological interaction on an ecological scale and thereby achieve pathogen resistance and/or biocontainment of non-natural biological systems (70, 95).
3.1. Semantic Orthogonality
Because virtually all strategies for the prevention of unintentional environmental proliferation (56) can theoretically be escaped by mutational drift and evolution, successful strategies must aim for the minimization of the likelihood of evolutionary escape. While biosafety systems such as kill switches or metabolic auxotrophy may be overcome by only a few mutations or gene transfer events, reconfiguration of the genetic system, e.g., by genetic code reassignment (semantic orthogonality) (Figure 2; see also Figure 4 below) potentially enables a virtually unbreachable informational firewall (86).
Figure 2.
Semantic versus alphabetic orthogonality. In semantic orthogonality, after genome resynthesis with code compression, an orphan codon (TCG, previously coding for Ser) is reassigned to an unnatural amino acid (uAA) (left) (unnatural or engineered components are designated with *). Alphabetic orthogonality requires genetic alphabet expansion by genomic insertion, replication, and transcription of a UBP X:Y (creating the new codon dGXC) and a CYG anticodon transfer RNA (tRNA). This also requires import of (d)XTP [(d)YTP] for replication (transcription), as well as potentially (engineered, marked by *) DNA and RNA polymerases and ribosome (right). Both require an (engineered) orthogonal tRNA synthetase (uaaRS*):tRNA pair assigned to the new codon and uAA import.
Figure 4.
Orthogonality concepts. Genetic containment, i.e., inability to transfer genes to and from natural E. coli cells (center), may be achieved via semantic orthogonality (code reassignment; top left), alphabetic orthogonality (genetic alphabet expansion; bottom left), or (stereo)chemical orthogonality [(stereo)chemical divergence of genetic polymers; right].
A key step toward the achievement of semantic orthogonality was the development of orthogonal ribosomes (o-ribs). These allow translation of dedicated orthogonal messenger RNAs (mRNAs) (not translated by the wild-type ribosome), which in turn frees up the o-rib for engineering without toxicity effects. This strategy has enabled the evolution of o-ribs with enhanced amber suppression (83, 116) or an ability to translate four base (quadruplet) codons, which can potentially free up new codon space for assignment of more than one uAA (77). In eukaryotes, orthogonal translation has even been shown to be implemented as a function of spatial partitioning within engineered membraneless organelles (87).
Resynthesis and codon reassignment of the E. coli genome can aid in both genetic code expansion and the establishment of orthogonality. Going beyond single (amber) codon removal and synthetic auxotrophy by establishing cell dependence on uAAs (69, 92), genome resynthesis with code compression has provided a striking validation of the potential of semantic orthogonality. By (a) replacing two isoacceptor serine codons, TCG and TCA, and the TAG stop codon with the synonymous codons AGC, AGT, and TAA, respectively, and (b) concurrently removing the corresponding serine tRNAs and RF1, three new codons were freed up for genetic code expansion. This gave rise to an organism (Syn61Δ3) with a compressed 61-codon genetic code (31) that is orthogonal to gene transfer (e.g., phage infection) from the 64-codon biosphere (89). However, it could be argued that its orthogonality remains one sided, as gene transfer from Syn61 to the biosphere is in principle still possible. Complete genetic isolation could for instance be achieved via reassignment of the vacant codons to three new uAAs, obtaining a new divergent genetic code.
Another form of such semantic orthogonality (Figure 2) may involve the (partial) rewriting of the code by codon expansion to quadruplet codons, potentially enabling a 256-codon code. While there appear to be no fundamental obstacles to the translation of at least some quadruplet codons within mixed triplet codes or in series (19, 22), the need for the elaboration of large numbers of mutually orthogonal quadruplet tRNA–aaRS pairs, as well as the issue of avoidance of code ambiguity (i.e., triplet or quadruplet register), poses obstacles to the expanded use of quadruplet codons.
In the future, a larger number of reassigned codons localized in essential genes, together with orthogonal translation that is dependent on the supply of uAAs or uAA precursors (93), should provide a near-unbreachable informational firewall between the natural and orthogonal genetic systems and thus a large degree of genetic isolation from the biosphere. Below, we discuss other opportunities for orthogonality based on expanded chemistries providing informational, chemical, or stereochemical divergence.
3.2. Alphabetic Orthogonality: Unnatural Base Pairs
While genome resynthesis with code compression may also be leveraged to reduce ambiguity in quadruplet translation, alternative approaches such as the synthesis and introduction of new orthogonal base pairs [also called unnatural base pairs (UBPs)] constitute a promising alternative (Figure 2). Indeed, genetic alphabet expansion by one or two orthogonal new base pairs potentially yields as many as 216 (1 UBP) or 512 (2 UBPs) new triplet codons (and thousands of potential quadruplet codons). Even considering likely codon degeneracy, such an approach would potentially enable an explosive expansion of the chemical repertoire of translation.
However, major obstacles must be overcome to realize this vision: (a) The unnatural nucleotide bases and their activated triphosphates need to be available in the cell at sufficiently high levels without causing toxicity (i.e., they must be orthogonal at both the nucleoside and nucleotide level); (b) DNA and RNA polymerase(s) must enable their replication and transcription with high efficiency and fidelity; and (c) the ribosome must be able to correctly translate the new UBP-containing codons to incorporate a defined amino acid into the growing polypeptide chain, which also requires the presence of UBP-tRNAs and cognate aaRSs capable of loading UBP-tRNA specifically with the appropriate uAA (Figure 2). The encountered challenges are paradigmatic of the general introduction of expanded chemistries into organisms and are therefore discussed in detail below.
UBP designs fall broadly into three categories, and all three have been efficiently replicated in vitro. One design, pioneered by Benner and colleagues (8, 101, 127), builds on purine- and pyrimidine-isosteric heterocycles, in which the hydrogen bond donors or acceptors have been scrambled to yield four potential new base pairs. Among these UBPs with alternative H-bond patterns, the P:Z pair is the most extensively characterized, closely followed by the B:S pair; these two pairs have been combined with the natural pairs in a new eight-letter hachimoji alphabet (42). An alternative approach from the Hirao group uses base pairs interacting mostly via shape complementarity, with the most promising results obtained with the Ds:Px pair (40, 124). The third type of UBP, developed by Romesberg and colleagues (74), relies on hydrophobic interactions and shape complementarity, with 5SICS:NaM and TPT3:NaM as the best-characterized pairs (57, 66) (Figure 3).
Figure 3.
Unnatural base pair (UBP) chemistries. The UBP design concepts shown are H-bond reassignment (Z:P and S:B; left), shape complementarity (Ds:Px; right), and hydrophobic interactions (NaM:5SICS and NaM:TPT3; bottom).
UBPs from all three base pairing types can be replicated with high fidelity in vitro, as demonstrated in polymerase chain reaction (PCR) amplifications of appropriate templates (51, 68, 126). However, for in vivo applications, only the hydrophobic d5SICS:dNaM and dTPT3:dNaM pairs have been explored to date. Despite their divergent structure from the natural bases, DNA templates containing hydrophobic base pair variants, such as d5SICS:dNaM, are processed by natural polymerases in vitro, with only slightly lower fidelities compared to natural substrates for the best-processed template sequences (68). Higher fidelities of up to 99.9% and reduced sequence dependency can be achieved by employing binary polymerase blends such as OneTaq to optimize the ratio of synthesis and repair via exonuclease activity in the test tube (66). Interestingly, while these hydrophobic bases intercalate within the DNA duplex (67), when replicated, the polymerase active site appears to enforce the natural edge-on structure (9), which may be a prerequisite for replication. Indeed, using a less intercalating UBP such as dNaM paired with the less aromatic dTPT3 further improves both incorporation and extension past the UBP (57). Sequence-dependent replication efficiencies are also observed for the dDs:dPx pair; however, these require at least six standard bases between subsequent Ds for successful replication (51). The reasons why consecutive hydrophobic bases are poorly replicated are not well studied but may include higher structural flexibility (due to an absence of interstrand H-bonding); general distortions of the double helix (72); and potentially a tendency to form looped-out structures by interleaving self-pairs, as has been suggested for the hydrophobic universal base 5-nitroindole (63).
Replicating hydrophobic UBPs in vivo requires an endogenous E. coli polymerase or introduction of specific polymerases, such as the Klenow fragment of E. coli DNA polymerase I, that are capable of replicating the d5SICS:dNaM pair in vitro (98). Although both d5SICS:dNaM and dTPT3:dNaM were successfully replicated in E. coli on a plasmid where the UBP is located in a fragment designed to be processed by DNA polymerase I (65, 131, 132), it was later shown to be DNA polymerase II and DNA polymerase III that were critical for replication in vivo (55). This clearly demonstrates the feasibility of in vivo replication of UBPs integrated into natural DNA. Major challenges to address will include the incorporation of multiple UBPs and measures to prevent UBP loss over time, which may closely correlate with the availability of unnatural triphosphates in vivo (65).
Although they have not been probed for in vivo applications to date, the Benner group’s dZ and dP nucleobases are promising candidates for the replication of multiple UBPs in series, as they are more compatible with the natural DNA architecture due to their nearly isosteric structure and similar H-bonding recognition principles (34). Indeed, they are recognized as substrates by members of both the A and B polymerase families (128). However, the polymerases differ in their efficiency and fidelity when dealing with the unnatural bases, especially when encountering multiple UBPs in series. Notably, Taq and Phusion polymerases are, without additional engineering, capable of replicating DNA containing stretches of up to four copies of these unnatural nucle-obases with error rates as low as 0.2% under optimized conditions (125). The major route for base pair loss is the pH-dependent mispairing of dZ with dG. Importantly, the mutation rate is bidirectional, i.e., dZ can also be gained, yielding an evolvable six-nucleotide system (125). Based on these promising results, it will be very interesting to evaluate the replication of this six-letter alphabet in vivo. The ability of polymerases to copy stretches of up to four unnatural bases should in principle allow for the replication of plasmids with a six-letter alphabet (125).
3.2.1. Availability of unnatural triphosphate nucleobases in vivo
Both replication and transcription of UBPs require the presence of the corresponding nucleoside triphosphates at sufficiently high concentrations in the cell. As a reference, physiological concentrations of the natural deoxyribonucleoside triphosphates (dNTPs) range from 5 μM for dGTP to 37 μM for dTTP (113). In the absence of established metabolic pathways to generate the desired unnatural nucleotides directly in vivo, growth media need to be supplemented with presynthesized unnatural nucleosides or nucleotides, and these require efficient cellular uptake.
The most explored route in E. coli relies on direct administration of the triphosphates to the growth medium and uptake via specialized transporters like the nucleoside triphosphate transporter of the algae Phaeodactylum tricornutum (PtNTT2). This strategy enabled the import of d5SICS and dNaM triphosphates and allowed for the propagation of a single UBP over several generations in vivo, generating the first semisynthetic organism with an expanded genetic alphabet (65). Yet some challenges remain: The rapid dephosphorylation of triphosphates both in the media and in the cytosol resulted in intracellular depletion and UBP loss during replication (65), and transporter expression appeared to be toxic and thus inhibited growth. Strategies for increasing triphosphate availability include (a) supplementation of the medium with potassium phosphate, which acts as a competitive inhibitor for triphosphate-degrading phosphatases (65); (b) chemical modification of the unnatural nucleotides to increase phosphatase resistance but retain polymerase recognition (e.g., via substitution of oxygen with a difluoromethylene unit between the β- and γ-phosphate groups for dTPT3; 29); and (c) transporter engineering to reduce toxicity and increase expression (and thus nucleotide import) via removal of an N-terminal signal peptide and chromosomal integration (131).
A very promising approach for selection and optimization of nucleoside triphosphate transporters was developed recently for dTTP uptake (81). Transporters were chromosomally integrated in place of ompT, an outer membrane protease, and uptake of the triphosphate was coupled with cell survival in strains auxotrophic for dTTP. This design resulted in the discovery that the nucleoside triphosphate transporter from Thalassiosira pseudonana (TpNTT2) imports triphosphates (including some unnatural xNTPs) more efficiently and is less toxic for E. coli than PtNTT2.
Ideally, however, unnatural nucleobases would be administered as nucleosides, which are thought to enter cells via either passive diffusion or active transport by bacterial nucleoside transporters such as NupC or Tsx (17, 129). Once in the cytoplasm, stepwise phosphorylation by kinases of the nucleotide salvage pathway would be required to generate the nucleoside triphosphates. Rate limiting in the phosphorylation cascade is generally the first step in generating the 5′-monophosphate, and this step is catalyzed by deoxynucleoside kinases (2). While most organisms possess highly specific variants of this kinase, Drosophila melanogaster has a variant with broader substrate specificity, which can phosphorylate all four natural and various unnatural nucleosides, including analogs of d5SICS and dNaM (123), as well as dP and its partner dZ upon introduction of a single point mutation (15). The remaining steps to generate the di- and triphosphate, catalyzed by mono- and diphosphate kinases, respectively, have not been thoroughly characterized, but these enzymes are generally thought to be more promiscuous than deoxynucleoside kinases. Indeed, E. coli nucleoside diphosphate kinase accepts dZDP and dPDP as substrates to generate triphosphates that are directly utilized by DNA polymerases (73). Thus, a system may be envisioned whereby endogenous, engineered, or recombinant kinases generate the required triphosphates directly from the unnatural nucleosides. This alternative approach involves considerably more cellular engineering but also represents the first step toward engineered in vivo biosynthesis and processing of unnatural nucleobases and fully autotrophic future organisms with expanded genetic alphabets.
3.2.2. Transcription of unnatural base pairs
T7 RNA polymerase (T7Rp) accepts various unnatural nucleobases as substrates, including 5SICS and NaM (99), as well as Pa and Ds (39). Similarly, templates containing up to three copies of dZ and dP (54) or an eight-letter genetic alphabet employing the Z:P and B:S pairs have been transcribed, the latter requiring an engineered T7Rp variant (42). Despite reduced in vitro transcription efficiency, recombinant expression of T7Rp in E. coli proved sufficient to generate mRNA from a plasmid encoding sfGFP with a single copy of the unnatural nucleobase dNaM for subsequent protein synthesis (see below), provided that the corresponding unnatural ribonucleoside triphosphates were added to the media in the presence of the engineered nucleotide transporter (132). Furthermore, S. cerevisiae RNA polymerase II also accepts NaM and TPT3 as substrates to produce full-length RNA transcripts, with considerably higher selectivity against natural nucleotide misincorporation if dTPT3 is present in the DNA template than if it is absent (79). This result paves the way for transcription of UBPs using endogenous RNA polymerases in both prokaryotes and eukaryotes.
3.2.3. Translation of unnatural base pairs
In the early 1990s, the Benner group first demonstrated successful translation of iso-C and iso-G UBPs in vitro, observing incorporation of the uAA iodotyrosine with yields of 90%, exceeding those resulting from amber suppression (7). Thus, the ribosome is clearly capable of translating UBP-containing codons, provided that a specifically pairing tRNA is present. Building on this result, the Hirao group transcribed and translated the ds:dy UBP in a one-pot reaction in vitro to produce human Ras protein containing the uAA 3-chlorotyrosine (41), demonstrating that the ribosome is also able to process UBPs based on shape complementarity.
Recently, UBP translation of sfGFP was described in vivo in E. coli by Romesberg and colleagues (132) from a plasmid containing a single copy of the dTPT3:dNAM UBP, combining all previously identified requirements—heterologous expression of an optimized nucleoside triphosphate transporter, T7Rp, and addition of the unnatural (d)NaM and (d)TPT3 triphosphates to the growth media—with the addition of a plasmid coding for an engineered tRNA containing TPT3 in the anticodon loop. Moreover, introduction of an orthogonal pair of TPT3-containing tRNA and a corresponding tRNA synthetase supported incorporation of a uAA to truly expand the genetic code with the newly generated A(NaM)C or G(NaM)C codons.
Theoretically, the resulting six-letter genetic alphabet gives rise to 152 new codons. However, an initial screen of possible codons containing a maximum of one UBP identified just nine that resulted in successful protein production, suggesting constraints on UBP usage, such as a requirement for these UBPs to be located at the second codon position and the presence of at least one G:C pair (30). Yet those nine codons were decoded with high efficiency (96%) (30), demonstrating the potential of the UBP approach for genetic code expansion. These results are a promising start toward achieving the ambitious goal of creating semisynthetic organisms with divergent six- or even eight-letter genetic alphabets.
3.3. Synthetic Genetics: Xeno-Nucleic Acids
Beyond such semantic or alphabetic orthogonality, one might ask if other forms of orthogonality might be developed, e.g., based on chemical divergence of the genetic system. One such approach has been pursued by Marlière and colleagues (71), who developed an E. coli strain in which dT was replaced genome-wide by the isostere 5-chlorouracil (5CU) and that, through continuous growth and evolution in the presence of 5CU, became dependent on it for growth. Due to the close similarity of T and 5CU, it is likely that this approach is only a first step to full genetic isolation (incomplete orthogonality; Figure 4), likely requiring a further expansion of the chemical divergence of such genomes.
More radical variations to the canonical chemistry of nucleic acids have been explored in efforts to define and understand the fundamental chemical and physicochemical parameters that underpin their capacity for genetic information storage and propagation (108). Apart from illuminating the profound influence of nucleic acid chemistry on function, these studies have yielded a large number of variations on the canonical (deoxy)ribonucleic acid structure used in nature that are (to varying degrees) capable of specific homo- or heteroduplex formation, genetic information storage, propagation, and evolution. These synthetic modified nucleic acids include an ever-increasing range of modified nucleic acid scaffolds, comprising sugar ring congeners and altered backbones (even including completely uncharged backbone chemistries) (32). For simplicity, we refer to all such synthetic modified nucleic acids as xeno-nucleic acids (XNAs), even if some chemical variations, such as 2′-O-methyl (2’OMe)-RNA or αP-S (phosphorothioate), can occur in nature (20, 118).
In recent decades, there has been growing interest in XNAs for applications in biotechnology, nanotechnology, and medicine, in particular as scaffolds for the emerging field of nucleic acid therapeutics. Several XNA modifications have already been approved for human use by the US Food and Drug Administration and confer beneficial physicochemical, pharmacokinetic, and pharmacodynamic properties, including biostability due to nuclease resistance, increased in vivo half-life, and increased antisense-binding affinity in the case of antisense oligonucleotides (ASOs) (114). However, the scope of XNAs is potentially much wider, as the capacity of nucleic acids (including XNAs) to fold into defined three-dimensional shapes allows the de novo discovery of ligands (XNA aptamers), catalysts (XNAzymes), and defined nanotechnology objects (105) and devices, raising the possibility that these too can be applied clinically.
3.3.1. Xeno-nucleic acid ligands (aptamers)
XNA aptamers have been discovered for a range of XNA chemistries through the same process used for DNA and RNA aptamers, systematic evolution of ligands by exponential enrichment (SELEX) (122). In the case of XNAs, SELEX of XNA aptamers or XNAzymes (106) relies on the synthesis of (XNA) libraries by (engineered) polymerases (45), selection of the desired phenotype through physical separation of successful sequences, reverse transcription by (engineered) reverse transcriptases (RTs), and amplification for subsequent sequencing or entering a new selection cycle. So-called slow off-rate modified aptamers (SOMAmers) bearing (hydrophobic) substituents at the pyrimidine C5 position enabled discovery of aptamer arrays against a wide range of targets that are promising for diagnostic biomarker monitoring (36). However, for fully backbone-modified XNA aptamers with sugar congeners like α-L-threofuranosyl nucleic acid (TNA), 2′-fluoro-arabinonucleic acid (FANA), and 1′ 5′-anhydrohexitol nucleic acid (HNA), engineered polymerases are required to polymerize some xNTPs, i.e., tNTPs (78), faNTPs (82), and hNTPs (82), respectively. The full SELEX cycle also requires RTs to convert XNA back into DNA. While some XNA chemistries, like TNA (23) and FANA (119), can be reverse transcribed by naturally occurring RTs or their engineered forms, such as Moloney murine leukemia virus (M-MuLV) RT (available commercially as Superscript I-IV) or Bst DNA polymerase (available commercially as RTx) (80), most XNA chemistries require bespoke RT engineering to enable efficient reverse transcription (46, 82).
TNA aptamers were selected to bind programmed death-ligand 1 (58) or HIV-1 RT (24, 75), as well as small-molecule targets like ochratoxin A (84) and adenosine triphosphate (130). FANA aptamers binding HIV-1 RT (3), integrase (91), and the receptor-binding domain of the SARS-CoV-2 Spike protein (4) have been developed, showcasing once more the ease of aptamer selection against naturally nucleic acid–binding proteins. In addition to the original anti-HIV TAR and anti-Hen Egg Lysozyme aptamers (82), HNA aptamers with a high affinity for vascular endothelial growth factor (VEGF) have recently been described (27). Finally, even uncharged backbone chemistries like P-alkyl phosphonate nucleic acids (phNAs) have been evolved into aptamers (5).
3.3.2. In vitro applications of xeno-nucleic acids: XNA catalysts (XNAzymes)
Nucleic acids can fold into three-dimensional structures in a way that renders them active catalysts. A classic example is the 10–23 DNAzyme, which cleaves RNA in a Mg2+ -dependent manner (94). Because of its potential promise as a gene-silencing agent, the 10-23 DNAzyme has been intensively studied. However, it has broadly failed to have clinical impact, possibly due to its dependence on unphysiological concentrations of Mg2+ for both folding and catalysis. Indeed, a recent nuclear magnetic resonance analysis of the 10–23 structure confirmed its structural and conformational plasticity and the central role of Mg2+ ions in shaping the transition from the inactive precat-alytic to the catalytic complex (12). Chemical modifications have been explored in an effort to enhance the properties of the 10–23 DNAzyme. For example, the study discussed above identified a 6-thio-dG14 modification that increased the cleavage rate (kobs) approximately sixfold. Similarly, FANA modifications at G2 and U8 in the catalytic core and full FANA-substitution of the RNA-target binding arms, in combination with two additional flanking TNA nucleotides, yielded a modified X10–23 with improved catalytic rate, biostability, and in vivo gene-silencing activity (120). Fully modified XNAzymes have also been obtained, including functionally analogous RNA-endonuclease FANAzymes (109, 119). In both cases, antisense effects likely make significant contributions to the apparent in vivo gene-silencing activity of the chimeric DNA-FANA (X10-23) or of all-FANAzymes (107, 110), as both DNA and FANA (and in particular mixed FANA-DNA oligonucleotides; 103) are able to efficiently recruit RNase H (the cellular enzyme that cleaves DNA-RNA heteroduplexes) (18). These effects may be disentangled by the study of RNA endonuclease XNAzymes elaborated in chemistries that do not recruit RNase H, such as 2’OMe-RNA (2′OMezymes) (33) or TNAzymes (121). With increasing structural and functional understanding of RNA endonuclease XNAzymes, there is hope that their strengths, i.e., their easy programmability, single-nucleotide specificity, and small size (enabling chemical synthesis) can be leveraged for their development as effective gene-silencing agents.
3.3.3. In vivo applications of xeno-nucleic acids
ASOs are emerging therapeutic modalities that have also greatly benefited from expanded nucleic acid chemistries. Natural chemistry-based RNA ASOs like small interfering RNAs (siRNAs), siRNAs with several chemical modifications that confer nuclease resistance, and DNA-based—usually phosphorothioated (αP-S)—ASOs, sometimes with flanking chemical modifications to make biostable gapmers, are part of the established ASO technology toolbox (21, 114). DNA gapmers recruit RNase H to degrade bound RNAs and rely on a central (αP-S)-DNA stretch crucial for RNase H recognition. The only XNA chemistry found to be able to elicit RNase H cleavage on a bound RNA is FANA (see above), but it is not as nuclease resistant as αP-S-DNA (18). Many other, more divergent chemistries, such as 2′-fluoro, 2′OMe, 2′-O-(2-methoxyethyl) (MOE), locked or bridged nucleic acids (LNAs/BNAs), and constrained ethyl nucleic acid (cEt), including fully modified (αP-S)-MOE-RNA and phospho-rodiamidate morpholino oligomer (PMO) ASOs, are used in the development of novel antisense medicines (114).
The repeating negative charges on the nucleic acid backbone inhibit passive transport across cell membranes. Recent endeavors to address this challenge target the phosphodiester backbone directly and introduce uncharged linkages, as in alkyl phosphonate nucleic acids (phNA) and phosphoryl guanidine oligonucleotides (PGOs) (49) or even positively charged guanidine-bridged nucleic acids (GuNA) (102, 104). While these approaches are elegant solutions to a physicochemical problem, they are still limited by larger problems like antisense-binding affinity to RNA targets, cost-efficient availability of starting materials, and ease of synthesis. While many of these chemistries have been explored by solid-phase synthesis, increasing success in polymerase engineering has enabled enzymatic synthesis of several of these chemistries, such as 2’OMe-RNA, phNA, MOE-RNA, and LNA (or blends there of) (5, 33, 38, 44), potentially enabling future rapid iterations of ASO sequence and chemistries to accelerate discovery.
3.3.4. Toward chemical orthogonality
XNAs introduced into cells (e.g., as described above for UBPs) could increase cellular function and information content in several ways. Alternative backbone chemistries, when synthesized inside the cell, could be designed to form specific XNA aptamers, XNAzymes, or ASOs (see above) that affect cellular regulation, e.g., by modifying protein interactions or modulating gene expression. XNA backbones (such as αP-S) can also affect decoding efficiency, especially on the level of translation (50, 52). However, a fundamental issue with most XNA chemistries is that, while they are divergent from the canonical DNA and RNA chemistry, they may be insufficiently orthogonal to avoid interference with the cellular nucleotide metabolism. Indeed, while UBPs appear to be largely nontoxic, the case of genomic rNTP incorporation indicates that even sporadic incorporation of noncognate nucleotides can lead to genetic instabilities (133). One potential approach to isolate XNAs further from the cellular metabolism would be to increase chemical divergence, and thereby orthogonality, by engineering XNA poly-merases for the acceptance of alternative leaving group chemistries (37) or uncharged backbone chemistries (5). Orthogonal in vivo XNAs may also require orthogonal replication, e.g., via the recruitment of a dedicated polymerase to the specific locus.
An interesting strategy for bio-orthogonal genetic isolation exploits stereochemical rather than chemical divergence. Orthogonal genetic mirror-image elements would be exceptionally suited for biocontainment strategies, since there is little crosstalk between proteins or nucleic acids of opposite chirality (6, 76). Progress in this area is based on the development of tools for the templated synthesis, replication, and evolution of L-DNA and -RNA (the mirror-image form of the naturally occurring D-DNA and -RNA) by mirror-image polymerases (elaborated in D-amino acids) (28, 97). A clear advantage of this method is that—provided the mirror-image enzymes can be made synthetically (or by a future mirror-image ribosome)—no further engineering is required. However, at the cellular level, it is unclear how such enantiomeric orthogonality could be established in stages, which would require either the full implementation of a mirror-image cellular metabolism (or at least genetic system) or the engineering of dedicated cellular components, e.g., polymerases, for the polymerization and processing of L-DNA and -RNA.
Other approaches to further chemical orthogonality, for example, based on XNA chemistries like homo-DNA, pRNA, or XyloNA (14, 26, 53) (Figure 5), which are unable to cross-pair stably with the natural nucleic acids, seem possible but would be even more challenging to implement. Steric orthogonality, whereby chemical orthogonality arises from significant steric divergence from the natural nucleic acids, such as in the expanded nucleic acids (xDNA, yDNA) of Kool and colleagues (60, 61, 64) or the fat and skinny DNAs of Benner and colleagues (43), may be another possibility, but these chemistries would likely not be fully orthogonal, as both x(y)DNA and fat (skinny) DNA maintain pairing with the natural bases. In all of these cases, efficient enzymatic synthesis and replication, let alone translation, remain to be established.
Figure 5.
Xeno-nucleic acids (XNAs) with potential for chemical orthogonality. (a,b) Two different approaches to steric orthogonality either by (a) expansion of natural bases or (b) new base pairs between either pyrimidine-like (skinny) or purine-like (fat) DNA bases. (c) Chemical orthogonality.
Thus, in terms of chemical orthogonality, the challenge is paradoxical in the sense that chemistries that can be accepted and processed by the extant cellular machinery (to some extent) are likely toxic, while those that diverge sufficiently to be fully orthogonal require such extensive cellular re-engineering as to be currently impractical. It would therefore seem that chemical, steric, or stereochemical orthogonality, i.e., orthogonal genetic systems based on XNA or mirror chemistries, may find more immediate utility in in vitro applications and in the context of the emerging field of synthetic biomaterials and cells (1, 90). Synthetic cells are based on a much-simplified synthetic cellular machinery, which would be much more amenable to wholesale engineering or replacement. Thus, synthetic cells comprising chemically orthogonal XNA genomes and/or transcripts might provide both a useful test bed for the various synthetic genetic concepts outlined above (including the step-wise development toward in vivo applications), as well as providing an opportunity to implement additional layers of control and added functionalities in synthetic cells and biomaterials.
4. Conclusion
By expanding the central dogma and extending the boundaries of life both at the informational and at the chemical level, the concepts outlined in this review promise not only to augment our abilities for biological engineering, but also to extend them to studying and understanding biological function and evolution at a deeper level by opening up fundamental aspects of the informational organization and chemical makeup of life for investigation.
Acknowledgments
This work was supported by the Medical Research Council (MRC) as part of United Kingdom Research and Innovation (also known as UK Research and Innovation) (MRC program grant MC_U105178804 to P.H.), by a Leverhulme Postdoctoral fellowship (ECF-2021-387 to K.G.), a Ph.D. fellowship from the Boehringer Ingelheim Fonds (to N.F.), a research collaboration between AstraZeneca UK Limited and the Medical Research Council (MRC-AstraZeneca Blue Sky Grant to W.L. and Y.L.), and an EMBO long-term Postdoctoral Fellowship (ALTF 648-202 to M.J.L.J.F.). For the purpose of open access, the MRC Laboratory of Molecular Biology has applied a CC BY public copyright license to any Author Accepted Manuscript version arising. We thank our colleague Daniel Dunkelmann (MRC LMB) for helpful comments on the manuscript.
Footnotes
Disclosure Statement
The authors are not aware of any affiliations, memberships, funding, or financial holdings that might be perceived as affecting the objectivity of this review.
Literature Cited
- 1.Adamala KP, Martin-Alarcon DA, Guthrie-Honea KR, Boyden ES. Engineering genetic circuit interactions within and between synthetic minimal cells. Nat Chem. 2017;9:431–39. doi: 10.1038/nchem.2644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Al-Madhoun SA, Tjarks W, Eriksson S. The role of thymidine kinases in the activation of pyrimidine nucleoside analogues. Mini-Rev Med Chem. 2004;4:341–50. doi: 10.2174/1389557043403963. [DOI] [PubMed] [Google Scholar]
- 3.Alves Ferreira-Bravo I, Cozens C, Holliger P, DeStefano JJ. Selection of 2’-deoxy-2’-fluoroarabinonucleotide (FANA) aptamers that bind HIV-1 reverse transcriptase with picomolar affinity. Nucleic Acids Res. 2015;43:9587–99. doi: 10.1093/nar/gkv1057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Alves Ferreira-Bravo I, DeStefano JJ. Xeno-nucleic acid (XNA) 2’-fluoro-arabino nucleic acid (FANA) aptamers to the receptor-binding domain of SARS-CoV-2 S protein block ACE2 binding. Viruses. 2021;13:1983. doi: 10.3390/v13101983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Arangundy-Franklin S, Taylor AI, Porebski BT, Genna V, Peak-Chew S, et al. A synthetic genetic polymer with an uncharged backbone chemistry based on alkyl phosphonate nucleic acids. Nat Chem. 2019;11:533–42. doi: 10.1038/s41557-019-0255-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ashley GW. Modeling, synthesis, and hybridization properties of (L)-ribonucleic acid. J Am Chem Soc. 1992;114:9731–36. [Google Scholar]
- 7.Bain JD, Switzer C, Chamberlin R, Benner SA. Ribosome-mediated incorporation of a non-standard amino acid into a peptide through expansion of the genetic code. Nature. 1992;356:537–39. doi: 10.1038/356537a0. [DOI] [PubMed] [Google Scholar]
- 8.Benner SA, Karalkar NB, Hoshika S, Laos R, Shaw RW, et al. Alternative Watson-Crick synthetic genetic systems. Cold Spring Harb Perspect Biol. 2016;8:a023770. doi: 10.1101/cshperspect.a023770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Betz K, Malyshev DA, Lavergne T, Welte W, Diederichs K, et al. KlenTaq polymerase replicates unnatural base pairs by inducing a Watson-Crick geometry. Nat Chem Biol. 2012;8:612–14. doi: 10.1038/nchembio.966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Blount BA, Gowers GF, Ho JCH, Ledesma-Amaro R, Jovicevic D, et al. Rapid host strain improvement by in vivo rearrangement of a synthetic yeast chromosome. Nat Commun. 2018;9:1932. doi: 10.1038/s41467-018-03143-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Boeke JD, Church G, Hessel A, Kelley NJ, Arkin A, et al. The Genome Project-Write. Science. 2016;353:126–27. doi: 10.1126/science.aaf6850. [DOI] [PubMed] [Google Scholar]
- 12.Borggrafe J, Victor J, Rosenbach H, Viegas A, Gertzen CGW, et al. Time-resolved structural analysis of an RNA-cleaving DNA catalyst. Nature. 2022;601:144–49. doi: 10.1038/s41586-021-04225-4. [DOI] [PubMed] [Google Scholar]
- 13.Caruthers MH. Gene synthesis machines: DNA chemistry and its uses. Science. 1985;230:281–85. doi: 10.1126/science.3863253. [DOI] [PubMed] [Google Scholar]
- 14.Chaput JC, Herdewijn P, Hollenstein M. Orthogonal genetic systems. ChemBioChem. 2020;21:1408–11. doi: 10.1002/cbic.201900725. [DOI] [PubMed] [Google Scholar]
- 15.Chen F, Zhang Y, Daugherty AB, Yang Z, Shaw R, et al. Biological phosphorylation of an unnatural base pair (UBP) using a Drosophila melanogaster deoxynucleoside kinase (DmdNK) mutant. PLOS ONE. 2017;12:e0174163. doi: 10.1371/journal.pone.0174163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Chin JW. Expanding and reprogramming the genetic code. Nature. 2017;550:53–60. doi: 10.1038/nature24031. [DOI] [PubMed] [Google Scholar]
- 17.Craig JE, Zhang Y, Gallagher MP. Cloning of the nupC gene of Escherichia coli encoding a nucleoside transport system, and identification of an adjacent insertion element, IS 186. Mol Microbiol. 1994;11:1159–68. doi: 10.1111/j.1365-2958.1994.tb00392.x. [DOI] [PubMed] [Google Scholar]
- 18.Damha MJ, Wilds CJ, Noronha A, Brukner I, Borkow G, et al. Hybrids of RNA and arabinonucleic acids (ANA and 2’F-ANA) are substrates of ribonuclease H. J Am Chem Soc. 1998;120:12976–77. [Google Scholar]
- 19.DeBenedictis EA, Carver GD, Chung CZ, Söll D, Badran AH. Multiplex suppression of four quadruplet codons via tRNA directed evolution. Nat Commun. 2021;12:5706. doi: 10.1038/s41467-021-25948-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dimitrova DG, Teysset L, Carre C. RNA 2’-O-methylation (Nm) modificationinhuman diseases. Genes. 2019;10:117. doi: 10.3390/genes10020117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Duffy K, Arangundy-Franklin S, Holliger P. Modified nucleic acids: replication, evolution, and next-generation therapeutics. BMC Biol. 2020;18:112. doi: 10.1186/s12915-020-00803-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Dunkelmann DL, Oehm SB, Beattie AT, Chin JW. A 68-codon genetic code to incorporate four distinct non-canonical amino acids enabled by automated orthogonal mRNA design. Nat Chem. 2021;13:1110–17. doi: 10.1038/s41557-021-00764-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Dunn MR, Chaput JC. Reverse transcription of threose nucleic acid by a naturally occurring DNA polymerase. ChemBioChem. 2016;17:1804–8. doi: 10.1002/cbic.201600338. [DOI] [PubMed] [Google Scholar]
- 24.Dunn MR, McCloskey CM, Buckley P, Rhea K, Chaput JC. Generating biologically stable TNA aptamers that function with high affinity and thermal stability. J Am Chem Soc. 2020;142:7721–24. doi: 10.1021/jacs.0c00641. [DOI] [PubMed] [Google Scholar]
- 25.Dymond JS, Richardson SM, Coombes CE, Babatz T, Muller H, et al. Synthetic chromosome arms function in yeast and generate phenotypic diversity by design. Nature. 2011;477:471–76. doi: 10.1038/nature10403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Egli M, Pallan PS, Pattanayek R, Wilds CJ, Lubini P, et al. Crystal structure of homo-DNA and nature’s choice of pentose over hexose in the genetic system. J Am Chem Soc. 2006;128:10847–56. doi: 10.1021/ja062548x. [DOI] [PubMed] [Google Scholar]
- 27.Eremeeva E, Fikatas A, Margamuljana L, Abramov M, Schols D, et al. Highly stable hexitol based XNA aptamers targeting the vascular endothelial growth factor. Nucleic Acids Res. 2019;47:4927–39. doi: 10.1093/nar/gkz252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Fan CY, Deng Q, Zhu TF. Bioorthogonal information storage in L-DNA with a high-fidelity mirror-image Pfu DNA polymerase. Nat Biotechnol. 2021;39:1548–55. doi: 10.1038/s41587-021-00969-6. [DOI] [PubMed] [Google Scholar]
- 29.Feldman AW, Dien VT, Romesberg FE. Chemical stabilization of unnatural nucleotide triphosphates for the in vivo expansion of the genetic alphabet. J Am Chem Soc. 2017;139:2464–67. doi: 10.1021/jacs.6b12731. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Fischer EC, Hashimoto K, Zhang Y, Feldman AW, Dien VT, et al. New codons for efficient production of unnatural proteins in a semisynthetic organism. Nat Chem Biol. 2020;16:570–76. doi: 10.1038/s41589-020-0507-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Fredens J, Wang K, de la Torre D, Funke LFH, Robertson WE, et al. Total synthesis of Escherichia coli with a recoded genome. Nature. 2019;569:514–18. doi: 10.1038/s41586-019-1192-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Freund N, Fürst MJLJ, Holliger P. New chemistries and enzymes for synthetic genetics. Curr Opin Biotechnol. 2022;74:129–36. doi: 10.1016/j.copbio.2021.11.004. [DOI] [PubMed] [Google Scholar]
- 33.Freund N, Taylor AI, Arangundy-Franklin S, Subramanian N, Peak-Chew SY, et al. A two-residue nascent-strand steric gate controls synthesis of 2’-O-methyl-and 2’-O-(2-methoxyethyl)-RNA. Nat Chem. 2023;15:91–100. doi: 10.1038/s41557-022-01050-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Georgiadis MM, Singh I, Kellett WF, Hoshika S, Benner SA, Richards NGJ. Structural basis for a six nucleotide genetic alphabet. J Am Chem Soc. 2015;137:6947–55. doi: 10.1021/jacs.5b03482. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Gibson DG, Glass JI, Lartigue C, Noskov VN, Chuang RY, et al. Creation of a bacterial cell controlled by a chemically synthesized genome. Science. 2010;329:52–56. doi: 10.1126/science.1190719. [DOI] [PubMed] [Google Scholar]
- 36.Gold L, Ayers D, Bertino J, Bock C, Bock A, et al. Aptamer-based multiplexed proteomic technology for biomarker discovery. PLOS ONE. 2010;5:e15004. doi: 10.1371/journal.pone.0015004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Herdewijn P, Marlière P. Redesigning the leaving group in nucleic acid polymerization. FEBS Lett. 2012;586:2049–56. doi: 10.1016/j.febslet.2012.02.033. [DOI] [PubMed] [Google Scholar]
- 38.Hervey JRD, Freund N, Houlihan G, Dhaliwal G, Holliger P, Taylor AI. Efficient synthesis and replication of diverse sequence libraries composed of biostable nucleic acid analogues. RSC Chem Biol. 2022;3:1209–15. doi: 10.1039/d2cb00035k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hirao I, Kimoto M, Mitsui T, Fujiwara T, Kawai R, et al. An unnatural hydrophobic base pair system: site-specific incorporation of nucleotide analogs into DNA and RNA. Nat Methods. 2006;3:729–35. doi: 10.1038/nmeth915. [DOI] [PubMed] [Google Scholar]
- 40.Hirao I, Kimoto M, Yamashige R. Natural versus artificial creation of base pairs in DNA: origin of nucleobases from the perspectives of unnatural base pair studies. Acc Chem Res. 2012;45:2055–65. doi: 10.1021/ar200257x. [DOI] [PubMed] [Google Scholar]
- 41.Hirao I, Ohtsuki T, Fujiwara T, Mitsui T, Yokogawa T, et al. An unnatural base pair for incorporating amino acid analogs into proteins. Nat Biotechnol. 2002;20:177–82. doi: 10.1038/nbt0202-177. [DOI] [PubMed] [Google Scholar]
- 42.Hoshika S, Leal NA, Kim M-J, Kim M-S, Karalkar NB, et al. Hachimoji DNA and RNA: agenetic system with eight building blocks. Science. 2019;363:884–87. doi: 10.1126/science.aat0971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hoshika S, Singh I, Switzer C, Molt RW, Jr, Leal NA, et al. “Skinny” and “fat” DNA: two new double helices. J Am Chem Soc. 2018;140:11655–60. doi: 10.1021/jacs.8b05042. [DOI] [PubMed] [Google Scholar]
- 44.Hoshino H, Kasahara Y, Kuwahara M, Obika S. DNA polymerase variants with high processivity and accuracy for encoding and decoding locked nucleic acid sequences. J Am Chem Soc. 2020;142:21530–37. doi: 10.1021/jacs.0c10902. [DOI] [PubMed] [Google Scholar]
- 45.Houlihan G, Arangundy-Franklin S, Holliger P. Engineering and application of polymerases for synthetic genetics. Curr Opin Biotechnol. 2017;48:168–79. doi: 10.1016/j.copbio.2017.04.004. [DOI] [PubMed] [Google Scholar]
- 46.Houlihan G, Arangundy-Franklin S, Porebski BT, Subramanian N, Taylor AI, Holliger P. Discovery and evolution of RNA and XNA reverse transcriptase function and fidelity. Nat Chem. 2020;12:683–90. doi: 10.1038/s41557-020-0502-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Hutchison CA, 3rd, Chuang RY, Noskov VN, Assad-Garcia N, Deerinck TJ, et al. Design and synthesis of a minimal bacterial genome. Science. 2016;351:aad6253. doi: 10.1126/science.aad6253. [DOI] [PubMed] [Google Scholar]
- 48.Isaacs FJ, Carr PA, Wang HH, Lajoie MJ, Sterling B, et al. Precise manipulation of chromosomes in vivo enables genome-wide codon replacement. Science. 2011;333:348–53. doi: 10.1126/science.1205822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Kanarskaya MA, Golyshev VM, Pyshnyi DV, Lomzov AA. Structure and hybridization properties of phosphoryl guanidine oligonucleotides under crowding conditions. Biochem Biophys Res Commun. 2021;577:110–15. doi: 10.1016/j.bbrc.2021.09.001. [DOI] [PubMed] [Google Scholar]
- 50.Kawaguchi D, Kodama A, Abe N, Takebuchi K, Hashiya F, et al. Phosphorothioate modification of mRNA accelerates the rate of translation initiation to provide more efficient protein synthesis. Angew Chem Int Ed. 2020;59:17403–7. doi: 10.1002/anie.202007111. [DOI] [PubMed] [Google Scholar]
- 51.Kimoto M, Kawai R, Mitsui T, Yokoyama S, Hirao I. An unnatural base pair system for efficient PCR amplification and functionalization of DNA molecules. Nucleic Acids Res. 2008;37:e14. doi: 10.1093/nar/gkn956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Kore AR, Charles I. Synthesis and evaluation of 2’-O-allyl substituted dinucleotide cap analog for mRNA translation. Bioorg Med Chem. 2010;18:8061–65. doi: 10.1016/j.bmc.2010.09.013. [DOI] [PubMed] [Google Scholar]
- 53.Krishnamurthy R, Pitsch S, Minton M, Miculka C, Windhab N, Eschenmoser A. Pyranosyl-RNA: base pairing between homochiral oligonucleotide strands of opposite sense of chirality. Angew Chem Int Ed. 1996;35:1537–41. [Google Scholar]
- 54.Leal NA, Kim H-J, Hoshika S, Kim M-J, Carrigan MA, Benner SA. Transcription, reverse transcription, and analysis of RNA containing artificial genetic components. ACS Synth Biol. 2015;4:407–13. doi: 10.1021/sb500268n. [DOI] [PubMed] [Google Scholar]
- 55.Ledbetter MP, Karadeema RJ, Romesberg FE. Reprogramming the replisome of a semisynthetic organism for the expansion of the genetic alphabet. J Am Chem Soc. 2018;140:758–65. doi: 10.1021/jacs.7b11488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Lee JW, Chan CTY, Slomovic S, Collins JJ. Next-generation biocontainment systems for engineered organisms. Nat Chem Biol. 2018;14:530–37. doi: 10.1038/s41589-018-0056-x. [DOI] [PubMed] [Google Scholar]
- 57.Li L, Degardin M, Lavergne T, Malyshev DA, Dhami K, et al. Natural-like replication of an unnatural base pair for the expansion of the genetic alphabet and biotechnology applications. J Am Chem Soc. 2014;136:826–29. doi: 10.1021/ja408814g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Li X, Li Z, Yu H. Selection of threose nucleic acid aptamers to block PD-1/PD-L1 interaction for cancer immunotherapy. Chem Commun. 2020;56:14653–56. doi: 10.1039/d0cc06032a. [DOI] [PubMed] [Google Scholar]
- 59.Liu CC, Jewett MC, Chin JW, Voigt CA. Toward an orthogonal central dogma. Nat Chem Biol. 2018;14:103–6. doi: 10.1038/nchembio.2554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Liu HB, Gao JM, Lynch SR, Saito YD, Maynard L, Kool ET. A four-base paired genetic helix with expanded size. Science. 2003;302:868–71. doi: 10.1126/science.1088334. [DOI] [PubMed] [Google Scholar]
- 61.Liu HB, Gao JM, Maynard L, Saito YD, Kool ET. Toward a new genetic system with expanded dimensions: size-expanded analogues of deoxyadenosine and thymidine. J Am Chem Soc. 2004;126:1102–9. doi: 10.1021/ja038384r. [DOI] [PubMed] [Google Scholar]
- 62.Liu W, Luo Z, Wang Y, Pham NT, Tuck L, et al. Rapid pathway prototyping and engineering using in vitro and in vivo synthetic genome SCRaMbLE-in methods. Nat Commun. 2018;9:1936. doi: 10.1038/s41467-018-04254-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Loakes D, Hill F, Brown DM, Salisbury SA. Stability and structure of DNA oligonucleotides containing non-specific base analogues. J Mol Biol. 1997;270:426–35. doi: 10.1006/jmbi.1997.1129. [DOI] [PubMed] [Google Scholar]
- 64.Lu HG, He KZ, Kool ET. yDNA: a new geometry for size-expanded base pairs. Angew Chem Int Ed. 2004;43:5834–36. doi: 10.1002/anie.200461036. [DOI] [PubMed] [Google Scholar]
- 65.Malyshev DA, Dhami K, Lavergne T, Chen T, Dai N, et al. A semi-synthetic organism with an expanded genetic alphabet. Nature. 2014;509:385–88. doi: 10.1038/nature13314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Malyshev DA, Dhami K, Quach HT, Lavergne T, Ordoukhanian P, et al. Efficient and sequenceindependent replication of DNA containing a third base pair establishes a functional six-letter genetic alphabet. PNAS. 2012;109:12005–10. doi: 10.1073/pnas.1205176109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Malyshev DA, Pfaff DA, Ippoliti SI, Hwang GT, Dwyer TJ, Romesberg FE. Solution structure, mechanism of replication, and optimization of an unnatural base pair. Chemistry. 2010;16:12650–59. doi: 10.1002/chem.201000959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Malyshev DA, Seo YJ, Ordoukhanian P, Romesberg FE. PCR with an expanded genetic alphabet. J Am Chem Soc. 2009;131:14620–21. doi: 10.1021/ja906186f. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Mandell DJ, Lajoie MJ, Mee MT, Takeuchi R, Kuznetsov G, et al. Biocontainment of genetically modified organisms by synthetic protein design. Nature. 2015;518:55–60. doi: 10.1038/nature14121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Marliere P. The farther, the safer: a manifesto for securely navigating synthetic species away from the old living world. Syst Synth Biol. 2009;3:77–84. doi: 10.1007/s11693-009-9040-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Marlière P, Patrouix J, Doring V, Herdewijn P, Tricot S, et al. Chemical evolution of abacterium’s genome. Angew Chem Int Ed. 2011;50:7109–14. doi: 10.1002/anie.201100535. [DOI] [PubMed] [Google Scholar]
- 72.Marx A, Betz K. The structural basis for processing of unnatural base pairs by DNA polymerases. Chemistry. 2020;26:3446–63. doi: 10.1002/chem.201903525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Matsuura MF, Shaw RW, Moses JD, Kim H-J, Kim M-J, et al. Assays to detect the formation of triphosphates of unnatural nucleotides: application to Escherichia coli nucleoside diphosphate kinase. ACS Synth Biol. 2016;5:234–40. doi: 10.1021/acssynbio.5b00172. [DOI] [PubMed] [Google Scholar]
- 74.McMinn DL, Ogawa AK, Wu Y, Liu J, Schultz PG, Romesberg FE. Efforts toward expansion of the genetic alphabet: DNA polymerase recognition of a highly stable, self-pairing hydrophobic base. J Am Chem Soc. 1999;121:11585–86. [Google Scholar]
- 75.Mei H, Liao J-Y, Jimenez RM, Wang Y, Bala S, et al. Synthesis and evolution of a threose nucleic acid aptamer bearing 7-deaza-7-substituted guanosine residues. J Am Chem Soc. 2018;140:5706–13. doi: 10.1021/jacs.7b13031. [DOI] [PubMed] [Google Scholar]
- 76.Milton RCD, Milton SCF, Kent SBH. Total chemical synthesis of a D-enzyme: the enantiomers ofHIV-1 protease show reciprocal chiral substrate specificity. Science. 1992;256:1445–48. doi: 10.1126/science.1604320. [DOI] [PubMed] [Google Scholar]
- 77.Neumann H, Wang K, Davis L, Garcia-Alai M, Chin JW. Encoding multiple unnatural amino acids via evolution of a quadruplet-decoding ribosome. Nature. 2010;464:441–44. doi: 10.1038/nature08817. [DOI] [PubMed] [Google Scholar]
- 78.Nikoomanzar A, Vallejo D, Yik EJ, Chaput JC. Programmed allelic mutagenesis of a DNA polymerase with single amino acid resolution. ACS Synth Biol. 2020;9:1873–81. doi: 10.1021/acssynbio.0c00236. [DOI] [PubMed] [Google Scholar]
- 79.Oh J, Shin J, Unarta IC, Wang W, Feldman AW, et al. Transcriptional processing of an unnatural base pair by eukaryotic RNA polymerase II. Nat Chem Biol. 2021;17:906–14. doi: 10.1038/s41589-021-00817-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Ong J, Evans JTC, Tanner N. Compositions and methods relating to variant DNA polymerases and synthetic DNA polymerases WO Patent 2013/033528 A1. 2013
- 81.Pezo V, Hassan C, Louis D, Sargueil B, Herdewijn P, Marlière P. Metabolic recruitment and directed evolution of nucleoside triphosphate uptake in Escherichia coli. ACS Synth Biol. 2018;7:1565–72. doi: 10.1021/acssynbio.8b00048. [DOI] [PubMed] [Google Scholar]
- 82.Pinheiro VB, Taylor AI, Cozens C, Abramov M, Renders M, et al. Synthetic genetic polymers capable of heredity and evolution. Science. 2012;336:341–44. doi: 10.1126/science.1217622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Rackham O, Chin JW. A network of orthogonal ribosomexm RNApairs. Nat Chem Biol. 2005;1:159–66. doi: 10.1038/nchembio719. [DOI] [PubMed] [Google Scholar]
- 84.Rangel AE, Chen Z, Ayele TM, Heemstra JM. In vitro selection of an XNA aptamer capable of small-molecule recognition. Nucleic Acids Res. 2018;46:8057–68. doi: 10.1093/nar/gky667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Ravikumar A, Arzumanyan GA, Obadi MKA, Javanpour AA, Liu CC. Scalable, continuous evolution of genes at mutation rates above genetic error thresholds. Cell. 2018;175:1946–57.:e13. doi: 10.1016/j.cell.2018.10.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Ravikumar A, Liu CC. Biocontainment through reengineered genetic codes. ChemBioChem. 2015;16:1149–51. doi: 10.1002/cbic.201500157. [DOI] [PubMed] [Google Scholar]
- 87.Reinkemeier CD, Girona GE, Lemke EA. Designer membraneless organelles enable codon reassignment of selected mRNAs in eukaryotes. Science. 2019;363:eaaw2644. doi: 10.1126/science.aaw2644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Richardson SM, Mitchell LA, Stracquadanio G, Yang K, Dymond JS, et al. Design of a synthetic yeast genome. Science. 2017;355:1040–44. doi: 10.1126/science.aaf4557. [DOI] [PubMed] [Google Scholar]
- 89.Robertson WE, Funke LFH, de la Torre D, Fredens J, Elliott TS, et al. Sense codon reassignment enables viral resistance and encoded polymer synthesis. Science. 2021;372:1057–62. doi: 10.1126/science.abg3029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Robinson AO, Venero OM, Adamala KP. Toward synthetic life: biomimetic synthetic cell communication. Curr Opin Chem Biol. 2021;64:165–73. doi: 10.1016/j.cbpa.2021.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Rose KM, Alves Ferreira-Bravo I, Li M, Craigie R, Ditzler MA, et al. Selection of 2’-deoxy-2’-fluoroarabino nucleic acid (FANA) aptamers that bind HIV-1 integrase with picomolar affinity. ACS Chem Biol. 2019;14:2166–75. doi: 10.1021/acschembio.9b00237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Rovner AJ, Haimovich AD, Katz SR, Li Z, Grome MW, et al. Recoded organisms engineered to depend on synthetic amino acids. Nature. 2015;518:89–93. doi: 10.1038/nature14095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Rubini R, Mayer C. Addicting Escherichiacoli to new-to-nature reactions. ACS Chem Biol. 2020;15:3093–98. doi: 10.1021/acschembio.0c00713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Santoro SW, Joyce GF. A general purpose RNA-cleaving DNA enzyme. PNAS. 1997;94:4262–66. doi: 10.1073/pnas.94.9.4262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Schmidt M. A metric space for semantic containment: towards the implementation of genetic firewalls. Biosystems. 2019;185:104015. doi: 10.1016/j.biosystems.2019.104015. [DOI] [PubMed] [Google Scholar]
- 96.Schmied WH, Elsässer SJ, Uttamapinant C, Chin JW. Efficient multisite unnatural amino acid incorporation in mammalian cells via optimized pyrrolysyl tRNA synthetase/tRNA expression and engineered eRF1. J Am Chem Soc. 2014;136:15577–83. doi: 10.1021/ja5069728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Sczepanski JT, Joyce GF. Binding of a structured D-RNA molecule by an L-RNA aptamer. JAm Chem Soc. 2013;135:13290–93. doi: 10.1021/ja406634g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Seo YJ, Hwang GT, Ordoukhanian P, Romesberg FE. Optimization of an unnatural base pair toward natural-like replication. J Am Chem Soc. 2009;131:3246–52. doi: 10.1021/ja807853m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Seo YJ, Matsuda S, Romesberg FE. Transcription of an expanded genetic alphabet. J Am Chem Soc. 2009;131:5046–47. doi: 10.1021/ja9006996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Shao Y, Lu N, Wu Z, Cai C, Wang S, et al. Creating a functional single-chromosome yeast. Nature. 2018;560:331–35. doi: 10.1038/s41586-018-0382-x. [DOI] [PubMed] [Google Scholar]
- 101.Sismour AM, Benner SA. The use of thymidine analogs to improve the replication of an extra DNA base pair: a synthetic biological system. Nucleic Acids Res. 2005;33:5640–46. doi: 10.1093/nar/gki873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Skakuj K, Bujold KE, Mirkin CA. Mercury-free automated synthesis of guanidinium backbone oligonucleotides. J Am Chem Soc. 2019;141:20171–76. doi: 10.1021/jacs.9b09937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Takahashi M, Li HT, Zhou JH, Chomchan P, Aishwarya V, et al. Dual mechanisms of action of self-delivering, anti-HIV-1 FANA oligonucleotides as a potential new approach to HIV therapy. Mol Ther Nucleic Acids. 2019;17:615–25. doi: 10.1016/j.omtn.2019.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Takegawa-Araki T, Kumagai S, Yasukawa K, Kuroda M, Sasaki T, Obika S. Structure-activity relationships of anti-microRNA oligonucleotides containing cationic guanidine-modified nucleic acids. J Med Chem. 2022;65:2139–48. doi: 10.1021/acs.jmedchem.1c01680. [DOI] [PubMed] [Google Scholar]
- 105.Taylor AI, Beuron F, Peak-Chew SY, Morris EP, Herdewijn P, Holliger P. Nanostructures from synthetic genetic polymers. ChemBioChem. 2016;17:1107–10. doi: 10.1002/cbic.201600136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Taylor AI, Holliger P. Directed evolution of artificial enzymes (XNAzymes) from diverse repertoires of synthetic genetic polymers. Nat Protoc. 2015;10:1625–42. doi: 10.1038/nprot.2015.104. [DOI] [PubMed] [Google Scholar]
- 107.Taylor AI, Holliger P. On gene silencing by the X10-23 DNAzyme. Nat Chem. 2022;14:855–58. doi: 10.1038/s41557-022-00990-5. [DOI] [PubMed] [Google Scholar]
- 108.Taylor AI, Houlihan G, Holliger P. Beyond DNA and RNA: the expanding toolbox of synthetic genetics. Cold Spring Harb Perspect Biol. 2019;11:a032490. doi: 10.1101/cshperspect.a032490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Taylor AI, Pinheiro VB, Smola MJ, Morgunov AS, Peak-Chew S, et al. Catalysts from synthetic genetic polymers. Nature. 2015;518:427–30. doi: 10.1038/nature13982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Taylor AI, Wan CJK, Donde MJ, Peak-Chew S-Y, Holliger P. A modular XNAzyme cleaves long, structured RNAs under physiological conditions and enables allele-specific gene silencing. Nat Chem. 2022;14:1295–305. doi: 10.1038/s41557-022-01021-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Thao TTN, Labroussaa F, Ebert N, V’kovski P, Stalder H, et al. Rapid reconstruction of SARS-CoV-2 using a synthetic genomics platform. Nature. 2020;582:561–65. doi: 10.1038/s41586-020-2294-9. [DOI] [PubMed] [Google Scholar]
- 112.Torres L, Kruger A, Csibra E, Gianni E, Pinheiro VB. Synthetic biology approaches to biological containment: pre-emptively tackling potential risks. Essays Biochem. 2016;60:393–410. doi: 10.1042/EBC20160013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Traut TW. Physiological concentrations of purines and pyrimidines. Mol Cell Biochem. 1994;140:1–22. doi: 10.1007/BF00928361. [DOI] [PubMed] [Google Scholar]
- 114.Wan WB, Seth PP. The medicinal chemistry of therapeutic oligonucleotides. J Med Chem. 2016;59:9645–67. doi: 10.1021/acs.jmedchem.6b00551. [DOI] [PubMed] [Google Scholar]
- 115.Wang K, Fredens J, Brunner SF, Kim SH, Chia T, Chin JW. Defining synonymous codon compression schemes by genome recoding. Nature. 2016;539:59–64. doi: 10.1038/nature20124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Wang K, Neumann H, Peak-Chew SY, Chin JW. Evolved orthogonal ribosomes enhance the efficiency of synthetic genetic code expansion. Nat Biotechnol. 2007;25:770–77. doi: 10.1038/nbt1314. [DOI] [PubMed] [Google Scholar]
- 117.Wang L, Brock A, Herberich B, Schultz PG. Expanding the genetic code of Escherichia coli. Science. 2001;292:498–500. doi: 10.1126/science.1060077. [DOI] [PubMed] [Google Scholar]
- 118.Wang L, Jiang S, Deng Z, Dedon PC, Chen S. DNA phosphorothioate modification-a new multi-functional epigenetic system in bacteria. FEMS Microbiol Rev. 2019;43:109–22. doi: 10.1093/femsre/fuy036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Wang Y, Ngor AK, Nikoomanzar A, Chaput JC. Evolution of a general RNA-cleaving FANA enzyme. Nat Commun. 2018;9:5067. doi: 10.1038/s41467-018-07611-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Wang Y, Nguyen K, Spitale RC, Chaput JC. A biologically stable DNAzyme that efficiently silences gene expression in cells. Nat Chem. 2021;13:319–26. doi: 10.1038/s41557-021-00645-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Wang Y, Wang Y, Song D, Sun X, Li Z, et al. An RNA-cleaving threose nucleic acid enzyme capable of single point mutation discrimination. Nat Chem. 2022;14:350–59. doi: 10.1038/s41557-021-00847-3. [DOI] [PubMed] [Google Scholar]
- 122.Wilson DS, Szostak JW. In vitro selection of functional nucleic acids. Annu Rev Biochem. 1999;68:611–47. doi: 10.1146/annurev.biochem.68.1.611. [DOI] [PubMed] [Google Scholar]
- 123.Wu Y, Fa M, Tae EL, Schultz PG, Romesberg FE. Enzymatic phosphorylation of unnatural nucleosides. J Am Chem Soc. 2002;124:14626–30. doi: 10.1021/ja028050m. [DOI] [PubMed] [Google Scholar]
- 124.Yamashige R, Kimoto M, Takezawa Y, Sato A, Mitsui T, et al. Highly specific unnatural base pair systems as a third base pair for PCR amplification. Nucleic Acids Res. 2012;40:2793–806. doi: 10.1093/nar/gkr1068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Yang Z, Chen F, Alvarado JB, Benner SA. Amplification, mutation, and sequencing of a six-letter synthetic genetic system. J Am Chem Soc. 2011;133:15105–12. doi: 10.1021/ja204910n. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Yang Z, Chen F, Chamberlin SG, Benner SA. Expanded genetic alphabets in the polymerase chain reaction. Angew Chem Int Ed. 2010;49:177–80. doi: 10.1002/anie.200905173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Yang Z, Hutter D, Sheng P, Sismour AM, Benner SA. Artificially expanded genetic information system: a new base pair with an alternative hydrogen bonding pattern. Nucleic Acids Res. 2006;34:6095–101. doi: 10.1093/nar/gkl633. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Yang Z, Sismour AM, Sheng P, Puskar NL, Benner SA. Enzymatic incorporation of a third nucleobase pair. Nucleic Acids Res. 2007;35:4238–49. doi: 10.1093/nar/gkm395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Ye J, van den Berg B. Crystal structure of the bacterial nucleoside transporter Tsx. EMBO J. 2004;23:3187–95. doi: 10.1038/sj.emboj.7600330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Zhang L, Chaput JC. In vitro selection of an ATP-binding TNA aptamer. Molecules. 2020;25:4194. doi: 10.3390/molecules25184194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Zhang Y, Lamb BM, Feldman AW, Zhou AX, Lavergne T, et al. A semisynthetic organism engineered for the stable expansion of the genetic alphabet. PNAS. 2017;114:1317–22. doi: 10.1073/pnas.1616443114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Zhang Y, Ptacin JL, Fischer EC, Aerni HR, Caffaro CE, et al. A semi-synthetic organism that stores and retrieves increased genetic information. Nature. 2017;551:644–47. doi: 10.1038/nature24659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Zhou ZX, Williams JS, Lujan SA, Kunkel TA. Ribonucleotide incorporation into DNA during DNA replication and its consequences. Crit Rev Biochem Mol Biol. 2021;56:109–24. doi: 10.1080/10409238.2020.1869175. [DOI] [PMC free article] [PubMed] [Google Scholar]





