Skip to main content
Computational and Structural Biotechnology Journal logoLink to Computational and Structural Biotechnology Journal
. 2021 Apr 25;19:2468–2476. doi: 10.1016/j.csbj.2021.04.057

Mini review: Enzyme-based DNA synthesis and selective retrieval for data storage

Eojin Yoo a,1, Donghui Choe a,1, Jongoh Shin a, Suhyung Cho a,b, Byung-Kwan Cho a,b,
PMCID: PMC8113751  PMID: 34025937

Abstract

The market for using and storing digital data is growing, with DNA synthesis emerging as an efficient way to store massive amounts of data. Storing information in DNA mainly consists of two steps: data writing and reading. The writing step requires encoding data in DNA, building one nucleotide at a time as a form of single-stranded DNA (ssDNA). Once the data needs to be read, the target DNA is selectively retrieved and sequenced, which will also be in the form of an ssDNA. Recently, enzyme-based DNA synthesis is emerging as a new method to be a breakthrough on behalf of decades-old chemical synthesis. A few enzymatic methods have been presented for data memory, including the use of terminal deoxynucleotidyl transferase. Besides, enzyme-based amplification or denaturation of the target strand into ssDNA provides selective access to the desired dataset. In this review, we summarize diverse enzymatic methods for either synthesizing ssDNA or retrieving the data-containing DNA.

Keywords: Enzymatic DNA synthesis, DNA data storage, Single-stranded DNA, Terminal deoxynucleotidyl transferase, Synthetic Biology

1. Introduction

In synthetic biology, the two major topics being highlighted are DNA synthesis and its applications in the data storage systems. We are experiencing a ‘data explosion’, which refers to a state when massive amount of data is rapidly generated and stored in a computer [1]. Thus, we need to develop an efficient method of storing digital data, and DNA is sought to be the most attractive candidate for replacing the current data storage mediums.

The idea of ‘genetic memory’ first came up in the 1960 s by Nobert Wiener. Since then, the growth of storing data in DNA is accelerated as both the storage capacity and synthetic scale have been enlarged by Church and Goldman, storing 659 KB and 739 KB of data in DNA molecules, respectively [2], [3]. The largest size of data reported is presented by Strauss and Ceze, encoding 200 MB of data in a DNA library [4]. With such advances in synthetic biology technology, a number of creative and practical techniques for storing DNA data have been proposed (Table 1) [4], [5], [6], [7], [8].

Table 1.

Comprehensive lists of DNA data storage system.

Total data Code Contents Synthesis method Length (nt) Bits per nucleotide Random access Ref.
650 kB 1 bit to 1 base 53,426 English text 11 JPG images 5.27-megabit computer code (all as HTML encoded draft) Phosphoramidite (silicon-based) 115 0.6 No [2]
739 kB 8 bits to 5–6 bases (ternary code, rotating encoding) Text file of Shakespeare’s sonnets PDF file of scientific paper JPEG file of the European Bioinformatics Institute MP3 file from Martin Luther King’s 1963 ‘I have a dream’ speech a Huffman code Phosphoramidite (microarray-based) 117 0.19 No [3]
83 kB Indexing and Reed–Solomon coding Text from the Swiss Federal Charter from 1291 and the English translation of the Method of Archimedes Phosphoramidite (microarray-based) 158 0.86 No [6]
151 kB 8 bits to 5–6 bases (ternary code, rotating encoding) Three JPG files Phosphoramidite (column-based) 120 0.57 Yes [58]
2 MB DNA fountain encoding Text file of amazon gift card Kolilbri operating system SVG file of pioneer PDF file of shannon's manuscript Video file of the arrival of a train Zipbomb malware Phosphoramidite (silicon-based) 152 1.18 No [105]
22 MB 6 bis to 3 bases, 2 bits to 2 bases MPEG compressed movie sequence Phosphoramidite (microarray-based) 230 0.89 No [106]
200.2 MB Indexing and Reed–Solomon coding 35 distinct files, including high-definition video, images, audio, and text. These included the “Universal Declaration of Human Rights” in over 100 languages Phosphoramidite (silicon-based) 150–200 0.81 Yes [4]
33 kB 150
8.5 MB 2 bits to 1 base Short input message in ASCI code Phosphoramidite (silicon-based) 194 1.94 No [107]
854B One character to DNA codon Text file Phosphoramidite (column-based) 85 1.78 No [108]
3 kB 14 bits to 8 bases Two JPEG images, The Citizen Kane poster and Smiley Face Phosphoramidite (column-based) 880–1,000 (assembly of 100nt) 1.71 Yes [109]
18B 2 bits to 1 base (ternary code,rotating encoding) Text message Enzyme-based (TdT, Apyrase) 150–200 1.5a No [42]
110B 2 bits to 1 base (ternary code,rotating encoding) Sheet music Enzyme-based (TdT, DMNP-EDTA) 50–100 9.2a Yes [5]
a

Bits per transition.

The current DNA data storage process mainly consists of writing, storing, and reading steps (Fig. 1). To begin the process, data files are converted to binary data consisting of 0 s and 1 s and stored in the form of nucleotides. Based on the designed sequence, the ssDNA is synthesized by linking nucleotides in order. This user-defined ssDNA is stored in vitro or used to make dsDNA for long-term storage. In order to extract data from the stored DNA, the target sequence is retrieved by selective amplification through PCR and sequenced using next-generation sequencing (NGS) methods. After error detection, the reads are decoded to the original data code for determining successful recovery.

Fig. 1.

Fig. 1

Overview of DNA data storage system. (A) The schematic mechanism of the whole process. To begin, data files are converted into binary code and sequentially encrypted in DNA sequence, using the coding scheme (e.g. Huffman code). Designed sequences are synthesized into DNA with enzymatic methods. After long-term storage in vitro or in vivo, data-containing DNA is sequenced and decoded for complete recovery of the file. (B) Conversion of digital data into DNA sequence. Binary code can be assigned to represent either two bases (upper) or one base (lower). Ternary code, generated by Huffman coding, is used for rotating encoding.

There have been many reviews that discussed the data storage procedures. However, as recent studies are transitioning to develop novel methods using enzymes, we would like to shed light on enzymatic methods which could be potentially used in data storage. Although most of the methods described here lack precedent usage in data storage, we strongly believe they could be applied to future enzymatic DNA data storage. In this review, under the three steps of the data storage system shown in Fig. 1, we have focused on the data writing and reading steps. With the emphasis on using enzyme-based techniques, we would like to present how data-encoding DNA is synthesized and selectively retrieved in the form of single-stranded DNA (ssDNA).

2. Writing: Encoding data in DNA

DNA synthesis for data storage has long been relying on the phosphoramidite method, which is a four-step cyclic reaction involving the addition of the desired nucleotide to a growing oligonucleotide chain immobilized on solid support [9], [10]. Based on the automation and improvement of the phosphoramidite method, an oligonucleotide synthesizer can synthesize up to 200 nucleotides (nt) with a 99.3% yield per synthesis cycle [11], [12]. However, this method has unresolved limitations for several reasons. First, the longer the oligonucleotide sequence, the lower the final product yield becomes. Theoretically, even 99% of the nucleotide incorporation efficiency per reaction cycle will yield 36.7 and 13.4% of the full-length product after 100 and 200 cycles (0.99100 and 0.99200), respectively [13], [14]. The final yield of oligonucleotide will be even lower than the theoretical yield, depending on the purification methods, such as MOPC, HPLC, and PAGE. Thus, current chemical DNA synthesis is limited to 200 nt as its upper limit, which may hinder the encoding of vast amounts of data in a sequence. Second, chemical synthesis requires the use of anhydrous solvents, which produce toxic wastes [15], [16]. Additionally, the use of either trichloroacetic acid or dichloroacetic acid causes depurination of adenosine. Since depurination forms an abasic site on the sequences, it damages DNA under the deprotection step by cleaving or truncating the product which is critical for generating error-free data [17], [18]. Therefore, as an alternative to chemical synthesis, a bio-based DNA synthesis method is highlighted.

Enzyme-based data storage is still in its infancy, but it is expected to involve lesser costs and time for the synthesis of data-containing DNA, compared to the chemical synthesis of DNA. The concept of enzymatic DNA synthesis arises from the discovery of DNA polymerase [19], [20]. However, to have a user-defined DNA sequence like in the chemical method, we need enzymes capable of extending the 3′ end of the ssDNA in a template-independent manner, such as polynucleotide phosphorylase (PNPase), T4 RNA ligase, and TdT.

2.1. Polynucleotide phosphorylase (PNPase)

PNPase was originally known for its processive RNA degradation activity [21], [22]. However, it has been demonstrated that PNPase is responsible for the polymerization of ribonucleoside diphosphates when Mg2+ ions are present. PNPase uses nucleoside diphosphates (NDPs) to perform template-independent synthesis of RNA, releasing orthophosphate (Pi) as its side product (Fig. 2A) [22]. To synthesize a defined RNA sequence using PNPase, a method was proposed to use modified nucleotides with a blocker attached to it which prevents the addition of additional enzymes in the sequence, allowing only one at a time [23]. The four requirements of the blocking group they proposed are as follows: i) It should be chemically stable under enzymatic reactions. ii) Chemical deblocking must be possible and not affect the structure of the growing polymer. iii) The blocking group must be small enough to be used as a substrate by the enzyme. iv) The size and configuration of the synthesized polymer with a blocking group on its 3′ end should be durable enough to block further extension. Following these four entries, a stepwise synthesis of ssDNA using 2′(3′)-O-(α-methoxyethyl) nucleoside 5′-diphosphates or 2′(3′)-O-isovaleryl derivatives of NDPs was suggested [24], [25], [26], [27], [28]. To enhance the specificity towards dADP and balance the phosphorlysis and polymerization activity, Mn2+ concentration and the ionic concentration were optimized, respectively, which succeeded in adding nine dNDPs to chemically synthesized 4-mer nucleotides [29]. Although recent studies have shown that PNPase has the potential to be used for ssDNA synthesis, PNPase had a critical drawback that it also catalyzes the reverse reaction. The direction of reversible PNPase reaction highly depends on the relative concentration of orthophosphate and dNDPs [30].

Fig. 2.

Fig. 2

Schematic illustration of enzymatic ssDNA synthesis. (A) PNPase; Mechanism of PNPase depends on dNDPs/Pi concentration. (B) T4Rnl; T4Rnl ligases 3′ hydroxyl group on the acceptor and 5′ phosphate of the incoming nucleotide, using ATP. 2′-deoxyribonucleoside 3′,5′-bisphosphate allows sequence-specific addition of one nucleotide per a cycle. (C) TdT; TdT is capable of generating homopolymeric sequence.

2.2. T4 RNA ligase (T4Rnl)

T4Rnl forms a phosphodiester bond between the 5′ phosphate and the 3′ hydroxyl group of the oligonucleotides in an ATP-dependent manner (Fig. 2B) [31], [32]. As it connects the 5′ and 3′ ends of the chains which are present as the reactants in series, DNA synthesis with T4Rnl is limited to the formation of homopolymeric molecules [33]. Therefore, in order to synthesize the desired sequence, a method was developed using 2′-deoxyribonucleoside 3′,5′-bisphosphate to the 3′ hydroxyl group of the oligonucleotide and block any further extension by 3′-phosphate [34]. When synthesizing deoxyribonucleoside bisphosphate to the oligodeoxyribonucleotide, excessive donor concentrations and low ATP levels helped yielding more than 85% per cycle. However, the time required for the reaction was 5 to 21 days, which is inefficient for synthesizing 10 or more nucleotides [35]. The reaction time was shortened through solid-state synthesis, resulting in a reduction in the reaction time to less than 144 h [36]. In addition, the synthesis efficiency varies widely depending on the type of reaction substrates [36], [37], and the presence of a 3′ unblocked donor is likely to cause self-ligation and circulation. Therefore, even if T4Rnl synthesizes ssDNA for a certain length, it needs to be further optimized for fast and efficient long-chain DNA synthesis.

2.3. Terminal deoxynucleotidyl transferase (TdT)

TdT is an enzyme belonging to the DNA polymerase X family. Unlike other polymerases, it has a unique property that enables template-independent DNA synthesis [38], [39]. There are two main reasons why TdT has template-independent properties. From a mechanistic viewpoint, the TdT reaction can occur between the 3′ OH and triphosphate groups of the incoming nucleotide, without the need to specifically bind a complementary base on the template strand. Another reason arises from a notable structural variation of the existence of a ‘lariat-like’ loop [40], [41]. The crystal structure of TdT demonstrates that this lariat-like loop, which consists of 16 amino acids, acts as a physical obstacle, generates steric hindrance, and prevents the accommodation of dsDNA. However, one limitation of using TdT is the formation of a homopolymer tail (Fig. 2C). Since TdT actively adds nucleotides to the terminal sequence, as long as the reaction conditions meets, there is no way to block further extension of nucleotides. Thus, the unique approaches of using TdT have been presented by either controlling homopolymer production or adding a single modified nucleotide.

2.3.1. Competitive synthesis

Based on the enzyme kinetics of wild-type TdT, it rapidly synthesizes DNA with natural dNTPs. A competitive addition of nucleotides was demonstrated to utilize the formation of homopolymers in data storage [5], [42]. Either enzymes or ion-caging molecules was used to block TdT activity at a level where they could control the number of nucleotides added. Instead of having a perfect sequence for representing each bit, the method aimed to record the rotational ternary code between the base transition sites. In this strategy, TdT competes with apyrase, an ecto-nucleoside triphosphate diphosphohydrolases for access to the nucleotides available. As a nucleotide converter, apyrase intensely degrades dNTPs to deoxynucleoside diphosphates (dNDPs) or dNMPs (monophosphate). Since TdT has a higher preference for enzyme activity with dNTPs than dNDPs [43], the presence of apyrase in the reaction prevents TdT from incorporating dNTPs. Based on the nature of TdT, it was attempted to control the activity of TdT by converting dNTPs to dNDPs [42]. By this method, TdT can only use the nucleotides before being degraded by apyrase, and the number of nucleotides added is three to five bases on average [44].

Similar to the apyrase method, 1-(4,5-dimethoxy-2-nitrophenyl)-1,2-diaminoethane-N,N,N',N'-tetraacetic acid (DMNP-EDTA), a chelator, competes intensively with TdT for catalytic ions in the reaction [5]. TdT uses catalytic ions for the nucleophilic attack, as cobalt ions are known to contribute most to enzyme activity by increasing the rate of base incorporation at the catalytic site. Before the reaction starts, the catalytic ions are caged by DMNP-EDTA and cannot be used for TdT, as the affinity towards DMNP-EDTA is greater than that towards TDT. Then, exposure to UV light breaks the structure of the caging molecule and triggers cage release back to the reaction environment. As TdT uses the catalytic ion again to perform another round of nucleotide addition and the reaction is stopped by adding an excessive amount of ion-caging molecule. This form of regulation is not a reversible reaction, and the length of the homopolymer structure depends on the irradiance and time of UV light on the synthetic product.

2.3.2. TdT with terminator-modified nucleotides

The competitive method is improper for synthesizing sequence-specific ssDNA without ultimate optimization because it cannot add nucleotides in order and inevitably generates a homopolymer sequence. Therefore, based on the chemical method, the use of 3′ modified nucleotides was proposed to prevent further extension after the addition of one nucleotide [45], [46]. Inspired by phosphoramidite methods, 3′-modified nucleotides with chemical blockers are known to be used as substrates for various DNA polymerases, which can be cleaved by chemical solutions. Therefore, it is expected that the cyclic addition of 3′ blocked nucleotides with TdT will be developed shortly. In addition to chemically cleavable nucleotides, TdT incorporates 3′‐O‐nitrobenzyl and 4,5‐dimethoxy‐2‐nitrobenzyl groups, which are prone to UV cleavage [47], [48]. It was proposed that TdT uses a nitrobenzyl-bound dNTP (NZ-dATP) to add one correct nucleotide onto the 750 bp blunt-end dsDNA template. The addition of NZ-dATP did not increase the DNA length, and extension occurred only when dNTP was added after exposure to UV light. UV-A light successfully degrades the nitrobenzyl group without causing DNA damages. However, there is a controversy. As the number of oligonucleotides that accept NZ-dATPs as its substrate is so small, that even after blocking, there are some fractions of remaining unblocked nucleotides that extend their length after unmodified nucleotide extension [49] To enhance the incorporation of such reversible nucleotides, enzyme engineering and searching new chemical moiety are required.

2.3.3. TdT reversely coupled to nucleotides with a linker

Using modified nucleotides requires enzyme engineering to incorporate bulky substrates and enlargement of the catalytic site, which may change the enzyme kinetics. This is due to the inherent characteristics of DNA polymerase, which has variable replication fidelity depending on the shape and size of the bases [50]. Although TdT has a wide variety of nucleotide preferences, 3′ OH blocked nucleotides have a poor incorporation rate compared to the naturally-occurring nucleotides with a free 3′ terminal hydroxyl group [51]. Therefore, although the method seems straightforward, finding and developing 3′ reversely blocked nucleotides with rapid enzyme kinetics is difficult. To solve this problem, a strategy was proposed for synthesizing a single, 3′ unblocked nucleotide, propargylamino-dNTPs, after forming a covalent bond with TdT [52]. The method requires modifying the base analog side of the nucleotide and synthesizing TdT using freely available hydroxyl groups. The covalent complex of TdT, tethered to a link with dNTP, inhibits further extension until it is cleaved by light at 365 nm and opens the 3′ end to continue the synthesis cycle. This has the side effect of leaving a propargylamino scar on the nucleobase, but the PCR amplification of the product successfully proceeds without any errors. In particular, this experiment has very little effect on enzyme kinetics because the 3′ end of the nucleotide remains the same as the natural nucleotide. The method succeeds in synthesizing 10mer oligonucleotide, yielding 93–98% per cycle.

3. Storage

The synthesized DNA is stored as dsDNA form for long-term stability. Since the durability of DNA is influenced by the medium they are stored in, the user should consider the appropriate method. Normally, DNA only requires dryness and ambient temperatures to be stored efficiently. Freezing DNA at a temperature between −20 and −80 °C, or in liquid nitrogen is sufficient, and dehydrated DNA can even be stored at room temperatures. But those methods inevitably encounter damages due to environmental factors like hydrolysis and UV irradiation [53]. Instead, DNA can be encapsulated in nanoparticles such as silica layers, iron oxide, and salts for higher stability under harsh conditions [6], [54], [55]. Cells can also be an in vivo storage medium by inserting DNA in bacterial plasmids, using genetic engineering systems [56], [57].

4. Reading: Selective retrieval of DNA for sequencing

During the encoding process, each DNA can be designed to have its own ‘barcode’ block, using error-correction codes. By doing this, a specific sequence can be read when required as in modern random-access memory [58]. When the desired data need to be decoded, the target DNA is selectively retrieved and amplified from the pool, using corresponding primers specific to the barcode block. In this step, selective amplification by polymerase chain reaction (PCR), keeps data from being volatile and needs a DNA template for synthesizing new strands. Then, the amplified DNA is sequenced for decoding. Most of the sequencing methods used are sequencing by synthesis (SBS) or nanopore sequencing. Since both of the methods requires ssDNA to initiate ‘reading’, we would like to introduce some mechanisms which generate amplified ssDNA from dsDNA. Although all the examples are given lack actual application in data reading currently, they are expected to be useful methods for obtaining the target DNA nevertheless (Table 2).

Table 2.

Efficiencies of enzymatic ssDNA amplification.

Production method Enzymes Template Length (nt) Yielda Application Ref
aPCR DNA Polymerase ssDNA plasmid, lambda phage dsDNA 15,000 695 ± 35 ng/kb Aptamer generation [65]
ssDNA plasmid, dsDNA fragments 449–3356 DNA origami scaffold synthesis [64]
RCA Phi 29 polymerase Circular DNA 378 ssDNA synthesis [81]
SDA DNA polymerase, nicking endonuclease ssDNA 500–1000 99.71%/kb Synthesis of codon-usage variants of lacZα and 74 challenging Drosophila protein antigens [110]
Single stranded binding protein, DNA polymerase, nicking endonuclease dsDNA (PCR product) 500 10-fold ssDNA synthesis [86]
1000 10-fold
5000 12-fold
SNAPCAR DNA Polymerase acrydite-labelled dsDNA (PCR product) 1650 50–70% DNA origami scaffold synthesis [96]
MeRPY DNA Polymerase acrydite-labelled dsDNA with uracil (PCR product) 89–3115 greater than70% DNA origami scaffold synthesis [99]
a

Yields of ssDNA are converted from the ng/ul or mg/L from original article.

4.1. Asymmetric PCR

Asymmetric PCR (aPCR) reaction starts with a dsDNA template amplification using a set of primers in asymmetric concentration; one is excessive, while the other is insufficient (Fig. 3A) [59]. If the insufficient primers are completely consumed, the remaining excess primers amplify the annealed target ssDNA [60]. Since DNA polymerase adds complementary nucleotides to the template strand, the aPCR method has the advantage of ssDNA synthesizing directly from the template. In addition, ssDNA can be synthesized as much as possible using primers. However, because of the low processivity of DNA polymerase used in the early stages, aPCR has been mainly used for aptamer generation [61], [62] and gene detection [63], which requires short, 20–100 nt ssDNA. To synthesize longer DNA, Taq DNA polymerase was used to produce 3.3 kb ssDNA [64]. Soon after, they succeeded in producing 15 kb ssDNA using high-processive LongAmp Taq polymerase [65]. The overall synthesis yielded 2 pmoles of the 1000 nt product per 50 µl of the reaction volume.

Fig. 3.

Fig. 3

Schematic illustration of enzymatic ssDNA amplification system. (A) asymmetric PCR; Using unbalanced ratio of forward and reverse primers, an excessive amount of primer allows amplification of its target strand. (B) RCA; A highly processive DNA polymerase adds nucleotides continuously to the template (circular DNA). As the polymerase goes back to origin, the existing ssDNA unwinds itself to allow the polymerase synthesize another round of ssDNA. (C) SDA; DNA nicking site is generated by a set of primer and targeted for endonuclease activity. To the 3′ end of the nicking site, DNA polymerase extends the strand and displaces the former strand. (D) Biotin-streptavidin magnetic bead method; One of the strands is biotinylated and fixed onto magnetic bead. The unbound strand is rinsed off during denaturation process. (E) SNAPCAR/MeRPy-PCR; copolymerization of dsDNA with acrylamide and sodium acrylate (99:1 ratio) forms a DNA-tagged polymer. Using SNAPCAR, the target strand is recovered by alkaline denaturation and methanol precipitation. MeRPy-PCR allows recovery of both of target and primer strand by selective precipitation, using uracil-DNA glycosylase (UDG) and dimethylethylenediamine (DMEDA).

The process of aPCR has several limitations. First, it requires highly selective purification methods because it is highly prone to the generation of dsDNA by-products [66], [67]. However, this can be partially overcome by the use of 3′ phosphate-blocked limiting primers, which decrease the mispriming and polymerization of DNA by-products [68]. Various purification methods, including gel extraction and immobilization on the column, have been suggested for this [61], [69]. Moreover, aPCR requires considerable optimization [70], [71]. The final yield of the desired ssDNA is highly dependent on the annealing temperature between the two primers [72], the number of amplification cycles [61], DMSO concentration [73], primer ratio, and the presence of a PCR enhancer such as L-ectoin [74]. Although it is sufficient to use a primer ratio between 1:50 to 1:100 with 30 cycles of PCR [59], the final efficiency varies over each condition. Thus, in order to achieve preferable conditions for long-length DNA synthesis, the durability of single-stranded DNA should be enhanced.

4.2. Isothermal amplification

PCR-based isothermal amplification aims to obtain large quantities of target sequences. This method is unique because thermal cycling is not required for the amplification step. In contrast to traditional PCR, isothermal amplification is carried out at a constant temperature under conditions such as water baths, cell surfaces, and living cell interiors that PCR cannot perform in [75].

Rolling circle amplification (RCA) is a method of rapidly synthesizing linear target sequences using highly processive polymerases that are using circular DNA as a template (Fig. 3B) [76], [77]. When the primer is annealed to the dsDNA template, DNA polymerase extends the primer by adding dNTPs and generates a linear ssDNA product at a constant temperature, ranging from 30 to 37 °C [77], [78]. As the polymerase returns to the origin of the template, the former strand is replaced by the strand displacement activity, creating a new template for continuous polymerization. The synthetic product is a tandem repeat of a complementary sequence of circular DNA [79]. Thus, the desired sequence can be obtained by including a restriction site in the primer and performing enzyme digestion after sufficient amplification is performed [80]. RCA was used for the synthesis of a 378 nt ssDNA, which is slightly longer than chemical synthesis (~200 nt) [81]. In terms of quantity, approximately 5 × 109 copies of the 96 nt repeats were synthesized within 90 min [82].

Strand-displacement amplification (SDA) amplifies in isothermal conditions using a DNA polymerase activity with the HincII endonuclease (Fig. 3C) [83]. This method consists of two steps. First, a pair of primers amplifies the dsDNA template and generates the recognition site of HincII at the 5′ terminal end of the target sequence. This can be replaced by using a primer with a restriction site to a denatured ssDNA. Strand breaking occurs on the dsDNA containing the restriction site by HincII, and the DNA polymerase, such as a Klenow fragment, extends the primer strand. After the formation of a new strand, polymerization takes place at the restriction site of 3′ end again by nicking, allowing cyclic amplification [84], [85]. Based on this, synthesis of 0.5–5 kb ssDNA has been demonstrated, applying single-stranded binding protein-aided Sequenase 2.0 DNA polymerase and Nt.BspQI endonuclease. The final yield of the product was 68, 55, and 180 ng [86].

4.3. Separation of ssDNA from dsDNA

In addition to amplifying the desired strand from the template, data are obtained by denaturing dsDNA containing a specific strand. This can be done by selectively breaking down the unwanted strands or fixing the desired strands to the anchoring substance. Unlike PCR-based ssDNA synthesizing methods, an additional purification step is not required, because they can easily precipitate the desired strand and ensure high strand recovery yield.

The biotin-streptavidin magnetic bead method is derived from solid-phase DNA sequencing (Fig. 3D) [87]. To extract the desired DNA using this method, the data-encoding ssDNA was 5′-biotinylated and PCR amplified before storage, generating a complementary strand of the modified strand to make dsDNA. When the data needs to be decoded, the dsDNA was immobilized on magnetic beads and the remaining, non-biotinylated ssDNA that contains data can be purified by denaturing at 95 °C [88], [89], [90], [91]. The maximum length that can be synthesized using this method is 4808 nt [92], and the synthesis yield (recovery) decreases in proportion to the length [93]. Moreover, a fraction of the biotinylated strands still elutes, during the denaturation process. NaOH is involved in the hydrogen bond rupture between the biotinylated strand and streptavidin between the strands, causing reannealing between the strands and the loss of the desired product. Thus, the final yield of the reaction is highly dependent on synthesis length, NaOH concentration, and type of streptavidin-coated magnetic bead [94], [95].

Similar to the biotin–streptavidin-based strategy, selective nascent polymer catch-and-release (SNAPCAR) selectively anchors acrydite-modified dsDNA and denatures non-anchored, desired ssDNA (Fig. 3E) [96]. Using 5′-terminal acrylamide modified oligonucleotide as a primer, traditional PCR is performed to generate dsDNA. Then, co-polymerization of the PCR product with a 99:1 ratio of acrylamide and acrylate generates a dsDNA product linked to a poly (acrylamide-co-acrylate) chain [97], [98]. Following precipitation of the chain-bound dsDNA, incubation in alkaline denaturing conditions releases the mobile complementary strand. The immobile primer strand, that covalently attaches to the acrylamide chain, is pulled down thereafter. This method succeeded in purifying 1650, 3315, and 7301 nt ssDNA with 50 to 70% yield, which is the longest sequence length among the methods described in this review. In addition, the estimated cost of SNAPCAR is $0.35 per 1 nmol of ssDNA, which is about 103 times cheaper than the streptavidin method. However, performing strand repair under standard laboratory conditions is inconvenient because the polymer must grow in low-oxygen conditions each time the PCR is run. Thus, methanol-responsive polymer PCR (MeRPy-PCR) was suggested, which compensates for the lack in the streptavidin method [99]. In MeRPy-PCR, deoxyuridine (dU) is site-specifically added to the acrydite-modified primer. After amplification, methanol purification, and denaturation of the mobile strand, the primer strand is recovered by uracil-DNA glycosylase (UDG) or dimethylethylenediamine (DMEDA). This method allows dual-purification of both the primer and target strand. The ssDNA synthesized by the above method showed more than 70% yield for the target strand and 20% for the primer strand, with a negative correlation between the length and the yield. The SNAPCAR and MeRPy methods can be applied to DNA scaffold origami construction. In addition, a DNA targeting strategy was proposed using ssDNA based on MeRPy-PCR [100].

5. Summary and outlook

In this mini-review, we discuss the current use of enzyme-based DNA synthesis and amplification methods for generating a digital data storage medium. Although most of the current mechanisms are based on chemical phosphoramidite synthesis, researchers today use enzymatic synthesis to achieve an efficiency rate similar to or even higher than the current techniques. This can generate a user-specific data-encoding strand by utilizing homopolymer generation or synthesizing designed sequences. Furthermore, enzymatic replication of DNA allows selective retrieval of the desired data. Although most enzymatic DNA synthesis methods still need to be developed with such limitations, TdT is thought to be the most promising method. Enzyme kinetics and innate properties of TdT provide a favorable starting point for applications in practical enzymatic DNA synthesis techniques.

Several challenges remain in reducing the final error rate. It is necessary to prevent ssDNA from forming a secondary structure when it exceeds a certain length. In this context, engineered TdT has been shown to withstand the high temperature of 47 °C, while preserving enzyme activity and inhibiting secondary structure [101]. In addition, the synthesis of high GC content is difficult due to the formation of hydrogen bonds between guanine repeats. This problem can also be partially solved with a lower catalytic ion concentration when synthesized with TdT [101]. Moreover, researchers today aim to apply enzymatic methods for generating larger assemblies, such as genome-scale sequences. Technologies such as Gibson assembly [65], [102], yeast assembly [103], and Golden Gate Assembly [104] have been extensively used in these research areas. As the maximum length of synthesizable DNA increases, we expect technological breakthroughs in the amount of data that could be stored in DNA. Overall, there is no doubt that DNA-based storage systems will become a highly efficient technology for replacing traditional storage mediums, and DNA-synthesizing enzymes will be the major factor for improving the process.

CRediT authorship contribution statement

Eojin Yoo: Writing - original draft. Donghui Choe: Writing - original draft. Jongoh Shin: Writing - original draft. Suhyung Cho: Writing - original draft. Byung-Kwan Cho: Conceptualization, Supervision, Writing - original draft.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

Acknowledgments

This research was supported by the National Research Foundation (NRF) grant (2021R1A2C1012589 to S.C.) and the Bio & Medical Technology Development Program (2018M3A9F3079664 to B.-K.C.) through the National Research Foundation (NRF) funded by the Ministry of Science and ICT.

Author contributions

B.-K.C. conceived and supervised the study. E.Y., D.C., J.S., S.C., and B.-K.C. wrote the manuscript.

References

  • 1.Zhu Y., Zhong N., Xiong Y. Springer; Berlin Heidelberg: 2009. Data explosion, data nature and dataology; pp. 147–158. [Google Scholar]
  • 2.Church G.M., Gao Y., Kosuri S. Next-generation digital information storage in DNA. Science. 2012;337(6102):1628. doi: 10.1126/science.1226355. [DOI] [PubMed] [Google Scholar]
  • 3.Goldman N., Bertone P., Chen S., Dessimoz C., LeProust E.M., Sipos B. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA. Nature. 2013;494(7435):77–80. doi: 10.1038/nature11875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Organick L., Ang S.D., Chen Y.-J., Lopez R., Yekhanin S., Makarychev K. Random access in large-scale DNA data storage. Nat Biotechnol. 2018;36(3):242–248. doi: 10.1038/nbt.4079. [DOI] [PubMed] [Google Scholar]
  • 5.Lee H., Wiegand D.J., Griswold K., Punthambaker S., Chun H., Kohman R.E. Photon-directed multiplexed enzymatic DNA synthesis for molecular digital data storage. Nat Commun. 2020;11(1) doi: 10.1038/s41467-020-18681-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Grass R.N., Heckel R., Puddu M., Paunescu D., Stark W.J. Robust chemical preservation of digital information on DNA in silica with error-correcting codes. Angew Chem Int Ed Engl. 2015;54(8):2552–2555. doi: 10.1002/anie.201411378. [DOI] [PubMed] [Google Scholar]
  • 7.Choi Y., Ryu T., Lee A.C., Choi H., Lee H. Addition of Degenerate Bases to DNA-based Data Storage for Increased Information Capacity. Sci Rep. 2018;367052 doi: 10.1038/s41598-019-43105-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Yim S.S., McBee R.M., Song A.M., Huang Y., Sheth R.U., Wang H.H. Robust direct digital-to-biological data storage in living cells. Nat Chem Biol. 2021;17(3):246–253. doi: 10.1038/s41589-020-00711-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Beaucage S.L., Caruthers M.H. Deoxynucleoside phosphoramidite - A new class of key intermediates for deoxypolynucleotide synthesis. Tetrahedron Lett. 1981;22:1859–1862. [Google Scholar]
  • 10.Adams S.P., Kavka K.S., Wykes E.J., Holder S.B., Galluppi G.R. Hindered dialkylamino nucleoside phosphite reagents in the synthesis of two DNA 51-mers. J Am Chem Soc. 1983;105(3):661–663. [Google Scholar]
  • 11.Cheng J.Y., Chen H.H., Kao Y.S., Kao W.C., Peck K. High throughput parallel synthesis of oligonucleotides with 1536 channel synthesizer. Nucleic Acids Res. 2002;30 doi: 10.1093/nar/gnf092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hughes R.A., Ellington A.D. Synthetic DNA synthesis and assembly: putting the synthetic in synthetic biology. Cold Spring Harb Perspect Biol. 2017;9 doi: 10.1101/cshperspect.a023812. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kosuri S., Church G.M. Large-scale de novo DNA synthesis: technologies and applications. Nat Methods. 2014;11(5):499–507. doi: 10.1038/nmeth.2918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Shivalingam A., Brown T. Synthesis of chemically modified DNA. Biochem Soc Trans. 2016;44:709–715. doi: 10.1042/BST20160051. [DOI] [PubMed] [Google Scholar]
  • 15.Agency USEP (2020). “Consolidated list of chemicals subject to the emergency planning and community right-to-know act (EPCRA), comprehensive environmental response, compensation and liability act (CERCLA) and section 112(r) of the clean air act.” Available: www.epa.gov/sites/production/files/2015-03/documents/list_of_lists.pdf. Accessed 2020 Nov 20.
  • 16.Tsunoda H., Kudo T., Ohkubo A., Seio K., Sekine M. Synthesis of oligodeoxynucleotides using fully protected deoxynucleoside 3'-phosphoramidite building blocks and base recognition of oligodeoxynucleotides incorporating N3-cyano-ethylthymine. Molecules. 2010;15:7509–7531. doi: 10.3390/molecules15117509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Septak M. Kinetic studies on depurination and detritylation of CPG-bound intermediates during oligonucleotide synthesis. Nucleic Acids Res. 1996;24:3053–3058. doi: 10.1093/nar/24.15.3053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.LeProust E.M., Peck B.J., Spirin K., McCuen H.B., Moore B. Synthesis of high-quality libraries of long (150mer) oligonucleotides by a novel depurination controlled process. Nucleic Acids Res. 2010;38:2522–2540. doi: 10.1093/nar/gkq163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Lenzer J. Arthur Kornberg. BMJ. 2008;336:50. [Google Scholar]
  • 20.Mitra S., Kornberg A. Enzymatic mechanisms of DNA replication. J Gen Physiol. 1966;49:59–79. doi: 10.1085/jgp.49.6.59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Briani F., Carzaniga T., Deho G. Regulation and functions of bacterial PNPase. Wiley Interdiscip Rev RNA. 2016;7:241–258. doi: 10.1002/wrna.1328. [DOI] [PubMed] [Google Scholar]
  • 22.Heppel L.A., Ortiz P.J., Ochoa S. Studies on polynucleotides synthesized by polynucleotide phosphorylase. I. Structure of polynucleotides with one type of nucleotide unit. J Biol Chem. 1957;229:679–694. [PubMed] [Google Scholar]
  • 23.Mackey J.K., Gilham P.T. New approach to the synthesis of polyribonucleotides of defined sequence. Nature. 1971;233:551–553. doi: 10.1038/233551a0. [DOI] [PubMed] [Google Scholar]
  • 24.Kaufmann G., Fridkin M., Zutra A., Littauer U.Z. Monofunctional substrates of polynucleotide phosphorylase. The monoaddition of 2'(3')-O-isovaleryl-nucleoside diphosphate to an initiator oligonucleotide. Eur J Biochem. 1971;24:4–11. doi: 10.1111/j.1432-1033.1971.tb19649.x. [DOI] [PubMed] [Google Scholar]
  • 25.Gillam S., Waterman K., Smith M. Enzymatic synthesis of oligonucleotides of defined sequence. Addition of short blocks of nucleotide residues to oligonucleotide primers. Nucleic Acids Res. 1975;2:613–624. doi: 10.1093/nar/2.5.613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Gilham S., Smith M. Enzymatic synthesis of deoxyribo-oligonucleotides of defined sequence. Nat New Biol. 1972;238:233–234. doi: 10.1038/newbio238233a0. [DOI] [PubMed] [Google Scholar]
  • 27.GILHAM SHIRLEY, SMITH MICHAEL. Enzymatic synthesis of deoxyribo-oligonucleotides of defined sequence. Properties of the enzyme. Nucleic Acids Res. 1972;238(86):233–234. doi: 10.1038/newbio238233a0. [DOI] [PubMed] [Google Scholar]
  • 28.Gillam S., Waterman K., Doel M., Smith M. Enzymatic synthesis of deoxyribo-oligonucleotides of defined sequence. Deoxyribo-oligonucleotide synthesis. Nucleic Acids Res. 1974;1:1649–1664. doi: 10.1093/nar/1.12.1649. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gillam S., Rottman F., Jahnke P., Smith M. Enzymatic synthesis of oligonucleotides of defined sequence: synthesis of a segment of yeast iso-1-cytochrome c gene. Proc Natl Acad Sci U S A. 1977;74:96–100. doi: 10.1073/pnas.74.1.96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Cardenas P.P., Carzaniga T., Zangrossi S., Briani F., Garcia-Tirado E. Polynucleotide phosphorylase exonuclease and polymerase activities on single-stranded DNA ends are modulated by RecN, SsbA and RecA proteins. Nucleic Acids Res. 2011;39:9250–9261. doi: 10.1093/nar/gkr635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Silber R., Malathi V.G., Hurwitz J. Purification and properties of bacteriophage T4-induced RNA ligase. Proc Natl Acad Sci U S A. 1972;69:3009–3013. doi: 10.1073/pnas.69.10.3009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kaufmann K.J., Dutton P.L., Netzel T.L., Leigh J.S., Rentzepis P.M. Picosecond kinetics of events leading to reaction center bacteriochlorophyll oxidation. Science. 1975;188:1301–1304. doi: 10.1126/science.188.4195.1301. [DOI] [PubMed] [Google Scholar]
  • 33.Mccoy M.I.M., Gumport R.I. T4 ribonucleic-acid ligase joins single-strand oligo(deoxyribonucleotides) Biochemistry. 1980;19:635–642. doi: 10.1021/bi00545a005. [DOI] [PubMed] [Google Scholar]
  • 34.Hinton D.M., Baez J.A., Gumport R.I. T4 RNA Ligase joins 2'-deoxyribonucleoside 3',5'-bisphosphates to oligodeoxyribonucleotides. Biochemistry-Us. 1978;17:5091–5097. doi: 10.1021/bi00617a004. [DOI] [PubMed] [Google Scholar]
  • 35.Hinton D.M., Gumport R.I. The synthesis of oligodeoxyribonucleotides using RNA ligase. Nucleic Acids Res. 1979;7:453–464. doi: 10.1093/nar/7.2.453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Schmitz C., Reetz M.T. Solid-phase enzymatic synthesis of oligonucleotides. Org Lett. 1999;1:1729–1731. doi: 10.1021/ol990240n. [DOI] [PubMed] [Google Scholar]
  • 37.Romaniuk E., McLaughlin L.W., Neilson T., Romaniuk P.J. The effect of acceptor oligoribonucleotide sequence on the T4 RNA ligase reaction. Eur J Biochem. 1982;125:639–643. doi: 10.1111/j.1432-1033.1982.tb06730.x. [DOI] [PubMed] [Google Scholar]
  • 38.Nick McElhinny S.A., Havener J.M., Garcia-Diaz M., Juárez R., Bebenek K. A gradient of template dependence defines distinct biological roles for family X polymerases in nonhomologous end joining. Mol Cell. 2005;19:357–366. doi: 10.1016/j.molcel.2005.06.012. [DOI] [PubMed] [Google Scholar]
  • 39.Bollum F.J. Thermal conversion of nonpriming deoxyribonucleic acid to primer. J Biol Chem. 1959;234:2733–2734. [PubMed] [Google Scholar]
  • 40.Delarue M., Boule J.B., Lescar J., Expert-Bezancon N., Jourdan N. Crystal structures of a template-independent DNA polymerase: murine terminal deoxynucleotidyltransferase. EMBO J. 2002;21:427–439. doi: 10.1093/emboj/21.3.427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Loc'h J., Gerodimos C.A., Rosario S., Tekpinar M., Lieber M.R. Structural evidence for an in trans base selection mechanism involving Loop1 in polymerase mu at an NHEJ double-strand break junction. J Biol Chem. 2019;294:10579–10595. doi: 10.1074/jbc.RA119.008739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Lee H.H., Kalhor R., Goela N., Bolot J., Church G.M. Terminator-free template-independent enzymatic DNA synthesis for digital information storage. Nat Commun. 2019;10:2383. doi: 10.1038/s41467-019-10258-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Chen Y., Podlevsky J.D., Logeswaran D., Chen J.J. A single nucleotide incorporation step limits human telomerase repeat addition activity. EMBO J. 2018;37 doi: 10.15252/embj.201797953. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Chen W., Guidotti G. Soluble apyrases release adp during ATP hydrolysis. Biochem Biophys Res Commun. 2001;282:90–95. doi: 10.1006/bbrc.2001.4555. [DOI] [PubMed] [Google Scholar]
  • 45.Wu J., Zhang S., Meng Q., Cao H., Li Z. 3'-O-modified nucleotides as reversible terminators for pyrosequencing. Proc Natl Acad Sci U S A. 2007;104:16462–16467. doi: 10.1073/pnas.0707495104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Efcavitch J.W., Sylvester J.E. Modified template-independent enzymes for polydeoxynucleotide synthesis. US10059929B2. 2015 [Google Scholar]
  • 47.Mathews A.S., Yang H., Montemagno C. Photo-cleavable nucleotides for primer free enzyme mediated DNA synthesis. Org Biomol Chem. 2016;14:8278–8288. doi: 10.1039/c6ob01371f. [DOI] [PubMed] [Google Scholar]
  • 48.Mathews AS, Yang H, Montemagno C (2017) 3'-O-caged 2'-deoxynucleoside triphosphates for light-mediated, enzyme-catalyzed, template-independent DNA synthesis. Curr Protoc Nucleic Acid Chem 71: 13.17.11– 13.17.38. [DOI] [PubMed]
  • 49.Stemple D.L., Mankowska S.A., Harvey S.A. Compositions and methods for template-free enzymatic nucleic acid synthesis. WO2018152323A1. 2018 [Google Scholar]
  • 50.Kool E.T. Active site tightness and substrate fit in DNA replication. Annu Rev Biochem. 2002;71:191–219. doi: 10.1146/annurev.biochem.71.110601.135453. [DOI] [PubMed] [Google Scholar]
  • 51.Gardner A.F., Wang J., Wu W., Karouby J., Li H. Rapid incorporation kinetics and improved fidelity of a novel class of 3'-OH unblocked reversible terminators. Nucleic Acids Res. 2012;40:7404–7415. doi: 10.1093/nar/gks330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Palluk S., Arlow D.H., de Rond T., Barthel S., Kang J.S. De novo DNA synthesis using polymerase-nucleotide conjugates. Nat Biotechnol. 2018;36:645–650. doi: 10.1038/nbt.4173. [DOI] [PubMed] [Google Scholar]
  • 53.Matange K., Tuck J.M., Keung A.J. DNA stability: a central design consideration for DNA data storage systems. Nat Commun. 2021;12:1358. doi: 10.1038/s41467-021-21587-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Chen W.D., Kohll A.X., Nguyen B.H., Koch J., Heckel R. Combining Data Longevity with High Storage Capacity—Layer-by-Layer DNA Encapsulated in Magnetic Nanoparticles. Adv Funct Mater. 2019;29:1901672. [Google Scholar]
  • 55.Kohll A.X., Antkowiak P.L., Chen W.D., Nguyen B.H., Stark W.J. Stabilizing synthetic DNA for long-term data storage with earth alkaline salts. Chem Commun (Camb) 2020;56:3613–3616. doi: 10.1039/d0cc00222d. [DOI] [PubMed] [Google Scholar]
  • 56.Anchordoquy T.J., Molina M.C. Preservation of DNA. Cell Preserv Technol. 2007;5:180–188. [Google Scholar]
  • 57.Cox J.P.L. Long-term data storage in DNA. Trends Biotechnol. 2001;19:247–250. doi: 10.1016/s0167-7799(01)01671-7. [DOI] [PubMed] [Google Scholar]
  • 58.Bornholt J., Lopez R., Carmean D.M., Ceze L., Seelig G. Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems Association for Computing Machinery. 2016. A DNA-based archival storage system; pp. 637–649. [Google Scholar]
  • 59.Gyllensten U.B., Erlich H.A. Generation of single-stranded DNA by the polymerase chain reaction and its application to direct sequencing of the HLA-DQA locus. Proc Natl Acad Sci U S A. 1988;85:7652–7656. doi: 10.1073/pnas.85.20.7652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Wilson R.K., Chen C., Hood L. Optimization of asymmetric polymerase chain reaction for rapid fluorescent DNA sequencing. Biotechniques. 1990;8:184–189. [PubMed] [Google Scholar]
  • 61.Citartan M., Tang T.H., Tan C., Hoe C.H., Saini R. Asymmetric PCR for good quality ssDNA generation towards DNA aptamer production. Songklanakarin Journal of Science and Technology. 2012;34:125–131. [Google Scholar]
  • 62.Heiat M., Ranjbar R., Latifi A.M., Rasaee M.J., Farnoosh G. Essential strategies to optimize asymmetric PCR conditions as a reliable method to generate large amount of ssDNA aptamers. Biotechnol Appl Biochem. 2017;64:541–548. doi: 10.1002/bab.1507. [DOI] [PubMed] [Google Scholar]
  • 63.Wooddell C.I., Burgess R.R. Use of asymmetric PCR to generate long primers and single-stranded DNA for incorporating cross-linking analogs into specific sites in a DNA probe. Genome Res. 1996;6:886–892. doi: 10.1101/gr.6.9.886. [DOI] [PubMed] [Google Scholar]
  • 64.Veneziano R., Ratanalert S., Zhang K., Zhang F., Yan H. Designer nanoscale DNA assemblies programmed from the top down. Science. 2016;352:1534. doi: 10.1126/science.aaf4388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Veneziano R., Shepherd T.R., Ratanalert S., Bellou L., Tao C. In vitro synthesis of gene-length single-stranded DNA. Sci Rep. 2018;8:6548. doi: 10.1038/s41598-018-24677-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Marimuthu C., Tang T.H., Tominaga J., Tan S.C., Gopinath S.C. Single-stranded DNA (ssDNA) production in DNA aptamer generation. Analyst. 2012;137:1307–1315. doi: 10.1039/c2an15905h. [DOI] [PubMed] [Google Scholar]
  • 67.He C.Z., Zhang K.H., Wang T., Wan Q.S., Hu P.P. Single-primer-limited amplification: a method to generate random single-stranded DNA sub-library for aptamer selection. Anal Biochem. 2013;440:63–70. doi: 10.1016/j.ab.2013.05.008. [DOI] [PubMed] [Google Scholar]
  • 68.Tolnai Z., Harkai A., Szeitner Z., Scholz E.N., Percze K. A simple modification increases specificity and efficiency of asymmetric PCR. Anal Chim Acta. 2019;1047:225–230. doi: 10.1016/j.aca.2018.10.017. [DOI] [PubMed] [Google Scholar]
  • 69.Kai E., Sumikura K., Ikebukuro K., Karube I. Purification of single stranded DNA from asymmetric PCR product using the SMART system. Biotechnol Tech. 1998;12:935–939. [Google Scholar]
  • 70.Sanchez J.A., Pierce K.E., Rice J.E., Wangh L.J. Linear-After-The-Exponential (LATE)–PCR: An advanced method of asymmetric PCR and its uses in quantitative real-time analysis. Proc Natl Acad Sci U S A. 2004;101:1933–1938. doi: 10.1073/pnas.0305476101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Tabarzad M., Kazemi B., Vahidi H., Aboofazeli R., Shahhosseini S. Challenges to design and develop of DNA aptamers for protein targets. I. Optimization of asymmetric PCR for generation of a single stranded DNA library. Iran J Pharm Res. 2014;13:133–141. [PMC free article] [PubMed] [Google Scholar]
  • 72.Venkatesan V., Hoti S.L., Kamaraj N., Ghosh S., Rajaram K. Optimisation of an asymmetric polymerase chain reaction assay for the amplification of single-stranded DNA from Wuchereria bancrofti for electrochemical detection. Mem Inst Oswaldo Cruz. 2013;108:804–807. doi: 10.1590/0074-0276108062013020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Kang J., Lee M.S., Gorenstein D.G. The enhancement of PCR amplification of a random sequence DNA library by DMSO and betaine: application to in vitro combinatorial selection of aptamers. J Biochem Biophys Methods. 2005;64:147–151. doi: 10.1016/j.jbbm.2005.06.003. [DOI] [PubMed] [Google Scholar]
  • 74.Schnoor M., Voß P., Cullen P., Böking T., Galla H.-J. Characterization of the synthetic compatible solute homoectoine as a potent PCR enhancer. Biochem Biophys Res Commun. 2004;322:867–872. doi: 10.1016/j.bbrc.2004.07.200. [DOI] [PubMed] [Google Scholar]
  • 75.Wei J., Gong X., Wang Q., Pan M., Liu X. Construction of an autonomously concatenated hybridization chain reaction for signal amplification and intracellular imaging. Chem Sci. 2017;9:52–61. doi: 10.1039/c7sc03939e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Fire A., Xu S.Q. Rolling replication of short DNA circles. Proc Natl Acad Sci U S A. 1995;92:4641–4645. doi: 10.1073/pnas.92.10.4641. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Dean F.B., Nelson J.R., Giesler T.L., Lasken R.S. Rapid amplification of plasmid and phage DNA using Phi 29 DNA polymerase and multiply-primed rolling circle amplification. Genome Res. 2001;11:1095–1099. doi: 10.1101/gr.180501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Ali M.M., Li F., Zhang Z., Zhang K., Kang D.K. Rolling circle amplification: a versatile tool for chemical biology, materials science and medicine. Chem Soc Rev. 2014;43:3324–3341. doi: 10.1039/c3cs60439j. [DOI] [PubMed] [Google Scholar]
  • 79.Blanco L., Bernad A., Lazaro J.M., Martin G., Garmendia C. Highly efficient DNA synthesis by the phage phi 29 DNA polymerase. Symmetrical mode of DNA replication. J Biol Chem. 1989;264:8935–8940. [PubMed] [Google Scholar]
  • 80.Nelissen F.H., Goossens E.P., Tessari M., Heus H.A. Enzymatic preparation of multimilligram amounts of pure single-stranded DNA samples for material and analytical sciences. Anal Biochem. 2015;475:68–73. doi: 10.1016/j.ab.2015.01.014. [DOI] [PubMed] [Google Scholar]
  • 81.Ducani C., Kaul C., Moche M., Shih W.M., Hogberg B. Enzymatic production of 'monoclonal stoichiometric' single-stranded DNA oligonucleotides. Nat Methods. 2013;10:647–652. doi: 10.1038/nmeth.2503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Lizardi P.M., Huang X., Zhu Z., Bray-Ward P., Thomas D.C. Mutation detection and single-molecule counting using isothermal rolling-circle amplification. Nat Genet. 1998;19:225–232. doi: 10.1038/898. [DOI] [PubMed] [Google Scholar]
  • 83.Walker G.T., Little M.C., Nadeau J.G., Shank D.D. Isothermal in vitro amplification of DNA by a restriction enzyme/DNA polymerase system. Proc Natl Acad Sci U S A. 1992;89:392–396. doi: 10.1073/pnas.89.1.392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Walker G.T., Fraiser M.S., Schram J.L., Little M.C., Nadeau J.G. Strand displacement amplification–an isothermal, in vitro DNA amplification technique. Nucleic Acids Res. 1992;20:1691–1696. doi: 10.1093/nar/20.7.1691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Shi C., Liu Q., Ma C., Zhong W. Exponential strand-displacement amplification for detection of microRNAs. Anal Chem. 2014;86:336–339. doi: 10.1021/ac4038043. [DOI] [PubMed] [Google Scholar]
  • 86.Joneja A., Huang X. Linear nicking endonuclease-mediated strand-displacement DNA amplification. Anal Biochem. 2011;414:58–69. doi: 10.1016/j.ab.2011.02.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Hultman T., Stahl S., Hornes E., Uhlen M. Direct solid phase sequencing of genomic and plasmid DNA using magnetic beads as solid support. Nucleic Acids Res. 1989;17:4937–4946. doi: 10.1093/nar/17.13.4937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Pagratis N.C. Rapid preparation of single stranded DNA from PCR products by streptavidin induced electrophoretic mobility shift. Nucleic Acids Res. 1996;24:3645–3646. doi: 10.1093/nar/24.18.3645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Civit L., Fragoso A., O'Sullivan C.K. Evaluation of techniques for generation of single-stranded DNA for quantitative detection. Anal Biochem. 2012;431:132–138. doi: 10.1016/j.ab.2012.09.003. [DOI] [PubMed] [Google Scholar]
  • 90.Wilson R. Preparation of single-stranded DNA from PCR products with streptavidin magnetic beads. Nucleic Acid Ther. 2011;21:437–440. doi: 10.1089/nat.2011.0322. [DOI] [PubMed] [Google Scholar]
  • 91.Yuce M., Kurt H., Budak H. Characterization of a dual biotin tag for improved single stranded DNA production. Anal Methods. 2014;6:548–557. [Google Scholar]
  • 92.Pound E., Ashton J.R., Becerril H.A., Woolley A.T. Polymerase chain reaction based scaffold preparation for the production of thin, branched DNA origami nanostructures of arbitrary sizes. Nano Lett. 2009;9:4302–4305. doi: 10.1021/nl902535q. [DOI] [PubMed] [Google Scholar]
  • 93.Kao W.-S., Li H.-W. An Efficient Bead-captured Denaturation Method for Preparing Long Single-stranded DNA. J Chin Chem Soc. 2017;64:1065–1070. [Google Scholar]
  • 94.Kilili G.K., Tilton L., Karbiwnyk C.M. [Letter to the Editor] NaOH concentration and streptavidin bead type are key factors for optimal DNA aptamer strand separation and isolation. Biotechniques. 2016;61:114–116. doi: 10.2144/000114449. [DOI] [PubMed] [Google Scholar]
  • 95.Holmberg A., Blomstergren A., Nord O., Lukacs M., Lundeberg J. The biotin-streptavidin interaction can be reversibly broken using water at elevated temperatures. Electrophoresis. 2005;26:501–510. doi: 10.1002/elps.200410070. [DOI] [PubMed] [Google Scholar]
  • 96.Krieg E., Shih W.M. Selective nascent polymer catch-and-release enables scalable isolation of multi-kilobase single-stranded DNA. Angew Chem Int Ed Engl. 2018;57:714–718. doi: 10.1002/anie.201710469. [DOI] [PubMed] [Google Scholar]
  • 97.Rehman F.N., Audeh M., Abrams E.S., Hammond P.W., Kenney M. Immobilization of acrylamide-modified oligonucleotides by co-polymerization. Nucleic Acids Res. 1999;27:649–655. doi: 10.1093/nar/27.2.649. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Damase T.R., Ellington A.D., Allen P.B. Purification of single-stranded DNA by co-polymerization with acrylamide and electrophoresis. Biotechniques. 2017;62:275–282. doi: 10.2144/000114557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Minev D., Guerra R., Kishi J.Y., Smith C., Krieg E. Rapid in vitro production of single-stranded DNA. Nucleic Acids Res. 2019;47:11956–11962. doi: 10.1093/nar/gkz998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Krieg E., Gupta K., Dahl A., Lesche M., Boye S. A smart polymer for sequence-selective binding, pulldown, and release of DNA targets. Commun Biol. 2020;3:369. doi: 10.1038/s42003-020-1082-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Barthel S., Palluk S., Hillson N.J., Keasling J.D., Arlow D.H. Enhancing terminal deoxynucleotidyl transferase activity on substrates with 3' terminal structures for enzymatic de novo DNA synthesis. Genes (Basel) 2020;11:102. doi: 10.3390/genes11010102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Gibson D.G., Young L., Chuang R.Y., Venter J.C., Hutchison C.A., 3rd Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat Methods. 2009;6:343–345. doi: 10.1038/nmeth.1318. [DOI] [PubMed] [Google Scholar]
  • 103.Gibson D.G., Benders G.A., Axelrod K.C., Zaveri J., Algire M.A. One-step assembly in yeast of 25 overlapping DNA fragments to form a complete synthetic Mycoplasma genitalium genome. Proc Natl Acad Sci U S A. 2008;105:20404–20409. doi: 10.1073/pnas.0811011106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Weber E., Engler C., Gruetzner R., Werner S., Marillonnet S. A modular cloning system for standardized assembly of multigene constructs. PLoS ONE. 2011;6 doi: 10.1371/journal.pone.0016765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Erlich Y., Zielinski D. DNA Fountain enables a robust and efficient storage architecture. Science. 2017;355:950–954. doi: 10.1126/science.aaj2038. [DOI] [PubMed] [Google Scholar]
  • 106.Blawat M., Gaedke K., Hütter I., Chen X.-M., Turczyk B. Forward error correction for DNA data storage. Procedia Comput Sci. 2016;80:1011–1022. [Google Scholar]
  • 107.Anavy L, Vaknin I, Atar O, Amit R, Yakhini Z (Unpublished results) Improved DNA based storage capacity and fidelity using composite DNA letters. bioRxiv doi: 10.1101/433524
  • 108.Choi Y., Ryu T., Lee A.C., Choi H., Lee H. High information capacity DNA-based data storage with augmented encoding characters using degenerate bases. Sci Rep. 2019;9:6582. doi: 10.1038/s41598-019-43105-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Tabatabaei Yazdi S.M.H., Yuan Y., Ma J., Zhao H., Milenkovic O. A rewritable, random-access DNA-based storage system. Sci Rep. 2015;5:14138. doi: 10.1038/srep14138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Quan J., Saaem I., Tang N., Ma S., Negre N. Parallel on-chip gene synthesis and application to optimization of protein expression. Nat Biotechnol. 2011;29:449–452. doi: 10.1038/nbt.1847. [DOI] [PubMed] [Google Scholar]

Articles from Computational and Structural Biotechnology Journal are provided here courtesy of Research Network of Computational and Structural Biotechnology

RESOURCES