Abstract
Programmable biomolecule-mediated computing is a new computing paradigm as compared to contemporary electronic computing. It employs nucleic acids and analogous biomolecular structures as information-storing and -processing substrates to tackle computational problems. It is of great significance to investigate the various issues of programmable biomolecule-mediated processors that are capable of automatically processing, storing, and displaying information. This Perspective provides several conceptual designs of programmable biomolecule-mediated processors and provides some insights into potential future research directions for programmable biomolecule-mediated processors.
Introduction
Whenever the word “computer” is mentioned, our intuition automatically associates it with an image of a monitor and keyboard, or various technical terms such as central processing unit (CPU), random access memory (RAM), and read-only memory (ROM). This is because we have grown accustomed to the concept of emulating computation through the use of devices commonly referred to as digital computers, which comprise an array of functional electronic components assembled on a silicon substrate. Since the introduction of the first digital computer in the early 1970s, improving its computational ability—processing speed, parallelism, minimization, and energy efficiency—has been the issue of most concern. To meet the ever-increasing demand for processing speed and parallelism, the size of individual transistor elements must be reduced. Doing so allows additional processing units to be packaged on the same silicon die; however, increasing the packaging density always brings problems, including increased power consumption and problematic heat dissipation issues. Moreover, the employment of silicon substrates as basic materials in the manufacture of digital computers always has a negative impact on health and the environment.1 Most importantly, the entire semiconductor industry is rapidly approaching the physical constraints predicted by Moore’s law.2 Moreover, current computer technology based on silicon materials and binary algorithms is restricted by further component miniaturization3 and operational speed.4
In reality, any device endowed with the three essential functions, namely, information processing, storing, and displaying, can be regarded as a computer.5 This ratiocination enables scholars from various disciplines to explore other potential alternatives to contemporary electronic digital computers. Among the various intriguing approaches, programmable biomolecule-mediated computing is a feasible method because of biomolecules’ appealing features as perfect nanomaterials, including their minuscule size, short structural repetition, and reasonable stiffness.6 As information storage media, the basic building blocks—deoxyribonucleic acid (DNA), ribonucleic acid (RNA), or proteins—are extremely small in size, capable of encoding a single bit of information in a solution of about 1 nm3.7 Furthermore, reasonable stiffness6 and a quantitative understanding of biomolecular thermodynamics enhance the reliability of DNA as a data storage device.8 A biomolecule-mediated processor, as an information-processing unit, is capable of executing approximately 10 trillion calculations at a time.9 The reason for this efficiency comes from the fact that, unlike conventional silicon-based digital computers, biomolecule-mediated computing executes calculations in parallel.10 Most silicon-based digital computers work in a linear fashion, executing one task at a time and repeating similar operations. In contrast, biomolecule-mediated computation executes multiple types of operations simultaneously and stochastically. It is this adaptive intelligence and sophisticated parallel computing that enable synthetic biomolecule-mediated processors to effectively solve complex problems. By comparison, a contemporary silicon-based digital computer could take hundreds of years to achieve the same solution. Furthermore, biomolecule-mediated computing is more energy-efficient as compared to modern computers.
The interdisciplinary research on biomolecule-mediated computing is closely related to the progress in biomolecular engineering. With a better quantitative understanding of biomolecular thermodynamics, additional sophisticated biomolecular structures can be created to increase information storage stability as well as processing speed. In addition, improvements in the quality and rapidly falling costs of synthesizing nucleic acids, coupled with the development of functional enzymes and available laboratory techniques, have provided an additional impetus to the discipline. Nowadays, biomolecule-mediated computing is gradually shifting from the development of limited-function biomolecular devices toward the creation of conceptual models of general-purpose biomolecule-mediated logic gates, analogous to the evolutionary history of electronic digital computers. At the same time, operations are no longer restricted to in vitro manipulations. On the contrary, the current findings indicate that these experiments can be successfully implemented under sophisticated cellular conditions. In recent years, the entire field has grown rapidly, building an enormous technical barrier for upcoming scholars with different knowledge backgrounds. Therefore, this Perspective attempts to cover the development from early-stage, limited-function biomolecular models to general-purpose, biomolecule-mediated logic circuits. Moreover, the focus of this paper is to clarify the underlying logic behind each stage of development and gradually outline the challenges of interdisciplinary research.
Silicon-Based Computing
Nowadays, computers are immensely powerful and can execute millions of calculations per second. They are small in size and affordable for a great many people. It would be pretty astonishing to track the rate at which computers have evolved since the first generation of computers were manufactured around 1941. They were driven by electromechanical components, with instructions provided via punch cards. The second generation of computers were built between 1941 and 1950 through the use of vacuum tubes and capacitors. Vacuum tubes were used as switching elements that defined the state of a computer program. Capacitors enabled the computer to have a memory compartment where intermediate results were stored and fed back to the computing system. As a result, computers shrank in size from what once occupied an entire room to the space of a large desk.
In the 1950s, vacuum tubes were gradually replaced by transistors, giving way to the third generation of computers. Transistors had many significant advantages over vacuum tubes in computing because they were faster, smaller, less expensive, and more dependable. Transistors, along with other electronic components, were connected together on a semiconductor material, known as an integrated circuit (IC). The computer system on the IC that executed the program was known as the CPU. Previously, each CPU could host only one or a few functions. This meant that people had to manually switch between different ICs to adopt distinct functions. This was an inefficient way of computing. The problem was solved when ICs were manufactured that integrated most or all of their functions. This is the well-known microprocessor, which is now the core of modern fourth-generation computers. Since then, computers have become faster and more compact through the use of tiny transistors with advanced nanotechnology. However, according to Moore’s law, there is a limit to the small size of a transistor, as it approaches the size of a single atom.3
DNA-Mediated Computing
DNA-mediated computing can be very compact because DNA strands are exceedingly small (1 bit per cubic nanometer compared to 1 bit per 1012 nm3 for modern computers), giving it exciting potential. Computations are also amazingly fast due to parallel processing (1014 to 1020 operations per second compared to 108 to 1012 operations per second in modern computers). DNA-mediated computing is more energy-efficient than modern computers. An operation typified by a reaction between two DNA strands uses 5 × 10–20 Joules of energy, compared to 10–9 Joules in a silicon-based computer. It is worth mentioning that the study of DNA-mediated computing may also lead to a better understanding of a more complex computer—the human brain.11−13
DNA-mediated computing has several advantages over silicon-based computing. First, if the ligation of two molecules is considered as a single operation, then 1020 or more operations per second can be executed. In contrast, the fastest supercomputers can execute approximately 1 × 1012 operations per second. Second, according to the Gibbs free energy,14 1 Joule is sufficient for approximately 2 × 1019 ligation operations. For existing supercomputers, 109 operations per Joule are executed. Last, the information density of DNA is 1 bit per nm3. For videotapes, the density of information is approximately 1 bit per 1012 nm. It is worth mentioning that DNA data storage, as the simplest form of DNA-mediated processor, was listed as one of the top 10 emerging technologies in 2019 by the World Economic Forum’s annual list;15 however, the advantage of silicon-based computing lies in the diversity of operations and flexibility in which these operations can be applied. The major advantages and drawbacks of DNA-mediated computing relative to silicon-based computing are compared in Table 1.
Table 1. Silicon-Based Computing versus DNA-Mediated Computing.
Characteristics | Silicon-based | DNA-mediated |
---|---|---|
Information storage | 1 bit per 1012 nm3 | 1 bit per nm3 |
Processing speed | 108 to 1012 operations per second | 1014 to 1020 operations per second (ligation) |
Energy efficiency | 109 operations per Joule | 2 × 1019 operations per Joule |
Computing architecture | Effective for single operation; multiple cores of CPU for multiple operations at one time (up to six operations) | Ineffective for single operation; naturally effective for massive parallel operations |
In silicon-based digital computing, electronic logic gates are the basic components for analysts to execute computational operations. These electronic gates convert electronic signals into binary codes, which are understandable by silicon-based digital computers. Based on this working principle, silicon-based digital computers can execute diverse types of electronic logic gates to implement various tasks.
Compared with the history of evolving electronic computers, the development of DNA-mediated computers is still at an early stage. DNA-mediated computing is a new computing paradigm that utilizes artificially synthesized nucleic acids and/or analogous biomolecular structures as information-storing and -processing substrates to tackle computational problems. The entire field hinges on two important discoveries—the double helix structure of the DNA molecule in 1953,16 and the understanding of fundamental biomolecular mechanics in the late 1970s. An integrated symmetrical table for the genetic codes of life created by Shu17 shows that life formed due to symmetry (template) but evolved due to asymmetry (signature). All of the information encoding life is encapsulated within the tiny nucleus of the cell. This extremely condensed nature of DNA makes it ideal for scaling down computations. The specific Watson–Crick base pairings reflect the unique nature of digital information. DNA-mediated computing can be broadly divided into two categories: limited-function DNA algorithms and logic circuits. For the first approach, computation using sequentially encoded information is demonstrated by solving difficult mathematical problems. This approach follows the traditional dogma that, while DNA stores information about life, the digital information is simply encoded in Watson–Crick base pairings. Afterward, calculations are performed using DNA-manipulating enzymes. The second approach leverages Watson–Crick base pairings to define states rather than to store information. The specificity of the Watson–Crick base pairing, which restricts a single-strand DNA (ssDNA) sequence to hybridize only to its reverse complement, can be used to prime a reaction in the presence of a specific DNA input sequence. This reaction acts like a digital logic gate with two input states, “0” or “1”, corresponding to a DNA logic gate with a “match” or “mismatch” state with respect to the target input strand sequence, respectively.
DNA Algorithms
Ligation-Based System
The computational use of DNA molecules was demonstrated7 to solve the Hamiltonian path problem (Figure 1). Although the solution to the problem demonstrated is trivial and could be easily solved by hand, solving a non-deterministic polynomial–time complete (NP-complete) problem in vitro with DNA molecules is significant. NP-complete problems grow with the size of the problem, and it is challenging for computers to obtain a solution for large problems. An inherent advantage of utilizing DNA molecules for computing is the parallelism of chemical reactions. In this demonstration, approximately 1014 DNA molecules were ligated simultaneously, and the ligation of up to 1020 DNA molecules was possible by increasing the reaction volume. Compared with the supercomputer speed of 1012 operations per second in 1994, DNA offers 2–8-fold higher computation speed.
The technique has been extended to many other mathematical problems: Boolean satisfiability problem,10 addition problem,18 maximal clique problem,19 Chinese postman problem,20 maximum matching problem,21 traveling salesman problem,22 maximum cut problem,23 clustering problem,24 bin-packing problem,25 and assignment problem.26 However, all these implementations have limited capabilities, as they are restricted to solving certain classes of combinatorial optimization problems.
Sequence Editing System
In addition to ligation (joining), DNA strands can also be cleaved with restriction enzymes. This technique, when used in DNA-mediated computing devices, opens up many more possibilities in terms of computational complexity.27 In addition to setting the minimum conditions to be met, boundary conditions can also be specified (Figure 1D). Any DNA strands in solution beyond the boundaries can be simply cleaved and filtered out by employing gel electrophoresis. A programmable and automatic computing machine composed of biomolecules was built around this technique,28 in which encoded input strands were decoded through a series of loops. Within each loop, a portion of the strand was cleaved if it matched a restriction enzyme recognition site. This process was repeated until the input strand was cleaved to the end or no restriction sites were detected, and the decoded output was read using gel electrophoresis. In addition, the same technique was employed to solve the strategic assignment problem.26 Restriction enzymes were used to accelerate the process, as demonstrated by a schema-based DNA computing algorithm applicable for graphics processing units.29 The same principle has been employed in the formation of in vitro molecular machine-learning algorithms, opening the way for solving machine-learning problems.30 Besides restriction enzymes, the use of clustered regularly interspaced short palindromic repeats (CRISPR) allows for more precise cleavage of target sequences.31
DNA Logic Circuits
Limited function DNA algorithms, specified in the first category, maximize the advantages of DNA-mediated computing, namely, massively parallel computing capabilities; however, this application is restricted to certain categories of mathematical problems. The second category emphasizes the development of DNA logic circuits, which are assembled from a series of functional logic gates. Analogous to the case of contemporary digital computers, designing appropriate DNA logic circuits is believed to be a viable starting point for the development of general-purpose DNA-mediated computers. Logic circuits are assembled from a series of individual functional units called logic gates, which are capable of executing simple Boolean logic operations. Many other early-stage designs of DNA-mediated logic gates were summarized.27,32−34 In general, DNA logic circuit designs can be classified into two categories: tiling and DNA strand displacement (DSD).
Tiling System
The tiling system was used to emulate an early form of the Turing machine,35 in which programs were represented on magnetic tape.36 The Turing machine used symbols to provide a readout based on the order in which the holes were punched in the tape. A separate set of symbols was attained by moving the point where the machine started to read. The starting point was called the controller state, which together with the symbol was referred to as the configuration. Therefore, the configuration was changed by changing the controller state.
DNA self-assembled molecules, known as tiles, were used to represent symbols and controller states. A configuration was a row of tiles. To change the configuration, a new row of tiles was stacked on top of the initial row37 in a manner determined by the Wang tiles,38 which were square tiles with colored edges. This arrangement places edges of the same color side by side, creating an aperiodic pattern on a plane. A set of the seven Wang tiles, each with a unique combination of five selected colors, for emulating an exclusive disjunction (XOR) logic is shown in Figure 2. Each DNA tile contains a reporter strand that extends diagonally across the tile and ligates to the reporter strand of other connected tiles. To obtain the assembled outputs, a polymerase chain reaction (PCR) is performed with specific primers for each input (X1, X2, X3, X4) and output (Y1, Y2, Y3, Y4).39 Atomic force microscopy (AFM) can also be used to detect the physical location of tiles with overlapping distinct features.40 Beyond XOR logic, the tiling system was also used to perform other DNA-mediated logic gates39,41 and arithmetic calculations. The latter included counting,42 addition,40 multiplication,43 subtraction,40 and division,44 as well as elementary functions.45
Challenges for tiling systems include determining the minimum type of tiles required to generate a solution, how quickly the tiling can be assembled, and whether a solution can be successfully generated for nondeterministic computations.46 One of the biggest limitations of the tiling system is the writing and reading of information. The DNA strands in a tile are designed to be specific for self-assembly and binding to other tiles. This specificity increases the difficulty of designing strands for multiple tiles and is not suitable for multiple logical operations. After the sequence is designed, the DNA strands must be synthesized for self-assembly. Obtaining results from a tiling system is not immediate, as it requires separate DNA molecular techniques or AFM. These time-consuming constraints are further compounded by the irreversible nature of DNA ligation, limiting the costly synthesized DNA circuits to a single use. Resettable kinetics were demonstrated by applying an excess of ssDNA blocker to outcompete the input of the corresponding binding strand.47 This addresses the issue of wasted resources and enables logic circuits to be designed once and reused; however, the time and resource costs of the tiling system far outweigh the ability to perform complex computations.
DNA Strand Displacement System
Moving from the tiling system, energy-free and enzyme-free reactions are desirable features for DNA computing without a complex design. DSD was demonstrated48 to occur without additional energy input during natural hybridization and was employed49 to implement AND, OR, and NOT gates, signal restoration, amplification, feedback, and cascading circuits (Figure 3 top). Compared to the ligation and restriction enzyme method, DSD allows multi-stage computations; i.e., the ssDNA output calculated in the first step can be used as the input for the next step. Multilevel computation enables more complex functions to be performed. Other advantages of DSD include sensitivity, simplicity, programmability, and ability to use enzyme-free reactions. Since then, DSD has been applied in various fields,50−52 such as nanomachines, origami assembly, medical sensing, and diagnostics.53−55 The physical chemistry and various applications of DSD have been discussed.52 This section focuses on recent applications of DSD on DNA logic gates.
Logic gates are the building blocks for the functional design of a complex computing device. One of the hurdles in DNA computing is that logic circuits have to be designed de novo for an application. Standardized units were designed to mimic gene regulatory networks with feedforward and feedback modules (genelets) for simple kinetics of gene activation, activated by RNA repressors and coactivator strands.56 To create a modular unit with minimal crosstalk and noise, a hairpin clamp (HPC5) was designed at the input to bind only the RNA repressor and coactivator strands. Although the demonstrated logic is between two states (“ON” and “OFF”), with a simple reaction kinetics output, the potential advantage of a standardized module enables a quick and modular complete logic design; however, when compared to silicon circuits, DSD logic gates have ssDNA waste that must be managed through a constant flow of solutions. In a complex system where designing multiple unique toehold sequences is challenging, discarded DNA strands can interact with the toehold region and release the output strand (leakage), thereby reducing the sensitivity and reliability of the circuit (Figure 3C). Lowering the concentration helps reduce leakage but also slows down the reaction rate. Several efforts were done to reduce leakage by introducing error-correction methods;57 however, such approaches tend to introduce design complexity that reduces the use of DSD in complex logic applications, limiting DSD logic gates to simple and small networks compared to silicon digital circuits.
Cascading DSD leakage compensation is a counterbalance to the toehold principle, where the added complexity makes the unique toehold sequence design difficult. If multiple inputs are required, a multi-input DSD logic gate (Figure 4) was demonstrated to eliminate the need for cascading logic,58 performing multi-input AND logic on a single DNA unit. It uses a single DSD “logic gate” unit for multiple inputs rather than cascading multiple DSD units per input (Figure 3). This approach has two advantages: fewer nucleotides are synthesized in the same operation, and fewer waste DNA byproducts are produced, thus reducing leakage. Nonetheless, the fundamental requirement of unique toehold sequences still limits the computational scale of multi-input DSD. Still, one major advantage of DNA computing is parallelism, not computational complexity or depth, speeding up multi-input and massively parallel computing, such as the tic-tac-toe game demonstrated. Massive parallelism makes DNA, and in particular DSD computation, well-suited for emulating the nodes of artificial neural networks (ANNs), where each node has low computational complexity and is not restricted by a toehold (Figure 5). ANN, demonstrated with in silico-designed ssDNA,59 was shown to work with cancer-associated miR-2 and miR-31 microRNAs (miRNAs).60
In addition to parallel computing, DSD can be used in fields that require only simple computations using natural DNA rather than digital data as input. Circulating miRNA is a good biomarker for rapid early detection, prognosis, and prediction of cancer.61 DSD-based cancer diagnosis was demonstrated with addition, subtraction, and multiplication gates.55 The result of computation was expressed as fluorescence intensity or small-molecule release.62 This has some potential applications where the specificity of single or multiple miRNA biomarkers can be detected to release therapeutic molecules in vivo. DSD does not require additional proteins or enzymes for computing. Hence, it can be applied in vivo without introducing exogenous proteins into the subject, which has potential applications in biosensing.63,64 Beyond cancer, DSD has also been developed for viral RNA detection, such as Zika virus.65 Combined with DSD to perform isothermal RNA amplification of target viral RNA for detection, the entire process can be performed on paper, reducing the time, cost, and expertise of nucleic acid detection.
Conclusion
In recent years, the research focus of DNA-mediated computing has gradually been shifting from the development of limited-function DNA algorithms to the demonstration of practical DNA-mediated logic gates. Because the specificity of the Watson–Crick base pairs mirrors the distinct nature of digital information, the use of DNA for computation is attractive. DNA computing can be achieved through two main approaches: computing using sequential information encoding and computing using sequential encoding states. Both approaches have natural limitations.
Hence, any processing or computation is initially done by synthesizing the designed sequence (input) followed by specific DNA enzymes such as ligase, restriction enzyme, or CRISPR (computation). As the information is stored in the sequence itself, gel electrophoresis or sequencing is required to extract the results. Although this approach has the highest storage density and parallel processing potential, the net speed of computation is limited by the synthesis and sequencing throughput and cost. Computing with enzymes also has the drawback that enzymes can affect subsequent logical operators by becoming a “noise signal”. So far, there is no workable way to perfectly remove this “noise signal” from the solution.
As such, the DNA sequence does not store information but is used to match a specific input. The entire toehold region encodes only two states; thus, it is not as information-dense as the first approach. Nevertheless, it avoids the introduction of enzymes (“noise signals”) because its input signals, logical operators, and output signals are all nucleic acids. Therefore, such logic circuits are always referred to as enzyme-free logic circuits. Unfortunately, the potential of DSD-mediated logic circuits for complex computation is extremely limited. This is because the process of toehold DSD is slow. In addition, a three-level logic circuit consisting of eight logic gates contains 130 different oligonucleotides. It is hard to imagine that there might be sufficient nucleic acids on this planet to construct a general-purpose programmable DNA-mediated processor. Additionally, the DSD process generates waste DNA, which must be managed during the computational process. While the two approaches can be different in working principle, both share the same limitations. Some of the limitations of biomolecule-mediated computing devices together with challenges to the discipline are as follows:
Large-Scale Limitations
DNA computing theoretically offers massively parallel computing capabilities with high information density; however, it is unlikely to reach experimental limitations, because the number of nucleic acid templates required to form a complete database grows exponentially with problem size. The example of the traveling salesman problem is used to demonstrate the experimental limitations of algorithms of this nature, with respect to the number of vertices. Suppose the traveling salesman problem contains 62 cities. The goal is to determine the shortest path through which a salesperson can pass through all cities exactly once. There are 62 cities, and the number of paths is 62! ≈ 3.15 × 1085. Even with the latest supercomputers, it takes a long time to render the distance of all the paths and determine the solution to the problem (i.e., shortest path).
On the other hand, based on the proposed biomolecule-mediated computing device,22 to activate the signal, 2015 unique DNA strands must be designed and synthesized; however, designing a large number of long DNA strands, with the same melting temperature and avoiding undesired mismatches, is always a challenge. Another, more critical issue is that it is hardly feasible to render a solution to this problem due to the substantial number of DNA strands that need to be synthesized first. Mathematically speaking, to have 2015 unique DNA strands would require each strand to have a minimum length of 142 bp. Assuming that each vertex (city) and potential path of the traveling salesman problem is represented by a DNA stand with a length of 150 bp, a complete path, passing through all cities exactly once, might be 18,450 bp in length. The length of the resultant DNA strand is close to the upper limit of agarose gel electrophoresis, which simply means that it may take an excessive amount of time to extract it from the mixture. In addition, approximately 6.37 × 1067 kg of DNA is required to ensure that the solution pool contains at least 100 copies of the DNA strands encoding the solution path.
Irreversibility
In the majority of the proposed biomolecule-mediated computing devices, the DNA strands encoding the solution to the problem can be collected at the end of the process; however, the obtained DNA strands cannot be reused for computational purposes. It specifies that the information processing conducted by the proposed conceptual models is irreversible. At this stage, although the DNA template cannot be recycled, the amount of DNA synthesized (after signal activation) can be manually duplicated by using PCR prior to the formation of the database. By doing this, the cost of running experiments can be confined to a reasonable level. As a trade-off, additional time must be invested in the duplication process, and the duration can be really extensive as the scale of the problem increases. The development of reconfigurable computing units47 is necessary for practical computing devices, and even more so for DNA-mediated computers, because synthesis and sequencing are costly and time-consuming to produce.
Human and Experimental Errors
Biomolecule-mediated computing devices are still in the initial stages of development; thus, a majority of the processes implemented by the individual functional units are not yet fully automated. Therefore, the determination of the correct answer to the selected problem is highly dependent on manual operations. Manual intervention is likely to always lead to unavoidable human and experimental errors, including problematic nucleic acid template design, improper laboratory setup (solution temperature and time for each process), and solution contamination, which can lead to highly error-prone biomolecule-mediated computing devices in practice.
Problematic nucleic acid template design means that design motifs utilizing a high proportion of DNA sequences are remarkably similar to each other. This is likely to cause mismatches in the non-complementary DNA strands and lead to incorrect answers to questions. Such problems can be minimized by deliberately selecting the sequence and length of the DNA, as long as a set of criteria is followed.66,67 On the other hand, the implementation of the design protocol may significantly limit the number of DNA templates available, thus limiting the scale of the problem that the proposed device is able to tackle. Alternatively, to prevent mismatches of non-complementary DNA strands, a feedback system was employed.68 Inappropriate laboratory setup and solution contamination can be avoided by fully automating the entire process, from database formation to signal display.
Interface
As with any computing device, the interface plays a significant role in its usability. For DNA to perform any computation task, the problem is transformed from the digital domain into sequences (for the first approach). Although there are software packages that can design questions and convert them into sequences, the biggest limitation, in terms of time and cost, remains the synthesis of DNA. Using photonics such as fluorescence can eliminate the need to sequence simple output computations.69 For multi-dimensional output, the introduction of surface-based techniques into DNA-mediated computing provides additional flexibility in the design of output representations.70,71 Interestingly, for complex computations like the path problem, the interface is limited to digital computers, which are necessary for everything from in silico design and translation of mathematical problems to sequencing to reading results.
However, this limitation is, in turn, an advantage for computing in biological environments, allowing direct biological interfaces. The ability to receive RNA as input has potential in diagnostics and biosensing. Cancer miRNAs can be detected without the use of PCR,55 opening up a new field of testing methods. More crucially, the direct biological interface allows in vivo computation to be combined with the direct release of therapeutic molecules62 or RNA signals in vivo. Simple in vivo DSD logic can be applied to high-risk patients for in vivo sensing of diagnostic or predictive cancer biomarkers72 and trigger-reporting signals or initial stage small-molecule therapy. Higher sensitivity can only be achieved when multiple biomarkers are present in an AND logic and within predefined threshold limits.59 Several studies have demonstrated the feasibility of cellular computation.73,74 The future of biomolecule-mediated computing may not be a replacement for silicon, but the use of the inherent advantages of programmable biomolecule-mediated computing to interface with native DNA or in vivo is ideally suited to unlock new medical technologies.
Acknowledgments
This research is supported by the Ministry of Education, Singapore, under its Academic Research Fund Tier 1 (RG75/20).
The authors declare no competing financial interest.
References
- Chepesiuk R. Where the chips fall: Environmental health in the semiconductor industry,. Environ. Health Perspect. 1999, 107 (9), A452–A457. 10.1289/ehp.99107a452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moore G. E. Cramming more components onto integrated circuits. Electronics 1965, 38, 114–117. [Google Scholar]
- Fuechsle M.; Miwa J. A.; Mahapatra S.; Ryu H.; Lee S.; Warschkow O.; Hollenberg L. C. L.; Klimeck G.; Simmons M. Y. A single-atom transistor. Nat. Nanotechnol. 2012, 7 (4), 242–246. 10.1038/nnano.2012.21. [DOI] [PubMed] [Google Scholar]
- Katz E. Biocomputing-Tools, aims, perspectives. Curr. Opin. Biotechnol. 2015, 34, 202–208. 10.1016/j.copbio.2015.02.011. [DOI] [PubMed] [Google Scholar]
- Danchin A. Bacteria as computers making computers. FEMS Microbiol. Rev. 2009, 33 (1), 3–26. 10.1111/j.1574-6976.2008.00137.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seeman N. C. DNA in a material world. Nature 2003, 421 (6921), 427–431. 10.1038/nature01406. [DOI] [PubMed] [Google Scholar]
- Adleman L. M. Molecular computation of solutions to combinatorial problems. Science 1994, 266 (5187), 1021–1024. 10.1126/science.7973651. [DOI] [PubMed] [Google Scholar]
- Church G. M.; Gao Y.; Kosuri S. Next-generation digital information storage in DNA. Science 2012, 337 (6102), 1628–1628. 10.1126/science.1226355. [DOI] [PubMed] [Google Scholar]
- Dong Y. F.; Dong C.; Wan F.; Yang J.; Zhang C. Development of DNA computing and information processing based on DNA-strand displacement. Sci. China-Chem. 2015, 58 (10), 1515–1523. 10.1007/s11426-015-5373-2. [DOI] [Google Scholar]
- Lipton R. J. DNA solution of hard computational problems. Science 1995, 268 (5210), 542–545. 10.1126/science.7725098. [DOI] [PubMed] [Google Scholar]
- Condon A. DNA and the brain. Nature 2011, 475 (7356), 304–305. 10.1038/475304a. [DOI] [PubMed] [Google Scholar]
- Shu J.-J.; Wang Q.-W.; Yong K.-Y.; Shao F.; Lee K. J. Programmable DNA-mediated multitasking processor. J. Phys. Chem. B 2015, 119 (17), 5639–5644. 10.1021/acs.jpcb.5b02165. [DOI] [PubMed] [Google Scholar]
- Wong J. R.; Lee K. J.; Shu J.-J.; Shao F. Magnetic fields facilitate DNA-mediated charge transport. Biochemistry 2015, 54 (21), 3392–3399. 10.1021/acs.biochem.5b00295. [DOI] [PubMed] [Google Scholar]
- Gibbs J. W. A method of geometrical representation of the thermodynamic properties of substances by means of surfaces. Trans. Connecticut Acad. Arts Sci. 1873, 2 (2), 382–404. [Google Scholar]
- DiChristina M.; Meyerson B. S. Top 10 emerging technologies of 2019. Sci. Am. 2019, 321 (6), 26–37. 10.1038/scientificamerican1219-26. [DOI] [PubMed] [Google Scholar]
- Watson J. D.; Crick F. H. C. Molecular structure of nucleic acids-A structure for deoxyribose nucleic acid. Nature 1953, 171 (4356), 737–738. 10.1038/171737a0. [DOI] [PubMed] [Google Scholar]
- Shu J.-J. A new integrated symmetrical table for genetic codes. BioSystems 2017, 151, 21–26. 10.1016/j.biosystems.2016.11.004. [DOI] [PubMed] [Google Scholar]
- Guarnieri F.; Fliss M.; Bancroft C. Making DNA add. Science 1996, 273 (5272), 220–223. 10.1126/science.273.5272.220. [DOI] [PubMed] [Google Scholar]
- Ouyang Q.; Kaplan P. D.; Liu S. M.; Libchaber A. DNA solution of the maximal clique problem. Science 1997, 278 (5337), 446–449. 10.1126/science.278.5337.446. [DOI] [PubMed] [Google Scholar]
- Yin Z. X.; Zhang F. Y.; Xu J. A Chinese postman problem based on DNA computing. J. Chem. Inf. Comput. Sci. 2002, 42 (2), 222–224. 10.1021/ci010046r. [DOI] [PubMed] [Google Scholar]
- Wang S. Y. DNA computing of bipartite graphs for maximum matching. J. Math. Chem. 2002, 31 (3), 271–279. 10.1023/A:1020808521487. [DOI] [Google Scholar]
- Lee J. Y.; Shin S.-Y.; Park T. H.; Zhang B.-T. Solving traveling salesman problems with DNA molecules encoding numerical values. BioSystems 2004, 78 (1–3), 39–47. 10.1016/j.biosystems.2004.06.005. [DOI] [PubMed] [Google Scholar]
- Xiao D. M.; Li W. X.; Zhang Z. Z.; He L. Solving maximum cut problems in the Adleman-Lipton model. BioSystems 2005, 82 (3), 203–207. 10.1016/j.biosystems.2005.06.009. [DOI] [PubMed] [Google Scholar]
- Bakar R. B. A.; Watada J.; Pedrycz W. DNA approach to solve clustering problem based on a mutual order. BioSystems 2008, 91 (1), 1–12. 10.1016/j.biosystems.2007.06.002. [DOI] [PubMed] [Google Scholar]
- Sanches C. A. A.; Soma N. Y. A polynomial-time DNA computing solution for the bin-packing problem. Appl. Math. Comput. 2009, 215 (6), 2055–2062. 10.1016/j.amc.2009.07.051. [DOI] [Google Scholar]
- Shu J.-J.; Wang Q.-W.; Yong K.-Y. DNA-based computing of strategic assignment problems. Phys. Rev. Lett. 2011, 106 (18), 188702. 10.1103/PhysRevLett.106.188702. [DOI] [PubMed] [Google Scholar]
- Miyamoto T.; Razavi S.; DeRose R.; Inoue T. Synthesizing biomolecule-based Boolean logic gates. ACS Synth. Biol. 2013, 2 (2), 72–82. 10.1021/sb3001112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benenson Y.; Paz-Elizur T.; Adar R.; Keinan E.; Livneh Z.; Shapiro E. Programmable and autonomous computing machine made of biomolecules. Nature 2001, 414 (6862), 430–434. 10.1038/35106533. [DOI] [PubMed] [Google Scholar]
- Hwang K.-S.; Chen Y.-J.; Jiang W.-C. A schema-based DNA algorithm applicable on graphics processing units. IEEE Access 2016, 4, 2498–2506. 10.1109/ACCESS.2016.2569590. [DOI] [Google Scholar]
- Lee J.-H.; Lee S. H.; Baek C.; Chun H.; Ryu J.-H.; Kim J.-W.; Deaton R.; Zhang B.-T. In vitro molecular machine learning algorithm via symmetric internal loops of DNA. BioSystems 2017, 158, 1–9. 10.1016/j.biosystems.2017.04.005. [DOI] [PubMed] [Google Scholar]
- Zhang J. Y.; Liu C. C. CRISPR-powered DNA computing and digital display. ACS Synth. Biol. 2021, 10 (11), 3148–3157. 10.1021/acssynbio.1c00431. [DOI] [PubMed] [Google Scholar]
- Stojanovic M. N.; Mitchell T. E.; Stefanovic D. Deoxyribozyme-based logic gates. J. Am. Chem. Soc. 2002, 124 (14), 3555–3561. 10.1021/ja016756v. [DOI] [PubMed] [Google Scholar]
- Balzani V.; Credi A.; Venturi M. Molecular logic circuits. ChemPhysChem 2003, 4 (1), 49–59. 10.1002/cphc.200390007. [DOI] [PubMed] [Google Scholar]
- Hockenberry A. J.; Jewett M. C. Synthetic in vitro circuits. Curr. Opin. Chem. Biol. 2012, 16 (3–4), 253–259. 10.1016/j.cbpa.2012.05.179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Turing A. M. On computable numbers, with an application to the Entscheidungsproblem. Proc. London Math. Soc. 1937, 42, 230–265. 10.1112/plms/s2-42.1.230. [DOI] [Google Scholar]
- Fujibayashi K.; Hariadi R.; Park S. H.; Winfree E.; Murata S. Toward reliable algorithmic self-assembly of DNA tiles: A fixed-width cellular automaton pattern. Nano Lett. 2008, 8 (7), 1791–1797. 10.1021/nl0722830. [DOI] [PubMed] [Google Scholar]
- Winfree E.; Liu F. R.; Wenzler L. A.; Seeman N. C. Design and self-assembly of two-dimensional DNA crystals. Nature 1998, 394 (6693), 539–544. 10.1038/28998. [DOI] [PubMed] [Google Scholar]
- Wang H. Proving theorems by pattern recognition II. Bell System Tech. J. 1961, 40 (1), 1–41. 10.1002/j.1538-7305.1961.tb03975.x. [DOI] [Google Scholar]
- Mao C. D.; LaBean T. H.; Reif J. H.; Seeman N. C. Logical computation using algorithmic self-assembly of DNA triple-crossover molecules. Nature 2000, 407 (6803), 493–496. 10.1038/35035038. [DOI] [PubMed] [Google Scholar]
- Tandon A.; Song Y.; Mitta S. B.; Yoo S.; Park S.; Lee S.; Raza M. T.; Ha T. H.; Park S. H. Demonstration of arithmetic calculations by DNA tile-based algorithmic self-assembly. ACS Nano 2020, 14 (5), 5260–5267. 10.1021/acsnano.0c01387. [DOI] [PubMed] [Google Scholar]
- Carbone A.; Seeman N. C. Circuits and programmable self-assembling DNA structures. Proc. Natl. Acad. Sci. U.S.A. 2002, 99 (20), 12577–12582. 10.1073/pnas.202418299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barish R. D.; Rothemund P. W. K.; Winfree E. Two computational primitives for algorithmic self-assembly: Copying and counting. Nano Lett. 2005, 5 (12), 2586–2592. 10.1021/nl052038l. [DOI] [PubMed] [Google Scholar]
- Brun Y. Arithmetic computation in the tile assembly model: Addition and multiplication. Theor. Comput. Sci. 2007, 378 (1), 17–31. 10.1016/j.tcs.2006.10.025. [DOI] [Google Scholar]
- Zhang X. C.; Wang Y. F.; Chen Z. H.; Xu J.; Cui G. Z. Arithmetic computation using self-assembly of DNA tiles: Subtraction and division. Prog. Nat. Sci.-Mater. Int. 2009, 19 (3), 377–388. 10.1016/j.pnsc.2008.07.013. [DOI] [Google Scholar]
- Raza M. T.; Tandon A.; Park S.; Lee S.; Nguyen T. B. N.; Vu T. H. N.; Jo S.; Nam Y.; Jeon S.; Jeong J.-H.; Park S. H. Demonstration of elementary functions via DNA algorithmic self-assembly. Nanoscale 2021, 13 (46), 19376–19384. 10.1039/D1NR05055A. [DOI] [PubMed] [Google Scholar]
- Brun Y. Solving NP-complete problems in the tile assembly model. Theor. Comput. Sci. 2008, 395 (1), 31–46. 10.1016/j.tcs.2007.07.052. [DOI] [Google Scholar]
- Kang H.; Lin T.; Xu X. J.; Jia Q.-S.; Lakerveld R.; Wei B. DNA dynamics and computation based on toehold-free strand displacement. Nat. Commun. 2021, 12 (1), 4994. 10.1038/s41467-021-25270-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yurke B.; Turberfield A. J.; Mills A. P.; Simmel F. C.; Neumann J. L. A DNA-fuelled molecular machine made of DNA. Nature 2000, 406 (6796), 605–608. 10.1038/35020524. [DOI] [PubMed] [Google Scholar]
- Seelig G.; Soloveichik D.; Zhang D. Y.; Winfree E. Enzyme-free nucleic acid logic circuits. Science 2006, 314 (5805), 1585–1588. 10.1126/science.1132493. [DOI] [PubMed] [Google Scholar]
- Krishnan Y.; Simmel F. C. Nucleic acid based molecular devices. Angew. Chem., Int. Ed. 2011, 50 (14), 3124–3156. 10.1002/anie.200907223. [DOI] [PubMed] [Google Scholar]
- Shu J.-J.; Wang Q.-W.; Yong K.-Y. Programmable DNA-mediated decision maker. Int. J. Bio-Inspired Comput. 2017, 10 (1), 51–55. 10.1504/IJBIC.2017.085334. [DOI] [Google Scholar]
- Simmel F. C.; Yurke B.; Singh H. R. Principles and applications of nucleic acid strand displacement reactions. Chem. Rev. 2019, 119 (10), 6326–6369. 10.1021/acs.chemrev.8b00580. [DOI] [PubMed] [Google Scholar]
- Picuri J. M.; Frezza B. M.; Ghadiri M. R. Universal translators for nucleic acid diagnosis. J. Am. Chem. Soc. 2009, 131 (26), 9368–9377. 10.1021/ja902490x. [DOI] [PubMed] [Google Scholar]
- de Murieta I. S.; Rodríguez-Patón A. DNA biosensors that reason. BioSystems 2012, 109 (2), 91–104. 10.1016/j.biosystems.2012.02.005. [DOI] [PubMed] [Google Scholar]
- Zhang C.; Zhao Y. M.; Xu X. M.; Xu R.; Li H. W.; Teng X. Y.; Du Y. Z.; Miao Y. Y.; Lin H.-C.; Han D. Cancer diagnosis with DNA molecular computation. Nat. Nanotechnol. 2020, 15 (8), 709–715. 10.1038/s41565-020-0699-0. [DOI] [PubMed] [Google Scholar]
- Schaffter S. W.; Chen K.-L.; O’Brien J.; Noble M.; Murugan A.; Schulman R. Standardized excitable elements for scalable engineering of far-from-equilibrium chemical networks. Nat. Chem. 2022, 14 (11), 1224–1232. 10.1038/s41557-022-01001-3. [DOI] [PubMed] [Google Scholar]
- Wang B. Y.; Thachuk C.; Ellington A. D.; Winfree E.; Soloveichik D. Effective design principles for leakless strand displacement systems. Proc. Natl. Acad. Sci. U.S.A. 2018, 115 (52), E12182–E12191. 10.1073/pnas.1806859115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen X.; Liu X. Y.; Wang F.; Li S. R.; Chen C. Z.; Qiang X. L.; Shi X. L. Massively parallel DNA computing based on domino DNA strand displacement logic gates. ACS Synth. Biol. 2022, 11 (7), 2504–2512. 10.1021/acssynbio.2c00270. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qian L. L.; Winfree E.; Bruck J. Neural network computation with DNA strand displacement cascades. Nature 2011, 475 (7356), 368–372. 10.1038/nature10262. [DOI] [PubMed] [Google Scholar]
- Okumura S.; Gines G.; Lobato-Dauzier N.; Baccouche A.; Deteix R.; Fujii T.; Rondelez Y.; Genot A. J. Nonlinear decision-making with enzymatic neural networks. Nature 2022, 610 (7932), 496–501. 10.1038/s41586-022-05218-7. [DOI] [PubMed] [Google Scholar]
- Xie Z.; Wroblewska L.; Prochazka L.; Weiss R.; Benenson Y. Multi-input RNAi-based logic circuit for identification of specific cancer cells. Science 2011, 333 (6047), 1307–1311. 10.1126/science.1205527. [DOI] [PubMed] [Google Scholar]
- Morihiro K.; Ankenbruck N.; Lukasak B.; Deiters A. Small molecule release and activation through DNA computing. J. Am. Chem. Soc. 2017, 139 (39), 13909–13915. 10.1021/jacs.7b07831. [DOI] [PubMed] [Google Scholar]
- Benenson Y.; Gil B.; Ben-Dor U.; Adar R.; Shapiro E. An autonomous molecular computer for logical control of gene expression. Nature 2004, 429 (6990), 423–429. 10.1038/nature02551. [DOI] [PubMed] [Google Scholar]
- Benenson Y. Synthetic biology with RNA: Progress report. Curr. Opin. Chem. Biol. 2012, 16 (3–4), 278–284. 10.1016/j.cbpa.2012.05.192. [DOI] [PubMed] [Google Scholar]
- Pardee K.; Green A. A.; Takahashi M. K.; Braff D.; Lambert G.; Lee J. W.; Ferrante T.; Ma D.; Donghia N.; Fan M.; Daringer N. M.; Bosch I.; Dudley D. M.; O’Connor D. H.; Gehrke L.; Collins J. J. Rapid, low-cost detection of Zika virus using programmable biomolecular components. Cell 2016, 165 (5), 1255–1266. 10.1016/j.cell.2016.04.059. [DOI] [PubMed] [Google Scholar]
- Deaton R.; Garzon M.; Murphy R. C.; Rose J. A.; Franceschetti D. R.; Stevens S. E. Reliability and efficiency of a DNA-based computation. Phys. Rev. Lett. 1998, 80 (2), 417–420. 10.1103/PhysRevLett.80.417. [DOI] [Google Scholar]
- Tanaka F.; Kameda A.; Yamamoto M.; Ohuchi A. Design of nucleic acid sequences for DNA computing based on a thermodynamic approach. Nucleic Acids Res. 2005, 33 (3), 903–911. 10.1093/nar/gki235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang C.-N.; Huang K.-S.; Yang C.-B.; Hsu C.-Y. Error-tolerant minimum finding with DNA computing. Int. J. Innovative Computing Information Control 2009, 5 (10A), 3045–3057. [Google Scholar]
- Saghatelian A.; Völcker N. H.; Guckian K. M.; Lin V. S.-Y.; Ghadiri M. R. DNA-based photonic logic gates: AND, NAND, and, INHIBIT. J. Am. Chem. Soc. 2003, 125 (2), 346–347. 10.1021/ja029009m. [DOI] [PubMed] [Google Scholar]
- Smith L. M.; Corn R. M.; Condon A. E.; Lagally M. G.; Frutos A. G.; Liu Q. H.; Thiel A. J. A surface-based approach to DNA computation. J. Comput. Biol. 1998, 5 (2), 255–267. 10.1089/cmb.1998.5.255. [DOI] [PubMed] [Google Scholar]
- Liu Q. H.; Wang L. M.; Frutos A. G.; Condon A. E.; Corn R. M.; Smith L. M. DNA computing on surfaces. Nature 2000, 403 (6766), 175–179. 10.1038/35003155. [DOI] [PubMed] [Google Scholar]
- Madhavan D.; Cuk K.; Burwinkel B.; Yang R. X. Cancer diagnosis and prognosis decoded by blood-based circulating microRNA signatures. Front. Genet. 2013, 4, 116. 10.3389/fgene.2013.00116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Green A. A.; Kim J. M.; Ma D.; Silver P. A.; Collins J. J.; Yin P. Complex cellular logic computation using ribocomputing devices. Nature 2017, 548 (7665), 117–121. 10.1038/nature23271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Choi S.; Lee G.; Kim J. Cellular computational logic using toehold switches. Int. J. Mol. Sci. 2022, 23 (8), 4265. 10.3390/ijms23084265. [DOI] [PMC free article] [PubMed] [Google Scholar]