Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2014 Apr 29;11(5):499–507. doi: 10.1038/nmeth.2918

Large-scale de novo DNA synthesis: technologies and applications

Sriram Kosuri 1,, George M Church 2,3
PMCID: PMC7098426  PMID: 24781323

This Review discusses large-scale de novo DNA synthesis via oligos or arrays, describes gene assembly and error correction and considers applications for large-scale DNA synthesis.

Subject terms: Genetic engineering, Synthetic biology

Abstract

For over 60 years, the synthetic production of new DNA sequences has helped researchers understand and engineer biology. Here we summarize methods and caveats for the de novo synthesis of DNA, with particular emphasis on recent technologies that allow for large-scale and low-cost production. In addition, we discuss emerging applications enabled by large-scale de novo DNA constructs, as well as the challenges and opportunities that lie ahead.

Main

DNA is the predominant information carrier for life. The development of synthetic techniques to construct DNA has led to marked improvements in our ability to understand and engineer biology. For example, despite extensive efforts to unravel the genetic code using molecular genetics, modest capabilities to synthesize nucleic acids ultimately led to the code's unraveling1. Today, reconstructions of complete viral and bacterial genomes are testaments of how far our synthetic capabilities have come.

Despite the improvements, our ability to read DNA is better than our ability to write it. Over the last decade, high-throughput sequencing technologies, here referred to as next-generation sequencing (NGS), have revolutionized the discovery and understanding of natural DNA sequence, with current installed capacity estimated at ∼15 petabases per year2. Large-scale data-sharing initiatives, such as GenBank, and continued improvements in bioinformatics software have made computational analyses on these data easier than ever. Such analyses help generate powerful statistical hypotheses for how genome sequence controls cellular functions across organisms and populations. In addition, NGS-based measurement tools allow for the analysis of many genetic and biochemical processes at unprecedented scale and low cost3. However, even though our ability to both generate hypotheses and measure outcomes has increased in scale owing to NGS, our ability to test such hypotheses experimentally still lags and is among the most limiting steps in the study of natural and engineered biology. Specifically, a designed DNA construct is a physical instance of a hypothesis to be tested, whether it be a simple plasmid-based reporter or a whole-genome synthesis of an organism. Progress in large-scale, low-cost construction of desired DNA sequences could rapidly engender progress in both fundamental and applied biological research.

Although rapid modification of natural DNA sequence both in vitro and in vivo is useful for a variety of purposes, methodologies for de novo synthesis of DNA from nucleosides confer a number of unique advantages. First, engineering new functions often requires vastly modified or wholly new genetic sequences that are most easily accessed by de novo synthesis methodologies. Second, synthesized constructs are often superior to natural sequence for the study of genetic mechanisms because they can be designed to specifically test hypotheses for how sequence affects function. Finally, sequences that are targeted to be amplified or modified from natural sequences can be difficult to access (for example, from metagenomic data sets); thus, synthesis is the only practical way to experimentally study them.

Here we review technological innovations and applications for de novo DNA synthesis as distinct from assembly and modification of natural DNA sequence. We cover large-scale single-stranded DNA oligonucleotide (oligo) synthesis, assembly of these oligos into longer double-stranded DNA constructs, and emerging applications (Fig. 1).

Figure 1. Lengths and costs of different oligo and gene synthesis technologies.

Figure 1

Commercial oligo synthesis from traditional vendors (pink) and array-based technologies (brown) are plotted according to commonly available length scales and price points. Costs of gene synthesis from commercial providers for cloned, sequence-verified genes (dark green) and unpurified DNA assemblies (light green) are shown, as are costs of gene synthesis from oligo pools (blue) derived from academic reports39,40.

Oligo synthesis

Oligo synthesis has a long history, beginning in academic labs in the 1950s, followed by automation and commercialization in the 1980s and progressing into high-throughput array-based methods in the 1990s. This history has been extensively reviewed4, and we will mostly cover current approaches here to understand their advantages and trade-offs and their effect on downstream gene synthesis processes.

Column-based oligo synthesis. The first synthetic oligos were reported in the 1950s by Todd, Khorana and their coworkers, who used phosphodiester5, H-phosphonate6 and phosphotriester7 approaches. Today, the dominant chemistry for oligo synthesis occurs in automated instruments employing solid-phase phosphoramidite chemistry first developed by Marvin Caruthers in the 1980s8 (Fig. 2). Phosphoramidite-based oligo synthesis most commonly consists of a four-step cycle that adds bases one at a time to a growing oligo chain attached to a solid support. First, a dimethoxytrityl (DMT)-protected nucleoside phosphoramidite that is attached to a solid support is deprotected by removal of the DMT using trichloroacetic acid. Second, a new DMT-protected phosphoramidite is coupled to the 5′ hydroxyl group of the growing oligo chain to form a phosphite triester. Third, a capping step acetylates any remaining unreacted 5′ hydroxyl groups, making the unreacted oligo chains inert to further nucleoside additions, helping to alleviate deletion errors. Fourth, an iodine oxidation converts the phosphite to a phosphate, producing a cyanoethyl-protected phosphate backbone. The DMT protecting group is removed to allow the cycle to continue. This detritylation step is usually monitored to track coupling efficiencies as individual bases are added. After all nucleosides are added in series from 3′ to 5′, the completed oligo is removed from the solid support, and protecting groups on bases and the phosphate backbone are removed.

Figure 2. Phosphoramidite chemistry.

Figure 2

The four-step synthetic oligo synthesis is the most commonly used chemistry for the production of DNA oligos.

This automated process usually synthesizes 96–384 oligos simultaneously at scales from 10 to 100 nmol. Over the years, improvements in raw materials, automation, processing and purification have enabled routine synthesis of up to ∼100 nt at costs of ∼$0.05–0.15 per nucleotide with error rates of ∼1 in 200 nt or better. The limits on length and error rates of this process are due to a few major reasons. First, the yield for each step in the synthetic cycle must be very high, especially for the production of long oligos. For example, even 99% yield from each turn of the cycle will result in 13% final yield for a 200-nt oligo synthesis. In addition, depurination, particularly of adenosine, can occur during acidic detritylation and becomes particularly problematic in the production of long oligos9,10,11. During the final removal of protecting groups from the bases and phosphate backbone, these abasic sites lead to cleavages that reduce the yield of long-length oligos. Finally, even successfully synthesized oligos contain appreciable errors12,13. The dominant errors in purified oligos are single-base deletions that result from either failure to remove the DMT or combined inefficiencies in the coupling and capping steps. Newer chemistries and improved processes continue to arise and will further augment oligo length and quality4.

Array-based oligo synthesis. Starting in the early 1990s, Affymetrix developed methods for spatially localized polymer synthesis on surfaces using light-activated chemistries, which paved the way for the development of DNA microarrays14,15. They used standard mask-based photolithographic techniques to selectively deprotect photolabile nucleoside phosphoramidites. Today, several technologies coexist to make spatially decoupled DNA microarrays. Maskless procedures (used, for example, by NimbleGen and LC Sciences) greatly simplified photolithographic techniques using programmable micromirror devices—similar to those found in modern-day digital projectors—to direct the light-based chemistries16,17. Ink-jet–based printing of nucleotides on an arrayed surface (as Agilent uses) allowed for oligo synthesis using standard phosphoramidite chemistries18,19,20. In addition, CombiMatrix (now CustomArray) developed semiconductor-based electrochemical acid production to selectively deprotect nucleosides21. Many other promising extensions and variations in microfluidic and microarray syntheses have been reported but are yet to become widely available or commercialized22. The use of microarray-derived oligos, whereby all the oligos synthesized from an array are cleaved and harvested as one 'oligo pool', has become increasingly popular as a cheap source of designed oligos. The scales, lengths and error rates vary greatly between vendors, but, to date, Agilent Technologies and CustomArray have provided oligos used in most recent publications (see “Emerging applications for large de novo DNA synthesis” below). Oligos produced from microarrays are 2–4 orders of magnitude cheaper than column-based oligos, with costs ranging from $0.00001–0.001 per nucleotide, depending on length, scale and platform.

Gene synthesis

Small sets of oligos (usually 5–50 oligos) provide the raw substrate for constructing larger synthetic fragments (usually 200–3,000 bp) via a variety of methods collectively termed gene synthesis ('gene' refers to 'gene length' rather than the classic genetic definition). The first synthetic genes were short (80–200 bp); Gobind Khorana's group used T4 DNA ligase to seal chemically synthesized oligos together23,24. In ligation-based approaches, complementary overlapping strands are enzymatically joined to produce larger fragments. Initially this was done sequentially, but higher-quality oligos, use of thermostable ligases to improve hybridization stringency25, and methods to produce and select for circular DNA26 have allowed for 'one-pot' production of gene-length fragments. Polymerase cycling assembly (PCA)-based techniques use polymerase to extend overlapping oligos into a double-stranded fragment by cycling in a non-exponential process27. Both ligation and PCA approaches usually rely on PCR to isolate and amplify full-length from partially assembled fragments and are often used in combination. More recently, Gibson and colleagues developed both in vivo28,29 and in vitro30,31 one-step protocols for assembling and cloning oligos directly into plasmid backbones. All of these protocols have been iteratively improved and underlie most academic and commercial gene synthesis efforts and have been reviewed elsewhere22,32,33,34. Finally, because oligo synthesis and assembly techniques are prone to errors, gene-length fragments are often cloned and sequence verified, which can substantially add to the final cost.

There are advantages and disadvantages of each approach. High-stringency ligation-based syntheses reduce error rates because sequences with errors are less likely to hybridize and ligate. However, as both top and bottom strands need to be synthesized and oligos require phosphorylated 5′ ends, oligo costs are higher. PCA-based approaches can rely on overlapping regions of only 15–25 nt, thus allowing for fewer oligos per gene synthesized, but suffer from higher error rates owing to lack of hybridization-based error filtration. Also, because PCA-based approaches can contain regions encoded by only a single oligo, targeted diversity can be introduced into these locations. For both methods, sequences containing high GC content and secondary structure can inhibit assembly owing to misannealing and loss in ligation stringency. Also, because PCR is used as a final step to amplify constructs from partial assemblies, the lengths of synthetic genes generated from these methods are usually kept <5 kb for reliability; better assembly techniques exist for larger assemblies (see below).

Array-based gene synthesis. Even though microarray-based oligo pools are cheap, there are several challenges in using them for gene synthesis. First, although the numbers of oligos that can be produced in a pool are large, their individual concentrations are quite low for most existing gene synthesis protocols. Second, the error rates for oligo pools are usually higher than those for column-synthesized oligos. Finally, the sheer number of oligos produced leads to interference between gene assemblies, making it difficult to scale up.

Tian et al.35 were the first to show how these problems could be overcome. They used PCR amplification to increase the concentration of the oligos before assembly, error-corrected them by hybridization to reverse complementary oligos (also constructed on the chip), and designed protein sequences computationally to avoid potential mishybridizations of the sequences. However, this work and contemporaneous reports36,37 used only dozens to hundreds of oligos at once. Scaling these methods proved difficult beyond pool sizes of 1,000 oligos38. At greater pool complexities, where the advantages in cost would come to play, constructing any individual gene became difficult, presumably owing to spurious cross-hybridization during the assembly process. In addition, the methods required sufficient sequence orthogonality between synthesized genes, which limited potential applications. To alleviate these and other issues, two approaches were used that first isolated subpools of oligos required for any single assembly, thereby overcoming the concerns about both pool complexity and sequence orthogonality (Fig. 3). Kosuri et al.39 used predesigned barcodes that allowed PCR amplification of oligos participating in only a particular assembly and then removed the barcodes by digestions, which was followed by standard assembly of the genes. Quan et al.40 used a custom ink-jet synthesizer20 that synthesized subsets of oligos in physically separated microwells, where amplification and assembly were then done in situ. Both methods used much larger oligo pools (>10,000 oligos) and enzymatic error correction, which paved the way for commercialization in recent years (Gen9). Finally, two reports for using one-pot assembly of libraries of genes directly from large pools have been attempted, but these have been limited to joining one or two oligos simultaneously and suffer from large differences in dynamic range and the inability to make sequences that are similar to one another41,42.

Figure 3. Different strategies for dealing with microarray oligo complexities.

Figure 3

Top, Kosuri et al.39 use amplification of barcoded subpools by PCR (thus eliminating background complexity), remove the barcode sequences and then assemble the genes. Bottom, Quan et al.40 use a custom synthesizer to print oligos needed for any assembly into separate micropatterned wells. Leveraging the spatial separation that enables microarray synthesis, they then amplify and assemble these genes within the microwells themselves.

Cloning, error correction and verification. Once synthetic genes have been assembled, they contain a mixture of perfect and imperfect sequences resulting from both oligo synthesis and assembly errors (Fig. 4). Usually, synthesized genes are cloned into plasmids in Escherichia coli or yeast and then sequence verified by Sanger sequencing. These steps are expensive, time consuming and difficult to automate. Thus, reducing the number of clones required to get a perfect sequence is paramount. To help alleviate synthesis errors, early approaches fused synthetic protein-coding sequences in frame with a selectable marker encoding antibiotic resistance or a fluorescence marker43,44. Because single-base deletions will most likely result in a frameshift, and thus loss of activity, it serves as a useful error-correction method but is limited to only protein-coding sequences.

Figure 4. Comparison of reported error rates from error-correction techniques.

Figure 4

The error rates are included along with the indicated oligo source and error-correction methodology. When starting error rates were unreported, we estimated the error rates on the basis of the oligo sources and assembly method. Open circles denote starting error rates; filled circles denote error rates of assembled genes (two filled circles denote error rates before and after error correction). ssDNA, single-stranded DNA; dsDNA, double-stranded DNA; Column, column-synthesized oligos; Array, microarray-based oligo pools; Hyb, oligo hybridization–based error correction; Seq, NGS-based error correction; Lig, high-temperature ligation/hybridization–based error correction; Nuclease, nuclease-based error correction.

More general methods of error correction usually depend upon a number of enzymatic techniques to reduce errors. All of these techniques rely on the fact that at any given position, most molecules possess the correct base. Heating and reannealing can force the formation of heteroduplexes that will contain disruptions to the canonical helical DNA structure. Such disruptions can be recognized and acted upon by several proteins. MutS binds heteroduplexes and can be used to filter errors by reverse purification13. Certain polymerases with exonuclease activity, endonucleases and resolvases can all cut or nick at such heteroduplex sites and, upon reamplification, can help filter errors12,26,45,46,47. Commercial enzymatic cocktails such as ErrASE have been commonly used to help reduce errors in synthetic genes as well30,39. Such error reduction can greatly reduce the cost and time of gene synthesis by bringing error rates low enough that the genes can be directly used for functional assays without in vivo cloning and sequence verification30,48.

More recent approaches have leveraged NGS technologies to screen and then select for perfect sequences at either the oligo or gene level. Matzas et al.49 used 454 sequencing (Roche) combined with a robotic pick-in-place pipette to mechanically pull reads with perfect sequence off the sequencing array and used these oligos for synthetic genes. Kim et al. also used 454 but marked individual molecules with random tags and then amplified constructs with perfect sequence42. Both approaches help reduce error rates, although they still suffer from sequencing errors and require more expensive long-read platforms. Schwartz et al.41 used Illumina sequencing of barcodes to pull out perfect sequences but overcame limitations in length and sequencing errors through a tag-based consensus-sequencing approach. All three approaches substantially cut error rates and will continue to improve with the rapid progress in DNA sequencing technologies. These NGS-based error-correction approaches are also especially exciting for library-based constructions and synthesis methods because they allow correction without having to separate each gene assembly into individual reactions.

Larger DNA assemblies. Methods to produce larger assemblies from combinations of sequence-verified de novo–synthesized or amplified gene-length fragments have seen rapid advances50. Both commercial and academic systems now allow combinations and libraries to be formed at high efficiency, fidelity and reliability for reasonably low reaction costs. Today, seamless assembly methods that do not leave behind scars at assembly junctions—including ligase cycling reaction51, Gibson assembly52, seamless ligation cloning extract53, yeast assembly54,55, circular polymerase extension cloning56, sequence- and ligation-independent cloning57, Golden Gate58 and others—are routinely used and automated both in academic and industrial settings. Most of these methods can also be used for the generation of large, multicomponent libraries with little extra effort. Thus, for de novo synthesis applications, the cost and errors associated with generating gene-sized fragments dominate over those required for constructing larger DNA assemblies.

Emerging applications for large de novo DNA synthesis

Molecular tools. In 2004, one of the first and largest uses of synthetic DNA was in the development of human and mouse short hairpin RNA libraries59. Using Agilent oligo pools, researchers synthesized 447,410 short hairpin RNAs targeting all human and mouse genes, which in total comprised ∼44 Mb of de novo sequence. Improved lengths and NGS-based screening approaches have greatly expanded both the usage and applicability of such libraries. Now oligo pools are also being used routinely for targeted capture and resequencing of exons and other genomic regions of interest60,61,62,63,64 as well as to study genetic regulatory mechanisms such as genome-wide CpG methylation65,66, RNA editing67 and allele-specific expression68. Another interesting use was the creation of a human peptidome phage-display library by Larman et al.69,70 (413,611 peptides using ∼58 Mb of DNA) for identifying autoimmune targets from patient samples. This same group also constructed a rationally designed human antibody library using oligo pools that were optimized for NGS analysis71. Similar phage-display methods were used to profile the interaction of PDZ domains against all known human and viral proteins' C termini72. Warner et al.73 used oligo pools to construct genome-wide barcoded knockout and overexpression libraries in E. coli to facilitate selections for traits of interest. Recently, two groups have leveraged clustered, regularly interspaced, short palindromic repeats (CRISPR)-mediated gene targeting combined with large oligo pools to construct comprehensive, pooled and barcoded knockout libraries in human cell lines74,75. These molecular tools have all been directly enabled by the availability of microarray-based oligo pools, and we can only expect more in the coming years,with improvements in length, quality and the scale of such libraries.

Understanding and engineering regulatory elements. Microarray-based oligo libraries have also been used to help uncover the structure and quantitative effects of regulatory elements that drive expression. In one of the earliest examples, Patwardhan et al.76 used an oligo pool of 18,492 synthetic promoter mutants for bacteriophage and mammalian Pol II core promoters with corresponding barcodes for NGS readout to quantitate expression differences and map important bases for core activity in these promoters. Later, Schlabach et al.77 used 52,429 oligos designed to contain arrays of transcription factor binding sites to screen for synthetic strong promoters that work in a variety of human cell lines. Recent efforts focused on understanding the structural and functional characteristics of thousands of cis-regulatory sequences governing transcriptional, translational and other regulatory processes in mammalian, yeast and bacterial systems78,79,80,81,82,83,84,85,86,87. Over the coming years, NGS-based methodologies that are developed to measure transcription, translation, epigenetics, splicing and other gene regulatory phenomena will also be used to analyze synthetic libraries. The goal is to understand which sequences are responsible for causal changes to these processes and how we can use them to engineer new functionalities.

Protein engineering. Protein engineering has always benefited from improvements in synthetic capabilities such as DNA shuffling, site-directed mutagenesis and low-cost gene synthesis. De novo synthesis, however, provides a more powerful tool to engineer new protein functions by taking advantage of computational design and metagenomic information. For example, Bayer et al.88 synthesized 89 methyl halide transferase enzymes found in metagenomic sequences from diverse organisms and showed large improvements in enzymatic activities. As another example, Kudla et al.89 and Quan et al.40 constructed and characterized libraries of reporter genes (154 and 1,468 genes, respectively) to study codon usage. More recently, our group has used oligo pools combined with multiplexed reporter assays to construct >14,000 reporter constructs and measured their transcriptional and translational rates to understand how N-terminal codon bias affects protein expression78. Finally, the development of deep mutational scanning techniques to measure structure-function relationships in multiplex will enable rapid characterization of large designed synthetic gene libraries as synthesis methods improve90,91,92,93,94,95.

Genetic refactoring. To better understand and engineer particular genetic systems, researchers have begun to redesign and de novo synthesize these systems with orthogonal, well-defined gene sequences and control elements. Through refactoring, researchers hope to define and include known elements in a pathway while simultaneously disrupting any unknown control elements; this may serve as a better starting point for improvement or transplantation of these genetic systems. Early work resynthesized the first ∼11 kb of bacteriophage T7 with a refactored surrogate that separated and defined individual genes and control elements and showed that the resultant phage was viable96. More recent bacteriophage genome refactorings have helped improve biological understanding and usefulness97,98. Temme et al.99 extended these approaches to refactor the Klebsiella oxytoca nitrogen-fixation 20-gene cluster in E. coli. They removed noncoding sequences, eliminated non-essential genes, removed transcription factors, randomized codons and placed all the genes into seven operons with synthetic regulatory elements governing transcription and translation. The refactored system reconstituted functionality, albeit at reduced production levels. Improvements in the design and automated assembly of these refactored segments allowed reconstitution to wild-type production levels. Finally, Lajoie et al.48 synthesized sequence-orthogonal variants of 42 E. coli essential genes using DNA microarrays and selected for function in order to explore the limits of genetic recoding. Again, such studies can powerfully explore regulatory requirements of genetic sequences but require currently expensive de novo synthesis methodologies and would greatly benefit from lower-cost gene synthesis.

Engineered genetic networks and metabolic pathways. Many researchers in synthetic biology are focused on building and optimizing genetic networks to control cellular behavior and metabolic pathways for chemical production100. Although many of these efforts are focused on assembling already existing DNA in myriad combinations, de novo synthesis is still an important mainstay and will become increasingly so as we improve our ability to design and measure the effects of such assembled pathways. For instance, when building large, multicomponent systems, the number of orthogonal components becomes limiting. Large-scale studies of hundreds to tens of thousands of regulatory elements such as promoters, ribosome-binding sites and transcriptional terminators in E. coli usually use de novo synthesis of designs culled from both natural and designed sequences79,101,102,103,104. The Voigt lab has also applied synthetic metagenomics for part mining to find libraries of orthogonal repressors (73 synthetic genes)105 and transcription factors (62 synthetic genes)106. Thus, as the genetic networks and pathways of engineered systems in synthetic biology get larger and studies move to new organisms, there will be increasing reliance on de novo DNA synthesis to generate requisite system components.

Whole-genome syntheses. De novo synthesis of genomes offers the promise of complete control of an organism's genetic code. Owing to the compact size of viruses and their importance in health and biotechnology, there has been tremendous progress in viral genomic reconstructions. Most synthetic reconstructions have been of RNA viruses by chemical synthesis of the required cDNA. In 2002, Eckard Wimmer's group first generated infectious poliovirus from synthetic reconstruction of its full cDNA107. Since then, dozens of RNA viruses have been chemically reconstructed—including the 1918 Spanish influenza108, the likely coronavirus progenitors to severe acute respiratory syndrome109 and many others110,111,112,113,114,115,116,117,118,119,120—for purposes of viral attenuation, historical reconstructions, vaccine development and viral genomic studies. Several DNA-based bacteriophages have been synthesized de novo as well110,121. Beyond viral genomes, over a series of studies, the Venter Institute designed, built, assembled and transplanted a fully synthetic bacterial genome to encode a viable organism50. Such efforts are only increasing. For example, the design, synthesis and viability of synthetically designed yeast chromosomal arms was shown by Dymond et al.122, and work on a fully synthetic yeast genome is ongoing.

DNA nanotechnology. As a chemical polymer, DNA has several unique properties that make it intriguing. First, the compact helical form and simple base-pairing rules of double-stranded DNA allow us to consider DNA as a technology to reliably position atoms in three-dimensional (3D) space at nanometer resolutions. The emergence of DNA origami123 and single-stranded tiles to form complex 2D and 3D shapes124,125,126 has been used by researchers to tackle problems from materials127 to therapeutics128. Base-pairing and strand-invasion properties of DNA have also allowed researchers to explore interesting information processing and computational capabilities using small libraries of oligos129,130,131,132. Finally, direct encoding of digital information into DNA sequence has recently been shown to outpace most other technologies for data density in three dimensions133,134. We are still in the early stages of this field, but harnessing advances in oligo-pool synthesis for such applications20,135 will allow researchers to test orders of magnitude more designs and hypotheses.

Future developments

Given the requisite investments, what is the cost of gene synthesis that we might expect to attain? Today, the cost of gene synthesis is on the same order as the cost of column-synthesized oligos used in their assembly. If gene synthesis transitioned to array-based oligos, there are no prima facie reasons why costs could not fall 3–5 orders of magnitude to be on par with the cost of oligo pools ($1 per 103–105 bp). The benefits would likely be as dramatic as productivity gains due to NGS, because testing genetic hypotheses would become as simple as the design and analyses allow them to be. However, the large private investments that drove massive drops in the costs of integrated circuits and DNA sequencing were largely motivated by the reasonable expectation for their broad-based consumer-level uses: a processor in every pocket and a genome sequence for every person136. Whereas potentially larger markets stand to benefit from cheap gene synthesis, including those of agriculture, chemicals, enzymes, materials and medicine, synthetic DNA serves only as a research tool for the ultimate product (with the possible exception of DNA nanotechnologies).

Can larger-scale synthetic biology efforts help increase demand sufficiently to spur investments? Even in academic research labs, the downstream cost of testing individual biological constructs for function is often far more expensive than the costs of the synthetic constructs themselves. Thus, reduction in gene synthesis costs will not tremendously affect the throughput and scale of current experimental workflows. However, the types of experiments conducted might also significantly change. One data point to consider occurred a decade ago when microarrays were first leveraged for cheap oligo pools. Although initial reports used these pools as plug-in replacements for column-synthesized oligos, researchers quickly adapted to this increased synthetic capacity, using powerful bioinformatics tools to design large libraries of synthetic oligos and NGS-based multiplexed assays to measure their functional consequences simultaneously. This has recently led to many fruitful experiments at scales that only a few years ago would have been unimaginable for an individual investigator. Likewise, cheap gene synthesis will likely change how we use synthetic genes through the development of powerful design tools for libraries of genes, pathways and genomes as well as cheap, multiplexed assays to measure or select for their function. Such new experimental paradigms could engender far greater use of synthetic genes than is currently imagined today. The initial progress described in this Review warrants optimism and hopefully enough demand and investment to bring about large advances in our ability to design, build, test and analyze biological hypotheses and designs.

Competing interests

S.K. and G.M.C. own stock in and are on the Scientific Advisory Board of Gen9, a company that sells synthetic genes. G.M.C. is on the Board of Directors of Sigma-Aldrich and the Scientific Advisory Board of Cambrian Genomics, both companies that sell synthetic genes or oligos.

References

  • 1.Nirenberg MW, Matthaei JH. The dependence of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides. Proc. Natl. Acad. Sci. USA. 1961;47:1588–1602. doi: 10.1073/pnas.47.10.1588. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Schatz MC, Phillippy AM. The rise of a digital immune system. Gigascience. 2012;1:4. doi: 10.1186/2047-217X-1-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Shendure J, Lieberman Aiden E. The expanding scope of DNA sequencing. Nat. Biotechnol. 2012;30:1084–1094. doi: 10.1038/nbt.2421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Roy S, Caruthers M. Synthesis of DNA/RNA and their analogs via phosphoramidite and H-phosphonate chemistries. Molecules. 2013;18:14268–14284. doi: 10.3390/molecules181114268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Michelson AM, Todd AR. Nucleotides part XXXII. Synthesis of a dithymidine dinucleotide containing a 3′: 5′-internucleotidic linkage. J. Chem. Soc. 1955;1955:2632–2638. doi: 10.1039/JR9550002632. [DOI] [Google Scholar]
  • 6.Hall RH, Todd A, Webb RF. 644. Nucleotides. Part XLI. Mixed anhydrides as intermediates in the synthesis of dinucleoside phosphates. J. Chem. Soc. 1957;1957:3291–3296. doi: 10.1039/jr9570003291. [DOI] [Google Scholar]
  • 7.Khorana HG, Razzell WE, Gilham PT, Tener GM, Pol EH. Syntheses of dideoxyribonucleotides. J. Am. Chem. Soc. 1957;79:1002–1003. doi: 10.1021/ja01561a065. [DOI] [Google Scholar]
  • 8.Beaucage SL, Caruthers MH. Deoxynucleoside phosphoramidites—a new class of key intermediates for deoxypolynucleotide synthesis. Tetrahedr. Lett. 1981;22:1859–1862. doi: 10.1016/S0040-4039(01)90461-7. [DOI] [Google Scholar]
  • 9.Efcavitch JW, Heiner C. Depurination as a yield decreasing mechanism in oligodeoxynucleotide synthesis. Nucleosides Nucleotides Nucleic Acids. 1985;4:267. doi: 10.1080/07328318508077883. [DOI] [Google Scholar]
  • 10.LeProust EM, et al. Synthesis of high-quality libraries of long (150mer) oligonucleotides by a novel depurination controlled process. Nucleic Acids Res. 2010;38:2522–2540. doi: 10.1093/nar/gkq163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Septak M. Kinetic studies on depurination and detritylation of CPG-bound intermediates during oligonucleotide synthesis. Nucleic Acids Res. 1996;24:3053–3058. doi: 10.1093/nar/24.15.3053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Binkowski BF, Richmond KE, Kaysen J, Sussman MR, Belshaw PJ. Correcting errors in synthetic DNA through consensus shuffling. Nucleic Acids Res. 2005;33:e55. doi: 10.1093/nar/gni053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Carr PA, et al. Protein-mediated error correction for de novo DNA synthesis. Nucleic Acids Res. 2004;32:e1622. doi: 10.1093/nar/gkh560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Fodor SP, et al. Light-directed, spatially addressable parallel chemical synthesis. Science. 1991;251:767–773. doi: 10.1126/science.1990438. [DOI] [PubMed] [Google Scholar]
  • 15.Pease AC, et al. Light-generated oligonucleotide arrays for rapid DNA sequence analysis. Proc. Natl. Acad. Sci. USA. 1994;91:5022–5026. doi: 10.1073/pnas.91.11.5022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Singh-Gasson S, et al. Maskless fabrication of light-directed oligonucleotide microarrays using a digital micromirror array. Nat. Biotechnol. 1999;17:974–978. doi: 10.1038/13664. [DOI] [PubMed] [Google Scholar]
  • 17.Gao X, et al. A flexible light-directed DNA chip synthesis gated by deprotection using solution photogenerated acids. Nucleic Acids Res. 2001;29:4744. doi: 10.1093/nar/29.22.4744. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Blanchard AP, Kaiser RJ, Hood LE. High-density oligonucleotide arrays. Biosens. Bioelectron. 1996;11:687–690. doi: 10.1016/0956-5663(96)83302-1. [DOI] [Google Scholar]
  • 19.Hughes TR, et al. Expression profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer. Nat. Biotechnol. 2001;19:342–347. doi: 10.1038/86730. [DOI] [PubMed] [Google Scholar]
  • 20.Saaem I, Ma KS, Marchi AN, LaBean TH, Tian J. In situ synthesis of DNA microarray on functionalized cyclic olefin copolymer substrate. ACS Appl. Mater. Interfaces. 2010;2:491–497. doi: 10.1021/am900884b. [DOI] [PubMed] [Google Scholar]
  • 21.Ghindilis AL, et al. CombiMatrix oligonucleotide arrays: genotyping and gene expression assays employing electrochemical detection. Biosens. Bioelectron. 2007;22:1853–1860. doi: 10.1016/j.bios.2006.06.024. [DOI] [PubMed] [Google Scholar]
  • 22.Tang, N., Ma, S. & Tian, J. in Synthetic Biology (ed. Zhao, H.) Ch. 1, 3–21 (Academic Press, 2013).
  • 23.Agarwal KL, et al. Total synthesis of the gene for an alanine transfer ribonucleic acid from yeast. Nature. 1970;227:27–34. doi: 10.1038/227027a0. [DOI] [PubMed] [Google Scholar]
  • 24.Sekiya T, et al. Total synthesis of a tyrosine suppressor transfer RNA gene. XVI. Enzymatic joinings to form the total 207-base pair-long DNA. J. Biol. Chem. 1979;254:5787–5801. [PubMed] [Google Scholar]
  • 25.Au LC, Yang FY, Yang WJ, Lo SH, Kao CF. Gene synthesis by a LCR-based approach: high-level production of leptin-L54 using synthetic gene in Escherichia coli. Biochem. Biophys. Res. Commun. 1998;248:200–203. doi: 10.1006/bbrc.1998.8929. [DOI] [PubMed] [Google Scholar]
  • 26.Bang D, Church GM. Gene synthesis by circular assembly amplification. Nat. Methods. 2008;5:37–39. doi: 10.1038/nmeth1136. [DOI] [PubMed] [Google Scholar]
  • 27.Stemmer WP, Crameri A, Ha KD, Brennan TM, Heyneker HL. Single-step assembly of a gene and entire plasmid from large numbers of oligodeoxyribonucleotides. Gene. 1995;164:49–53. doi: 10.1016/0378-1119(95)00511-4. [DOI] [PubMed] [Google Scholar]
  • 28.Gibson DG. Synthesis of DNA fragments in yeast by one-step assembly of overlapping oligonucleotides. Nucleic Acids Res. 2009;37:6984–6990. doi: 10.1093/nar/gkp687. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gibson DG. Oligonucleotide assembly in yeast to produce synthetic DNA fragments. Methods Mol. Biol. 2012;852:11–21. doi: 10.1007/978-1-61779-564-0_2. [DOI] [PubMed] [Google Scholar]
  • 30.Dormitzer PR, et al. Synthetic generation of influenza vaccine viruses for rapid response to pandemics. Sci. Transl. Med. 2013;5:185ra168. doi: 10.1126/scitranslmed.3006368. [DOI] [PubMed] [Google Scholar]
  • 31.Gibson DG, Smith HO, Hutchison CA, III, Venter JC, Merryman C. Chemical synthesis of the mouse mitochondrial genome. Nat. Methods. 2010;7:901–903. doi: 10.1038/nmeth.1515. [DOI] [PubMed] [Google Scholar]
  • 32.Carr PA, Church GM. Genome engineering. Nat. Biotechnol. 2009;27:1151–1162. doi: 10.1038/nbt.1590. [DOI] [PubMed] [Google Scholar]
  • 33.Czar MJ, Anderson JC, Bader JS, Peccoud J. Gene synthesis demystified. Trends Biotechnol. 2009;27:63–72. doi: 10.1016/j.tibtech.2008.10.007. [DOI] [PubMed] [Google Scholar]
  • 34.Xiong AS, et al. Chemical gene synthesis: strategies, softwares, error corrections, and applications. FEMS Microbiol. Rev. 2008;32:522–540. doi: 10.1111/j.1574-6976.2008.00109.x. [DOI] [PubMed] [Google Scholar]
  • 35.Tian J, et al. Accurate multiplex gene synthesis from programmable DNA microchips. Nature. 2004;432:1050–1054. doi: 10.1038/nature03151. [DOI] [PubMed] [Google Scholar]
  • 36.Zhou X, et al. Microfluidic PicoArray synthesis of oligodeoxynucleotides and simultaneous assembling of multiple DNA sequences. Nucleic Acids Res. 2004;32:5409–5417. doi: 10.1093/nar/gkh879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Richmond KE, et al. Amplification and assembly of chip-eluted DNA (AACED): a method for high-throughput gene synthesis. Nucleic Acids Res. 2004;32:5011–5018. doi: 10.1093/nar/gkh793. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Borovkov AY, et al. High-quality gene assembly directly from unpurified mixtures of microarray-synthesized oligonucleotides. Nucleic Acids Res. 2010;38:e180. doi: 10.1093/nar/gkq677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kosuri S, et al. Scalable gene synthesis by selective amplification of DNA pools from high-fidelity microchips. Nat. Biotechnol. 2010;28:1295–1299. doi: 10.1038/nbt.1716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Quan J, et al. Parallel on-chip gene synthesis and application to optimization of protein expression. Nat. Biotechnol. 2011;29:449–452. doi: 10.1038/nbt.1847. [DOI] [PubMed] [Google Scholar]
  • 41.Schwartz JJ, Lee C, Shendure J. Accurate gene synthesis with tag-directed retrieval of sequence-verified DNA molecules. Nat. Methods. 2012;9:913–915. doi: 10.1038/nmeth.2137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kim H, et al. 'Shotgun DNA synthesis' for the high-throughput construction of large DNA molecules. Nucleic Acids Res. 2012;40:e140. doi: 10.1093/nar/gks546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Kim H, Han H, Shin D, Bang D. A fluorescence selection method for accurate large-gene synthesis. ChemBioChem. 2010;11:2448–2452. doi: 10.1002/cbic.201000368. [DOI] [PubMed] [Google Scholar]
  • 44.Allert M, Cox JC, Hellinga HW. Multifactorial determinants of protein expression in prokaryotic open reading frames. J. Mol. Biol. 2010;402:905–918. doi: 10.1016/j.jmb.2010.08.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Smith J, Modrich P. Removal of polymerase-produced mutant sequences from PCR products. Proc. Natl. Acad. Sci. USA. 1997;94:6847–6850. doi: 10.1073/pnas.94.13.6847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Young L, Dong Q. Two-step total gene synthesis method. Nucleic Acids Res. 2004;32:e59. doi: 10.1093/nar/gnh058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Fuhrmann M, Oertel W, Berthold P, Hegemann P. Removal of mismatched bases from synthetic genes by enzymatic mismatch cleavage. Nucleic Acids Res. 2005;33:e58. doi: 10.1093/nar/gni058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Lajoie MJ, et al. Probing the limits of genetic recoding in essential genes. Science. 2013;342:361–363. doi: 10.1126/science.1241460. [DOI] [PubMed] [Google Scholar]
  • 49.Matzas M, et al. High-fidelity gene synthesis by retrieval of sequence-verified DNA identified using high-throughput pyrosequencing. Nat. Biotechnol. 2010;28:1291–1294. doi: 10.1038/nbt.1710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Gibson DG. Programming biological operating systems: genome design, assembly and activation. Nat. Methods. 2014;11:521–526. doi: 10.1038/nmeth.2894. [DOI] [PubMed] [Google Scholar]
  • 51.de Kok S, et al. Rapid and reliable DNA assembly via ligase cycling reaction. ACS Synth. Biol. 2014;3:97–106. doi: 10.1021/sb4001992. [DOI] [PubMed] [Google Scholar]
  • 52.Gibson DG, et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods. 2009;6:343–345. doi: 10.1038/nmeth.1318. [DOI] [PubMed] [Google Scholar]
  • 53.Zhang Y, Werling U, Edelmann W. SLiCE: a novel bacterial cell extract-based DNA cloning method. Nucleic Acids Res. 2012;40:e55. doi: 10.1093/nar/gkr1288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Gibson DG, et al. One-step assembly in yeast of 25 overlapping DNA fragments to form a complete synthetic Mycoplasma genitalium genome. Proc. Natl. Acad. Sci. USA. 2008;105:20404–20409. doi: 10.1073/pnas.0811011106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Muller H, et al. Assembling large DNA segments in yeast. Methods Mol. Biol. 2012;852:133–150. doi: 10.1007/978-1-61779-564-0_11. [DOI] [PubMed] [Google Scholar]
  • 56.Quan J, Tian J. Circular polymerase extension cloning of complex gene libraries and pathways. PLoS ONE. 2009;4:e6441. doi: 10.1371/journal.pone.0006441. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Li MZ, Elledge SJ. Harnessing homologous recombination in vitro to generate recombinant DNA via SLIC. Nat. Methods. 2007;4:251–256. doi: 10.1038/nmeth1010. [DOI] [PubMed] [Google Scholar]
  • 58.Weber E, Engler C, Gruetzner R, Werner S, Marillonnet S. A modular cloning system for standardized assembly of multigene constructs. PLoS ONE. 2011;6:e16765. doi: 10.1371/journal.pone.0016765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Cleary MA, et al. Production of complex nucleic acid libraries using highly parallel in situ oligonucleotide synthesis. Nat. Methods. 2004;1:241–248. doi: 10.1038/nmeth724. [DOI] [PubMed] [Google Scholar]
  • 60.Tewhey R, et al. Enrichment of sequencing targets from the human genome by solution hybridization. Genome Biol. 2009;10:R116. doi: 10.1186/gb-2009-10-10-r116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Porreca GJ, et al. Multiplex amplification of large sets of human exons. Nat. Methods. 2007;4:931–936. doi: 10.1038/nmeth1110. [DOI] [PubMed] [Google Scholar]
  • 62.Gnirke A, et al. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat. Biotechnol. 2009;27:182–189. doi: 10.1038/nbt.1523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Depledge DP, et al. Specific capture and whole-genome sequencing of viruses from clinical samples. PLoS ONE. 2011;6:e27805. doi: 10.1371/journal.pone.0027805. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Geniez S, et al. Targeted genome enrichment for efficient purification of endosymbiont DNA from host DNA. Symbiosis. 2012;58:201–207. doi: 10.1007/s13199-012-0215-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Li JB, et al. Multiplex padlock targeted sequencing reveals human hypermutable CpG variations. Genome Res. 2009;19:1606–1615. doi: 10.1101/gr.092213.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Deng J, et al. Targeted bisulfite sequencing reveals changes in DNA methylation associated with nuclear reprogramming. Nat. Biotechnol. 2009;27:353–360. doi: 10.1038/nbt.1530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Li JB, et al. Genome-wide identification of human RNA editing sites by parallel DNA capturing and sequencing. Science. 2009;324:1210–1213. doi: 10.1126/science.1170995. [DOI] [PubMed] [Google Scholar]
  • 68.Zhang K, et al. Digital RNA allelotyping reveals tissue-specific and allele-specific gene expression in human. Nat. Methods. 2009;6:613–618. doi: 10.1038/nmeth.1357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Larman HB, et al. Autoantigen discovery with a synthetic human peptidome. Nat. Biotechnol. 2011;29:535–541. doi: 10.1038/nbt.1856. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Larman HB, et al. PhIP-Seq characterization of autoantibodies from patients with multiple sclerosis, type 1 diabetes and rheumatoid arthritis. J. Autoimmun. 2013;43:1–9. doi: 10.1016/j.jaut.2013.01.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Larman HB, Xu GJ, Pavlova NN, Elledge SJ. Construction of a rationally designed antibody platform for sequencing-assisted selection. Proc. Natl. Acad. Sci. USA. 2012;109:18523–18528. doi: 10.1073/pnas.1215549109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Ivarsson Y, et al. Large-scale interaction profiling of PDZ domains through proteomic peptide-phage display using human and viral phage peptidomes. Proc. Natl. Acad. Sci. USA. 2014;111:2542–2547. doi: 10.1073/pnas.1312296111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Warner JR, Reeder PJ, Karimpour-Fard A, Woodruff LB, Gill RT. Rapid profiling of a microbial genome using mixtures of barcoded oligonucleotides. Nat. Biotechnol. 2010;28:856–862. doi: 10.1038/nbt.1653. [DOI] [PubMed] [Google Scholar]
  • 74.Shalem O, et al. Genome-scale CRISPR-Cas9 knockout screening in human cells. Science. 2014;343:84–87. doi: 10.1126/science.1247005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Wang T, Wei JJ, Sabatini DM, Lander ES. Genetic screens in human cells using the CRISPR-Cas9 system. Science. 2014;343:80–84. doi: 10.1126/science.1246981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Patwardhan RP, et al. High-resolution analysis of DNA regulatory elements by synthetic saturation mutagenesis. Nat. Biotechnol. 2009;27:1173–1175. doi: 10.1038/nbt.1589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Schlabach MR, Hu JK, Li M, Elledge SJ. Synthetic design of strong promoters. Proc. Natl. Acad. Sci. USA. 2010;107:2538–2543. doi: 10.1073/pnas.0914803107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Goodman DB, Church GM, Kosuri S. Causes and effects of N-terminal codon bias in bacterial genes. Science. 2013;342:475–479. doi: 10.1126/science.1241934. [DOI] [PubMed] [Google Scholar]
  • 79.Kosuri S, et al. Composability of regulatory sequences controlling transcription and translation in Escherichia coli. Proc. Natl. Acad. Sci. USA. 2013;110:14024–14029. doi: 10.1073/pnas.1301301110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Melnikov A, et al. Systematic dissection and optimization of inducible enhancers in human cells using a massively parallel reporter assay. Nat. Biotechnol. 2012;30:271–277. doi: 10.1038/nbt.2137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Kheradpour P, et al. Systematic dissection of regulatory motifs in 2000 predicted human enhancers using a massively parallel reporter assay. Genome Res. 2013;23:800–811. doi: 10.1101/gr.144899.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Kwasnieski JC, Mogno I, Myers CA, Corbo JC, Cohen BA. Complex effects of nucleotide variants in a mammalian cis-regulatory element. Proc. Natl. Acad. Sci. USA. 2012;109:19498–19503. doi: 10.1073/pnas.1210678109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.White MA, Myers CA, Corbo JC, Cohen BA. Massively parallel in vivo enhancer assay reveals that highly local features determine the cis-regulatory function of ChIP-seq peaks. Proc. Natl. Acad. Sci. USA. 2013;110:11952–11957. doi: 10.1073/pnas.1307449110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Mogno I, Kwasnieski JC, Cohen BA. Massively parallel synthetic promoter assays reveal the in vivo effects of binding site variants. Genome Res. 2013;23:1908–1915. doi: 10.1101/gr.157891.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Kaplan N, et al. The DNA-encoded nucleosome organization of a eukaryotic genome. Nature. 2009;458:362–366. doi: 10.1038/nature07667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Sharon E, et al. Inferring gene regulatory logic from high-throughput measurements of thousands of systematically designed promoters. Nat. Biotechnol. 2012;30:521–530. doi: 10.1038/nbt.2205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Smith RP, et al. Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model. Nat. Genet. 2013;45:1021–1028. doi: 10.1038/ng.2713. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Bayer TS, et al. Synthesis of methyl halides from biomass using engineered microbes. J. Am. Chem. Soc. 2009;131:6508–6515. doi: 10.1021/ja809461u. [DOI] [PubMed] [Google Scholar]
  • 89.Kudla G, Murray AW, Tollervey D, Plotkin JB. Coding-sequence determinants of gene expression in Escherichia coli. Science. 2009;324:255–258. doi: 10.1126/science.1170160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Araya CL, Fowler DM. Deep mutational scanning: assessing protein function on a massive scale. Trends Biotechnol. 2011;29:435–442. doi: 10.1016/j.tibtech.2011.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Melamed D, Young DL, Gamble CE, Miller CR, Fields S. Deep mutational scanning of an RRM domain of the Saccharomyces cerevisiae poly(A)-binding protein. RNA. 2013;19:1537–1551. doi: 10.1261/rna.040709.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Fowler DM, et al. High-resolution mapping of protein sequence-function relationships. Nat. Methods. 2010;7:741–746. doi: 10.1038/nmeth.1492. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Kim I, Miller CR, Young DL, Fields S. High-throughput analysis of in vivo protein stability. Mol. Cell. Proteomics. 2013;12:3370–3378. doi: 10.1074/mcp.O113.031708. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.McLaughlin RN, Jr., Poelwijk FJ, Raman A, Gosal WS, Ranganathan R. The spatial architecture of protein function and adaptation. Nature. 2012;491:138–142. doi: 10.1038/nature11500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Reynolds KA, McLaughlin RN, Ranganathan R. Hot spots for allosteric regulation on protein surfaces. Cell. 2011;147:1564–1575. doi: 10.1016/j.cell.2011.10.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Chan LY, Kosuri S, Endy D. Refactoring bacteriophage T7. Mol. Syst. Biol. 2005;1:2005.0018. doi: 10.1038/msb4100025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Jaschke PR, Lieberman EK, Rodriguez J, Sierra A, Endy D. A fully decompressed synthetic bacteriophage oX174 genome assembled and archived in yeast. Virology. 2012;434:278–284. doi: 10.1016/j.virol.2012.09.020. [DOI] [PubMed] [Google Scholar]
  • 98.Ghosh D, Kohli AG, Moser F, Endy D, Belcher AM. Refactored M13 bacteriophage as a platform for tumor cell imaging and drug delivery. ACS Synth. Biol. 2012;1:576–582. doi: 10.1021/sb300052u. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Temme K, Zhao D, Voigt CA. Refactoring the nitrogen fixation gene cluster from Klebsiella oxytoca. Proc. Natl. Acad. Sci. USA. 2012;109:7085–7090. doi: 10.1073/pnas.1120788109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Brophy JAN, Voigt CA. Principles of genetic circuit design. Nat. Methods. 2014;11:508–520. doi: 10.1038/nmeth.2926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Cambray G, et al. Measurement and modeling of intrinsic transcription terminators. Nucleic Acids Res. 2013;41:5139–5148. doi: 10.1093/nar/gkt163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Chen YJ, et al. Characterization of 582 natural and synthetic terminators and quantification of their design constraints. Nat. Methods. 2013;10:659–664. doi: 10.1038/nmeth.2515. [DOI] [PubMed] [Google Scholar]
  • 103.Mutalik VK, et al. Precise and reliable gene expression via standard transcription and translation initiation elements. Nat. Methods. 2013;10:354–360. doi: 10.1038/nmeth.2404. [DOI] [PubMed] [Google Scholar]
  • 104.Mutalik VK, et al. Quantitative estimation of activity and quality for collections of functional genetic elements. Nat. Methods. 2013;10:347–353. doi: 10.1038/nmeth.2403. [DOI] [PubMed] [Google Scholar]
  • 105.Stanton BC, et al. Genomic mining of prokaryotic repressors for orthogonal logic gates. Nat. Chem. Biol. 2014;10:99–105. doi: 10.1038/nchembio.1411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Rhodius VA, et al. Design of orthogonal genetic switches based on a crosstalk map of σs, anti-σs, and promoters. Mol. Syst. Biol. 2013;9:702. doi: 10.1038/msb.2013.58. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Cello J, Paul AV, Wimmer E. Chemical synthesis of poliovirus cDNA: generation of infectious virus in the absence of natural template. Science. 2002;297:1016–1018. doi: 10.1126/science.1072266. [DOI] [PubMed] [Google Scholar]
  • 108.Tumpey TM, et al. Characterization of the reconstructed 1918 Spanish influenza pandemic virus. Science. 2005;310:77–80. doi: 10.1126/science.1119392. [DOI] [PubMed] [Google Scholar]
  • 109.Becker MM, et al. Synthetic recombinant bat SARS-like coronavirus is infectious in cultured cells and in mice. Proc. Natl. Acad. Sci. USA. 2008;105:19944–19949. doi: 10.1073/pnas.0808116105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Smith HO, Hutchison CA, III, Pfannkoch C, Venter JC. Generating a synthetic genome by whole genome assembly: phiX174 bacteriophage from synthetic oligonucleotides. Proc. Natl. Acad. Sci. USA. 2003;100:15440–15445. doi: 10.1073/pnas.2237126100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Takehisa J, et al. Generation of infectious molecular clones of simian immunodeficiency virus from fecal consensus sequences of wild chimpanzees. J. Virol. 2007;81:7463–7475. doi: 10.1128/JVI.00551-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Burns CC, et al. Genetic inactivation of poliovirus infectivity by increasing the frequencies of CpG and UpA dinucleotides within and across synonymous capsid region codons. J. Virol. 2009;83:9957–9969. doi: 10.1128/JVI.00508-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Dewannieux M, et al. Identification of an infectious progenitor for the multiple-copy HERV-K human endogenous retroelements. Genome Res. 2006;16:1548–1556. doi: 10.1101/gr.5565706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Orlinger KK, et al. An inactivated West Nile virus vaccine derived from a chemically synthesized cDNA system. Vaccine. 2010;28:3318–3324. doi: 10.1016/j.vaccine.2010.02.092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Mueller S, et al. Live attenuated influenza virus vaccines by computer-aided rational design. Nat. Biotechnol. 2010;28:723–726. doi: 10.1038/nbt.1636. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Burns CC, et al. Modulation of poliovirus replicative fitness in HeLa cells by deoptimization of synonymous codon usage in the capsid region. J. Virol. 2006;80:3259–3272. doi: 10.1128/JVI.80.7.3259-3272.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Lee YN, Bieniasz PD. Reconstitution of an infectious human endogenous retrovirus. PLoS Pathog. 2007;3:e10. doi: 10.1371/journal.ppat.0030010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Mueller S, Papamichail D, Coleman JR, Skiena S, Wimmer E. Reduction of the rate of poliovirus protein synthesis through large-scale codon deoptimization causes attenuation of viral virulence by lowering specific infectivity. J. Virol. 2006;80:9687–9696. doi: 10.1128/JVI.00738-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Wimmer E, Paul AV. Synthetic poliovirus and other designer viruses: what have we learned from them? Annu. Rev. Microbiol. 2011;65:583–609. doi: 10.1146/annurev-micro-090110-102957. [DOI] [PubMed] [Google Scholar]
  • 120.Coleman JR, et al. Virus attenuation by genome-scale changes in codon pair bias. Science. 2008;320:1784–1787. doi: 10.1126/science.1155761. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Liu Y, et al. Whole-genome synthesis and characterization of viable S13-like bacteriophages. PLoS ONE. 2012;7:e41124. doi: 10.1371/journal.pone.0041124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Dymond JS, et al. Synthetic chromosome arms function in yeast and generate phenotypic diversity by design. Nature. 2011;477:471–476. doi: 10.1038/nature10403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Rothemund PW. Folding DNA to create nanoscale shapes and patterns. Nature. 2006;440:297–302. doi: 10.1038/nature04586. [DOI] [PubMed] [Google Scholar]
  • 124.Dietz H, Douglas SM, Shih WM. Folding DNA into twisted and curved nanoscale shapes. Science. 2009;325:725–730. doi: 10.1126/science.1174251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Douglas SM, et al. Self-assembly of DNA into nanoscale three-dimensional shapes. Nature. 2009;459:414–418. doi: 10.1038/nature08016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Ke Y, Ong LL, Shih WM, Yin P. Three-dimensional structures self-assembled from DNA bricks. Science. 2012;338:1177–1183. doi: 10.1126/science.1227268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Maune HT, et al. Self-assembly of carbon nanotubes into two-dimensional geometries using DNA origami templates. Nat. Nanotechnol. 2010;5:61–66. doi: 10.1038/nnano.2009.311. [DOI] [PubMed] [Google Scholar]
  • 128.Douglas SM, Bachelet I, Church GM. A logic-gated nanorobot for targeted transport of molecular payloads. Science. 2012;335:831–834. doi: 10.1126/science.1214081. [DOI] [PubMed] [Google Scholar]
  • 129.Venkataraman S, Dirks RM, Rothemund PW, Winfree E, Pierce NA. An autonomous polymerization motor powered by DNA hybridization. Nat. Nanotechnol. 2007;2:490–494. doi: 10.1038/nnano.2007.225. [DOI] [PubMed] [Google Scholar]
  • 130.Soloveichik D, Seelig G, Winfree E. DNA as a universal substrate for chemical kinetics. Proc. Natl. Acad. Sci. USA. 2010;107:5393–5398. doi: 10.1073/pnas.0909380107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 131.Qian L, Winfree E. Scaling up digital circuit computation with DNA strand displacement cascades. Science. 2011;332:1196–1201. doi: 10.1126/science.1200520. [DOI] [PubMed] [Google Scholar]
  • 132.Qian L, Winfree E, Bruck J. Neural network computation with DNA strand displacement cascades. Nature. 2011;475:368–372. doi: 10.1038/nature10262. [DOI] [PubMed] [Google Scholar]
  • 133.Church GM, Gao Y, Kosuri S. Next-generation digital information storage in DNA. Science. 2012;337:1628. doi: 10.1126/science.1226355. [DOI] [PubMed] [Google Scholar]
  • 134.Goldman N, et al. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA. Nature. 2013;494:77–80. doi: 10.1038/nature11875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Marchi AN, Saaem I, Tian J, LaBean TH. One-pot assembly of a hetero-dimeric DNA origami from chip-derived staples and double-stranded scaffold. ACS Nano. 2013;7:903–910. doi: 10.1021/nn302322j. [DOI] [PubMed] [Google Scholar]
  • 136.Kosuri S, Sismour AM. When it rains, it pores. ACS Synth. Biol. 2012;1:109–110. doi: 10.1021/sb300015f. [DOI] [PubMed] [Google Scholar]
  • 137.Saaem I, Ma S, Quan J, Tian J. Error correction of microchip synthesized genes using Surveyor nuclease. Nucleic Acids Res. 2012;40:e23. doi: 10.1093/nar/gkr887. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Nature Methods are provided here courtesy of Nature Publishing Group

RESOURCES