Abstract
The ability to engineer biological systems, whether to introduce novel functionality or improved performance, is a cornerstone of biotechnology and synthetic biology. Typically, this requires the generation of genetic diversity to explore variations in phenotype, a process that can be performed at many levels, from single molecule targets (i.e., in directed evolution of enzymes) to whole organisms (e.g., in chassis engineering). Recent advances in DNA synthesis technology and automation have enhanced our ability to create variant libraries with greater control and throughput. This review highlights the latest developments in approaches to create such a hierarchy of diversity from the enzyme level to entire pathways in vitro, with a focus on the creation of combinatorial libraries that are required to navigate a target's vast design space successfully to uncover significant improvements in function.
Keywords: Mutagenesis, Pathway assembly, Genetic engineering, Directed evolution, Synthetic biology, Metabolic engineering
Abbreviations: PCR, Polymerase Chain Reaction; LCR, Ligase Cycling Reaction; SDM, site-directed mutagenesis; OE-PCR, overlap extension PCR
1. Introduction
Diversity is the foundation upon which the biological world is built, enabling Darwinian evolution through iterative cycles of genetic variation and selection. Through this process natural selection has produced countless biological macromolecules with different functions, including catalysis (e.g. enzymes), molecular recognition (e.g. antibodies), structural (e.g. cytoskeletal proteins) and signalling (e.g. hormones). This dramatic demonstration of the potential for harnessing genetic diversity to unearth specialised performance and novel functionality has inspired the powerful technique of directed evolution (DE), which is now commonly employed to engineer improvements in enzyme performance. Mimicking the process of natural evolution, DE performs iterative cycles of DNA mutagenesis and selection/screening on a specific genetic target in order to isolate new enzyme variants with improved function (Dougherty and Arnold, 2009; Tracewell and Arnold, 2009).
DE has become a vital tool in the development of biocatalysts for industrial processes, engineering improvements such as activity towards non-native substrates, increased turnover numbers, kcat, and improved solvent stability (Cheng et al., 2015; Cherry and Fidantsef, 2003; Dougherty and Arnold, 2009). But, clearly, natural selection operates on a wide hierarchy of different levels, from individual genes to entire organisms. Thus, whilst DE typically targets a single enzyme-coding sequence, its concepts should be equally applicable at higher levels of hierarchy, for example when optimising the overall function of an integrated biological system, such as a biosynthetic pathway within an heterologous host. Fortunately, our ability to engineer larger genetic constructs containing multiple components has advanced dramatically in the past decade through synthetic biology, bringing such an ambitious expansion of the scope of DE within reach of the experimentalist.
Synthetic biology can have diverse and creative applications, ranging from biosynthesis of natural and non-natural products, to biofuels and engineered organisms for healthcare (Breitling and Takano, 2015; Khalil and Collins, 2010; Wu et al., 2019a). Progress has been driven by the increasing throughput and decreasing cost of DNA synthesis, making synthetic DNA available to every laboratory. Coupled with widespread adoption of automation (Chao et al., 2017; Hillson et al., 2019) this allows for the assembly and engineering of whole metabolic pathways (Ellis et al., 2011), and has enabled the emergence of synthetic genomics, where organisms with entirely synthetically-derived genomes are constructed (Foo and Chang, 2018; Gibson et al., 2008; Gibson et al., 2010; Pretorius and Boeke, 2018). The high-throughput capabilities afforded by automation enable many diverse pathway and genomic sequences to be created, which can then be tested to identify an optimally performing chassis by utilising a design–build–test–learn framework (Carbonell et al., 2018). This is required because, as with traditional DE, the component parts may not function efficiently within their new chassis, and engineering these pathways could improve overall performance (Carbonell et al., 2016; Chi et al., 2019; Decoene et al., 2018). However, other additional factors may hinder performance, such as interference with host metabolism, supply of substrates in vivo and dynamic flux control (David et al., 2016; Gerosa and Sauer, 2011; Solomon et al., 2012; Wu et al., 2015). Consequently, optimisation of these chassis also requires genome editing and engineering (for instance, gene knock-outs and knock-ins, gene overexpression, and relaxation of feedback inhibition) to alter fluxes and improve overall performance (e.g., rate, yield and titre of the target compound production). Genetic diversity can be achieved through genome engineering (employing techniques like CRISPR, recombineering, zinc-finger nucleases) in vivo and are often organism-specific. These approaches have recently been reviewed elsewhere (Naseri and Koffas, 2020; Simon et al., 2019), and are not discussed in this review. Here we focus on recent developments in the generation of genetic diversity at multiple levels in vitro, typically introducing designed alterations via synthetically-derived DNA, which are widely used in biotechnology and synthetic biology.
For any bio-engineering project the challenge facing the experimenter is the same: the total number of coding sequences and their combinations that could be created is many orders of magnitude larger than what it is possible to create and test in the laboratory. This concept is described as sequence space (Currin et al., 2015; Hayashi et al., 2006; Povolotskaya and Kondrashov, 2010; Wong et al., 2007), which describes the total number of possible variants of a given sequence of length n. Sequence space is vast and experimentally untestable for even very small protein sequences, for example the sequence space of a small 100 amino acid protein is ~1.3 × 10130 (20100, given the 20 possible amino acids), a number larger than the number of atoms in the observable universe (Kondrashov and Kondrashov, 2015). The combinatorial explosion further accelerates when multiple proteins functionally interact in a complex biological system, and a variety of regulatory elements add a further level of complexity to the challenge. Inevitably, this results in the need to reduce the breadth of the experiments to limit the search to a subset of the total diversity that can be effectively screened in the laboratory (Romero and Arnold, 2009; Romero et al., 2013). Unfortunately, despite the wealth of sequences, characterisation data and computational tools available, the behaviour of biological sequences remains largely unpredictable, with the exact nature of beneficial mutations still needing to be empirically determined, often through screening as many variants as possible. The experimenter must therefore select where and how to mutate their target sequence (or sequences), which can be assisted by prior knowledge, structural and protein characterisation data, and computational tools. This challenge is made dramatically more complicated by the phenomenon of epistasis, which describes the interdependence of amino acids throughout the sequence (even between different sequences within a biological system). Consequently, the beneficial effect of one mutation can be dependent on another mutation elsewhere (Miton and Tokuriki, 2016; Sailer and Harms, 2018; Storz, 2018). This is supported by the frequent observations that single-point mutations are rarely sufficient to elicit significant changes in performance (e.g. activity or productivity) (Reetz et al., 2006), and the control of pathway flux is typically shared widely among the enzymes in a reaction cascade. This review highlights the considerable recent progress in approaches for the creation of combinatorial diversity in DNA sequences. In particular, we discuss the developments in creating DNA diversity in vitro using specifically controlled (targeted rather than random) techniques. Whilst not specifically reviewed here, this area has significant overlap with the topics of DNA synthesis (oligonucleotide and gene synthesis) and assembly (cloning and synthetic genomics). In this review we elucidate that genetic diversity creation in vitro is a vital strategy to engineer biological macromolecules, with diverse applications in biotechnology and synthetic biology.
2. Creation of random genetic diversity
Early pioneering work on DE by Frances Arnold and co-workers utilised random mutagenesis methods to create diversity, prior to screening for improved enzyme function (Arnold, 1993; Chen and Arnold, 1991; Chen and Arnold, 1993). Due to its simplicity of use, by far the most popular random approach is error-prone polymerase chain reaction (epPCR), whereby mutations are introduced by a Taq polymerase during PCR amplification (McCullum et al., 2010; Wong et al., 2006). The method is generic (as no prior information on protein structure or function is needed) and optimised protocols are available that reduce mutation bias and provide tuneable control over mutation frequency (Biles and Connolly, 2004; Cadwell and Joyce, 1992; Copp et al., 2014) and general location (Yang et al., 2017). However, epPCR is limited in many regards: mutation of two consecutive bases is rare (meaning not all possible amino acids can be encoded at a given codon), it typically requires large screening efforts, stop codons can be encoded (3.2% of all possible single-base mutations within codons), and it cannot specifically mutate selected bases. Targeted mutagenesis methods overcome these limitations, providing highly accurate and controllable mutations of any desired sequence; these methods are the focus of this review.
Noteworthy alternative random diversity creation methods include recombination, exemplified by Stemmer's ‘DNA shuffling’ technique (where homologous sequences are fragmented and reassembled (Kikuchi et al., 1999; Stemmer, 1994a; Stemmer, 1994b; Zhao and Arnold, 1997)) and in vitro genome rearrangement using Synthetic Chromosome Rearrangement and Modification by LoxP-mediated Evolution (SCRaMbLE), mediated by a Cre recombinase and resulting in random recombination between loxP sites (Jia et al., 2018; Ma et al., 2019; Wu et al., 2018).
3. Targeted diversity creation
Variant sequence libraries encoding targeted (designed) mutations can be assembled through a number of different approaches in vitro, though universally these methods utilise synthetic variant oligonucleotides to determine the mutations through their incorporation into newly synthesised DNA. In the past decade, advances in oligonucleotide synthesis have dramatically changed the way that targeted mutations are created.
3.1. Variant library design and strategies for sequence diversification
Sequence libraries can be synthesised using a variety of methods to encode specifically designed variants. Given this control, attention must be given to the nature of mutations created and the number of variant sequences this creates. It is often desirable to create many mutations to test for a desired function, however simultaneous mutation at multiple sites within a protein can create a library of variant sequences which is impractically large for screening. For example, simultaneous mutation of just three amino acids (to any of the 20 amino acids) generates 8000 (203) protein variants, requiring significant high-throughput screening to experimentally validate. Mutation of further residues in the same way grows the library size exponentially, with just seven randomised residues creating a library size of 1.28 × 109 (207) variants, beyond even ultra-high throughput screening capabilities (Agresti et al., 2010; Autour and Ryckelynck, 2017). Careful design of combinatorial variant libraries is therefore crucial to rationally reduce the experimental screening burden (Qu et al., 2020) and powerful computational tools are now available for the selection of amino acids for mutation using phylogenetic and molecular simulation approaches (Goldenzweig et al., 2016; Khersonsky et al., 2018; Sumbalova et al., 2018; Weinstein et al., 2020).
As outlined in section 3.2, oligonucleotides from solid-phase synthesis can be designed to remove amino acid bias (over-representation of certain amino acids) in variant libraries. In addition, a number of approaches are also available to reduce codon redundancy (Pines et al., 2015). For example, the ‘22c-trick’ (Kille et al., 2013) reduces redundancy and bias by using three variant codons (NDT, VHG, TGG), incorporated into three separate oligonucleotides, together encoding all 20 amino acids with 22 codons and no stop codons (an improvement compared to 32 codons for NNK randomisation). However, one limitation for this approach is that there can be no overlap between mutagenic oligonucleotides for each target codon, as randomisation of two codons using the same primer sequence would require synthesis of 32 = 9 different oligonucleotides. Mutagenesis of consecutive residues without redundancy or bias has been achieved using the Sloning method (Van den Brulle et al., 2008) and its successor ProxiMAX (Ashraf et al., 2013), employing an alternative ligation and restriction digest-based approach to library synthesis.
Various in silico tools have been created to reduce redundancy (or codon compression) and also to assist in the design of bespoke degenerate codons that encode only those amino acids required, including CodonGenie (Swainston et al., 2017), ANT (Engqvist and Nielsen, 2015), SwiftLib (Jacobs et al., 2015), DYNAMCC (Halweg-Edwards et al., 2016) and DC- or MDC-Analyzer (Tang et al., 2012; Wang et al., 2015). A generic ‘smart’ reduced codon (NDT, whereby N = A, T, G or C; D = A, G or T) was developed by Reetz et al. (Qu et al., 2020; Reetz et al., 2008), encoding 12 possible codons and amino acids (F, L, I, V, S, T, H, N, D, C, R, G) covering a range of physicochemical properties. These approaches seek to reduce library size through the rational design of selected variant sequences, easing the burden of screening duplicate or unwanted variants. The notion of ‘smart’ libraries has also been addressed by the GeneORator method (Currin et al., 2019), an approach based on Boolean logic that mutates multiple codons in different combinations (so called ‘OR-type’ mutations), in order to create rationally reduced combinatorial libraries. (See Table 1.)
Table 1.
Category | Method | Features | Reference |
---|---|---|---|
Protein variant design | PROSS, FuncLib, HotSpot Wizard | Automated algorithms for the design of protein variants. | (Goldenzweig et al., 2016; Khersonsky et al., 2018; Weinstein et al., 2020) |
Variant codon design | CodonGenie, ANT, SwiftLib, DYNAMCC, DC- and MDC-Analyzer | In silico tools for degenerate codon design and analysis. | (Engqvist and Nielsen, 2015; Halweg-Edwards et al., 2016; Jacobs et al., 2015; Swainston et al., 2017; Tang et al., 2012; Wang et al., 2015) |
22c-trick | Reduces redundancy bias by using three variant codons (NDT, VHG, TGG) for randomisation. | (Kille et al., 2013) | |
NDT codon | Generic ‘smart’ reduced codon encoding amino acids F, L, I, V, S, T, H, N, D, C, R and G. | (Qu et al., 2020; Reetz et al., 2008) | |
Library design | GeneORator | Use of Boolean logic to create large combinatorial libraries with rationally reduced complexity. | (Currin et al., 2019) |
RBS library design | RedLibs | A “reduced libraries” approach for the design of RBS libraries for pathway engineering. | (Jeschek et al., 2016) |
Cloning design | Potapov et al. | Golden gate four base pair overhang design tool. | (Potapov et al., 2018) |
3.2. DNA oligonucleotide synthesis
In the past decade advances in DNA oligonucleotide synthesis technology have altered how genetic diversity can be created. Oligonucleotides are short (typically <100 nt in length) sequences of single stranded DNA, synthetically derived using phosphoramidite chemistry (Caruthers et al., 1987; Itakura et al., 1984). Oligonucleotides are employed both as primers for PCR and as overlapping building blocks for assembly of larger synthetic DNA sequences by gene synthesis (Czar et al., 2009; Hughes et al., 2011; Ma et al., 2012), and can be designed to be the source of controlled variant sequences for diversity creation. Established methods for oligonucleotide synthesis utilise controlled pore glass (CPG) supports from which the synthesised strand is elongated (Fig. 1A). Variant library oligonucleotides are created by incubating mixtures of phosphoramidite nucleosides (e.g. an equal mixture of all four nucleosides) for each chosen variant position.
Technological advances to miniaturise phosphoramidite chemistry using solid-phase supported synthesis (LeProust et al., 2001; LeProust et al., 2010; Tian et al., 2004) utilise inkjet ‘printing’ techniques to deliver reagents for polymerisation on a miniscule scale. This technology reduces synthesis volumes by >106 (to sub-femtomole scale (Li et al., 2019)) and thus permits thousands of individual oligonucleotides sequences to be synthesised in parallel at low cost. Consequently, each individual variant sequence of a library can be specifically synthesised in parallel, which when cleaved from the solid support creates a pool of oligonucleotides containing only the desired variant sequences (Fig. 1B). This provides tuneable control over the variant nucleotides at each position within the library and their relative frequency, which has been exploited to reduce codon redundancy (use multiple codons for the same amino acid) and bias (Acevedo-Rocha et al., 2015; Cleary et al., 2004; Hoebenreich et al., 2014; Li et al., 2018; Sayous et al., 2020). For example, randomisation of a codon using a single oligonucleotide is typically done by choosing the NNK codon (N = A, T, G or C; K = G or T), which encodes all 20 amino acids using 32 codons, but with unnecessary redundancy for 8 amino acids (L, V, P, A, G, S, T and R) and a stop codon (Nov, 2012). Furthermore, amino acid bias exists for the NNK codon, with three codons each for R, L and S, and just one codon each for twelve other amino acids. This bias becomes more pronounced as the number of randomised codons increases, leading to dramatic under-representation of some variants in the library (consequently increasing the experimental screening burden and the likelihood that beneficial mutations will be missed). Solid-phase synthesis can synthesise just the desired 20 codons for full randomisation, and several studies have demonstrated the value of reducing bias to improve library quality for directed evolution (Li et al., 2018; Sayous et al., 2020).
Whilst solid-phase oligonucleotide synthesis offers several advantages, some limitations are notable. First, the sub-femtomole synthesis scale generates only limited material compared to the larger CPG scales, although amplification of these samples has been reported (Schmidt et al., 2015) this remains a limitation for some applications. Second, the costs for oligo library pools are substantially higher (roughly 10-fold) than those for pools generated using CPG synthesis (these are currently only available through Twist Bioscience).
3.3. Creating sequence diversity for directed evolution and synthetic biology
Since its inception by Smith and colleagues in 1978 (Hutchison et al., 1978), site-directed mutagenesis (SDM) has become an indispensable tool in biotechnology. The most commonly used mutagenesis methods utilise the polymerase chain reaction (PCR), amplifying template sequences with mutagenic oligonucleotide primers. Conventional methods of SDM, such as QuikChange (Mao et al., 2011; Xia et al., 2015), amplify the entire target sequence (and plasmid backbone) to introduce mutations in a single PCR step. This approach has variable efficiency and yields and, despite improvements to improve robustness (notably the commercial Q5 and Phusion mutagenesis kits), is limited by its inability to create mutations at more than one contiguous region. These approaches are therefore being superseded by more robust and efficient methods capable of creating combinatorial libraries.
As outlined above, approaches to creating genetic diversity have historically been focussed on the synthesis of variant libraries of gene sequences for directed evolution. However, the emergence of synthetic biology has expanded the challenge of engineering biology to a greater scale, to include whole biosynthetic gene pathways (Smanski et al., 2014; Smanski et al., 2016) and genomes (Eisenstein, 2020; Schindler et al., 2018). The genetic components handled in synthetic biology are preferably standardised as ‘parts’ and can include elements with regulatory functions (e.g. promoters, ribosome binding sites and terminators) as well as coding sequencing (e.g. genes). For DNA assembly in vitro, multi-part constructs several kilobases in size are often required, in order to create novel biosynthetic pathways to function within a desired host organism (chassis) (Casini et al., 2015; Ellis et al., 2011). Unfortunately, the a priori design of an optimal metabolic pathway for a particular chassis is currently an unrealistic aim as, alongside the challenges described for traditional DE, there are additional layers of complexity that can require optimisation. For novel pathways encoding multiple heterologous genes these include gene order, transcription and translation control, and construct copy number. Furthermore, imbalance in metabolic fluxes can cause growth inhibition, increased metabolic burden and production of toxic intermediates, leading to suboptimal yields of target compounds (Dueber et al., 2009; Jeschek et al., 2017; Wu et al., 2016b). Consequently, iterative variations on pathway designs should be tested to determine optimised constructs, with efforts focussed on the ability to quickly and efficiently create combinatorial variations of assembled parts.
A number of experimental approaches have been established to conduct the multifactorial optimisation of metabolic pathways for synthetic biology, including multivariate modular metabolic engineering (MMME (Biggs et al., 2014)) and the creation of small high-content libraries (e.g. using promoter libraries or ribosome binding site engineering), as reviewed recently (Jeschek et al., 2017). For example, the RedLibs algorithm was developed to design RBS libraries for pathway optimisation (Jeschek et al., 2016). Particularly noteworthy is the Design of Experiments (DoE) strategy, a generic approach to statistically assess the effect of individual factors in affecting overall performance in order to avoid more exhaustive sampling of sequence space. In a synthetic biology setting this has been successfully applied to optimise sequence and assembly order of coding and regulatory parts as well as general experimental conditions (Carbonell et al., 2018; Xu et al., 2017; Zhou et al., 2015). DoE therefore reduces the exploration space down to a number of assemblies that can realistically be assembled and screened in a high throughput laboratory. In this review we discuss the experimental methodologies that can be employed to conduct these combinatorial optimisation experiments.
Many methods are available for the assembly of large DNA constructs from modular parts. Given that multiple parts are often assembled simultaneously, order and directionality must be controlled, which is typically done using either overlapping homologous sequences (e.g. MODAL (Casini et al., 2013), BASIC (Storch et al., 2015), PaperClip (Trubitsyna et al., 2014; Trubitsyna et al., 2017), USER (Genee et al., 2015; Geu-Flores et al., 2007; Nour-Eldin et al., 2006), Gibson (Gibson et al., 2008; Torella et al., 2014), DNA assembler (Shao and Zhao, 2013; Shao et al., 2009; Yuan et al., 2016), TEDA (Xia et al., 2019) and DATEL (Jin et al., 2016a)), or single strand overhangs created by restriction endonuclease digestion (e.g. Golden Gate (Engler et al., 2008; Gao et al., 2013)). Whilst many of these methods are capable of assembling large DNA constructs with many parts, not all are efficient and flexible enough for creating diverse combinatorial variant pathways.
3.3.1. Restriction enzyme-based methods
For DE, routine cloning-based strategies have also been employed, using restriction enzymes followed by ligation to assemble different DNA fragments. These approaches provide a means to “mix and match” different gene fragments to create diversity (Fig. 2A). Encoding variants within these fragments provides a rapid means to test different combinatorial mutations; however, the limited ligation efficiency limits these approaches to relatively small (<104) libraries (Popova et al., 2015; Quaglia et al., 2017). (see Table 2, Table 3.)
Table 2.
Category | Method | Features | Efficacy | Efficiency | Reference |
---|---|---|---|---|---|
Restriction enzyme-based | Type IIS restriction enzyme (BsaI) modular assembly | Mix-and-match assembly of variant gene fragments via BsaI digestion and DNA ligation, similar to Golden Gate cloning. Limited to small library sizes. | 100% | n.a. | (Popova et al., 2015; Quaglia et al., 2017) |
VersaTile | Mix-and-match assembly of protein modules, based on Golden Gate assembly. | 95% | n.a. | (Gerstmans et al., 2020) | |
Ligation-based | Darwin assembly | Mutations encoded by oligonucleotides are incorporated by DNA polymerase and ligase activities. Method has high efficiency for multiple mutation sites. | 98–100% | 100% | (Cozens and Pinheiro, 2018) |
ProxiMax | Ligation of defined codons to build a library without codon bias. Requires trimer oligonucleotides, robust and controllable but becomes challenging for multiple mutation sites. | NR | 100% | (Ashraf et al., 2013) | |
PCR-based | QuikChange (and variations thereof) | Mutations introduced by a primer pair during inverse PCR. Limited to mutation of one position. | 84% | 55% | (Mao et al., 2011; Xia et al., 2015) |
Overlap extension PCR (OE-PCR) | A gene fragment is amplified by PCR (with primers adding both mutations and homologous termini), then the full-length sequence is assembled by overlap extension PCR. Method is robust and reliable, though cumbersome for multiple mutation sites. | >90% | 100% | (An et al., 2005; Bryksin and Matsumura, 2010; Cheng et al., 2017; Heckman and Pease, 2007; Hussain and Chong, 2016; Wäneskog and Bjerling, 2014; Wei et al., 2012; Williams et al., 2014; Xiao and Pei, 2011) | |
Asymmetric PCR | A single-stranded gene fragment is amplified by asymmetric PCR using mutagenic primers, which is then used as a megaprimer to introduce mutations into full sequence. Method is robust and reliable, though cumbersome for multiple mutation sites. | 91–100% | 100% | (Bi et al., 2012; Sadler et al., 2018) | |
SpeedyGenes | Gene synthesis method, encoding mutations using the overlapping oligonucleotide primers that assemble the gene library de novo. Many combinatorial mutations can be efficiently assembled, though efficiency drops for large genes. | 76–90% | 100% | (Currin et al., 2017, Currin et al., 2014) |
Table 3.
Category | Method | Features | Efficacy | Efficiency | Fidelity/ accuracy | Reference |
---|---|---|---|---|---|---|
Homologous overlap assembly | Gibson assembly | Assembly of fragments with >15 bp homologous overlaps in one-pot isothermal conditions with a T5 exonuclease, DNA polymerase and DNA ligase. Reliable assemblies though efficiency drops as more parts are assembled. Requires creation of 15 bp overlapping termini which increases sample preparation for combinatorial assemblies. | 95% | 87% | 80–95% | (Gibson, 2011, Gibson, 2009; Gibson et al., 2009, Gibson et al., 2008; Torella et al., 2014) |
Modular Overlap-Directed Assembly with Linkers (MODAL) | Similar to Gibson assembly, extended to include modular reusable parts for combinatorial assemblies. | NR | NR | 75–100% | (Casini et al., 2013) | |
PaperClip | Unmodified parts are ‘clipped’ by ligation of specific double stranded oligonucleotides that confer the homologous sequences required to direct subsequent pathway assembly. Not scarless but more flexible for combinatorial assemblies. | NR | 80% | NR | (Trubitsyna et al., 2017, Trubitsyna et al., 2014) | |
Uracil Excision Cloning (USER) | Parts amplified using primers (containing uracil) are then cleaved using uracil DNA glycosylase to create overhangs, which then anneal for construct assembly. Increased sample preparation, like for Gibson method. | NR | 94% | 70–95% | (Cavaleiro et al., 2015; Genee et al., 2015; Geu-Flores et al., 2007; Nour-Eldin et al., 2006, p.) | |
DNA assembler | Assembly of large multipart pathways in vivo using the homologous recombination capacity of Saccharomyces cerevisiae. Long homologous overlaps of parts are required for efficient assembly, increasing preparation and costs. | NR | NR | 71 (44 kb assembly from 50 fragments) – 100% (12 kb assembly from 6 fragments) | (Shao et al., 2009; Shao and Zhao, 2013; Yuan et al., 2016) | |
T5 Exonuclease-Dependent Assembly (TEDA) | Homologous overhangs are created by T5 exonuclease (as in Gibson assembly) allowing self-annealing, then repair and ligation occurs upon transformation into E. coli. Less reagents required compared to Gibson method, with similar efficiency. | NR | NR | >90% | (Xia et al., 2019) | |
DNA Assembly with Thermostable Exonuclease and Ligase (DATEL and sDATEL) | Prepared parts encode 30 bp homologous termini, following denaturation and annealing Taq polymerase removes displaced overhangs and Taq ligase joins the juxtaposed termini. Up to 10 parts can be assembled, though with sample preparation limitations like other homology-based methods. | NR | NR | 75% | (Ding et al., 2017; Jin et al., 2016a) | |
Restriction enzyme-based | Golden Gate assembly (also Scarless stitching) | One-pot assembly of multiple parts using BsaI digestion and DNA ligation. Directionality is controlled by the variable single-strand overhangs. Scarless and reliable but requiring increased sample preparation to repeatedly add different restriction sites/overhangs to parts for combinatorial assembly. | NR | NR | 95–100% | (Engler et al., 2008; Gao et al., 2013; HamediRad et al., 2019; Smanski et al., 2014) |
MoClo, CIDAR MoClo, EcoFlex | Extension of Golden Gate method using multiple type IIS endonucleases, utilised for a hierarchical and modular assembly protocol. EcoFlex extends MoClo to encode pathway variants using multiple pre-assembly levels. Requires more time to prepare preassembled modules prior to assembly. | NR | NR | 95–100% | (Iverson et al., 2016; Moore et al., 2016; Vecchione and Fritz, 2019; Weber et al., 2011; Werner et al., 2012) | |
Biopart Assembly Standard for Idempotent Cloning (BASIC) | Utilises type IIS (BsaI) restriction digestion of parts and then ligation to short oligonucleotide linkers to confer directionality of assembly, enabling more flexible and modular assemblies. Improved flexibility and sample preparation for combinatorial assemblies. | NR | 90% | 90% | (Storch et al., 2017, Storch et al., 2015) | |
MIDAS | Utilises three type IIS endonucleases in three steps: 1) BsmBI digestion and ligation forms source vectors, 2) BsaI digestion and ligation forms shuttle vectors, and 3) AarI and BsmBI digestion and ligation assembles the multigene constructs. Requires more time to prepare preassembled modules prior to assembly. | 96% | 90% | NR | (van Dolleweerd et al., 2018) | |
Start-Stop assembly | Scarless assembly utilising the 3 bp start and stop codons to create expression units, followed by combinatorial assembly using BsaI digestion and ligation. Multistep protocol is longer compared to “one-pot” methods. | 99% | NR | NR | (Taylor et al., 2019) | |
Coussement et al. | Golden Gate method extended to include assembly of variant sequences using single strand assembly. Features similar to Golden Gate. | NR | NR | NR | (Coussement et al., 2017) | |
Ligation-based methods | RECODE | Similar to Darwin assembly for pathway libraries. Mutagenic oligonucleotides are incorporated into a new ssDNA strand (using a thermostable DNA ligase and polymerase during thermocycling), then PCR with terminal anchor primers create the dsDNA library. Efficiently creates combinatorial mutations independent of template pathway size, though does not rearrange part orders. | 58% | 100% | 58% | (Jin et al., 2016b) |
Ligase Cycling Reaction (LCR) | Scarless assembly utilising a thermostable DNA ligase and bridging oligonucleotides, which direct assembly during thermocycling. Parts are modular as they do not require addition of homologous or restriction sites, permitting flexible combinatorial assemblies. Efficiency drops for larger multipart assemblies. | 100% | 80–100% | NR | (de Kok et al., 2014; Wiedmann et al., 1994; Yuan et al., 2016) |
To date, the most popular approach to combinatorial pathway assembly utilises type IIS restriction endonucleases (e.g. BsaI, BbsI). These enzymes digest DNA to create single-stranded ‘sticky ends’ outside of their restriction site, permitting customisable overhangs (which confer order and directionality during the assembly of multiple parts) that can then be joined using T4 DNA ligase, as exemplified by the Golden Gate method (Fig. 2B). This requires the preparation of parts flanked by customised restriction sequences, which can be either encoded by the plasmid or added by PCR. Given that endonuclease digestion creates overhangs outside of the recognition sequence, ligated constructs are ‘scarless’ (devoid of any cloning-specific sequences). Many elaborations on Golden Gate have been developed, improving robustness and flexibility, including Goldenbraid and Scarless stitching (Smanski et al., 2014).
Several studies have demonstrated highly parallel assembly of combinatorial constructs using the Golden Gate approach, often through the standardisation of parts and linker sequences, notably MoClo (Vecchione and Fritz, 2019; Weber et al., 2011; Werner et al., 2012), CIDAR MoClo (Iverson et al., 2016), MIDAS (van Dolleweerd et al., 2018) and Start-Stop assembly (Taylor et al., 2019). Various studies have shown that optimised design of the single-stranded linkers is crucial for the efficient one-pot assembly of multiple parts when attempting Golden Gate assemblies (HamediRad et al., 2019; Potapov et al., 2018). Interestingly, Coussement et al. (Coussement et al., 2017) extended the approach to include single-strand assembly of variant sequences, creating promoter libraries that were then assembled into genetic pathways by Golden Gate to screen for increased pathway yield. Promoter and RBS libraries have also been assembled for screening using the EcoFlex methodology, a Golden Gate approach that uses multiple pre-assembly levels to build pathway variants (Moore et al., 2016). Recently, VersaTile utilised the Golden Gate approach to create random “mix-and-match” protein modules libraries (Gerstmans et al., 2020).
One limitation to Golden Gate approaches is the necessity to encode specific overhang sequences at each terminus to direct the order of the assembled parts. For combinatorial assemblies, where it is desirable to rearrange part order and have variable numbers of parts, unique overhang sequences must be generated for each part in each combination, which dramatically increases the workload for preparation of parts prior to assembly. This limitation is shared by other methods that depend on homologous sequences to confer directionality during DNA assembly, notably the isothermal assembly methods developed by Gibson et al. (Gibson, 2009; Gibson, 2011; Gibson et al., 2009). These methods are popular for the assembly of multiple DNA fragments and there are some examples of their use for combinatorial assemblies (Halleran et al., 2018; Santos-Moreno and Schaerli, 2019; Torella et al., 2014).
3.3.2. Ligation-based methods
DNA concatenation via ligation is an efficient approach for mutation of multiple sites. Rather than using mutagenic oligonucleotides as PCR primers, they are instead hybridised to a template sequence, a polymerase then extends the oligonucleotide to fill in the gaps between hybridised oligonucleotides to create adjacent termini that are joined together by a DNA ligase. This approach provides a more attractive “one-pot” approach compared to PCR-based methods, which is advantageous when looking to increase throughput using automation. The Darwin assembly method (Cozens and Pinheiro, 2018) recently combines the ligation-based method with streptavidin-biotin purification to create a streamlined workflow, with an efficiency suitable to create large (108 variants) and diverse combinatorial libraries (mutating up to 10 separate sites simultaneously, Fig. 3A).
The majority of synthetic biology approaches for the assembly of diverse pathway sequences focus on the modular assembly of multiple parts in different orders, in contrast to DE approaches to create gene libraries of variant sequence. However, Jin et al. (Jin et al., 2016b) described a ligation and PCR-based approach that is compatible for both gene and pathway variant libraries, called ‘rapidly efficient combinatorial oligonucleotides for directed evolution’ (RECODE, Fig. 3B). Through hybridisation of multiple mutagenic oligonucleotides that can bind anywhere within a pathway construct, combinatorial libraries were created whilst simultaneously mutating RBS, promoter and gene sequences for screening of compound yield. RECODE could be considered as a complementary approach to modular assembly methods like Golden Gate as, whilst it is unable to alter part orders, it possesses advantages in creating larger combinatorial variant libraries by mutating multiple sites across a pathway simultaneously.
For combinatorial pathway assemblies, the Ligase Cycling Reaction (LCR) approach does not require any addition of terminal sequences to each part (unlike restriction enzyme-based methods like Golden Gate). The directionality of LCR is controlled using bridging oligonucleotides, which are complementary to the terminal 5′ and 3′ sequences of the two parts to be joined (Fig. 4). Through repeated thermocycling, these oligonucleotides hybridise two target strands, creating adjacent 5′ and 3′ termini that are then ligated using a thermostable ligase (de Kok et al., 2014; Wiedmann et al., 1994). This approach means that the preparation of parts is the same for all assemblies and therefore each part only needs to be prepared once, providing a flexible platform on which to create countless combinatorial assemblies (Robinson et al., 2018). Automation of this approach has enabled high-throughput DoE studies as a means to optimise the biosynthesis of a number of natural products and commodity chemicals in E. coli (Carbonell et al., 2018; Robinson et al., 2020). Whilst LCR provides the most flexible assembly platform, its efficiency reduces for larger assemblies (>12 kb plasmids), a difficulty that Yuan et al. (Yuan et al., 2016) overcame by combining LCR with the ‘DNA assembler’ method in Saccharomyces cerevisiae, which assembled plasmids of 44 kb with an impressive 71% efficiency.
3.3.3. PCR-based methods
Mutagenesis using overlap extension PCR (OE-PCR) is a popular two-step process for DE, whereby truncated gene amplicons are created using mutagenic PCR primers, which are then assembled together in a second step to create the full target sequence library (Bryksin and Matsumura, 2010; Heckman and Pease, 2007; Hussain and Chong, 2016; Williams et al., 2014; Xiao and Pei, 2011) (Fig. 5A). Further developments in this approach have made it amenable to combinatorial mutations (An et al., 2005; Cheng et al., 2017; Wäneskog and Bjerling, 2014; Wei et al., 2012). Similarly, mutagenesis using asymmetric PCR employs two steps, where mutagenic primers are used to create single-stranded products, which are then deployed as megaprimers to introduce these mutations in the second step (Bi et al., 2012; Sadler et al., 2018). Interestingly, pathway assembly by OE-PCR is not widely used. Trial experiments with OE-PCR in our laboratory have shown that the necessary long-range PCR runs have variable efficiency and this may account for the method's apparent unpopularity in large pathway assemblies (unpublished results).
Gene synthesis, whereby gene sequences are assembled from short oligonucleotides de novo (without a template), can also be as achieved using OE-PCR (Fig. 5B). Here, each oligonucleotide overlaps with the next, such that these short fragments can be stitched together during PCR to create an entire gene. As the gene is assembled de novo each time, mutations can be encoded by any oligonucleotide sequence in any position (providing they do not fall where oligonucleotides overlap), permitting large and diverse combinatorial libraries to be synthesised (Currin et al., 2014; Currin et al., 2017). This is an efficient process for DNA assemblies of less than 2 kb; however, due to the inherent error rate (arising from the oligonucleotide synthesis chemistry) the efficiency of larger assemblies drops significantly. Consequently, gene synthesis is typically used as the means to generate the synthetic biology ‘parts’, which are then assembled into larger constructs using the alternative methods outlined above.
4. Discussion
This review highlights recent developments in the creation of genetic diversity in vitro, ranging from large libraries of single gene targets for traditional DE to the assembly of larger combinatorial pathway libraries for systems-level DE by synthetic biology. For all approaches, the challenge of creating sufficient diversity to capture the desired functional improvement is a difficult one, given the potential sequence space and inevitable limitations on experimental throughput. Various methodologies have been outlined that attempt to address this issue, including reducing the necessary experimental effort (i.e. reducing bias and redundancy of gene libraries), creating ‘smart’ libraries of reduced diversity (restricting the search space to statistically designed variants), as well as improving process efficiency through new methods and automation.
A common issue for synthesis of genetic diversity in vitro is the transfer of this material to a screening platform in vivo, often achieved through transformation of plasmid constructs into a microbial host. Despite the availability of high efficiency bacterial hosts (achieving >109 colony forming units per μg of DNA), the efficiency of transforming cloning or assembly reactions can be many orders of magnitude lower, primarily due to the relatively low efficiency of perfect plasmid assembly in vitro. For example, whilst Golden Gate is a reliable tool for the assembly of large constructs (often assembling >20 kb with over 20 parts), transformation of these assembly reactions typically yields 102–103 colonies (Coussement et al., 2017), with efficiency reducing further as the number of assembled parts increases (in contrast, the simple cloning of single gene parts in DE enables efficiencies of >106 transformants). This limits the production of large combinatorial libraries with these methods, so effort is usually concentrated on the assembly of pre-selected construct designs in parallel. This dramatically increases resource requirements, is reliant on automation and reduces the screening scope. Therefore, improvements in assembly and cell transformation efficiency are required for the screening of large, more diverse synthetic pathway libraries. Alternatively, recent advancements in cell-free technologies could negate the need to conduct for transformations, and thereby increase the number of variants available for screening (Lu, 2017; Yehezkel et al., 2016).
For all of the approaches described here, the efficiency to the entire experimental process is greatly enhanced through integration with in silico design and learn strategies. At the design stage, integrating assembly designs with laboratory automation scripts greatly increases throughput and accuracy by reducing manual intervention steps (Carbonell et al., 2016). At the learn stage, machine learning algorithms provide a means to interrogate complex biological networks to unearth optimal sequences based on limited sampling of the design space. Machine learning has only recently been exploited for the DE of fluorescent proteins and enzymes (Fox, 2005; Mazurenko et al., 2020; Saito et al., 2018; Wu et al., 2016a; Wu et al., 2019b), and it has been tentatively explored in synthetic biology applications with small library sizes (Jervis et al., 2019; Opgenorth et al., 2019). If synthetic biology seeks to harness the potential of machine learning it will require the characterisation of large, diverse sequence libraries to provide enough data to generate meaningful predictions. To provide this, synthetic biology approaches must continue to improve assembly speed and accuracy, permitting a larger throughput of more diverse sequences. It is possible that advances in oligonucleotide synthesis technology could help in this regard. Protocols need to be streamlined for end-to-end automation, for example with use of one-pot assemblies or minimising complex liquid handling and DNA purification procedures. Statistical methods, like DoE, require smaller samples of sequence and system diversity and are a powerful means to optimise multifactorial problems to obtain improved function, albeit within more limited parameters than machine learning. It is envisaged that these computational approaches will provide a better understanding as to the key factors affecting the performance of biological molecules and systems. This knowledge could then be utilised to improve the predictability of biological engineering, a goal that can only be achieved through the accurate and efficient utilisation of genetic diversity.
Declaration of Competing Interest
AC is the named inventor for a patent describing a mutagenesis method (GeneORator), which is mentioned in the manuscript.
Acknowledgements
We acknowledge funding from the Biotechnology and Biological Sciences Research Council (BBSRC) under grant BB/M017702/1 “Centre for synthetic biology of fine and speciality chemicals (SYNBIOCHEM)”. The authors thank the SYNBIOCHEM team for their comments and suggestions in preparing this manuscript.
Contributor Information
Andrew Currin, Email: andrew.currin@manchester.ac.uk.
Rainer Breitling, Email: rainer.breitling@manchester.ac.uk.
References
- Acevedo-Rocha C.G., Reetz M.T., Nov Y. Economical analysis of saturation mutagenesis experiments. Sci. Rep. 2015;5:10654. doi: 10.1038/srep10654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Agresti J.J., Antipov E., Abate A.R., Ahn K., Rowat A.C., Baret J.-C., Marquez M., Klibanov A.M., Griffiths A.D., Weitz D.A. Ultrahigh-throughput screening in drop-based microfluidics for directed evolution. PNAS. 2010;107:4004–4009. doi: 10.1073/pnas.0910781107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- An Y., Ji J., Wu W., Lv A., Huang R., Wei Y. A rapid and efficient method for multiple-site mutagenesis with a modified overlap extension PCR. Appl. Microbiol. Biotechnol. 2005;68:774–778. doi: 10.1007/s00253-005-1948-8. [DOI] [PubMed] [Google Scholar]
- Arnold F.H. Protein engineering for unusual environments. Curr. Opin. Biotechnol. 1993;4:450–455. doi: 10.1016/0958-1669(93)90011-K. [DOI] [PubMed] [Google Scholar]
- Ashraf M., Frigotto L., Smith M.E., Patel S., Hughes M.D., Poole A.J., Hebaishi H.R.M., Ullman C.G., Hine A.V. ProxiMAX randomization: a new technology for non-degenerate saturation mutagenesis of contiguous codons. Biochem. Soc. Trans. 2013;41:1189–1194. doi: 10.1042/BST20130123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Autour A., Ryckelynck M. Ultrahigh-throughput improvement and discovery of enzymes using droplet-based microfluidic screening. Micromachines. 2017;8:128. doi: 10.3390/mi8040128. [DOI] [Google Scholar]
- Bi Y., Qiao X., Hua Z., Zhang L., Liu X., Li L., Hua W., Xiao H., Zhou J., Wei Q., Zheng X. An asymmetric PCR-based, reliable and rapid single-tube native DNA engineering strategy. BMC Biotechnol. 2012;12:39. doi: 10.1186/1472-6750-12-39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biggs B.W., De Paepe B., Santos C.N.S., De Mey M., Kumaran Ajikumar P. Multivariate modular metabolic engineering for pathway and strain optimization. Curr. Opin. Biotechnol. Cell Pathway Eng. 2014;29:156–162. doi: 10.1016/j.copbio.2014.05.005. [DOI] [PubMed] [Google Scholar]
- Biles B.D., Connolly B.A. Low-fidelity Pyrococcus furiosus DNA polymerase mutants useful in error-prone PCR. Nucleic Acids Res. 2004;32:e176. doi: 10.1093/nar/gnh174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Breitling R., Takano E. Synthetic biology advances for pharmaceutical production. Current opinion in biotechnology. Chem. Biotechnol. Pharmaceut. Biotechnol. 2015;35:46–51. doi: 10.1016/j.copbio.2015.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bryksin A.V., Matsumura I. Overlap extension PCR cloning: a simple and reliable way to create recombinant plasmids. Biotechniques. 2010;48:463–465. doi: 10.2144/000113418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cadwell R.C., Joyce G.F. Randomization of genes by PCR mutagenesis. Genome Res. 1992;2:28–33. doi: 10.1101/gr.2.1.28. [DOI] [PubMed] [Google Scholar]
- Carbonell P., Currin A., Jervis A.J., Rattray N.J.W., Swainston N., Yan C., Takano E., Breitling R. Bioinformatics for the synthetic biology of natural products: integrating across the design–build–test cycle. Nat. Prod. Rep. 2016;33:925–932. doi: 10.1039/C6NP00018E. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carbonell P., Jervis A.J., Robinson C.J., Yan C., Dunstan M., Swainston N., Vinaixa M., Hollywood K.A., Currin A., Rattray N.J.W., Taylor S., Spiess R., Sung R., Williams A.R., Fellows D., Stanford N.J., Mulherin P., Feuvre R.L., Barran P., Goodacre R., Turner N.J., Goble C., Chen G.G., Kell D.B., Micklefield J., Breitling R., Takano E., Faulon J.-L., Scrutton N.S. An automated design-build-test-learn pipeline for enhanced microbial production of fine chemicals. Commun. Biol. 2018;1:66. doi: 10.1038/s42003-018-0076-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caruthers M.H., Barone A.D., Beaucage S.L., Dodds D.R., Fisher E.F., McBride L.J., Matteucci M., Stabinsky Z., Tang J.-Y. [15] chemical synthesis of deoxyoligonucleotides by the phosphoramidite method. In: Ray Wu L.G., editor. Methods in Enzymology, Recombinant DNA Part E. Academic Press; 1987. pp. 287–313. [DOI] [PubMed] [Google Scholar]
- Casini A., MacDonald J.T., Jonghe J.D., Christodoulou G., Freemont P.S., Baldwin G.S., Ellis T. One-pot DNA construction for synthetic biology: the modular overlap-directed assembly with linkers (MODAL) strategy. Nucleic Acids Res. 2013:gkt915. doi: 10.1093/nar/gkt915. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Casini A., Storch M., Baldwin G.S., Ellis T. Bricks and blueprints: methods and standards for DNA assembly. Nat. Rev. Mol. Cell Biol. 2015;16:568–576. doi: 10.1038/nrm4014. [DOI] [PubMed] [Google Scholar]
- Cavaleiro A.M., Kim S.H., Seppälä S., Nielsen M.T., Nørholm M.H.H. Accurate DNA assembly and genome engineering with optimized uracil excision cloning. ACS Synth. Biol. 2015;4:1042–1046. doi: 10.1021/acssynbio.5b00113. [DOI] [PubMed] [Google Scholar]
- Chao R., Mishra S., Si T., Zhao H. Engineering biological systems using automated biofoundries. Metab. Eng. 2017;42:98–108. doi: 10.1016/j.ymben.2017.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen K.Q., Arnold F.H. Enzyme engineering for nonaqueous solvents: random mutagenesis to enhance activity of subtilisin E in polar organic media. Biotechnology (N.Y.) 1991;9:1073–1077. doi: 10.1038/nbt1191-1073. [DOI] [PubMed] [Google Scholar]
- Chen K., Arnold F.H. Tuning the activity of an enzyme for unusual environments: sequential random mutagenesis of subtilisin E for catalysis in dimethylformamide. Proc. Natl. Acad. Sci. U. S. A. 1993;90:5618–5622. doi: 10.1073/pnas.90.12.5618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng F., Zhu L., Schwaneberg U. Directed evolution 2.0: improving and deciphering enzyme properties. Chem. Commun. 2015;51:9760–9772. doi: 10.1039/C5CC01594D. [DOI] [PubMed] [Google Scholar]
- Cheng F., Xu J.-M., Xiang C., Liu Z.-Q., Zhao L.-Q., Zheng Y.-G. Simple-MSSM: a simple and efficient method for simultaneous multi-site saturation mutagenesis. Biotechnol. Lett. 2017;39:567–575. doi: 10.1007/s10529-016-2278-x. [DOI] [PubMed] [Google Scholar]
- Cherry J.R., Fidantsef A.L. Directed evolution of industrial enzymes: an update. Curr. Opin. Biotechnol. 2003;14:438–443. doi: 10.1016/S0958-1669(03)00099-5. [DOI] [PubMed] [Google Scholar]
- Chi H., Wang X., Shao Y., Qin Y., Deng Z., Wang L., Chen S. Engineering and modification of microbial chassis for systems and synthetic biology. Synthetic Syst. Biotechnol. 2019;4:25–33. doi: 10.1016/j.synbio.2018.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cleary M.A., Kilian K., Wang Y., Bradshaw J., Cavet G., Ge W., Kulkarni A., Paddison P.J., Chang K., Sheth N., Leproust E., Coffey E.M., Burchard J., McCombie W.R., Linsley P., Hannon G.J. Production of complex nucleic acid libraries using highly parallel in situ oligonucleotide synthesis. Nat. Methods. 2004;1:241–248. doi: 10.1038/nmeth724. [DOI] [PubMed] [Google Scholar]
- Copp Janine N., Hanson-Manful P., Ackerley David F., Patrick Wayne M. Error-Prone PCR and effective generation of gene variant libraries for directed evolution. In: Gillam E.M.J., Copp J.N., Ackerley David, editors. Directed Evolution Library Creation, Methods in Molecular Biology. Springer; New York: 2014. pp. 3–22. [DOI] [PubMed] [Google Scholar]
- Coussement P., Bauwens D., Maertens J., De Mey M. Direct combinatorial pathway optimization. ACS Synth. Biol. 2017;6:224–232. doi: 10.1021/acssynbio.6b00122. [DOI] [PubMed] [Google Scholar]
- Cozens C., Pinheiro V.B. Darwin assembly: fast, efficient, multi-site bespoke mutagenesis. Nucleic Acids Res. 2018;46:e51. doi: 10.1093/nar/gky067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Currin A., Swainston N., Day P.J., Kell D.B. SpeedyGenes: an improved gene synthesis method for the efficient production of error-corrected, synthetic protein libraries for directed evolution. Protein Eng. Des. Sel. 2014;27:273–280. doi: 10.1093/protein/gzu029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Currin A., Swainston N., Day P.J., Kell D.B. Synthetic biology for the directed evolution of protein biocatalysts: navigating sequence space intelligently. Chem. Soc. Rev. 2015;44:1172–1239. doi: 10.1039/C4CS00351A. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Currin A., Swainston N., Day P.J., Kell D.B. SpeedyGenes: exploiting an improved gene synthesis method for the efficient production of synthetic protein libraries for directed evolution. Methods Mol. Biol. 2017;1472:63–78. doi: 10.1007/978-1-4939-6343-0_5. [DOI] [PubMed] [Google Scholar]
- Currin A., Kwok J., Sadler J.C., Bell E.L., Swainston N., Ababi M., Day P., Turner N.J., Kell D.B. GeneORator: An effective strategy for navigating protein sequence space more efficiently through Boolean OR-type DNA libraries. ACS Synth. Biol. 2019;8:1371–1378. doi: 10.1021/acssynbio.9b00063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Czar M.J., Anderson J.C., Bader J.S., Peccoud J. Gene synthesis demystified. Trends Biotechnol. 2009;27:63–72. doi: 10.1016/j.tibtech.2008.10.007. [DOI] [PubMed] [Google Scholar]
- David F., Nielsen J., Siewers V. Flux control at the Malonyl-CoA node through hierarchical dynamic pathway regulation in Saccharomyces cerevisiae. ACS Synth. Biol. 2016;5:224–233. doi: 10.1021/acssynbio.5b00161. [DOI] [PubMed] [Google Scholar]
- de Kok S., Stanton L.H., Slaby T., Durot M., Holmes V.F., Patel K.G., Platt D., Shapland E.B., Serber Z., Dean J., Newman J.D., Chandran S.S. Rapid and reliable DNA assembly via ligase cycling reaction. ACS Synth. Biol. 2014;3:97–106. doi: 10.1021/sb4001992. [DOI] [PubMed] [Google Scholar]
- Decoene T., Paepe B.D., Maertens J., Coussement P., Peters G., Maeseneire S.L.D., Mey M.D. Standardization in synthetic biology: an engineering discipline coming of age. Crit. Rev. Biotechnol. 2018;38:647–656. doi: 10.1080/07388551.2017.1380600. [DOI] [PubMed] [Google Scholar]
- Ding W., Weng H., Jin P., Du G., Chen J., Kang Z. Scarless assembly of unphosphorylated DNA fragments with a simplified DA℡ method. Bioengineered. 2017;8:296–301. doi: 10.1080/21655979.2017.1308986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dougherty M.J., Arnold F.H. Directed evolution: new parts and optimized function. Curr. Opin. Biotechnol. 2009;20:486–491. doi: 10.1016/j.copbio.2009.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dueber J.E., Wu G.C., Malmirchegini G.R., Moon T.S., Petzold C.J., Ullal A.V., Prather K.L.J., Keasling J.D. Synthetic protein scaffolds provide modular control over metabolic flux. Nat. Biotechnol. 2009;27:753–759. doi: 10.1038/nbt.1557. [DOI] [PubMed] [Google Scholar]
- Eisenstein M. How to build a genome. Nature. 2020;578:633–635. doi: 10.1038/d41586-020-00511-9. [DOI] [PubMed] [Google Scholar]
- Ellis T., Adie T., Baldwin G.S. DNA assembly for synthetic biology: from parts to pathways and beyond. Integr. Biol. 2011;3:109–118. doi: 10.1039/C0IB00070A. [DOI] [PubMed] [Google Scholar]
- Engler C., Kandzia R., Marillonnet S. A one pot, one step, precision cloning method with high throughput capability. PLoS One. 2008;3 doi: 10.1371/journal.pone.0003647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Engqvist M.K.M., Nielsen J. ANT: software for generating and evaluating degenerate codons for natural and expanded genetic codes. ACS Synth. Biol. 2015;4:935–938. doi: 10.1021/acssynbio.5b00018. [DOI] [PubMed] [Google Scholar]
- Foo J.L., Chang M.W. Synthetic yeast genome reveals its versatility. Nature. 2018;557:647–648. doi: 10.1038/d41586-018-05164-3. [DOI] [PubMed] [Google Scholar]
- Fox R. Directed molecular evolution by machine learning and the influence of nonlinear interactions. J. Theor. Biol. 2005;234:187–199. doi: 10.1016/j.jtbi.2004.11.031. [DOI] [PubMed] [Google Scholar]
- Gao X., Yan P., Shen W., Li X., Zhou P., Li Y. Modular construction of plasmids by parallel assembly of linear vector components. Anal. Biochem. 2013;437:172–177. doi: 10.1016/j.ab.2013.02.028. [DOI] [PubMed] [Google Scholar]
- Genee H.J., Bonde M.T., Bagger F.O., Jespersen J.B., Sommer M.O.A., Wernersson R., Olsen L.R. Software-supported USER cloning strategies for site-directed mutagenesis and DNA assembly. ACS Synth. Biol. 2015;4:342–349. doi: 10.1021/sb500194z. [DOI] [PubMed] [Google Scholar]
- Gerosa L., Sauer U. Regulation and control of metabolic fluxes in microbes. Curr. Opin. Biotechnol. Nanobiotechnol. Syst. Biol. 2011;22:566–575. doi: 10.1016/j.copbio.2011.04.016. [DOI] [PubMed] [Google Scholar]
- Gerstmans H., Grimon D., Gutiérrez D., Lood C., Rodríguez A., van Noort V., Lammertyn J., Lavigne R., Briers Y. A VersaTile-driven platform for rapid hit-to-lead development of engineered lysins. Sci. Adv. 2020;6:eaaz1136. doi: 10.1126/sciadv.aaz1136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Geu-Flores F., Nour-Eldin H.H., Nielsen M.T., Halkier B.A. USER fusion: a rapid and efficient method for simultaneous fusion and cloning of multiple PCR products. Nucleic Acids Res. 2007;35:e55. doi: 10.1093/nar/gkm106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gibson D.G. Synthesis of DNA fragments in yeast by one-step assembly of overlapping oligonucleotides. Nucleic Acids Res. 2009;37:6984–6990. doi: 10.1093/nar/gkp687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gibson D.G. Chapter fifteen - enzymatic assembly of overlapping DNA fragments. In: Voigt Christopher., editor. Methods in Enzymology. Academic Press; 2011. pp. 349–361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gibson D.G., Benders G.A., Andrews-Pfannkoch C., Denisova E.A., Baden-Tillson H., Zaveri J., Stockwell T.B., Brownley A., Thomas D.W., Algire M.A., Merryman C., Young L., Noskov V.N., Glass J.I., Venter J.C., Hutchison C.A., Smith H.O. Complete chemical synthesis, assembly, and cloning of a mycoplasma genitalium genome. Science. 2008;319:1215–1220. doi: 10.1126/science.1151721. [DOI] [PubMed] [Google Scholar]
- Gibson D.G., Young L., Chuang R.-Y., Venter J.C., Hutchison C.A., Smith H.O. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods. 2009;6:343–345. doi: 10.1038/nmeth.1318. [DOI] [PubMed] [Google Scholar]
- Gibson D.G., Smith H.O., Iii C.A.H., Venter J.C., Merryman C. Chemical synthesis of the mouse mitochondrial genome. Nat. Methods. 2010;7:901–903. doi: 10.1038/nmeth.1515. [DOI] [PubMed] [Google Scholar]
- Goldenzweig A., Goldsmith M., Hill S.E., Gertman O., Laurino P., Ashani Y., Dym O., Unger T., Albeck S., Prilusky J., Lieberman R.L., Aharoni A., Silman I., Sussman J.L., Tawfik D.S., Fleishman S.J. Automated structure- and sequence-based Design of Proteins for high bacterial expression and stability. Mol. Cell. 2016;63:337–346. doi: 10.1016/j.molcel.2016.06.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Halleran A.D., Swaminathan A., Murray R.M. Single Day construction of multigene circuits with 3G assembly. ACS Synth. Biol. 2018;7:1477–1480. doi: 10.1021/acssynbio.8b00060. [DOI] [PubMed] [Google Scholar]
- Halweg-Edwards A.L., Pines G., Winkler J.D., Pines A., Gill R.T. A web Interface for codon compression. ACS Synth. Biol. 2016;5:1021–1023. doi: 10.1021/acssynbio.6b00026. [DOI] [PubMed] [Google Scholar]
- HamediRad M., Weisberg S., Chao R., Lian J., Zhao H. Highly efficient single-pot Scarless Golden Gate assembly. ACS Synth. Biol. 2019;8:1047–1054. doi: 10.1021/acssynbio.8b00480. [DOI] [PubMed] [Google Scholar]
- Hayashi Y., Aita T., Toyota H., Husimi Y., Urabe I., Yomo T. Experimental rugged fitness landscape in protein sequence space. PLoS One. 2006;1 doi: 10.1371/journal.pone.0000096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heckman K.L., Pease L.R. Gene splicing and mutagenesis by PCR-driven overlap extension. Nat. Protoc. 2007;2:924–932. doi: 10.1038/nprot.2007.132. [DOI] [PubMed] [Google Scholar]
- Hillson N., Caddick M., Cai Y., Carrasco J.A., Chang M.W., Curach N.C., Bell D.J., Le Feuvre R., Friedman D.C., Fu X., Gold N.D., Herrgård M.J., Holowko M.B., Johnson J.R., Johnson R.A., Keasling J.D., Kitney R.I., Kondo A., Liu C., Martin V.J.J., Menolascina F., Ogino C., Patron N.J., Pavan M., Poh C.L., Pretorius I.S., Rosser S.J., Scrutton N.S., Storch M., Tekotte H., Travnik E., Vickers C.E., Yew W.S., Yuan Y., Zhao H., Freemont P.S. Building a global alliance of biofoundries. Nat. Commun. 2019;10:2040. doi: 10.1038/s41467-019-10079-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoebenreich S., Zilly F.E., Acevedo-Rocha C.G., Zilly M., Reetz M.T. Speeding up directed evolution: combining the advantages of solid-phase combinatorial gene synthesis with statistically guided reduction of screening effort. ACS Synth. Biol. 2014 doi: 10.1021/sb5002399. [DOI] [PubMed] [Google Scholar]
- Hughes R.A., Miklos A.E., Ellington A.D. Chapter twelve - gene synthesis: methods and applications. In: Voigt Christopher., editor. Methods in Enzymology. Academic Press; 2011. pp. 277–309. [DOI] [PubMed] [Google Scholar]
- Hussain H., Chong N.F.-M. Combined overlap extension PCR method for improved site directed mutagenesis. Biomed. Res. Int. 2016:2016. doi: 10.1155/2016/8041532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hutchison C.A., Phillips S., Edgell M.H., Gillam S., Jahnke P., Smith M. Mutagenesis at a specific position in a DNA sequence. J. Biol. Chem. 1978;253:6551–6560. [PubMed] [Google Scholar]
- Itakura K., Rossi J.J., Wallace R.B. Synthesis and use of synthetic oligonucleotides. Annu. Rev. Biochem. 1984;53:323–356. doi: 10.1146/annurev.bi.53.070184.001543. [DOI] [PubMed] [Google Scholar]
- Iverson S.V., Haddock T.L., Beal J., Densmore D.M. CIDAR MoClo: improved MoClo assembly standard and new E. coli part library enable rapid combinatorial Design for Synthetic and Traditional Biology. ACS Synth. Biol. 2016;5:99–103. doi: 10.1021/acssynbio.5b00124. [DOI] [PubMed] [Google Scholar]
- Jacobs T.M., Yumerefendi H., Kuhlman B., Leaver-Fay A. SwiftLib: rapid degenerate-codon-library optimization through dynamic programming. Nucleic Acids Res. 2015;43 doi: 10.1093/nar/gku1323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jervis A.J., Carbonell P., Vinaixa M., Dunstan M.S., Hollywood K.A., Robinson C.J., Rattray N.J.W., Yan C., Swainston N., Currin A., Sung R., Toogood H., Taylor S., Faulon J.-L., Breitling R., Takano E., Scrutton N.S. Machine learning of designed translational control allows predictive pathway optimization in Escherichia coli. ACS Synth. Biol. 2019;8:127–136. doi: 10.1021/acssynbio.8b00398. [DOI] [PubMed] [Google Scholar]
- Jeschek M., Gerngross D., Panke S. Rationally reduced libraries for combinatorial pathway optimization minimizing experimental effort. Nat. Commun. 2016;7:11163. doi: 10.1038/ncomms11163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jeschek M., Gerngross D., Panke S. Combinatorial pathway optimization for streamlined metabolic engineering. Curr. Opin. Biotechnol. Tissue Cell Pathway Eng. 2017;47:142–151. doi: 10.1016/j.copbio.2017.06.014. [DOI] [PubMed] [Google Scholar]
- Jia B., Wu Y., Li B.-Z., Mitchell L.A., Liu H., Pan S., Wang J., Zhang H.-R., Jia N., Li B., Shen M., Xie Z.-X., Liu D., Cao Y.-X., Li X., Zhou X., Qi H., Boeke J.D., Yuan Y.-J. Precise control of SCRaMbLE in synthetic haploid and diploid yeast. Nat. Commun. 2018;9:1933. doi: 10.1038/s41467-018-03084-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jin P., Ding W., Du G., Chen J., Kang Z. DATEL: a Scarless and sequence-independent DNA assembly method using Thermostable exonucleases and ligase. ACS Synth. Biol. 2016;5:1028–1032. doi: 10.1021/acssynbio.6b00078. [DOI] [PubMed] [Google Scholar]
- Jin P., Kang Z., Zhang J., Zhang L., Du G., Chen J. Combinatorial evolution of enzymes and synthetic pathways using one-step PCR. ACS Synth. Biol. 2016;5:259–268. doi: 10.1021/acssynbio.5b00240. [DOI] [PubMed] [Google Scholar]
- Khalil A.S., Collins J.J. Synthetic biology: applications come of age. Nat. Rev. Genet. 2010;11:367–379. doi: 10.1038/nrg2775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khersonsky O., Lipsh R., Avizemer Z., Ashani Y., Goldsmith M., Leader H., Dym O., Rogotner S., Trudeau D.L., Prilusky J., Amengual-Rigo P., Guallar V., Tawfik D.S., Fleishman S.J. Automated design of efficient and functionally diverse enzyme repertoires. Mol. Cell. 2018;72 doi: 10.1016/j.molcel.2018.08.033. 178-186.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kikuchi M., Ohnishi K., Harayama S. Novel family shuffling methods for the in vitro evolution of enzymes. Gene. 1999;236:159–167. doi: 10.1016/S0378-1119(99)00240-1. [DOI] [PubMed] [Google Scholar]
- Kille S., Acevedo-Rocha C.G., Parra L.P., Zhang Z.-G., Opperman D.J., Reetz M.T., Acevedo J.P. Reducing codon redundancy and screening effort of combinatorial protein libraries created by saturation mutagenesis. ACS Synth. Biol. 2013;2:83–92. doi: 10.1021/sb300037w. [DOI] [PubMed] [Google Scholar]
- Kondrashov D.A., Kondrashov F.A. Topological features of rugged fitness landscapes in sequence space. Trends Genet. 2015;31:24–33. doi: 10.1016/j.tig.2014.09.009. [DOI] [PubMed] [Google Scholar]
- LeProust E., Zhang H., Yu P., Zhou X., Gao X. Characterization of oligodeoxyribonucleotide synthesis on glass plates. Nucleic Acids Res. 2001;29:2171–2180. doi: 10.1093/nar/29.10.2171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- LeProust E.M., Peck B.J., Spirin K., McCuen H.B., Moore B., Namsaraev E., Caruthers M.H. Synthesis of high-quality libraries of long (150mer) oligonucleotides by a novel depurination controlled process. Nucleic Acids Res. 2010;38:2522–2540. doi: 10.1093/nar/gkq163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li A., Acevedo-Rocha C.G., Sun Z., Cox T., Xu J.L., Reetz M.T. Beating Bias in the directed evolution of proteins: combining high-Fidelity on-Chip solid-phase gene synthesis with efficient gene assembly for combinatorial library construction. ChemBioChem. 2018;19:221–228. doi: 10.1002/cbic.201700540. [DOI] [PubMed] [Google Scholar]
- Li H., Huang Y., Wei Z., Wang W., Yang Z., Liang Z., Li Z. An oligonucleotide synthesizer based on a microreactor chip and an inkjet printer. Sci. Rep. 2019;9:5058. doi: 10.1038/s41598-019-41519-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lu Y. Vol. 2. 2017. Cell-free synthetic biology: engineering in an open world. Synthetic and Systems Biotechnology, A tribute to Arny Demain, for his lifelong pioneering contributions to biochemical engineering; pp. 23–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ma S., Saaem I., Tian J. Error correction in gene synthesis technology. Trends Biotechnol. 2012;30:147–154. doi: 10.1016/j.tibtech.2011.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ma L., Li Y., Chen X., Ding M., Wu Y., Yuan Y.-J. SCRaMbLE generates evolved yeasts with increased alkali tolerance. Microb. Cell Factories. 2019;18:52. doi: 10.1186/s12934-019-1102-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mao Y., Lin J., Zhou A., Ji K., Downey J.S., Chen R., Han A. Quikgene: a gene synthesis method integrated with ligation-free cloning. Anal. Biochem. 2011;415:21–26. doi: 10.1016/j.ab.2011.04.004. [DOI] [PubMed] [Google Scholar]
- Mazurenko S., Prokop Z., Damborsky J. Machine learning in enzyme engineering. ACS Catal. 2020;10:1210–1223. doi: 10.1021/acscatal.9b04321. [DOI] [Google Scholar]
- McCullum E.O., Williams B.A.R., Zhang J., Chaput J.C. Random mutagenesis by error-prone PCR. In: Braman J., editor. In Vitro Mutagenesis Protocols, Methods in Molecular Biology. Humana Press; 2010. pp. 103–109. [DOI] [PubMed] [Google Scholar]
- Miton C.M., Tokuriki N. How mutational epistasis impairs predictability in protein evolution and design. Protein Sci. 2016;25:1260–1272. doi: 10.1002/pro.2876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moore S.J., Lai H.-E., Kelwick R.J.R., Chee S.M., Bell D.J., Polizzi K.M., Freemont P.S. EcoFlex: a multifunctional MoClo kit for E. coli synthetic biology. ACS Synth. Biol. 2016;5:1059–1069. doi: 10.1021/acssynbio.6b00031. [DOI] [PubMed] [Google Scholar]
- Naseri G., Koffas M.A.G. Application of combinatorial optimization strategies in synthetic biology. Nat. Commun. 2020;11:2446. doi: 10.1038/s41467-020-16175-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nour-Eldin H.H., Hansen B.G., Nørholm M.H.H., Jensen J.K., Halkier B.A. Advancing uracil-excision based cloning towards an ideal technique for cloning PCR fragments. Nucleic Acids Res. 2006;34 doi: 10.1093/nar/gkl635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nov Y. When second best is good enough: another probabilistic look at saturation mutagenesis. Appl. Environ. Microbiol. 2012;78:258–262. doi: 10.1128/AEM.06265-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Opgenorth P., Costello Z., Okada T., Goyal G., Chen Y., Gin J., Benites V., de Raad M., Northen T.R., Deng K., Deutsch S., Baidoo E.E.K., Petzold C.J., Hillson N.J., Garcia Martin H., Beller H.R. Lessons from two design–build–test–learn cycles of Dodecanol production in Escherichia coli aided by machine learning. ACS Synth. Biol. 2019;8:1337–1351. doi: 10.1021/acssynbio.9b00020. [DOI] [PubMed] [Google Scholar]
- Pines G., Pines A., Garst A.D., Zeitoun R.I., Lynch S.A., Gill R.T. Codon compression algorithms for saturation mutagenesis. ACS Synth. Biol. 2015;4:604–614. doi: 10.1021/sb500282v. [DOI] [PubMed] [Google Scholar]
- Popova B., Schubert S., Bulla I., Buchwald D., Kramer W. A robust and Versatile method of combinatorial chemical synthesis of gene libraries via hierarchical assembly of partially randomized modules. PLoS One. 2015;10 doi: 10.1371/journal.pone.0136778. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Potapov V., Ong J.L., Kucera R.B., Langhorst B.W., Bilotti K., Pryor J.M., Cantor E.J., Canton B., Knight T.F., Evans T.C., Lohman G.J.S. Comprehensive profiling of four base overhang ligation Fidelity by T4 DNA ligase and application to DNA assembly. ACS Synth. Biol. 2018;7:2665–2674. doi: 10.1021/acssynbio.8b00333. [DOI] [PubMed] [Google Scholar]
- Povolotskaya I.S., Kondrashov F.A. Sequence space and the ongoing expansion of the protein universe. Nature. 2010;465:922–926. doi: 10.1038/nature09105. [DOI] [PubMed] [Google Scholar]
- Pretorius I.S., Boeke J.D. Yeast 2.0-connecting the dots in the construction of the world’s first functional synthetic eukaryotic genome. FEMS Yeast Res. 2018;18 doi: 10.1093/femsyr/foy032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qu G., Li A., Acevedo-Rocha C.G., Sun Z., Reetz M.T. The crucial role of methodology development in directed evolution of selective enzymes. Angew. Chem. Int. Ed. 2020;59:2–30. doi: 10.1002/anie.201901491. [DOI] [PubMed] [Google Scholar]
- Quaglia D., Ebert M.C.C.J.C., Mugford P.F., Pelletier J.N. Enzyme engineering: a synthetic biology approach for more effective library generation and automated high-throughput screening. PLoS One. 2017;12 doi: 10.1371/journal.pone.0171741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reetz M.T., Wang L.-W., Bocola M. Directed evolution of Enantioselective enzymes: iterative cycles of CASTing for probing protein-sequence space. Angew. Chem. Int. Ed. 2006;45:1236–1241. doi: 10.1002/anie.200502746. [DOI] [PubMed] [Google Scholar]
- Reetz M.T., Kahakeaw D., Lohmer R. Addressing the numbers problem in directed evolution. ChemBioChem. 2008;9:1797–1804. doi: 10.1002/cbic.200800298. [DOI] [PubMed] [Google Scholar]
- Robinson C.J., Dunstan M.S., Swainston N., Titchmarsh J., Takano E., Scrutton N.S., Jervis A.J. Multifragment DNA assembly of biochemical pathways via automated ligase cycling reaction. In: Scrutton N., editor. Methods in Enzymology, Enzymes in Synthetic Biology. Academic Press; 2018. pp. 369–392. [DOI] [PubMed] [Google Scholar]
- Robinson C.J., Carbonell P., Jervis A.J., Yan C., Hollywood K.A., Dunstan M.S., Currin A., Swainston N., Spiess R., Taylor S., Mulherin P., Parker S., Rowe W., Matthews N.E., Malone K.J., Le Feuvre R., Shapira P., Barran P., Turner N.J., Micklefield J., Breitling R., Takano E., Scrutton N.S. Rapid prototyping of microbial production strains for the biomanufacture of potential materials monomers. Metab. Eng. 2020;60:168–182. doi: 10.1016/j.ymben.2020.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Romero P.A., Arnold F.H. Exploring protein fitness landscapes by directed evolution. Nat. Rev. Mol. Cell Biol. 2009;10:866–876. doi: 10.1038/nrm2805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Romero P.A., Krause A., Arnold F.H. Navigating the protein fitness landscape with Gaussian processes. PNAS. 2013;110:E193–E201. doi: 10.1073/pnas.1215251110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sadler J.C., Green L., Swainston N., Kell D.B., Currin A. Methods in Enzymology. Elsevier; 2018. Fast and flexible synthesis of combinatorial libraries for directed evolution; pp. 59–79. [DOI] [PubMed] [Google Scholar]
- Sailer Z.R., Harms M.J. Uninterpretable Interact. Epistasis Uncertainty. 2018:378489. doi: 10.1101/378489. [DOI] [Google Scholar]
- Saito Y., Oikawa M., Nakazawa H., Niide T., Kameda T., Tsuda K., Umetsu M. Machine-learning-guided mutagenesis for directed evolution of fluorescent proteins. ACS Synth. Biol. 2018;7:2014–2022. doi: 10.1021/acssynbio.8b00155. [DOI] [PubMed] [Google Scholar]
- Santos-Moreno J., Schaerli Y. A framework for the modular and combinatorial assembly of synthetic gene circuits. ACS Synth. Biol. 2019;8:1691–1697. doi: 10.1021/acssynbio.9b00174. [DOI] [PubMed] [Google Scholar]
- Sayous V., Lubrano P., Li Y., Acevedo-Rocha C.G. Unbiased libraries in protein directed evolution. Biochimica et Biophysica Acta (BBA) - Proteins and Proteomics. 2020;1868:140321. doi: 10.1016/j.bbapap.2019.140321. [DOI] [PubMed] [Google Scholar]
- Schindler D., Dai J., Cai Y. Synthetic genomics: a new venture to dissect genome fundamentals and engineer new functions. Curr. Opin. Chem. Biol. Synthetic Biol. Synthetic Biomol. 2018;46:56–62. doi: 10.1016/j.cbpa.2018.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmidt T.L., Beliveau B.J., Uca Y.O., Theilmann M., Da Cruz F., Wu C.-T., Shih W.M. Scalable amplification of strand subsets from chip-synthesized oligonucleotide libraries. Nat. Commun. 2015;6:8634. doi: 10.1038/ncomms9634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shao Z., Zhao H. Construction and engineering of large biochemical pathways via DNA assembler. In: Polizzi K.M., Kontoravdi C., editors. Synthetic Biology, Methods in Molecular Biology. Humana Press; Totowa, NJ: 2013. pp. 85–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shao Z., Zhao Hua, Zhao Huimin. DNA assembler, an in vivo genetic method for rapid construction of biochemical pathways. Nucleic Acids Res. 2009;37 doi: 10.1093/nar/gkn991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simon A.J., d’Oelsnitz S., Ellington A.D. Synthetic evolution. Nat. Biotechnol. 2019;37:730–743. doi: 10.1038/s41587-019-0157-4. [DOI] [PubMed] [Google Scholar]
- Smanski M.J., Bhatia S., Zhao D., Park Y., Woodruff L.B.A., Giannoukos G., Ciulla D., Busby M., Calderon J., Nicol R., Gordon D.B., Densmore D., Voigt C.A. Functional optimization of gene clusters by combinatorial design and assembly. Nat. Biotechnol. 2014;32:1241–1249. doi: 10.1038/nbt.3063. [DOI] [PubMed] [Google Scholar]
- Smanski M.J., Zhou H., Claesen J., Shen B., Fischbach M.A., Voigt C.A. Synthetic biology to access and expand nature’s chemical diversity. Nat. Rev. Microbiol. 2016;14:135–149. doi: 10.1038/nrmicro.2015.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Solomon K.V., Sanders T.M., Prather K.L.J. A dynamic metabolite valve for the control of central carbon metabolism. Metab. Eng. 2012;14:661–671. doi: 10.1016/j.ymben.2012.08.006. [DOI] [PubMed] [Google Scholar]
- Stemmer W.P. DNA shuffling by random fragmentation and reassembly: in vitro recombination for molecular evolution. PNAS. 1994;91:10747–10751. doi: 10.1073/pnas.91.22.10747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stemmer W.P.C. Rapid evolution of a protein in vitro by DNA shuffling. Nature. 1994;370:389–391. doi: 10.1038/370389a0. [DOI] [PubMed] [Google Scholar]
- Storch M., Casini A., Mackrow B., Fleming T., Trewhitt H., Ellis T., Baldwin G.S. BASIC: a new biopart assembly standard for idempotent cloning provides accurate, Single-Tier DNA Assembly for Synthetic Biology. ACS Synth. Biol. 2015;4:781–787. doi: 10.1021/sb500356d. [DOI] [PubMed] [Google Scholar]
- Storch M., Casini A., Mackrow B., Ellis T., Baldwin G.S. BASIC: A simple and accurate modular DNA assembly method. In: Hughes R.A., editor. Synthetic DNA: Methods and Protocols, Methods in Molecular Biology. Springer; New York, NY: 2017. pp. 79–91. [DOI] [PubMed] [Google Scholar]
- Storz J.F. Current Opinion in Structural Biology, Carbohydrates Sequences and Topology. Vol. 50. 2018. Compensatory mutations and epistasis for protein function; pp. 18–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sumbalova L., Stourac J., Martinek T., Bednar D., Damborsky J. HotSpot wizard 3.0: web server for automated design of mutations and smart libraries based on sequence input information. Nucleic Acids Res. 2018;46:W356–W362. doi: 10.1093/nar/gky417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Swainston N., Currin A., Green L., Breitling R., Day P.J., Kell D.B. CodonGenie: optimised ambiguous codon design tools. PeerJ Comput. Sci. 2017;3 doi: 10.7717/peerj-cs.120. [DOI] [Google Scholar]
- Tang L., Gao H., Zhu X., Wang X., Zhou M., Jiang R. Construction of “small-intelligent” focused mutagenesis libraries using well-designed combinatorial degenerate primers. BioTechniques. 2012;52:149–158. doi: 10.2144/000113820. [DOI] [PubMed] [Google Scholar]
- Taylor G.M., Mordaka P.M., Heap J.T. Start-stop assembly: a functionally scarless DNA assembly system optimized for metabolic engineering. Nucleic Acids Res. 2019;47:e17. doi: 10.1093/nar/gky1182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tian J., Gong H., Sheng N., Zhou X., Gulari E., Gao X., Church G. Accurate multiplex gene synthesis from programmable DNA microchips. Nature. 2004;432:1050–1054. doi: 10.1038/nature03151. [DOI] [PubMed] [Google Scholar]
- Torella J.P., Boehm C.R., Lienert F., Chen J.-H., Way J.C., Silver P.A. Rapid construction of insulated genetic circuits via synthetic sequence-guided isothermal assembly. Nucleic Acids Res. 2014;42:681–689. doi: 10.1093/nar/gkt860. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tracewell C.A., Arnold F.H. Directed enzyme evolution: climbing fitness peaks one amino acid at a time. Curr. Opin. Chem. Biol. 2009;13:3–9. doi: 10.1016/j.cbpa.2009.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trubitsyna M., Michlewski G., Cai Y., Elfick A., French C.E. PaperClip: rapid multi-part DNA assembly from existing libraries. Nucleic Acids Res. 2014;42:e154. doi: 10.1093/nar/gku829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trubitsyna M., Liu C.-K., Salinas A., Elfick A., French C.E. PaperClip: A simple method for flexible multi-part DNA assembly. In: Hughes R.A., editor. Synthetic DNA: Methods and Protocols, Methods in Molecular Biology. Springer; New York, NY: 2017. pp. 111–128. [DOI] [PubMed] [Google Scholar]
- Van den Brulle J., Fischer M., Langmann T., Horn G., Waldmann T., Arnold S., Fuhrmann M., Schatz O., O’Connell T., O’Connell D., Auckenthaler A., Schwer H. A novel solid phase technology for high-throughput gene synthesis. BioTechniques. 2008;45:340–343. doi: 10.2144/000112953. [DOI] [PubMed] [Google Scholar]
- van Dolleweerd C.J., Kessans S.A., Van de Bittner K.C., Bustamante L.Y., Bundela R., Scott B., Nicholson M.J., Parker E.J. MIDAS: a modular DNA assembly system for synthetic biology. ACS Synth. Biol. 2018;7:1018–1029. doi: 10.1021/acssynbio.7b00363. [DOI] [PubMed] [Google Scholar]
- Vecchione S., Fritz G. CRIMoClo plasmids for modular assembly and orthogonal chromosomal integration of synthetic circuits in Escherichia coli. J. Biol. Eng. 2019;13:92. doi: 10.1186/s13036-019-0218-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wäneskog M., Bjerling P. Multi-fragment site-directed mutagenic overlap extension polymerase chain reaction as a competitive alternative to the enzymatic assembly method. Anal. Biochem. 2014;444:32–37. doi: 10.1016/j.ab.2013.09.021. [DOI] [PubMed] [Google Scholar]
- Wang X., Lin H., Zheng Y., Feng J., Yang Z., Tang L. MDC-analyzer-facilitated combinatorial strategy for improving the activity and stability of halohydrin dehalogenase from agrobacterium radiobacter AD1. J. Biotechnol. 2015;206:1–7. doi: 10.1016/j.jbiotec.2015.04.002. [DOI] [PubMed] [Google Scholar]
- Weber E., Engler C., Gruetzner R., Werner S., Marillonnet S. A modular cloning system for standardized assembly of multigene constructs. PLoS One. 2011;6 doi: 10.1371/journal.pone.0016765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wei H., Hu J., Wang L., Xu F., Wang S. Rapid gene splicing and multi-sited mutagenesis by one-step overlap extension polymerase chain reaction. Anal. Biochem. 2012;429:76–78. doi: 10.1016/j.ab.2012.06.027. [DOI] [PubMed] [Google Scholar]
- Weinstein J., Khersonsky O., Fleishman S.J. Practically useful protein-design methods combining phylogenetic and atomistic calculations. Curr. Opin. Struct. Biol. 2020;63:58–64. doi: 10.1016/j.sbi.2020.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Werner S., Engler C., Weber E., Gruetzner R., Marillonnet S. Fast track assembly of multigene constructs using Golden Gate cloning and the MoClo system. Bioengineered. 2012;3:38–43. doi: 10.4161/bbug.3.1.18223. [DOI] [PubMed] [Google Scholar]
- Wiedmann M., Wilson W.J., Czajka J., Luo J., Barany F., Batt C.A. Ligase chain reaction (LCR)-overview and applications. Genome Res. 1994;3:S51–S64. doi: 10.1101/gr.3.4.s51. [DOI] [PubMed] [Google Scholar]
- Williams E.M., Copp J.N., Ackerley D.F. Site-saturation mutagenesis by overlap extension PCR. In: Gillam E.M.J., Copp J.N., Ackerley D., editors. Directed Evolution Library Creation: Methods and Protocols, Methods in Molecular Biology. Springer; New York, NY: 2014. pp. 83–101. [DOI] [PubMed] [Google Scholar]
- Wong T.S., Roccatano D., Zacharias M., Schwaneberg U. A statistical analysis of random mutagenesis methods used for directed protein evolution. J. Mol. Biol. 2006;355:858–871. doi: 10.1016/j.jmb.2005.10.082. [DOI] [PubMed] [Google Scholar]
- Wong T.S., Wong T.S., Roccatano D., Schwaneberg U. Challenges of the genetic code for exploring sequence space in directed protein evolution. Biocatal Biotransformation. 2007;25:229–241. doi: 10.1080/10242420701444280. [DOI] [Google Scholar]
- Wu J., Du G., Chen J., Zhou J. Enhancing flavonoid production by systematically tuning the central metabolic pathways based on a CRISPR interference system in Escherichia coli. Sci. Rep. 2015;5:13477. doi: 10.1038/srep13477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu S.G., Shimizu K., Tang J.K.-H., Tang Y.J. Facilitate collaborations among synthetic biology, metabolic engineering and machine learning. ChemBioEng Rev. 2016;3:45–54. doi: 10.1002/cben.201500024. [DOI] [Google Scholar]
- Wu G., Yan Q., Jones J.A., Tang Y.J., Fong S.S., Koffas M.A.G. Metabolic burden: cornerstones in synthetic biology and metabolic engineering applications. Trends Biotechnol. 2016;34:652–664. doi: 10.1016/j.tibtech.2016.02.010. [DOI] [PubMed] [Google Scholar]
- Wu Y., Zhu R.-Y., Mitchell L.A., Ma L., Liu R., Zhao M., Jia B., Xu H., Li Y.-X., Yang Z.-M., Ma Y., Li X., Liu H., Liu D., Xiao W.-H., Zhou X., Li B.-Z., Yuan Y.-J., Boeke J.D. In vitro DNA SCRaMbLE. Nat. Commun. 2018;9:1935. doi: 10.1038/s41467-018-03743-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu M.-R., Jusiak B., Lu T.K. Engineering advanced cancer therapies with synthetic biology. Nat. Rev. Cancer. 2019;19:187–195. doi: 10.1038/s41568-019-0121-0. [DOI] [PubMed] [Google Scholar]
- Wu Z., Kan S.B.J., Lewis R.D., Wittmann B.J., Arnold F.H. Machine learning-assisted directed protein evolution with combinatorial libraries. PNAS. 2019:201901979. doi: 10.1073/pnas.1901979116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xia Y., Chu W., Qi Q., Xun L. New insights into the QuikChangeTM process guide the use of Phusion DNA polymerase for site-directed mutagenesis. Nucleic Acids Res. 2015;43:e12. doi: 10.1093/nar/gku1189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xia Y., Li K., Li J., Wang T., Gu L., Xun L. T5 exonuclease-dependent assembly offers a low-cost method for efficient cloning and site-directed mutagenesis. Nucleic Acids Res. 2019;47:e15. doi: 10.1093/nar/gky1169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xiao Y.-H., Pei Y. Asymmetric overlap extension PCR method for site-directed mutagenesis. In: Park D.J., editor. PCR Protocols, Methods in Molecular Biology. Humana Press; 2011. pp. 277–282. [DOI] [PubMed] [Google Scholar]
- Xu P., Rizzoni E.A., Sul S.-Y., Stephanopoulos G. Improving metabolic pathway efficiency by statistical model-based multivariate regulatory metabolic engineering. ACS Synth. Biol. 2017;6:148–158. doi: 10.1021/acssynbio.6b00187. [DOI] [PubMed] [Google Scholar]
- Yang J., Ruff A.J., Arlt M., Schwaneberg U. Casting epPCR (cepPCR): a simple random mutagenesis method to generate high quality mutant libraries. Biotechnol. Bioeng. 2017;114:1921–1927. doi: 10.1002/bit.26327. [DOI] [PubMed] [Google Scholar]
- Yehezkel T.B., Rival A., Raz O., Cohen R., Marx Z., Camara M., Dubern J.-F., Koch B., Heeb S., Krasnogor N., Delattre C., Shapiro E. Synthesis and cell-free cloning of DNA libraries using programmable microfluidics. Nucleic Acids Res. 2016;44:e35. doi: 10.1093/nar/gkv1087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yuan Y., Andersen E., Zhao H. Flexible and Versatile strategy for the construction of large biochemical pathways. ACS Synth. Biol. 2016;5:46–52. doi: 10.1021/acssynbio.5b00117. [DOI] [PubMed] [Google Scholar]
- Zhao H., Arnold F.H. Optimization of DNA shuffling for high Fidelity recombination. Nucleic Acids Res. 1997;25:1307–1308. doi: 10.1093/nar/25.6.1307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou H., Vonk B., Roubos J.A., Bovenberg R.A.L., Voigt C.A. Algorithmic co-optimization of genetic constructs and growth conditions: application to 6-ACA, a potential nylon-6 precursor. Nucleic Acids Res. 2015;43:10560–10570. doi: 10.1093/nar/gkv1071. [DOI] [PMC free article] [PubMed] [Google Scholar]