Abstract
Precision genome editing accelerates the discovery of the genetic determinants of phenotype and the engineering of novel behaviors in organisms. Advances in DNA synthesis and recombineering have enabled high-throughput engineering of genetic circuits and biosynthetic pathways via directed mutagenesis of bacterial chromosomes. However, the highest recombination efficiencies have to date been reported in persistent mutator strains, which suffer from reduced genomic fidelity. The absence of inducible transcriptional regulators in these strains also prevents concurrent control of genome engineering tools and engineered functions. Here, we introduce a new recombineering platform strain, BioDesignER, which incorporates (i) a refactored λ-Red recombination system that reduces toxicity and accelerates multi-cycle recombination, (ii) genetic modifications that boost recombination efficiency, and (iii) four independent inducible regulators to control engineered functions. These modifications resulted in single-cycle recombineering efficiencies of up to 25% with a 7-fold increase in recombineering fidelity compared to the widely used recombineering strain EcNR2. To facilitate genome engineering in BioDesignER, we have curated eight context-neutral genomic loci, termed Safe Sites, for stable gene expression and consistent recombination efficiency. BioDesignER is a platform to develop and optimize engineered cellular functions and can serve as a model to implement comparable recombination and regulatory systems in other bacteria.
INTRODUCTION
The design-build-test (DBT) cycle is a common paradigm used in engineering disciplines. Within the context of synthetic biology, it is employed to engineer user-defined cellular functions for applications such as metabolic engineering, biosensing, and therapeutics (1,2). The rapid prototyping of engineered functions has been facilitated by advances in in vitro DNA assembly, and plasmids have traditionally been used to implement designs in vivo given their ease-of-assembly and portability. However, for deployment in contexts beyond the laboratory such as large-scale industrial bioprocesses or among complex microbial communities, plasmid-based circuits suffer from multiple limitations: high intercellular variation in gene expression, genetic instability from random partitioning of plasmids during cell division, and plasmid loss in environments for which antibiotic use could disrupt native microbial communities or is economically infeasible (3,4). These shortcomings can be ameliorated once a design is transferred from a plasmid to the host genome, which offers improved genetic stability and lower expression variation (5) along with reduced metabolic load (6). However, behaviors optimized for plasmid contexts often do not map predictably to the genome. As such, building and testing designs directly on the genome can reduce the DBT cycle time and facilitate engineering cellular programs for complex environments.
Expanding synthetic biology efforts to genome-scale engineering has historically been limited by factors such as low endogenous rates of recombination, lack of optimized workflows for recombination, and uncertainty due to locus-dependent expression variability (7,8). The advent of recombination-based genetic engineering (recombineering), which relies on homologous recombination proteins - often exo, bet, and gam from bacteriophage λ - in conjunction with linear donor DNA containing target homology and the desired mutations, has enabled genomic deletions, insertions, and point mutations at user-defined loci (9–13). Recombineering has enabled generation of genomic discovery resources such as the Escherichia coli K-12 in-frame, single-gene deletion collection of non-essential genes (Keio collection) (14) and technologies such as trackable multiplex recombineering (TRMR), which enables genome-scale mapping of genetic modifications to traits of interest (15,16). In addition, pooled library recombineering approaches such as CRISPR-enabled trackable genome engineering (CREATE) have combined CRISPR-Cas9 gene editing schemes with barcode tracking to enable high-throughput mutational profiling at single-nucleotide resolution on a genome-wide scale (17).
Meanwhile, techniques such as multiplex automated genome engineering (MAGE) have been developed to generate complex mutagenesis libraries by extending recombineering to simultaneously modify multiple genetic loci through iterative cycles of single-strand DNA (ssDNA) oligonucleotide recombination (18). MAGE has enabled several genome-scale recombineering efforts such as the recoding of all 321 occurrences of TAG stop codons with synonymous TAA codons in a single E. coli strain (19,20), the removal of all instances of 13 rare codons from 42 highly expressed essential genes to study genome design constraints (21), the insertion of multiple T7 promoters across 12 genomic operons to optimize metabolite production (22), and the His-tagging of 38 essential genes that encode the entire translation machinery over 110 MAGE cycles for subsequent in vitro enzyme studies (23). In addition, methods such as tracking combinatorial engineered libraries (TRACE) have been developed to facilitate the rapid, high-throughput mapping of multiplex engineered modifications from such genomic explorations to phenotypes of interest (24,25).
To achieve the high levels of recombination necessary to carry out large-scale, multiplexed genome editing, many of these studies required the use of mutagenic strains. Specifically, the endogenous methyl-directed mismatch repair (MMR) system, which acts to revert newly made recombineering modifications when active, was removed to more effectively retain targeted modifications in the standard MAGE strain EcNR2. While deactivation of MMR dramatically enhances recombination efficiency, it also increases the rate of background mutagenesis by 100–1000 fold (26,27). Indeed, in converting all 321 occurrences of TAG stop codons to TAA stop codons, Lajoie et al. noted the addition of 355 unintended (i.e. off-target) mutations after the final strain construction (20).
Several approaches have been proposed to circumvent the use of MMR-deficient strains and thus avoid their high basal rates of off-target mutagenesis. Designs utilizing mismatches that are poorly repaired or that introduce silent mismatches near the desired mutation can be used to evade MMR, which only recognizes short mismatches (28). Furthermore, oligos containing chemically modified bases can be used to evade MMR correction and increase allelic-replacement efficiency (29). While these approaches boost recombination rates without increasing basal mutagenesis rates, they either limit the range of mutations that can be implemented or significantly increase oligonucleotide costs.
More recent efforts have focused on approaches to create a transient mutagenesis state. Specifically, cells are cycled between phases of elevated mutation rates, during which editing can take place efficiently, and phases of wild type-like mutation rates, during which cells can be propagated without incurring a significant number of background mutations. Nyerges et al. reported the use of a temperature-controlled mismatch repair deficient strain (E. coli tMMR) in which the MMR machinery can be transiently inactivated by shifting cells to a non-permissive temperature (36°C) during oligonucleotide incorporation and cell recovery and then reactivated by returning cells to the permissive temperature (32°C) for propagation (30). This approach reduced the number of off-target mutations by 85%. In another work, Nyerges et al. developed pORTMAGE, a genome editing workflow that uses a dominant-negative mutL allele cloned on a broad-host range vector to transiently inactivate host MMR response using a similar temperature-dependent protocol (31). The pORTMAGE system demonstrated high on-target mutation rates coupled with no reported off-target mutations. The plasmid-based nature of the system enabled transfer to other bacteria such as Salmonella enterica and Citrobacter freundii. While both of these approaches reduce the off-target mutation rate, they restrict cell growth to 30–32°C and hence increase the time between recombineering cycles. In contrast, during preparation of this manuscript Bubnov et al. reported the construction of a plasmid system with a dominant-negative mutS allele and a novel conditionally replicating origin that enabled plasmid propagation and genome editing at higher temperatures (e.g. 37°C) (32). Finally, Lennen et al. developed a plasmid-based MAGE system, Transient Mutator Multiplex Automated Genome Engineering (TM-MAGE). In TM-MAGE, E. coli Dam methylase is inducibly overexpressed to transiently limit MMR and thus enable high allelic replacement efficiencies with a 12–33 fold lower off-target mutation rate than strains with fully disabled MMR (33).
To date, recombineering has advanced genome engineering in what can be categorized as two thematic research areas: developing genetic tools for model microbial hosts and multi-stage genome editing for metabolic engineering and synthetic biology. These research areas have divergent requirements. In the former case, the portability of the recombineering vector is key to easily generate mutations across bacterial hosts and allow removal of the recombineering cassette following mutagenesis. Recombineering plasmids function as effective and convenient vectors for a portable genome engineering solution. In the latter case, a chromosomally integrated recombineering cassette has proven effective for multi-cycle recombineering up to dozens of rounds to simultaneously target many genomic loci related to an engineered function. Unfortunately, there is no chromosomally integrated recombineering system reported to date that achieves high-efficiency ssDNA recombination without the side-effect of a significantly elevated global mutagenesis rate. Hence, researchers still face a trade-off between genome editing efficiency and genome stability.
Here we present a rational genome engineering approach to develop a high-fidelity recombineering platform strain, called BioDesignER, with enhanced recombineering efficiency, low off-target mutagenesis rates, and short editing cycle times. We refactored the λ-Red machinery in E. coli K-12 MG1655-derived EcNR1 to decrease cycle time and reduce toxicity, stacked genetic modifications shown to increase recombination rates, and characterized gene expression across the chromosome at curated integration loci, herein referred to as Safe Sites. We also introduced genomic modifications to independently control four transcriptional regulators of gene expression and characterized the induction regime for each regulator. We profiled the growth and ssDNA recombination rates of BioDesignER with a dual-fluorescent reporter cassette integrated at each Safe Site and also demonstrated the retention of double-stranded DNA (dsDNA) recombination capabilities in the strain. We performed a comparative study of background mutagenesis rates of our strain and alternative platform strains using a fluorescent reporter-based fluctuation assay and found that BioDesignER exhibited a 4.2-fold lower mutagenesis rate compared to the widely used recombineering strain EcNR2. Finally, we compared the multi-cycle accumulation of targeted mutations for BioDesignER and other high-efficiency recombineering strains and found that BioDesignER exhibited similar multiplex editing efficiencies to EcNR2.nuc5-, a persistent mutator strain with the highest reported ssDNA recombination efficiency. BioDesignER is a high-fidelity genome engineering strain that uniquely enables high-efficiency recombineering while retaining a low basal mutagenesis rate.
MATERIALS AND METHODS
Chemicals, reagents and media
LB Lennox Medium (10 g/l Tryptone, 5 g/l Yeast Extract, 5 g/l NaCl; Sigma-Aldrich, USA) was used to culture strains for experiments, to prepare electrocompetent cells for recombineering, and as recovery broth following electroporation. Antibiotics concentrations used were 34 μg/ml for chloramphenicol, 100 μg/ml for carbenicillin, and 50 μg/ml for kanamycin. Anhydrotetracycline (aTc) (CAS 13803-65-1; Sigma-Aldrich, USA) was used at 100 ng/ml to induce the λ-Red genes for recombineering. For thyA-mediated recombineering steps, M9 minimal media supplemented with 0.4% glucose, 0.2% casamino acids, thymine (100 μg/ml), and trimethoprim (50 μg/ml) was used. M9 minimal media with valine (20 μg/ml) was used to select for the ilvG+ genotype. All M9 minimal media was supplemented with biotin at 10 μg/ml to account for the biotin auxotrophy common to all EcNR1-derivative strains.
Oligonucleotides
Oligos were ordered from Integrated DNA Technologies, resuspended in 1× TE buffer at either 500 uM (recombineering oligos) or 100 uM (standard amplification oligos), and stored at −20°C. For recombineering workflows, oligos were designed to target the lagging strand of DNA replication and contain at least 35 bp of homology to the target locus. Oligos for testing recombination efficiency were ordered with 5’ phosphorothioate base modifications. Oligo sequences for individual BioDesignER lineage construction steps are available in linked Benchling files from Supplementary Table S1.
Strains
A key set of strains used in this work is listed in Table 1. Supplementary Table S2 provides relevant genotypes of all intermediate strains in the lineage from EcNR1 to BioDesignER and lists the identification numbers for strains used to quantify recombineering and mutagenesis rates. The BioDesignER lineage table includes descriptions of genetic modifications, associated recombineering, selection and enrichment methods, and a web link to sequence-level detail for each modification. Supplementary Table S3 provides an abbreviated summary of strain identification numbers and genotypes for the BioDesignER lineage.
Table 1.
Strain | Lab ID | Parent strain | Genetic modifications | Reference |
---|---|---|---|---|
EcNR1 | RE002 | MG1655 | λ-Red(ampR)::bioA/bioB | Wang et al. 2009 |
EcNR2 | ET046 | EcNR1 | cmR::mutS | Wang et al. 2009 |
EcNR2.nuc5- | ET003 | EcNR2 | dnaG.Q576A ΔrecJ ΔxonA ΔxseA ΔexoX Δred-α | Mosberg et al. 2012 |
pTet-λ | RE574 | MG1655 | pTet2-gam-bet-exo/tetR/ampR::bioA/B ilvG+ | This study |
damOE | RE824 | MG1655 | pTet2-gam-bet-exo-dam/tetR/ampR::bioA/B ilvG+ | This study |
dnaG.Q | RE626 | pTet-λ | dnaG.Q576A | This study |
exo1 | RE628 | pTet-λ | dnaG.Q576A ΔrecJ | This study |
exo2 | HR146 | pTet-λ | dnaG.Q576A ΔrecJ ΔxonA | This study |
BioDesignER | RE630 | damOE | dnaG.Q576A lacIQ1 Pcp8-araE ΔaraBAD pConst-araC ΔrecJ ΔxonA | This study |
Growth rate measurements
Two clones of each strain were cultured overnight in LB Lennox (LB) medium with chloramphenicol. The following morning each strain culture was back-diluted 1:100 into two media types: (i) LB with aTc (LB+aTc) (ii) LB. The resulting inocula were divided into four technical replicates and then grown for up to 18 h in a Biotek Synergy 2 microplate reader. The growth rate at early exponential phase was calculated from the resulting optical density data using custom analysis scripts in Python.
Competent cell preparation and recombineering
Strains were grown overnight in LB Lennox medium (LB) with antibiotics as appropriate at 37°C. The following morning each strain culture was back-diluted 1:100 into 25 ml LB+aTc and grown at 37°C until they reached OD600 0.3–0.4. The resulting mid-log cultures were chilled in a 4°C ice-water bath. Cultures were centrifuged (Beckman-Coulter Allegra 25R) at 8000 × g and subjected to two washes: (i) 25 ml chilled water (ii) 15 ml chilled 10% glycerol. The cell pellets after the final glycerol wash were resuspended in 10% glycerol, yielding ∼500 ul of competent cells given the residual cell mass from the wash.
Due to their different induction and growth requirements, EcNR2 and EcNR2.nuc5- strains were grown overnight at 30°C, back-diluted 1:100 into 25 ml LB+chlor media, and cultured at 30°C until they reached OD600 0.3–0.6. The λ-Red machinery was induced by incubating the cultures in a 42°C water bath for 15 min after which the strains were chilled in a 4°C ice-water bath for at least 10 min. The remainder of the preparation for EcNR2 and EcNR2.nuc5- follows the same aforementioned wash steps.
A total of 40 ul of competent cells were used for each recombineering reaction. Oligos were diluted to 50 μM concentration in 10% glycerol and 10 μl of the diluted oligo was added to the competent cell mixture. For water control reactions, 10 μl of water was added. For multiplexed reactions, 10 μl of a cocktail with a total oligo concentration of 50 μM was used. The resulting cell-oligo mix was transferred to a chilled cuvette (1 mm gap, VWR) and electroporated using a BTX™-Harvard Apparatus ECM™ 630 Exponential Decay Wave Electroporator with the following parameters: voltage (1800 V), resistance (200 Ω), and capacitance (25 μF).
Fluorescence-coupled scar-free selection/counter-selection
Working from the ΔthyA strain RE095 and derivative strains (Supplementary Table S2), a dsDNA thyA cassette with or without a fluorescence gene (Supplementary Figure S1) was amplified with 35–50 bp flanking homology to a target genomic locus and integrated via standard recombineering as described above, with the exception that cells were made competent by growing in LB supplemented with thymine (100 μg/ml) and trimethoprim (50 μg/ml). Integrants of thyA were selected for on LB media. Colonies with fluorescence-coupled thyA cassettes were screened visually for fluorescent phenotypes on a blue-light transilluminator. Proper insertion of the cassette was confirmed by locus-specific colony polymerase chain reaction (PCR). Replacement of the thyA cassette was performed through recombineering with a ssDNA or dsDNA cassette as described in Supplementary Table S2 and selected for on M9 agar plates supplemented with thymine (100 μg/ml), trimethoprim (50 μg/ml), and casamino acids (0.2%). Removal of fluorescence-coupled thyA cassettes was screened visually for non-fluorescent colonies via blue-light transillumination and sequences were validated via colony PCR and Sanger sequencing.
Recombineering efficiency and Safe Site expression measurements
Competent cells were transformed with water (control) or oligos to turn off sfGFP, mKate2, or both reporters. Following electroporation, cells were resuspended in 3 ml LB+carb. These cultures were mixed and 30 μl was transferred into an additional 3 ml LB+carb for overnight growth at 37°C (or 30°C for EcNR2 and EcNR2.nuc5-). The following morning, saturated cultures of each transformation were diluted 1:200 into phosphate-buffered saline (PBS) solution and run on a Sony SH800 cell sorter for single-cell flow cytometry analysis. At least 50,000 events were recorded for each reaction, and the fractional abundance of each reporter phenotype (GFP+ RFP+, GFP- RFP+, GFP+ RFP-, GFP- RFP-) in the population was measured. The threshold for each reporter phenotype was determined via a prior calibration in which gates for each fluorescent reporter were measured. For measurement of gene expression across Safe Sites, overnight outgrowths of control reactions from the Safe Site recombineering efficiency transformations were processed on the flow cytometer.
Response curves of inducible regulators
BioDesignER was transformed with plasmids containing a GFP gene regulated by each transcription factor - AraC (pBAD), CymR (pCym), or LacI (pLac) - and controlled by one of two replication origins, p15A or pSC101. Plasmids with the p15A origin contain a kanR marker and plasmids with the pSC101 origin use a cmR marker. Plasmid sequences are available via Benchling (https://benchling.com/organizations/arkinlab). Individual colonies were inoculated in LB with an appropriate antibiotic to maintain the plasmid and grown overnight. Saturated cultures were diluted 200-fold into a microtiter plate (Corning 3904) and grown at 37°C with shaking in a Biotek Synergy 2 plate reader. Kinetic growth and fluorescence measurements were taken every 5 or 10 min for 12 h. Absorbance was measured at 600 nm. GFP fluorescence was measured using 485/20 and 520/15 nm filter cubes for excitation and emission, respectively. mKate fluorescence was measured using 560/20 and 615/30 nm filter cubes for excitation and emission, respectively. Fluorescence values measured nearest OD 0.5 were used to estimate absorbance-normalized fluorescence in each channel.
Flow cytometry analysis of inducible regulators
Saturated cultures from the kinetic growth assays used to measure regulator inducer responses were diluted 400-fold into PBS and analyzed in a BD LSR Fortessa flow cytometer (488 nm excitation / 525/50 nm emission for GFP; 561 nm excitation / 670/30 nm emission for mKate) using an autosampler. Raw .fcs files were imported for pre-processing and subsequent analysis with custom Python scripts (see Supplementary Data D1 for example) using the FlowCytometryTools software package (https://github.com/eyurtsev/FlowCytometryTools). For each sample, 50,000 events were captured and outliers in forward scatter and side scatter were removed using a filter with cut-offs for events outside the second and third quartile.
Safe Site expression analysis
Data for expression levels at each Safe Site were gathered from the flow cytometry data used to measure recombineering efficiency at each Safe Site. Data files were extracted for four recombineering conditions (no oligo control, GFP-off, mKate-off, and dual-off) and two biological replicates. The geometric mean for each fluorescence channel was calculated from filtered data. Specifically, events outside the second and third quartiles for forward and side scatter channels were removed from analysis for each .fcs data file. The dual-fluorescent subpopulation for each measurement was extracted by gating at a value that excluded non-fluorescent subpopulations but did not truncate the distribution of the dual-fluorescent subpopulation.
Fluctuation assay
Fluctuation tests were performed on an inactivated cmR-mNeon translational fusion cassette integrated at Safe Site 1. The cassette was inserted using selection on chloramphenicol and subsequently inactivated for the strains listed in Supplementary Table S2. The cmR-mNeon cassette was first integrated as dsDNA into the respective strains via recombineering and selected for by plating on LB agar supplemented with 34 μg/ml chloramphenicol. A premature stop codon (AAA to TAA) was generated on the cassette via ssDNA recombineering with an oligo containing the stop codon mutation. The non-fluorescent population was enriched using cell sorting (Sony SH800) and non-fluorescent colonies were isolated on LB agar plates.
Prior to fluctuation tests, individual non-fluorescent colonies were grown at 30°C in LB+carb and stored at −80°C as glycerol stocks normalized to OD600 of 0.5. For the fluctuation tests, cultures were diluted 1000-fold and grown for 16 h in permissive conditions of LB+carb at 30°C (N = 24). For pTet-λ ΔmutS, EcNR2 and EcNR2.nuc5-, 20 μl of culture was spotted onto LB agar plates supplemented with chloramphenicol and carbenicillin. For all other strains, 100 μl volume spots were used. Viability counts were estimated for all strains by serial dilutions of six independent cultures on LB agar plates supplemented with carbenicillin. Chloramphenicol-resistant mutants were counted and mutation rates were inferred by the MSS-MLE method (34,35).
Iterative recombineering cycling
Strains were prepared for transformation using the competent cell protocol described above using 25 ml of culture with a target OD600 of 0.3. Each culture was resuspended in ∼500 μl of 10% glycerol after washes. Each transformation consisted of 40 μl competent cells mixed with 10 μl of 50 μM oligo mix. After transformation, cells were recovered in 3 ml LB supplemented with carbenicillin. The recovery culture was grown to saturation before beginning the next round of competent cell prep and recombination. In parallel, the recovery culture was diluted 1:60 into an additional 3 ml of LB supplemented with carbenicillin and grown to saturation prior to measurements using flow cytometry (Sony SH800).
RESULTS
Rational strain design
We introduced multiple targeted modifications to an MG1655-derivative strain to decrease recombination cycle time, reduce toxicity of the recombination machinery, and introduce a transient hypermutation phenotype via hypermethylation (Figure 1A and B; Table 1). Using EcNR1 (18) as the host, we refactored the λ-Red recombination machinery, which consists of the genes exo, bet, and gam, and serves as the basis for mediating homology-directed recombination of ssDNA and dsDNA products. To reduce recombineering cycle times, we replaced the temperature-inducible regulation of the λ-Red locus with a TetR-regulated design (Figure 1C). This allowed us to propagate cells at 37°C instead of 30–32°C during all phases of a recombineering workflow: competent cell prep, λ induction, cell recovery, and selection. We also minimized the λ prophage by deleting the λ-kil gene, which has been reported to be responsible for the cell death phenotype observed under λ-Red expression (36), and other dispensable phage genes. Finally, we introduced DNA adenine methyltransferase (dam) to the λ-Red operon of our strain. Co-induction of dam with the λ-Red recombination genes results in transient hypermutation via hypermethylation, which has been reported to enable incorporated mutations to evade MMR (33).
To remove a valine-sensitive growth defect present in E. coli K-12, we restored expression of ilvG. K-12 contains three acetohydroxy acid synthases (ilvB, ilvG, and ilvH) that are involved in branch-chained amino acid biosynthesis. K-12 does not express ilvG due to a natural frameshift mutation and thus exhibits a growth defect in the presence of exogenous valine and the absence of isoleucine (37,38). This valine-sensitive growth phenotype is alleviated by restoration of ilvG (39). Using oligo-mediated recombination (Materials and Methods section), we removed the frameshift mutation in the endogenous ilvG gene, which has been reported to enable faster growth in minimal media. We called this strain pTet-λ.
We next incorporated genomic modifications shown to improve recombination efficiency. Using a scar-free genome engineering workflow that utilizes a novel thyA selection/counter-selection cassette containing a fluorescent marker (Supplementary Figure S1, Materials and Methods section), we iteratively generated multiple beneficial mutations. For example, genetic variants of DNA primase (dnaG) enhance recombination efficiency by increasing the length of Okazaki fragments, thus exposing longer stretches of the lagging strand of the replication fork to ssDNA recombination (40). We incorporated into our strain the dnaG.Q576A variant, which was shown to boost recombination efficiency more than other dnaG mutants in EcNR2.
Endogenous nucleases can degrade exogenous DNA used in recombineering workflows. The removal of a set of five nuclease genes (endA, exoX, recJ, xonA, and xseA) has been shown to improve ssDNA recombination efficiency (41). However, while this exonuclease knockout strain, EcNR2.nuc5-, exhibited increased recombination efficiency, it also resulted in a lower post-electroporation growth rate compared to EcNR2. This suggested that deletion of the entire set of nucleases introduces to the strain an undesirable physiological defect. To avoid such growth defects, which are compounded for workflows requiring multiple recombineering cycles, we looked to systematically combine exonuclease knockouts that distinctly improve recombination rates. We constructed individual knockouts of each of the five exonucleases in the pTet-λ dnaG.Q576A strain and measured the recombineering efficiency of the resulting strains. We assayed recombination efficiency for each exonuclease knockout using oligo mediated recombination at a genomically-encoded sfGFP reporter. In this assay, recombination of an oligo designed to introduce a premature stop codon into sfGFP results in a loss of fluorescence that can be quantified using flow cytometry. Deletions of xonA (4.5 ± 0.3%) (mean ± 1 standard deviation) and recJ (2.7 ± 0.1%) showed the greatest efficiencies, while the remaining exonuclease deletions yielded nominal efficiencies (<2%) (Supplementary Table S4). Based on these results, we deleted only two of the five exonucleases (recJ, xonA) in the next step of strain construction. While deletion of the λ-Red exonuclease (exo) can also promote stability of exogenous ssDNA (41), we opted to retain it due to its role in dsDNA recombination. The culmination of these genetic modifications in addition to the inducible regulator modifications described below resulted in the BioDesignER strain.
To assess the effect of BioDesignER modifications on strain fitness, we measured the growth rates of key strains in the modification lineage in LB rich media (Figure 2A). We noted that, in general, doubling times decreased as additional modifications were made. Additionally, in contrast to cell death reported for extended co-expression of λ-kil with the recombination machinery (36), we observed only a slight increase in doubling time when expressing the refactored λ-Red cassette.
BioDesignER recombineering enhancements
ssDNA recombination enhancements
To quantify recombineering enhancements of key BioDesignER modifications, we measured ssDNA recombination rates for several strain intermediates. We integrated a dual fluorescent reporter cassette expressing both sfGFP and mKate2 at a common genomic locus for each strain of the lineage and quantified ssDNA recombination efficiency. For each strain we transformed an oligo to inactivate sfGFP via incorporation of a premature stop codon. We also performed a control reaction in each case using water in place of oligo. After recovery and outgrowth, we measured the fluorescence profiles of each strain using flow cytometry (Figure 2B). We observed increases in recombination efficiency at each modification stage with single cycle conversion rates improving from 1.6 ± 0.1% in pTet-λ to 25.4 ± 1.0% in BioDesignER.
To investigate the efficacy of mismatch repair evasion on recombination efficiency, we compared BioDesignER against pTet-λ derivative strains containing mismatch repair modifications and against two standard ΔmutS recombineering variants, EcNR2 and EcNR2.nuc5-. BioDesignER (25.4 ± 1.0%) exhibits much higher recombination efficiency than pTet-λ with dam over-expression (damOE, 6.91 ± 0.19%) or ΔmutS (12.9 ± 1.7%) as hypermutagenesis strategies (Figure 2C, left panel). Performing the same recombineering experiments at 30°C and comparing to EcNR2 and EcNR2.nuc5-, which are constrained to growth at 30°C, we found that BioDesignER (13.6 ± 1.2%) exhibited recombination rates comparable to EcNR2 (14.5 ± 2.1%), yet approximately 3-fold lower than EcNR2.nuc5- (37.7 ± 3.8%) (Figure 2C, right panel). We were surprised to find that the recombineering efficiency of BioDesignER decreased by nearly 2-fold when grown at a lower temperature.
dsDNA recombination enhancements
Knocking out endogenous exonucleases has been reported to significantly reduce or abolish dsDNA recombination efficiency (41). We measured the efficiency of dsDNA recombination in pTet-λ and BioDesignER and found no significant reduction in recombination efficiency (Figure 2D). This result suggests that λ-Exo is sufficient to process dsDNA recombination templates in the absence of RecJ and ExoI ssDNA exonucleases. A previous study reported that dsDNA recombination is at least an order of magnitude less efficient in a four-nuclease deficient genotype (ΔexoX, ΔrecJ, ΔxseA, and ΔxonA) with abolished dsDNA recombination activity in a three-nuclease (ΔrecJ, ΔxseA and ΔxonA) knockout (41). We note here that we were successful in generating dsDNA recombinants in EcNR2.nuc5- at a similar efficiency to EcNR2 with no alteration to the recombineering protocol, suggesting that another nuclease is aiding dsDNA recombination in E. coli or that recombination can occur through an exonuclease-independent mechanism.
Control of multiple independent regulators
BiodesignER expresses transcriptional regulators that utilize four independent small-molecule inducers to allow multi-input control of synthetic circuits, biosynthetic pathways, or gene editing tools. The strain produces the repressors TetR, LacI, and CymR as well as the activator AraC. TetR is expressed from the λ prophage element native to EcNR1. We incorporated the transcriptional over-expression allele lacIQ1 to boost LacI production, which allows efficient regulation of multi-copy plasmids (42). We also introduced the tight and titratable regulator CymR (43), which is inactivated by the small molecule cumate. To improve gene regulation by arabinose, we replaced the arabinose-sensitive promoter of the araE transporter gene with a constitutive promoter to eliminate all-or-none expression and allow titratable induction (44). In conjunction with this modification, we introduced a constitutive promoter to drive expression of AraC and deleted the araBAD operon to eliminate arabinose degradation via catabolism.
To characterize the induction profiles of each regulator, we quantified the fluorescence levels and growth rates of cells transformed with multi-copy plasmids. We constructed a set of GFP expression plasmids with promoters responsive to each regulator (Figure 2E, see Supplementary Figure S2 for sequence-level details) and transformed each plasmid into BioDesignER. Gene expression profiles were characterized by measuring single-cell fluorescence and bulk growth and fluorescence.
Fold-change induction for each regulator increased with plasmid copy number while no leaky expression was observed for low-copy and medium-copy plasmids. For plasmids with the low-copy replication origin pSC101, we observed mean fold-change induction levels of 107, 68, and 20 for arabinose, cumate, and isopropyl β-D-1-thiogalactopyranoside (IPTG), respectively. For plasmids with the medium-copy replication origin p15A, we observed mean fold-change induction levels of 146, 184, and 30 for arabinose, cumate, and IPTG, respectively. In both copy-number contexts, GFP expression with no inducer was indistinguishable from a control plasmid lacking gfp. We found that repressor levels were insufficient to fully repress GFP expression on plasmids with the ColE1 replication origin. We note that AraC-regulated GFP expression saturates near 33 μM (5 μg/ml, 0.0005%) arabinose, a much lower saturation point than common plasmid-based systems (0.1% arabinose).
Single-cell fluorescence distributions observed through flow cytometry revealed unimodal distributions of GFP expression for nearly all induction conditions (Figure 2E). GFP expression from both cumate- and IPTG-responsive promoters produced monotonic, decreasing coefficient of variation noise profiles for increasing inducer levels (Supplementary Figure S3). For arabinose induction, despite introducing modifications consistent with Khlebnikov et al., we observed significant cell–cell variability at two intermediate arabinose levels. Specifically, we observed a maximum coefficient of variation at 3.3 μM (Supplementary Figure S3), seen in Figure 2E as a broad, weakly bimodal fluorescence distribution.
Characterization of genomic integration Safe Sites
Genome integration Safe Sites
To aid identifying genomic loci that provide reliable gene expression and recombination efficiency for future engineering efforts, we characterized a curated list of integration loci across the E. coli K-12 genome. The resulting eight genomic loci, termed Safe Sites, were chosen based on several criteria to minimize disruption to local chromosomal context upon integration of synthetic DNA constructs (Figure 3A and Supplementary Figure S4). Specifically, the integration Safe Sites are intergenic regions located between two convergently transcribed, non-essential genes that do not exhibit any phenotypes or growth defects across the majority of biochemical conditions screened in previous high-throughput studies (14,45) (Supplementary Table S5), and contain no annotated features (small RNAs, promoters, transcription factor-binding sites) according to RegulonDB (46) (Supplementary Table S6).
To characterize gene expression variation across the chromosome, we measured the expression of dual-fluorescent reporters (sfGFP and mKate2) integrated into BioDesignER at each Safe Site. We observed a linear decrease in expression for both sfGFP (pearson rsfGFP,arm1 = −0.91, pearson rsfGFP,arm2 = −0.65, psfGFP < 0.05, permutation test) and mKate2 (pearson rmKate,arm1 = −0.85, pearsonrmKate,arm2 = −0.51, pmKate < 0.05, permutation test) reporters with respect to distance from the chromosomal origin (Figure 3B). This result was consistent with expected variations in local chromosomal copy number due to bi-directional replication dynamics during growth (47–49). Interestingly, we observed a much stronger correlation of expression to distance from replication origin for chromosome Arm 1, though mKate2 expression at Safe Site 8 was a low outlier. We also assessed the effect of integration at each Safe Site on cellular fitness by measuring growth rates for each integration strain. We observed that, in general, genomic integration and expression from each Safe Site did not reduce growth rate, though Safe Site 8 displayed a nominal decrease when grown under aTc induction (Supplementary Figure S5). The two unexpected results at Safe Site 8 suggest that it may not be a reliable locus for integration and expression.
Recombination rates across Safe Sites
Changes in local chromosomal structure may lead to unexpected fluctuations in recombination efficiency at various locations across the genome. To characterize recombination efficiency as a function of chromosomal locus for BioDesignER, we performed three independent ssDNA oligo-mediated recombination reactions for the panel of eight Safe Site strains. For each strain, we independently transformed (i) an oligo to inactivate sfGFP, (ii) an oligo to inactivate mKate2, or (iii) an oligo cocktail to inactivate both reporters. We also performed a control reaction in each case using water in place of oligo. For Safe Sites that lie on opposite sides of the replication fork, we designed appropriate oligos to ensure recombination targeting the lagging strand. We found that recombination rates were consistently high across the chromosome with Safe Sites displaying, on average, single cycle, single site conversion rates of 17.0 ± 6.7% and 19.7 ± 5.7% for sfGFP and mKate2, respectively (Figure 3C). We also report single cycle, multiplex conversion rates (averaged across Safe Sites) of 7.5 ± 4.4% for the sfGFP, 7.9 ± 2.9% for the mKate2, and 6.3 ± 2.3% for both reporters when transformed with the dual oligo cocktail.
Analysis of transient hypermethylation effects on mutagenesis
To investigate the effect of BioDesignER modifications on global mutation rate, we developed a mutagenesis detection assay that focuses on a single codon. The mutagenesis cassette utilizes chloramphenicol acetyltransferase (cat) gene translationally fused to green fluorescence gene mNeon (Figure 4A). This strategy should allow estimation of mutation rate without mutant fitness biases and second-site suppressor mutations observed in traditional fluctuation analyses such as rifampicin resistance (50). Following integration of the mutagenesis cassette at Safe Site 1, we introduced a TAA stop codon (ochre) at Lys19 of cat via a single nucleotide mutation. In principle, only mutations or suppression of the ochre codon generates chloramphenicol resistant, green fluorescent colonies. Across several fluctuation tests, we sequenced 54 chloramphenicol-resistant clones from distinct mutational events. We observed eight unique genotypes at the ochre codon arising from spontaneous mutations (Supplementary Figure S6). The absence of ochre codons suggests a minimal contribution of ochre suppressor mutants in the fluctuation test.
Using this assay, we benchmarked mutation rates of BioDesignER against (i) strains in the BioDesignER construction lineage, (ii) EcNR2 (reference), and (iii) MMR-deficient (control) strains pTet-λ ΔmutS and damOE. All assayed strains utilized the inactivated cat-mNeon cassette at Safe Site 1 as shown in Figure 4A. To allow EcNR2 to be compatible with the cat-mNeon fluctuation assay, we replaced the cmR selection cassette native to EcNR2 with kanR. Under comparable growth conditions, we estimated mutation rates of 3.36 × 10−9 (95% confidence interval (CI): 2.22–4.66 × 10−9), 4.55 × 10−9 (CI: 3.10–6.19 × 10−9), and 6.54 × 10−9 (CI: 4.77–8.51 × 10−9) mutations/cell/generation for pTet-λ, damOE, and BioDesignER, respectively. By comparison, we observed mutation rates of 2.98 × 10−8 (CI: 2.13–3.93 × 10−8) for the control pTet-λ ΔmutS, which was similar to the rate of 2.73 × 10−8 (CI: 1.79–3.81 × 10−8) observed for EcNR2. For all strains assayed, all chloramphenicol-resistant colonies were also fluorescent.
To investigate the effect of λ-Red induction on global mutation rates and compare the mutagenic effect of dam over-expression to deletion of mutS, we tested the mutation rates for pTet-λ, damOE, and pTet-λ ΔmutS both with and without aTc induction (Supplementary Figure S6). We found no effect on global mutation rates due to aTc induction (i.e. expression of the λ-Red machinery) in pTet-λ and pTet-λ ΔmutS. Consistent with prior work (33), we observed an increase in mutation rate for damOE under aTc induction - specifically, 2.4-fold in this work. Finally, we noted that even with aTc induction damOE was still less mutagenic than pTet-λ ΔmutS, suggesting that BioDesignER uniquely strikes a balance between on-target and off-target mutagenesis rates.
To quantify this balance, we compared the recombination and mutagenesis rates for a selection of control strains and BioDesignER (Figure 4B). The resulting trade-off space can be divided into two regimes where strains falling in the shaded region exhibit a favorable trade-off between recombination rate and mutation rate. BioDesignER falls in the favorable subspace, while MMR-deficient strains such as EcNR2 and pTet-λ ΔmutS fall in the unfavorable regime above the tradeoff line. To summarize this result, we introduce the metric recombineering fidelity, which we define as the product of fold-increase in recombination rate and fold-decrease in mutagenesis rate, each relative to EcNR2. Using this metric, we calculate that BioDesignER exhibits 7.3-fold greater recombineering fidelity than EcNR2 (1.75-fold improvement in recombination rate and 4.17-fold decrease in mutagenesis rate) (Table 2).
Table 2.
Strain | Recombination efficiency (%) | Mutation rate (mutations/cell/generation) | Recombineering fidelity | Temperature (°C) |
---|---|---|---|---|
pTet-λ | 1.6 ± 0.1 | 3.36 × 10−9 | 0.9 | 37 |
EcNR2 | 14.5 ± 2.1 | 2.73 × 10−8 | 1.0 | 30 |
BioDesignER | 25.4 ± 1.0 | 6.54 × 10−9 | 7.3 | 37 |
Multi-cycle recombineering rate enhancements
High single-cycle editing efficiency enables the rapid generation of genotypically diverse populations using multiplexed, cyclical recombineering workflows. To assess how well BioDesignER could generate a population with multiplex edits, we transformed a starting population with an oligo cocktail targeting multiple sites and tracked phenotypic diversity as a function of recombineering cycle for multiple strains. Specifically, we transformed BioDesignER harboring the sfGFP-mKate2 fluorescence cassette with oligos to inactivate both reporters over four sequential recombineering cycles. In parallel, we compared BioDesignER to pTet-λ (Figure 5A), EcNR2, and EcNR2.nuc5- (Supplementary Figure S7) transformed with the same cocktail. BioDesignER exhibited high multiplex editing efficiency with nearly 60% of the population incorporating both edits (58.8 ± 3.5%) by the fourth recombineering cycle (Figure 5A), thus outperforming EcNR2 (15.9 ± 3.0%) and showing similar efficiency to EcNR2.nuc5- (54.3 ± 5.6%) (Figure 5B).
Given the higher single-cycle conversion rate of EcNR2.nuc5- compared to BioDesignER (Figure 2C, right panel), we were surprised by the comparable performance of the two strains over multiple recombineering cycles. We partly attribute this parity to uncharacteristically low and sporadic single-cycle efficiencies that we repeatedly observed for EcNR2.nuc5- replicates (Supplementary Figure S7). Regardless, while both BioDesignER and EcNR2.nuc5- exhibited similar multiplex editing efficiencies, EcNR2.nuc5- requires culturing at 30–32°C and is a persistent mutator, which increases recombineering cycle time and basal mutation rate, respectively - thus limiting its overall utility as a reliable strain for multiplex genome editing.
DISCUSSION
High-efficiency genome engineering in bacteria enables breadth (24) and depth (17) explorations of genotypic diversity to enhance engineered behaviors. However, to date, no platform strain exists that incorporates a suite of core functions to provide efficient recombineering and regulate both genome engineering functions and cellular programs. BioDesignER is a high fidelity recombineering strain constructed to rapidly explore and optimize engineered functions. It incorporates many genomic modifications that increase recombination efficiency and reduce cycle time for recombineering workflows while minimizing off-target mutations. BioDesignER includes four independent inducible regulators to control recombineering and accommodate additional user designs. We have quantified the gene expression and targeted mutagenesis characteristics of eight Safe Site integration loci distributed across the genome and found seven Safe Sites suitable for engineering purposes.
BioDesignER enables rapid selection-based recombineering workflows with no requirements for plasmid transformation or curing. Reliable engineering of sequential genome integrations with established recombineering approaches, such as the use of plasmids pSIM5 (12) or pKD20 (10), require transformation and curing procedures of plasmid-encoded recombineering functions for each integration stage. These requirements increase the time required for individual genome editing steps by multiple days. Anecdotally, we have found plasmid-based recombineering systems unreliable for conducting multiple editing cycles from a single transformation of the recombineering plasmid. We speculate that the failure to achieve multi-cycle genome editing from plasmid-based recombineering solutions may be related to the accumulation of mutations spurred by the maintenance or leaky expression of λ-Red genes over many generations. In contrast, we have completed all of the scar-free DNA recombineering workflows reported here with no restoration or replacement of the minimized pTet λ-Red cassette.
We have increased recombineering fidelity in BioDesignER by striking a balance between recombination efficiency and mutagenesis rates. A high recombineering fidelity platform such as this may provide new avenues to multiplex genome remodeling using CRISPR-Cas9 techniques. CRISPR-Cas9 genome editing approaches in bacteria are limited by recombination efficiency to rescue double-stranded breaks. Linking CRISPR-Cas9 counterselection of native sequences with high-efficiency, multi-site recombineering may allow concurrent selection of many modifications from a large bacterial population with little off-target activity, thereby enabling researchers to explore unprecedented genetic diversity.
While BioDesignER exhibits robust functionalities with respect to recombineering fidelity, comparing the recombination efficiency of the BioDesignER lineage to EcNR2-derived strains reveals inconsistent results related to culture temperatures. Specifically, we found a nearly 2-fold reduction in recombination efficiency for BioDesignER at 30°C compared to 37°C, resulting in recombineering efficiencies similar to EcNR2 (Figure 2C). This reduction suggests some uncharacterized dependence of recombination efficiency on temperature and could reflect reduced ssDNA access to the replication fork, lower ssDNA half-life at reduced temperatures, or perhaps temperature-dependent expression of the λ-Red machinery from pTet.
While constructing BioDesignER, we developed multiple selection/counter-selection strategies that may be of general use for bacterial genome engineering. These strategies combine selection/counter-selection and fluorescence screening components to accelerate scar-free genome engineering. Specifically, the genetic cassettes utilize selection/counter-selection of thyA, building on work from FRUIT (51). This approach requires two recombineering transformations: a dsDNA integration of the fluorescence-coupled thyA cassette at the target genomic locus followed by removal of the cassette using ssDNA or dsDNA. The genetic modification of interest can be incorporated at either integration stage. In comparison, CRISPR-based genome editing workflows, which are gaining popularity, require multiple steps including guide plasmid construction, co-transformation with Cas9, and subsequent curing. Thus, the selection/counter selection methodologies developed here allow a simple and effective approach to genome engineering. Finally, as a resource to the bioengineering community we have generated a variant of BioDesignER (RE1000) with no antibiotic resistance genes (ΔampR).
Development of BioDesignER points to genome design strategies for next-generation biotechnology hosts. As synthetic biology matures, the application space is expanding beyond prototypical genetic circuits and metabolic pathways in laboratory environments to robust engineered functions in ecologies with high biotic and abiotic complexity, including soil, wastewater, and the human gut (52). Efficient and sustained activity of engineered functions in these environments will require programmed behaviors to be optimized in phylogenetically diverse microbes. We anticipate the integrative approach used to develop and characterize BioDesignER can be a template to develop high-efficiency recombineering platforms for new bacterial hosts.
Supplementary Material
ACKNOWLEDGEMENTS
The authors would like to thank Vivek Mutalik for providing the cymR expression construct and the core promoter sequence used to design the Tet-based λ-Red expression system. We would also like to thank the Li Ka Shing Flow Cytometry Facility at UC Berkeley.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
Department of Energy Genome Science program - Office of Biological and Environmental Research [DE-SC008812, Funding Opportunity Announcement DE-FOA-0000640]; National Science Foundation (NSF) Graduate Research Fellowship (to H.S.R.); National Institutes of Health (NIH) Genomics and Computational Biology Training Program [5T32HG000047-18] (to H.S.R.). Funding for open access charge: DOE BER [DE-SC008812].
Conflict of interest statement. None declared.
REFERENCES
- 1. Khalil A.S., Collins J.J.. Synthetic biology: applications come of age. Nat. Rev. Genet. 2010; 11:367–379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Weber W., Fussenegger M.. Emerging biomedical applications of synthetic biology. Nat. Rev. Genet. 2011; 13:21–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Tyo K.E.J., Ajikumar P.K., Stephanopoulos G.. Stabilized gene duplication enables long-term selection-free heterologous pathway expression. Nat. Biotechnol. 2009; 27:760–765. [DOI] [PubMed] [Google Scholar]
- 4. Friehs K. Plasmid copy number and plasmid stability. Adv. Biochem. Eng. Biotechnol. 2004; 86:47–82. [DOI] [PubMed] [Google Scholar]
- 5. Bassalo M.C., Garst A.D., Halweg-Edwards A.L., Grau W.C., Domaille D.W., Mutalik V.K., Arkin A.P., Gill R.T.. Rapid and efficient one-step metabolic pathway integration in E. coli. ACS Synth. Biol. 2016; 5:561–568. [DOI] [PubMed] [Google Scholar]
- 6. Lee J.W., Gyorgy A., Cameron D.E., Pyenson N., Choi K.R., Way J.C., Silver P.A., Del Vecchio D., Collins J.J.. Creating single-copy genetic circuits. Mol. Cell. 2016; 63:329–336. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Esvelt K.M., Wang H.H.. Genome-scale engineering for systems and synthetic biology. Mol. Syst. Biol. 2013; 9:1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Bryant J.A., Sellars L.E., Busby S.J.W., Lee D.J.. Chromosome position effects on gene expression in Escherichia coli K-12. Nucleic Acids Res. 2014; 42:11383–11392. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Murphy K.C. Use of bacteriophage lambda recombination functions to promote gene replacement in Escherichia coli. J. Bacteriol. 1998; 180:2063–2071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Datsenko K.A., Wanner B.L.. One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc. Natl. Acad. Sci. U.S.A. 2000; 97:6640–6645. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Ellis H.M., Yu D., DiTizio T., Court D.L.. High efficiency mutagenesis, repair, and engineering of chromosomal DNA using single-stranded oligonucleotides. Proc. Natl. Acad. Sci. U.S.A. 2001; 98:6742–6746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Datta S., Costantino N., Court D.L.. A set of recombineering plasmids for gram-negative bacteria. Gene. 2006; 379:109–115. [DOI] [PubMed] [Google Scholar]
- 13. Sharan S.K., Thomason L.C., Kuznetsov S.G., Court D.L.. Recombineering: a homologous recombination-based method of genetic engineering. Nat. Protoc. 2009; 4:206–223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Baba T., Ara T., Hasegawa M., Takai Y., Okumura Y., Baba M., Datsenko K.A., Tomita M., Wanner B.L., Mori H.. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol. Syst. Biol. 2006; 2:1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Warner J.R., Reeder P.J., Karimpour-Fard A., Woodruff L.B.A., Gill R.T.. Rapid profiling of a microbial genome using mixtures of barcoded oligonucleotides. Nat. Biotechnol. 2010; 28:856–862. [DOI] [PubMed] [Google Scholar]
- 16. Freed E.F., Winkler J.D., Weiss S.J., Garst A.D., Mutalik V.K., Arkin A.P., Knight R., Gill R.T.. Genome-Wide tuning of protein expression levels to rapidly engineer microbial traits. ACS Synth. Biol. 2015; 4:1244–1253. [DOI] [PubMed] [Google Scholar]
- 17. Garst A.D., Bassalo M.C., Pines G., Lynch S.A., Halweg-Edwards A.L., Liu R., Liang L., Wang Z., Zeitoun R., Alexander W.G. et al.. Genome-wide mapping of mutations at single-nucleotide resolution for protein, metabolic and genome engineering. Nat. Biotechnol. 2016; 35:48–55. [DOI] [PubMed] [Google Scholar]
- 18. Wang H.H., Isaacs F.J., Carr P.A., Sun Z.Z., Xu G., Forest C.R., Church G.M.. Programming cells by multiplex genome engineering and accelerated evolution. Nature. 2009; 460:894–898. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Isaacs F.J., Carr P.A., Wang H.H., Lajoie M.J., Sterling B., Kraal L., Tolonen A.C., Gianoulis T.A., Goodman D.B., Reppas N.B. et al.. Precise manipulation of chromosomes in vivo enables genome-wide codon replacement. Science. 2011; 333:348–353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Lajoie M.J., Rovner A.J., Goodman D.B., Aerni H.-R., Haimovich A.D., Kuznetsov G., Mercer J.A., Wang H.H., Carr P.A., Mosberg J.A. et al.. Genomically recoded organisms expand biological functions. Science. 2013; 342:357–360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Lajoie M.J., Kosuri S., Mosberg J.A., Gregg C.J., Zhang D., Church G.M.. Probing the limits of genetic recoding in essential genes. Science. 2013; 342:361–363. [DOI] [PubMed] [Google Scholar]
- 22. Wang H.H., Kim H., Cong L., Jeong J., Bang D., Church G.M.. Genome-scale promoter engineering by coselection MAGE. Nature Methods. 2012; 9:591–593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Wang H.H., Huang P.-Y., Xu G., Haas W., Marblestone A., Li J., Gygi S.P., Forster A.C., Jewett M.C., Church G.M.. Multiplexed in vivo His-tagging of enzyme pathways for in vitro single-pot multienzyme catalysis. ACS Synth. Biol. 2012; 1:43–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Zeitoun R.I., Garst A.D., Degen G.D., Pines G., Mansell T.J., Glebes T.Y., Boyle N.R., Gill R.T.. Multiplexed tracking of combinatorial genomic mutations in engineered cell populations. Nat. Biotechnol. 2015; 33:631–637. [DOI] [PubMed] [Google Scholar]
- 25. Zeitoun R.I., Pines G., Grau W.C., Gill R.T.. Quantitative tracking of combinatorially engineered populations with multiplexed binary assemblies. ACS Synth. Biol. 2017; 6:619–627. [DOI] [PubMed] [Google Scholar]
- 26. Glickman B.W., Radman M.. Escherichia coli mutator mutants deficient in methylation-instructed DNA mismatch correction. Proc. Natl. Acad. Sci. U.S.A. 1980; 77:1063–1067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Schaaper R.M., Dunn R.L.. Spectra of spontaneous mutations in Escherichia coli strains defective in mismatch correction: the nature of in vivo DNA replication errors. Proc. Natl. Acad. Sci. U.S.A. 1987; 84:6220–6224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Sawitzke J.A., Thomason L.C., Costantino N., Bubunenko M., Datta S., Court D.L.. Recombineering: in vivo genetic engineering in E. coli, S. enterica, and beyond. Meth. Enzymol. 2007; 421:171–199. [DOI] [PubMed] [Google Scholar]
- 29. Wang H.H., Xu G., Vonner A.J., Church G.. Modified bases enable high-efficiency oligonucleotide-mediated allelic replacement via mismatch repair evasion. Nucleic Acids Res. 2011; 39:7336–7347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Nyerges Á., Csörgő B., Nagy I., Latinovics D., Szamecz B., Pósfai G., Pál C.. Conditional DNA repair mutants enable highly precise genome engineering. Nucleic Acids Res. 2014; 42:e62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Nyerges Á., Csörgő B., Nagy I., Bálint B., Bihari P., Lázár V., Apjok G., Umenhoffer K., Bogos B., Pósfai G. et al.. A highly precise and portable genome engineering method allows comparison of mutational effects across bacterial species. Proc. Natl. Acad. Sci. U.S.A. 2016; 113:2502–2507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Bubnov D.M., Yuzbashev T.V., Vybornaya T.V., Netrusov A.I., Sineoky S.P.. Development of new versatile plasmid-based systems for λRed-mediated Escherichia coli genome engineering. J. Microbiol. Methods. 2018; 151:48–56. [DOI] [PubMed] [Google Scholar]
- 33. Lennen R.M., Nilsson Wallin A.I., Pedersen M., Bonde M., Luo H., Herrgård M.J., Sommer M.O.A.. Transient overexpression of DNA adenine methylase enables efficient and mobile genome engineering with reduced off-target effects. Nucleic Acids Res. 2016; 44:e36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Sarkar S., Ma W.T., Sandri G.H.. On fluctuation analysis: a new, simple and efficient method for computing the expected number of mutants. Genetica. 1992; 85:173–179. [DOI] [PubMed] [Google Scholar]
- 35. Ma W.T., Sandri G.H., Sarkar S.. Analysis of the Luria–Delbrück distribution using discrete convolution powers. J. Appl. Probab. 1992; 29:255–267. [Google Scholar]
- 36. Sergueev K., Yu D., Austin S., Court D.. Cell toxicity caused by products of the p(L) operon of bacteriophage lambda. Gene. 2001; 272:227–235. [DOI] [PubMed] [Google Scholar]
- 37. Lawther R.P., Calhoun D.H., Adams C.W., Hauser C.A., Gray J., Hatfield G.W.. Molecular basis of valine resistance in Escherichia coli K-12. Proc. Natl. Acad. Sci. U.S.A. 1981; 78:922–925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Lawther R.P., Calhoun D.H., Gray J., Adams C.W., Hauser C.A., Hatfield G.W.. DNA sequence fine-structure analysis of ilvG (IlvG+) mutations of Escherichia coli K-12. J. Bacteriol. 1982; 149:294–298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Tedin K., Norel F.. Comparison of ΔrelA strains of Escherichia coli and Salmonella enterica serovar Typhimurium suggests a role for ppGpp in attenuation regulation of branched-chain amino acid biosynthesis. J. Bacteriol. 2001; 183:6184–6196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Lajoie M.J., Gregg C.J., Mosberg J.A., Washington G.C., Church G.M.. Manipulating replisome dynamics to enhance lambda Red-mediated multiplex genome engineering. Nucleic Acids Res. 2012; 40:e170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Mosberg J.A., Gregg C.J., Lajoie M.J., Wang H.H., Church G.M.. Improving lambda red genome engineering in Escherichia coli via rational removal of endogenous nucleases. PLoS One. 2012; 7:e44638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Glascock C.B., Weickert M.J.. Using chromosomal lacIQ1 to control expression of genes on high-copy-number plasmids in Escherichia coli. Gene. 1998; 223:221–231. [DOI] [PubMed] [Google Scholar]
- 43. Choi Y.J., Morel L., Le François T., Bourque D., Bourget L., Groleau D., Massie B., Míguez C.B.. Novel, versatile, and tightly regulated expression system for Escherichia coli strains. Appl. Environ. Microbiol. 2010; 76:5058–5066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Khlebnikov A., Skaug T., Keasling J.D.. Modulation of gene expression from the arabinose-inducible araBAD promoter. J. Ind. Microbiol. Biotechnol. 2002; 29:34–37. [DOI] [PubMed] [Google Scholar]
- 45. Price M.N., Wetmore K.M., Waters R.J., Callaghan M., Ray J., Liu H., Kuehl J.V., Melnyk R.A., Lamson J.S., Suh Y. et al.. Mutant phenotypes for thousands of bacterial genes of unknown function. Nature. 2018; 557:503–509. [DOI] [PubMed] [Google Scholar]
- 46. Gama-Castro S., Salgado H., Santos-Zavaleta A., Ledezma-Tejeida D., Muñiz-Rascado L., García-Sotelo J.S., Alquicira-Hernández K., Martínez-Flores I., Pannier L., Castro-Mondragón J.A. et al.. RegulonDB version 9.0: high-level integration of gene regulation, coexpression, motif clustering and beyond. Nucleic Acids Res. 2016; 44:D133–D143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Bipatnath M., Dennis P.P., Bremer H.. Initiation and velocity of chromosome replication in Escherichia coli B/r and K-12. J. Bacteriol. 1998; 180:265–273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Reynolds T.S., Gill R.T.. Quantifying impact of chromosome copy number on recombination in Escherichia coli. ACS Synth. Biol. 2015; 4:776–780. [DOI] [PubMed] [Google Scholar]
- 49. Sauer C., Syvertsson S., Bohorquez L.C., Cruz R., Harwood C.R., van Rij T., Hamoen L.W.. Effect of genome position on heterologous gene expression in bacillus subtilis: an unbiased analysis. ACS Synth. Biol. 2016; 5:942–947. [DOI] [PubMed] [Google Scholar]
- 50. Lee H., Popodi E., Tang H., Foster P.L.. Rate and molecular spectrum of spontaneous mutations in the bacterium Escherichia coli as determined by whole-genome sequencing. Proc. Natl. Acad. Sci. U.S.A. 2012; 109:E2774–E2783. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Stringer A.M., Singh N., Yermakova A., Petrone B.L., Amarasinghe J.J., Reyes-Diaz L., Mantis N.J., Wade J.T.. FRUIT, a scar-free system for targeted chromosomal mutagenesis, epitope tagging, and promoter replacement in Escherichia coli and Salmonella enterica. PLoS One. 2012; 7:e44841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Venturelli O.S., Egbert R.G., Arkin A.P.. Towards engineering biological systems in a broader context. J. Mol. Biol. 2016; 428:928–944. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.