Abstract
Synthetic auxotrophy in which bacterial viability depends on the presence of a synthetic amino acid provides a robust strategy for the containment of genetically modified organisms and the development of safe, live vaccines. However, a simple, general strategy to evolve essential proteins to be dependent on synthetic amino acids is lacking. Using a temperature sensitive selection system, we evolved an Escherichia coli (E. coli) sliding clamp variant with an orthogonal protein-protein interface, which contains a Leu273 to p-benzoylphenyl alanine (pBzF) mutation. The E. coli strain with this variant DNA clamp has a very low escape frequency (< 10−10), and its growth is strictly dependent on the presence of pBzF. This selection strategy can be generally applied to create ncAA dependence of other organisms with DNA clamp homologues.
Graphical Abstract

The use of auxotrophic bacterial strains for the containment of genetically modified organisms or as conditional, live vaccines suffers from the potential availability of the missing nutrient or metabolite from altered metabolism or the host1. To overcome this challenge, recent efforts have focused on the generation of auxotrophs that are dependent on non-naturally occurring compounds for growth, specifically a non-canonical amino acid (ncAA) that is added exogenously to the growth medium2–6. This approach is enabled by the fact that ncAAs with unique physicochemical properties can be readily added to the genetic codes of prokaryotic and eukaryotic organisms7–11. All that is required is an orthogonal translation system consisting of an exogenous tRNA and aminoacyl-tRNA synthetase (aaRS) pair that does not cross react with tRNAs or aaRSs in the host, and that recognize a unique codon, usually a nonsense codon7–8.
A number of strategies have been reported to construct ncAA dependent auxotrophs with low escape frequencies. For example, structurally distinct ncAAs were substituted at permissive sites in multiple essential enzymes to generate strains dependent on the simultaneous presence of the ncAAs5. In another example, the ncAA L-4,4′-biphenylalanine (bipA) was used to generate auxotrophic bacteria in which the core of an essential enzyme was redesigned to pack around bipA6. Other approaches that do not require multiple ncAA substitution or computational design include the substitution of an essential Lys residue by N-ε-acetyl-L-lysine with subsequent host enzyme deacetylation to generate Lys3; engineering a metal ion binding site in which His is substituted with the structurally similar analogue, 3-methyl-L-histidine2; and engineering of a dimer interface that depends on the presence of an ncAA4. Here we describe a general strategy to develop ncAA auxotrophs by engineering the protein-protein interface of the E. coli sliding clamp to be strictly dependent on the presence of an ncAA for bacterial survival. Using this approach, we developed an E. coli auxotroph that requires pBzF for survival with an escape frequency below 10-10.
The β-subunit (sliding clamp, SC) of the DNA polymerase III holoenzyme, encoded by the dnaN gene, is a dimeric protein that completely encircles the DNA as nucleotides are added. SC plays a critical role in the processivity of DNA replication and is essential for bacterial survival12–14. The SC forms a head-to-tail homodimer, with each monomer having identical chain topology15. The conserved structure of SCs across gram positive and gram negative bacteria make it a readily generalizable strategy for generating other ncAA dependent auxotrophs16–17.
To construct an orthogonal β-subunit interface, we first disrupted the native E. coli SC (SCEco) interface by incorporating the large and structurally distinct ncAA, pBzF, at the dimer interface, which resulted in a loss of cell viability. We then performed a selection with a library of residues around the pBzF side chain to restore its SC function. To this end, we conditionally knocked out the dnaN gene of E. coli in the presence of the plasmid, pKD-ESC which expresses SCEco and λ-red recombinase. pKD-ESC also has a temperature sensitive replicon (repA101ts) that limits the growth temperature of the knockout strain, DH10B ΔdnaN::tetR/pKD-ESC (termed EV1) to below 37 °C18. The EV1 strain was then transformed with the plasmid pUltra-BzF4 which contains the tRNACUA/aaRS pair that encodes pBzF to afford strain EV1.BzF (Figure 1). To disrupt the dimer interface19–20, we substituted Leu273 in SCEco (encoded by plasmid pBESC-AX) with the larger ncAA pBzF, expecting unfavorable van der Waals interactions with the surrounding hydrophobic residues (Figure S1b, c). Of note, an additional mutation V70A was introduced into this variant (termed SCEco-beta) to avoid potential steric hindrance between the benzophenone ring of pBzF and Val70 at the engineered interface (Figure S1). The mutant strain EV1.BzF/pBESC-AX could not survive at 42 °C, whereas the wild type analog, EV1.BzF/pBESC grew well under the same condition (Figure S2).
Figure 1. Strategy used to engineer ncAA dependent bacteria.

(a) Directed evolution of the E. coli sliding clamp dimer interface with a library that contains randomly mutated residues (cyan) surrounding the 273 residue (magenta). The engineered dimer interface has unique interactions of pBzF273 with surrounding residues (lower panel). (b) A temperature sensitive replicon in plasmid pKD-ESC (blue circle) was used to conditionally knock out the essential gene, dnaN. The selection was conducted at 42 °C with SCEco-beta library that is encoded in pBK-SC-Lib (green circle). The orthogonal translational machinery (encoded in pUltra-BzF, gray circle) was used to site specifically incorporate pBzF at the dimer interface of mutant sliding clamp. The evolved SCEco-beta variant that is encoded in pBK-SC-Hit (gold circle) enables pBzF dependent growth of EV2.BzF.hit (highlighted in red).
To restore the variant SCEco-beta subunit interface and, as a result, SC function in DNA replication, we carried out saturation mutagenesis on the residues surrounding the pBzF273 side chain of the mutant SCEco-beta in order to repack the largely hydrophobic dimer interface. Plasmid pBK-SC-Lib encoded a dnaN gene library with the following mutations: Leu273TAG, Val70Ala, and NNK (N = A, C, G or T; K = G or T) for the surrounding residues, Phe′75, Ile′78, Met′97, Phe′106, Leu′108, and Ile272 (where the prime residues correspond to the other subunit). The transformant EV1.BzF/pBK-SC-Lib was plated on permissive LB-agar media containing 1 mM pBzF and isopropyl 1-thio-β-D-galactopyranoside (IPTG, 1 mM) at 42 °C and grown overnight. Surviving colonies were amplified in the presence of pBzF and regrown by replica spotting on permissive (LB-agar, 1 mM pBzF) and non-permissive (LB-agar, no ncAA) media at 42 °C.
Using this selection strategy, we identified ten SCEco-beta variants which showed a pBzF dependent growth phenotype when introduced into the EV1.BzF strain, and after curing of the complementing pKD-ESC plasmid at 42 °C (Figure 1b and Table S1). One variant, h2, had a second TAG mutation at residue 108 in addition to the Leu273TAG mutation (Table S1). Because we hypothesized that two TAG codons would need to be mutated for this variant to escape pBzF growth dependence (the single base pair mutation frequency is ~10−10 per generation)21, which should result in a very low escape frequency, we first characterized this variant. However, the engineered strain, EV2.BzF/pBK-SC-h2 (EV2.BzF.h2) had a higher than expected escape frequency (4 × 10−10 escapees per colony forming unit (c.f.u.)) (Table S5). All the escape mutants had a mutation in the tyrT anticodon GTA to CTA to give supF amber suppressor, suggesting that tyrosine incorporation at residues 108 and 273 of h2 is the escape mechanism of EV2.BzF.h2. A complementation experiment with a plasmid library containing saturation mutations (NNKs) at residues 108 and 273 of h2 surprisingly showed that multiple combinations of hydrophobic amino acids at these residues resulted in the EV2.BzF.h2 growth in non-permissive media (Figure S3).
Next, we investigated clone h1 in which the mutations are mostly composed of small hydrophobic amino acids (Figure 2a). The engineered E. coli, EV2.BzF.h1 showed pBzF dependent growth in permissive media, whereas no growth was observed in non-permissive media over 2 weeks (Figure S4a). Unfortunately, EV2.BzF.h1 exhibited a comparable escape frequency (1.3 × 10−10 escapees per c.f.u) to EV2.BzF.h2 (Table S5). All isolated escapees had a TAG to TGG mutation at position 273, which encodes tryptophan. In a complementation experiment, we found that residue 273 in h1 can tolerate mutation to Phe, Trp, Met, or Val with 84%, 8%, 5%, and 3% frequency among all the growing colonies, respectively (Figure S4b, c).
Figure 2. Structure of h1 and h5 dimer interface.

Models of h1 and h5 were generated using the Rosetta protein design software. pBzF273 (magenta) is located at the interface with surrounding residues [(a) Ala′70, Leu′75, Gly′78, Leu′97, Met′106, Cys′108, and Val272 in h1; (b) Ala′70, Met′75, Phe′78, Leu′97, Thr′106, Ala′108, and Val272 in h5; cyan]. H-bond is shown with a red dashed line. Hydrogens are masked for clarity.
We then investigated all other hits from the selection to identify the SCEco-beta variant that gave the lowest mutational escape frequency with a canonical amino acid substitution at residue 273. Because EV2.BzF.h1 had a reasonably low escape frequency, we used this strain as the host for complementation by the other hits (h3-h10) in which an NNK mutation was introduced at site 273. In the case of clone 5, the only canonical amino acid capable of functional replacement of pBzF273 and conferring cell growth was phenylalanine (Figure 3a, b). Because the amber codon (UAG) is two mutations away from either of the phenylalanine codons (UUU and UUC), EV2.BzF.h5 should have lower escape frequency than EV2.BzF.h1 or EV2.BzF.h2. Indeed, EV2.BzF.h5 exhibited a pBzF dependent growth profile (Figure 3c) with non-detectable escapees (no escapees from the culture of 2 × 1010 in non-permissive solid media for 2 weeks at 37 °C) (Table S5).
Figure 3. Characterization of EV2.BzF.h5 dependence on pBzF for survival.

(a) EV2.BzF.h1 strain was transformed with a focused library of h5 in pEvol-h5-NNK plasmid for complementation. The growth was monitored both in permissive and non-permissive media that contains 0.02% L-arabinose. The wild type and blank plasmid were used for positive and negative controls, respectively. (b) Frequency of amino acid mutation at residue 273 of h5 in pEvol-h5-NNK plasmids isolated from survived EV2.BzF.h1 clones in non-permissive media. (c) Growth curve of EV2.BzF.wt (square) or EV2.BzF.h5 (triangle) cells in permissive (LB, 1 mM pBzF, opaque) and non-permissive (LB, no ncAA, hollow) media. Error bars represent the standard deviation of three technical replicates.
The doubling times of EV2.BzF.h1, EV2.BzF.h2, and EV2.BzF.h5 are slightly longer than the EV2.BzF.wt that expresses wild type SCEco (Table S5), perhaps due to lower levels of expression of the SCEco-beta variants compared to the wild type SCEco in the E. coli cytosol. Indeed, smaller amounts of the h5 protein were observed in the soluble fraction compared to wild type when each was overexpressed in E. coli (Figure S5). We were also able to confirm the dimeric nature of variant h5 by exploiting the photoreactivity of the benzophenone side chain of pBzF22. The soluble fractions of the lysate from E. coli cells that expressed c-terminal his-tagged SCEco and corresponding variant proteins were irradiated with long wavelength UV light, and crude samples were analyzed by denaturing SDS gel electrophoresis and western blotting. Clone h5 showed clear dimer bands upon irradiation, whereas wild type and non-irradiated h5 did not form dimer bands (Figure S5).
To understand the nature of the structural modifications, we modeled the engineered interfaces of h1 and h5 using the Rosetta software23 (Figure 2, see Supporting Information for a detailed description). Modeling revealed that the mutations at residues 70, 75, and 97 observed in the h1 and h5 variants allow the bulky “second ring” of pBzF (i.e., the phenyl ring farthest from the protein backbone) to adopt an orientation that would be difficult to achieve in the wild type protein. The models also suggest that mutations at position 106 facilitated an energetically favorable rotamer of the pBzF. Notably, Thr′106 in h5 also appears to introduce a hydrogen bonding interaction to the carbonyl of the pBzF273 residue.
In conclusion, by evolving the sliding clamp to be dependent on pBzF for function, we generated a mutant E. coli auxotrophic for pBzF with an undetectable escape rate for a single engineered essential protein, it is noteworthy that the strategy described here should be generally useful to engineer ncAA auxotrophs by creating ncAA dependent essential proteins. Given that the SC dimer interface is well conserved across gram negative pathogens including Pseudomonas aeruginosa (P. aeruginosa), Acinetobacter baumannii (A. baumannii), and Enterobacter cloacae (E. cloacae)17, the hit variants evolved in this study should be readily transferred to SCs of those strains for the development of conditional live vaccines (Figure S6). As such, we are now expanding this strategy to develop conditional live vaccines for P. aeruginosa and A. baumannii infections which are a significant risk for hospitalized patients, and those with impaired lung or immune system function. We are also applying a similar strategy to identify a second engineered ncAA dependent auxotroph.
Supplementary Material
ACKNOWLEDGMENT
This work was supported by NIH Grant R01 GM062159.
Footnotes
ASSOCIATED CONTENT
Supporting Information
The Supporting Information is available free of charge on the ACS Publications website.
Materials and Methods, Figures S1–S6, Tables S1–S5 (PDF)
The authors declare no competing financial interests.
REFERENCES
- (1).Lee JW; Chan CTY; Slomovic S; Collins JJ, Next-generation biocontainment systems for engineered organisms. Nat. Chem. Biol 2018, 14 (6), 530–537. [DOI] [PubMed] [Google Scholar]
- (2).Gan F; Liu R; Wang F; Schultz PG, Functional Replacement of Histidine in Proteins To Generate Noncanonical Amino Acid Dependent Organisms. J. Am. Chem. Soc 2018, 140 (11), 3829–3832. [DOI] [PubMed] [Google Scholar]
- (3).Xuan W; Schultz PG, A Strategy for Creating Organisms Dependent on Noncanonical Amino Acids. Angew. Chem. Int. Ed. Engl 2017, 56 (31), 9170–9173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (4).Koh M; Nasertorabi F; Han GW; Stevens RC; Schultz PG, Generation of an Orthogonal Protein-Protein Interface with a Noncanonical Amino Acid. J. Am. Chem. Soc 2017, 139 (16), 5728–5731. [DOI] [PubMed] [Google Scholar]
- (5).Rovner AJ; Haimovich AD; Katz SR; Li Z; Grome MW; Gassaway BM; Amiram M; Patel JR; Gallagher RR; Rinehart J; Isaacs FJ, Recoded organisms engineered to depend on synthetic amino acids. Nature 2015, 518 (7537), 89–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (6).Mandell DJ; Lajoie MJ; Mee MT; Takeuchi R; Kuznetsov G; Norville JE; Gregg CJ; Stoddard BL; Church GM, Biocontainment of genetically modified organisms by synthetic protein design. Nature 2015, 518 (7537), 55–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (7).Wang L; Brock A; Herberich B; Schultz PG, Expanding the genetic code of Escherichia coli. Science 2001, 292 (5516), 498–500. [DOI] [PubMed] [Google Scholar]
- (8).Chin JW; Cropp TA; Anderson JC; Mukherji M; Zhang Z; Schultz PG, An expanded eukaryotic genetic code. Science 2003, 301 (5635), 964–7. [DOI] [PubMed] [Google Scholar]
- (9).Liu CC; Schultz PG, Adding new chemistries to the genetic code. Annu. Rev. Biochem 2010, 79, 413–44. [DOI] [PubMed] [Google Scholar]
- (10).Wan W; Tharp JM; Liu WR, Pyrrolysyl-tRNA synthetase: an ordinary enzyme but an outstanding genetic code expansion tool. Biochim. Biophys. Acta 2014, 1844 (6), 1059–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (11).Young DD; Schultz PG, Playing with the Molecules of Life. ACS chemical biology 2018, 13 (4), 854–870. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (12).Kelman Z; O’Donnell M, DNA polymerase III holoenzyme: structure and function of a chromosomal replicating machine. Annu. Rev. Biochem 1995, 64, 171–200. [DOI] [PubMed] [Google Scholar]
- (13).Heltzel JM; Scouten Ponticelli SK; Sanders LH; Duzen JM; Cody V; Pace J; Snell EH; Sutton MD, Sliding clamp-DNA interactions are required for viability and contribute to DNA polymerase management in Escherichia coli. J. Mol. Biol 2009, 387 (1), 74–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (14).Fijalkowska IJ; Schaaper RM; Jonczyk P, DNA replication fidelity in Escherichia coli: a multi-DNA polymerase affair. FEMS Microbiol. Rev 2012, 36 (6), 1105–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (15).Kong XP; Onrust R; O’Donnell M; Kuriyan J, Three-dimensional structure of the beta subunit of E. coli DNA polymerase III holoenzyme: a sliding DNA clamp. Cell 1992, 69 (3), 425–37. [DOI] [PubMed] [Google Scholar]
- (16).Yin Z; Whittell LR; Wang Y; Jergic S; Ma C; Lewis PJ; Dixon NE; Beck JL; Kelso MJ; Oakley AJ, Bacterial Sliding Clamp Inhibitors that Mimic the Sequential Binding Mechanism of Endogenous Linear Motifs. J. Med. Chem 2015, 58 (11), 4693–702. [DOI] [PubMed] [Google Scholar]
- (17).McGrath AE; Martyn AP; Whittell LR; Dawes FE; Beck JL; Dixon NE; Kelso MJ; Oakley AJ, Crystal structures and biochemical characterization of DNA sliding clamps from three Gram-negative bacterial pathogens. Journal of structural biology 2018, 204 (3), 396–405. [DOI] [PubMed] [Google Scholar]
- (18).Hashimotogotoh T; Franklin FCH; Nordheim A; Timmis KN, Specific-Purpose Plasmid Cloning Vectors .1. Low Copy Number, Temperature-Sensitive, Mobilization-Defective Psc101-Derived Containment Vectors. Gene 1981, 16 (1–3), 227–235. [DOI] [PubMed] [Google Scholar]
- (19).Stewart J; Hingorani MM; Kelman Z; O’Donnell M, Mechanism of beta clamp opening by the delta subunit of Escherichia coli DNA polymerase III holoenzyme. J. Biol. Chem 2001, 276 (22), 19182–9. [DOI] [PubMed] [Google Scholar]
- (20).Jeruzalmi D; Yurieva O; Zhao Y; Young M; Stewart J; Hingorani M; O’Donnell M; Kuriyan J, Mechanism of processivity clamp opening by the delta subunit wrench of the clamp loader complex of E. coli DNA polymerase III. Cell 2001, 106 (4), 417–28. [PubMed] [Google Scholar]
- (21).Barrick JE; Yu DS; Yoon SH; Jeong H; Oh TK; Schneider D; Lenski RE; Kim JF, Genome evolution and adaptation in a long-term experiment with Escherichia coli. Nature 2009, 461 (7268), 1243–7. [DOI] [PubMed] [Google Scholar]
- (22).Chin JW; Martin AB; King DS; Wang L; Schultz PG, Addition of a photocrosslinking amino acid to the genetic code of Escherichiacoli. Proc Natl Acad Sci U S A 2002, 99 (17), 11020–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (23).Leaver-Fay A; Tyka M; Lewis SM; Lange OF; Thompson J; Jacak R; Kaufman K; Renfrew PD; Smith CA; Sheffler W; Davis IW; Cooper S; Treuille A; Mandell DJ; Richter F; Ban YE; Fleishman SJ; Corn JE; Kim DE; Lyskov S; Berrondo M; Mentzer S; Popovic Z; Havranek JJ; Karanicolas J; Das R; Meiler J; Kortemme T; Gray JJ; Kuhlman B; Baker D; Bradley P, ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol 2011, 487, 545–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
