Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2015 Mar 23;43(13):e83. doi: 10.1093/nar/gkv248

A set of powerful negative selection systems for unmodified Enterobacteriaceae

Varnica Khetrapal 1, Kurosh Mehershahi 1, Shazmina Rafee 1, Siyi Chen 1, Chiew Ling Lim 1, Swaine L Chen 1,2,*
PMCID: PMC4513841  PMID: 25800749

Abstract

Creation of defined genetic mutations is a powerful method for dissecting mechanisms of bacterial disease; however, many genetic tools are only developed for laboratory strains. We have designed a modular and general negative selection strategy based on inducible toxins that provides high selection stringency in clinical Escherichia coli and Salmonella isolates. No strain- or species-specific optimization is needed, yet this system achieves better selection stringency than all previously reported negative selection systems usable in unmodified E. coli strains. The high stringency enables use of negative instead of positive selection in phage-mediated generalized transduction and also allows transfer of alleles between arbitrary strains of E. coli without requiring phage. The modular design should also allow further extension to other bacteria. This negative selection system thus overcomes disadvantages of existing systems, enabling definitive genetic experiments in both lab and clinical isolates of E. coli and other Enterobacteriaceae.

INTRODUCTION

Elucidation of the molecular basis for a given phenotype in bacteria (such as the ability to cause an infection) is heavily reliant on the ability to create defined genetic mutations. Numerous systems exist for manipulation of bacterial chromosomes; in general, the most powerful of these use selection to isolate the desired mutant. Positive selection markers based on antibiotic resistance genes are thus a mainstay of the genetic toolbox in nearly every genetically tractable bacterium.

However, cloning strategies relying solely on positive selection markers (which enable isolation of bacteria carrying the marker) result in ‘marked’ strains, where the marker itself or a residual scar remains in the genome, potentially causing unanticipated effects. Creation of definitive genetic constructs (e.g. a single point mutation in a gene of interest) is therefore greatly facilitated by negative selection markers (which allow selection of bacteria without the marker). Having negative selection enables a two-step strategy of (i) positive selection-mediated deletion/replacement of a gene of interest by a selection cassette followed by (ii) negative selection-mediated, seamless replacement of the same cassette by a mutated allele (Figure 1A). Comparison with another strain in which the selection cassette is similarly replaced with a wild-type allele thus allows rigorous assignment of phenotypic differences to the engineered point mutation.

Figure 1.

Figure 1.

Design and characterization of a negative selection module. (A) Schematic of a two-step cloning strategy for allelic replacement using positive and negative selection. Step 1. A PCR product (depicted with a thick black baseline) containing a positive–negative selection cassette (kan-PrhaB-relE here) flanked by sequence (shaded gray) homologous to the targeted gene is subjected to double crossover recombination (indicated by crossing lines) to replace the gene. Selection for this replacement is done using the positive selection marker (kanamycin here). Step 2. Another PCR product (thick black baseline) containing the same or a different allele (indicated by *) of the target gene, also flanked by sequence homologous to the targeted locus is subjected to double crossover recombination to reintegrate the original locus. The resulting genome carries no residual markers or DNA scar. Selection for this reintegration is done using the negative selection marker (plating on rhamnose in this case). (B) Schematic diagram of template plasmid pSLC-217. The negative selection module (PrhaB driving relE) is shaded in black. The positive selection module (neo, mediating kanamycin resistance) is shaded gray. Transcriptional terminators are indicated by open boxes. Universal priming sites P1 and P2 are shown as arrows. Other features (FRT sites, AmpR gene) are indicated by open box arrows.

Negative selection markers are in general less efficient than positive selection markers, even for well-studied bacteria such as Escherichia coli (18). Particularly for disease-causing clinical isolates, negative selection systems that require host genotype modification (e.g. ccdB, tolC, thyA, rpsl and thymidine kinases (48)) are impractical, as the required host modifications often involve conserved metabolic functions that may impact virulence (9), thereby confounding the analysis or requiring an additional step to restore the original genotype. Among negative selection systems that are usable in unmodified host strains (and are therefore candidates for direct use in clinical isolates), additional drawbacks include low selection stringency (as for sacB (2)) and the need for strain-specific optimization of selection conditions (as for the tetA and tetA–sacB systems (1,3)).

To overcome these disadvantages in unmodified hosts, we designed a general and modular negative selection system based on inducible toxins. We optimized selection conditions in one clinical strain of E. coli, UTI89, then verified that this design was usable without further modification or optimization in lab and other clinical isolates of E. coli as well as in Salmonella enterica, enabling convenient two-step creation of markerless and scarless chromosomal point mutations. The negative selection module achieved high selection stringency in all tested strains, exceeding all other reported negative selection systems usable in unmodified lab isolates of E. coli by up to 60-fold, and approaching theoretical predictions of maximum stringency based on mutation rates. The high negative selection stringency enabled its use in traditional phage-mediated generalized transduction, in which an additional benefit is the generation of unmarked transductants using a wild-type donor. Furthermore, we were able to use the negative selection system to perform a modified phage-free ‘transduction’ (which we term generalized allelic exchange) using transformed whole genomic DNA between two uropathogenic isolates of E. coli, an otherwise difficult problem due to the lack of transducing phages for clinical strains (10). Therefore, this negative selection system is a convenient and powerful addition to the genetic toolbox for all E. coli strains that, in combination with traditional antibiotic markers, enables the creation of arbitrary unmarked mutations in all tested strains of E. coli and other Enterobacteriaceae and allows for definitive genetic experiments even in clinical strains. It further enables new applications for generalized allelic exchange between different strains.

MATERIALS AND METHODS

Media and culture conditions

LB and M9 agar were used as complex and minimal media, respectively. For plasmid maintenance and selection of positive selection antibiotic markers, media was supplemented with antibiotics (Sigma, Singapore) at the following concentrations: ampicillin (100 μg/ml), kanamycin (50 μg/ml), chloramphenicol (20 μg/ml) and tetracycline (10 μg/ml). M9 agar was supplemented with 0.2% glucose to repress toxin gene expression or with 5% sucrose (for cat-sacB selection), 0.2% rhamnose (for all PrhaB-driven toxins) or 0.2% arabinose (for ParaB-driven toxins). Maintenance of plasmids carrying the toxin genes was carried out in LB supplemented with 2% glucose.

Bacterial strains and plasmids

Relevant characteristics of bacterial strains and plasmids used in this study are listed in Supplementary Table S1. The template plasmids pKD3 and pKD4 were obtained from AddGene and were modified in accordance with the AddGene Material Transfer Agreement for non-commercial purposes.

tetA negative selection

Overnight cultures of UTI89 carrying tetA on plasmids or integrated into the chromosome were diluted 1:100 into fresh media and grown to an optical density at 600nm (OD600) of 1. Serial 10-fold dilutions were plated to quantify the number of surviving bacteria (CFU) on selective and non-selective (LB) plates. Selective plates were supplemented with tetracycline (for positive selection) or 2–4 mM NiCl2 (for negative selection).

Plasmid construction

All plasmids used in this study are listed in Supplementary Table S1. To facilitate cloning, we first inserted several restriction sites (AatII, SmaI and SacII) into the AfeI site of pKD4, generating pSLC-243. Toxic genes were amplified from E.coli MG1655 (relE, chpBK, mqsR, higB, yafQ and yhaV), Pseudomonas aeruginose (tse2 (11)) or E. coli UTI89 (phage λ holin (12)) genomic DNA by PCR then cloned using NdeI and BamHI sites (in pAH120) or NdeI and NheI (in pAH150 (13)) to place them under the control of the rhaB or araB promoter, respectively. Each promoter-toxin module, including the flanking tL3 and rgnB transcription terminators from pAH120 or pAH150, was cloned by blunt ligation into the SmaI site of pSLC-243 to generate template plasmids (pSLC-17 containing kan-PrhaB-relE is shown in Figure 1B). All plasmids were propagated in E. coli strain BW23473 (pir+, low copy number) or BW23474 (pir116, high copy number). The sequences of all negative selection cassettes are available upon request.

Recombineering

All genomic manipulations were carried out using previously described Red recombinase recombineering protocols optimized for lab or clinical isolates of E. coli (14), with minor modifications. The positive–negative selection cassette was amplified by PCR using the P1 and P2 universal primers for pKD4. Primers used for cloning are listed in Supplementary Table S8. Primers used for fimH constructs were as previously described (15). Each primer was synthesized with an extra 50 bp of sequence at the 5′ end that was homologous to the targeted genomic locus. Red recombinase was expressed from the pKM208 (in E. coli) or pKD46 (in S. enterica Serovar Typhimurium 14028S) helper plasmids. Overnight cultures (LB-ampicillin, 30°C with agitation) of the target strain were diluted 1:100 into fresh media, grown at 30°C with agitation to OD600 = 0.2–0.3, and induced with 1 mM Isopropyl β-D-1-thiogalactopyranoside (IPTG) (pKM208) or 0.2% arabinose (pKD46) for 30–45 min. The induced cells were then heat shocked for 15 min at 42°C, followed by swirling in an ice water bath for 15 min. Cells were harvested by centrifugation (5000 rpm for 10 min), washed two times with cold (4°C) water, re-suspended in 1/100 of the original culture volume of cold 10% glycerol, and frozen in 50 μl aliquots as competent cells. To perform double crossover homologous recombination, 1 μg of a PCR product was transformed by electroporation into thawed competent cells using 1mm electroporation cuvettes in a GenePulser XCEL set to an output voltage of 1500 V with 25 μF capacitance and 400 Ω resistance (Bio-Rad, Singapore). Cells were then recovered in LB at 37°C with shaking for 2 h followed by static incubation (no shaking) for 2 h, plated on selective plates, and incubated overnight (for kanamycin selection) or for 24–48 h (for rhamnose selection) at 37°C.

Diagnostic PCR and Sanger sequencing

Colony PCR reactions were used to check for insertion/replacement of the selection cassette. The first primer pair was used to specifically detect the locus of desired homologous recombination and the second primer pair was to detect the presence or absence of the specific toxin gene at the desired locus. Primers used are listed in Supplementary Table S8. Primers for fimH constructs were uti8 + 4913333 and uti8 − 4915222 as described previously (15). The strains were confirmed using Sanger sequencing of the diagnostic PCR products with the same primers used for amplification (1st Base, Singapore).

Immunoblot

Bacteria from 1 ml of culture at OD600 = 1.0 was harvested by centrifugation (14 000 rpm, 1 min). The bacterial pellet was resuspended in 300 μl of 4× sodium dodecyl sulfate (SDS) loading buffer (50% glycerol, 0.5% SDS, 0.25 M Tris, mercaptoethanol and bromophenol blue). Ten microliters of this sample was analyzed by sodium dodecyl sulphate-polyacrylamide gel electrophoresis (SDS-PAGE) (15% polyacrylamide gel). Proteins were transferred to a nitrocellulose membrane using a Trans-Blot® SD Semi-Dry transfer cell (Bio-Rad, Singapore). Blots were probed with anti-GroEL antibody (Sigma, Singapore) at a 1:6000 dilution in Tris-Borate-EDTA (TBE) with 5% skimmed milk to assess loading quantities. Blots were probed with custom anti-OmpC monoclonal antibodies (Abmart Inc., Shanghai, China). Antibody 4H15 (1:1000 dilution) recognizes both the wild type and mutant OmpC protein (mutated in loop 5 residues) while 4K3 (1:1000) was raised against the OmpC loop 5 residues and thus does not detect the mutant OmpC protein. Secondary antibodies used were ECL™ anti-rabbit IgG Horseradish peroxidase (HRP)-linked whole antibody (1:10 000) for anti-GroEL antibody and ECL™ anti-mouse IgG HRP-linked whole antibody (1:10 000). Blots were visualized using Amersham HRP substrate in a Chemidoc machine (Bio-Rad, Singapore).

Whole genome sequencing

Genomic DNA from strain SLC-561 (UTI89 fimH::FRT; mlaA::mlaAUTI89) was prepared from 1mL of an OD600 of 1 culture using the Wizard® Genomic DNA Extraction Kit (Promega) and prepared for Illumina sequencing using the TruSeq DNA library preparation kit (Illumina) according to the manufacturer's recommendations. This library was sequenced on a HiSeq 2000 (2 × 76 bp reads), yielding 5 680 917 read pairs (∼170× coverage). After quality filtering, fastq files were analyzed with cortex_var (version 1.0.5.20) from the Cortex package (16) using a joint analysis with UTI89 (NCBI accession number NC_007946) as the reference genome; the resulting VCF files indicated the known deletion at the fimH locus but no other variants (insertions, deletions, or single nucleotide polymorphisms) that passed quality filters. The sample was further tested for large (>100 bp) structural variations relative to the UTI89 reference genome using the SVRE program (Sukumaran and Chen, unpublished data) ; no large structural variations besides the fimH deletion were found that differed from wt UTI89.

Hemagglutination titers

These were performed as previously described (15). Briefly, 1 ml of an OD600 = 1.0 suspension of bacterial cells in PBS was gently pelleted (4000 × g, 2 min) and resuspended in 100 μl phosphate buffered saline (PBS). Twenty-five microliters of this was serially diluted in a row of a 96-well V-bottom plate (Corning #3897) where each well contained 25 μl of PBS (with or without mannose) (dilution range 1:2 to 1:4096). Twenty-five microliters of guinea pig red blood cells in PBS were added to each well. The plate was gently agitated and incubated overnight at 4°C. The HA titer reported was the greatest dilution of cells that resulted in visible clumping of erythrocytes.

SDS–EDTA sensitivity

Overnight cultures were diluted 1:100 into fresh media and grown to OD600 = 1. Serial 10-fold dilutions were plated on different media to quantify the number of surviving bacteria (CFU). Selective plates were supplemented with 0.5% SDS and between 0.75–2.25 mM ethylenediaminetetraacetic acid (EDTA) and non-selective plates were supplemented with 0.5% SDS, with or without kanamycin.

Selection stringency

Overnight grown cultures were diluted 1:100 into fresh media and grown to OD600 = 1. Serial 10-fold dilutions were carried out and plated on non-restrictive (plain LB for all strains; also including antibiotic plates for antibiotic resistant strains) media to quantify the number of surviving bacteria (CFU) in the original suspension. 1010 cells (10 ml culture of OD600 = 1, centrifuged and re-suspended with 200 μl PBS) were spread on appropriate restrictive plates to check for negative selection (on M9-rhamnose or M9-arabinose for strains carrying the negative selection cassette) and positive selection (LB-kanamycin for strains not carrying the kanamycin positive selection marker) stringency. Selection stringency was determined by dividing the calculated CFU of the original inoculum on restrictive plates by the average number of CFU that grew on non-restrictive plates.

Luria–Delbruck fluctuation test

Strains were grown overnight on a non-restrictive plates (LB for MG1655 and LB-kanamycin for SLC-568) and one single colony was resuspended in 50 μl of PBS. Twenty microliters of this suspension was diluted 200× and aliquoted into the central 60 wells of each of 2× 96-well plates per strain. Each well received 20 μl of the diluted suspension (approximately 102 or 104 CFU/well) and 180 μl of non-restrictive media (LB for MG1655 and M9 with 0.2% glucose for SLC-568). The plates were then incubated at 37°C with shaking (220 rpm) for 6 h (MG1655, starter cultures with 104 CFU/well) or 18 h (SLC-568, starter cultures with 102 CFU/well). (As a control, we also carried out 8-h incubations with SLC-568; the resulting data was similar to that from experiments using an18-h incubation). Ten wells from each plate were used to calculate total CFU per well by serial dilutions. The remaining 50 wells from each of the two plates were pelleted (100 total) and resuspended in 200 μl of selective medium (LB-kanamycin for MG1655 and M9 with 0.2% rhamnose for SLC-568). The plates were then incubated overnight at 37°C with shaking (220rpm) after which the cells were again pelleted and resuspended in 10 μl of restrictive medium and spotted on restrictive agar plates. The agar plates were incubated at 37°C for 24–48 h. The number of wells which gave rise to colonies growing on restrictive agar plates was counted and the mutation rate of the strains was determined using the following formula (17);

graphic file with name M1.gif

Generalized transduction

Transduction using P1vir was performed in MG1655 using standard protocols (18). A transducing lysate generated from wild type MG1655 was used to infect SLC-568 (MG1655 hsdS::kan-PrhaB-tse2) followed by plating on M9-rhamnose as described above in the ‘Recombineering’ section.

Generalized allelic exchange

Isolated genomic DNA from E. coli CFT073 was sheared to an average length of 10 kb fragments by sonication (Branson Digital Sonifier Disruptor 450) (2 × 5 s pulses at 10% power on ice). Ten micrograms of sheared DNA was transformed into competent SLC-557 (UTI89 ompC::kan-PrhaB-relE) expressing red recombinase. Cells were recovered in LB at 37°C with shaking for 2 h followed by static incubation (no shaking) for 2 h then plated on M9-rhamnose agar plates, and screened via PCR and sequencing for allelic exchange at ompC locus using P21 and P22 primer set.

BLAST analysis of type II toxins

Type II toxin genes present in MG1655 were taken from (19) and from the NCBI RefSeq annotation of the MG1655 genome (NC_000913). Finished E. coli genome sequences were downloaded from NCBI RefSeq (Supplementary Figure S2). Each non-K12 genome was used as the database to search for each toxin gene using the blastn (using the toxin gene sequences) and tblastn (using the toxin protein sequences) programs (BLAST 2.2.28+) with default parameters. Examination of the blastn and tblastn results showed that a cutoff of a blastn alignment over 80% of the toxin gene length captured all genes with simultaneously >88% DNA identity and >84% protein identity over at least 80% of the toxin gene (or protein) sequence. This 80% blastn alignment cutoff also included four additional gene/genome combinations which had potential frameshift mutations (yafQ and yafO in SMS_3_5; cbtA in O55_H7 RM12579 and P12b) but for which the entire gene sequence was otherwise present. We thus used 80% alignment over the toxin gene length by blastn as the cutoff to call toxins as present or absent in a given genome.

RESULTS

Existing negative selection systems are not practical for a clinical isolate of E. coli

We first tested those existing negative selection systems that do not require host genotype modification for their ability to function in single copy in the chromosome of an unmodified clinical uropathogenic isolate of E. coli, UTI89. Only two systems, tetA (1) and sacB (20), could potentially be used in UTI89. Selection against Tet resistance by fusaric acid is known to require optimization in different strains of E. coli and may not be usable in others (21); we were unable to demonstrate negative selection mediated by the tetA gene found on pBR322 in UTI89 after optimization of fusaric acid and salt concentrations (data not shown). It has also been reported that Ni2+ salts can be used to select against TetR strains and that the selectivity can in some cases exceed 1 in 106 cells (1), but the selection power and optimal Ni2+ concentration varied between different E. coli strains, typically requiring above 6mM NiCl2. We found that, indeed, growth of UTI89/pBR322 (TetR) relative to UTI89 was inhibited by 4 orders of magnitude at an optimal concentration of 3 mM NiCl2. However, pBR322 has an estimated copy number of 20/cell. In order to use this system for chromosomal manipulation, it must work at a copy number of 1/cell. We found that the sensitivity of TetR UTI89 to NiCl2 decreased with copy number, to the point that no NiCl2 sensitivity was found with UTI89::tet (containing a single copy chromosomal tetA gene) relative to UTI89 (Supplementary Figure S1), as was also seen with DH10B (a lab adapted strain of E. coli) (1).

Gram negative bacteria carrying the B. subtilis sacB gene accumulate toxic metabolites when grown in the presence of sucrose. The sacB gene is widely used for negative selection (20,22,23), but it suffers from relatively high false positive rates for chromosomal manipulations due to spontaneous suppressor mutations that inactivate sacB (24). This has the overall effect of reducing its stringency to <2.3 × 10−5 (20). However, in UTI89, a single chromosomal copy of cat-sacB mediated negative selection with a relatively weak selection stringency of 3.40 × 10−2. Recently, a system combining both the tetA and sacB genes has been reported with improved stringency (3) but is very sensitive to selection conditions and therefore requires strain-specific optimization. Therefore, a negative selection system immediately usable in clinical strains of E. coli is lacking.

Design of a modular negative selection cassette based on inducible toxins

We sought to design a new negative selection system that would both provide high stringency and not require strain-specific optimization. Negative selection using the mazF toxin gene has been demonstrated in single strains of Bacillus subtilis (25) and Clostridium acetobutylicum (26), though no data on stringency was reported. These reports suggested that isolated toxins from toxin–antitoxin (TA) systems could be generally useful as negative selection markers, even when used in heterologous species (mazF is an E. coli gene, yet seems to function in Gram positives). We therefore asked whether a general system providing tightly controlled induction of a toxin gene could be used in E. coli and other Enterobacteriaceae. The E. coli MG1655 (a lab-adapted fecal isolate) chromosome is known to carry numerous TA systems (19); we focused on Type II TA systems (in which both toxin and antitoxin are proteins) because they are more easily identified and therefore likely to be more completely annotated. To develop a system for use in clinical isolates (in particular, UTI89), we reasoned that the TA system we used should be absent from the UTI89 genome. Using BLAST analysis, we identified 12 candidate toxins present in MG1655 but not in the UTI89 genome (Supplementary Table S2). Further analysis using 55 non-K12 full genome sequences available at NCBI RefSeq highlighted 7 MG1655 toxins (rnlA, ypjF, ykfI, ydaS, yjhX, relE and mqsR) that were not found in more than half of non-K12 genomes (Supplementary Figure S2 and Supplementary Table S2).

We created a negative selection cassette consisting of the relE toxin (chosen for its small size, to minimize the PCR length in subsequent steps and its potential for inactivation by mutation) driven by the rhaB promoter (chosen for its ‘tight’ control; it is induced by growth on rhamnose and repressed by growth on glucose). To further ensure deliberate control of expression, we included the tL3 and rgnB terminators flanking the PrhaB-relE to guard against read-through transcription from other promoters; and we retained the native rhaB ribosome binding site. For use in making unmarked chromosomal mutations, we combined this negative selection cassette with a kanamycin resistance gene, a traditional positively selectable antibiotic marker. We built the construct as a derivative of the well-designed cloning system described by Datsenko and Wanner (14) based on Lambda Red recombinase and antibiotic selection using the pKD4 template plasmid. We refer to the derived template plasmid as pSLC-217 and to the combined positive–negative selection cassette as kan-PrhaB-relE (Figure 1B).

Efficient, scarless and markerless manipulation of the UTI89 chromosome

To validate our system, we first used the kan-PrhaB-relE cassette to perform a two-step allelic replacement of the ompC gene at its native locus in UTI89. We used a Red recombinase protocol optimized for clinical isolates of E. coli (22) to replace the native ompC gene by homologous recombination with a linear PCR product (containing kan-PrhaB-relE flanked by homology arms targeting the ompC locus), using positive selection on kanamycin (Figure 1A, step 1). We then used PCR products for the wild type UTI89 ompC and a mutated ompC (carrying mutations in extracellular loop L5) allele in a subsequent Red recombinase-mediated double-crossover replacement of the integrated positive–negative selection cassette, using negative selection on rhamnose (Figure 1A, step 2). Clones were screened using PCR for the entire ompC locus as well as for specific insertion of the relE toxin gene at the ompC locus (Supplementary Figure S3A). Positive clones identified by PCR were verified by testing for loss of kanamycin resistance, restored ability to grow on rhamnose (Supplementary Figure S3B), and Sanger sequencing of the entire ompC locus. The efficiency for isolating the correct recombinant during the counterselection step was 87–100% as calculated by testing 30 colonies from each counterselection step by PCR, growth on rhamnose, and loss of the positive selection marker. To further confirm that the restored ompC locus was functional, we tested for OmpC protein expression using immunoblotting. We saw OmpC protein expression in whole cell extracts for the parental wild type strain as well as both recombined strains (carrying a wt and L5 mutant ompC, respectively), but no OmpC band in the intermediate knockout strain. Furthermore, using an antibody specific to the L5 loop of OmpC, we saw detection only in the parental wild type (wt) and the strain with the restored wt ompC allele but not in the knockout or the strain carrying the L5 mutant allele (Supplementary Figure S3C). Therefore, the combination of our negative selection cassette with a traditional positive selectable marker has enabled us to create a functional, scarless mutation directly in the ompC locus of UTI89.

Creation of markerless mutations at arbitrary loci in UTI89

We verified that the kan-PrhaB-relE positive–negative selection cassette could be used to manipulate arbitrary genes in UTI89 by performing gene deletion and complementation at three other loci (Supplementary Table S3). We used the two-step integration/knockout followed by replacement of the integrated dual selection cassette, using negative selection on rhamnose to create isogenic mutant strains or unmarked mutations with defined deletion junctions (Supplementary Tables S1 and S3). We successfully deleted and reintegrated the mlaA and hsdS genes, verifying each by both diagnostic PCR and Sanger sequencing of the locus. Functional confirmation of mlaA knockout and reintegrated constructs was also done using an SDS–EDTA sensitivity assay (Supplementary Table S4). We also made unmarked versions of previously reported strains carrying point mutations in the fimH gene (15), eliminating the downstream linked kanamycin gene previously required for the second step of cloning. All clones were confirmed to contain the predicted mutations by diagnostic PCR and sequencing of the new junctions created by recombination. Furthermore, all of the unmarked fimH mutants were verified to have similar type 1 pilus function to their marked counterparts based on hemagglutination of guinea pig red blood cells (Supplementary Table S5).

No off-target mutations are created

Expression of Red recombinase can induce a 10-fold increase in spontaneous mutations (22). As the goal is to perform well-defined chromosomal manipulations, the two rounds of Red recombinase mediated recombination are a potentially large problem for off-target, unintended mutations. To mitigate this, we followed the recommendations of Campellone and Murphy to minimize Red recombinase induction to 30–45 min prior to generation of competent cells in both recombination steps (22). To further address whether off-target mutations were being generated, however, we performed whole genome sequencing on a strain that had undergone a chromosomal knockout then reintegration of the mlaA gene. Using Cortex (which detects SNPs and small indels) (16), we found no mutations compared with the parental strain (prior to the initial knockout of mlaA). We also used a new sensitive structural variation caller (which detects larger inversions, duplications, insertions, deletions and other more complex rearrangements; manuscript in preparation) and again found no differences in genomic rearrangements compared with the parental strain. While we only examined one strain, the complete absence of any detectable off-target mutations in over 5 Mbp of sequence in this strain indicates that unintended mutations are not generally being created at a high rate during these engineering steps.

Generalization of negative selection for other host strains by module replacement

The positive–negative selection cassette we created was built from several well known genetic elements (promoters, terminators, RBS sequences and antibiotic markers). In the interest of generality, potentially enabling use in other E. coli strains as well as non-E. coli bacteria, we first verified that we could simply replace the promoter and antibiotic marker without having to perform any optimization of sequences or selection conditions. We created ParaB-relE, where the negative selection would be carried out with growth on arabinose. We also switched the parental template plasmid to pKD3, encoding a chloramphenicol positive selection cassette (Supplementary Table S3). Indeed, switching between PrhaB and ParaB or chloramphenicol and kanamycin had little effect on the efficiency of the negative selection step (based on limited testing of 7–8 colonies by PCR and loss of the positive selection marker) when used in a similar two-step allelic replacement at the ompC locus of UTI89 using the previously defined selection conditions we used for UTI89 (although the antibiotic and sugar were altered as necessary).

Given that several toxins encoded by MG1655 besides relE were also potential candidates for use in UTI89, but that may be variably present in other E. coli strains, we next tested whether we could also simply replace the toxin gene in a modular fashion. We tested several other toxins (mqsR, higB, yhaV, yafQ and chpBK) with PrhaB, using the same cloning sites we used for relE to make identical fusions of the start codon of each toxin gene with the promoter and ribosome binding site of our template plasmid. We further reasoned that alternative toxins, such as those from phage or antibacterial secretion systems, might function effectively both in E. coli (thus enabling use in MG1655 and other lab strains) as well as in other bacteria. We therefore also included the holin gene (324 bp) from phage λ (12) in UTI89 and the tse2 type 6 secretion system toxin (477 bp) from Pseudomonas aeruginosa (11). These positive–negative selection cassette variants were again tested using the same two-step knockout/replacement protocol without modification of transformation or growth conditions. We found that, except for variants containing higB and yafQ (see below), all variants were usable for creating markerless mutations in UTI89.

Finally, we tested our system in different bacterial strains. We replaced the ompC gene in CFT073 (another uropathogenic E. coli clinical isolate (27)), EDL933 (an enterohemorrhagic E. coli isolate (28)), TOP2515 (a cystitis isolate that has never been reported to be manipulated on its chromosome (29)), and S. enterica serovar Typhimurium 14028S (30) with the ompC allele from UTI89 (Supplementary Table S3). In addition, we replaced the hsdS gene in E. coli MG1655 with the hsdS gene from UTI89 (Supplementary Table S3) using the P. aeruginosa tse2 gene as the negative selection system toxin. All replacements were successful as verified by PCR, growth phenotypes, and Sanger sequencing. Furthermore, for all variant cassettes and all strains tested, we used the same amplification primers, transformation conditions, and negative selection conditions (M9 with 0.2% rhamnose) that we had originally used for UTI89. Thus, our positive–negative selection cassette is indeed general and needs no optimization when used in different clinical isolates of E. coli and other Enteric bacteria.

Inducible toxins mediate high negative selection stringency

We noted in these manipulations that, in general, very few background/false positive colonies grew on rhamnose plates (Supplementary Figure S3B). Furthermore, two toxins, higB and phage λ holin, could suppress growth on rhamnose when present on a high copy plasmid but not when present in single copy on the chromosome (Supplementary Figure S4). These data suggested that different toxins vary in selection stringency, which might be useful for creating negative selection cassettes in organisms lacking tightly regulated promoters. We first tested selection stringency by calculating the fraction of viable colony forming units (CFUs) that were able to grow under restrictive conditions (smaller numbers indicate higher stringency). This test measures the frequency of colonies growing under restrictive conditions (as opposed to the actual mutation rate – see below), is naturally related to the background colonies one would observe during a typical cloning procedure, and has been used previously in rapid screens of mutability (3133). We therefore refer to these values as ‘stringency frequencies’. We plated bacteria carrying a positive–negative selection cassette in single copy in the chromosome onto each of LB, LB-antibiotic and M9-rhamnose or M9-arabinose plates. On non-restrictive plates, we quantified bacteria by plating serial dilutions of the original culture, while on restrictive plates, we concentrated 1010 CFU from 10 ml of OD600 = 1 culture and plated them all. For reference, the parental strains and the strains with the replaced alleles (both sensitive to kanamycin or chloramphenicol) were also plated to the same plates to test the stringency of positive selection on antibiotics (Figure 2A). We found that positive selection markers mediated very strong selection (stringency frequencies of 10−9–10−10) in UTI89 (Figure 2A). Among negative selection markers, different toxins, when driven by the PrhaB promoter, did indeed vary in their strength of selection, but the toxins that were usable for negative selection in single copy on the UTI89 chromosome had stringency frequencies of 10−7–10−8 (Figure 2A and Supplementary Table S6). Because the insertion site of a negative selection cassette may affect its relative copy number due to the timing of DNA replication, and copy number influences expression levels and thereby potentially stringency, we repeated these tests with the kan-PrhaB-relE selection cassette integrated into different loci in UTI89. In general, stringency did not vary much across different loci within UTI89 (Figure 2B, Supplementary Figure S5A), across different E. coli strains, or in Salmonella (Figure 2C, Supplementary Figure S5B). Notably, the tse2 toxin (11) from the Pseudomonas Type VI secretion system had the highest stringency frequency (1.96 × 10−8) at the hsdS locus in MG1655 (Figure 2C). In comparison, the highest stringency for any negative selection system reported to date for unmodified E. coli strains (MG1655 or its close relatives, all lab-adapted) is 6 × 10−7 (presumably also a stringency frequency) using a combined tetAsacB cassette (3).

Figure 2.

Figure 2.

Selection stringency tests. (A) Selection stringency tests for different toxins. Data shows the selection stringency, which is calculated as: CFU/ml on restrictive agar divided by CFU/ml on non-restrictive agar. First set of bars represents the positive selection stringency for wild type (UTI89) strains tested in each experiment. The middle set of bars represents the negative selection stringency for each strain carrying the dual selection cassette integrated into the ompC locus in UTI89. The last set of bars represents positive selection stringency for the strains in which the selection cassette has been replaced with the wild type ompC allele in a subsequent negative selection step. (B) Stringency does not vary with chromosomal locus. Data shows the positive selection stringency for wild-type strains (on the left) followed by the negative selection stringency for strains carrying the kan-PrhaB-relE cassette at different loci in UTI89 (on the right). (C) The positive–negative selection cassette is usable in different strains of E. coli and in Salmonella. Toxins and strains are as indicated. For graphs in (A–C), data represents the average of the log-transformed values of selection stringencies with error bars indicating standard deviation of the log-transformed stringency values calculated from three independent biological replicates.

We verified that CFU quantification on rhamnose plates was accurate by comparing CFU calculated by serial dilutions plated to both plain LB and M9-rhamnose; no differences were seen for any of the wild-type strains (Supplementary Table S7). Furthermore, we also tested a control in which the kanamycin gene and PrhaB were integrated into the chromosome without an accompanying toxin gene (SLC-658); this also showed no toxicity from growth on rhamnose compared with plain LB (Supplementary Table S7).

To further validate the high stringency of our system, we explored several additional lines of reasoning for the strain carrying the tse2 toxin in MG1655 (SLC-568). First, we repeated our initial stringency frequency test by plating a higher number (1011 CFU; total culture volume 100 ml, OD600 = 1) of cells onto M9-rhamnose plates; we found a similar stringency frequency (1.11 × 10−8; Supplementary Figure S6), representing growth of at least several hundred colonies in each of three replicates of this experiment.

Second, we estimated the theoretical maximum possible stringency (i.e. lowest mutation rate) based on chromosomal replication error rates for MG1655, assuming that only mutations within the negative selection cassette itself could confer breakthrough growth. The range of mutation rates for MG1655 has been reported to be between 1 × 10−3 and 3.3 × 10−3 /genome/generation (34,35); this translates into 2.2 × 10−10 to 7.1 × 10−10 mutations/nucleotide/generation. The PrhaB-tse2 cassette is 811 bp; therefore the expected mutation rate is 1.8 × 10−7 to 5.8 × 10−7 mutations/cassette/generation. Not all mutations would inactivate the toxin or disrupt its expression; this fraction is difficult to quantify, but we take as limits the expected fraction of coding mutations that would be expected to be nonsense (5%) (36) and the fraction of lacI mutations that are dominant (40%) (37) (this range encompasses the estimated fraction (20%) of missense mutations (∼70% of all de novo mutations) that are strongly deleterious (overall 14% of all de novo mutations)) (36). This gives an expected inactivation rate between 0.9 × 10−8 and 2.3×10−7/cassette/generation. In addition the toxin could be inactivated by a spontaneous large deletion, which occurs at a rate of between 1.79 × 10−9 and 2.5 × 10−7 per generation in E. coli (3840). These calculated mutation rate values are now rates at which colonies growing on rhamnose could be generated, and should be considered distinct from the stringency frequencies measured and discussed above.

Measurement of experimental rates of mutation (as opposed to stringency frequencies) is more involved; the relevant issues have been previously elucidated by Luria and Delbruck (17). We carried out a Luria–Delbruck fluctuation test to directly measure experimental rates of mutation to breakthrough growth on rhamnose. Using two different starting small culture sizes (see ‘Materials and Methods’ section), we calculated a mutation rate toward growth on rhamnose of 1.42 × 10−8, within the theoretical range based on per-nucleotide mutation rates calculated above. As a control, we also performed a Luria–Delbruck fluctuation test for mutations conferring resistance to kanamycin. The fluctuation test gave a mutation rate to resistance (1.11 × 10−10) that was lower than our rate for mutation of our negative selection cassette, in keeping with the better stringency frequency of kanamycin (2.30 × 10−10) as measured above.

As a final verification of the reproducibility of the high stringency of our negative selection cassettes, we performed the same cloning to create a separate but isogenic strain to SLC-568 (designated SLC-657) and assayed its frequency of breakthrough growth on rhamnose (by plating 1010 CFU), again arriving at 2.38 × 10−8 (Supplementary Figure S6). We therefore conclude that the negative selection stringency (mutation frequencies) of the tse2 system in MG1655 is indeed better than previously published negative selection systems and that its mutation rate is very close to the theoretical minimum mutation rate expected for a system of this size.

Negative selection by inducible toxins enables unmarked allelic exchange

We then asked whether high negative selection stringency could be leveraged to enable ‘difficult’ applications such as generalized transduction, where positive selection markers are traditionally required (though recently P1 transduction has been performed using a tetA-sacB negative selection cassette (3)). We performed generalized transduction with a phage P1 lysate (generated from unmodified wild-type MG1655) to complement an hsdS knockout (replaced with kan-PrhaB-tse2; host strain MG1655 (SLC-568)); selection on rhamnose resulted in growth of >2000 transductants (out of ∼109 plated CFU), of which 16/16 (100%) were positive for restoration of the wild-type hsdS locus by PCR. In contrast, no colonies were observed when the P1 phage lysate itself was plated, and 200 colonies were observed on rhamnose when only SLC-568 (the recipient strain) cells were plated. The estimated efficiency of >90% compares favorably with the 56–63% efficiency reported using tetA-sacB (3), consistent with the higher stringency of our system. Transduction using negative selection improves on traditional generalized transduction because the resulting transductants contain no residual antibiotic marker or scar and because unmarked, unmodified strains can be used as the donor.

However, clinical strains are often difficult to manipulate using generalized transduction, either due to phage specificity (10) or low efficiency. We were unable to perform a restoration of the ompC locus in UTI89 using P1-mediated generalized transduction from unmodified MG1655, and we were unable to create a P1 transducing lysate from either UTI89 or CFT073. We speculated that we could circumvent these difficulties by mimicking transduction using transformation of sheared genomic DNA (Figure 3A). Such a strategy would remove limitations imposed by the need to create a transducing lysate from the donor strain that could successfully infect the recipient strain. We refer to this phage-free ‘transduction’ as generalized allelic exchange. Reimplementation of generalized transduction in this way would not only expand its potential use to strains without transducing phage, but could also potentially lead to a novel application for strain hybridization we term ‘mass allelic exchange’ (see 'Discussion' section). We directly transformed sheared genomic DNA from CFT073 by electroporation into SLC-557 (UTI89 ompC::kan-PrhaB-relE) expressing Red recombinase. Selection on rhamnose followed by PCR screening resulted in identification of the desired recombinant, where CFT073 genomic DNA had replaced the ompC locus in UTI89, in 2/55 colonies (3.6%). We sequenced the DNA flanking ompC in these two clones and found that the precise recombination breakpoints were different (Figure 3B), as expected from the random shearing of the CFT073 genomic DNA. Of note, selection stringency is paramount in this application, as only 0.2% of the input DNA (the CFT073 genome is ∼5 Mbp, while DNA was sheared to 10 kb) is ‘on-target’ for replacement of the negative selection module at the ompC locus. Therefore, our negative selection system enables phage-free generalized allelic exchange between different strains of E. coli, substituting for generalized transduction.

Figure 3.

Figure 3.

Generalized allelic exchange. (A) Schematic of the generalized allelic exchange strategy. A negative selection marker (open rectangle) is inserted into the recipient strain (UTI89 here) at a defined locus (thick black arc, ompC in this case). Sheared whole genomic DNA from another strain (CFT073 here, red lines) is transformed into UTI89, and growth on restrictive conditions allows selection for replacement of the targeted locus (thick red arc) with no residual marker or scar. (B) Recombination breakpoints for ompC allelic exchange from CFT073 to UTI89. A schematic of the gene organization of 6kb surrounding the ompC gene in UTI89 is shown at the top. The rectangles below depict recombination breakpoints of two clones (SLC-578 and SLC-579) as determined by Sanger sequencing of the ompC gene and flanking sequence. In each, a central region of CFT073 sequence encompassing the ompC gene is indicated in gray, while the surrounding UTI89 sequence is indicated by black. (C) Schematic for strain hybridization by mass generalized allelic exchange. A library of UTI89 with random insertions of the negative selection system (thick black arcs) throughout the chromosome is subjected to the process in (A) en masse. Growth on restrictive conditions would result in a library of hybrids in which each locus originally carrying the negative selection system insertion has been replaced by the homologous locus from CFT073 (thick red arcs).

DISCUSSION

There is a surprising lack of stringent and general negative selection tools, even for cloning strains of E. coli. Indeed, several recent reports (3,4,41) have highlighted the need for better negative selection systems in E. coli. In general, previous systems have suffered from low stringency, a requirement for modified hosts, or a need to optimize selection conditions separately for each host strain. In the context of understanding the genetic basis of pathogenesis in primary clinical isolates, these drawbacks have made negative selection largely impractical. Particularly for loss of function studies, high frequency site-specific recombinase systems such as Cre-lox (42) and Flp-FRT (14) can be leveraged to manipulate chromosomal loci with only a small scar left behind, largely alleviating the need for negative selection. However, allelic replacement experiments in clinical isolates to date have been relatively rare compared with loss of function experiments; when done, they are limited either due to laborious construction methods or due to use of a linked positive selection marker.

General and stringent negative selection systems could remove these limitations on direct genetic studies of unmodified clinical pathogenic isolates. Recent reports of new negative selection systems have continued to focus on cloning strains of E. coli (3,4,41). These reported systems may be applicable to clinical strains, though the requirement for or complexity of strain-specific optimization is still unclear. Our system, due to its modular design, can accommodate essentially any toxin gene, whether from a toxin–antitoxin system, phage system, secretion system or potentially other as yet unknown genetic modules. We have demonstrated that the majority of tested toxins can be converted without optimization into effective negative selection systems that vary little in their efficiency and stringency. Furthermore, we have shown that no optimization is needed to use these systems in multiple clinical strains of E. coli, including a previously uncharacterized clinical isolate and a well-studied lab strain of E. coli, as well as in S. enterica Serovar Typhimurium. Therefore, we anticipate that little or no optimization will be needed for the vast majority of E. coli strains.

The stringency and generality of our design certainly depends on the availability of tightly regulated promoters in E. coli and other enteric bacteria. We have shown that both the rhamnose and arabinose promoters are sufficiently well controlled in single copy to enable bacterial growth when not specifically induced and are very stringent in preventing growth when induced. Based on similar promoter dynamics when present in single copy on the chromosome, we also expect the widely used lac promoter to be a viable promoter for driving toxin genes (43,44). Additional promoters native to E. coli and other Enterobacteria that respond to pH (45), temperature (46) or oxygenation conditions (47) could also be used. This intentional modularity of transcriptional control, enabling use of different promoters, is particularly valuable for generalization to other bacterial systems.

We noted that, while four of six tested TA system genes could be immediately used in our strategy, higB was only viable for negative selection when present on a plasmid in multiple copies, while yafQ could not be used for negative selection at all. Interestingly, this implies that the induced expression level from a single copy is insufficient to prohibit growth. In all cases, the 5′ untranslated sequence and ribosome binding sequence were identical among our constructs. Therefore, we suspect that the observed variation in toxicity at the same levels of promoter induction could be due to additional transcriptional (such as pause or termination sites), translational (such as rare codons or mRNA stability), or post-translational effects (such as protein folding or intrinsic toxicity thresholds) unique to individual toxins. Regardless of the mechanism, however, our results imply that toxins differ in their ‘effective toxicity’ when we controlled copy number and promoter induction; this concept of a threshold for toxicity has been introduced previously in relation to levels of the antitoxin within a cell (48). Because the toxins themselves are known to function in a broad range of bacteria (25,26) as well as eukaryotes (yeast) (49), toxin genes that are permissive for cell growth at low transcription levels may enable the use of less optimal (less tight) inducible promoters. Furthermore, due to the large number of toxin-antitoxin systems (and other bacterial toxins from phages and secretion systems) present in public databases from genome sequencing efforts (cataloged, for example, in TADB (50), RASTA (51) and the PanDaTox (52) databases), it seems likely that effective toxicity will vary greatly, especially if codon usage or transcriptional controls within the coding sequences are not removed. Indeed, one could imagine collecting a database of toxin genes that are characterized based on the threshold transcriptional or induction level required for toxicity; this could then be queried to identify reasonable candidates for use with existing inducible promoters to generate additional negative selection systems depending on the available genetic tools in a given system.

We have performed numerous chromosomal manipulations in UTI89 and other lab and clinical E. coli strains using all variants of these negative selection systems. The construction of definitive strains for genetic tests in clinical isolates is no longer less convenient than in lab strains. However, we have relied on the widely used Red recombinase system from phage λ. Expression of this system in E. coli is known to be mutagenic, and we use it in each of two steps to manipulate the chromosome. While following published recommendations for limiting the length of recombinase expression (22), the possibility still remains that unselected mutations may be induced that could confound interpretation of downstream experiments. To definitively address this possibility, we used full genome sequencing of one of our allelic replacement strains and found no unexpected mutations of any class, including SNPs, small indels, or large scale rearrangements. Furthermore, phenotypic testing of a panel of FimH mutants, all constructed by allelic replacement, showed perfect concordance with previously published results (in which the strains carried a linked positive selection marker).

Our system exceeds the reported stringency of selection (measured using stringency frequencies) of the next best reported system (tetA–sacB in MG1655, W3110 and/or DH10B; 6 × 10−7 (3)) by up to 60× in lab strains of E. coli (kan-PrhaB-tse2 in MG1655; 1.11×10−8 as measured by plating 1011 CFU); this begins to approach (within 5×) the selection stringency frequency of positively selectable antibiotic markers in some strains we have tested (kanamycin in E. coli EDL933; 2.40 × 10−9). The stringency frequency in clinical strains (kan-PrhaB-relE in UTI89; 3.31 × 10−8) is 17-fold better than the tetA–sacB system as measured in lab strains, and it remains high without the need for strain- or species-specific optimization of transformation or growth conditions. We have further measured mutation rates for growth on restrictive conditions and find that these also correspond well with expected minimum theoretical rates for inactivating mutations as calculated by previously reported genomic mutation rates in E. coli (34,35). We have used the more easily measured stringency frequencies to compare the different systems. This is similar to previous screens of mutation rates in E. coli (3133), though perhaps not fully rigorous. We note, however, that our stringency frequencies indicate that kanamycin is more stringent than our negative selection system (namely, using PrhaB-tse2), and we obtain the same result when making a more careful measurement of mutation rate. Furthermore, measurement of mutation frequencies is able to detect mutator strains as those with higher mutation frequencies (32,33). Thus our and previous data both argue that measurement of stringency (mutation) frequencies is reasonable for a relative comparison of mutation rates, a notion further supported by additional theoretical analysis of fluctuation tests for mutation rates (53).

Although mutation rates and frequencies are not directly comparable, we note here two further observations inspired by a discussion with one of the reviewers. First, our expected minimum mutation rate (0.9 × 10−8) and calculated mutation rate (2.45 × 10−8) are both similar to the mutation rates towards resistance to phage measured by Luria and Delbruck themselves (1.1 × 10−8–4.1 × 10−8) (17). In looking closely at their original data, the culture sizes (0.2–10 ml with 1 × 108–4×1010 CFU in each) are also similar to ours. The ‘average’ number of resistant bacteria measured in each of their cultures, due to the jackpot effect, is higher than the number of resistant bacteria measured in the majority of their cultures (50–95% of cultures in each experiment have fewer resistant bacteria than the average), which forms part of the motivation for developing the fluctuation test instead of relying on mutation frequencies. Furthermore, using a ‘simple expectation’ calculation of (# of bacteria) × (mutation rate) as an arbitrary cutoff, Luria and Delbruck observe an average frequency of mutant CFU above this number, while we see frequencies of background growth near or below this number. This again highlights the distinction between mutation frequencies and rates, arguing that direct measurements of rates are more reliable. This observation may also indicate that we have not sampled enough independent cultures to capture the high variance in the number of mutations present per culture; unfortunately, due to the way we performed the fluctuation tests, we did not collect data on the number of resistant CFU per culture.

However, as a second point, we do note another interesting set of data that provides an informative comparison with our work. Recently, Luria–Delbruck fluctuation tests were performed on E. coli for resistance to rifampicin, in the context of examining the relationship of cell density with mutation rate (54); the measured range of mutation rates to rifampicin resistance spanned ∼2 × 10−9 to 1 × 10−8. Several earlier studies have examined mutation to rifampicin resistance in E. coli using simple frequency tests; two large studies of E. coli isolates obtained mutation frequencies of 2.5 × 10−9 to 1 × 10−8 (31) and 5 × 10−9 to 5 × 10−8 (32), and a third study of E. coli and Salmonella obtained <1 × 10−8 to 5 × 10−8 (33). The mutation rate and frequency studies differ in the strains and media used, but overall the results match our data in that the (more simply measured) mutation frequencies are quantitatively similar to the mutation rates measured using the fluctuation tests. Furthermore, we also note a quantitative similarity between the stringency frequency (2.30 × 10−10) and mutation rate (1.11 × 10−10) that we measured for kanamycin resistance. Again, we note only that the quantitative similarity between stringency (mutation) frequencies and mutation rates that we observe in our data is mirrored in published data and present in multiple experiments with different selection cassettes; we therefore suggest that this likely indicates our measurements are accurate. However, the conceptual separation of rates and frequencies must still be maintained. For most other negative selection cassettes, however, only stringency frequencies are available, and the following discussion of stringencies should be considered in the context of the foregoing discussion.

Unlike several commonly used systems including sacB or tetA, our negative selection cassettes appear to come very close to the theoretical maximal stringency (based on mutation rate) for genes of their size, which may account for their superiority over these systems. Although the theoretical maximal stringency for sacB is between 1.67 × 10−8 and 4.25 × 10−7, the reported stringencies (usually using stringency frequency) measured for this system are much lower (E. coli HS996; 2.3 × 10−5 (20)). Similarly, although the theoretical maximal stringency for tetA is between 1.33 × 10−8 and 3.40 × 10−7, the reported stringencies in different host strains and at different nickel concentrations vary between 10−4 and 10−6 (1). These systems fall short of expectations in their behavior by 100- to 1000-fold (assuming the correspondence between mutation frequencies and rates should also hold for these systems). Why is our system more robust to selection conditions and changing host strains than previous systems? We speculate that several aspects of our design contribute to higher stringency. First, we are using toxins that have presumably evolved for the purpose of efficiently stopping cell growth; in contrast, tetracycline resistance and the sacB gene were evolved for other purposes, and their use for negative selection is based on leveraging other properties they confer on cells. Second, we have included several features that enable us to have ‘tight’ control over toxin induction, including transcriptional terminators and the well characterized rhamnose and arabinose promoters to reduce basal uninduced expression; and the use of the native rhaB RBS sequence and minimal media with a single carbon source to ensure good ‘induced’ expression. The combination of an inducible toxin, therefore, for arresting cell growth becomes reliably stringent in different strains and species so long as we can maintain low or no basal expression. A further potential improvement may also be gained by using genomic templates for amplification of the positive–negative selection cassette, thereby eliminating the possibility of the template plasmids conferring resistance during recombination, despite their reliance on conditional replication origins. In contrast, the variable selection mediated by tetA or sacB may be due to their overall expression levels in and membrane properties of different strains. We finally note that generality and high stringency is common for negative selection systems that require special host genotypes (such as tolC (41), thyA (6) and neo-rpsl (7)); we have now provided this benefit to unmodified lab and clinical isolates of E. coli and Salmonella and potentially other Enterobacteriaceae.

In practice, stringency does seem to vary for our negative selection system in different experimental settings, but no more than it does for positive selection markers. For example, in the reimplementation of P1 transduction using negative selection with MG1655 carrying the tse2 toxin, we saw a higher number of background colonies (200 out of 109 CFU, giving a background frequency of 5 × 10−6). However, we have observed this result with positive selection markers as well. Kanamycin stringency frequency is 2.3 × 10−10, but use of the Datsenko and Wanner protocol with Red recombinase leads to background colonies (not the desired recombinants and not able to regrow on kanamycin upon passage) growing on kanamycin plates at a frequency higher than this. This discrepancy in kanamycin stringency frequency may be due to the mutagenic effect of Red recombinase expression, although we and others (22) see no evidence for higher mutation when Red recombinase expression is limited in time. Similarly, we have no good explanation for the higher background rate when performing P1 transduction compared to simple titering on rhamnose plates. However, the variation in stringency seems to apply to both negative and positive selection markers.

The generally high stringency of our negative selection system is sufficient to enable previously inaccessible large scale genomic manipulations. We have carried out an initial proof of concept experiment which shows that the system is powerful enough to select for clones of UTI89 that have been transformed with the correct 10 kb fragment of the CFT073 genome (0.2% of the total genome) and then recombined it into the correct ompC locus. Potentially, additional development of this technology to reduce background colonies and improve genomic DNA transfer length and efficiency, such as by using Hfr strains for conjugation or transducing lysates, would enable the transfer of entire swaths of genomic DNA from unrelated strains, subject to flanking homology. Combined with transposon engineering, a pool of strains containing randomly inserted negative selectable markers could then be used to create libraries of wholesale allelic exchange mutants in a procedure we term ‘mass allelic exchange’ (Figure 3C). This would finally enable a practical genetic technology for assigning phenotypic consequences to intra-species polymorphisms as well as close inter-species differences. This enabling of sexual genetics in non-naturally competent bacteria could be especially powerful for dissecting the differences between strains of E. coli (or other bacteria) that cause different diseases in humans, a longstanding and perplexing problem.

One drawback of relE and several of the other toxins used is that they tend to be bacteriostatic instead of bacteriocidal. This necessitates an extra step of colony purification during standard cloning procedures and must be accounted for in large-scale allelic exchange experiments. However, switching to a bacteriocidal toxin should alleviate this drawback and increase the efficiency and convenience of this system, while possibly simultaneously further broadening the host range.

In summary, we have designed a modular and general negative selection system that is both extremely powerful and highly versatile, potentially enabling generalization to other bacteria. The high selection stringency enables novel and improved applications including generalized allelic exchange between arbitrary E. coli strains (or even between different species). This negative selection module should therefore be particularly useful in the dissection of bacterial virulence as well as regulation, physiology, speciation, and evolution. With further optimization, libraries of strain hybrids could be created that would surmount drawbacks inherent to positive selection based hybrids (55), enabling sexual genetic strategies to systematically evaluate the thousands of DNA polymorphisms that typically separate unrelated clinical isolates of E. coli and other bacteria that are not naturally competent.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

SUPPLEMENTARY DATA

Acknowledgments

We thank Linda Kenney, Bill Burkholder, Kimberly Kline and members of the Chen lab for helpful discussions and suggestions on the method and the manuscript. We are also indebted to an anonymous reviewer for the insights and discussions regarding the measurement and interpretation of selection stringency. Salmonella enterica Serovar Typhimurium 14028S was kindly provided by Linda Kenney. P1vir phage lysate was kindly provided by Shu Sin Chng.

FUNDING

National Research Foundation, Prime Minister's Office, Singapore under its NRF Research Fellowship Scheme [NRF-RF2010-10 to S.L.C.]; Genome Institute of Singapore (GIS)/Agency for Science, Technology and Research (A*STAR). Funding for open access charge: National Research Foundation, Prime Minister's Office, Singapore under its NRF Research Fellowship Scheme [NRF-RF2010-10 to S.L.C.].

Conflict of interest statement. None declared.

REFERENCES

  • 1.Podolsky T., Fong S.T., Lee B.T. Direct selection of tetracycline-sensitive Escherichia coli cells using nickel salts. Plasmid. 1996;36:112–115. doi: 10.1006/plas.1996.0038. [DOI] [PubMed] [Google Scholar]
  • 2.Gay P., Le Coq D., Steinmetz M., Ferrari E., Hoch J.A. Cloning structural gene sacB, which codes for exoenzyme levansucrase of Bacillus subtilis: expression of the gene in Escherichia coli. J. Bacteriol. 1983;153:1424–1431. doi: 10.1128/jb.153.3.1424-1431.1983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Li X.T., Thomason L.C., Sawitzke J.A., Costantino N., Court D.L. Positive and negative selection using the tetA-sacB cassette: recombineering and P1 transduction in Escherichia coli. Nucleic Acids Res. 2013;41:e204. doi: 10.1093/nar/gkt1075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Wang H., Bian X., Xia L., Ding X., Muller R., Zhang Y., Fu J., Stewart A.F. Improved seamless mutagenesis by recombineering using ccdB for counterselection. Nucleic Acids Res. 2014;42:e37. doi: 10.1093/nar/gkt1339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.DeVito J.A. Recombineering with tolC as a selectable/counter-selectable marker: remodeling the rRNA operons of Escherichia coli. Nucleic Acids Res. 2008;36:e4. doi: 10.1093/nar/gkm1084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Wong Q.N., Ng V.C., Lin M.C., Kung H.F., Chan D., Huang J.D. Efficient and seamless DNA recombineering using a thymidylate synthase A selection system in Escherichia coli. Nucleic Acids Res. 2005;33:e59. doi: 10.1093/nar/gni059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Heermann R., Zeppenfeld T., Jung K. Simple generation of site-directed point mutations in the Escherichia coli chromosome using Red(R)/ET(R) Recombination. Microb. Cell Factor. 2008;7:14. doi: 10.1186/1475-2859-7-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Tashiro Y., Fukutomi H., Terakubo K., Saito K., Umeno D. A nucleoside kinase as a dual selector for genetic switches and circuits. Nucleic Acids Res. 2011;39:e12. doi: 10.1093/nar/gkq1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Alteri C.J., Mobley H.L. Escherichia coli physiology and metabolism dictates adaptation to diverse host microenvironments. Curr. Opin. Microb. 2012;15:3–9. doi: 10.1016/j.mib.2011.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Battaglioli E.J., Baisa G.A., Weeks A.E., Schroll R.A., Hryckowian A.J., Welch R.A. Isolation of generalized transducing bacteriophages for uropathogenic strains of Escherichia coli. Appl. Environ. Microbiol. 2011;77:6630–6635. doi: 10.1128/AEM.05307-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hood R.D., Singh P., Hsu F., Guvener T., Carl M.A., Trinidad R.R., Silverman J.M., Ohlson B.B., Hicks K.G., Plemel R.L., Li M., et al. A type VI secretion system of Pseudomonas aeruginosa targets a toxin to bacteria. Cell Host Microbe. 2010;7:25–37. doi: 10.1016/j.chom.2009.12.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Grundling A., Manson M.D., Young R. Holins kill without warning. Proc. Natl. Acad. Sci. U.S.A. 2001;98:9348–9352. doi: 10.1073/pnas.151247598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Haldimann A., Wanner B.L. Conditional-replication, integration, excision, and retrieval plasmid-host systems for gene structure-function studies of bacteria. J. Bacteriol. 2001;183:6384–6393. doi: 10.1128/JB.183.21.6384-6393.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Datsenko K.A., Wanner B.L. One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc. Natl. Acad. Sci. U.S.A. 2000;97:6640–6645. doi: 10.1073/pnas.120163297. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Chen S.L., Hung C.S., Pinkner J.S., Walker J.N., Cusumano C.K., Li Z., Bouckaert J., Gordon J.I., Hultgren S.J. Positive selection identifies an in vivo role for FimH during urinary tract infection in addition to mannose binding. Proc. Natl. Acad. Sci. U.S.A. 2009;106:22439–22444. doi: 10.1073/pnas.0902179106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Iqbal Z., Caccamo M., Turner I., Flicek P., McVean G. De novo assembly and genotyping of variants using colored de Bruijn graphs. Nat. Genet. 2012;44:226–232. doi: 10.1038/ng.1028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Luria S.L.a.D., M. Mutations of bacteria from virus sensitivity to virus resistance. Genetics. 1943;28:491–511. doi: 10.1093/genetics/28.6.491. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Thomason L.C., N.C., Court D.L. Current Protocols in Molecular Biology. NY: John Wiley & Sons, Inc; 2007. [Google Scholar]
  • 19.Yamaguchi Y., Inouye M. Regulation of growth and death in Escherichia coli by toxin-antitoxin systems. Nat. Rev. Microbiol. 2011;9:779–790. doi: 10.1038/nrmicro2651. [DOI] [PubMed] [Google Scholar]
  • 20.Stavropoulos T.A., Strathdee C.A. Synergy between tetA and rpsL provides high-stringency positive and negative selection in bacterial artificial chromosome vectors. Genomics. 2001;72:99–104. doi: 10.1006/geno.2000.6481. [DOI] [PubMed] [Google Scholar]
  • 21.Bochner B.R., Huang H.C., Schieven G.L., Ames B.N. Positive selection for loss of tetracycline resistance. J. Bacteriol. 1980;143:926–933. doi: 10.1128/jb.143.2.926-933.1980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Murphy K.C., Campellone K.G. Lambda Red-mediated recombinogenic engineering of enterohemorrhagic and enteropathogenic E. coli. BMC Mol. Biol. 2003;4:11. doi: 10.1186/1471-2199-4-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Liang R., Liu J. Scarless and sequential gene modification in Pseudomonas using PCR product flanked by short homology regions. BMC Microbiol. 2010;10:209. doi: 10.1186/1471-2180-10-209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Zhang Y., Buchholz F., Muyrers J.P., Stewart A.F. A new logic for DNA engineering using recombination in Escherichia coli. Nat. Genet. 1998;20:123–128. doi: 10.1038/2417. [DOI] [PubMed] [Google Scholar]
  • 25.Zhang X.Z., Yan X., Cui Z.L., Hong Q., Li S.P. mazF, a novel counter-selectable marker for unmarked chromosomal manipulation in Bacillus subtilis. Nucleic Acids Res. 2006;34:e71. doi: 10.1093/nar/gkl358. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Al-Hinai M.A., Fast A.G., Papoutsakis E.T. Novel system for efficient isolation of Clostridium double-crossover allelic exchange mutants enabling markerless chromosomal gene deletions and DNA integration. Appl. Environ. Microbiol. 2012;78:8112–8121. doi: 10.1128/AEM.02214-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Welch R.A., Burland V., Plunkett G. 3rd, Redford P., Roesch P., Rasko D., Buckles E.L., Liou S.R., Boutin A., Hackett J., et al. Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli. Proc. Natl. Acad. Sci. U.S.A. 2002;99:17020–17024. doi: 10.1073/pnas.252529799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Riley L.W., Remis R.S., Helgerson S.D., McGee H.B., Wells J.G., Davis B.R., Hebert R.J., Olcott E.S., Johnson L.M., Hargrett N.T., et al. Hemorrhagic colitis associated with a rare Escherichia coli serotype. N. Engl. J. Med. 1983;308:681–685. doi: 10.1056/NEJM198303243081203. [DOI] [PubMed] [Google Scholar]
  • 29.Chen S.L., Wu M., Henderson J.P., Hooton T.M., Hibbing M.E., Hultgren S.J., Gordon J.I. Genomic diversity and fitness of E. coli strains recovered from the intestinal and urinary tracts of women with recurrent urinary tract infection. Sci. Transl. Med. 2013;5:13. doi: 10.1126/scitranslmed.3005497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Jarvik T., Smillie C., Groisman E.A., Ochman H. Short-term signatures of evolutionary change in the Salmonella enterica serovar typhimurium 14028 genome. J. Bacteriol. 2010;192:560–567. doi: 10.1128/JB.01233-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Denamur E., Bonacorsi S., Giraud A., Duriez P., Hilali F., Amorin C., Bingen E., Andremont A., Picard B., Taddei F., et al. High frequency of mutator strains among human uropathogenic Escherichia coli isolates. J. Bacteriol. 2002;184:605–609. doi: 10.1128/JB.184.2.605-609.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Matic I., Radman M., Taddei F., Picard B., Doit C., Bingen E., Denamur E., Elion J. Highly variable mutation rates in commensal and pathogenic Escherichia coli. Science. 1997;277:1833–1834. doi: 10.1126/science.277.5333.1833. [DOI] [PubMed] [Google Scholar]
  • 33.LeClerc J.E., Li B., Payne W.L., Cebula T.A. High mutation frequencies among Escherichia coli and Salmonella pathogens. Science. 1996;274:1208–1211. doi: 10.1126/science.274.5290.1208. [DOI] [PubMed] [Google Scholar]
  • 34.Drake J.W. A constant rate of spontaneous mutation in DNA-based microbes. Proc. Natl. Acad. Sci. U.S.A. 1991;88:7160–7164. doi: 10.1073/pnas.88.16.7160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Lee H., Popodi E., Tang H., Foster P.L. Rate and molecular spectrum of spontaneous mutations in the bacterium Escherichia coli as determined by whole-genome sequencing. Proc. Natl. Acad. Sci. U.S.A. 2012;109 doi: 10.1073/pnas.1210309109. doi:10.1073/pnas.1210309109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kryukov G.V., Pennacchio L.A., Sunyaev S.R. Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. Am. J. Hum. Genet. 2007;80:727–739. doi: 10.1086/513473. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Schaaper R.M., Dunn R.L. Spontaneous mutation in the Escherichia coli lacI gene. Genetics. 1991;129:317–326. doi: 10.1093/genetics/129.2.317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Uematsu N., Matsuoka C., Agemizu Y., Nagoshi E., Yamamoto K. Asymmetric crossing over in the spontaneous formation of large deletions in the tonB-trp region of the Escherichia coli K-12 chromosome. Mol. Gen. Genet. 1999;261:523–529. doi: 10.1007/s004380050996. [DOI] [PubMed] [Google Scholar]
  • 39.Albertini A.M., Hofer M., Calos M.P., Miller J.H. On the formation of spontaneous deletions: the importance of short sequence homologies in the generation of large deletions. Cell. 1982;29:319–328. doi: 10.1016/0092-8674(82)90148-9. [DOI] [PubMed] [Google Scholar]
  • 40.Schaaper R.M., Danforth B.N, Glickman B.W. Mechanisms of spontaneous mutagenesis: an analysis of the spectrum of spontaneous mutation in the Escherichia coli lacI gene. J. Mol. Biol. 1986;189:273–284. doi: 10.1016/0022-2836(86)90509-7. [DOI] [PubMed] [Google Scholar]
  • 41.Gregg C.J., Lajoie M.J., Napolitano M.G., Mosberg J.A., Goodman D.B., Aach J., Isaacs F.J., Church G.M. Rational optimization of tolC as a powerful dual selectable marker for genome engineering. Nucleic Acids Res. 2014;42:4779–4790. doi: 10.1093/nar/gkt1374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Tuntufye H.N., Goddeeris B.M. Use of lambda Red-mediated recombineering and Cre/lox for generation of markerless chromosomal deletions in avian pathogenic Escherichia coli. FEMS Microbiol. Lett. 2011;325:140–147. doi: 10.1111/j.1574-6968.2011.02421.x. [DOI] [PubMed] [Google Scholar]
  • 43.Novick A., Weiner M. Enzyme induction as an all-or-none phenomenon. Proc. Natl. Acad. Sci. U.S.A. 1957;43:553–566. doi: 10.1073/pnas.43.7.553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Siegele D., Hu J.C. Gene expression from plasmids containing the araBAD promoter at subsaturating inducer concentrations represents mixed populations. Proc. Natl. Acad. Sci. U.S.A. 1997;94:8168–8172. doi: 10.1073/pnas.94.15.8168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Chou C.H., Aristidpu A.A., Meng S.Y., Bennett G.N., San K.Y. Characterization of a pH-inducible promoter system for high-level expression of recombinant proteins in Escherichia coli. Biotechnol. Bioeng. 1995;47:186–192. doi: 10.1002/bit.260470210. [DOI] [PubMed] [Google Scholar]
  • 46.Valdez-Cruz N.A., Caspeta L., Perez N.O., Ramirez O.T., Trujillo-Roldan M.A. Production of recombinant proteins in E. coli by the heat inducible expression system based on the phage lambda pL and/or pR promoters. Microb. Cell Factor. 2010;9:18. doi: 10.1186/1475-2859-9-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Khosla C., Curtis J.E., Bydalek P., Swartz J.R., Bailey J.E. Expression of recombinant proteins in Escherichia coli using an oxygen-responsive promoter. Nat. Biotechnol. 1990;8:554–558. doi: 10.1038/nbt0690-554. [DOI] [PubMed] [Google Scholar]
  • 48.Rotem E., Loinger A., Ronin I., Levin-Reisman I., Gabay C., Shoresh N., Biham O., Balaban N.Q. Regulation of phenotypic variability by a threshold-based mechanism underlies bacterial persistence. Proc. Natl. Acad. Sci. U.S.A. 2010;107:12541–12546. doi: 10.1073/pnas.1004333107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Yang J., Jiang W., Yang S. mazF as a counter-selectable marker for unmarked genetic modification of Pichia pastoris. FEMS Yeast Res. 2009;9:600–609. doi: 10.1111/j.1567-1364.2009.00503.x. [DOI] [PubMed] [Google Scholar]
  • 50.Shao Y., Harrison E.M., Bi D., Tai C., He X., Ou H.Y., Rajakumar K., Deng Z. TADB: a web-based resource for Type 2 toxin-antitoxin loci in bacteria and archaea. Nucleic Acids Res. 2011;39:D606–D611. doi: 10.1093/nar/gkq908. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Sevin E.W., Barloy-Hubler F. RASTA-Bacteria: a web-based tool for identifying toxin-antitoxin loci in prokaryotes. Genome Biol. 2007;8:R155. doi: 10.1186/gb-2007-8-8-r155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Amitai G., Sorek R. PanDaTox: a tool for accelerated metabolic engineering. Bioengineered. 2012;3:218–221. doi: 10.4161/bioe.20431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Stewart F.M. Fluctuation tests: how reliable are the estimates of mutation rates. Genetics. 1994;137:1139–1146. doi: 10.1093/genetics/137.4.1139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Krasovec R., Belavkin R.V., Aston J.A., Channon A., Aston E., Rash B.M., Kadirvel M., Forbes S., Knight C.G. Mutation rate plasticity in rifampicin resistance depends on Escherichia coli cell-cell interactions. Nat. Commun. 2014;5:3742. doi: 10.1038/ncomms4742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Freddolino P.L., Goodarzi H., Tavazoie S. Revealing the genetic basis of natural bacterial phenotypic divergence. J. Bacteriol. 2014;196:825–839. doi: 10.1128/jb.01039-13. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SUPPLEMENTARY DATA

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES