Altered expression of a quality control protease in E. coli reshapes the in vivo mutational landscape of a model enzyme

Samuel Thompson; Yang Zhang; Christine Ingle; Kimberly A Reynolds; Tanja Kortemme

doi:10.7554/eLife.53476

. 2020 Jul 23;9:e53476. doi: 10.7554/eLife.53476

Altered expression of a quality control protease in E. coli reshapes the in vivo mutational landscape of a model enzyme

Samuel Thompson ^1,^✉, Yang Zhang ², Christine Ingle ³, Kimberly A Reynolds ^3,⁴, Tanja Kortemme ^1,^2,^5,^✉

Editors: Sarel Jacob Fleishman⁶, Patricia J Wittkopp⁷

PMCID: PMC7377907 PMID: 32701056

Abstract

Protein mutational landscapes are shaped by the cellular environment, but key factors and their quantitative effects are often unknown. Here we show that Lon, a quality control protease naturally absent in common E. coli expression strains, drastically reshapes the mutational landscape of the metabolic enzyme dihydrofolate reductase (DHFR). Selection under conditions that resolve highly active mutants reveals that 23.3% of all single point mutations in DHFR are advantageous in the absence of Lon, but advantageous mutations are largely suppressed when Lon is reintroduced. Protein stability measurements demonstrate extensive activity-stability tradeoffs for the advantageous mutants and provide a mechanistic explanation for Lon’s widespread impact. Our findings suggest possibilities for tuning mutational landscapes by modulating the cellular environment, with implications for protein design and combatting antibiotic resistance.

Research organism: E. coli

Introduction

Natural protein sequences are constrained by pressures to maintain required structures and functions within a complex cellular environment. However, key cellular factors shaping protein sequences (such as interactions with cellular binding partners or with the proteostasis machinery) are often unknown. To characterize functional constraints, it has been useful to determine mutational landscapes of proteins, which we define here as the effects on growth of every possible single amino acid mutation in the protein, via deep mutational scanning (Boucher et al., 2016; Fowler and Fields, 2014). Deep mutational scanning studies have provided insights into evolution of new protein functions (McLaughlin et al., 2012; Stiffler et al., 2015; Wrenbeck et al., 2017), protein design (Tinberg et al., 2013; Whitehead et al., 2012), functional trade-offs (Klesmith et al., 2017; Steinberg and Ostermeier, 2016), and adaptation to altered environments (Hietpas et al., 2013). With a few exceptions (Bandaru et al., 2017; Hietpas et al., 2013; Jiang et al., 2013; Stiffler et al., 2015), however, these studies find a general tolerance to mutation for residues outside of active sites and binding interfaces (Araya et al., 2012; Boucher et al., 2016; Klesmith et al., 2017; Roscoe et al., 2013; Wrenbeck et al., 2017) that is often explained by the absence of key environmental constraints under the selection conditions (Bandaru et al., 2017; Jiang et al., 2013; Stiffler et al., 2015).

To study the impact of multiple constraints on mutational tolerance during selection, we chose E. coli dihydrofolate reductase (DHFR) as a model system. DHFR is an essential enzyme within folate metabolism that reduces dihydrofolate to tetrahydrofolate and is necessary for thymidine production. Using this activity as the basis for an in vivo selection assay (Reynolds et al., 2011), we aimed first to measure a mutational landscape for DHFR and then to determine how a change to the cellular environment might affect the landscape. Because DHFR is known to progress through multiple conformational states during catalysis (Boehr et al., 2006; Sawaya and Kraut, 1997; Figure 1—figure supplement 1), we expected the mutational landscape of DHFR to be constrained by the requirement to adopt these different conformations. Moreover, prior work had suggested DHFR is impacted by cellular constraints such as protein quality control (Bershtein et al., 2013) and the build-up of a toxic metabolic intermediate (Schober et al., 2019). We hence expected deep mutational scanning to reveal a highly constrained mutational landscape for DHFR that would contrast with the mutational tolerance observed in other systems.

Results

As the basis for our studies, we first sought to establish highly sensitive selection conditions for DHFR function that would be calibrated to DHFR enzymatic velocity (rate of DHF conversion per molecule of DHFR) and capable of resolving mutants with velocities near-to or faster-than wild-type. We anticipated that we would need to control DHFR protein expression (intracellular abundance) levels because two prior studies that modified the chromosomal DHFR gene had reported an overall high mutational tolerance under permissive selection conditions (Garst et al., 2017) and that DHFR abundance can be reduced to ~30% without a growth impact (Bershtein et al., 2013). We used an E. coli strain derived from ER2566 with the genes for DHFR and a downstream enzyme, thymidylate synthase, deleted in the genome and complemented on a pACYC-DUET plasmid with a weak ribosome binding site (see Materials and methods) that results in DHFR abundance at approximately 10% of the endogenous protein level (Figure 1—figure supplement 2, Figure 1—source data 1). To tightly control growth conditions, we performed selections in a turbidostat to maintain the culture in early Log phase growth (Figure 1A, Figure 1—figure supplement 3A). To quantify the effects of DHFR mutations on growth, we calculated selection coefficients (Rubin et al., 2017) from the change in allele frequency over time by deep sequencing of timepoint samples determined in biological triplicate (Figure 1B). For a panel of 14 DHFR mutants, we confirmed that the selection coefficients obtained from deep mutational scanning correlated linearly with growth rates measured separately for the individual variants in a plate reader (Figure 1—figure supplement 3B, Figure 1—source data 2), as expected. Furthermore, under our controlled selection conditions, we observed a linear relationship between selection coefficient and in vitro velocity (Figure 1C) at cytosolic substrate concentrations (Bennett et al., 2009; Kwon et al., 2008) for these DHFR mutants (Figure 1—source data 3). These results confirm that selection coefficients between −1.5 and 1.0 in our experiment are correlated with DHFR enzymatic velocity over approximately 3 orders of magnitude, and that selection can resolve mutants with higher velocities than wild-type level velocity.

Figure 1. — (A) Turbidostat schematic. Reoccurring dilutions with fresh medium keep the culture optical density (OD600) below 0.075. (B) The selection coefficient for each mutant is the slope of the linear regression of allele frequency over time. The wild-type (squares) value is normalized to zero. Advantageous (red) mutations increase and disadvantageous (blue) mutations decrease in frequency. (C) Selection coefficients from deep mutational scanning as a function of enzymatic velocity for purified DHFR point mutants measured in vitro. Velocities at 20 µM DHF were calculated from Michalis-Menten parameters. Error bars reflect the standard deviation from three biological replicates. (D) Histogram of selection coefficients. The wild-type value is indicated with a vertical black line. The median standard deviation over all mutations is the cut-off for WT-like behavior (Materials and methods, Figure 1—figure supplement 3, Figure 1—figure supplement 4) and is indicated with dashed lines. Mutation are colored as advantageous (red), disadvantageous (blue), WT-like (white), or null (grey). (E) Structural model of DHFR (PDB ID: 3QL3) with cross-section slices (**a–e**) indicated. The DHF substrate (green) and the NADPH cofactor (purple) are represented by spheres (yellow carbons and heteroatom coloring). An arrow indicates the perspective for each slice. (**a–e**) five cross-section slices. Color scale indicates numbers of advantageous mutations at each position. Crosshatching indicates residues with >20% solvent accessible surface area.

Figure 1—source data 1. Soluble DHFR expression levels in molecules per cell measured from lysate activity assays as described in Materials and methods.
The location of the DHFR gene is listed in parenthesis in the first column. Expression values corresponds to the cell strain in the column heading.

elife-53476-fig1-data1.xlsx^{(9.8KB, xlsx)}

Figure 1—source data 2. Selection coefficients for –Lon selection (Figure 1—source data 1) compared to monoculture growth rates measured in a plate reader in *ER2566 ∆folA/∆thyA (–Lon)* as described in Materials and methods.
For values listed as ND, no detectable change in OD was measured during a 30 hr growth period.

elife-53476-fig1-data2.xlsx^{(9.9KB, xlsx)}

Figure 1—source data 3. Michaelis-Menten kinetics for the set of DHFR mutants (Fierke and Benkovic, 1989; Huang et al., 1994; Reynolds et al., 2011) used to calibrate the selection are reported together with the reference from which the values were taken.

elife-53476-fig1-data3.xlsx^{(9.6KB, xlsx)}

Figure 1—figure supplement 1. — (A) Turbidostat schematic. Reoccurring dilutions with fresh medium keep the culture optical density (OD600) below 0.075. (B) The selection coefficient for each mutant is the slope of the linear regression of allele frequency over time. The wild-type (squares) value is normalized to zero. Advantageous (red) mutations increase and disadvantageous (blue) mutations decrease in frequency. (C) Selection coefficients from deep mutational scanning as a function of enzymatic velocity for purified DHFR point mutants measured in vitro. Velocities at 20 µM DHF were calculated from Michalis-Menten parameters. Error bars reflect the standard deviation from three biological replicates. (D) Histogram of selection coefficients. The wild-type value is indicated with a vertical black line. The median standard deviation over all mutations is the cut-off for WT-like behavior (Materials and methods, Figure 1—figure supplement 3, Figure 1—figure supplement 4) and is indicated with dashed lines. Mutation are colored as advantageous (red), disadvantageous (blue), WT-like (white), or null (grey). (E) Structural model of DHFR (PDB ID: 3QL3) with cross-section slices (**a–e**) indicated. The DHF substrate (green) and the NADPH cofactor (purple) are represented by spheres (yellow carbons and heteroatom coloring). An arrow indicates the perspective for each slice. (**a–e**) five cross-section slices. Color scale indicates numbers of advantageous mutations at each position. Crosshatching indicates residues with >20% solvent accessible surface area.

Figure 1—source data 1. Soluble DHFR expression levels in molecules per cell measured from lysate activity assays as described in Materials and methods.
The location of the DHFR gene is listed in parenthesis in the first column. Expression values corresponds to the cell strain in the column heading.

elife-53476-fig1-data1.xlsx^{(9.8KB, xlsx)}

Figure 1—source data 2. Selection coefficients for –Lon selection (Figure 1—source data 1) compared to monoculture growth rates measured in a plate reader in *ER2566 ∆folA/∆thyA (–Lon)* as described in Materials and methods.
For values listed as ND, no detectable change in OD was measured during a 30 hr growth period.

elife-53476-fig1-data2.xlsx^{(9.9KB, xlsx)}

Figure 1—source data 3. Michaelis-Menten kinetics for the set of DHFR mutants (Fierke and Benkovic, 1989; Huang et al., 1994; Reynolds et al., 2011) used to calibrate the selection are reported together with the reference from which the values were taken.

elife-53476-fig1-data3.xlsx^{(9.6KB, xlsx)}

We next analyzed the deep mutational scanning data for all possible DHFR single point mutants under the calibrated selection conditions (Figure 1D, Supplementary file 1). All pairwise replicates were related with a Pearson correlation R² value of 0.70 and the median standard deviation between replicates for all selection coefficients was 0.2 (Figure 1—figure supplement 3C–E). Using this value, we defined the selection coefficient interval of 0 ±0.2 as WT-like behavior. Within this interval, the standard deviation of the selection coefficients between replicates was not correlated with changes in selection coefficient (Figure 1—figure supplement 4A). Moreover, our WT-like threshold of 0.2 was greater than the value of 0.12 for the standard deviation for wild-type synonymous codons (Figure 1—figure supplement 4B). Based on these considerations, we defined DHFR mutations with selection coefficients of <−0.2 and >0.2 as disadvantageous and advantageous, respectively. Mutations that were depleted during overnight growth (under less stringent conditions using a supplemented growth medium, see Materials and methods) were assigned a null phenotype. As expected, mutations at DHFR positions that are known to be functionally important (M20, W22, D27, L28, F31, T35, M42, L54, R57, T113, G121, D122, and S148) were generally disadvantageous or null mutations (Figure 1—figure supplement 5). These results indicate that our selection assay is a sensitive reporter of functionally important residues and that our results are consistent with previous biochemical characterization of DHFR.

In previous deep mutational scanning experiments, stringent selection typically revealed many disadvantageous mutations (Garst et al., 2017; Jiang et al., 2013; Mavor et al., 2016; Mavor et al., 2018; Stiffler et al., 2015). In contrast, the most striking observation under our conditions is the large fraction of advantageous mutations (red, Figure 1D): 736 of 3161 possible variants were advantageous (23.3%), and wild-type DHFR only ranked 1203^rd (although 467 of the 1202 higher-ranking variants fall into the WT-like interval). In direct measurements of individual growth rates under our selection conditions, the top two DHFR variants (W47L and L24V) led to increases in growth rate of 40% and 76%, respectively, when compared to wild-type DHFR (Figure 1—figure supplement 6). Advantageous mutations were widely distributed over 127 of the 159 positions of DHFR (Figure 1E). Furthermore, when we examined the DHFR structure, many of the advantageous mutations appeared to disrupt key side-chain interactions, for example by disrupting atomic packing interactions or surface salt-bridges (Figure 1—figure supplement 7).

To understand the origins of this counter-intuitive prevalence of advantageous mutations, we looked for cellular factors potentially affecting our mutational landscape. Our selection strain (Anton et al., 2016), like most standard expression strains of E. coli, is naturally deficient in Lon protease (Gur and Sauer, 2008) due to an insertion of IS186 in the lon promoter region (saiSree et al., 2001). Lon is a major component of protein quality control in E. coli (Powers et al., 2012; Sauer and Baker, 2011) responsible for degrading poorly folded proteins. Moreover, Lon had previously been implicated in degrading DHFR unstable variants in E. coli (Bershtein et al., 2013; Cho et al., 2015), and deleting Lon in an MG1655 strain of E. coli masked the deleterious impact of 2 destabilizing mutations out of a panel of 21 mutants tested in growth experiments at 30 °C (Bershtein et al., 2013). Although these 21 mutants were selected for minimal impacts on Michaelis-Menten kinetic parameters, we reasoned that the absence of Lon could be responsible for the large fraction of advantageous but potentially destabilizing mutations observed in our selection.

To test this prediction, we reintroduced chromosomal Lon expression under the control of a constitutive promoter in our selection strain, and repeated deep mutational scanning in biological triplicate (Supplementary file 2). We refer to the two regimes as +Lon and –Lon selection. The quality of +Lon selection was comparable to that of –Lon selection (Figure 2—figure supplement 1, Figure 2—figure supplement 2). Consistent with our hypothesis, the distribution of selection coefficients shifted towards more negative values in the +Lon selection, depleting positive selection coefficients and enriching for negative or null coefficients (Figure 2A). The number of advantageous mutations after reintroducing Lon decreased from 737 in –Lon selection to 384 in +Lon selection (Figure 2B), the mean selection coefficient for advantageous mutations decreased from 0.47 to 0.37, and the rank of the wild-type sequence increased by 341 to 864^th (where 479 of the 863 higher-ranked variants are in the WT-like interval) (Figure 2—figure supplement 3). The median rank of the wild-type residue over all positions decreased from eight in –Lon selection to five in +Lon selection (Figure 2—figure supplement 4).

Figure 2. — (A) Histogram of selection coefficients for mutations (top) in –Lon (grey) and +Lon selection (green). The difference of the histograms (bottom) is shown with grey indicating more mutants for –Lon selection and green indicating more mutants for +Lon selection. The threshold for classification for advantageous and disadvantageous mutations is as in Figure 1 and indicated with dashed lines. (B) Distribution of mutations classified by selection coefficients: 0.2 ≤ advantageous (adv.), 0.2 > WT like > –0.2, –0.2 ≥ disadvantageous (disadv.), null, and no data (a mutant was not detected in the library after transformation into the selection strain). Grey bars: –Lon selection; green bars: +Lon selection. (C) Distribution of sequence positions into the five mutational response categories: Beneficial, Tolerant, Mixed, Deleterious, Intolerant. Grey bars: –Lon selection; green bars: +Lon selection. (D) Heatmap of DHFR selection coefficients in the –Lon and +Lon strains, showing details of the distributions shown in C) (dotted border). Positions (rows) are grouped by their mutational response category for –Lon and +Lon as in C) and sorted by the wild-type amino acid. Amino acid residues (columns) are organized by physiochemical similarity and indicated by their one-letter amino acid code. An asterisk indicates a stop codon. Advantageous mutations are shown in shades of red, disadvantageous mutations in shades of blue, Null mutations in grey and ‘No data’ as defined in A) in black. Wild-type amino acid residues are outlined in black.

Figure 2—figure supplement 1. — (A) Histogram of selection coefficients for mutations (top) in –Lon (grey) and +Lon selection (green). The difference of the histograms (bottom) is shown with grey indicating more mutants for –Lon selection and green indicating more mutants for +Lon selection. The threshold for classification for advantageous and disadvantageous mutations is as in Figure 1 and indicated with dashed lines. (B) Distribution of mutations classified by selection coefficients: 0.2 ≤ advantageous (adv.), 0.2 > WT like > –0.2, –0.2 ≥ disadvantageous (disadv.), null, and no data (a mutant was not detected in the library after transformation into the selection strain). Grey bars: –Lon selection; green bars: +Lon selection. (C) Distribution of sequence positions into the five mutational response categories: Beneficial, Tolerant, Mixed, Deleterious, Intolerant. Grey bars: –Lon selection; green bars: +Lon selection. (D) Heatmap of DHFR selection coefficients in the –Lon and +Lon strains, showing details of the distributions shown in C) (dotted border). Positions (rows) are grouped by their mutational response category for –Lon and +Lon as in C) and sorted by the wild-type amino acid. Amino acid residues (columns) are organized by physiochemical similarity and indicated by their one-letter amino acid code. An asterisk indicates a stop codon. Advantageous mutations are shown in shades of red, disadvantageous mutations in shades of blue, Null mutations in grey and ‘No data’ as defined in A) in black. Wild-type amino acid residues are outlined in black.

To examine in more detail how the mutational response of individual residues changes between selection ±Lon, we used a K-means clustering algorithm (see Materials and methods) to group all DHFR sequence positions into five categories: positions where mutations were generally advantageous (Beneficial), generally WT-like (Tolerant), variably advantageous and disadvantageous (Mixed), generally disadvantageous (Restricted), and generally null (Intolerant). Grouping was performed separately for –Lon and +Lon selection (Figure 3—source data 1). Comparing the distributions of DHFR positions in –Lon and +Lon conditions illustrates the extensive reshaping of the mutational landscape by Lon (Figure 2C,D). For –Lon selection, 28 positions (17.6%) were classified as Beneficial, where nearly every mutation was preferred over the wild-type residue. In comparison, the number of Beneficial positions decreased to 10 in +Lon selection, with only three surface-exposed positions (E48, T68, D127) common between the two Beneficial sets. Simultaneously, the number of Restricted positions increased from 42 to 67 with the reintroduction of Lon into the selection strain (Figure 2C). These results support the conclusion that Lon activity broadly penalizes mutations, including a large subset of the advantageous mutations. Overall, the changes upon modulating Lon activity lead to a model in which upregulating Lon increases constraints on DHFR, and the mutational landscape changes from being permissive when Lon is absent to being more restricted when Lon is present (Figure 2D).

To analyze the constraints imposed by Lon on the DHFR mutational landscape in structural detail, we defined a ∆selection coefficient for each amino acid residue at each position as the difference between the +Lon and –Lon selections (Figure 3A). The ∆selection coefficient values were most negative at positions in the Beneficial category and at positions with a native VILMWF or Y amino acid residue (Figure 3B, excludes Intolerant positions from –Lon selection); overall, mutations at positions with native hydrophobic residues are enriched for negative ∆selection coefficients (Figure 3—figure supplement 1A). Strikingly, the mean ∆selection coefficient was –0.71 for the 65 buried positions with <20% side-chain solvent accessible surface area, compared to –0.27 for the 79 exposed positions (Figure 3C, Figure 3—figure supplement 1B, Figure 3—source data 1). These results show that Lon has a broad impact on the mutational landscape throughout the DHFR structure but imposes particularly strong constraints in the DHFR core.

Figure 3. — (A) Conceptual diagram of ∆selection coefficients, calculated as the +Lon selection coefficient minus the –Lon selection coefficient (see Materials and methods). (B) Heatmap of ∆selection coefficient values for all positions not classified as Intolerant. ∆selection coefficients values between –0.2 and 0.2 are shown in white; ∆selection coefficients >0.2 are in shades of red and ∆selection coefficients <–0.2 in shades of blue. Amino acid residues (columns) are organized by physiochemical similarity and indicated by their one-letter amino acid code. The mean ∆selection coefficient (avg) at each position is shown as a separate column and outlined with a light blue box. Positions (rows) are sorted by the wild-type amino acid and grouped by their mutational response category from the –Lon selection in Figure 2C,D. Positions with a native VILMWF or Y amino acid are indicated with an orange bar to the left. (C) Per-position mean ∆selection coefficient displayed on the structural model of DHFR. The five cross-section slices of the DHFR structure are displayed as in Figure 1E, and the color scale is as in B).

Figure 3—source data 1. Burial classification for DHFR positions from the Getarea server (Fraczkiewicz and Braun, 1998) as described in Materials and methods.

elife-53476-fig3-data1.xlsx^{(12.9KB, xlsx)}

Figure 3—figure supplement 1. — (A) Conceptual diagram of ∆selection coefficients, calculated as the +Lon selection coefficient minus the –Lon selection coefficient (see Materials and methods). (B) Heatmap of ∆selection coefficient values for all positions not classified as Intolerant. ∆selection coefficients values between –0.2 and 0.2 are shown in white; ∆selection coefficients >0.2 are in shades of red and ∆selection coefficients <–0.2 in shades of blue. Amino acid residues (columns) are organized by physiochemical similarity and indicated by their one-letter amino acid code. The mean ∆selection coefficient (avg) at each position is shown as a separate column and outlined with a light blue box. Positions (rows) are sorted by the wild-type amino acid and grouped by their mutational response category from the –Lon selection in Figure 2C,D. Positions with a native VILMWF or Y amino acid are indicated with an orange bar to the left. (C) Per-position mean ∆selection coefficient displayed on the structural model of DHFR. The five cross-section slices of the DHFR structure are displayed as in Figure 1E, and the color scale is as in B).

Figure 3—source data 1. Burial classification for DHFR positions from the Getarea server (Fraczkiewicz and Braun, 1998) as described in Materials and methods.

elife-53476-fig3-data1.xlsx^{(12.9KB, xlsx)}

To determine why mutations in DHFR were advantageous in the absence of Lon but less so in its presence, we selected a subset of mutations for more detailed characterization in individual experiments. We considered all positions with more than one mutation in the top 100 most advantageous mutations for the –Lon condition. We describe these positions by their location in one of four structural regions that appear to be hot-spots for the top advantageous mutations (Figure 4A,B, Figure 4—figure supplement 1): 1) exchanges between hydrophobic residues at core positions, 2) disruptions of surface residues on the beta-sheet near the active site, 3) disruptions of polar interactions with the adenine ring of NADPH, or 4) mutations to the active site or M20 loop that controls access to the active site. At these positions, we selected strongly advantageous mutations. Where possible, we selected two mutations at the same position but with significantly differing Lon sensitivities such that the set had a range of ∆selection coefficients from −0.07 to −1.46, with the exception of L24V that had a positive ∆selection coefficient. We first confirmed that the selected advantageous mutations indeed had higher cytosolic DHFR activity (the total rate of conversion of DHF to THF) in ER2566 ∆folA/∆thyA (–Lon) lysates relative to the activity for WT DHFR (Figure 4—figure supplement 2), consistent with the deep mutational scanning results.

Figure 4. — (A) DHFR structure with mutational hot-spots. For positions with two or more top 100 advantageous mutations in the absence of Lon, the beta carbon is depicted as a sphere scaled according to the number of top mutations. For mutants selected for in vitro characterization, the beta carbon is colored according to its location in the DHFR structure: core (purple), surface beta-sheet (gold), proximal to the adenine ring on NADPH (blue), or proximal to the active site and M20 loop (red). Positions for advantageous mutants from the calibration set are depicted in dark grey. (B) The structure from A) rotated 90° clockwise. (C) In vitro velocities of purified DHFR wild-type and point mutants measured at 20 µM DHF. Bars are colored in reference to the hot-spots in A). Error bars represent ±1 standard deviation from three independent experiments (Materials and methods). The dashed line represents the velocity of WT DHFR. (D) DHFR cellular abundance calculated from the lysate DHFR activity in Figure 4—figure supplement 2 and in vitro kinetics with purified enzyme (see Materials and methods). Error bars represent the cumulative percent error (standard deviation) from three independent experiments for velocity and three biological replicates for lysate activity. Data are shown in both the -Lon (light grey) and +Lon (green) conditions. The dashed line represents the WT expression level of DHFR in the –Lon background. Mutants are in the same order as in C) (see Figure 4—source data 2; four mutants were not measured). (E) Cellular abundance of DHFR vs. in vitro velocities of purified DHFR wild-type and point mutants measured at 20 µM DHF. Points are colored as in A). Error bars represent ±1 standard deviation from three independent experiments (Materials and methods). The dashed line represents WT-level DHFR activity, i.e. DHFR abundance/velocity pairs whose product is equivalent to [DHFR]_WT • velocity_WT. (F) Correlation between in vitro T_m values and in vivo ∆selection coefficients for DHFR wild-type and characterized mutants. Points are colored as in A). (G) ∆T_m values and ∆∆selection coefficient for mutations at the same position. Points representing comparison between mutants are numbered as follows: 1) D116I-M, 2) M42Y-F, 3) W30M-F, 4) I91G-A, 5) Q102W-L, 6) L62A-V, 7) I41A-V, 8) W47V-L.

Figure 4—source data 1. In vitro velocity for selected advantageous measured as described in Materials and methods at multiple concentrations of DHF are reported with the standard deviation over three independent experiments.

elife-53476-fig4-data1.xlsx^{(11KB, xlsx)}

Figure 4—source data 2. Soluble DHFR abundance levels in molecules per cell measured from lysate activity assays as described in Materials and methods.
All values are for the SMT205 plasmid transformed into the cell strain in the column heading. NM, not measured.

elife-53476-fig4-data2.xlsx^{(10.6KB, xlsx)}

Figure 4—source data 3. Apparent T_m values from thermal denaturation experiments monitored by CD signal at 225 nm are reported along with the ∆selection coefficient (Lon impact) value depicted in Figure 4D.

elife-53476-fig4-data3.xlsx^{(9.7KB, xlsx)}

Figure 4—figure supplement 1. — (A) DHFR structure with mutational hot-spots. For positions with two or more top 100 advantageous mutations in the absence of Lon, the beta carbon is depicted as a sphere scaled according to the number of top mutations. For mutants selected for in vitro characterization, the beta carbon is colored according to its location in the DHFR structure: core (purple), surface beta-sheet (gold), proximal to the adenine ring on NADPH (blue), or proximal to the active site and M20 loop (red). Positions for advantageous mutants from the calibration set are depicted in dark grey. (B) The structure from A) rotated 90° clockwise. (C) In vitro velocities of purified DHFR wild-type and point mutants measured at 20 µM DHF. Bars are colored in reference to the hot-spots in A). Error bars represent ±1 standard deviation from three independent experiments (Materials and methods). The dashed line represents the velocity of WT DHFR. (D) DHFR cellular abundance calculated from the lysate DHFR activity in Figure 4—figure supplement 2 and in vitro kinetics with purified enzyme (see Materials and methods). Error bars represent the cumulative percent error (standard deviation) from three independent experiments for velocity and three biological replicates for lysate activity. Data are shown in both the -Lon (light grey) and +Lon (green) conditions. The dashed line represents the WT expression level of DHFR in the –Lon background. Mutants are in the same order as in C) (see Figure 4—source data 2; four mutants were not measured). (E) Cellular abundance of DHFR vs. in vitro velocities of purified DHFR wild-type and point mutants measured at 20 µM DHF. Points are colored as in A). Error bars represent ±1 standard deviation from three independent experiments (Materials and methods). The dashed line represents WT-level DHFR activity, i.e. DHFR abundance/velocity pairs whose product is equivalent to [DHFR]_WT • velocity_WT. (F) Correlation between in vitro T_m values and in vivo ∆selection coefficients for DHFR wild-type and characterized mutants. Points are colored as in A). (G) ∆T_m values and ∆∆selection coefficient for mutations at the same position. Points representing comparison between mutants are numbered as follows: 1) D116I-M, 2) M42Y-F, 3) W30M-F, 4) I91G-A, 5) Q102W-L, 6) L62A-V, 7) I41A-V, 8) W47V-L.

Figure 4—source data 1. In vitro velocity for selected advantageous measured as described in Materials and methods at multiple concentrations of DHF are reported with the standard deviation over three independent experiments.

elife-53476-fig4-data1.xlsx^{(11KB, xlsx)}

Figure 4—source data 2. Soluble DHFR abundance levels in molecules per cell measured from lysate activity assays as described in Materials and methods.
All values are for the SMT205 plasmid transformed into the cell strain in the column heading. NM, not measured.

elife-53476-fig4-data2.xlsx^{(10.6KB, xlsx)}

Figure 4—source data 3. Apparent T_m values from thermal denaturation experiments monitored by CD signal at 225 nm are reported along with the ∆selection coefficient (Lon impact) value depicted in Figure 4D.

elife-53476-fig4-data3.xlsx^{(9.7KB, xlsx)}

The lysate activity assay reports on both the enzymatic activity of a DHFR variant and its intracellular abundance, [DHFR] (Bershtein et al., 2015b; Dykhuizen et al., 1987). To separate the two contributions, we purified each of the DHFR variants and determined their enzymatic velocity in vitro using concentrations of DHF that are consistent with estimates of cytosolic DHF concentration based on mass spectrometry measurements (Kwon et al., 2008). At 20 µM DHF, 16 the mutants had velocities equal and up to three-fold higher than that of WT (Figure 4C, Figure 4—figure supplement 3, Figure 4—source data 1). In contrast, the other eight mutants had velocities as much as two-fold lower than that of WT at the same DHF concentration. These results show that the higher cytosolic DHFR activity of the advantageous mutations can only partially be explained by changes in the kinetic parameters for these mutants.

We therefore examined the soluble intracellular abundance of these mutants. In the absence of Lon, we observed that mutant abundance levels varied from close-to-wild-type levels to a 20-fold increase over wild-type (Figure 4D, Figure 4—figure supplement 4, Figure 4—source data 2). Importantly, abundance decreased for most mutants in the presence of Lon (Figure 4—figure supplement 4), as expected, and these abundance decreases correspond to decreased selection coefficients (negative values in the ∆selection coefficients from Figure 3 that report on the Lon impact on selection (Figure 4—figure supplement 5)). Moreover, when considering both velocity and abundance the expected total cellular DHFR activity ([DHFR] • velocity) is increased compared to wild-type for the majority of advantageous mutants (Figure 4E, Figure 4—figure supplement 6, positions above the dotted line indicate expected cellular activity greater than wild-type). However, the expected total cellular DHFR activity is not a strong quantitative predictor of the advantageous mutants in –Lon selection (Figure 4—figure supplement 7, Figure 4—figure supplement 8). We attribute discrepancies at least in part to the difficulty of accurately quantifying rather small differences in activity and abundance, in addition to other potential complicating factors such as differential activity of cellular chaperones for different DHFR variants (Cho et al., 2015), and feedback regulation that could affect cellular concentrations of the substrate DHF (Bershtein et al., 2015a; Kwon et al., 2008). Nevertheless, our velocity and abundance measurement are in qualitative agreement with the in vivo selection. Taken together, these results suggest that increased selection coefficients arise from an interplay of effects of the mutations on cellular abundance and catalytic activity (Dykhuizen et al., 1987), and that each parameter alone is insufficient to explain the majority of the advantageous mutations. Moreover, Lon suppresses advantageous mutations at least in part by reducing their cellular abundance.

To test more directly whether advantageous mutations in DHFR destabilize the protein and whether this destabilization could explain the sensitivity to Lon expression, we measured apparent melting temperature (T_m) values from non-reversible thermal denaturation monitored by circular dichroism spectroscopy. We found that many of the advantageous mutations considerably destabilized the protein (Figure 4F, Figure 4—figure supplement 9, Figure 4—source data 3). Moreover and as expected, the ∆selection coefficients between +Lon and –Lon selection (Figure 3) are correlated with T_m (Figure 4F), except for mutations near the active site. Strikingly, when we compare different mutations at the same position, the change in ∆selection coefficients (i.e. Lon sensitivity) correlates with the change in T_m values (Figure 4G). These results indicate that the many of the selected advantageous mutations are destabilizing, and that destabilization is correlated with Lon sensitivity. One possible explanation for the selection advantage of the subset of destabilizing mutations with increased k_cat (e.g. L24V, W30F/M, M42F/Y, H114V, D116I/M, E154V) is that these mutations promote breathing motions that accelerate product release, which is rate limiting for wild-type DHFR at neutral pH (Oyen et al., 2017) and for a hyperactive DHFR mutant with a 7-fold increase in k_cat(Iwakura et al., 2006).

Taken together, our data indicate that the observed widespread changes in the mutational landscape of DHFR can be explained by a penalty for destabilizing mutations from Lon expression, leading to extensive activity – stability tradeoffs for advantageous mutations. The effect of these two selection pressures is directly observable in the structural arrangement of the mutational response categories (Figure 5, Figure 5—figure supplement 1). In –Lon conditions, mutational responses are arranged in shells around the hydride transfer site (Liu et al., 2013; Figure 5A, top), where the proportion of advantageous mutations increases with increasing distance (Figure 5B). This same spatial pattern also holds for +Lon selection (Figure 5A, bottom), but it is now superimposed with the additional pressure against destabilizing mutations such that there are no Beneficial positions in the core (Figure 5C, Figure 5—figure supplement 2). In contrast, the mutational responses as a function of distance to other DHFR sites (e.g. C5 of the NADPH adenine ring) do not show as strong of a relationship (Figure 5—figure supplement 3). These findings illustrate how the contributions from two constraints – one structural (distance from hydride transfer) and one dependent on cellular context (Lon) – can be distinguished from structural patterns in the mutational landscape.

Figure 5. — (A) Mutational response categories from –Lon selection (top, categories in Figure 2C,D) and +Lon selection (bottom, categories as in Figure 2C,D) colored onto residues and displayed on slices as in Figure 1E. (B) Relationship between mutational response and distance from hydride transfer for –Lon selection. The percent of positions from each mutational response category are plotted as a function of distance from the site of hydride transfer. Each category colored as in A), top). (C) Relationship between mutational response and distance from hydride transfer for +Lon selection. Each category colored as in A), bottom).

Figure 5—figure supplement 1. — (A) Mutational response categories from –Lon selection (top, categories in Figure 2C,D) and +Lon selection (bottom, categories as in Figure 2C,D) colored onto residues and displayed on slices as in Figure 1E. (B) Relationship between mutational response and distance from hydride transfer for –Lon selection. The percent of positions from each mutational response category are plotted as a function of distance from the site of hydride transfer. Each category colored as in A), top). (C) Relationship between mutational response and distance from hydride transfer for +Lon selection. Each category colored as in A), bottom).

Discussion

The naturally occurring insertion in the Lon promoter in our original selection strain, in combination with our stringent selection conditions, allowed the serendipitous discovery that advantageous mutations are remarkably prevalent throughout the DHFR structure but are also highly sensitive to Lon. The large fraction of advantageous mutations to DHFR appears to conflict with the fixation of the wild-type DHFR sequence during evolution. While Lon expression in our selection increases both the relative rank of the WT DHFR sequence (Figure 2—figure supplement 4) and the similarity between amino acid preferences from selection and from bacterial DHFR orthologues (Figure 2—figure supplement 5), there are still considerable differences: There are still 384 advantageous mutants that rank substantially better than the WT sequence even in the presence of Lon, and the amino preferences in the two selection experiments (±Lon) are more similar to each other than either is to the preferences from bacterial DHFR orthologues.

Considering these differences, we note several caveats in comparing our selection results to selection in evolution: First and most generally, screening DHFR variants under calibrated selection conditions (such as defined temperature, medium, and growth kept in early log phase) for a few generations is not expected to recapitulate the natural selection pressures on E. coli DHFR on evolutionary timescales. Second and more specifically, our selection conditions were intentionally engineered to be highly sensitive to mutations by dampening DHFR abundance to approximately 10% of the endogenous level (Figure 1—figure supplement 2). In contrast, endogenous DHFR is expected to be buffered from mutational impacts. Increasing DHFR activity or abundance in E. coli several-fold above that in wildtype strains does not increase fitness, and, conversely, reducing DHFR abundance in E. coli does not have an impact on growth until abundance is below 30% of the endogenous level (Bershtein et al., 2013; Bhattacharyya et al., 2016). Indeed, selection on mutations to the chromosomal DHFR gene did not reveal strong mutational impacts in the absence of the anti-folate drug trimethoprim (Garst et al., 2017). Third, chromosomal DHFR expression is modulated through feedback mechanism (Bershtein et al., 2015a), and it would be an interesting question how the distribution of fitness effects of DHFR mutations will be shaped by the presence of such a regulatory expression element that is absent in our selection system. Taken together, these mutational buffering effects likely explain why mutations that are advantageous in our selection are not prevalent in evolutionary DHFR sequences, and likely also explain why DHFR sequences do not vary between naturally occurring –Lon and +Lon strains of E. coli.

Nevertheless, our engineered selection conditions yielded considerable insights into constraints on mutational landscapes that are typically hidden from observation precisely because of buffering effects in natural contexts. The increase in the number of advantageous mutations in the absence of Lon shows that decreasing cellular constraints can substantially modulate the tolerance to mutation in a deep mutational scanning experiment. Because all B type E. coli strains (e.g. BL21) have the same natural Lon deficiency as our selection strain, our results could have implications for selection experiments performed in these strains over much longer time-scales such as the E. coli Long-Term Evolution Experiment (Tenaillon et al., 2016), or directed evolution strategies that often lead to mutations at positions distal to the active site.

Beyond experiments in B-type E. coli, we expect the fundamental principle of tuning trade-offs to play a role in other experimental systems. Prior work has illuminated the impact of chaperones on the effect of mutations, such as for GroEL in bacteria (Tokuriki and Tawfik, 2009) and for Hsp90 in eukaryotic cells that has been shown to buffer the phenotypic impacts of deleterious mutations (Queitsch et al., 2002). Our results highlight an opposite key role for the protein quality control machinery to tune in vivo mutational responses and lead to a model where protease activities add constraints to the mutational landscape and chaperones relieve them.

The ability to tune multiple constraints could provide a general way of controlling landscapes to drive genes into regions of sequence space that are highly responsive to external pressures. A concrete example of how this principle could be applied is in combinatorial antibiotics. Lon inactivation has been shown to increase resistance to antibiotics (Nicoloff and Andersson, 2013). Switching between compounds capable of inhibiting or activating Lon in combination with DHFR-targeting folate inhibitors such as trimethoprim could serve to variably promote destabilized resistance mutants when Lon is inhibited and then penalize those mutations when Lon is reactivated.

While the power in engineering individual gene sequences is well-recognized, we are only just beginning to explore the potential in engineering the general behavior of local sequence space. We anticipate that further study of tunable constraints will yield a new toolkit for fine control of the landscapes that guide movements through sequence space and enable unexplored engineering applications.

Materials and methods

All plasmid and primer sequences are listed in The Appendix. Key plasmids were deposited in the Addgene plasmid repository (accession codes are listed in The Appendix). All code and python scripts are available at https://github.com/keleayon/2019_DHFR_Lon.git with key input files and example command lines (Thompson, 2020; copy archived at https://github.com/elifesciences-publications/2019_DHFR_Lon).

Generation of plasmids for in vivo selection assay

The vector bearing DHFR and TYMS for in vivo selection (SMT205) was derived from the pACYC-Duet vector described by Reynolds et al., 2011. The lac operon upstream of the TYMS gene was replaced with a Tet-inducible promoter. A Tet promoter fragment had been generated with overlap extension PCR and cloned into the pACYC vector (SMT101) at unique AflII/BglII sites to produce SMT201. Selection conditions that resolved increased-fitness mutations were obtained with the SMT205 plasmid where the DHFR ‘AAGGAG’ ribosome binding site (RBS) was replaced with ‘AATGAG’ based on prediction from the RBS calculator (Salis et al., 2009) using inverse PCR. Briefly, PCR reactions were set up using 2x Q5 mastermix (NEB, cat# M0492), 10 ng of plasmid template, and 500 nM forward and reverse primers. PCR was performed in the following steps: 1) 98 °C for 30 s, 2) 98 °C for 10 s, 3) 57–63 °C for 30 s, 4) 72 °C for 2 min, 5) return to step 2 for 22 cycles, 6) 72 °C for 5 min. As needed, the annealing temperature (step 3) was optimized in the range of 57–63 °C. 25 µL of PCR reaction was mixed with 1 µL of DpnI (NEB, cat# R0176), 1 µL of T4 PNK (NEB, cat# M0201), 1 µL of T4 ligase (NEB, cat# M0202), and 3.1 µL of T4 ligase buffer (NEB, cat# B0202) at 37 °C for 2–4 hr. The reactions were then transformed into chemically competent Top10 cells and plated on LB agar plates with 35 µg/mL chloramphenicol (Fisher BioReagents, BP904, CAS: 56-76-7, 35 mg/mL in ethanol). The plates were incubated overnight at 37 °C. Single colonies were picked and used to inoculate 5 mL of LB medium (10 g Bacto-tryptone (Fisher BioReagent, cat# BP1415, CAS: 73049-73-7), 5 g Bacto-yeast extract (BD Difco, cat# 212720, CAS: 8013-01-2), 10 g NaCl (Fisher BioReagents, cat# BP358, CAS 7647-14-5), 0.186 g KCl (Sigma, cat# P9541, CAS: 7447-40-7), volume brought to 1 L with MilliQ water, autoclaved) + 35 mg/mL chloramphenicol. Cultures were incubated overnight in 14 mL plastic culture tubes (Falcon, cat# 352059) at 37 °C under 225 rpm shaking. Pellets were collected by centrifugation at 3500 rpm for 10 min at 4 °C in a swinging-bucket centrifuge (Beckman Coulter, Allegra X-12R) and miniprepped (Qiagen, cat# 27104). Constructs were confirmed by Sanger sequencing (Quintara Biosciences) by alignment to the template sequence in ClustalOmega.

Generation of plasmid libraries

Four sublibraries were generated to cover the entire mutational space of E. coli DHFR: positions 1–40 (sublibrary1, SL1), positions 41–80 (sublibrary2, SL2), positions 81–120 (sublibrary3, SL3), and positions 121–159 (sublibrary4, SL4). The single point mutant library was performed by multiple parallel inverse PCR reactions to substitute an NNS degenerate codon at every codon in DHFR. PCR primers (The Appendix) were phosphorylated in a 20 µL reaction with 1 µL T4 polynucleotide kinase and 1x T4 ligase buffer. Inverse PCR reactions were performed as described above, followed by PCR clean-up (Qiagen, cat# 28104). The cleaned PCR reactions were incubated for 4 hr with 1 µL DpnI, 1 µL of T4 ligase, and 3 µL of T4 ligase buffer. PCR reactions were analyzed by gel electrophoresis using a 1% agarose gel in TAE buffer (20 mM acetic acid (Sigma Aldrich, cat#, 695092, CAS: 64-19-7), 2 mM EDTA (ACROS Organics, cat# AC118432500, CAS: 60-00-4), 40 mM Tris, pH 8.5) with 0.01% v/v GelRed (Biotium, cat# 41003), and the product amount was quantified using gel densitometry in the FIJI image processing software package (Schindelin et al., 2012). Samples were pooled stoichiometrically, cleaned once with a gel extraction kit (Qiagen, cat# 28115), and again with a PCR clean-up kit. The pooled and cleaned ligation products were transformed into E. coli Top10 cells by electroporation (BioRad GenePulser Xcell, 1 mm path length cuvette (cat# 165–2089), 1.8 kV, time constant ~5 ms) using ~5 µL to obtain a minimum of 10⁷ transformants as measured by dilution plating on LB-agar plates with 35 µg/mL chloramphenicol. The transformed cells were rescued in SOB medium (20 g Bacto-tryptone, 5 g Bacto-yeast extract, 0.584 g NaCl, 0.186 g KCl, 800 mL MilliQ water, pH 7.0, volume brought to 1 L with MilliQ water, autoclaved) without antibiotics for 45 min at 37 °C before culturing overnight in 10 mL SOB medium with 35 µg/mL chloramphenicol. In the morning, glycerol stocks were made by mixing 500 µL of saturated culture with 500 µL of sterile filtered 50% (v/v) glycerol. 5 mL of the culture was used to miniprep the transformed library with a Qiagen miniprep kit.

Generation of individual point mutant plasmids

Point mutants in all DHFR-containing plasmids were generated via inverse PCR as described above for the generation of SMT205 except that the appropriate antibiotic was matched with the plasmid (The Appendix). Library primer sequences (The Appendix) were used except that the ‘NNS’ sequence on the forward primer was replaced with the desired codon.

Generation of ER2566 ∆folA ∆thyA –Lon and ER2566 ∆folA ∆thyA +Lon

The ER2566 ∆folA ∆thyA –Lon strain was generated as previously described (Reynolds et al., 2011) and a gift from Prof. Stephen Benkovic. The ER2566 ∆folA ∆thyA +Lon strain was generated from ER2566 ∆folA ∆thyA –Lon by lambda red recombination using Support Protocol I from Thomason et al., 2014. The pSim6 plasmid bearing the Lamda red genes linked to a temperature sensitive promoter and the pIB279 plasmid bearing the Kan-SacB positive-negative selection marker (Blomfield et al., 1991) were gifts from Carol Gross. The Kan-SacB cassette was amplified with 2 rounds of PCR using primers with 5’ homology arms for the region upstream of the Lon gene (The Appendix). The insertion fragment containing the Anderson consensus promoter (iGEM, 2006) with homology arms for the region upstream of Lon in the ER2566 genome was amplified from primers using overlap extension PCR.

Plate reader assay for E. coli growth

Growth rates for the selection strains bearing individual DHFR mutants were measured in 96-well plate growth assays as described for one individual mutant. The SMT205 plasmid was transformed via heat shock into chemically competent ER2566 ∆folA ∆thyA ±Lon cells and plated on an LB-agar plate with 30 µg/mL chloramphenicol plus 50 µg/mL thymidine and incubated overnight at 37 °C. On the second day, 2 mL M9 medium (1x M9 salts (BD Difco, cat# 248510), 0.4% glucose w/v (Fisher Chemical, cat# D16, CAS: 50-99-7), 2 mM MgSO4 (Sigma Aldrich, cat# 63138, CAS:10034-99-8)) with supplements for deficient folate metabolism (50 µg/mL thymidine (Sigma Aldrich, cat# T1895, CAS: 50-89-5), 22 µg/mL adenosine (Sigma Aldrich, cat# A9251, CAS: 56-61-7), 1 µg/mL calcium pantothenate (TCI, cat# P0012, CAS: 137-08-6), 38 µg/mL glycine (Fisher BioReagents, cat# BP381, CAS: 56-40-6), and 37.25 µg/mL methionine (Fisher BioReagents, cat# BP388, CAS 63-68-3)) and 30 µg/mL chloramphenicol in a 14 ml culture tube was inoculated with 5–10 colonies scraped from the plate and incubated at 37 °C at 225 rpm shaking for 12–14 hr. Biological replicates were obtained from separate inoculations at this step and run on the same plate. All assays were run from fresh transformations. Then, 20–50 µL of the previous culture was used to inoculate 5 mL of M9 medium (no supplements) with 30 µg/mL chloramphenicol in a 14 ml culture tube. This fresh culture was incubated for 6 hr at 30 °C at 225 rpm shaking. Meanwhile 2 mL of M9 medium with 30 µg/mL chloramphenicol and a transparent 96-well plate were pre-warmed at 30 °C. After the 6 hr incubation, the optical density at 600 nm (OD600) of the culture was measured on a Cary 50 spectrophotometer over a path of 1 cm. This early log-phase culture was diluted to an OD600 = 0.005 in the 2 mL aliquot of warmed M9. 200 µL of the dilute culture was pipetted into a well in the 96-well plate. Technical replicates were obtained by dispensing the same dilute culture into multiple wells. Wells were covered with 50 µL of mineral oil (Sigma Aldrich, cat# M5904, CAS: 8042-47-5) using the reverse pipetting technique. The plate was then incubated for 20–48 hr at 30 °C in a Victor X3 multimode plate reader (Perkin Elmer). Every 10 min, the plate was shaken for 30 s with an orbital diameter of 1.8 mm under the ‘normal’ speed setting. Then, the absorbance at 600 nm (ABS600) was measured for each well. Growth rates were calculated from the slope of Log2(ABS600 – ABS600_t=0) for ∆ABS600 in the range of 0.015–0.04 using an in-house python script.

Deep mutational scanning experiments

Competitive growth under selection for cellular DHFR activity was performed in a continuous culture turbidostat (gift of Rama Ranganathan) as described below for a single sublibrary. Sublibraries of DHFR single point mutants were transformed via electroporation as described above into electrocompetent ER2566 ∆folA ∆thyA ±Lon cells using approximately 50 ng of plasmid DNA and 80 µL of competent cells with a transformation efficiency of 10⁸ cfu/ng (based on testing with 10 ng of pACYC plasmid DNA). Immediately after electroporation, the cells were rescued with 2 mL of SOB medium with 50 µg/mL thymidine warmed to 37 °C. The rescue culture was incubated at 37 °C for 45 min at 225 rpm shaking. After the rescue step, 4 µL of the rescue medium (1/500 of the rescue volume) was serially diluted in 10-fold increments. Half the volume of each dilution (1/1000 – 1/10⁷ of the rescue volume) was plated on an LB-agar plate with 30 µg/mL chloramphenicol plus 50 µg/mL thymidine and incubated overnight at 37 °C. The colonies were counted the following morning to check for a minimum of 1000x oversampling of the theoretical diversity in the library (~10⁶ transformants for each sublibrary). Meanwhile, the larger portion of the rescue medium was mixed with 4 mL of SOB medium with 45 µg/mL chloramphenicol (1.5x) plus 50 µg/mL thymidine warmed to 37 °C. This 6 mL culture was incubated for 5–6 hr at 37 °C at 225 rpm shaking in a 14 mL culture tube. After incubation, the culture was pelleted by centrifuging for 5 min at 3000 rpm at room temperature in a swinging bucket centrifuge. The cells were resuspended in 50 mL of supplemented M9 medium + 30 µg/mL chloramphenicol and incubated for 12–14 hr at 37 °C at 225 rpm shaking in a 250 mL flask. In the morning, 150 mL of supplemented M9 medium + 30 µg/mL chloramphenicol in a 1 L flask was inoculated with 15 mL of the overnight culture. This pre-culture was incubated at 30 °C for 4 hr at 225 rpm shaking. After 4 hr, the pre-culture was centrifuged at 3000 rpm for 5 min at room temperature in a swinging bucket centrifuge, and the OD600 was measured to ensure that the culture did not grow beyond early-mid log phase (OD600 ~0.3). The supernatant was decanted, and the pellet was resuspended in 30 mL of M9 medium. Pelleting and resuspension were repeated for a total of 3 washes to remove the supplemented medium. After three washes, the OD600 was measured for the resuspended pellet using a 10-fold dilution to stay in the linear range of the spectrophotometer.

The washed pellet was then transferred to the growth chamber of the turbidostat (a 250 mL pyrex bottle) containing 150 mL of M9 medium with 50 µg/mL chloramphenicol. Selection experiments were performed with 2 of the four sublibraries at a time (two repeats of SL1-SL2 and SL3-SL4, and one repeat of SL1+SL3 and SL2+SL4 for a net of biological triplicates for every codon in the gene), and the resuspended pellet from each library was diluted in the initial culture to an OD600 = 0.035. Mixing and oxygenation was provided by sterile filtered air from an aquarium pump. Every 60 s, the aquarium pump was stopped, and the optical density of the culture was read by an infrared emitter-receiver pair. The ADC (analog-to-digital converter) of the voltage over the receiver was calibrated against a spectrophotometer to convert the signal into an approximate OD600. The cells were grown at 30 °C with an OD600 threshold of 0.075. When the OD600 of the selection culture exceeded the threshold, the selection culture was diluted to OD600 ~0.065 with 25 mL of M9 medium with 50 µg/mL chloramphenicol, and the additional culture volume was driven through a waste line by the positive pressure of the aquarium pump. At timepoints of t = 0, 2, 4, 6, 8, 12, 16, and 18 hr, 6 mL of the selection culture in 2 mL centrifuge tubes was pelleted at 5000 rpm for 5 min at 4 °C in a microcentrifuge (Eppendorf, 5242R). The supernatant was removed except for the last ~200 µL, and the tubes were again pelleted at 5000 rpm for 5 min at 4 °C in a microcentrifuge, and all the supernatant was carefully removed from the pellet. The pellets were stored at −20 °C until sequencing.

Amplicon generation

Amplicons were generated by two rounds of PCR. The first round of PCR amplifies a portion of the DHFR gene from the pACYC plasmid containing 2–3 sublibraries. For quality control templates were 1 ng/µL plasmid solutions and the amplicons covered SL1-SL2 or SL3-SL4. Round 1 PCR reactions were set up using 1 µL of template, 1% v/v Q5 hotstart polymerase (NEB, cat# M0493), 1x Q5 Reaction Buffer, 1x Q5 High GC Enhancer, 200 µM dNTPs, and 500 nM forward and reverse primers. PCR was performed in the following steps: 1) 98 °C for 30 s, 2) 98 °C for 10 s, 3) 57 °C for 30 s, 4) 72 °C for 12 s, 5) return to step 2 for 16 cycles, 6) 72 °C for 2 min.

The Round 2 PCR uses primers that attach the Illumina adapters and the i5 (reverse) and i7 (forward) barcodes for sample identification and demultiplexing. Round 2 PCR reactions were set up and run identically to Round one reactions except that the template was 1 µL of Round 1 PCR. Round two reactions were analyzed by gel electrophoresis using a 1% TAE-agarose gel in TAE buffer with 0.01% v/v GelRed, and the product amount was quantified using gel densitometry in FIJI. Samples were pooled stoichiometrically and cleaned with a gel extraction kit (Qiagen). Because of the risk of contamination from small primer dimers, gel extraction was performed with very dilute samples. Only 20 µL of sample was loaded onto a 50 mL TAE-agarose gel (OWL EasyCast, B1A) with 8 of the 10 wells combined into a single well. The pooled amplicons were then cleaned again with a PCR clean-up kit (Zymogen, cat# D4013) to allow for small volume elution. The final amplicon concentration was measured with a NanoDrop One UV spectrophotometer and by Picogreen assay (Thermo Scientific, cat# P11496).

Sequencing for deep mutational scanning experiments

Templates for amplicon PCR were prepared from the frozen pellets. The pellets were resuspended in 20 µL of autoclaved MilliQ water and incubated on ice for 10 min. The samples were then centrifuged at 15,000 rpm for 10 min at 4 °C in a benchtop microcentrifuge. 1 µL of the supernatant was used as template in the amplicon generation protocol for sublibraries described above. The amplicons were sequenced on an Illumina NextSeq using a 300-cycle 500/550 high-output kit. Because of the limitations in the number of sequencing cycles on the Illumina NextSeq, the full amplicon was not sequenced for amplicons containing non-adjacent sublibraries (SL1+SL3, and SL2+SL4). Reads were demultiplexed into their respective selection experiment and timepoint using their TruSeq barcodes. Paired end reads were joined using FLASH (Magoč and Salzberg, 2011). For amplicons with adjacent sublibraries (SL1-SL2 and SL3-SL4), the joined reads were kept. For amplicons with distal sublibraries (SL1+SL3 and SL2+SL4), the unjoined reads were kept. Reads from all lanes of the Illumina chip were concatenated and raw counts of DHFR mutants were obtained from these reads.

Reads on the Illumina NextSeq (two-color chemistry, LED optics) generally have lower quality scores than reads from the Illumina MiSeq (four-color chemistry, laser optics). This lower quality leads to a background signal. This background was estimated from a WT sample. The median + one standard deviation value of background count was subtracted from every allele and the alleles were translated into the amino acid sequence, combining synonymous sequences. Counts at each timepoint were only reported for an allele if its frequency was above 2.0 × 10⁻⁵. Raw counts are reported in Supplementary file 3–5.

Analysis of deep mutational scanning data

Mutant counts were used to generate selection coefficients on our background-subtracted count files with Enrich2 using unweighted linear regression (Rubin et al., 2017). The raw Enrich2 values for each unique selection experiment were combined with a post-processing script. Enrich2 does not calculate selection coefficients for mutants that have no counts at a timepoint, so some selection coefficients were recalculated using only the timepoints before the counts for that allele fell below the cutoff frequency of 2.0 × 10⁻⁵. Individual selection coefficients were evaluated based on two criteria: noise and number of timepoints. Individual selection coefficients were discarded 1) if the standard error from regression was greater than 0.5 + 0.5 • |selection coefficient| or 2) if there were fewer than four timepoints reporting on the mutant. The regression for the fitness value of the mutants from replicate selection experiments to the average values across all experiments was calculated and the fitness values in each replicate were scaled to correct for linear differences in the selection values between replicates. These normalized values were then averaged for the final fitness value. Averaged selection coefficients values were evaluated based on two criteria: the standard deviation of the averaged selection coefficients and the number of replicates. Averaged selection coefficients were discarded 1) if the standard deviation over the normalized replicates was greater than 0.5 + 0.25 • |selection coefficient| or 2) if there were fewer than two replicates. In Supplementary file 1 and 2 the fitness is reported as the mean normalized fitness, the standard error is reported as the combined Enrich2 standard error (from linear regression of timepoints), and the standard deviation is reported as the standard deviation of the biological replicates. The correlation and R-values of normalized replicate experiments and the distribution of standard deviations and standard errors for each mutant are reported in Figure 1—figure supplement 3, Figure 1—figure supplement 4, Figure 2—figure supplement 1, Figure 2—figure supplement 2.

Selection was evaluated by comparing selection coefficients to DHFR velocity from reported Michaelis-Menten kinetics at cytosolic concentrations of DHF (Bennett et al., 2009; Kwon et al., 2008). Kinetic values are listed in Figure 1—source data 3. Based on this calibration, differences between selection coefficients below ~−2.5 were not considered interpretable, and a floor value of −2.5 was applied to all selection coefficients for the purpose of analysis.

For subtraction to calculate ∆selection coefficients, null selection coefficients in +Lon selection were substituted with the lowest measured selection coefficient. Mutations with a null selection coefficient in –Lon selection were assigned a ∆selection coefficient of ‘No data’ (colored black). Mutations with ‘No data’ value in either selection condition were also assigned a ∆selection coefficient of ‘No data’ here.

For clustering of positions, an in-house Python script was used for K-means clustering of positions into categories based on general mutational response at a position (i.e discarding the amino acid identities of the mutants). Spatial clustering was performed based on selection coefficients with the distance between two positions calculated in the following steps: 1) sorting the vectors of selection coefficients for each position, 2) trimming the vectors to match vector lengths after discarding ‘no data’ values, 3) calculating a ∆ vector by subtracting the two sorted and trimmed vectors, and finally calculating the distance as the mean of the absolute value of the ∆ vector. For the first round, categories were seeded with virtual positions that have prototypical mutational profiles for the five categories (Beneficial, Tolerant, Mixed, Restricted, and Intolerant). From this first round, all positions in DHFR were categorized into initial clusters. In subsequent rounds, the virtual positions were removed and candidate positions were compared to the non-self positions populating each cluster. The distance between a candidate position and a cluster of positions is calculated as the average of the distance between the candidate position and the three closest non-self positions in the cluster. Clustering was performed over 10 rounds following the initial seeded round, and convergence was confirmed by observing that five repetitions gave identical clusters.

Purification of his₆-tagged DHFR

DHFR variants were expressed from pHis8 plasmids (KR101/SMT301) for nickel affinity purification as described for one DHFR variant. The plasmid bearing the his-tagged DHFR mutant was transformed via heat shock into chemically competent ER2566 ∆folA ∆thyA –Lon cells, then the cells were plated on LB-agar plates containing 50 µg/mL kanamycin (AMRESCO, cat# 0408, CAS: 25389-94-0, 50 mg/mL in ethanol) and 50 mg/mL thymidine. The plates were incubated overnight at 37 °C. The next day 2 mL of LB medium with 50 µg/mL kanamycin was inoculated with a single colony. This culture was incubated overnight at 37 °C at 225 rpm shaking. The next day, 25 mL of TB medium (12 g Bacto-tryptone, 24 g Bacto-yeast extract, 0.4% glycerol v/v (Sigma Aldrich, cat# G7893, CAS: 56-81-5), brought to 900 mL with MilliQ water, autoclaved, cooled, mixed with 100 mL sterile filtered buffered phosphate (0.17 M KH₂PO₄ (Sigma Aldrich, cat# P0662, CAS: 7778-77-0), 0.72 M K₂HPO₄ (Sigma Aldrich, cat# P550, CAS: 16786-57-1))) with 50 µg/mL kanamycin in a 50 mL conical tube was inoculated with 100 µL of the overnight culture. The culture was grown at 37 °C until the OD600 reached 0.5–0.6. Then, the culture was induced with 0.25 mM IPTG (Gold Biotechnology, cat# I2481C100, CAS: 367-93-1, 1M in autoclaved water, sterile filtered) and incubated for 18 hr at 18 °C at 225 rpm shaking. The cultures were pelleted by centrifugation at 3000 rpm for 5 min at 4 °C in a swinging-bucket centrifuge, the supernatant was discarded, and the pellet was resuspended by pipetting in 4 mL/g-pellet of B-PER (ThermoScientific, cat# 78266) with 1 mM PMSF (Millipore Sigma, cat# 7110, CAS: 329-98-6, 100 mM in ethanol), 10 µg/mL leupeptin (VWR Chemicals, cat# J583, CAS: 26305-03-3, 5 mg/mL in water), and 2 µg/mL pepstatin (VWR Chemicals, cat# J580, CAS: 103476-89-7, 2 mg/mL in water). The lysates were incubated at room temperature for 30 min on a rocker and clarified by centrifugation at 3000 rpm for 5 min at 4 °C in a swinging-bucket centrifuge. The lysate supernatant was then transferred to a fresh 50 mL conical tube and incubated for 30 min with 20 µL of NiNTA resin pre-equilibrated in Nickel Binding Buffer (50 mM Tris base (Fisher BioReagents, cat# BP152, CAS: 77-86-1) pH 8.0, 500 mM NaCl, 10 mM imidazole (Fisher Chemical, cat# 03196, CAS: 288-32-4), and then supernatant was removed by pipetting. The resin was washed 3 times for 5 min with 1 mL of Nickel Binding Buffer. Then the protein was eluted into 200 µL of Nickel Elution Buffer (100 mM Tris pH 8.0, 1 M NaCl, 400 mM imidazole) and dialyzed against DHFR Storage Buffer (50 mM Tris pH 8.0, 300 mM NaCl, 1% glycerol v/v) in 3000 Da MW cut-off Slidalyzer dialysis cups (Thermo Scientific, cat# 88401) at 4 °C. After 4 changes of dialysis buffer over 24 hr, the protein was aliquoted, flash frozen in liquid nitrogen, and stored at −80 °C. Proteins were purified to ~90–95% purity as judged from PAGE gel analysis.

In vitro assay for DHFR velocity and Michaelis-Menten kinetics

In vitro measurements of DHFR velocity were carried out by monitoring the change in UV absorbance. For each mutant screened, a purified enzyme aliquot was thawed and centrifuged at 15,000 rpm for 5 min at 4 °C in a benchtop microcentrifuge. The soluble enzyme was then transferred to a fresh tube, and the concentration was measured by UV absorption on a Nanodrop. Molar concentration of DHFR was calculated using an extinction coefficient of 33585 M⁻¹ cm⁻¹ at 280 nm for all variants with the following exceptions: 28085 (W30F/M, W47L/M), 35075 (M42Y, R98Y, L165Y), or 39085 (Q102W) M⁻¹ cm⁻¹. The enzyme was diluted to 555 nM in DHFR storage buffer. A pre-reaction mixture was prepared in MTEN buffer (5 mM MES (Sigma Aldrich, cat# 69889, CAS: 145224-94-8), 25 mM ethanolamine (Sigma Aldrich, cat# E6133, CAS: 2002-24-6), 100 mM NaCl, 25 mM Tris base, pH to 7.0) with 55.5 nM enzyme, 111 µM NADPH (Sigma Aldrich, cat# N7505, CAS: 2646-71-1) and 5 mM DTT (GoldBio, cat# DTT25, CAS: 27565-41-9, 1M in water, sterile filtered). The pre-reaction mixture and a micro quartz cuvette (Fisher Scientific, cat# 14-958-103, 10 mm path length, 2 mm window width) were pre-incubated at 30 °C. The reaction was started by adding 20 µl of 500 µM DHF (Sigma Aldrich, cat# D7006, CAS: 4033-27-6) in MTEN with 5 mM DTT to 180 µL of pre-reaction mixture. The substrate solution was made fresh from a sealed ampule on the day of the experiment. The reaction was briefly mixed by pipetting and then the reaction was monitored by reading the absorbance at 340 nm with an interval of 0.1 s in a Cary 50 spectrophotometer with the Peltier temperature set to 30 °C. The reactions were allowed to run to completion to establish the baseline, which was subtracted from the absorbance values. The real-time concentration of DHF was calculated by dividing the normalized absorbance values by the decrease in absorbance at 340 nm for the reaction, 0.0132 µM⁻¹cm⁻¹, the velocity of the reaction was calculated as the slope of linear regression to a 30 s window with a mean DHF concentration equal to 5, 10, 20, or 30 µM. Final velocities were normalized to enzyme concentration.

Michaelis-Menten kinetics were performed as described above using 1–5 µM DHFR for concentrations of DHFR from 0.5 to 100 µM. Initial velocities were estimated from linear regression to the absorbance divided by the decrease in absorbance at 340 nm for the reaction, and then they were fit to the Michaelis-Menten equation using the non-linear least squares method in R.

Determining DHFR activity and abundance in cell lysates

The cellular activity of DHFR was measured in cell lysates, and then used to calculate DHFR cellular abundance using a method adapted from Guerrero et al., 2019; Rodrigues et al., 2016. For each characterized DHFR variant, a plasmid (WT DHFR in plasmids SMT102, SMT201, SMT202 and SMT205 with modified promoters and RBSs or DHFR single point mutants in the final selection plasmid SMT205, see The Appendix) was transformed via heat shock into chemically competent ER2566 ∆folA ∆thyA ±Lon cells, which were plated on an LB-agar plate with 30 µg/mL chloramphenicol plus 50 µg/mL thymidine and incubated overnight at 37 °C. On the second day, 2 mL M9 medium with supplements for deficient folate metabolism (50 µg/mL thymidine, 22 µg/mL adenosine, 1 µg/mL calcium pantothenate, 38 µg/mL glycine, and 37.25 µg/mL methionine) and 30 µg/mL chloramphenicol in a 14 ml culture tube was inoculated with a single colony scraped from the plate and incubated at 37 °C at 225 rpm shaking for 12–14 hr. Three biological replicates were obtained from separate single colonies at this step, and all biological replicates were processed in parallel for subsequent steps. All assays were run from fresh transformations. 20–50 µL of the previous culture were used to inoculate 20 mL of M9 medium (no supplements) with 30 µg/mL chloramphenicol in a 50 ml conical tube. This fresh culture was incubated for 12–18 hr at 30 °C at 225 rpm shaking until the OD600 value was between 0.3 and 0.5 on a Cary 50 spectrophotometer over a path of 1 cm. The cultures were pelleted by centrifugation at 3000 rpm for 5 min at 4 °C in a swinging-bucket centrifuge, the supernatant was discarded, and the pellet was thoroughly resuspended in 1.1 mL of M9 medium. 1 mL of the resuspension was transferred to a 1.5 mL Eppendorf tube, and the sample was pelleted at 5000 rpm for 5 min at 4 °C in a microcentrifuge. The supernatant was carefully removed from the pellet, and the pellet was stored at −80 °C until the next step. The remained 100 µL of resuspended pellet was mixed with 900 µL and the OD600 value was measured for each pellet to determine the number of cells in the pellet, with a conversion factor of 8 × 10⁸ cells/mL at OD600 = 1.0. Pellets for positive (ER2566) and negative (ER2566 ∆folA ∆thyA ±Lon) control samples were collected in a similar fashion, except that antibiotics were not used and initial plates were streaked from glycerol stocks. Additionally, the M9 medium for ER2566 ∆folA ∆thyA ±Lon contained folate supplements in every step.

Cell pellets were lysed in B-PER with 1 mM PMSF, 10 µg/mL leupeptin, and 2 µg/mL pepstatin. Volumes for lysis were calculated to have consistent lysate concentration according to the formula: lysis volume = (volume of culture for resuspended pellet)・(OD600 of culture)・(30 µL BPER lysis buffer/1 mL culture). Pellets were resuspended by pipetting in the calculated volume, and the lysates were incubated at room temperature for 30 min on a rocker. The lysates were then clarified by centrifuged at 15,000 rpm for 5 min at 4 °C in a benchtop microcentrifuge. Lysates were kept on ice while the reactions were prepared.

Measurements of DHFR activity in lysates were carried out by monitoring the change in UV absorbance in a BioTek Synergy H1 multimode plate reader. A 180 µL pre-reaction mixture was prepared with MTEN buffer (5 mM MES, 25 mM ethanolamine, 100 mM NaCl, 25 mM Tris base, pH to 7.0), 111 µM NADPH, 5 mM DTT, and containing 20 µL lysate. The pre-reaction mixtures in a UV transparent 96-well plate (Grenier Bio-One, cat# 655809) were pre-incubated at 30 °C for 10 min. The substrate solution of 500 µM DHF in MTEN with 5 mM DTT was made freshly from a sealed ampule of DHF on the day of the experiment. The reaction was started by automatic injection of 20 µl of 500 µM DHF in MTEN with 5 mM DTT into each well with pre-reaction mixture. The plate was then orbital shaken for 1 min at 365 rpm with a 2 mm amplitude. The reaction was briefly mixed by pipetting and then the reaction was monitored by reading the absorbance at 340 nm with an interval 1 min for 2 hr while incubating at 30 °C. To establish a baseline for accurate calculation of DHF concentration in each well, 50 µL of 1 µL WT DHFR in DHFR storage buffer was injected into each well, the plate was then orbital shaken for 1 min at 365 rpm with a 2 mm amplitude, and the reactions were allowed to run to completion over 10 min, before a final reading of absorbance at 340 nm was taken. In processing, this baseline value was subtracted from the absorbance values for each well. The real-time concentration of DHF was calculated by dividing the normalized absorbance values by the decrease in absorbance at 340 nm for the reaction, 0.0132 µM⁻¹cm⁻¹, times a correction factor of 1.5 for calibration between the plate reader and the absorbance at 340 nm using a Cary 50 spectrophotometer with a 1 cm pathlength quartz cuvette. The velocity of the reaction was calculated as the slope of linear regression for DHF concentration as a function of time over a window of DHF concentration from 20 to 30 µM. The mean slope of the negative control wells (untransformed ER2566 ∆folA ∆thyA ±Lon) was subtracted from all wells as a baseline. The linear regression of in vitro DHFR reactions using purified enzyme over the same window of DHF concentration from 20 to 30 µM was calculated from measurements described above (section ‘In vitro assay for DHFR velocity’, Supplementary file 6), and the DHFR abundance in each well was calculated from the ratio of activity_lysate/velocity_{purified enzyme}. The number of DHFR molecules per cell was then calculated by dividing the total number of DHFR molecules in each 200 µL of reaction by the number of cells in 20 µL of lysate based on the OD600 measurements.

CD spectroscopy

Samples for circular dichroism (CD) spectroscopy were prepared at a concentration of 10 µM in a buffer of 150 mM NaCl and 50 mM Tris, pH 8.0. CD spectra acquisition and thermal denaturation was carried out in a Jasco J-715 CD spectrometer using a cuvette with a 2 mm pathlength (Starna Cell Inc, cat# 21-Q-2). For each DHFR variant, a pre-denaturation spectra was recorded between 207 nm and 280 nm where the high tension voltage was below 600 V. Thermal denaturation data were collected at 225 nm with a bandwidth of 2 nm, a response time of 8 s, and a resolution of 0.1 °C during heating at a rate of 1 °C/min. When the curve flattened, the sample was removed from the CD spectrometer and the system was returned to 30 °C. The sample was returned to the chamber and allowed to equilibrate for 10 min. A post-denaturation spectrum was recorded after equilibration. Between samples, the cuvette was cleaned with sonication in Hellmanex III (Hellma, cat# 2805939) followed by washing with 50% concentrated nitric acid. Thermal denaturation was found to be only partially reversible based on comparisons of spectra recorded before and after denaturation. Thermal denaturation curves were fit to a sigmoidal model for the calculation of an approximate apparent T_m for all mutants as previously reported (Smith et al., 2013).

Structural representation of DHFR

All images of the DHFR structure were prepared with UCSF Chimera, and volumetric representations were prepared using the MSMS package (Sanner et al., 1996). Solvent accessible surface accessible surface area (SASA) was calculated using the Getarea server (Fraczkiewicz and Braun, 1998) for four crystal structures of DHFR (1RX1, 3QL3, 1RX4, and 1RX5) representing different states in DHFR’s catalytic cycle. All models were downloaded from PDB_REDO (Joosten et al., 2014). For all positions in DHFR, if the residue had <20% SASA in any structure, the residue was classified as buried. All other residues were classified as exposed. Burial classification is reported in Figure 3—source data 1.

The distance between the positions within each mutational response category and sites within the DHFR structure (hydride transfer site, M20 loop, core of the globular domain, and the beta-sheet surface beneath the active site) were determined using a model of the transition state provided by Phil Hanoian (Liu et al., 2013). The representative atom for the hydride transfer site is the hydride atom in the transition state model. The representative atom for the adenine ring is C5 (C18 in the pdb). The representative atom for the core of the globular domain is the alpha carbon of I41. The representative atom for the beta sheet region is the alpha carbon of D114. For all cases, the distance is defined as the distance between the representative atom and the alpha carbon of the target position.

Mean atom neighbors for each residue on a structure were calculated using an in-house python script. The number of non-hydrogen atoms within an 8 Å shell of each non-hydrogen atom in the structure were counted and averaged for all non-hydrogen atoms at each side chain. These values we calculated for four crystal structures of DHFR (PDB IDs: 1RX1, 3QL3, 1RX4, 1RX5) and averaged over the set.

Profile similarity analysis

We downloaded the DHFR alignment from OpenSeq.org (Ovchinnikov et al., 2014), selected all bacterial DHFR sequences, and aligned the E. coli DHFR sequence to the MSA using MUSCLE (Edgar, 2004). This multiple sequence alignment (MSA) is provided in Supplementary file 7. Frequencies for each amino acid at each sequence position in the MSA were calculated from counts in each column, with absent amino acids given an arbitrarily low frequency of 0.0001. To compare the amino acid frequencies from the MSA to the selection coefficients, we first divided the selection coefficients by ln(2) • 18 hr to convert from the Enrich selection coefficients to a ∆doubling rate. We then multiplied the ∆doubling rate by −1 and back-calculated frequencies using Boltzmann weighting using a temperature (0.44 kT for –Lon selection, and 0.47 kT for +Lon selection) that resulted in the mean sequence entropy to be within ±0.01 of that of the MSA (0.50). Then, profile similarity at each sequence position was calculated as 1 – the Jensen-Shannon Divergence of the amino acid frequencies. Profile similarity was determined over columns corresponding to positions 2–158 because the DHFR library begins at position two and the DHFR MSA cuts off after position 158.

Acknowledgements

The authors would like to thank Carol Gross, Melanie Silvis, and Byoung Mo Koo for discussion and for providing Lambda red plasmids; Rama Ranganathan and Victor Salinas for supplying parts and expertise for the construction of the turbidostat; Sharon Hammes-Shiffer and Phil Hanoian for providing QM/MM models of the hydride transfer step; Natasha Carli and Jim McGuire at the Gladstone Institute Genomics Core for performing NextSeq 500 sequencing runs with support from the James B Pendleton Charitable Trust; and Norma Neff, Anna Sellas, and Rene Sit for performing and aiding Miseq and NextSeq 500 sequencing runs at the Chan Zuckerberg Biohub.

Appendix 1

Appendix 1—key resources table.

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Strain, strain background (Escherichia coli)	ER2566	New England Biolabs	Cat# C2566I	Chemically competent cells
Strain, strain background (Escherichia coli)	ER2566 ∆folA/∆thyA (–Lon)	Reynolds et al. Cell 2011		Chemically competent and electrocompetent cells
Strain, strain background (Escherichia coli)	ER2566 ∆folA/∆thyA (+Lon)	This work		Chemically competent and electrocompetent cells
Recombinant DNA reagent	SMT101 (plasmid)	This work		Dual expression of DHFR and TYMS, in vivo assays, chloramphenicol (35 µg/mL final concentration)
Recombinant DNA reagent	SMT201 (plasmid)	This work		SMT101 with TET promter for TYMS, in vivo assays, Chloramphenicol (35 µg/mL final concentration)
Recombinant DNA reagent	SMT205 (plasmid)	This work		SMT201 with mutated RBS for DHFR, in vivo assays, Chloramphenicol (35 µg/mL final concentration)
Recombinant DNA reagent	SMT215 (plasmid)	This work		SMT205 with DHFR-FLAG-tag, western blot, Chloramphenicol (35 µg/mL final concentration)
Recombinant DNA reagent	KR101/SMT301 (plasmid)	Reynolds et al. Cell 2011		His8-tag, Heterologous expression, NiNTA purfication, kanamycin (50 µg/mL final concentration)
Recombinant DNA reagent	pSIM6 (plasmid)	Blomfield et al., 1991		Lambda Red recombinase expression, temperature-sensitive promoter, ampicillin/carbenicilin (100 µg/mL final concentration)
Recombinant DNA reagent	pIB279 (plasmid)	Blomfield et al., 1991		KAN-SacB cassette for positive/negative selection, ampicillin/carbenicilin (100 µg/mL final concentration)
Sequence-based reagent	TetDuet1_sense	This work	Mutagenic PCR primer	ccgCTTAAGtcgaacagaaagtaatcgtattgtacatccctatc
Sequence-based reagent	TetDuet2_anti	This work	Mutagenic PCR primer	gatagggatgtcaatctctatcactgatagggatgtacaatacg
Sequence-based reagent	TetDuet3_sense	This work	Mutagenic PCR primer	agagattgacatccctatcagtgatagagatactgagcacatcag
Sequence-based reagent	TetDuet4_anti	This work	Mutagenic PCR primer	ctttaatgaattcggtcagtgcgtcctgctgatgtgctcagtatctc
Sequence-based reagent	TetDuet5_sense	This work	Mutagenic PCR primer	cactgaccgaattcattaaagaggagaaaggtaccatatggc
Sequence-based reagent	TetDuet_5flanking	This work	Mutagenic PCR primer	ccgcttaagtcgaacagaaag
Sequence-based reagent	TetDuet_3flanking	This work	Mutagenic PCR primer	cggagatctgccatatggtacc
Sequence-based reagent	WT_DHFR_pos2_fwd	This work	Mutagenic PCR primer	NNSAGTCTGATTGCGGCGTTAG
Sequence-based reagent	WT_DHFR_pos2_fwd2	This work	Mutagenic PCR primer	NNSAGTCTGATTGCGGCGTTAG
Sequence-based reagent	WT_DHFR_pos3_fwd	This work	Mutagenic PCR primer	NNSCTGATTGCGGCGTTAGCG
Sequence-based reagent	WT_DHFR_pos4_fwd	This work	Mutagenic PCR primer	NNSATTGCGGCGTTAGCGGTA
Sequence-based reagent	WT_DHFR_pos5_fwd	This work	Mutagenic PCR primer	NNSGCGGCGTTAGCGGTAGAT
Sequence-based reagent	WT_DHFR_pos6_fwd	This work	Mutagenic PCR primer	NNSGCGTTAGCGGTAGATCGC
Sequence-based reagent	WT_DHFR_pos7_fwd	This work	Mutagenic PCR primer	NNSTTAGCGGTAGATCGCGTTATC
Sequence-based reagent	WT_DHFR_pos8_fwd	This work	Mutagenic PCR primer	NNSGCGGTAGATCGCGTTATCG
Sequence-based reagent	WT_DHFR_pos8_fwd2	This work	Mutagenic PCR primer	NNSGCGGTAGATCGCGTTATCG
Sequence-based reagent	WT_DHFR_pos9_fwd	This work	Mutagenic PCR primer	NNSGTAGATCGCGTTATCGGCATG
Sequence-based reagent	WT_DHFR_pos10_fwd	This work	Mutagenic PCR primer	NNSGATCGCGTTATCGGCATGG
Sequence-based reagent	WT_DHFR_pos11_fwd	This work	Mutagenic PCR primer	NNSCGCGTTATCGGCATGGAAAA
Sequence-based reagent	WT_DHFR_pos12_fwd	This work	Mutagenic PCR primer	NNSGTTATCGGCATGGAAAACGC
Sequence-based reagent	WT_DHFR_pos13_fwd	This work	Mutagenic PCR primer	NNSATCGGCATGGAAAACGCC
Sequence-based reagent	WT_DHFR_pos14_fwd	This work	Mutagenic PCR primer	NNSGGCATGGAAAACGCCATG
Sequence-based reagent	WT_DHFR_pos15_fwd	This work	Mutagenic PCR primer	NNSATGGAAAACGCCATGCCG
Sequence-based reagent	WT_DHFR_pos16_fwd	This work	Mutagenic PCR primer	NNSGAAAACGCCATGCCGTGG
Sequence-based reagent	WT_DHFR_pos17_fwd	This work	Mutagenic PCR primer	NNSAACGCCATGCCGTGGAAC
Sequence-based reagent	WT_DHFR_pos18_fwd	This work	Mutagenic PCR primer	NNSGCCATGCCGTGGAACCTG
Sequence-based reagent	WT_DHFR_pos19_fwd	This work	Mutagenic PCR primer	NNSATGCCGTGGAACCTGCCT
Sequence-based reagent	WT_DHFR_pos20_fwd	This work	Mutagenic PCR primer	NNSCCGTGGAACCTGCCTGCC
Sequence-based reagent	WT_DHFR_pos21_fwd	This work	Mutagenic PCR primer	NNSTGGAACCTGCCTGCCGAT
Sequence-based reagent	WT_DHFR_pos22_fwd	This work	Mutagenic PCR primer	NNSAACCTGCCTGCCGATCTC
Sequence-based reagent	WT_DHFR_pos22_fwd2	This work	Mutagenic PCR primer	NNSAACCTGCCTGCCGATCTC
Sequence-based reagent	WT_DHFR_pos23_fwd	This work	Mutagenic PCR primer	NNSCTGCCTGCCGATCTCGCC
Sequence-based reagent	WT_DHFR_pos24_fwd	This work	Mutagenic PCR primer	NNSCCTGCCGATCTCGCCTGG
Sequence-based reagent	WT_DHFR_pos25_fwd	This work	Mutagenic PCR primer	NNSGCCGATCTCGCCTGGTTT
Sequence-based reagent	WT_DHFR_pos26_fwd	This work	Mutagenic PCR primer	NNSGATCTCGCCTGGTTTAAACGC
Sequence-based reagent	WT_DHFR_pos27_fwd	This work	Mutagenic PCR primer	NNSCTCGCCTGGTTTAAACGCAACA
Sequence-based reagent	WT_DHFR_pos28_fwd	This work	Mutagenic PCR primer	NNSGCCTGGTTTAAACGCAACAC
Sequence-based reagent	WT_DHFR_pos29_fwd	This work	Mutagenic PCR primer	NNSTGGTTTAAACGCAACACCTTAAATAAAC
Sequence-based reagent	WT_DHFR_pos30_fwd	This work	Mutagenic PCR primer	NNSTTTAAACGCAACACCTTAAATAAACCCG
Sequence-based reagent	WT_DHFR_pos31_fwd	This work	Mutagenic PCR primer	NNSAAACGCAACACCTTAAATAAACCCGTG
Sequence-based reagent	WT_DHFR_pos32_fwd	This work	Mutagenic PCR primer	NNSCGCAACACCTTAAATAAACCCGT
Sequence-based reagent	WT_DHFR_pos33_fwd	This work	Mutagenic PCR primer	NNSAACACCTTAAATAAACCCGTGATTATGG
Sequence-based reagent	WT_DHFR_pos34_fwd	This work	Mutagenic PCR primer	NNSACCTTAAATAAACCCGTGATTATGGG
Sequence-based reagent	WT_DHFR_pos35_fwd	This work	Mutagenic PCR primer	NNSTTAAATAAACCCGTGATTATGGGCC
Sequence-based reagent	WT_DHFR_pos36_fwd	This work	Mutagenic PCR primer	NNSAATAAACCCGTGATTATGGGCC
Sequence-based reagent	WT_DHFR_pos37_fwd	This work	Mutagenic PCR primer	NNSAAACCCGTGATTATGGGCC
Sequence-based reagent	WT_DHFR_pos38_fwd	This work	Mutagenic PCR primer	NNSCCCGTGATTATGGGCCGC
Sequence-based reagent	WT_DHFR_pos39_fwd	This work	Mutagenic PCR primer	NNSGTGATTATGGGCCGCCATAC
Sequence-based reagent	WT_DHFR_pos40_fwd	This work	Mutagenic PCR primer	NNSATTATGGGCCGCCATACCT
Sequence-based reagent	WT_DHFR_pos41_fwd	This work	Mutagenic PCR primer	NNSATGGGCCGCCATACCTGG
Sequence-based reagent	WT_DHFR_pos42_fwd	This work	Mutagenic PCR primer	NNSGGCCGCCATACCTGGGAA
Sequence-based reagent	WT_DHFR_pos42_fwd2	This work	Mutagenic PCR primer	NNSGGCCGCCATACCTGGGAATC
Sequence-based reagent	WT_DHFR_pos43_fwd	This work	Mutagenic PCR primer	NNSCGCCATACCTGGGAATCAATC
Sequence-based reagent	WT_DHFR_pos43_fwd2	This work	Mutagenic PCR primer	NNSCGCCATACCTGGGAATCAATC
Sequence-based reagent	WT_DHFR_pos44_fwd	This work	Mutagenic PCR primer	NNSCATACCTGGGAATCAATCGGTC
Sequence-based reagent	WT_DHFR_pos45_fwd	This work	Mutagenic PCR primer	NNSACCTGGGAATCAATCGGTC
Sequence-based reagent	WT_DHFR_pos46_fwd	This work	Mutagenic PCR primer	NNSTGGGAATCAATCGGTCGTC
Sequence-based reagent	WT_DHFR_pos47_fwd	This work	Mutagenic PCR primer	NNSGAATCAATCGGTCGTCCGTTG
Sequence-based reagent	WT_DHFR_pos48_fwd	This work	Mutagenic PCR primer	NNSTCAATCGGTCGTCCGTTGC
Sequence-based reagent	WT_DHFR_pos49_fwd	This work	Mutagenic PCR primer	NNSATCGGTCGTCCGTTGCCA
Sequence-based reagent	WT_DHFR_pos50_fwd	This work	Mutagenic PCR primer	NNSGGTCGTCCGTTGCCAGGAC
Sequence-based reagent	WT_DHFR_pos51_fwd	This work	Mutagenic PCR primer	NNSCGTCCGTTGCCAGGACGC
Sequence-based reagent	WT_DHFR_pos52_fwd	This work	Mutagenic PCR primer	NNSCCGTTGCCAGGACGCAAA
Sequence-based reagent	WT_DHFR_pos53_fwd	This work	Mutagenic PCR primer	NNSTTGCCAGGACGCAAAAATATTATCC
Sequence-based reagent	WT_DHFR_pos54_fwd	This work	Mutagenic PCR primer	NNSCCAGGACGCAAAAATATTATCCTCAG
Sequence-based reagent	WT_DHFR_pos55_fwd	This work	Mutagenic PCR primer	NNSGGACGCAAAAATATTATCCTCAGCAG
Sequence-based reagent	WT_DHFR_pos56_fwd	This work	Mutagenic PCR primer	NNSCGCAAAAATATTATCCTCAGCAGTCAA
Sequence-based reagent	WT_DHFR_pos57_fwd	This work	Mutagenic PCR primer	NNSAAAAATATTATCCTCAGCAGTCAACCGG
Sequence-based reagent	WT_DHFR_pos58_fwd	This work	Mutagenic PCR primer	NNSAATATTATCCTCAGCAGTCAACCGGGTA
Sequence-based reagent	WT_DHFR_pos59_fwd	This work	Mutagenic PCR primer	NNSATTATCCTCAGCAGTCAACCG
Sequence-based reagent	WT_DHFR_pos60_fwd	This work	Mutagenic PCR primer	NNSATCCTCAGCAGTCAACCG
Sequence-based reagent	WT_DHFR_pos61_fwd	This work	Mutagenic PCR primer	NNSCTCAGCAGTCAACCGGGT
Sequence-based reagent	WT_DHFR_pos62_fwd	This work	Mutagenic PCR primer	NNSAGCAGTCAACCGGGTACG
Sequence-based reagent	WT_DHFR_pos63_fwd	This work	Mutagenic PCR primer	NNSAGTCAACCGGGTACGGAC
Sequence-based reagent	WT_DHFR_pos64_fwd	This work	Mutagenic PCR primer	NNSCAACCGGGTACGGACGAT
Sequence-based reagent	WT_DHFR_pos65_fwd	This work	Mutagenic PCR primer	NNSCCGGGTACGGACGATCGC
Sequence-based reagent	WT_DHFR_pos66_fwd	This work	Mutagenic PCR primer	NNSGGTACGGACGATCGCGTA
Sequence-based reagent	WT_DHFR_pos66_fwd2	This work	Mutagenic PCR primer	NNSGGTACGGACGATCGCGTAAC
Sequence-based reagent	WT_DHFR_pos67_fwd	This work	Mutagenic PCR primer	NNSACGGACGATCGCGTAACG
Sequence-based reagent	WT_DHFR_pos67_fwd2	This work	Mutagenic PCR primer	NNSACGGACGATCGCGTAACG
Sequence-based reagent	WT_DHFR_pos68_fwd	This work	Mutagenic PCR primer	NNSGACGATCGCGTAACGTGG
Sequence-based reagent	WT_DHFR_pos69_fwd	This work	Mutagenic PCR primer	NNSGATCGCGTAACGTGGGTG
Sequence-based reagent	WT_DHFR_pos70_fwd	This work	Mutagenic PCR primer	NNSCGCGTAACGTGGGTGAAG
Sequence-based reagent	WT_DHFR_pos71_fwd	This work	Mutagenic PCR primer	NNSGTAACGTGGGTGAAGTCGG
Sequence-based reagent	WT_DHFR_pos72_fwd	This work	Mutagenic PCR primer	NNSACGTGGGTGAAGTCGGTG
Sequence-based reagent	WT_DHFR_pos73_fwd	This work	Mutagenic PCR primer	NNSTGGGTGAAGTCGGTGGAT
Sequence-based reagent	WT_DHFR_pos73_fwd2	This work	Mutagenic PCR primer	NNSTGGGTGAAGTCGGTGGATG
Sequence-based reagent	WT_DHFR_pos74_fwd	This work	Mutagenic PCR primer	NNSGTGAAGTCGGTGGATGAAGC
Sequence-based reagent	WT_DHFR_pos74_fwd2	This work	Mutagenic PCR primer	NNSGTGAAGTCGGTGGATGAAGC
Sequence-based reagent	WT_DHFR_pos75_fwd	This work	Mutagenic PCR primer	NNSAAGTCGGTGGATGAAGCCAT
Sequence-based reagent	WT_DHFR_pos76_fwd	This work	Mutagenic PCR primer	NNSTCGGTGGATGAAGCCATC
Sequence-based reagent	WT_DHFR_pos77_fwd	This work	Mutagenic PCR primer	NNSGTGGATGAAGCCATCGCG
Sequence-based reagent	WT_DHFR_pos78_fwd	This work	Mutagenic PCR primer	NNSGATGAAGCCATCGCGGCG
Sequence-based reagent	WT_DHFR_pos79_fwd	This work	Mutagenic PCR primer	NNSGAAGCCATCGCGGCGTGT
Sequence-based reagent	WT_DHFR_pos80_fwd	This work	Mutagenic PCR primer	NNSGCCATCGCGGCGTGTGGT
Sequence-based reagent	WT_DHFR_pos80_fwd2	This work	Mutagenic PCR primer	NNSGCCATCGCGGCGTGTGG
Sequence-based reagent	WT_DHFR_pos81_fwd	This work	Mutagenic PCR primer	NNSATCGCGGCGTGTGGTGAC
Sequence-based reagent	WT_DHFR_pos82_fwd	This work	Mutagenic PCR primer	NNSGCGGCGTGTGGTGACGTA
Sequence-based reagent	WT_DHFR_pos82_fwd2	This work	Mutagenic PCR primer	NNSGCGGCGTGTGGTGACGTACCAGAAATC
Sequence-based reagent	WT_DHFR_pos83_fwd	This work	Mutagenic PCR primer	NNSGCGTGTGGTGACGTACCA
Sequence-based reagent	WT_DHFR_pos84_fwd	This work	Mutagenic PCR primer	NNSTGTGGTGACGTACCAGAAATCAT
Sequence-based reagent	WT_DHFR_pos84_fwd2	This work	Mutagenic PCR primer	NNSTGTGGTGACGTACCAGAAATCATG
Sequence-based reagent	WT_DHFR_pos85_fwd	This work	Mutagenic PCR primer	NNSGGTGACGTACCAGAAATCATGG
Sequence-based reagent	WT_DHFR_pos86_fwd	This work	Mutagenic PCR primer	NNSGACGTACCAGAAATCATGGTGATTGG
Sequence-based reagent	WT_DHFR_pos87_fwd	This work	Mutagenic PCR primer	NNSGTACCAGAAATCATGGTGATTGGCGG
Sequence-based reagent	WT_DHFR_pos88_fwd	This work	Mutagenic PCR primer	NNSCCAGAAATCATGGTGATTGGCGG
Sequence-based reagent	WT_DHFR_pos89_fwd	This work	Mutagenic PCR primer	NNSGAAATCATGGTGATTGGCGGCG
Sequence-based reagent	WT_DHFR_pos89_fwd2	This work	Mutagenic PCR primer	NNSGAAATCATGGTGATTGGCGGC
Sequence-based reagent	WT_DHFR_pos90_fwd	This work	Mutagenic PCR primer	NNSATCATGGTGATTGGCGGC
Sequence-based reagent	WT_DHFR_pos91_fwd	This work	Mutagenic PCR primer	NNSATGGTGATTGGCGGCGGTC
Sequence-based reagent	WT_DHFR_pos92_fwd	This work	Mutagenic PCR primer	NNSGTGATTGGCGGCGGTCGC
Sequence-based reagent	WT_DHFR_pos93_fwd	This work	Mutagenic PCR primer	NNSATTGGCGGCGGTCGCGTTTA
Sequence-based reagent	WT_DHFR_pos94_fwd	This work	Mutagenic PCR primer	NNSGGCGGCGGTCGCGTTTAT
Sequence-based reagent	WT_DHFR_pos95_fwd	This work	Mutagenic PCR primer	NNSGGCGGTCGCGTTTATGAA
Sequence-based reagent	WT_DHFR_pos95_fwd2	This work	Mutagenic PCR primer	NNSGGCGGTCGCGTTTATGAAC
Sequence-based reagent	WT_DHFR_pos96_fwd	This work	Mutagenic PCR primer	NNSGGTCGCGTTTATGAACAGTTCTT
Sequence-based reagent	WT_DHFR_pos97_fwd	This work	Mutagenic PCR primer	NNSCGCGTTTATGAACAGTTCTTGC
Sequence-based reagent	WT_DHFR_pos98_fwd	This work	Mutagenic PCR primer	NNSGTTTATGAACAGTTCTTGCCAAAAGCGC
Sequence-based reagent	WT_DHFR_pos99_fwd	This work	Mutagenic PCR primer	NNSTATGAACAGTTCTTGCCAAAAGCGC
Sequence-based reagent	WT_DHFR_pos100_fwd	This work	Mutagenic PCR primer	NNSGAACAGTTCTTGCCAAAAGCGCAAAAAC
Sequence-based reagent	WT_DHFR_pos101_fwd	This work	Mutagenic PCR primer	NNSCAGTTCTTGCCAAAAGCGCAAAAAC
Sequence-based reagent	WT_DHFR_pos102_fwd	This work	Mutagenic PCR primer	NNSTTCTTGCCAAAAGCGCAAAAAC
Sequence-based reagent	WT_DHFR_pos103_fwd	This work	Mutagenic PCR primer	NNSTTGCCAAAAGCGCAAAAACTGTAT
Sequence-based reagent	WT_DHFR_pos104_fwd	This work	Mutagenic PCR primer	NNSCCAAAAGCGCAAAAACTGTATCTGA
Sequence-based reagent	WT_DHFR_pos104_fwd2	This work	Mutagenic PCR primer	NNSCCAAAAGCGCAAAAACTGTATCTG
Sequence-based reagent	WT_DHFR_pos105_fwd	This work	Mutagenic PCR primer	NNSAAAGCGCAAAAACTGTATCTGACG
Sequence-based reagent	WT_DHFR_pos106_fwd	This work	Mutagenic PCR primer	NNSGCGCAAAAACTGTATCTGACG
Sequence-based reagent	WT_DHFR_pos107_fwd	This work	Mutagenic PCR primer	NNSCAAAAACTGTATCTGACGCATATCGAC
Sequence-based reagent	WT_DHFR_pos107_fwd2	This work	Mutagenic PCR primer	NNSCAAAAACTGTATCTGACGCATATCG
Sequence-based reagent	WT_DHFR_pos108_fwd	This work	Mutagenic PCR primer	NNSAAACTGTATCTGACGCATATCGAC
Sequence-based reagent	WT_DHFR_pos109_fwd	This work	Mutagenic PCR primer	NNSCTGTATCTGACGCATATCGACG
Sequence-based reagent	WT_DHFR_pos110_fwd	This work	Mutagenic PCR primer	NNSTATCTGACGCATATCGACGCA
Sequence-based reagent	WT_DHFR_pos111_fwd	This work	Mutagenic PCR primer	NNSCTGACGCATATCGACGCAG
Sequence-based reagent	WT_DHFR_pos112_fwd	This work	Mutagenic PCR primer	NNSACGCATATCGACGCAGAAGT
Sequence-based reagent	WT_DHFR_pos113_fwd	This work	Mutagenic PCR primer	NNSCATATCGACGCAGAAGTGGAAG
Sequence-based reagent	WT_DHFR_pos114_fwd	This work	Mutagenic PCR primer	NNSATCGACGCAGAAGTGGAAG
Sequence-based reagent	WT_DHFR_pos115_fwd	This work	Mutagenic PCR primer	NNSGACGCAGAAGTGGAAGGC
Sequence-based reagent	WT_DHFR_pos116_fwd	This work	Mutagenic PCR primer	NNSGCAGAAGTGGAAGGCGAC
Sequence-based reagent	WT_DHFR_pos117_fwd	This work	Mutagenic PCR primer	NNSGAAGTGGAAGGCGACACC
Sequence-based reagent	WT_DHFR_pos118_fwd	This work	Mutagenic PCR primer	NNSGTGGAAGGCGACACCCAT
Sequence-based reagent	WT_DHFR_pos118_fwd2	This work	Mutagenic PCR primer	NNSGTGGAAGGCGACACCCATTTC
Sequence-based reagent	WT_DHFR_pos119_fwd	This work	Mutagenic PCR primer	NNSGAAGGCGACACCCATTTCC
Sequence-based reagent	WT_DHFR_pos120_fwd	This work	Mutagenic PCR primer	NNSGGCGACACCCATTTCCCG
Sequence-based reagent	WT_DHFR_pos121_fwd	This work	Mutagenic PCR primer	NNSGACACCCATTTCCCGGATTAC
Sequence-based reagent	WT_DHFR_pos122_fwd	This work	Mutagenic PCR primer	NNSACCCATTTCCCGGATTACGA
Sequence-based reagent	WT_DHFR_pos123_fwd	This work	Mutagenic PCR primer	NNSCATTTCCCGGATTACGAGCC
Sequence-based reagent	WT_DHFR_pos124_fwd	This work	Mutagenic PCR primer	NNSTTCCCGGATTACGAGCCG
Sequence-based reagent	WT_DHFR_pos125_fwd	This work	Mutagenic PCR primer	NNSCCGGATTACGAGCCGGAT
Sequence-based reagent	WT_DHFR_pos126_fwd	This work	Mutagenic PCR primer	NNSGATTACGAGCCGGATGACTG
Sequence-based reagent	WT_DHFR_pos127_fwd	This work	Mutagenic PCR primer	NNSTACGAGCCGGATGACTGG
Sequence-based reagent	WT_DHFR_pos128_fwd	This work	Mutagenic PCR primer	NNSGAGCCGGATGACTGGGAA
Sequence-based reagent	WT_DHFR_pos129_fwd	This work	Mutagenic PCR primer	NNSCCGGATGACTGGGAATCG
Sequence-based reagent	WT_DHFR_pos130_fwd	This work	Mutagenic PCR primer	NNSGATGACTGGGAATCGGTATTCAG
Sequence-based reagent	WT_DHFR_pos131_fwd	This work	Mutagenic PCR primer	NNSGACTGGGAATCGGTATTCAGC
Sequence-based reagent	WT_DHFR_pos131_fwd2	This work	Mutagenic PCR primer	NNSGACTGGGAATCGGTATTCAGC
Sequence-based reagent	WT_DHFR_pos132_fwd	This work	Mutagenic PCR primer	NNSTGGGAATCGGTATTCAGCGAATT
Sequence-based reagent	WT_DHFR_pos133_fwd	This work	Mutagenic PCR primer	NNSGAATCGGTATTCAGCGAATTCCAC
Sequence-based reagent	WT_DHFR_pos134_fwd	This work	Mutagenic PCR primer	NNSTCGGTATTCAGCGAATTCCAC
Sequence-based reagent	WT_DHFR_pos135_fwd	This work	Mutagenic PCR primer	NNSGTATTCAGCGAATTCCACGATG
Sequence-based reagent	WT_DHFR_pos135_fwd2	This work	Mutagenic PCR primer	NNSGTATTCAGCGAATTCCACGATGC
Sequence-based reagent	WT_DHFR_pos136_fwd	This work	Mutagenic PCR primer	NNSTTCAGCGAATTCCACGATGC
Sequence-based reagent	WT_DHFR_pos136_fwd2	This work	Mutagenic PCR primer	NNSTTCAGCGAATTCCACGATGC
Sequence-based reagent	WT_DHFR_pos137_fwd	This work	Mutagenic PCR primer	NNSAGCGAATTCCACGATGCTG
Sequence-based reagent	WT_DHFR_pos138_fwd	This work	Mutagenic PCR primer	NNSGAATTCCACGATGCTGATGC
Sequence-based reagent	WT_DHFR_pos139_fwd	This work	Mutagenic PCR primer	NNSTTCCACGATGCTGATGCG
Sequence-based reagent	WT_DHFR_pos140_fwd	This work	Mutagenic PCR primer	NNSCACGATGCTGATGCGCAG
Sequence-based reagent	WT_DHFR_pos140_fwd2	This work	Mutagenic PCR primer	NNSCACGATGCTGATGCGCAG
Sequence-based reagent	WT_DHFR_pos141_fwd	This work	Mutagenic PCR primer	NNSGATGCTGATGCGCAGAACT
Sequence-based reagent	WT_DHFR_pos142_fwd	This work	Mutagenic PCR primer	NNSGCTGATGCGCAGAACTCTC
Sequence-based reagent	WT_DHFR_pos143_fwd	This work	Mutagenic PCR primer	NNSGATGCGCAGAACTCTCACAG
Sequence-based reagent	WT_DHFR_pos144_fwd	This work	Mutagenic PCR primer	NNSGCGCAGAACTCTCACAGC
Sequence-based reagent	WT_DHFR_pos145_fwd	This work	Mutagenic PCR primer	NNSCAGAACTCTCACAGCTATTGCTTTG
Sequence-based reagent	WT_DHFR_pos146_fwd	This work	Mutagenic PCR primer	NNSAACTCTCACAGCTATTGCTTTGAGATT
Sequence-based reagent	WT_DHFR_pos147_fwd	This work	Mutagenic PCR primer	NNSTCTCACAGCTATTGCTTTGAGATTCT
Sequence-based reagent	WT_DHFR_pos148_fwd	This work	Mutagenic PCR primer	NNSCACAGCTATTGCTTTGAGATTCTGG
Sequence-based reagent	WT_DHFR_pos149_fwd	This work	Mutagenic PCR primer	NNSAGCTATTGCTTTGAGATTCTGGAG
Sequence-based reagent	WT_DHFR_pos150_fwd	This work	Mutagenic PCR primer	NNSTATTGCTTTGAGATTCTGGAGCG
Sequence-based reagent	WT_DHFR_pos151_fwd	This work	Mutagenic PCR primer	NNSTGCTTTGAGATTCTGGAGCG
Sequence-based reagent	WT_DHFR_pos152_fwd	This work	Mutagenic PCR primer	NNSTTTGAGATTCTGGAGCGGC
Sequence-based reagent	WT_DHFR_pos153_fwd	This work	Mutagenic PCR primer	NNSGAGATTCTGGAGCGGCGG
Sequence-based reagent	WT_DHFR_pos154_fwd	This work	Mutagenic PCR primer	NNSATTCTGGAGCGGCGGTAA
Sequence-based reagent	WT_DHFR_pos155_fwd	This work	Mutagenic PCR primer	NNSCTGGAGCGGCGGTAACAG
Sequence-based reagent	WT_DHFR_pos156_fwd	This work	Mutagenic PCR primer	NNSGAGCGGCGGTAACAGGCG
Sequence-based reagent	WT_DHFR_pos157_fwd	This work	Mutagenic PCR primer	NNSCGGCGGTAACAGGCGTCG
Sequence-based reagent	WT_DHFR_pos158_fwd	This work	Mutagenic PCR primer	NNSCGGTAACAGGCGTCGACA
Sequence-based reagent	WT_DHFR_pos159_fwd	This work	Mutagenic PCR primer	NNSTAACAGGCGTCGACAAGCT
Sequence-based reagent	WT_DHFR_pos2_rev	This work	Mutagenic PCR primer	CATGGTATATCTCCTTATTAAAGTTAAA
Sequence-based reagent	WT_DHFR_pos2_rev2	This work	Mutagenic PCR primer	CATGGTATATCTCATTATTAAAGTTAAAC
Sequence-based reagent	WT_DHFR_pos3_rev	This work	Mutagenic PCR primer	GATCATGGTATATCTCCTTATTAAAGTT
Sequence-based reagent	WT_DHFR_pos4_rev	This work	Mutagenic PCR primer	ACTGATCATGGTATATCTCCTTATTAAA
Sequence-based reagent	WT_DHFR_pos5_rev	This work	Mutagenic PCR primer	CAGACTGATCATGGTATATCTCCTTATT
Sequence-based reagent	WT_DHFR_pos6_rev	This work	Mutagenic PCR primer	AATCAGACTGATCATGGTATATCTCCTT
Sequence-based reagent	WT_DHFR_pos7_rev	This work	Mutagenic PCR primer	CGCAATCAGACTGATCATGGTATATCT
Sequence-based reagent	WT_DHFR_pos8_rev	This work	Mutagenic PCR primer	CGCCGCAATCAGACTGATC
Sequence-based reagent	WT_DHFR_pos8_rev2	This work	Mutagenic PCR primer	CGCCGCAATCAGACTGATC
Sequence-based reagent	WT_DHFR_pos9_rev	This work	Mutagenic PCR primer	TAACGCCGCAATCAGACTGA
Sequence-based reagent	WT_DHFR_pos10_rev	This work	Mutagenic PCR primer	CGCTAACGCCGCAATCAG
Sequence-based reagent	WT_DHFR_pos11_rev	This work	Mutagenic PCR primer	TACCGCTAACGCCGCAAT
Sequence-based reagent	WT_DHFR_pos12_rev	This work	Mutagenic PCR primer	ATCTACCGCTAACGCCGC
Sequence-based reagent	WT_DHFR_pos13_rev	This work	Mutagenic PCR primer	GCGATCTACCGCTAACGC
Sequence-based reagent	WT_DHFR_pos14_rev	This work	Mutagenic PCR primer	AACGCGATCTACCGCTAAC
Sequence-based reagent	WT_DHFR_pos15_rev	This work	Mutagenic PCR primer	GATAACGCGATCTACCGCTAAC
Sequence-based reagent	WT_DHFR_pos16_rev	This work	Mutagenic PCR primer	GCCGATAACGCGATCTACC
Sequence-based reagent	WT_DHFR_pos17_rev	This work	Mutagenic PCR primer	CATGCCGATAACGCGATCTAC
Sequence-based reagent	WT_DHFR_pos18_rev	This work	Mutagenic PCR primer	TTCCATGCCGATAACGCG
Sequence-based reagent	WT_DHFR_pos19_rev	This work	Mutagenic PCR primer	GTTTTCCATGCCGATAACGC
Sequence-based reagent	WT_DHFR_pos20_rev	This work	Mutagenic PCR primer	GGCGTTTTCCATGCCGATAACG
Sequence-based reagent	WT_DHFR_pos21_rev	This work	Mutagenic PCR primer	CATGGCGTTTTCCATGCC
Sequence-based reagent	WT_DHFR_pos22_rev	This work	Mutagenic PCR primer	CGGCATGGCGTTTTCCAT
Sequence-based reagent	WT_DHFR_pos22_rev2	This work	Mutagenic PCR primer	CGGCATGGCGTTTTCCATG
Sequence-based reagent	WT_DHFR_pos23_rev	This work	Mutagenic PCR primer	CCACGGCATGGCGTTTTC
Sequence-based reagent	WT_DHFR_pos24_rev	This work	Mutagenic PCR primer	GTTCCACGGCATGGCGTT
Sequence-based reagent	WT_DHFR_pos25_rev	This work	Mutagenic PCR primer	CAGGTTCCACGGCATGGC
Sequence-based reagent	WT_DHFR_pos26_rev	This work	Mutagenic PCR primer	AGGCAGGTTCCACGGCAT
Sequence-based reagent	WT_DHFR_pos27_rev	This work	Mutagenic PCR primer	GGCAGGCAGGTTCCACGG
Sequence-based reagent	WT_DHFR_pos28_rev	This work	Mutagenic PCR primer	ATCGGCAGGCAGGTTCCA
Sequence-based reagent	WT_DHFR_pos29_rev	This work	Mutagenic PCR primer	GAGATCGGCAGGCAGGTT
Sequence-based reagent	WT_DHFR_pos30_rev	This work	Mutagenic PCR primer	GGCGAGATCGGCAGGCAG
Sequence-based reagent	WT_DHFR_pos31_rev	This work	Mutagenic PCR primer	CCAGGCGAGATCGGCAGG
Sequence-based reagent	WT_DHFR_pos32_rev	This work	Mutagenic PCR primer	AAACCAGGCGAGATCGGC
Sequence-based reagent	WT_DHFR_pos33_rev	This work	Mutagenic PCR primer	TTTAAACCAGGCGAGATCGG
Sequence-based reagent	WT_DHFR_pos34_rev	This work	Mutagenic PCR primer	GCGTTTAAACCAGGCGAGAT
Sequence-based reagent	WT_DHFR_pos35_rev	This work	Mutagenic PCR primer	GTTGCGTTTAAACCAGGCGA
Sequence-based reagent	WT_DHFR_pos36_rev	This work	Mutagenic PCR primer	GGTGTTGCGTTTAAACCAGG
Sequence-based reagent	WT_DHFR_pos37_rev	This work	Mutagenic PCR primer	TAAGGTGTTGCGTTTAAACCAGG
Sequence-based reagent	WT_DHFR_pos38_rev	This work	Mutagenic PCR primer	ATTTAAGGTGTTGCGTTTAAACCAGG
Sequence-based reagent	WT_DHFR_pos39_rev	This work	Mutagenic PCR primer	TTTATTTAAGGTGTTGCGTTTAAACCAG
Sequence-based reagent	WT_DHFR_pos40_rev	This work	Mutagenic PCR primer	GGGTTTATTTAAGGTGTTGCGTTTAAAC
Sequence-based reagent	WT_DHFR_pos41_rev	This work	Mutagenic PCR primer	CACGGGTTTATTTAAGGTGTTGCGT
Sequence-based reagent	WT_DHFR_pos42_rev	This work	Mutagenic PCR primer	AATCACGGGTTTATTTAAGGTGTTGC
Sequence-based reagent	WT_DHFR_pos42_rev2	This work	Mutagenic PCR primer	AATCACGGGTTTATTTAAGGTGTTGC
Sequence-based reagent	WT_DHFR_pos43_rev	This work	Mutagenic PCR primer	CATAATCACGGGTTTATTTAAGGTGTTG
Sequence-based reagent	WT_DHFR_pos43_rev2	This work	Mutagenic PCR primer	CATAATCACGGGTTTATTTAAGGTGTTG
Sequence-based reagent	WT_DHFR_pos44_rev	This work	Mutagenic PCR primer	GCCCATAATCACGGGTTTATTTAAGG
Sequence-based reagent	WT_DHFR_pos45_rev	This work	Mutagenic PCR primer	GCGGCCCATAATCACGGG
Sequence-based reagent	WT_DHFR_pos46_rev	This work	Mutagenic PCR primer	ATGGCGGCCCATAATCAC
Sequence-based reagent	WT_DHFR_pos47_rev	This work	Mutagenic PCR primer	GGTATGGCGGCCCATAATC
Sequence-based reagent	WT_DHFR_pos48_rev	This work	Mutagenic PCR primer	CCAGGTATGGCGGCCCATA
Sequence-based reagent	WT_DHFR_pos49_rev	This work	Mutagenic PCR primer	TTCCCAGGTATGGCGGCC
Sequence-based reagent	WT_DHFR_pos50_rev	This work	Mutagenic PCR primer	TGATTCCCAGGTATGGCGGC
Sequence-based reagent	WT_DHFR_pos51_rev	This work	Mutagenic PCR primer	GATTGATTCCCAGGTATGGCGG
Sequence-based reagent	WT_DHFR_pos52_rev	This work	Mutagenic PCR primer	ACCGATTGATTCCCAGGTATG
Sequence-based reagent	WT_DHFR_pos53_rev	This work	Mutagenic PCR primer	ACGACCGATTGATTCCCAG
Sequence-based reagent	WT_DHFR_pos54_rev	This work	Mutagenic PCR primer	CGGACGACCGATTGATTCC
Sequence-based reagent	WT_DHFR_pos55_rev	This work	Mutagenic PCR primer	CAACGGACGACCGATTGATTC
Sequence-based reagent	WT_DHFR_pos56_rev	This work	Mutagenic PCR primer	TGGCAACGGACGACCGAT
Sequence-based reagent	WT_DHFR_pos57_rev	This work	Mutagenic PCR primer	TCCTGGCAACGGACGACC
Sequence-based reagent	WT_DHFR_pos58_rev	This work	Mutagenic PCR primer	GCGTCCTGGCAACGGACG
Sequence-based reagent	WT_DHFR_pos59_rev	This work	Mutagenic PCR primer	TTTGCGTCCTGGCAACGG
Sequence-based reagent	WT_DHFR_pos60_rev	This work	Mutagenic PCR primer	ATTTTTGCGTCCTGGCAAC
Sequence-based reagent	WT_DHFR_pos61_rev	This work	Mutagenic PCR primer	AATATTTTTGCGTCCTGGCAAC
Sequence-based reagent	WT_DHFR_pos62_rev	This work	Mutagenic PCR primer	GATAATATTTTTGCGTCCTGGCAAC
Sequence-based reagent	WT_DHFR_pos63_rev	This work	Mutagenic PCR primer	GAGGATAATATTTTTGCGTCCTGGC
Sequence-based reagent	WT_DHFR_pos64_rev	This work	Mutagenic PCR primer	GCTGAGGATAATATTTTTGCGTCCTG
Sequence-based reagent	WT_DHFR_pos65_rev	This work	Mutagenic PCR primer	ACTGCTGAGGATAATATTTTTGCGTCCT
Sequence-based reagent	WT_DHFR_pos66_rev	This work	Mutagenic PCR primer	TTGACTGCTGAGGATAATATTTTTGCG
Sequence-based reagent	WT_DHFR_pos66_rev2	This work	Mutagenic PCR primer	TTGACTGCTGAGGATAATATTTTTGC
Sequence-based reagent	WT_DHFR_pos67_rev	This work	Mutagenic PCR primer	CGGTTGACTGCTGAGGATAATATTTTTG
Sequence-based reagent	WT_DHFR_pos67_rev2	This work	Mutagenic PCR primer	CGGTTGACTGCTGAGGATAATATTTTTG
Sequence-based reagent	WT_DHFR_pos68_rev	This work	Mutagenic PCR primer	ACCCGGTTGACTGCTGAG
Sequence-based reagent	WT_DHFR_pos69_rev	This work	Mutagenic PCR primer	CGTACCCGGTTGACTGCT
Sequence-based reagent	WT_DHFR_pos70_rev	This work	Mutagenic PCR primer	GTCCGTACCCGGTTGACT
Sequence-based reagent	WT_DHFR_pos71_rev	This work	Mutagenic PCR primer	ATCGTCCGTACCCGGTTG
Sequence-based reagent	WT_DHFR_pos72_rev	This work	Mutagenic PCR primer	GCGATCGTCCGTACCCGG
Sequence-based reagent	WT_DHFR_pos73_rev	This work	Mutagenic PCR primer	TACGCGATCGTCCGTACC
Sequence-based reagent	WT_DHFR_pos73_rev2	This work	Mutagenic PCR primer	TACGCGATCGTCCGTACC
Sequence-based reagent	WT_DHFR_pos74_rev	This work	Mutagenic PCR primer	CGTTACGCGATCGTCCGT
Sequence-based reagent	WT_DHFR_pos74_rev2	This work	Mutagenic PCR primer	CGTTACGCGATCGTCCGTAC
Sequence-based reagent	WT_DHFR_pos75_rev	This work	Mutagenic PCR primer	CCACGTTACGCGATCGTC
Sequence-based reagent	WT_DHFR_pos76_rev	This work	Mutagenic PCR primer	CACCCACGTTACGCGATC
Sequence-based reagent	WT_DHFR_pos77_rev	This work	Mutagenic PCR primer	CTTCACCCACGTTACGCG
Sequence-based reagent	WT_DHFR_pos78_rev	This work	Mutagenic PCR primer	CGACTTCACCCACGTTACG
Sequence-based reagent	WT_DHFR_pos79_rev	This work	Mutagenic PCR primer	CACCGACTTCACCCACGTTAC
Sequence-based reagent	WT_DHFR_pos80_rev	This work	Mutagenic PCR primer	ATCCACCGACTTCACCCACGTTAC
Sequence-based reagent	WT_DHFR_pos80_rev2	This work	Mutagenic PCR primer	ATCCACCGACTTCACCCAC
Sequence-based reagent	WT_DHFR_pos81_rev	This work	Mutagenic PCR primer	TTCATCCACCGACTTCACCCA
Sequence-based reagent	WT_DHFR_pos82_rev	This work	Mutagenic PCR primer	GGCTTCATCCACCGACTTCAC
Sequence-based reagent	WT_DHFR_pos82_rev2	This work	Mutagenic PCR primer	GGCTTCATCCACCGACTTCAC
Sequence-based reagent	WT_DHFR_pos83_rev	This work	Mutagenic PCR primer	GATGGCTTCATCCACCGAC
Sequence-based reagent	WT_DHFR_pos84_rev	This work	Mutagenic PCR primer	CGCGATGGCTTCATCCAC
Sequence-based reagent	WT_DHFR_pos84_rev2	This work	Mutagenic PCR primer	CGCGATGGCTTCATCCAC
Sequence-based reagent	WT_DHFR_pos85_rev	This work	Mutagenic PCR primer	CGCCGCGATGGCTTCATC
Sequence-based reagent	WT_DHFR_pos86_rev	This work	Mutagenic PCR primer	ACACGCCGCGATGGCTTC
Sequence-based reagent	WT_DHFR_pos87_rev	This work	Mutagenic PCR primer	ACCACACGCCGCGATGGC
Sequence-based reagent	WT_DHFR_pos88_rev	This work	Mutagenic PCR primer	GTCACCACACGCCGCGAT
Sequence-based reagent	WT_DHFR_pos89_rev	This work	Mutagenic PCR primer	TACGTCACCACACGCCGC
Sequence-based reagent	WT_DHFR_pos89_rev2	This work	Mutagenic PCR primer	TACGTCACCACACGCCG
Sequence-based reagent	WT_DHFR_pos90_rev	This work	Mutagenic PCR primer	TGGTACGTCACCACACGC
Sequence-based reagent	WT_DHFR_pos91_rev	This work	Mutagenic PCR primer	TTCTGGTACGTCACCACACGC
Sequence-based reagent	WT_DHFR_pos92_rev	This work	Mutagenic PCR primer	GATTTCTGGTACGTCACCACACG
Sequence-based reagent	WT_DHFR_pos93_rev	This work	Mutagenic PCR primer	CATGATTTCTGGTACGTCACCACAC
Sequence-based reagent	WT_DHFR_pos94_rev	This work	Mutagenic PCR primer	CACCATGATTTCTGGTACGTCACC
Sequence-based reagent	WT_DHFR_pos95_rev	This work	Mutagenic PCR primer	AATCACCATGATTTCTGGTACGTCA
Sequence-based reagent	WT_DHFR_pos95_rev2	This work	Mutagenic PCR primer	AATCACCATGATTTCTGGTACGTC
Sequence-based reagent	WT_DHFR_pos96_rev	This work	Mutagenic PCR primer	GCCAATCACCATGATTTCTGGTAC
Sequence-based reagent	WT_DHFR_pos97_rev	This work	Mutagenic PCR primer	GCCGCCAATCACCATGATTT
Sequence-based reagent	WT_DHFR_pos98_rev	This work	Mutagenic PCR primer	ACCGCCGCCAATCACCATGATTTC
Sequence-based reagent	WT_DHFR_pos99_rev	This work	Mutagenic PCR primer	GCGACCGCCGCCAATCAC
Sequence-based reagent	WT_DHFR_pos100_rev	This work	Mutagenic PCR primer	AACGCGACCGCCGCCAAT
Sequence-based reagent	WT_DHFR_pos101_rev	This work	Mutagenic PCR primer	ATAAACGCGACCGCCGCC
Sequence-based reagent	WT_DHFR_pos102_rev	This work	Mutagenic PCR primer	TTCATAAACGCGACCGCC
Sequence-based reagent	WT_DHFR_pos103_rev	This work	Mutagenic PCR primer	CTGTTCATAAACGCGACCG
Sequence-based reagent	WT_DHFR_pos104_rev	This work	Mutagenic PCR primer	GAACTGTTCATAAACGCGACC
Sequence-based reagent	WT_DHFR_pos104_rev2	This work	Mutagenic PCR primer	GAACTGTTCATAAACGCGACCG
Sequence-based reagent	WT_DHFR_pos105_rev	This work	Mutagenic PCR primer	CAAGAACTGTTCATAAACGCGAC
Sequence-based reagent	WT_DHFR_pos106_rev	This work	Mutagenic PCR primer	TGGCAAGAACTGTTCATAAACGC
Sequence-based reagent	WT_DHFR_pos107_rev	This work	Mutagenic PCR primer	TTTTGGCAAGAACTGTTCATAAACG
Sequence-based reagent	WT_DHFR_pos107_rev2	This work	Mutagenic PCR primer	TTTTGGCAAGAACTGTTCATAAACG
Sequence-based reagent	WT_DHFR_pos108_rev	This work	Mutagenic PCR primer	CGCTTTTGGCAAGAACTGTTCATAAA
Sequence-based reagent	WT_DHFR_pos109_rev	This work	Mutagenic PCR primer	TTGCGCTTTTGGCAAGAACT
Sequence-based reagent	WT_DHFR_pos110_rev	This work	Mutagenic PCR primer	TTTTTGCGCTTTTGGCAAGAAC
Sequence-based reagent	WT_DHFR_pos111_rev	This work	Mutagenic PCR primer	CAGTTTTTGCGCTTTTGGCAAG
Sequence-based reagent	WT_DHFR_pos112_rev	This work	Mutagenic PCR primer	ATACAGTTTTTGCGCTTTTGGCAA
Sequence-based reagent	WT_DHFR_pos113_rev	This work	Mutagenic PCR primer	CAGATACAGTTTTTGCGCTTTTGG
Sequence-based reagent	WT_DHFR_pos114_rev	This work	Mutagenic PCR primer	CGTCAGATACAGTTTTTGCGCTTTT
Sequence-based reagent	WT_DHFR_pos115_rev	This work	Mutagenic PCR primer	ATGCGTCAGATACAGTTTTTGCG
Sequence-based reagent	WT_DHFR_pos116_rev	This work	Mutagenic PCR primer	GATATGCGTCAGATACAGTTTTTGCG
Sequence-based reagent	WT_DHFR_pos117_rev	This work	Mutagenic PCR primer	GTCGATATGCGTCAGATACAGTTTTTG
Sequence-based reagent	WT_DHFR_pos118_rev	This work	Mutagenic PCR primer	TGCGTCGATATGCGTCAGATA
Sequence-based reagent	WT_DHFR_pos118_rev2	This work	Mutagenic PCR primer	TGCGTCGATATGCGTCAGATAC
Sequence-based reagent	WT_DHFR_pos119_rev	This work	Mutagenic PCR primer	TTCTGCGTCGATATGCGTCA
Sequence-based reagent	WT_DHFR_pos120_rev	This work	Mutagenic PCR primer	CACTTCTGCGTCGATATGCG
Sequence-based reagent	WT_DHFR_pos121_rev	This work	Mutagenic PCR primer	TTCCACTTCTGCGTCGATATG
Sequence-based reagent	WT_DHFR_pos122_rev	This work	Mutagenic PCR primer	GCCTTCCACTTCTGCGTC
Sequence-based reagent	WT_DHFR_pos123_rev	This work	Mutagenic PCR primer	GTCGCCTTCCACTTCTGC
Sequence-based reagent	WT_DHFR_pos124_rev	This work	Mutagenic PCR primer	GGTGTCGCCTTCCACTTC
Sequence-based reagent	WT_DHFR_pos125_rev	This work	Mutagenic PCR primer	ATGGGTGTCGCCTTCCAC
Sequence-based reagent	WT_DHFR_pos126_rev	This work	Mutagenic PCR primer	GAAATGGGTGTCGCCTTCC
Sequence-based reagent	WT_DHFR_pos127_rev	This work	Mutagenic PCR primer	CGGGAAATGGGTGTCGCC
Sequence-based reagent	WT_DHFR_pos128_rev	This work	Mutagenic PCR primer	ATCCGGGAAATGGGTGTC
Sequence-based reagent	WT_DHFR_pos129_rev	This work	Mutagenic PCR primer	GTAATCCGGGAAATGGGTGTC
Sequence-based reagent	WT_DHFR_pos130_rev	This work	Mutagenic PCR primer	CTCGTAATCCGGGAAATGGG
Sequence-based reagent	WT_DHFR_pos131_rev	This work	Mutagenic PCR primer	CGGCTCGTAATCCGGGAA
Sequence-based reagent	WT_DHFR_pos131_rev2	This work	Mutagenic PCR primer	CGGCTCGTAATCCGGGAAATG
Sequence-based reagent	WT_DHFR_pos132_rev	This work	Mutagenic PCR primer	ATCCGGCTCGTAATCCGG
Sequence-based reagent	WT_DHFR_pos133_rev	This work	Mutagenic PCR primer	GTCATCCGGCTCGTAATCC
Sequence-based reagent	WT_DHFR_pos134_rev	This work	Mutagenic PCR primer	CCAGTCATCCGGCTCGTA
Sequence-based reagent	WT_DHFR_pos135_rev	This work	Mutagenic PCR primer	TTCCCAGTCATCCGGCTC
Sequence-based reagent	WT_DHFR_pos135_rev2	This work	Mutagenic PCR primer	TTCCCAGTCATCCGGCTC
Sequence-based reagent	WT_DHFR_pos136_rev	This work	Mutagenic PCR primer	CGATTCCCAGTCATCCGG
Sequence-based reagent	WT_DHFR_pos136_rev2	This work	Mutagenic PCR primer	CGATTCCCAGTCATCCGGC
Sequence-based reagent	WT_DHFR_pos137_rev	This work	Mutagenic PCR primer	TACCGATTCCCAGTCATCCG
Sequence-based reagent	WT_DHFR_pos138_rev	This work	Mutagenic PCR primer	GAATACCGATTCCCAGTCATCC
Sequence-based reagent	WT_DHFR_pos139_rev	This work	Mutagenic PCR primer	GCTGAATACCGATTCCCAGTC
Sequence-based reagent	WT_DHFR_pos140_rev	This work	Mutagenic PCR primer	TTCGCTGAATACCGATTCCCA
Sequence-based reagent	WT_DHFR_pos140_rev2	This work	Mutagenic PCR primer	TTCGCTGAATACCGATTCCCAG
Sequence-based reagent	WT_DHFR_pos141_rev	This work	Mutagenic PCR primer	GAATTCGCTGAATACCGATTCCC
Sequence-based reagent	WT_DHFR_pos142_rev	This work	Mutagenic PCR primer	GTGGAATTCGCTGAATACCGATTC
Sequence-based reagent	WT_DHFR_pos143_rev	This work	Mutagenic PCR primer	ATCGTGGAATTCGCTGAATACC
Sequence-based reagent	WT_DHFR_pos144_rev	This work	Mutagenic PCR primer	AGCATCGTGGAATTCGCTG
Sequence-based reagent	WT_DHFR_pos145_rev	This work	Mutagenic PCR primer	ATCAGCATCGTGGAATTCGC
Sequence-based reagent	WT_DHFR_pos146_rev	This work	Mutagenic PCR primer	CGCATCAGCATCGTGGAATT
Sequence-based reagent	WT_DHFR_pos147_rev	This work	Mutagenic PCR primer	CTGCGCATCAGCATCGTG
Sequence-based reagent	WT_DHFR_pos148_rev	This work	Mutagenic PCR primer	GTTCTGCGCATCAGCATC
Sequence-based reagent	WT_DHFR_pos149_rev	This work	Mutagenic PCR primer	AGAGTTCTGCGCATCAGC
Sequence-based reagent	WT_DHFR_pos150_rev	This work	Mutagenic PCR primer	GTGAGAGTTCTGCGCATCAG
Sequence-based reagent	WT_DHFR_pos151_rev	This work	Mutagenic PCR primer	GCTGTGAGAGTTCTGCGC
Sequence-based reagent	WT_DHFR_pos152_rev	This work	Mutagenic PCR primer	ATAGCTGTGAGAGTTCTGCG
Sequence-based reagent	WT_DHFR_pos153_rev	This work	Mutagenic PCR primer	GCAATAGCTGTGAGAGTTCTGC
Sequence-based reagent	WT_DHFR_pos154_rev	This work	Mutagenic PCR primer	AAAGCAATAGCTGTGAGAGTTCTG
Sequence-based reagent	WT_DHFR_pos155_rev	This work	Mutagenic PCR primer	CTCAAAGCAATAGCTGTGAGAGTTC
Sequence-based reagent	WT_DHFR_pos156_rev	This work	Mutagenic PCR primer	AATCTCAAAGCAATAGCTGTGAGAGTTC
Sequence-based reagent	WT_DHFR_pos157_rev	This work	Mutagenic PCR primer	CAGAATCTCAAAGCAATAGCTGTGAGAG
Sequence-based reagent	WT_DHFR_pos158_rev	This work	Mutagenic PCR primer	CTCCAGAATCTCAAAGCAATAGCTG
Sequence-based reagent	WT_DHFR_pos159_rev	This work	Mutagenic PCR primer	CCGCTCCAGAATCTCAAAGC
Sequence-based reagent	SL1_FWD	This work	Round one amplicon PCR primer	CACTCTTTCCCTACACGACGCTCTTCCGATCTNNNNACTTTAATAACGAGATATACCATG
Sequence-based reagent	SL1_REV	This work	Round one amplicon PCR primer	TGACTGGAGTTCAGACGTGTGCTCTTCCGATCTNNNNGTATGGCGGCCCATAAT
Sequence-based reagent	SL2_FWD	This work	Round one amplicon PCR primer	CACTCTTTCCCTACACGACGCTCTTCCGATCTNNNNACACCTTAAATAAACCCGTG
Sequence-based reagent	SL2_REV	This work	Round one amplicon PCR primer	TGACTGGAGTTCAGACGTGTGCTCTTCCGATCTNNNNCACGCCGCGATGGC
Sequence-based reagent	SL3_FWD	This work	Round one amplicon PCR primer	CACTCTTTCCCTACACGACGCTCTTCCGATCTNNNNTGAAGTCGGTGGATGAA
Sequence-based reagent	SL3_REV	This work	Round one amplicon PCR primer	TGACTGGAGTTCAGACGTGTGCTCTTCCGATCTNNNNGAAATGGGTGTCGCC
Sequence-based reagent	SL4_FWD	This work	Round one amplicon PCR primer	CACTCTTTCCCTACACGACGCTCTTCCGATCTNNNNCGACGCAGAAGTGGAA
Sequence-based reagent	SL4_REV	This work	Round one amplicon PCR primer	TGACTGGAGTTCAGACGTGTGCTCTTCCGATCTNNNNGCTTGTCGACGCCTG
Sequence-based reagent	D501	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	AATGATACGGCGACCACCGAGATCTACACTATAGCCTACACTCTTTCCCTACACGAC
Sequence-based reagent	D502	Illumina/Reynolds et al. Cell 2011	Round two amplicon PCR primer	AATGATACGGCGACCACCGAGATCTACACATAGAGGCACACTCTTTCCCTACACGAC
Sequence-based reagent	D503	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	AATGATACGGCGACCACCGAGATCTACACCCTATCCTACACTCTTT CCCTACACGAC
Sequence-based reagent	D504	Illumina/Reynolds et al. Cell 2011	Round two amplicon PCR primer	AATGATACGGCGACCACCGAGATCTACACGGCTCTGAACACTCTTTCCCTACACGAC
Sequence-based reagent	D505	Illumina/Reynolds et al. Cell 2011	Round two amplicon PCR primer	AATGATACGGCGACCACCGAGATCTACACAGGCGAAGACACTCTTTCCCTACACGAC
Sequence-based reagent	D506	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	AATGATACGGCGACCACCGAGATCTACACTAATCTTAACACTCTTTCCCTACACGAC
Sequence-based reagent	D507	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	AATGATACGGCGACCACCGAGATCTACACCAGGACGTACACTCTTTCCCTACACGAC
Sequence-based reagent	D508	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	AATGATACGGCGACCACCGAGATCTACACGTACTGACACACTCTTTCCCTACACGAC
Sequence-based reagent	D701	Illumina/Reynolds et al. Cell 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATCGAGTAATGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D702	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATTCTCCGGAGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D703	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATAATGAGCGGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D704	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATGGAATCTCGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D705	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATTTCTGAATGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D706	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATACGAATTCGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D707	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATAGCTTCAGGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D708	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAG ATGCGCATTAGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D709	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATCATAGCCGGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D710	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATTTCGCGGAGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D711	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATGCGCGAGAGTGACTGGAGTTCAGACGTG
Sequence-based reagent	D712	Illumina/Reynolds et al., 2011	Round two amplicon PCR primer	CAAGCAGAAGACGGCATACGAGATCTATCGCTGTGACTGGAGTTCAGACGTG
Sequence-based reagent	KanSacB_round1_fwd	This work	PCR primer	caggcatctggtgaataaTCCTTTTATGATTTTCTATCAAACAAAAGAGG
Sequence-based reagent	KanSacB_round1_rev	This work	PCR primer	tcaatgcgttcagaacgctcaggattcatGCTTGGTCGGTCATTTCGAAC
Sequence-based reagent	KanSacB_round2_fwd/Anderson_promoter_outer_fwd	This work	PCR primer	gtcaaagcaaaccgttgctgatttatggcaagccggaagcgcaacaggcatctggtgaataa
Sequence-based reagent	KanSacB_round2_rev/Anderson_promoter_outer_rev	This work	PCR primer	ccaccacatcgcgcagcggcaatacggggatttcaatgcgttcagaacgctcaggattcat
Sequence-based reagent	Anderson_promoter_outer_fwd/KanSacB_round2_fwd	This work	PCR primer	same as KanSacB_round2_fwd/Anderson_promoter_outer_fwd
Sequence-based reagent	Anderson_promoter_inner_fwd	This work	PCR primer	CCTAGGACTGAGCTAGCTGTCAAcgtcagtatatggggatgtttcccc
Sequence-based reagent	Anderson_promoter_inner_rev	This work	PCR primer	GCTAGCTCAGTCCTAGGTATAATGCTAGCAGGAtacctggcggaaattaaactaagagag
Sequence-based reagent	Anderson_promoter_outer_rev/KanSacB_round2_rev	This work	PCR primer	same as KanSacB_round2_rev/Anderson_promoter_outer_rev

Open in a new tab

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Samuel Thompson, Email: Thompson.SamuelM@gmail.com.

Tanja Kortemme, Email: kortemme@cgl.ucsf.edu.

Sarel Jacob Fleishman, Weizmann Institute of Science, Israel.

Patricia J Wittkopp, University of Michigan, United States.

Funding Information

This paper was supported by the following grants:

National Science Foundation MCB 1615990 to Tanja Kortemme.
Gordon and Betty Moore Foundation GBMF4557 to Kimberly A Reynolds.
National Science Foundation Graduate Student Fellowship to Samuel Thompson.
UCSF Chuan Lyu Chancellor's Fellowship Graduate Student Fellowship to Samuel Thompson.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Software, Formal analysis, Funding acquisition, Investigation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Formal analysis, Investigation, Writing - review and editing.

Investigation, Methodology, Writing - review and editing.

Resources, Formal analysis, Supervision, Funding acquisition, Project administration, Writing - review and editing.

Conceptualization, Resources, Formal analysis, Supervision, Funding acquisition, Writing - original draft, Project administration, Writing - review and editing.

Additional files

Supplementary file 1. Selection coefficients for –Lon selection measured as described in Materials and methods are reported with the standard deviation between biological replicates and the standard error from linear regression (as calculated by Enrich2; Rubin et al., 2017).

Values are reported as calculated, but based on the selection calibration, differences between selection coefficients with values below ~–2.5 are not interpretable.

elife-53476-supp1.csv^{(91.4KB, csv)}

Supplementary file 2. Selection coefficients for +Lon selection measured as described in Materials and methods are reported with the standard deviation between biological replicates and the standard error from linear regression.

Values are reported as calculated, but based on the selection calibration, differences between selection coefficients with values below ~–2.5 are not interpretable.

elife-53476-supp2.csv^{(89.9KB, csv)}

Supplementary file 3. Raw deep sequencing counts for the calibration set of mutants –Lon selection.

Counts are recorded for all turbidostat timepoints over three repeats.

elife-53476-supp3.csv^{(2.2KB, csv)}

Supplementary file 4. Raw deep sequencing counts for single point mutants in –Lon selection.

Counts are recorded for a sample collected from the transformation rescue medium, a sample from overnight outgrowth in supplemented M9, and for all turbidostat timepoints over six experiments. In each experiment, 2 of 4 sublibraries were screened as described in Materials and methods, for a total of 3 repeats over the full library.

elife-53476-supp4.csv^{(641.8KB, csv)}

Supplementary file 5. Raw deep sequencing counts for single point mutants in +Lon selection.

Counts are recorded as in Supplementary file 4.

elife-53476-supp5.csv^{(664KB, csv)}

Supplementary file 6. DHFR reaction velocities as a function of DHF concentration used for the measurement of soluble DHFR abundance from lysate activities as described in Materials and methods.

For each mutant (column 1), three technical repeats are included (column 2). There are two lines for each repeat reaction, one for DHF concentration and one for the reaction velocity at that concentration (column 3). All data in columns 4+ are the experimental values.

elife-53476-supp6.csv^{(72.7KB, csv)}

Supplementary file 7. Multiple sequence alignment of bacterial DHFR sequences generated as described in Materials and methods and used for bioinformatics analyses.

elife-53476-supp7.csv^{(248.7KB, csv)}

Transparent reporting form

elife-53476-transrepform.docx^{(249.1KB, docx)}

Data availability

Source data have been provided for Figure 1 (Figure 1—source data 1–3, Supplementary file 1), Figure 2 (Supplementary files 1 and 2), Figure 3 (Figure 3—source data 1, Supplementary files 1 and 2), Figure 4 (Figure 4—source data 1–3, Supplementary file 7) and Figure 5 (Figure 3—source data 1). Code for analysis is available in our GitHub repository for this project (https://github.com/keleayon/2019_DHFR_Lon.git; copy archived at https://github.com/elifesciences-publications/2019_DHFR_Lon) along with key input files and example command lines. Raw deep sequencing data was deposited to the Sequence Read Archive in entry PRJNA590072 (BioSamples: SAMN13316587, SAMN13316662). Allele counts used to generate the selection coefficients (all figures) are reported in Supplementary files 4–6. Key plasmids (Appendix 1) will be available from Addgene.

The following dataset was generated:

Thompson S. 2019. Mapping the mutational landscape of DHFR single point mutants with perturbations to the cellular environment. NCBI BioProject. PRJNA590072

References

Anton BP, Fomenkov A, Raleigh EA, Berkmen M. Complete Genome Sequence of the Engineered Escherichia coli SHuffle Strains and Their Wild-Type Parents. Genome Announcements. 2016;4:16. doi: 10.1128/genomeA.00230-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
Araya CL, Fowler DM, Chen W, Muniez I, Kelly JW, Fields S. A fundamental protein property, thermodynamic stability, revealed solely from large-scale measurements of protein function. PNAS. 2012;109:16858–16863. doi: 10.1073/pnas.1209751109. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bandaru P, Shah NH, Bhattacharyya M, Barton JP, Kondo Y, Cofsky JC, Gee CL, Chakraborty AK, Kortemme T, Ranganathan R, Kuriyan J. Deconstruction of the ras switching cycle through saturation mutagenesis. eLife. 2017;6:e27810. doi: 10.7554/eLife.27810. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bennett BD, Kimball EH, Gao M, Osterhout R, Van Dien SJ, Rabinowitz JD. Absolute metabolite concentrations and implied enzyme active site occupancy in Escherichia coli. Nature Chemical Biology. 2009;5:593–599. doi: 10.1038/nchembio.186. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bershtein S, Mu W, Serohijos AW, Zhou J, Shakhnovich EI. Protein quality control acts on folding intermediates to shape the effects of mutations on organismal fitness. Molecular Cell. 2013;49:133–144. doi: 10.1016/j.molcel.2012.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bershtein S, Choi JM, Bhattacharyya S, Budnik B, Shakhnovich E. Systems-level response to point mutations in a core metabolic enzyme modulates genotype-phenotype relationship. Cell Reports. 2015a;11:645–656. doi: 10.1016/j.celrep.2015.03.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bershtein S, Serohijos AW, Bhattacharyya S, Manhart M, Choi JM, Mu W, Zhou J, Shakhnovich EI. Protein homeostasis imposes a barrier on functional integration of horizontally transferred genes in Bacteria. PLOS Genetics. 2015b;11:e1005612. doi: 10.1371/journal.pgen.1005612. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bhattacharyya S, Bershtein S, Yan J, Argun T, Gilson AI, Trauger SA, Shakhnovich EI. Transient protein-protein interactions perturb E. coli metabolome and cause gene dosage toxicity. eLife. 2016;5:e20309. doi: 10.7554/eLife.20309. [DOI] [PMC free article] [PubMed] [Google Scholar]
Blomfield IC, Vaughn V, Rest RF, Eisenstein BI. Allelic exchange in Escherichia coli using the Bacillus subtilis sacB gene and a temperature-sensitive pSC101 replicon. Molecular Microbiology. 1991;5:1447–1457. doi: 10.1111/j.1365-2958.1991.tb00791.x. [DOI] [PubMed] [Google Scholar]
Boehr DD, McElheny D, Dyson HJ, Wright PE. The dynamic energy landscape of dihydrofolate reductase catalysis. Science. 2006;313:1638–1642. doi: 10.1126/science.1130258. [DOI] [PubMed] [Google Scholar]
Boucher JI, Bolon DN, Tawfik DS. Quantifying and understanding the fitness effects of protein mutations: laboratory versus nature. Protein Science. 2016;25:1219–1226. doi: 10.1002/pro.2928. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cho Y, Zhang X, Pobre KF, Liu Y, Powers DL, Kelly JW, Gierasch LM, Powers ET. Individual and collective contributions of chaperoning and degradation to protein homeostasis in E. coli. Cell Reports. 2015;11:321–333. doi: 10.1016/j.celrep.2015.03.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dykhuizen DE, Dean AM, Hartl DL. Metabolic flux and fitness. Genetics. 1987;115:25–31. doi: 10.1093/genetics/115.1.25. [DOI] [PMC free article] [PubMed] [Google Scholar]
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fierke CA, Benkovic SJ. Probing the functional role of threonine-113 of Escherichia coli dihydrofolate reductase for its effect on turnover efficiency, catalysis, and binding. Biochemistry. 1989;28:478–486. doi: 10.1021/bi00428a011. [DOI] [PubMed] [Google Scholar]
Fowler DM, Fields S. Deep mutational scanning: a new style of protein science. Nature Methods. 2014;11:801–807. doi: 10.1038/nmeth.3027. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fraczkiewicz R, Braun W. Exact and efficient analytical calculation of the accessible surface areas and their gradients for macromolecules. Journal of Computational Chemistry. 1998;19:319–333. doi: 10.1002/(SICI)1096-987X(199802)19:3<319::AID-JCC6>3.0.CO;2-W. [DOI] [Google Scholar]
Garst AD, Bassalo MC, Pines G, Lynch SA, Halweg-Edwards AL, Liu R, Liang L, Wang Z, Zeitoun R, Alexander WG, Gill RT. Genome-wide mapping of mutations at single-nucleotide resolution for protein, metabolic and genome engineering. Nature Biotechnology. 2017;35:48–55. doi: 10.1038/nbt.3718. [DOI] [PubMed] [Google Scholar]
Guerrero RF, Scarpino SV, Rodrigues JV, Hartl DL, Ogbunugafor CB. Proteostasis Environment Shapes Higher-Order Epistasis Operating on Antibiotic Resistance. Genetics. 2019;212:565–575. doi: 10.1534/genetics.119.302138. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gur E, Sauer RT. Recognition of misfolded proteins by Lon, a AAA+ protease. Genes & Development. 2008;22:2267–2277. doi: 10.1101/gad.1670908. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hietpas RT, Bank C, Jensen JD, Bolon DNA. Shifting fitness landscapes in response to altered environments. Evolution. 2013;67:3512–3522. doi: 10.1111/evo.12207. [DOI] [PMC free article] [PubMed] [Google Scholar]
Huang Z, Wagner CR, Benkovic SJ. Nonadditivity of mutational effects at the folate binding site of Escherichia coli dihydrofolate reductase. Biochemistry. 1994;33:11576–11585. doi: 10.1021/bi00204a020. [DOI] [PubMed] [Google Scholar]
iGEM Registry of standard biological parts. [November 19, 2018];2006 http://parts.igem.org/Promoters/Catalog/Anderson
Iwakura M, Maki K, Takahashi H, Takenawa T, Yokota A, Katayanagi K, Kamiyama T, Gekko K. Evolutional design of a hyperactive cysteine- and methionine-free mutant of Escherichia coli dihydrofolate reductase. Journal of Biological Chemistry. 2006;281:13234–13246. doi: 10.1074/jbc.M508823200. [DOI] [PubMed] [Google Scholar]
Jiang L, Mishra P, Hietpas RT, Zeldovich KB, Bolon DN. Latent effects of Hsp90 mutants revealed at reduced expression levels. PLOS Genetics. 2013;9:e1003600. doi: 10.1371/journal.pgen.1003600. [DOI] [PMC free article] [PubMed] [Google Scholar]
Joosten RP, Long F, Murshudov GN, Perrakis A. The PDB_REDO server for macromolecular structure model optimization. IUCrJ. 2014;1:213–220. doi: 10.1107/S2052252514009324. [DOI] [PMC free article] [PubMed] [Google Scholar]
Klesmith JR, Bacik JP, Wrenbeck EE, Michalczyk R, Whitehead TA. Trade-offs between enzyme fitness and solubility illuminated by deep mutational scanning. PNAS. 2017;114:2265–2270. doi: 10.1073/pnas.1614437114. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kwon YK, Lu W, Melamud E, Khanam N, Bognar A, Rabinowitz JD. A domino effect in antifolate drug action in Escherichia coli. Nature Chemical Biology. 2008;4:602–608. doi: 10.1038/nchembio.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu CT, Hanoian P, French JB, Pringle TH, Hammes-Schiffer S, Benkovic SJ. Functional significance of evolving protein sequence in dihydrofolate reductase from Bacteria to humans. PNAS. 2013;110:10159–10164. doi: 10.1073/pnas.1307130110. [DOI] [PMC free article] [PubMed] [Google Scholar]
Magoč T, Salzberg SL. FLASH: fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27:2957–2963. doi: 10.1093/bioinformatics/btr507. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mavor D, Barlow K, Thompson S, Barad BA, Bonny AR, Cario CL, Gaskins G, Liu Z, Deming L, Axen SD, Caceres E, Chen W, Cuesta A, Gate RE, Green EM, Hulce KR, Ji W, Kenner LR, Mensa B, Morinishi LS, Moss SM, Mravic M, Muir RK, Niekamp S, Nnadi CI, Palovcak E, Poss EM, Ross TD, Salcedo EC, See SK, Subramaniam M, Wong AW, Li J, Thorn KS, Conchúir SÓ, Roscoe BP, Chow ED, DeRisi JL, Kortemme T, Bolon DN, Fraser JS. Determination of ubiquitin fitness landscapes under different chemical stresses in a classroom setting. eLife. 2016;5:e18502. doi: 10.7554/eLife.15802. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mavor D, Barlow KA, Asarnow D, Birman Y, Britain D, Chen W, Green EM, Kenner LR, Mensa B, Morinishi LS, Nelson CA, Poss EM, Suresh P, Tian R, Arhar T, Ary BE, Bauer DP, Bergman ID, Brunetti RM, Chio CM, Dai SA, Dickinson MS, Elledge SK, Helsell CVM, Hendel NL, Kang E, Kern N, Khoroshkin MS, Kirkemo LL, Lewis GR, Lou K, Marin WM, Maxwell AM, McTigue PF, Myers-Turnbull D, Nagy TL, Natale AM, Oltion K, Pourmal S, Reder GK, Rettko NJ, Rohweder PJ, Schwarz DMC, Tan SK, Thomas PV, Tibble RW, Town JP, Tsai MK, Ugur FS, Wassarman DR, Wolff AM, Wu TS, Bogdanoff D, Li J, Thorn KS, O'Conchúir S, Swaney DL, Chow ED, Madhani HD, Redding S, Bolon DN, Kortemme T, DeRisi JL, Kampmann M, Fraser JS. Extending chemical perturbations of the ubiquitin fitness landscape in a classroom setting reveals new constraints on sequence tolerance. Biology Open. 2018;7:bio036103. doi: 10.1242/bio.036103. [DOI] [PMC free article] [PubMed] [Google Scholar]
McLaughlin RN, Poelwijk FJ, Raman A, Gosal WS, Ranganathan R. The spatial architecture of protein function and adaptation. Nature. 2012;491:138–142. doi: 10.1038/nature11500. [DOI] [PMC free article] [PubMed] [Google Scholar]
Miller GP, Wahnon DC, Benkovic SJ. Interloop contacts modulate ligand cycling during catalysis by Escherichia coli dihydrofolate reductase. Biochemistry. 2001;40:867–875. doi: 10.1021/bi001608n. [DOI] [PubMed] [Google Scholar]
Miller GP, Benkovic SJ. Strength of an interloop hydrogen bond determines the kinetic pathway in catalysis by Escherichia coli dihydrofolate reductase. Biochemistry. 1998;37:6336–6342. doi: 10.1021/bi973065w. [DOI] [PubMed] [Google Scholar]
Nicoloff H, Andersson DI. Lon protease inactivation, or translocation of the lon gene, potentiate bacterial evolution to antibiotic resistance. Molecular Microbiology. 2013;90:1233–1248. doi: 10.1111/mmi.12429. [DOI] [PubMed] [Google Scholar]
Ovchinnikov S, Kamisetty H, Baker D. Robust and accurate prediction of residue-residue interactions across protein interfaces using evolutionary information. eLife. 2014;3:e02030. doi: 10.7554/eLife.02030. [DOI] [PMC free article] [PubMed] [Google Scholar]
Oyen D, Fenwick RB, Aoto PC, Stanfield RL, Wilson IA, Dyson HJ, Wright PE. Defining the structural basis for allosteric product release from E. coli Dihydrofolate Reductase Using NMR Relaxation Dispersion. Journal of the American Chemical Society. 2017;139:11233–11240. doi: 10.1021/jacs.7b05958. [DOI] [PMC free article] [PubMed] [Google Scholar]
Powers ET, Powers DL, Gierasch LM. FoldEco: a model for proteostasis in E. coli. Cell Reports. 2012;1:265–276. doi: 10.1016/j.celrep.2012.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
Queitsch C, Sangster TA, Lindquist S. Hsp90 as a capacitor of phenotypic variation. Nature. 2002;417:618–624. doi: 10.1038/nature749. [DOI] [PubMed] [Google Scholar]
Reynolds KA, McLaughlin RN, Ranganathan R. Hot spots for allosteric regulation on protein surfaces. Cell. 2011;147:1564–1575. doi: 10.1016/j.cell.2011.10.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rodrigues JV, Bershtein S, Li A, Lozovsky ER, Hartl DL, Shakhnovich EI. Biophysical principles predict fitness landscapes of drug resistance. PNAS. 2016;113:E1470–E1478. doi: 10.1073/pnas.1601441113. [DOI] [PMC free article] [PubMed] [Google Scholar]
Roscoe BP, Thayer KM, Zeldovich KB, Fushman D, Bolon DN. Analyses of the effects of all ubiquitin point mutants on yeast growth rate. Journal of Molecular Biology. 2013;425:1363–1377. doi: 10.1016/j.jmb.2013.01.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rubin AF, Gelman H, Lucas N, Bajjalieh SM, Papenfuss AT, Speed TP, Fowler DM. A statistical framework for analyzing deep mutational scanning data. Genome Biology. 2017;18:150. doi: 10.1186/s13059-017-1272-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
saiSree L, Reddy M, Gowrishankar J. IS186 insertion at a hot spot in the lon promoter as a basis for lon protease deficiency of Escherichia coli B: identification of a consensus target sequence for IS186 transposition. Journal of Bacteriology. 2001;183:6943–6946. doi: 10.1128/JB.183.23.6943-6946.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
Salis HM, Mirsky EA, Voigt CA. Automated design of synthetic ribosome binding sites to control protein expression. Nature Biotechnology. 2009;27:946–950. doi: 10.1038/nbt.1568. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sanner MF, Olson AJ, Spehner JC. Reduced surface: an efficient way to compute molecular surfaces. Biopolymers. 1996;38:305–320. doi: 10.1002/(SICI)1097-0282(199603)38:3<305::AID-BIP4>3.0.CO;2-Y. [DOI] [PubMed] [Google Scholar]
Sauer RT, Baker TA. AAA+ proteases: atp-fueled machines of protein destruction. Annual Review of Biochemistry. 2011;80:587–612. doi: 10.1146/annurev-biochem-060408-172623. [DOI] [PubMed] [Google Scholar]
Sawaya MR, Kraut J. Loop and subdomain movements in the mechanism of Escherichia coli dihydrofolate reductase: crystallographic evidence. Biochemistry. 1997;36:586–603. doi: 10.1021/bi962337c. [DOI] [PubMed] [Google Scholar]
Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schober AF, Mathis AD, Ingle C, Park JO, Chen L, Rabinowitz JD, Junier I, Rivoire O, Reynolds KA. A Two-Enzyme adaptive unit within bacterial folate metabolism. Cell Reports. 2019;27:3359–3370. doi: 10.1016/j.celrep.2019.05.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
Smith CA, Shi CA, Chroust MK, Bliska TE, Kelly MJS, Jacobson MP, Kortemme T. Design of a phosphorylatable PDZ domain with peptide-specific affinity changes. Structure. 2013;21:54–64. doi: 10.1016/j.str.2012.10.007. [DOI] [PubMed] [Google Scholar]
Steinberg B, Ostermeier M. Shifting fitness and epistatic landscapes reflect Trade-offs along an evolutionary pathway. Journal of Molecular Biology. 2016;428:2730–2743. doi: 10.1016/j.jmb.2016.04.033. [DOI] [PubMed] [Google Scholar]
Stiffler MA, Hekstra DR, Ranganathan R. Evolvability as a function of purifying selection in TEM-1 β-lactamase. Cell. 2015;160:882–892. doi: 10.1016/j.cell.2015.01.035. [DOI] [PubMed] [Google Scholar]
Tenaillon O, Barrick JE, Ribeck N, Deatherage DE, Blanchard JL, Dasgupta A, Wu GC, Wielgoss S, Cruveiller S, Médigue C, Schneider D, Lenski RE. Tempo and mode of genome evolution in a 50,000-generation experiment. Nature. 2016;536:165–170. doi: 10.1038/nature18959. [DOI] [PMC free article] [PubMed] [Google Scholar]
Thomason LC, Sawitzke JA, Li X, Costantino N, Court DL. Recombineering: genetic engineering in Bacteria using homologous recombination. Current Protocols in Molecular Biology. 2014;106:1.16–1-39. doi: 10.1002/0471142727.mb0116s106. [DOI] [PubMed] [Google Scholar]
Thompson S. 2019_DHFR_Lon. c3e2201GitHub. 2020 https://github.com/keleayon/2019_DHFR_Lon
Tinberg CE, Khare SD, Dou J, Doyle L, Nelson JW, Schena A, Jankowski W, Kalodimos CG, Johnsson K, Stoddard BL, Baker D. Computational design of ligand-binding proteins with high affinity and selectivity. Nature. 2013;501:212–216. doi: 10.1038/nature12443. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tokuriki N, Tawfik DS. Chaperonin overexpression promotes genetic variation and enzyme evolution. Nature. 2009;459:668–673. doi: 10.1038/nature08009. [DOI] [PubMed] [Google Scholar]
Whitehead TA, Chevalier A, Song Y, Dreyfus C, Fleishman SJ, De Mattos C, Myers CA, Kamisetty H, Blair P, Wilson IA, Baker D. Optimization of affinity, specificity and function of designed influenza inhibitors using deep sequencing. Nature Biotechnology. 2012;30:543–548. doi: 10.1038/nbt.2214. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wrenbeck EE, Azouz LR, Whitehead TA. Single-mutation fitness landscapes for an enzyme on multiple substrates reveal specificity is globally encoded. Nature Communications. 2017;8:15695. doi: 10.1038/ncomms15695. [DOI] [PMC free article] [PubMed] [Google Scholar]

eLife. doi: 10.7554/eLife.53476.sa1

Decision letter

Editor: Sarel Jacob Fleishman¹

Reviewed by: Sarel Jacob Fleishman²

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The authors combined mutational scanning with structural and biochemical analysis of DHFR against different genetic backgrounds and show how these backgrounds can change the tolerance to mutations. The work provides several important mechanistic insights on the relationship between cellular proteostasis, protein structure and evolution.

Decision letter after peer review:

Thank you for submitting your article "Modulating the cellular context broadly reshapes the mutational landscape of a model enzyme" for consideration by eLife. Your article has been reviewed by three peer reviewers, including Sarel Jacob Fleishman as the Reviewing Editor, and the evaluation has been overseen by Patricia Wittkopp as the Senior Editor.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

Thompson et al. used deep mutational scanning of E. coli DHFR to evaluate how the constraints imposed by the cellular environment modulate the mutational tolerance of the enzyme. To this end, selection coefficients of every possible DHFR amino acid substitution were determined in the absence and presence of Lon protease. The authors demonstrate that Lon dramatically transforms the mutational landscape of DHFR. A particularly interesting finding is that Lon largely suppresses the advantageous mutations that, in the absence of Lon, constitute over 23% of all single point mutations. It is suggested that the observed phenomenon can be explained by extensive activity-stability trade-offs, whereby advantageous mutations increase the DHFR activity, but this improvement in activity comes at the expense of reduced thermodynamic stability that renders the mutants sensitive to Lon degradation. The manuscript is clearly written and interesting and nicely adds to our understanding of the relationship between cellular proteostasis and evolution.

Essential revisions:

1) Bacterial fitness depends on the product of the catalytic proficiency (k_cat/K_M) and intracellular abundance of an essential enzyme (Dykhuzien, Dean and Hartl, 1987). This dependence was also specifically demonstrated for DHFR in E. coli (Bershtein et al., 2015) but isn't discussed in this paper. In the manuscript, the activity of the DHFR mutants is measured as initial velocity at a particular concentration of DHF. However, the comparison between DHFR mutants using this type of measurement is meaningless for mutants that vary substantially in their Km(DHF) values. For example, the reported Km value of L54F is 0.7 μm and that of F31Y is 168 μm – 240 fold higher (Figure 1—source data 3). This means that when the initial velocity for both mutants is measured at 20 μm DHF, the L54F variant operates at a rate close to Vmax, whereas the rate of F31Y is measured way below its Km and, therefore, is far away from its Vmax value. Since the changes in k_cat and K_M amongst DHFR mutants are not necessarily correlated (e.g., the k_cat of variant F31V is close to that of wt but its K_M is 2 orders of magnitude higher, Figure 1—source data 3), the differences in the initial velocities at a given concentration of DHF will be sometimes driven by k_cat and sometimes by K_M. The interpretation of these measurements with respect to bacterial fitness is further muddled by the fact that 1) the intracellular concentrations of the mutants are not known, and 2) the intracellular amounts of DHF can rise as a result of low DHFR abundance and/or activity (Kwon et al., 2008), thus affecting the relative importance of Km. Indeed, roughly half of analyzed adaptive mutations appear to have initial velocities lower than that of wt (Figure 4C and Figure 4—figure supplement 3), although the authors claim that the initial velocities are expected to be correlated with selection coefficients (as shown in Figure 1C for a small subset of mutants). Thus, the way the activity of DHFR mutants is measured does not adequately explain the observed distribution of selection coefficients.

For proper interpretation of the selection coefficients, it is therefore important to measure the intracellular abundance of a selection of DHFR mutants on Lon+/- backgrounds and to measure k_cat and K_M parameters for a subset of advantageous DHFR mutants.

2) Related to point (1) above, the mechanism invoked by the authors to explain why destabilization may increase activity through increased dynamics at the active site is interesting but other mechanisms related to cellular abundance have not been taken into consideration. In particular, DHFR destabilization is known to turn DHFR into a chaperonin client and this interaction may increase cellular levels. As argued in point (1) above, more detailed measurements of cellular abundance and k_cat,K_M determination are needed to produce a consistent interpretation of the results.

3) Results – the authors show that their DMS results are nicely reproducible. However, I don't think that they correlate the DMS results with individually measured selection coefficients (it's not totally clear whether the data shown in Figure 1C is from individual measurements or the DMS). They should do this to establish that the DMS accurately recapitulates individual measurements both in E. coli and for purified protein.

4) The selection system is beautifully designed to allow highly sensitive selection conditions, including the identification of better-than-wt DHFR mutants. The experimental conditions in the paper, however, are likely to be different from those that a wild type strain would face. First, the endogenous promoter of folA regulates the DHFR expression via a negative feedback loop: A drop in DHFR activity/abundance results in the upregulation of its expression (Bershtein, et al., 2015). An interesting question is how the distribution of fitness effects of DHFR mutations will be shaped by the presence of such a regulatory expression element. Second, it was demonstrated that the endogenous DHFR levels in E. coli strain carrying the chromosomal folA gene are very close to the optimal level, as the increase in activity or abundance of DHFR does not increase fitness (Bhattacharyya et al., 2016). The fact that over 23% of single point DHFR mutations increase bacterial fitness suggests that the intracellular DHFR levels in the selection system are far away from the optimum. Third, there is no difference in the DHFR sequence between naturally occurring E. coli B and K-12 strains, even though according to the authors' conclusions, the lack of Lon protease in B strains should have driven the adaptive evolution of DHFR in this strain. It would be helpful if the authors discussed these caveats in the manuscript.

eLife. 2020 Jul 23;9:e53476. doi: 10.7554/eLife.53476.sa2

Author response

Essential revisions:

1) Bacterial fitness depends on the product of the catalytic proficiency (k_cat/K_M) and intracellular abundance of an essential enzyme (Dykhuzien, Dean and Hartl, 1987). This dependence was also specifically demonstrated for DHFR in E. coli (Bershtein et al., 2015) but isn't discussed in this paper. In the manuscript, the activity of the DHFR mutants is measured as initial velocity at a particular concentration of DHF. However, the comparison between DHFR mutants using this type of measurement is meaningless for mutants that vary substantially in their Km(DHF) values. For example, the reported Km value of L54F is 0.7 μm and that of F31Y is 168 μm – 240 fold higher (Figure 1—source data 3). This means that when the initial velocity for both mutants is measured at 20 μm DHF, the L54F variant operates at a rate close to Vmax, whereas the rate of F31Y is measured way below its Km and, therefore, is far away from its Vmax value. Since the changes in k_cat and K_M amongst DHFR mutants are not necessarily correlated (e.g., the k_cat of variant F31V is close to that of wt but its K_M is 2 orders of magnitude higher, Figure 1—source data 3), the differences in the initial velocities at a given concentration of DHF will be sometimes driven by k_cat and sometimes by K_M. The interpretation of these measurements with respect to bacterial fitness is further muddled by the fact that 1) the intracellular concentrations of the mutants are not known, and 2) the intracellular amounts of DHF can rise as a result of low DHFR abundance and/or activity (Kwon et al., 2008), thus affecting the relative importance of Km. Indeed, roughly half of analyzed adaptive mutations appear to have initial velocities lower than that of wt (Figure 4C and Figure 4—figure supplement 3), although the authors claim that the initial velocities are expected to be correlated with selection coefficients (as shown in Figure 1C for a small subset of mutants). Thus, the way the activity of DHFR mutants is measured does not adequately explain the observed distribution of selection coefficients.

For proper interpretation of the selection coefficients, it is therefore important to measure the intracellular abundance of a selection of DHFR mutants on Lon+/- backgrounds and to measure k_cat and K_M parameters for a subset of advantageous DHFR mutants.

We thank the reviewers for this important comment concerning the key factors that affect the selection pressure in our assay. The phrasing in original submission suggested a straight-forward, single explanation (in vitro catalytic activity) for the selection pressure. While there is a correlation between selection coefficients and the in vitro velocity of the mutants used in the calibration of our selection conditions, this relationship is clearly too simplistic. We agree with the reviewers that differences in selection coefficients arise from an interplay of several factors including DHFR catalytic activity, intracellular DHFR abundance, and substrate concentration (in addition to potential other factors such as changes in feedback regulation and chaperone activity, as also pointed out by the reviewers and discussed further below).

As mentioned by the reviewers, (Bershtein et al., 2015) relate growth rate to abundance • k_cat/K_M, but their data (Author response image 1) also illustrate the difficulty with these measurements. The DHFR variants studied by Bershtein exhibit WT-like growth rates that differ by less than a factor of two, and a considerable fraction of the variants appear to be in a plateau region where growth rate is largely independent of changes in abundance • k_cat/K_M, as predicted by a metabolic flux model. However, in the region before the plateau (right inset) where one would expect a dependence of growth rate on DHFR abundance • k_cat/K_M, interpretation becomes difficult because of limits in the accuracy with which the small differences in the relevant parameters can be quantified (abundance measurement errors were not given).

In our revised manuscript we now relate our selection coefficients to both DHFR abundance and catalytic activity. In new experiments, we estimated the intracellular DHFR abundance [DHFR] of 21 DHFR variants using a method adapted from previously used assays (Guerrero et al., 2019; Rodrigues et al., 2016). These new data are presented in Figure 4D and Figure 4—figure supplement 4.To quantify activity and compare relative activities of different DHFR mutants, we previously measured the enzyme velocity_[DHF] at several DHF concentrations for each DHFR mutant. We would like to note that velocity can be a more informative metric for comparing DHFR activity than k_cat/K_M, provided that the in vivo DHFR concentration can be estimated. k_cat/K_M can be misleading as the sole metric for comparing mutants of the same enzyme acting on the same substrate under all conditions because the relative velocities of the mutant enzymes are additionally dependent on the substrate concentration (see for example Figure 2A (Eisenthal, Danson and Hough, 2007): The ratio of relative velocities for two enzyme variants with the same k_cat/K_M ratio can invert with changing substrate concentration). Directly comparing velocities resolves this problem but requires assuming a relevant intracellular concentration of DHF, which we can estimate from existing data. Results from the Rabinowitz lab show that the concentration of reduced folates (DHF and its polyglutamylated derivatives) is in the low tens of µM for exponentially growing E. coli in glucose-rich M9 media (Kwon et al., 2008). While the concentration of reduced folates can rise as a result of low DHFR activity (e.g. reaching ~100 µM after the addition of trimethoprim), the reduced folate concentration returns to approximately the starting concentration within the span of an hour. Because this adaptation to reduced DHFR activity is shorter than the time between our first two timepoints from selection (0 hrs and 2 hrs), we estimate the DHF concentration to be in the tens of µM. (We acknowledge that fully resolving this question would require the measurement of [DHF] for each of the mutants characterized here under the turbidostat growth conditions, ideally at multiple times during an 18-hour growth period to determine if folate levels fluctuate over time. These measurements are highly specialized because of the sensitivity of reduced folates and are beyond our capacity without collaborating with experts. In the absence of such mutant-specific data on in vivo DHF concentrations, k_cat and K_M values would not provide additional information on in vivo DHFR mutant activity.)

With the new data on DHFR intracellular abundance, we can now estimate cellular DHFR activity as [DHFR] • velocity_[DHF]. This new analysis resolves the discrepancies for the subset of advantageous DHFR mutants that previously could not be explained simply by the velocity data alone since the mutants did not show increased velocities compared to wild-type. However, total cellular DHFR activity increases as the interplay of velocity and DHFR abundance: In general, advantageous mutants with lower velocities than wild-type have increased DHFR abundance (left part of Figure 4D), qualitatively explaining increased fitness over wild-type (Figure 4E and Figure 4—figure supplement 8).

In the revision, we have added the following to the Results section:

“We first confirmed that the selected advantageous mutations indeed had higher cytosolic DHFR activity (the total rate of conversion of DHF to THF) in ER2566 ∆folA/∆thyA (–Lon) lysates relative to the activity for WT DHFR (Figure 4—figure supplement 2), consistent with the deep mutational scanning results. […] Moreover, when considering both velocity and abundance the expected total cellular DHFR activity ([DHFR] • velocity) is increased compared to wild-type for the majority of advantageous mutants (Figure 4E, Figure 4—figure supplement 6, positions above the dotted line indicate expected cellular activity greater than wild-type).”

2) Related to point (1) above, the mechanism invoked by the authors to explain why destabilization may increase activity through increased dynamics at the active site is interesting but other mechanisms related to cellular abundance have not been taken into consideration. In particular, DHFR destabilization is known to turn DHFR into a chaperonin client and this interaction may increase cellular levels. As argued in point (1) above, more detailed measurements of cellular abundance and k_cat,K_M determination are needed to produce a consistent interpretation of the results.

Please see our response to the related point (1) above, where we now detail a consistent model for the changed selection coefficients based on both cellular abundance and velocity.

We have also added a brief section on other mechanisms, including that DHFR destabilization is known to turn DHFR into a chaperonin client:

“However, the expected total cellular DHFR activity is not a strong quantitative predictor of the advantageous mutants in –Lon selection (Figure 4—figure supplement 7, Figure 4—figure supplement 8). […] Moreover, Lon suppresses advantageous mutations at least in part by reducing their cellular abundance.”

3) Results – The authors show that their DMS results are nicely reproducible. However, I don't think that they correlate the DMS results with individually measured selection coefficients (it's not totally clear whether the data shown in Figure 1C is from individual measurements or the DMS). They should do this to establish that the DMS accurately recapitulates individual measurements both in E. coli and for purified protein.

The data from Figure 1C were obtained from DMS measurements and we have now clarified this point. As suggested by the reviewer, we have added new data on individually measured selection coefficients, which correlate well with the DMS data (Figure 1—figure supplement 3B). We changed the manuscript as follows:

Legend for Figure 1:

“C) Selection coefficients from deep mutational scanning as a function of enzymatic velocity for purified DHFR point mutants measured in vitro.”

Main text:

“For a panel of 14 DHFR mutants, we confirmed that the selection coefficients obtained from deep mutational scanning correlated linearly with growth rates measured separately for the individual variants in a plate reader (Figure 1—figure supplement 3B, Figure 1—source data 2), as expected. Furthermore, under our controlled selection conditions we observed a linear relationship between selection coefficient and in vitro velocity (Figure 1C) at cytosolic substrate concentrations(Bennett et al., 2009; Kwon et al., 2008) for these DHFR mutants (Figure 1—source data 3).”

4) The selection system is beautifully designed to allow highly sensitive selection conditions, including the identification of better-than-wt DHFR mutants. The experimental conditions in the paper, however, are likely to be different from those that a wild type strain would face. First, the endogenous promoter of folA regulates the DHFR expression via a negative feedback loop: A drop in DHFR activity/abundance results in the upregulation of its expression (Bershtein, et al., 2015). An interesting question is how the distribution of fitness effects of DHFR mutations will be shaped by the presence of such a regulatory expression element. Second, it was demonstrated that the endogenous DHFR levels in E. coli strain carrying the chromosomal folA gene are very close to the optimal level, as the increase in activity or abundance of DHFR does not increase fitness (Bhattacharyya et al., 2016). The fact that over 23% of single point DHFR mutations increase bacterial fitness suggests that the intracellular DHFR levels in the selection system are far away from the optimum. Third, there is no difference in the DHFR sequence between naturally occurring E. coli B and K-12 strains, even though according to the authors' conclusions, the lack of Lon protease in B strains should have driven the adaptive evolution of DHFR in this strain. It would be helpful if the authors discussed these caveats in the manuscript.

We agree that our selection conditions are different from those wild-type DHFR would face in naturally occurring E. coli strains. To discuss these caveats, we have made several changes to the manuscript as described below.

To discuss differences in DHFR abundance, we added new data (Figure 1—figure supplement 2) quantifying DHFR abundance of WT DHFR in our selection strain compared to the endogenous DHFR levels in the parent strain:

“As the basis for our studies, we first sought to establish highly sensitive selection conditions for DHFR function that would be calibrated to DHFR enzymatic velocity (rate of DHF conversion per molecule of DHFR) and capable of resolving mutants with velocities near-to or faster-than wild-type. […] We used an E. coli strain derived from ER2566 with the genes for DHFR and a downstream enzyme, thymidylate synthase, deleted in the genome and complemented on a pACYC-DUET plasmid with a weak ribosome binding site (see Materials and methods) that results in DHFR abundance at approximately 10% of the endogenous protein level (Figure 1—figure supplement 2, Figure 1—source data 1)”.

We also added a section in the Discussion that addresses differences to selection on naturally occurring DHFR sequences and mentions feedback mechanisms:

“The large fraction of advantageous mutations to DHFR appears to conflict with the fixation of the wild-type DHFR sequence during evolution. […] Nevertheless, our engineered selection conditions yielded considerable insights into constraints on mutational landscapes that are typically hidden from observation precisely because of buffering effects in natural contexts.”

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

Thompson S. 2019. Mapping the mutational landscape of DHFR single point mutants with perturbations to the cellular environment. NCBI BioProject. PRJNA590072

Supplementary Materials

Figure 1—source data 1. Soluble DHFR expression levels in molecules per cell measured from lysate activity assays as described in Materials and methods.

The location of the DHFR gene is listed in parenthesis in the first column. Expression values corresponds to the cell strain in the column heading.

elife-53476-fig1-data1.xlsx^{(9.8KB, xlsx)}

Figure 1—source data 2. Selection coefficients for –Lon selection (Figure 1—source data 1) compared to monoculture growth rates measured in a plate reader in ER2566 ∆folA/∆thyA (–Lon) as described in Materials and methods.

For values listed as ND, no detectable change in OD was measured during a 30 hr growth period.

elife-53476-fig1-data2.xlsx^{(9.9KB, xlsx)}

Figure 1—source data 3. Michaelis-Menten kinetics for the set of DHFR mutants (Fierke and Benkovic, 1989; Huang et al., 1994; Reynolds et al., 2011) used to calibrate the selection are reported together with the reference from which the values were taken.

elife-53476-fig1-data3.xlsx^{(9.6KB, xlsx)}

Figure 3—source data 1. Burial classification for DHFR positions from the Getarea server (Fraczkiewicz and Braun, 1998) as described in Materials and methods.

elife-53476-fig3-data1.xlsx^{(12.9KB, xlsx)}

Figure 4—source data 1. In vitro velocity for selected advantageous measured as described in Materials and methods at multiple concentrations of DHF are reported with the standard deviation over three independent experiments.

elife-53476-fig4-data1.xlsx^{(11KB, xlsx)}

Figure 4—source data 2. Soluble DHFR abundance levels in molecules per cell measured from lysate activity assays as described in Materials and methods.

All values are for the SMT205 plasmid transformed into the cell strain in the column heading. NM, not measured.

elife-53476-fig4-data2.xlsx^{(10.6KB, xlsx)}

Figure 4—source data 3. Apparent T_m values from thermal denaturation experiments monitored by CD signal at 225 nm are reported along with the ∆selection coefficient (Lon impact) value depicted in Figure 4D.

elife-53476-fig4-data3.xlsx^{(9.7KB, xlsx)}

Values are reported as calculated, but based on the selection calibration, differences between selection coefficients with values below ~–2.5 are not interpretable.

elife-53476-supp1.csv^{(91.4KB, csv)}

Values are reported as calculated, but based on the selection calibration, differences between selection coefficients with values below ~–2.5 are not interpretable.

elife-53476-supp2.csv^{(89.9KB, csv)}

Supplementary file 3. Raw deep sequencing counts for the calibration set of mutants –Lon selection.

Counts are recorded for all turbidostat timepoints over three repeats.

elife-53476-supp3.csv^{(2.2KB, csv)}

Supplementary file 4. Raw deep sequencing counts for single point mutants in –Lon selection.

elife-53476-supp4.csv^{(641.8KB, csv)}

Supplementary file 5. Raw deep sequencing counts for single point mutants in +Lon selection.

Counts are recorded as in Supplementary file 4.

elife-53476-supp5.csv^{(664KB, csv)}

Supplementary file 6. DHFR reaction velocities as a function of DHF concentration used for the measurement of soluble DHFR abundance from lysate activities as described in Materials and methods.

elife-53476-supp6.csv^{(72.7KB, csv)}

Supplementary file 7. Multiple sequence alignment of bacterial DHFR sequences generated as described in Materials and methods and used for bioinformatics analyses.

elife-53476-supp7.csv^{(248.7KB, csv)}

Transparent reporting form

elife-53476-transrepform.docx^{(249.1KB, docx)}

Data Availability Statement

The following dataset was generated:

Thompson S. 2019. Mapping the mutational landscape of DHFR single point mutants with perturbations to the cellular environment. NCBI BioProject. PRJNA590072

[bib1] Anton BP, Fomenkov A, Raleigh EA, Berkmen M. Complete Genome Sequence of the Engineered Escherichia coli SHuffle Strains and Their Wild-Type Parents. Genome Announcements. 2016;4:16. doi: 10.1128/genomeA.00230-16. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib2] Araya CL, Fowler DM, Chen W, Muniez I, Kelly JW, Fields S. A fundamental protein property, thermodynamic stability, revealed solely from large-scale measurements of protein function. PNAS. 2012;109:16858–16863. doi: 10.1073/pnas.1209751109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] Bandaru P, Shah NH, Bhattacharyya M, Barton JP, Kondo Y, Cofsky JC, Gee CL, Chakraborty AK, Kortemme T, Ranganathan R, Kuriyan J. Deconstruction of the ras switching cycle through saturation mutagenesis. eLife. 2017;6:e27810. doi: 10.7554/eLife.27810. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] Bennett BD, Kimball EH, Gao M, Osterhout R, Van Dien SJ, Rabinowitz JD. Absolute metabolite concentrations and implied enzyme active site occupancy in Escherichia coli. Nature Chemical Biology. 2009;5:593–599. doi: 10.1038/nchembio.186. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] Bershtein S, Mu W, Serohijos AW, Zhou J, Shakhnovich EI. Protein quality control acts on folding intermediates to shape the effects of mutations on organismal fitness. Molecular Cell. 2013;49:133–144. doi: 10.1016/j.molcel.2012.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] Bershtein S, Choi JM, Bhattacharyya S, Budnik B, Shakhnovich E. Systems-level response to point mutations in a core metabolic enzyme modulates genotype-phenotype relationship. Cell Reports. 2015a;11:645–656. doi: 10.1016/j.celrep.2015.03.051. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] Bershtein S, Serohijos AW, Bhattacharyya S, Manhart M, Choi JM, Mu W, Zhou J, Shakhnovich EI. Protein homeostasis imposes a barrier on functional integration of horizontally transferred genes in Bacteria. PLOS Genetics. 2015b;11:e1005612. doi: 10.1371/journal.pgen.1005612. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] Bhattacharyya S, Bershtein S, Yan J, Argun T, Gilson AI, Trauger SA, Shakhnovich EI. Transient protein-protein interactions perturb E. coli metabolome and cause gene dosage toxicity. eLife. 2016;5:e20309. doi: 10.7554/eLife.20309. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] Blomfield IC, Vaughn V, Rest RF, Eisenstein BI. Allelic exchange in Escherichia coli using the Bacillus subtilis sacB gene and a temperature-sensitive pSC101 replicon. Molecular Microbiology. 1991;5:1447–1457. doi: 10.1111/j.1365-2958.1991.tb00791.x. [DOI] [PubMed] [Google Scholar]

[bib10] Boehr DD, McElheny D, Dyson HJ, Wright PE. The dynamic energy landscape of dihydrofolate reductase catalysis. Science. 2006;313:1638–1642. doi: 10.1126/science.1130258. [DOI] [PubMed] [Google Scholar]

[bib11] Boucher JI, Bolon DN, Tawfik DS. Quantifying and understanding the fitness effects of protein mutations: laboratory versus nature. Protein Science. 2016;25:1219–1226. doi: 10.1002/pro.2928. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] Cho Y, Zhang X, Pobre KF, Liu Y, Powers DL, Kelly JW, Gierasch LM, Powers ET. Individual and collective contributions of chaperoning and degradation to protein homeostasis in E. coli. Cell Reports. 2015;11:321–333. doi: 10.1016/j.celrep.2015.03.018. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] Dykhuizen DE, Dean AM, Hartl DL. Metabolic flux and fitness. Genetics. 1987;115:25–31. doi: 10.1093/genetics/115.1.25. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] Fierke CA, Benkovic SJ. Probing the functional role of threonine-113 of Escherichia coli dihydrofolate reductase for its effect on turnover efficiency, catalysis, and binding. Biochemistry. 1989;28:478–486. doi: 10.1021/bi00428a011. [DOI] [PubMed] [Google Scholar]

[bib16] Fowler DM, Fields S. Deep mutational scanning: a new style of protein science. Nature Methods. 2014;11:801–807. doi: 10.1038/nmeth.3027. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] Fraczkiewicz R, Braun W. Exact and efficient analytical calculation of the accessible surface areas and their gradients for macromolecules. Journal of Computational Chemistry. 1998;19:319–333. doi: 10.1002/(SICI)1096-987X(199802)19:3<319::AID-JCC6>3.0.CO;2-W. [DOI] [Google Scholar]

[bib18] Garst AD, Bassalo MC, Pines G, Lynch SA, Halweg-Edwards AL, Liu R, Liang L, Wang Z, Zeitoun R, Alexander WG, Gill RT. Genome-wide mapping of mutations at single-nucleotide resolution for protein, metabolic and genome engineering. Nature Biotechnology. 2017;35:48–55. doi: 10.1038/nbt.3718. [DOI] [PubMed] [Google Scholar]

[bib19] Guerrero RF, Scarpino SV, Rodrigues JV, Hartl DL, Ogbunugafor CB. Proteostasis Environment Shapes Higher-Order Epistasis Operating on Antibiotic Resistance. Genetics. 2019;212:565–575. doi: 10.1534/genetics.119.302138. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] Gur E, Sauer RT. Recognition of misfolded proteins by Lon, a AAA+ protease. Genes & Development. 2008;22:2267–2277. doi: 10.1101/gad.1670908. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] Hietpas RT, Bank C, Jensen JD, Bolon DNA. Shifting fitness landscapes in response to altered environments. Evolution. 2013;67:3512–3522. doi: 10.1111/evo.12207. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] Huang Z, Wagner CR, Benkovic SJ. Nonadditivity of mutational effects at the folate binding site of Escherichia coli dihydrofolate reductase. Biochemistry. 1994;33:11576–11585. doi: 10.1021/bi00204a020. [DOI] [PubMed] [Google Scholar]

[bib23] iGEM Registry of standard biological parts. [November 19, 2018];2006 http://parts.igem.org/Promoters/Catalog/Anderson

[bib24] Iwakura M, Maki K, Takahashi H, Takenawa T, Yokota A, Katayanagi K, Kamiyama T, Gekko K. Evolutional design of a hyperactive cysteine- and methionine-free mutant of Escherichia coli dihydrofolate reductase. Journal of Biological Chemistry. 2006;281:13234–13246. doi: 10.1074/jbc.M508823200. [DOI] [PubMed] [Google Scholar]

[bib25] Jiang L, Mishra P, Hietpas RT, Zeldovich KB, Bolon DN. Latent effects of Hsp90 mutants revealed at reduced expression levels. PLOS Genetics. 2013;9:e1003600. doi: 10.1371/journal.pgen.1003600. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] Joosten RP, Long F, Murshudov GN, Perrakis A. The PDB_REDO server for macromolecular structure model optimization. IUCrJ. 2014;1:213–220. doi: 10.1107/S2052252514009324. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib27] Klesmith JR, Bacik JP, Wrenbeck EE, Michalczyk R, Whitehead TA. Trade-offs between enzyme fitness and solubility illuminated by deep mutational scanning. PNAS. 2017;114:2265–2270. doi: 10.1073/pnas.1614437114. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] Kwon YK, Lu W, Melamud E, Khanam N, Bognar A, Rabinowitz JD. A domino effect in antifolate drug action in Escherichia coli. Nature Chemical Biology. 2008;4:602–608. doi: 10.1038/nchembio.108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] Liu CT, Hanoian P, French JB, Pringle TH, Hammes-Schiffer S, Benkovic SJ. Functional significance of evolving protein sequence in dihydrofolate reductase from Bacteria to humans. PNAS. 2013;110:10159–10164. doi: 10.1073/pnas.1307130110. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] Magoč T, Salzberg SL. FLASH: fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27:2957–2963. doi: 10.1093/bioinformatics/btr507. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib31] Mavor D, Barlow K, Thompson S, Barad BA, Bonny AR, Cario CL, Gaskins G, Liu Z, Deming L, Axen SD, Caceres E, Chen W, Cuesta A, Gate RE, Green EM, Hulce KR, Ji W, Kenner LR, Mensa B, Morinishi LS, Moss SM, Mravic M, Muir RK, Niekamp S, Nnadi CI, Palovcak E, Poss EM, Ross TD, Salcedo EC, See SK, Subramaniam M, Wong AW, Li J, Thorn KS, Conchúir SÓ, Roscoe BP, Chow ED, DeRisi JL, Kortemme T, Bolon DN, Fraser JS. Determination of ubiquitin fitness landscapes under different chemical stresses in a classroom setting. eLife. 2016;5:e18502. doi: 10.7554/eLife.15802. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] Mavor D, Barlow KA, Asarnow D, Birman Y, Britain D, Chen W, Green EM, Kenner LR, Mensa B, Morinishi LS, Nelson CA, Poss EM, Suresh P, Tian R, Arhar T, Ary BE, Bauer DP, Bergman ID, Brunetti RM, Chio CM, Dai SA, Dickinson MS, Elledge SK, Helsell CVM, Hendel NL, Kang E, Kern N, Khoroshkin MS, Kirkemo LL, Lewis GR, Lou K, Marin WM, Maxwell AM, McTigue PF, Myers-Turnbull D, Nagy TL, Natale AM, Oltion K, Pourmal S, Reder GK, Rettko NJ, Rohweder PJ, Schwarz DMC, Tan SK, Thomas PV, Tibble RW, Town JP, Tsai MK, Ugur FS, Wassarman DR, Wolff AM, Wu TS, Bogdanoff D, Li J, Thorn KS, O'Conchúir S, Swaney DL, Chow ED, Madhani HD, Redding S, Bolon DN, Kortemme T, DeRisi JL, Kampmann M, Fraser JS. Extending chemical perturbations of the ubiquitin fitness landscape in a classroom setting reveals new constraints on sequence tolerance. Biology Open. 2018;7:bio036103. doi: 10.1242/bio.036103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] McLaughlin RN, Poelwijk FJ, Raman A, Gosal WS, Ranganathan R. The spatial architecture of protein function and adaptation. Nature. 2012;491:138–142. doi: 10.1038/nature11500. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] Miller GP, Wahnon DC, Benkovic SJ. Interloop contacts modulate ligand cycling during catalysis by Escherichia coli dihydrofolate reductase. Biochemistry. 2001;40:867–875. doi: 10.1021/bi001608n. [DOI] [PubMed] [Google Scholar]

[bib35] Miller GP, Benkovic SJ. Strength of an interloop hydrogen bond determines the kinetic pathway in catalysis by Escherichia coli dihydrofolate reductase. Biochemistry. 1998;37:6336–6342. doi: 10.1021/bi973065w. [DOI] [PubMed] [Google Scholar]

[bib36] Nicoloff H, Andersson DI. Lon protease inactivation, or translocation of the lon gene, potentiate bacterial evolution to antibiotic resistance. Molecular Microbiology. 2013;90:1233–1248. doi: 10.1111/mmi.12429. [DOI] [PubMed] [Google Scholar]

[bib37] Ovchinnikov S, Kamisetty H, Baker D. Robust and accurate prediction of residue-residue interactions across protein interfaces using evolutionary information. eLife. 2014;3:e02030. doi: 10.7554/eLife.02030. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib38] Oyen D, Fenwick RB, Aoto PC, Stanfield RL, Wilson IA, Dyson HJ, Wright PE. Defining the structural basis for allosteric product release from E. coli Dihydrofolate Reductase Using NMR Relaxation Dispersion. Journal of the American Chemical Society. 2017;139:11233–11240. doi: 10.1021/jacs.7b05958. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] Powers ET, Powers DL, Gierasch LM. FoldEco: a model for proteostasis in E. coli. Cell Reports. 2012;1:265–276. doi: 10.1016/j.celrep.2012.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] Queitsch C, Sangster TA, Lindquist S. Hsp90 as a capacitor of phenotypic variation. Nature. 2002;417:618–624. doi: 10.1038/nature749. [DOI] [PubMed] [Google Scholar]

[bib41] Reynolds KA, McLaughlin RN, Ranganathan R. Hot spots for allosteric regulation on protein surfaces. Cell. 2011;147:1564–1575. doi: 10.1016/j.cell.2011.10.049. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib42] Rodrigues JV, Bershtein S, Li A, Lozovsky ER, Hartl DL, Shakhnovich EI. Biophysical principles predict fitness landscapes of drug resistance. PNAS. 2016;113:E1470–E1478. doi: 10.1073/pnas.1601441113. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib43] Roscoe BP, Thayer KM, Zeldovich KB, Fushman D, Bolon DN. Analyses of the effects of all ubiquitin point mutants on yeast growth rate. Journal of Molecular Biology. 2013;425:1363–1377. doi: 10.1016/j.jmb.2013.01.032. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] Rubin AF, Gelman H, Lucas N, Bajjalieh SM, Papenfuss AT, Speed TP, Fowler DM. A statistical framework for analyzing deep mutational scanning data. Genome Biology. 2017;18:150. doi: 10.1186/s13059-017-1272-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib45] saiSree L, Reddy M, Gowrishankar J. IS186 insertion at a hot spot in the lon promoter as a basis for lon protease deficiency of Escherichia coli B: identification of a consensus target sequence for IS186 transposition. Journal of Bacteriology. 2001;183:6943–6946. doi: 10.1128/JB.183.23.6943-6946.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib46] Salis HM, Mirsky EA, Voigt CA. Automated design of synthetic ribosome binding sites to control protein expression. Nature Biotechnology. 2009;27:946–950. doi: 10.1038/nbt.1568. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib47] Sanner MF, Olson AJ, Spehner JC. Reduced surface: an efficient way to compute molecular surfaces. Biopolymers. 1996;38:305–320. doi: 10.1002/(SICI)1097-0282(199603)38:3<305::AID-BIP4>3.0.CO;2-Y. [DOI] [PubMed] [Google Scholar]

[bib48] Sauer RT, Baker TA. AAA+ proteases: atp-fueled machines of protein destruction. Annual Review of Biochemistry. 2011;80:587–612. doi: 10.1146/annurev-biochem-060408-172623. [DOI] [PubMed] [Google Scholar]

[bib49] Sawaya MR, Kraut J. Loop and subdomain movements in the mechanism of Escherichia coli dihydrofolate reductase: crystallographic evidence. Biochemistry. 1997;36:586–603. doi: 10.1021/bi962337c. [DOI] [PubMed] [Google Scholar]

[bib50] Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib51] Schober AF, Mathis AD, Ingle C, Park JO, Chen L, Rabinowitz JD, Junier I, Rivoire O, Reynolds KA. A Two-Enzyme adaptive unit within bacterial folate metabolism. Cell Reports. 2019;27:3359–3370. doi: 10.1016/j.celrep.2019.05.030. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib52] Smith CA, Shi CA, Chroust MK, Bliska TE, Kelly MJS, Jacobson MP, Kortemme T. Design of a phosphorylatable PDZ domain with peptide-specific affinity changes. Structure. 2013;21:54–64. doi: 10.1016/j.str.2012.10.007. [DOI] [PubMed] [Google Scholar]

[bib53] Steinberg B, Ostermeier M. Shifting fitness and epistatic landscapes reflect Trade-offs along an evolutionary pathway. Journal of Molecular Biology. 2016;428:2730–2743. doi: 10.1016/j.jmb.2016.04.033. [DOI] [PubMed] [Google Scholar]

[bib54] Stiffler MA, Hekstra DR, Ranganathan R. Evolvability as a function of purifying selection in TEM-1 β-lactamase. Cell. 2015;160:882–892. doi: 10.1016/j.cell.2015.01.035. [DOI] [PubMed] [Google Scholar]

[bib55] Tenaillon O, Barrick JE, Ribeck N, Deatherage DE, Blanchard JL, Dasgupta A, Wu GC, Wielgoss S, Cruveiller S, Médigue C, Schneider D, Lenski RE. Tempo and mode of genome evolution in a 50,000-generation experiment. Nature. 2016;536:165–170. doi: 10.1038/nature18959. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] Thomason LC, Sawitzke JA, Li X, Costantino N, Court DL. Recombineering: genetic engineering in Bacteria using homologous recombination. Current Protocols in Molecular Biology. 2014;106:1.16–1-39. doi: 10.1002/0471142727.mb0116s106. [DOI] [PubMed] [Google Scholar]

[bib57] Thompson S. 2019_DHFR_Lon. c3e2201GitHub. 2020 https://github.com/keleayon/2019_DHFR_Lon

[bib58] Tinberg CE, Khare SD, Dou J, Doyle L, Nelson JW, Schena A, Jankowski W, Kalodimos CG, Johnsson K, Stoddard BL, Baker D. Computational design of ligand-binding proteins with high affinity and selectivity. Nature. 2013;501:212–216. doi: 10.1038/nature12443. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib59] Tokuriki N, Tawfik DS. Chaperonin overexpression promotes genetic variation and enzyme evolution. Nature. 2009;459:668–673. doi: 10.1038/nature08009. [DOI] [PubMed] [Google Scholar]

[bib60] Whitehead TA, Chevalier A, Song Y, Dreyfus C, Fleishman SJ, De Mattos C, Myers CA, Kamisetty H, Blair P, Wilson IA, Baker D. Optimization of affinity, specificity and function of designed influenza inhibitors using deep sequencing. Nature Biotechnology. 2012;30:543–548. doi: 10.1038/nbt.2214. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib61] Wrenbeck EE, Azouz LR, Whitehead TA. Single-mutation fitness landscapes for an enzyme on multiple substrates reveal specificity is globally encoded. Nature Communications. 2017;8:15695. doi: 10.1038/ncomms15695. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Altered expression of a quality control protease in E. coli reshapes the in vivo mutational landscape of a model enzyme

Samuel Thompson

Yang Zhang

Christine Ingle

Kimberly A Reynolds

Tanja Kortemme

Roles

Abstract

Introduction

Results

Figure 1. E. coli DHFR deep mutational scanning uncovers many advantageous mutations.

Figure 1—figure supplement 1. Conformations adopted during the DHFR catalytic cycle: 1RX1, 3QL3, 1RX4, and 1RX5) and a QMMM model of the hydride transfer step (Liu et al., 2013) represent the conformational states adopted by DHFR over the catalytic cycle.

Figure 1—figure supplement 2. Soluble WT DHFR cellular abundance for endogenous (chromosomal) DHFR in the parental strain and DHFR expressed from plasmids in the selection system.

Figure 1—figure supplement 3. Determination of selection coefficients for DHFR.

Figure 1—figure supplement 4. Variation in selection coefficients for –Lon selection.

Figure 1—figure supplement 5. Residues previously known to have a functional role shown on the DHFR structure.

Figure 1—figure supplement 6. Growth curves for top advantageous mutations.

Figure 1—figure supplement 7. Example positions with multiple advantageous mutations hypothesized to be destabilizing, shown on the DHFR structure.

Figure 2. Lon protease expression reshapes the mutational landscape.

Figure 2—figure supplement 1. Quality of the selection under +Lon conditions.

Figure 2—figure supplement 2. Relationship between error and selection coefficient for +Lon selection.

Figure 2—figure supplement 3. Comparison of selection coefficients ±Lon Scatterplot comparing selection coefficients in –Lon and +Lon selection, showing that mutations are generally repressed by Lon activity.

Figure 2—figure supplement 4. Ranks of the wild-type amino acid residues in ±Lon selections.

Figure 2—figure supplement 5. Comparison of DHFR per-position sequence preferences.

Figure 3. Delta selection coefficients show Lon impact.

Figure 3—figure supplement 1. ∆selection coefficients.

Figure 4. Advantageous mutations arise from an interplay of increased enzymatic velocity and increased abundance in the absence of Lon.

Figure 4—figure supplement 1. Structural context for hotspot residues from Figure 4.

Figure 4—figure supplement 2. Lysate activity for DHFR wild-type and point mutants on the selection plasmid.

Figure 4—figure supplement 3. In vitro velocities of purified DHFR wild-type and point mutants.

Figure 4—figure supplement 4. Soluble cellular abundance for DHFR wild-type and point mutants on the selection plasmid.

Figure 4—figure supplement 5. Lon impact as ∆selection coefficient versus change in DHFR abundance ±Lon.

Figure 4—figure supplement 6. Cellular abundance versus in vitro velocity for DHFR wild-type and point mutants.

Figure 4—figure supplement 7. Selection coefficient compared to predictions of DHFR wild-type and point mutant activity from cellular abundance and in vitro velocity measurements.

Figure 4—figure supplement 8. Zoom in for Selection coefficient compared to predictions of DHFR wild-type and point mutant activity from cellular abundance and in vitro velocity measurements.

Figure 4—figure supplement 9. Thermal denaturation curves monitored by CD signal at 225 m for selected hotspot mutants.

Figure 5. Structural characterization of multiple constraints on the DHFR mutational landscape.

Figure 5—figure supplement 1. Selection coefficients under the two Lon expression regimes mapped on the DHFR structure.

Figure 5—figure supplement 2. Burial of residues within each mutation response category reported as the mean number of atomic neighbors.

Figure 5—figure supplement 3. Residues in mutational response categories in the –Lon selection as a function of distance from several sites in the DHFR structure.

Discussion

Materials and methods

Generation of plasmids for in vivo selection assay

Generation of plasmid libraries

Generation of individual point mutant plasmids

Generation of ER2566 ∆folA ∆thyA –Lon and ER2566 ∆folA ∆thyA +Lon

Plate reader assay for E. coli growth

Deep mutational scanning experiments

Amplicon generation

Sequencing for deep mutational scanning experiments

Analysis of deep mutational scanning data

Purification of his6-tagged DHFR

In vitro assay for DHFR velocity and Michaelis-Menten kinetics

Determining DHFR activity and abundance in cell lysates

CD spectroscopy

Structural representation of DHFR

Profile similarity analysis

Acknowledgements

Appendix 1

Appendix 1—key resources table.

Funding Statement

Contributor Information

Funding Information

Additional information

Competing interests

Author contributions

Additional files

Data availability

References

Decision letter

Roles

Author response

Author response image 1.

Associated Data

Data Citations

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

Purification of his₆-tagged DHFR