Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2019 Aug 9;47(18):9609–9618. doi: 10.1093/nar/gkz698

Enhancement of LacI binding in vivo

Manyu Du 1,2, Seth Kodner 3, Lu Bai 1,2,4,
PMCID: PMC6765135  PMID: 31396617

Abstract

Transcription factors (TFs) bind to specific sequences in DNA to regulate transcription. Despite extensive measurements of TFs’ dissociation constant (Kd) in vitro, their apparent Kdin vivo are usually unknown. LacI, a bacterial TF, is often used to artificially recruit proteins onto eukaryotic genomes. As LacI binds tightly to its recognition site (LacO) in vitro with a Kd about 10 picomolar (pM), it is often assumed that LacI also has high affinity to LacO in vivo. In this work, we measured LacI binding in living yeast cells using a fluorescent repressor operator system and found an apparent Kd of ∼0.6 μM, four orders of magnitude higher than that in vitro. By genetically altering (i) GFP-LacI structure, (ii) GFP-LacI stability, (iii) chromosome accessibility and (iv) LacO sequence, we reduced the apparent Kd to <10 nM. It turns out that the GFP tagging location and the fusion protein stability have a large effect on LacI binding, but surprisingly, chromosome accessibility only plays a mild role. These findings contribute to our quantitative understanding of the features that affect the apparent Kd of TF in cells. They also provide guidance for future design of more specific chromosomal recruitment through high-affinity TFs.

INTRODUCTION

Transcription initiation starts with the binding of transcription factors (TFs) to their target sites in promoters and enhancers. The stability and specificity of TF binding are related to their binding affinities, which can be modulated to achieve desirable transcription activities (1–3). TF binding affinity is also an important consideration for synthetic systems where a protein of interest is fused with an exogenous TF to be recruited to its recognition motifs engineered into the genome. In these cases, TFs with high binding affinities are often used to ensure strong recruitment to the target sites while minimizing the off-target effect.

TF binding affinities, or the dissociation constants, Kd, have been extensively measured in vitro with purified TFs and DNA containing their recognition sequences (4). In vivo, the apparent Kd (defined as the concentration of a TF at which 50% of the binding sites are occupied), may be further affected by chromatin accessibility, protein stability, interference from other factors and non-specific DNA, etc. The apparent Kd of most TFs in the cells are unknown, as the in vivo measurement is technically challenging. A recent study showed that the apparent Kd of glucocorticoid receptor (GR) is three orders of magnitude higher in vivo than in vitro (5), illustrating that Kd measured in vitro may not correctly predict the binding behavior of TFs in cells.

The dimeric Escherichia coli lac repressor (LacI) used in this study binds to its recognition site (LacO) with a Kd of ∼10 pM in vitro (6). Because of such high affinity, LacI is widely used in eukaryotic cells for targeted recruitment. The fluorescent repressor operator system (FROS), for example, uses the GFP-LacI fusion to label a chromosome locus containing an array of LacO repeats, and such labeled locus can be tracked as a diffraction-limited fluorescent dot (7,8). The apparent Kd of GFP-LacI to LacO is an important parameter in this system because it determines the signal-to-noise ratio, i.e. lower apparent Kd means stronger dot signal with lower background. However, to our knowledge, the apparent Kd of GFP-LacI has not been measured in eukaryotic cells.

Here, we used FROS to measure the apparent Kd of LacI binding by varying the concentration of GFP-LacI and quantifying the fraction of bound LacO through imaging. The LacI in the initial system we used shows an apparent Kd of ∼0.6 μM, four orders of magnitude higher than the in vitro value. To reduce the apparent Kd, we genetically altered (i) GFP-LacI structure, (ii) GFP-LacI stability, (iii) chromosome accessibility and (iv) LacO sequence, and evaluated how each of these factors affect LacI binding in vivo. By combining the genetic traits that favor binding, we reduced the apparent Kd of LacI by 40-fold to <10 nM. These results contribute to our understanding of factors that can influence the TF binding in vivo. It turns out that the location of GFP tagging and the stability of the fusion protein have a large effect on LacI binding. To our knowledge, this is the first time that the relation between the degradation rate and apparent Kd is measured experimentally. In contrast, the presence of nucleosome only mildly reduces LacI binding, which is surprising given that nucleosome can change the Kd of bacterial TFs by ∼5 orders of magnitude in vitro (9). Practically, we also provided an improved recruitment system with increased specificity. More specifically, our new FROS setup will allow the use of shorter LacO arrays (less perturbation to the genome) and/or shorter exposure to excitation during fluorescent imaging (less photo damage to the cells) without compromising the signal-to-noise ratio.

MATERIALS AND METHODS

Plasmid and strain construction

Standard methods were used to construct the strains and plasmids. Plasmids used in the study were derived from pRS yeast shuttle vectors. Plasmid pSR13 which contains the 256× LacO repeats was provided by Dr Gasser. The LacO templates used in Chromatin immunoprecipitation (ChIP) assays were synthesized by Integrated DNA Technology. Plasmids containing the new 64× or 32× LacO array was cloned through a method developed by Dr Belmont (10). More specifically, we started with a plasmid containing two LacO sites cloned into pRS403 in between SalI and XhoI. We then prepared two digestion reactions on this plasmid: one with BamHI and SalI; the other with BamHI and XhoI. Due to the compatible sticky ends generated by SalI and XhoI, they generated a ligation product that could not be recut with either enzyme. This way, we doubled the number of LacO sites by repeating the digestion and ligation procedures. With increasing copy number of LacO sites, the plasmids became increasingly unstable due to recombination. For the arrays in Figures 4F and 6, the longest we could obtain contain 64× LacO and 32× Osys sites. These arrays were integrated into yeast through a method developed by Rohner et al. (11). Yeast strains used in the study were derived from W303. The unstable GFP-LacI was generated by fusing with a CLN2 PEST degron, which leads to rapid degradation through the SCFGrr1–mediated ubiquitination (12). See Supplementary Table SI and II for detailed information of strains and plasmids.

Figure 4.

Figure 4.

The apparent Kd of LacI is mildly enhanced by increasing chromatin accessibility. (A) LacO templates for testing the nucleosome effect on LacI binding. This template contains two LacO sites flanked by two consensus or mutated (*) Abf1 motifs. (B) Methylation protection assay over the two templates in A in strains lacking LacI. For every ‘C’ in the ‘GC’ context, its conversion level to ‘T’ is plotted. Only the unmethylated (protected) ‘C’s will be converted to ‘T’. The conversion level thus corresponds to the protection level against methylation. The three ‘GC’s over the LacO sites are more protected on the template with Abf1* (hatched bar) than with wt Abf1 (solid bar), indicating that it has higher nucleosome coverage. (C) ChIP measurement of Histone H3 over the two templates in A. Consistent with B, H3 is more enriched on the template with Abf1*. (D) ChIP measurements of LacI over the two templates. LacI shows higher enrichment on the LacO flanked by the wt Abf1 sites. Error bars shown here represent standard error of the mean (SEM) among three biological repeats. (E) Imaging data of HIS3pr-GFP-LacI-degron binding on Abf1-LacO array (64× LacO) versus original LacO array (256×). Abf1-LacO array is generated by duplicating the template with the wt Abf1 in A with 32 repeats. We observed brighter dots over the Abf1-LacO array despite the reduced number of LacO sites. (F) Kd measurement of the GFP-LacI binding on the two arrays above (N = 165 for the Abf1-LacO array, and 268 for the original array). The Abf1-LacO array mildly reduces the apparent Kd (P- value < 0.0001).

Figure 6.

Figure 6.

The apparent Kd of LacI is reduced by over 60-fold after combining all the genetic traits that favor LacI binding. (A) Scheme of a new FROS, including the REV1pr driving the expression of a stable LacI-GFP and an array of 32 copies of Osys flanked by Abf1 sites. (B) Fluorescent images of the new FROS at different GFP-LacI concentrations. (C) Apparent Kd measurement of the new FROS. The normalized dot intensity is plotted against the background GFP-LacI concentration in individual cells (N = 112) with a bin size of 100 nM. The LacO array is fully occupied even at the lowest LacI concentration, indicating that the apparent Kd of the system is <10 nM. A binding curve of Kd = 5 nM is shown for comparison.

Fluorescence microscopy

We used the instrumentation and data acquisition platform as described in a previous study (13). Briefly, cells were grown in synthetic complete (SCD) liquid media at 30°C to an OD660 ∼0.2, washed, diluted to an OD660 ∼0.1, and then transferred onto a coverslip with a SCD agarose pad. After further growth on the pad for 2–3 h, the sample was put under the fluorescent microscope for imaging. Nine z-stacks were taken with an exposure time of 0.2 s and light intensity of 30% at the excitation wavelength of 488 nm. For all our experiments, unlabeled cells were imaged side-by-side with labeled sample to quantify the level of auto-fluorescence.

Image analysis

We developed Matlab programs to analyze the z-stack fluorescent images (see Supplementary Figure S1 for example) (14). After using the phase image to annotate the cell boundaries, we first generated a composite image where each pixel within the cell boundaries takes on the maximum intensity of this pixel among the z-stack. We analyzed this image to find the location of the chromosome dots. We then went back to the initial z-stack images, generated their intensity scan (minus the auto-fluorescence level of unlabeled cells) and quantified the dot/background intensity at each z position (Supplementary Figure S1). There was a small fraction of cells with extra-bright dots (intensities above 1.8-fold of the average level), presumably due to co-localization of the two LacO arrays after DNA replication, and these dots were discarded in the following analysis.

We next used the highest dot intensity among the z-stacks and the corresponding background intensity to calculate the free protein concentration. For cells with saturated LacI-GFP binding,

graphic file with name M1.gif

Here, Nbound is the number of GFP molecules associated with the LacO array under saturation, which we assumed to be twice the amount of LacO sites (e.g. 512 for the 256× array). The dot intensity is the calculated as the volume underneath the peak in the intensity scan (Supplementary Figure S1B), representing all the photons generated by the bound GFP. For the free LacI-GFP, its concentration is not completely homogenous in the cell due to organelles like vacuole. To best represent the free proteins in the nucleus, we quantified the ‘total background intensity’ as the integrated GFP intensity in a ring enclosing the dot (Supplementary Figure S1C). ‘Area’ in the formula above is the area of the ring; DOF is the ‘depth of field’, which we estimated to be 1.6 μm (Supplementary Figure S2). For cells with unsaturated LacI-GFP binding, the Nbound and Nfree can simply be computed by comparing the dot/background intensity with the saturated case.

Validation of our measurement of concentration and apparent Kd

To show our apparent Kd quantification is internally consistent, we measured the saturated dot intensity on five LacO arrays with different repeat number (32, 72, 144, 256 and 288) as well as the apparent Kd on two of these arrays (72 and 144). To compare our estimated GFP-LacI concentration to previously published concentration of endogenous proteins, we collected eight datasets of protein copy number per cell measured in synthetic media (15–22), and converted them to concentrations using the average haploid yeast volume 42 μm3 (23). Five of these databases were collected using fluorescence measurement of GFP labeled endogenous proteins, and the other three used quantitative mass spectrometry. We imaged two GFP-labeled endogenous proteins, Ser2 and Rnr4, and compared their fluorescence intensity and reported concentration to those of GFP-LacI.

ChIP assay

The ChIP protocol was modified from a previously described method (13). Cells were grown in 50 ml SCD to reach OD660 ∼0.4 and then crosslinked by formaldehyde (final concentration 1%) for 20 min. After quenched with 2.7 ml of 2.5 M glycine, these cells were harvested by centrifugation at 2000 rpm for 10 min at 4°C. The cells were then resuspended in 250 μl of FA-lysis buffer containing proteinase inhibitors and disrupted by 250 μl of glass beads for 30 min. We punched a hole at the bottom of the tube and centrifuge the sample to get rid of the glass beads, and the whole cell extract was sonicated to fragment chromatin to lengths typically in between 200 and 800 bp. We centrifuged the sonicated samples at 14 000 rpm for 20 min to get rid of the cell debris and transferred the supernatant to a new tube, where it was incubated with anti-GFP antibody (Abcam 250) for 10–12 h at 4°C and then with Protein A/G PLUS-Agarose beads for 4 h (Santa Cruz Biotechnology, sc-2003). An aliquot of the supernatant was saved for input control. We extracted DNA from the input and immunoprecipitated samples and quantified them by quantitative polymerase chain reaction (qPCR) analysis. All ChIP experiments were repeated three times.

Measurement of nucleosome occupancy using DNA methylation

Protocols were modified from a previously described method (24). Strains containing the LacO binding sites and M. cviPI methyltransferase were grown in YPD media overnight to saturation. They were washed twice with water on the next morning and then diluted to an OD660 of 0.1 in 40 ml SCD media. After 2–3 h when the OD660 was reached ∼0.2, we added 40 ml fresh SCD and split each culture into two. We added 4 l of 1 mM β-estradiol to one culture for induction of the DNA methylation and 4 μl of 100% ethanol to the other culture to serve as the control. After 2 h, we purified the genomic DNA through standard phenol extraction. The genomic DNA was then treated with the reagents in the EZ DNA methylation-lightning kit (ZYMO RESEARCH) so that the unmethylated cytosine is converted into uracil. We then performed PCR on the converted genomic DNA template and sequenced the PCR products of regions of interest through Sanger sequencing. The nucleosome occupancy is indicated by the phred value of thymine. The higher conversion rate from cytosine (C) to thymine (T) represents higher nucleosome occupancy.

RESULTS

Large apparent Kd of LacI in a commonly used FROS

We defined the ‘apparent Kd’ as the concentration of LacI at which 50% of the LacO sites are occupied. To measure the apparent Kdin vivo using FROS, we constructed strains containing a 256× LacO array and GFP-LacI at different concentrations (Figure 1A), The LacI protein used in the FROS lacks the tetramerization domain, and under physiological salt condition, its affinity to the wt LacO site vitro has a Kd of ∼10 pM (6). Using the original construct with GFP-LacI driven by the HIS3 promoter, the LacO array can be visualized as bright ‘chromosome dots’ with high GFP background (Figure 1B). To lower the steady-state level of GFP-LacI, we generated a destabilized version of GFP-LacI (GFP-LacI-degron) (25,26), which reduces the background GFP intensity by 10-fold (Figure 1A and B). We also constructed a strain with intermediate GFP-LacI concentration by integrating four copies of the GFP-LacI-degron into the genome. Besides these genetic manipulations, we took advantage of the natural cell-to-cell variability in the GFP-LacI concentration for the apparent Kd measurement.

Figure 1.

Figure 1.

LacI shows large apparent Kd in a commonly used FROS. (A) Scheme of FROS where GFP labeled LacI is recruited to arrays of LacO with 256 repeats (256×; same notation is used below). GFP-LacI is driven by HIS3 promoter ± the destabilization sequence (deg). These constructs were integrated into the yeast genome with different copy numbers to generate variable concentrations of the fusion protein. (B) Fluorescent images of the FROS at different GFP-LacI concentrations. All images are shown with the same brightness and contrast (same as below). (C) Measurement of the amount of bound versus free GFP-LacI. The right panel shows the intensity profile of the highlighted cell in the left panel, where the peak reflects the fluorescence generated by the bound GFP-LacI, and the background by the free ones. See Supplementary Figure S1 and ‘Materials and Methods’ section for detailed quantification method. (D) Measurement of the dissociation constant, Kd. The box plot shows the normalized dot intensity versus background GFP-LacI concentration measured in individual cells (total number of cells: N = 268). The data were binned into 100 nM windows and fitted by a Hill function with Kd = 592 ±16 nM and hill coefficient n = 2.93 ± 0.42. Error bars shown here represent 5th to 95th percentile (same as below).

We quantified the intensity of the chromosome dot (representing the amount of bound GFP-LacI) and the background intensity (representing the amount of free GFP-LacI in the vicinity of the dot) for each single cell (Figure 1C; Supplementary Figures S1 and 2). In yeast cells containing the stable GFP-LacI, the dot intensity is not correlated with the background intensity (Figure 1D), indicating that LacI binding reaches saturation. Assuming the saturation condition represents full occupancy of LacO, there should be 512 bound GFP-LacI (each LacO binds a LacI dimer), we can calculate the copy number of the free GFP-LacI by comparing the background intensities with the saturated dot intensity (‘Materials and Methods’ section). In the low-background strain containing 1× GFP-LacI-degron, the GFP-LacI concentration was estimated to be ∼100 nM. Despite the fact that this concentration is much higher than the reported Kdin vitro, we could barely see chromosome dots in this strain (Figure 1B). By fitting the dot intensity vs GFP-LacI concentration with a Hill function, the apparent Kd turns out to be ∼600 nM, four orders of magnitude higher than the Kdin vitro (Figure 1D).

We carried out a number of tests to validate the Kd measurement above. First, for five LacO arrays with different number of LacO repeats, we expressed high level of GFP-LacI and quantified the saturated dot intensities. As expected, the intensities scale linearly with the LacO repeat number (Pearson correlation: R = 0.99) (Supplementary Figure S3A). Second, when we expressed GFP-LacI at different levels and measured the apparent Kd using two of these arrays, we found that the apparent Kd is not affected by the array size (Supplementary Figure S3B and C), showing that our measurements are internally consistent. To get an independent assessment of the accuracy of the measured concentration, we imaged the GFP-labeled endogenous Ser2 and Rnr4 proteins together with HIS3pr-GFP-LacI, and compared their fluorescence levels and concentrations (‘Materials and Methods’ section). Despite the fact that the published Ser2 and Rnr4 concentrations were measured with methodologies different from ours, the fluorescence intensities from the three strains are largely proportional to the concentration of the three proteins (Supplementary Figure S3D). These data verify our method of quantifying concentration and apparent Kd.

To understand which factors compromised LacI binding in vivo, we next genetically modified the structure and the stability of the fusion protein, chromatin accessibility, and the intrinsic binding affinity of LacO to quantitatively examine how they affect the apparent Kd of LacI.

The apparent Kd of LacI is reduced by fusing GFP to the C-terminus of LacI

The crystal structure of the LacI protein shows that its DNA binding domain is located at the N-terminus (27). In the original construct, the GFP was fused to the N-terminus with a largely unstructured linker region, indicating that GFP in some orientations may sterically block LacI binding (Supplementary Figure S4). We therefore moved the GFP to the C-terminus and imaged the chromosome dots on the same 256× LacO array (Figure 2A). At high concentration, GFP-LacI and LacI-GFP show essentially the same dot intensity, consistent with saturated binding (Figure 2B and C). However, at low concentration, LacI-GFP shows higher binding with lower free protein background (Figure 2C and D). These results clearly show that the binding of LacI is enhanced by labeling GFP at its C-terminus.

Figure 2.

Figure 2.

The apparent Kd of LacI is enhanced by fusing GFP to the C-terminal of LacI. (A) Construct of the fusion proteins with GFP linked to either the N- or C-terminal of LacI. (B) Fluorescent images of the FROS with different fusion proteins in A. (C and D) Normalized dot intensity (C) and free protein concentration (D) in the four strains containing GFP fused to the N- or C-terminal of LacI with (left panel) or without (right panel) the degron sequence (N = 137, 29, 71 and 48 for the four strains, respectively). The dot intensity increases by 5.3-fold (P-value < 0.0001) after moving the GFP to the C terminus, while the background concentration is reduced by 2.5-fold (P-value < 0.0001).

The apparent Kd of LacI is reduced by using stabilized version of LacI

Stability of a protein may also affect its binding kinetics. Theoretical analysis of a simple binding reaction with or without active degradation shows that a high degradation rate can increase the apparent Kd (Figure 3A). Conjugating the CLN2 degron to a heterologous protein results in a protein half-life of 12 min (kdeg = 9.6 × 10−4s−1) (28). At physiological salt concentration, the association rate of dimeric LacI is diffusion-limited (107–108 M−1s−1) (29,30), and the koff should be in the range of 10−4 – 10−3 s−1. Assuming the degradation rate is comparable in the nucleus and in the cytoplasm, these values suggest that the high degradation rate can increase the apparent Kd by up to 10-fold.

Figure 3.

Figure 3.

The apparent Kd of LacI is enhanced by using a stabilized version of LacI. (A) Analysis of a simple binding reaction with or without active degradation shows that a high degradation rate can significantly increase the effective Kd. kon: binding rate, koff: dissociation rate, kdeg: degradation rate. (B) Generating low concentration of stable GFP-LacI using a promoter of low activity (REV1pr). (C) Fluorescent images of the FROS on the original 256× LacO array with the constructs in B. (D) Quantification of the normalized dot intensity (left panel) and the free protein concentration (right panel) in these two strains (N = 137 and 57, respectively). The dot intensity increases by 15-fold (P-value < 0.0001) with the stable GFP-LacI, while the background concentration is about the same.

To eliminate active degradation while maintaining low steady-state protein level, we used a weaker constitutive promoter, REV1 promoter, to drive the expression of the stable GFP-LacI (Figure 3B) (31). In comparison to GFP-LacI-degron driven by the HIS3 promoter, this construct increases the GFP dot intensity by 15-fold while maintaining similar background level (Figure 3C and D). Such increase in the LacI binding is consistent with our expectation above.

The apparent Kd of LacI is mildly reduced by increasing the accessibility of chromosome

Chromosome accessibility can be another crucial factor that affects TF binding. Many TFs have difficulty accessing their binding sites embedded in the nucleosomes (32–34). The Kd of two TFs, LexA and Gal4, is increased by four to five of magnitude in vitro when the same binding motif is covered by a nucleosome (9). We therefore investigated if nucleosomes have the same effect on LacI in vivo by perturbing the nucleosome occupancy on LacO and probing its effect on LacI binding. Abf1 is a strong nucleosome-displacing factor in yeast that antagonizes nucleosome formation near its binding site (35–37). On a new LacO template, we added an Abf1 binding motif to each side of two LacO sites (Figure 4A). The distance between the two Abf1 motifs is 107 bp, shorter than the length of a single nucleosome, and therefore, we expect the LacO sites in-between to be nucleosome-free. For comparison, we generated another template containing 2-bp mutation in the Abf1 motifs (Figure 4A).

We measured the nucleosome occupancy on the two templates above in strains lacking the LacI protein. The assay commonly used for nucleosome mapping, micrococcal nuclease digestion followed by stacking PCR (38,39), is not suitable here because of the ambiguity introduced by the repetitive sequences. Instead, we used a methylation assay where nucleosome or other chromosome-bound proteins are detected based on protection against DNA methylation (‘Materials and Methods’ section) (24). LacO sites with the consensus Abf1 motifs are more susceptible to methylation (less protected) than the ones with the mutated Abf1 motifs (Figure 4B). The level of protection in these two cases, 0.2 and 0.6, are consistent with regions known to be exposed or embedded under the nucleosome (Supplementary Figure S5). To confirm the difference in methylation susceptibility is indeed due to nucleosomes, we also conducted histone H3 ChIP over the LacO sites with consensus or mutated Abf1 motifs. Higher histone enrichment was found in the latter case (Figure 4C). Together, these data suggest that we have generated two templates with different nucleosome coverage over the same LacO sites.

We then introduced the LacI protein into these strains and performed ChIP assay to measure the level of LacI binding on the Abf1/Abf1*-LacO templates (‘Materials and Methods’ section). There is a mild but significant improvement of LacI binding on the template containing the consensus Abf1 sites (P-value = 0.021) (Figure 4D). To confirm the ChIP results, we amplified the templates into an array with 64× LacO sites and performed imaging to quantify the bound GFP-LacI-degron. Despite the fact that the new template contains four times less LacO sites than the original 256× array, we detected comparable or even brighter chromosome dots with GFP-LacI-degron driven by the HIS3 promoter (Figure 4E). The dot intensity versus free protein curve yields an apparent Kd of 372 ± 33 nM, smaller than that on the original LacO template (apparent Kd = 592 ± 16 nM; P-value < 10−4) (Figure 4F). We did the same set of experiments on templates with different linker sequences between LacO and Abf1 sites, and reached the same conclusions (Supplementary Figure S6). These results suggest that nucleosomes have a mild inhibitory effect on LacI binding in the cells.

The apparent Kd of LacI is reduced by using symmetric LacO sites

Previous studies have identified a symmetric Lac operator (Osys) that binds LacI with higher affinity than the wild-type version in vitro (Kd is reduced by 5-fold) (Figure 5A) (40–42). To examine if LacI also shows stronger binding to Osys in vivo, we swapped the two wild-type LacO sites flanked by Abf1 with Osys (Figure 5B) and evaluated LacI binding with ChIP. We observed a 6-fold increase in the LacI enrichment on Osys than the wild-type LacO (Figure 5C), which is comparable to what was measured in vitro. We also conducted the same ChIP experiments on templates containing different linker sequences, and LacI consistently binds Osys better than the wild-type LacO (Supplementary Figure S7). We conclude that Osys leads to robust increase in LacI binding in vivo.

Figure 5.

Figure 5.

The apparent Kd of LacI is enhanced by using a symmetric Lac operator sequence. (A) DNA sequences of the wildtype Lac operator (wt LacO) and the symmetric Lac operator (Osys). (B) Templates used for the LacI binding measurements. The templates contain two wt or symmetric LacO. (C) LacI enrichment on the two templates measured by ChIP assay. Osys increases the LacI binding by 6-fold. Error bars shown here represent SEM of three repeats.

A combined system reduced the apparent Kd of LacI to <10 nM

We next combined all the genetic manipulations above to generate a new FROS with much higher occupancy of LacI at low concentration. This system includes stable LacI-GFP driven by the REV1 promoter, as well as a LacO template containing 32× Osys flanked by Abf1 (Figure 6A). We again generated three strains with different LacI-GFP level (low and medium: 1× and 3× REV1pr-LacI-GFP, and high: HIS3pr-LacI-GFP) and measured the bound vs free GFP intensity in single cells. LacI shows similar binding in all three strains regardless of the LacI-GFP concentrations (Figure 6B and C). We further reduced the LacI-GFP level by 30% by putting a single copy of REV1pr-LacI-GFP into a diploid yeast strain. Even in this condition, the binding of LacI is near-saturated (Supplementary Figure S8A). Given that we did not see a significant drop in the dot intensity with LacI-GFP concentration as low as 30 nM, the apparent Kd should be <10 nM (Figure 6C and Supplementary Figure S8B), more than 60-fold lower than what we started with in Figure 1. The improved signal-to-noise ratio here allows chromosome dots to be visualized with lower excitation (and thus, lower photo-damage to the cells) and/or less LacO repeats (less perturbation to the genome). The way we improved FROS will provide guidance for future design of more specific chromosomal recruitment through high-affinity TFs.

DISCUSSION

The apparent Kd of TFs in vivo is a parameter of high functional relevance. The DNA binding domain of a TF is usually conjugated to a functional domain (e.g. activation domain), and how specific such functional domain acts on the target sites vs the rest of the genome is determined by the ratio between the bound and free TFs. TFs with low apparent Kd can occupy target sites at low concentration, leading to high specificity. The LacO/LacI system is a popular choice for artificial recruitment in eukaryotic species because of its high affinity in vitro. However, to our knowledge, the apparent Kd of LacI in vivo has not been carefully evaluated in these species. Based on our results, we suspect that in some studies, the LacI fusion proteins were expressed to high levels that masked the weak binding of LacI, but such solution may also increase the background noise and reduce the sensitivity of the measurement.

In a few past studies, in vivo binding of TFs were measured using two types of methods. The first one is an ‘equilibrium’ method, which measures the occupancy of the binding site at different TF concentrations using footprinting assay (43,44). To get robust signal in this assay, a large amount of TF binding sites needs to be introduced into the cells on plasmids. The chromosome configurations on these DNAs may be distinct from the native genome and thus affect the TF binding. Alternatively, it can be measured by the ‘kinetic’ method, where sparsely labeled TFs are tracked in real time to reveal kon and koff (45–47). This method has been used to measure LacI dynamics in live E. coli cells (47–50). Besides being technically challenging, this method usually cannot pinpoint a single genomic locus, and the Kd measured this way often reflects the binding averaged over different genomic sites. In our study, taking advantage of the FROS setup, we developed a new strategy to measure the apparent Kd by simultaneously visualizing the proteins that are either bound on specific chromosomal sites or free in the nucleus. This method in principle can be applied to other TFs.

Starting with a commonly used FROS construct, we found that LacI binding to LacO is very weak in the yeast nucleus (apparent Kd ∼0.6 μM). We need to point out that there are a few factors that may confound this estimation. First, we assumed that LacO sites are 100% occupied by LacI-GFP upon saturation. If only a fraction of LacO is occupied instead, our assumption above would lead to an overestimation of the background LacI-GFP concentration, and therefore an overestimation of apparent Kd. Given that a LacO array as short as 32× can yield bright chromosome dots (Figure 6), the bound fraction is unlikely to be much smaller than 100%. Second, a fraction of GFP molecules may be fluorescently invisible because they are immature or photo-bleached. These proteins should be present in the bound and free population with the same proportion, and therefore will not affect the estimated free protein concentration and Kd. Third, the expression of GFP-LacI may generate free GFPs unlinked to LacI, which will cause an overestimation of Kd. This is an unlikely scenario as LacI is translated efficiently in yeast. Indeed, a number of tests have validated our quantification of LacI concentration and apparent Kd (Supplementary Figure S3).

We demonstrated that multiple genetic features can be manipulated to decrease the apparent Kd, including fusion protein structure, stability, chromatin accessibility and binding site sequence. Out of these measures, only the last one directly impacts the non-covalent interactions at the protein–DNA interface. The other ones affect the accessibility of the DNA (due to nucleosome coverage) or the protein (due to degradation or tag interference). Some of these effects, e.g. those caused by protein stability and Osys, are consistent with expectations. However, we were surprised to observe that nucleosome only has a mild inhibitory effect on LacI binding. Nucleosomal DNA was shown to be highly inaccessible for most TFs except the ones with pioneer activities (51). The Kd of LexA and Gal4 on the nucleosomal site are five orders of magnitude higher than that on naked DNA in vitro (9). LacO is 20-bp in length, which covers two DNA helical turns and should always be partially occluded on the nucleosome surface regardless of its orientation. Also, given that it is a bacterial TF, LacI is unlikely to have ‘pioneer-factor’ properties that allow it to bind to nucleosome with high-affinity. A potential mechanism for LacI to get over the nucleosome barrier is by taking advantage of the nucleosome dynamics in vivo. Events such as DNA replication, histone turnover or nucleosome remodeling can transiently expose DNA, allowing LacI to bind; once it binds, LacI can dwell on the DNA for a long time and prevent the reassembly of nucleosomes (47,52). Consistent with this idea, another bacterial TF, TetR, depletes nucleosome in yeast over its otherwise embedded binding sites (35). Maybe this is the reason why eukaryotic TFs tend to contact less DNA bases and have lower binding energy: nucleosome presents a more formidable barrier for these TFs, so that their binding can be regulated by the chromosome accessibility. Besides nucleosome, other endogenous DNA-binding proteins can also associate with the LacO array non-specifically and interfere with LacI binding. Given the mild effect of nucleosome on LacI binding, we do not expect these highly dynamic and non-specific binding to dramatically increase the apparent Kd of LacI.

At last, the apparent Kdin vivo is likely to be affected by the presence of overwhelming amount of non-specific DNA (53,54). These DNA compete with LacO for LacI:

graphic file with name M2.gif

Here ‘gDNA’ represents genomic DNA, and Kd,s and Kd,n represent dissociation constant on specific and non-specific DNA:

graphic file with name M3.gif

It should be noted that the ‘free’ LacI-GFP we measured in this study includes the diffusive part ([LacI]) and the part that is transiently associated with non-specific DNA [LacI·gDNA].

graphic file with name M4.gif

Based on how we defined the apparent Kd, we can convert the Kd to Kd,s:

graphic file with name M5.gif

The ratio between Kd,s and Kd,n for LacI was shown to be 2.5 × 10−7 (53). Depending on how strongly nucleosomes inhibit non-specific LacI binding, 20–100% of the yeast genome can function as competitive DNA (nucleosome coverage of the yeast genome is ∼80%). If we assume this fraction to be 50%, [gDNA] should be about 3.3 mM, which leads to Inline graphic. This calculation shows that the overall apparent Kd has to be above 0.8 nM due to the presence of non-specific genomic DNA. Our final system with the apparent Kd <10 nM is approaching this theoretical limit. Interestingly, in E. coli cells expressing low-level of LacI dimer (three to five copies per cell), the ton and toff for an individual Osys were found to be 28 s and 9.3 min, respectively (48). Such binding (∼0.0036 s−1nM−1) and dissociation rate (0.0018 s−1) yield an apparent Kd of ∼0.5 nM, which is slightly lower than the apparent Kd we could achieve in yeast (between 0.8 and 10 nM). This is as expected due to more exposed LacO (not chromatinized) and less competitive DNA in the E. coli genome.

Supplementary Material

gkz698_Supplemental_Files

ACKNOWLEDGEMENTS

The authors are grateful to Dr Susan Gasser, Dr Pankaj Chaturvedi, Dr Andrew Belmont, Dr Huihun Jung, Dr Benjamin Allen, Dr Michael Kladde, Dr Lai Shi and Dr Yanming Wang for providing strains, plasmids and technical advice. We acknowledge all members in Bai lab for insightful comments on the manuscript. We also want to thank the members of the Center of Eukaryotic Gene Regulation at PSU for discussions and technical support.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

National Institutes of Health (NIH) [R01 GM118682]. Funding for open access charge: NIH [R01 GM118682].

Conflict of interest statement. None declared.

REFERENCES

  • 1. Pan Y., Tsai C.J., Ma B., Nussinov R.. Mechanisms of transcription factor selectivity. Trends Genet. 2010; 26:75–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Schleif R.F. Modulation of DNA binding by gene-specific transcription factors. Biochemistry. 2013; 52:6755–6765. [DOI] [PubMed] [Google Scholar]
  • 3. Slattery M., Zhou T., Yang L., Dantas Machado A.C., Gordan R., Rohs R.. Absence of a simple code: how transcription factors read the genome. Trends Biochem. Sci. 2014; 39:381–399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Barrera L.A., Vedenko A., Kurland J.V., Rogers J.M., Gisselbrecht S.S., Rossin E.J., Woodard J., Mariani L., Kock K.H., Inukai S. et al.. Survey of variation in human transcription factors reveals prevalent DNA binding changes. Science. 2016; 351:1450–1454. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Belikov S., Berg O.G., Wrange O.. Quantification of transcription factor-DNA binding affinity in a living cell. Nucleic Acids Res. 2016; 44:3045–3058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Chakerian A.E., Matthews K.S.. Characterization of mutations in oligomerization domain of Lac repressor protein. J. Biol. Chem. 1991; 266:22206–22214. [PubMed] [Google Scholar]
  • 7. Belmont A.S., Straight A.F.. In vivo visualization of chromosomes using lac operator-repressor binding. Trends Cell Biol. 1998; 8:121–124. [DOI] [PubMed] [Google Scholar]
  • 8. Belmont A.S., Li G., Sudlow G., Robinett C.. Visualization of large-scale chromatin structure and dynamics using the lac operator/lac repressor reporter system. Methods Cell Biol. 1999; 58:203–222. [DOI] [PubMed] [Google Scholar]
  • 9. Luo Y., North J.A., Rose S.D., Poirier M.G.. Nucleosomes accelerate transcription factor dissociation. Nucleic Acids Res. 2014; 42:3017–3027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Robinett C.C., Straight A., Li G., Willhelm C., Sudlow G., Murray A., Belmont A.S.. In vivo localization of DNA sequences and visualization of large-scale chromatin organization using lac operator/repressor recognition. J. Cell Biol. 1996; 135:1685–1700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Rohner S., Gasser S.M., Meister P.. Modules for cloning-free chromatin tagging in Saccharomyces cerevisae. Yeast. 2008; 25:235–239. [DOI] [PubMed] [Google Scholar]
  • 12. Quilis I., Igual J.C.. A comparative study of the degradation of yeast cyclins Cln1 and Cln2. FEBS Open Bio. 2017; 7:74–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Du M., Zhang Q., Bai L.. Three distinct mechanisms of long-distance modulation of gene expression in yeast. PLos Genet. 2017; 13:e1006736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Yan C., Wu S., Pocetti C., Bai L.. Regulation of cell-to-cell variability in divergent gene expression. Nat. Commun. 2016; 7:11099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. de Godoy L.M., Olsen J.V., Cox J., Nielsen M.L., Hubner N.C., Frohlich F., Walther T.C., Mann M.. Comprehensive mass-spectrometry-based proteome quantification of haploid versus diploid yeast. Nature. 2008; 455:1251–1254. [DOI] [PubMed] [Google Scholar]
  • 16. Thakur S.S., Geiger T., Chatterjee B., Bandilla P., Frohlich F., Cox J., Mann M.. Deep and highly sensitive proteome coverage by LC-MS/MS without prefractionation. Mol. Cell Proteomics. 2011; 10:M110 003699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Tkach J.M., Yimit A., Lee A.Y., Riffle M., Costanzo M., Jaschob D., Hendry J.A., Ou J., Moffat J., Boone C. et al.. Dissecting DNA damage response pathways by analysing protein localization and abundance changes during DNA replication stress. Nat. Cell Biol. 2012; 14:966–976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Peng M., Taouatas N., Cappadona S., van Breukelen B., Mohammed S., Scholten A., Heck A.J.. Protease bias in absolute protein quantitation. Nat. Methods. 2012; 9:524–525. [DOI] [PubMed] [Google Scholar]
  • 19. Breker M., Gymrek M., Schuldiner M.. A novel single-cell screening platform reveals proteome plasticity during yeast stress responses. J. Cell Biol. 2013; 200:839–850. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Denervaud N., Becker J., Delgado-Gonzalo R., Damay P., Rajkumar A.S., Unser M., Shore D., Naef F., Maerkl S.J.. A chemostat array enables the spatio-temporal analysis of the yeast proteome. Proc. Natl. Acad. Sci. U.S.A. 2013; 110:15842–15847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Mazumder A., Tummler K., Bathe M., Samson L.D.. Single-cell analysis of ribonucleotide reductase transcriptional and translational response to DNA damage. Mol. Cell. Biol. 2013; 33:635–642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Chong Y.T., Koh J.L., Friesen H., Duffy S.K., Cox M.J., Moses A., Moffat J., Boone C., Andrews B.J.. Yeast proteome dynamics from single cell imaging and automated analysis. Cell. 2015; 161:1413–1424. [DOI] [PubMed] [Google Scholar]
  • 23. Jorgensen P., Edgington N.P., Schneider B.L., Rupes I., Tyers M., Futcher B.. The size of the nucleus increases as yeast cells grow. Mol. Biol. Cell. 2007; 18:3523–3532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Jessen W.J., Dhasarathy A., Hoose S.A., Carvin C.D., Risinger A.L., Kladde M.P.. Mapping chromatin structure in vivo using DNA methyltransferases. Methods. 2004; 33:68–80. [DOI] [PubMed] [Google Scholar]
  • 25. Lanker S., Valdivieso M.H., Wittenberg C.. Rapid degradation of the G1 cyclin Cln2 induced by CDK-dependent phosphorylation. Science. 1996; 271:1597–1601. [DOI] [PubMed] [Google Scholar]
  • 26. Houser J.R., Ford E., Chatterjea S.M., Maleri S., Elston T.C., Errede B.. An improved short-lived fluorescent protein transcriptional reporter for Saccharomyces cerevisiae. Yeast. 2012; 29:519–530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Lewis M., Chang G., Horton N.C., Kercher M.A., Pace H.C., Schumacher M.A., Brennan R.G., Lu P.. Crystal structure of the lactose operon repressor and its complexes with DNA and inducer. Science. 1996; 271:1247–1254. [DOI] [PubMed] [Google Scholar]
  • 28. Salama S.R., Hendricks K.B., Thorner J.. G1 cyclin degradation: the PEST motif of yeast Cln2 is necessary, but not sufficient, for rapid protein turnover. Mol. Cell Biol. 1994; 14:7953–7966. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Hsieh M., Brenowitz M.. Comparison of the DNA association kinetics of the Lac repressor tetramer, its dimeric mutant LacIadi, and the native dimeric Gal repressor. J. Biol. Chem. 1997; 272:22092–22096. [DOI] [PubMed] [Google Scholar]
  • 30. Riggs A.D., Bourgeois S., Cohn M.. The lac repressor-operator interaction. 3. Kinetic studies. J. Mol. Biol. 1970; 53:401–417. [DOI] [PubMed] [Google Scholar]
  • 31. Lee M.E., Aswani A., Han A.S., Tomlin C.J., Dueber J.E.. Expression-level optimization of a multi-enzyme pathway in the absence of a high-throughput assay. Nucleic Acids Res. 2013; 41:10668–10678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Adams C.C., Workman J.L.. Binding of disparate transcriptional activators to nucleosomal DNA is inherently cooperative. Mol. Cell Biol. 1995; 15:1405–1421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Bai L., Charvin G., Siggia E.D., Cross F.R.. Nucleosome-depleted regions in cell-cycle-regulated promoters ensure reliable gene expression in every cell cycle. Dev. Cell. 2010; 18:544–555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Takahata S., Yu Y., Stillman D.J.. Repressive chromatin affects factor binding at yeast HO (homothallic switching) promoter. J. Biol. Chem. 2011; 286:34809–34819. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Yan C., Chen H., Bai L.. Systematic study of Nucleosome-Displacing factors in budding yeast. Mol. Cell. 2018; 71:294–305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Venditti P., Costanzo G., Negri R., Camilloni G.. ABFI contributes to the chromatin organization of Saccharomyces cerevisiae ARS1 B-domain. Biochim. Biophys. Acta. 1994; 1219:677–689. [DOI] [PubMed] [Google Scholar]
  • 37. Ganapathi M., Palumbo M.J., Ansari S.A., He Q., Tsui K., Nislow C., Morse R.H.. Extensive role of the general regulatory factors, Abf1 and Rap1, in determining genome-wide chromatin structure in budding yeast. Nucleic Acids Res. 2011; 39:2032–2044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Sekinger E.A., Moqtaderi Z., Struhl K.. Intrinsic histone-DNA interactions and low nucleosome density are important for preferential accessibility of promoter regions in yeast. Mol. Cell. 2005; 18:735–748. [DOI] [PubMed] [Google Scholar]
  • 39. Kent N.A., Mellor J.. Chromatin structure snap-shots: rapid nuclease digestion of chromatin in yeast. Nucleic Acids Res. 1995; 23:3786–3787. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Falcon C.M., Matthews K.S.. Operator DNA sequence variation enhances high affinity binding by hinge helix mutants of lactose repressor protein. Biochemistry. 2000; 39:11074–11083. [DOI] [PubMed] [Google Scholar]
  • 41. Sadler J.R., Sasmor H., Betz J.L.. A perfectly symmetric lac operator binds the lac repressor very tightly. Proc. Natl. Acad. Sci. U.S.A. 1983; 80:6785–6789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Simons A., Tils D., von Wilcken-Bergmann B., Muller-Hill B.. Possible ideal lac operator: Escherichia coli lac operator-like sequences from eukaryotic genomes lack the central G X C pair. Proc. Natl. Acad. Sci. U.S.A. 1984; 81:1624–1628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Connaghan-Jones K.D., Moody A.D., Bain D.L.. Quantitative DNase footprint titration: a tool for analyzing the energetics of protein-DNA interactions. Nat. Protoc. 2008; 3:900–914. [DOI] [PubMed] [Google Scholar]
  • 44. Yang S.W., Nash H.A.. Comparison of protein binding to DNA in vivo and in vitro: defining an effective intracellular target. EMBO J. 1995; 14:6292–6300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Liu H., Dong P., Ioannou M.S., Li L., Shea J., Pasolli H.A., Grimm J.B., Rivlin P.K., Lavis L.D., Koyama M. et al.. Visualizing long-term single-molecule dynamics in vivo by stochastic protein labeling. Proc. Natl. Acad. Sci. U.S.A. 2018; 115:343–348. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Chen J., Zhang Z., Li L., Chen B.C., Revyakin A., Hajj B., Legant W., Dahan M., Lionnet T., Betzig E. et al.. Single-molecule dynamics of enhanceosome assembly in embryonic stem cells. Cell. 2014; 156:1274–1285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Elf J., Li G.W., Xie X.S.. Probing transcription factor dynamics at the single-molecule level in a living cell. Science. 2007; 316:1191–1194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Hammar P., Wallden M., Fange D., Persson F., Baltekin O., Ullman G., Leroy P., Elf J.. Direct measurement of transcription factor dissociation excludes a simple operator occupancy model for gene regulation. Nat. Genet. 2014; 46:405–408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Hammar P., Leroy P., Mahmutovic A., Marklund E.G., Berg O.G., Elf J.. The lac repressor displays facilitated diffusion in living cells. Science. 2012; 336:1595–1598. [DOI] [PubMed] [Google Scholar]
  • 50. Garza de Leon F., Sellars L., Stracy M., Busby S.J.W., Kapanidis A.N.. Tracking low-copy transcription factors in living bacteria: the case of the lac repressor. Biophys. J. 2017; 112:1316–1327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Soufi A., Garcia M.F., Jaroszewicz A., Osman N., Pellegrini M., Zaret K.S.. Pioneer transcription factors target partial DNA motifs on nucleosomes to initiate reprogramming. Cell. 2015; 161:555–568. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Stracy M., Uphoff S., Garza de Leon F., Kapanidis A.N.. In vivo single-molecule imaging of bacterial DNA replication, transcription, and repair. FEBS Lett. 2014; 588:3585–3594. [DOI] [PubMed] [Google Scholar]
  • 53. Lin S., Riggs A.D.. The general affinity of lac repressor for E. coli DNA: implications for gene regulation in procaryotes and eucaryotes. Cell. 1975; 4:107–111. [DOI] [PubMed] [Google Scholar]
  • 54. von Hippel P.H., Revzin A., Gross C.A., Wang A.C.. Non-specific DNA binding of genome regulating proteins as a biological control mechanism: I. The lac operon: equilibrium aspects. Proc. Natl. Acad. Sci. U.S.A. 1974; 71:4808–4812. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkz698_Supplemental_Files

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES