Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Jul 19.
Published in final edited form as: Phys Rev Lett. 2013 Jan 2;110(1):018101. doi: 10.1103/PhysRevLett.110.018101

Theoretical and Experimental Dissection of DNA Loop-Mediated Repression

James Q Boedicker 1, Hernan G Garcia 2, Rob Phillips 1,*
PMCID: PMC3716456  NIHMSID: NIHMS492741  PMID: 23383841

Abstract

Transcriptional networks across all domains of life feature a wide range of regulatory architectures. Theoretical models now make clear predictions about how key parameters describing those architectures modulate gene expression, and the ability to construct genetic circuits with tunable parameters enables precise tests of such models. We dissect gene regulation through DNA looping by tuning network parameters such as repressor copy number, DNA binding strengths, and loop length in both thermodynamic models and experiments. Our results help clarify the short-length mechanical properties of DNA.


The processes of the central dogma serve as the macromolecular pipeline linking the critical genetic information hidden within DNA sequences to the active proteins that drive much of cellular life. Vast arrays of interlinked regulatory networks impose when and where different batteries of genes are turned on. But how is the output of these networks controlled by the parameters that govern a given regulatory architecture and how effective are such parameters as a substrate for evolutionary change?

To explore these questions, we require quantitative knowledge of the transcriptional decisions made by the individual elements of these networks. Here we make a systematic theoretical and experimental study of key regulatory parameters in a common regulatory motif containing a single transcription factor. Systematic studies like this serve in several very useful capacities for understanding the evolution and engineering of genetic networks. First, from the perspective of the evolution of transcriptional networks, it is critical to know the systematic dependence of the expression on all of the parameters that can be tuned over evolutionary time [1,2], several of which are indicated in Fig. 1(a). Second, an objective of synthetic biology is to use various “parts” from the regulatory palette to assemble novel genetic networks. Analogous to the input-output functions of electronic circuits, work like ours serves as the development of understanding the “I–V curves” for these kinds of biological networks [35]. In this Letter, we explore one of the key conceptual building blocks of regulatory networks featuring “action at a distance”, in which DNA mechanically deforms to facilitate the activity of transcription factors bound to nonadjacent sites of a promoter region [6,7].

FIG. 1.

FIG. 1

(color). Loop-mediated gene regulation is tuned by parameters incorporated into a thermodynamic model. (a) Lac repressor reduces gene expression by binding its operators, including binding to both operators and looping the intervening DNA. (b) A thermodynamic model of gene regulation contains the states of the two operator constructs, their associated weights, and the rates of transcription from each state. (c) The model predicts the influence of each parameter on gene expression, as captured in the experimentally measurable quantity repression defined in Eq. (1). (d) 3D plot of repression as a function of number of repressors per cell and the main operator binding energy for ΔFloop(L)=9kBT and r2 = r3. YFP, yellow fluorescent protein; RNAP, RNA polymerase.

To compute the input-output relation for such motifs we implement thermodynamic models. These models are widely used as a quantitative framework for transcriptional regulation [810], including in the analysis of gene regulation with DNA looping [1115] (see Fig. S1 in the Supplemental Material [16] for comparisons to previous studies). Thermodynamic models are based upon the assumption that the probability of finding the system in a given regulatory state is a function of the free energy associated with each state of the system, which is basically in quasiequilibrium. Our objective is to test the limits of such models. Indeed, recent reports indicate that some regulatory processes require description beyond the realm of thermodynamic models [1720], and noise in gene expression plays a critical role in some regulatory contexts [21]. We see these thermodynamic models as a starting point for quantitative descriptions of regulatory input-output functions. Such models predict experimental quantities such as the repression,

repression=gene expression(R=0)gene expression(R0) (1)

where R is the intracellular number of repressor molecules. In a model of repression by DNA looping, we enumerate the different states of the system and assign the corresponding weights and relative rates of transcription to each state, as shown in Fig. 1(b). The level of expression is given as

gene expression=(r2PNNSeβΔεpd+r3PNNS2RNNSeβ(Δεpd+Δεrad))/Z, (2)

in which r2 and r3 are the rate constants for transcription from states 2 and 3 (for now r2 = r3, we address this assumption later in the text), P is the number of RNA polymerase molecules per cell, Δεrad is the binding energy corresponding to the auxiliary operator, Δεpd is the binding energy of RNA polymerase to the promoter, NNS is the number of nonspecific binding sites on the genome (see Supplemental Material [16] for discussion), β is the inverse of the Boltzmann constant times the temperature, and Z is the partition function which is the sum of all the weights listed in Fig. 1(b). After safely making the approximation that 1+PNNSeβΔεpd1, the weak promoter approximation (see Supplemental Material [16] for details), this definition leads to the equation shown in Fig. 1(c), where ΔFloop(L) is the free energy needed to form a loop of length L, and Δεrmd is the binding energy of repressor to the main operator. The only free parameters remaining in the equation in Fig. 1(c) are those related to the binding energy of the Lac repressor to DNA, the DNA looping free energy and the number of repressors inside the cell. Previous experiments performed on simpler architectures have given us quantitative knowledge of the binding energies and the number of repressors [22]. The only real unknown parameter in the equation shown in Fig. 1(c) is the looping free energy.

To explore the consequences of the repression equation as a quantitative and predictive description of loop-mediated repression, we systematically measured the functional form for repression as a function of the key tunable parameters in that equation. The repression equation predicts how gene expression will scale with the number of repressors, the binding energies of the operators, and the looping free energy, as shown in Fig. 1(d). The values of the parameters used in this and subsequent calculations can be found in Table S5 of the Supplemental Material [16]. The approach of tuning these parameters in both predictive models and quantitative experiments has been implemented previously to establish the input-output function of repression from a single operator [22] and to examine the wild-type lac operon [23]. Such models have also been applied to previous measurements of Lac repressor looping [12,13,24]. Here we use the model to specify a series of experimental measurements that more rigorously explore the parameter space that influences loop formation.

In the spirit of conducting precision measurements to test the quantitative predictions of physical models, we experimentally tuned the theoretical “knobs” of the system by constructing an extensive library of E. coli strains containing fluorescent reporter constructs based on the lac operon. Repression was measured by comparing the ratio of gene expression in cells with and without the repressor, as in Eq. (1).

Repression in looping constructs has a nontrivial dependence on the number of repressors. However, such tests are not commonplace due to the difficulty of creating bacterial strains with known absolute numbers of repressors. We used a set of strains which varied the number of repressors per cell between 10 and 1000 [22,25] to explore the dependence of repression on the number of repressors as shown in Fig. 2. First, we titrated the number of repressors for specific looping constructs, with the data from one such construct shown in Fig. 2(a), in order to test that the functional form predicted by the equation in Fig. 1(c) held. A complementary way of probing the dependence on repressor number is shown in Fig. 2(b) where we examined repression over a full helical period of double stranded DNA for a given number of repressors per cell and used this information to predict the outcome of an experiment where the same DNA constructs are measured in the presence of a different number of repressors.

FIG. 2.

FIG. 2

(color). Titrating the number of repressors per cell resulted in repression levels similar to predicted values. (a) To predict how repression scales with number of repressors, we first measured repression for a strain with the wild type number of repressors per cell (11 ± 2) and used the equation from Fig. 1(c) to calculate the looping energy. The prediction, as depicted by the dashed black line with the shaded region defining the 95% confidence interval, was compared to other experimental measurements. Global fits of all data points in the figure to the main operator binding energy and the looping energy are shown with the blue solid line and green dotted line respectively. (b) Repression as a function of loop length for two different values of the repressor number per cell. The black dashed line and shaded region show the parameter-free prediction for 130 repressors per cell using the best fit main operator binding energy calculated in (a). The blue solid line shows the global fit to the main operator binding energy for the 130 repressors per cell data. Error bars are standard error.

As seen in Fig. 2, the relation between the predicted and measured repression is in reasonable agreement. The predicted repression levels, as shown by the black dashed line and shaded regions in Figs. 2(a) and 2(b), were calculated using the looping energy extracted from the data at 11 repressors per cell. Some experimental measurements differed from predictions. In order to better compare the predictions with the experimental measurements, we performed global fits to all the experimental measurements to look for systematic disagreements between the data and parameter values. The best fit main operator binding energies for these data sets, shown by the blue solid lines, were −14:3kBT for Fig. 2(a) and −15:1kBT for Fig. 2(b). Both values indicate stronger binding than the previously reported value of −13:9 ± 0:2kBT [22], yet are consistent with other fits to similar data as shown in Figs. S2 and S3 and Table S6 of the Supplemental Material [16]. This discrepancy suggests that this disagreement between model predictions and experimental measurements could be largely due to uncertainty in model parameters, especially the main operator binding energy which strongly influences repression as shown in Fig. S4 in the Supplemental Material [16]. However, deviations from model predictions such as at 100.5 and 101.5 bp in Fig. 2(b) suggest a possible systematic deviation which may warrant further investigation.

A second way of systematically tuning the parameters governing repression is by tuning the strength of the repressor binding to its operators. Figure 3(a) shows the result of taking looping constructs with different loop lengths and measuring the resulting repression for both a weak and stronger operator. Here too, the repression equation makes specific predictions about repression when the operator strengths are varied and these predictions are largely borne out. Another way to view the dependence on operator strength is by noting that the looping free energy that emerges from our repression measurements should be indifferent to which operators are present. Figure 3(b) shows that the looping free energies that emerge from different operator choices are the same to within error.

FIG. 3.

FIG. 3

(color). The looping energy is independent of operator binding energies. (a) Repression measurements for constructs in which the main operator was O2 and the auxiliary operator was Oid (black circles) were used to predict how repression scales with the main operator binding energy. The family of curves illustrates how repression responds to 0:5kBT shifts in the binding energy of the main operator. The red line shows the prediction using the previously reported binding energy to operator O1 [22], in close agreement with measurements of Oid-O1 constructs (blue diamonds). (b) The looping energies extracted from both operator combinations are within error. Error bars represent standard error.

Next we closely examine the variation of looping energy with loop length, and address the anomalously high repression that has been observed for short loops [7,26,27]. Previous measurements of transcription factor-mediated loop formation in vivo suggested that some of the shortest DNA loops are energetically favorable in a way that appeared inconsistent with our in vitro understanding of DNA mechanics from the wormlike chain model [28]. Theoretical works have shown that the flexibility and geometry of transcription factors can help accommodate the formation of short DNA loops [29,30]. However, the precise details of the value and phasing of the predicted looping free energies can vary dramatically with the assumptions made about the mechanical properties and geometry of both the DNA and the protein [31], making it difficult to draw unequivocal conclusions about the flexibility of DNA at short distances. Additionally, a recent report suggests that a looping independent role for the upstream operator in gene regulation skews our interpretation of loop formation at short distances [27]. Adhya et al. first demonstrated that a Gal or Lac repressor in the upstream position interacts with RNA polymerase on the promoter [3234], and this effect has been characterized using thermodynamic models in the context of an upstream operator binding Lac repressor [18]. Here we quantify this loop independent contribution to repression and examine how accounting for direct upstream repression corrects looping energies at short distances.

We quantified the amount of loop independent repression from the upstream operator by measuring gene expression in constructs lacking a main operator, as shown in Fig. 4(a). So far we have assumed that a Lac repressor bound to the upstream operator does not interact with the transcriptional machinery by setting r2 = r3. Now we calculate the relative rates of transcription from states 2 and 3 (see Fig. S5 and Supplemental Material [16] for derivation). As shown in Fig. 4(b), the ratio of transcription rates from states 2 and 3 varies with distance between 0.05 and 1.4, greatly reducing transcription at some lengths and acting as an activator at others, consistent with previous observations [18,32,33]. The upstream repressor interferes with promoter escape, and this interaction has been shown to be dependent upon the position and sequence of the upstream operator [18]. However, we still lack a quantitative understanding of how the sequence of the operator region influences this interaction and dictates the functional form of the curve in Fig. 4(b). By taking this effect into account, the corrected looping energy can be calculated, as shown in Fig. 4(c). At some lengths the looping energy changes several kBT, resulting in large changes in promoter occupancy as shown in Fig. S6 in the Supplemental Material [16]. These results indicate the importance of the upstream operator in gene regulation at short distances. Furthermore, this correction reveals that short loops are not of a lower energy than longer loops, and restores the approximately 12 bp periodicity of repression with loop length at short distances as shown in Fig. S5(D) in the Supplemental Material [16]. Although the scaling of the local minima of looping energies with loop length is not a clear function of operator distance over the loop lengths measured, the corrected results shown in Fig. 4(c) are more consistent with the wormlike chain model in that the local minima of the looping energies do not decrease with length for loops shorter than the ≈150 bp persistence length of DNA [35].

FIG. 4.

FIG. 4

(color). Correcting for direct upstream repression when calculating looping energies. (a) Regulatory interactions occur between RNA polymerase and the Lac repressor bound to a nearby upstream operator, skewing in vivo measurements of looping based on gene expression. (b) To account for direct auxiliary repression in looping constructs, we measure r3=r2, at each operator distance. (c) Correcting the two operator model from Fig. 1(b) to account for direct auxiliary repression reveals increased looping energies for the shortest loops and restores a uniform periodicity of repression with length at short operator distances. Error bars represent standard error.

Transcription is a critical regulatory decision point in the central dogma. Here we provide a quantitative characterization of the way that critical regulatory parameters modulate the output of transcriptional circuits involving DNA looping. Our thermodynamic model produces falsifiable predictions that are confirmed by our measurements. Additionally, by extending the model to allow for interaction between RNA polymerase and upstream repressors, we show the dependence of repression on loop length is complicated at short distances by the presence of nonlooping induced phasing. Accounting for this interaction resolves a long-standing puzzle about the possible short length flexibility of DNA in vivo.

Supplementary Material

SI

Acknowledgments

This work was supported by the National Institutes of Health Pioneer Award No. DP1 OD000217 (H. G. G., R. P.) and Grants No. R01 GM085286 and No. R01 GM085286-01S (J. Q. B., H. G. G., R. P.). We would like to acknowledge Jane Kondev, Jeff Gelles, Jim Maher, Jason Kahn, John Marko, Sunney Xie, Eran Segal, Sankar Adhya, Paul Wiggins, Thomas Kuhlman, David Bensimon, Jon Widom, and Stephanie Johnson for many thoughtful discussions.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SI

RESOURCES