Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2012 May 16;102(10):2321–2330. doi: 10.1016/j.bpj.2012.04.008

Generalized Facilitated Diffusion Model for DNA-Binding Proteins with Search and Recognition States

Maximilian Bauer , Ralf Metzler ‡,§,
PMCID: PMC3353100  PMID: 22677385

Abstract

Transcription factors (TFs) such as the lac repressor find their target sequence on DNA at remarkably high rates. In the established Berg-von Hippel model for this search process, the TF alternates between three-dimensional diffusion in the bulk solution and one-dimensional sliding along the DNA chain. To overcome the so-called speed-stability paradox, in similar models the TF was considered as being present in two conformations (search state and recognition state) between which it switches stochastically. Combining both the facilitated diffusion model and alternating states, we obtain a generalized model. We explicitly treat bulk excursions for rodlike chains arranged in parallel and consider a simplified model for coiled DNA. Compared to previously considered facilitated diffusion models, corresponding to limiting cases of our generalized model, we surprisingly find a reduced target search rate. Moreover, at optimal conditions there is no longer an equipartition between the time spent by the protein on and off the DNA chain.

Introduction

The survival of cells relies crucially on their ability to efficiently control their metabolism. For example, the bacterium Escherichia coli is able to digest both glucose and lactose. However, the use of lactose as an energy source is disadvantageous if both substances are available. Hence, in this situation the production of lactose-digesting enzymes is inhibited. This task is accomplished by a transcription factor (TF) named the lac repressor that binds specifically to the respective operator region on the DNA.

More than four decades ago, Riggs et al. (1) observed in vitro that the association rate ka of the repressor to the operator in this system was ka = 7 × 109 M−1 s−1. The occurrence of this high rate came as a surprise, because an application of the classical Smoluchowski formula for diffusion-controlled reactions (2) yields a rate of ∼108 M−1 s−1, i.e., a result that is nearly two orders of magnitude smaller.

Riggs et al. attempted to explain this discrepancy and the observed dependence on ionic strength with the “electrostatic attraction between a positively charged site on the repressor and the negatively charged phosphate groups in the operator” (1). They rejected the idea that the repressor could bind to DNA and hop along it to reach the operator.

Four years later, Richter and Eigen (3) interpreted the results in the opposite way: they claimed that the electrostatic forces cannot explain the high association rate. In their model, the repressor first binds unspecifically to the DNA and then diffuses along the chain to the operator region that serves as a target for this process. This was the starting point for the formulation of the classical facilitated-diffusion model by Berg, von Hippel, and co-workers (4–8).

In their seminal work, they explain the unusually high rates as the result of the combination of different search mechanisms: because the target is part of the DNA chain, a TF combines three-dimensional diffusion in the bulk solution with one-dimensional sliding along the contour of the DNA. While bound to the chain it can dissociate from the chain and reassociate at another point on the DNA after a round of three-dimensional diffusion. Depending on the distances traveled in this manner, these processes are called hops (short distances) or jumps (long distances). This theoretical model successfully explains the experimentally observed nonmonotonic dependence of the association rate on the salt concentration (7). In some versions of the Berg-von Hippel model, another propagation mode for bound TFs exists called intersegmental transfer: if the DNA chain forms a loop, the protein can use it as a shortcut to reach another point on the DNA far away measured along the contour without dissociating. Another possibility are intersegmental jumps that are also possible for TFs with only one binding site. The interplay of these different search mechanisms was studied in Lomholt et al. (9) showing that intersegmental transfer can (partially) replace volume excursions as a mixing mechanism.

In subsequent years the target search problem for transcription factors was reformulated as a simplified version of the Berg-von Hippel model by Halford and Marko (10) and was studied starting from first principles by Klenin et al. (11). In addition to these kinetic studies, it was investigated by means of a stochastic approach by Coppey et al. (12) and by Meroz et al. (13). In a similar context, intermittent search processes were studied by Eliazar et al. (14) and by Oshanin et al. (15).

In recent years, the topic of search processes of TFs has obtained renewed interest. This has been fueled by the possibility of analyzing DNA-binding proteins on the single-molecule level (16–19), in some cases even in living cells (20).

Due to the increase in computational power over the past decades, it has also become possible to study this target search problem with computational methods, as done by Florescu and Joyeux (21,22) and by Koslover et al. (23). The main focus of the latter work was to elucidate the role of different DNA conformations following the experimental study by van den Broek et al. (24) and the related analytical approach by Lomholt et al. (25).

Even though the general form of the facilitated diffusion model is widely accepted, a conceptual problem exists as the so-called speed-stability paradox (26,27), which can be understood in the following way: the sliding along the DNA can be interpreted as a one-dimensional diffusion process through a random potential landscape. The effective diffusion constant in such a potential is proportional to exp(−σ2), where σ denotes the root-mean-squared roughness of the potential (28). As pointed out by Mirny and co-workers, the TF can only slide rapidly to the target if σ < kBT; concurrently the protein has to stay bound to the target sequence long enough for the actual gene expression to take place, which requires σ > 5 kBT (26,27).

The problem of these seemingly contradictory constraints can be solved by introducing two conformations of the TF: a search state and a recognition state (26,27), an idea already anticipated by Winter et al. (7). This is an assumption that is to some extent affirmed by multidimensional nuclear magnetic resonance observations (29). Similar two-state models for the target search of transcription factors have been studied by Hu et al. (30), Zhou (31), and Reingruber and Holcman (32). In a related context, Reingruber and Holcman (33) showed, analytically, for one-dimensional motion that the escape time from an interval can be reduced by introducing a second (faster-moving) state even if the particle is not allowed to leave the interval in this state.

However, a large majority of modern reinterpretations of the classical facilitated diffusion model assumes that start- and end-points of intermediate rounds of three-dimensional diffusion are completely uncorrelated, a fact already criticized by Hu et al. (34) and Kolomeisky (35). In this article, we consider the full facilitated diffusion model for rodlike DNA chains, where the correlations between dissociation and reassociation points are treated explicitly. In addition, we generalize it by including the two-state model to overcome the speed-stability paradox. The relaxed, coiled conformation is effectively included in our model. Consequently, many previous theoretical models are formed from the limiting cases of our approach. The two most important findings of this study are that: 1), the overall association rate in our generalized model is reduced in comparison to these limiting cases; and 2), at optimum, the time spent on the DNA by the TF exceeds the time spent in the bulk.

Methods

Model

Following the approach by Berg and Blomberg (4), we consider the search of a single TF on a DNA chain of contour length 2L, applying their closed-cell approach, which will be further clarified in the next section. The corresponding process of a single repressor on a rodlike DNA is schematically drawn in Fig. 1: the TF is illustrated as a U-shaped red particle and the target region on the DNA is depicted in black. The situation of a TF searching for a target on a DNA in the coiled conformation is described at the end of the following section. The TF starts its search for the target site somewhere in the bulk solution where it diffuses three-dimensionally (with diffusion constant D3). Some exemplary three-dimensional diffusive trajectories are drawn in gray color. Even though we consider the TF as being present in two possible conformations, while it diffuses in the bulk solution, we suppose that both conformations are physically indistinguishable (31).

Figure 1.

Figure 1

Schematic of the target search process by a transcription factor (TF, U-shape) on a rodlike DNA. The TF diffuses in the bulk and may slide one-dimensionally along the DNA while nonspecifically bound. Eventually it reaches its target, the specific binding site placed at z = 0.

On an encounter with some part of the DNA the protein can bind to it (with rate kon) and subsequently it will be able to slide along the DNA contour one-dimensionally (with diffusion constant Ds). After such an association, the protein is at first in the search mode, but it can change its conformation to the recognition mode (26) (with rate ksr). At the same time, there is the possibility to leave the search mode by dissociation to the bulk solution (with rate koff). The corresponding one-dimensional diffusive trajectories are depicted in blue in Fig. 1.

After leaving the chain, the TF will start a round of three-dimensional diffusion and will eventually return to some point on the DNA chain. Before such a dissociation event, however, the TF could change to the recognition-mode. It may still diffuse along the DNA contour, but now with the much smaller diffusion constant Dr. With rate krs it switches back to the search state.

The reason for the reduced diffusivity Dr in the recognition mode is that the protein-DNA binding energy is not constant along the chain. Thus, effectively, the motion can be described as diffusion in a rugged potential landscape, such that

Diexp(σi2). (1)

Here σi denotes the variance of the TF-DNA binding potential where i stands for either recognition or search mode (26,28). Whereas in the search mode the protein is only loosely bound to the DNA, in the recognition mode it interacts strongly with the DNA, such that σrσs and thus the difference in the diffusivities, DsDr. It is assumed that the operator can only recognize its specific binding site and bind tightly in the recognition mode.

The dynamical quantities that we study in the following are the densities per length ni(z,t) of the transcription factor on the DNA, where again i denotes either recognition or search mode and z stands for the distance along the DNA contour. These two densities obey the two coupled diffusion equations

nr(z,t)t=Dr2nr(z,t)z2krsnr(z,t)+ksrns(z,t), (2)

and

ns(z,t)t=Ds2ns(z,t)z2+krsnr(z,t)ksrns(z,t)+G(z,t)koffns(z,t)+koff0Ldz0tdτF(z,z,tτ)ns(z,τ). (3)

For symmetry reasons, it suffices to consider only half of the DNA with positive values of z: 0zL. Equation 2 describes the change of the density nr(z,t) of proteins in the recognition mode at position z and time t. The first term on the right-hand side corresponds to the one-dimensional (sliding) motion. The two other terms represent variations in the distribution resulting from the conformational change to and from the search mode.

In Eq. 3 for the distribution ns(z,t) of proteins in the search mode, the first three terms on the right-hand side are equivalent to the ones in Eq. 2. The term G denotes the virgin flux, i.e., the influx of TFs that have not been bound to the chain previously. The term proportional to koff describes particles dissociating from the chain into the bulk. The last term represents proteins that have dissociated at position z′ at an earlier time and now return to the DNA at z.

As already stated, we assume that in the beginning the repressor molecule is delocalized in the bulk that corresponds to vanishing initial concentrations ni on the chain,

nr(z,t=0)=ns(z,t=0)=0. (4)

We assume that for the TF in the recognition mode any encounter with the operator region at z = 0 leads to a specific association that is equivalent to the boundary condition

nr(z=0,t)=0. (5)

We assume that the extremity of the DNA at z = L acts as a reflecting boundary for the protein in the recognition mode. Reflection off the end of the DNA was indeed observed experimentally for the endonuclease EcoRV (36). Because the TF cannot see the operator while being in the search mode, for the corresponding density ns(z,t) both boundaries are reflecting

nsz|z=0=nsz|z=L=nrz|z=L=0. (6)

To convert the partial differential equations, Eq. 2 and Eq. 3 to ordinary differential equations, we apply a Laplace transformation, yielding

(u+krsDr2z2)n˜r(z,u)=ksrn˜s(z,u) (7)

and

(u+ksr+koffDs2z2)n˜s(z,u)=krsn˜r(z,u)+koff0LdzF˜(z,z,u)n˜s(z,u)+G˜(z,u). (8)

Here the tilde denotes a Laplace transform with respect to time,

n˜i(z,u)=0ni(z,t)exp(ut)dt,

where u is the corresponding variable with the physical dimension of a frequency. The mean specific association time τ of the TF can be then derived from the Laplace transform of the flux to the target,

j(t)=Drnr(z,t)z|z=0, (9)

in the following way:

τ=j˜(u)u|u=0. (10)

From this mean first-passage time, the association rate ka becomes (4)

ka=1τn0, (11)

where n0 denotes the density of operators placed on the DNA. To proceed, we determine the kernel F˜ and the virgin flux G˜ from first principles.

(Re)association process and closed-cell approach

The two functions G˜(u,z) and F˜(u,z) describe the association of previously unbound proteins and the reassociation of particles that dissociated earlier. To obtain these functions, it is necessary to study the diffusive motion of the transcription factors in the bulk.

We first consider the case of a rodlike chain to mimic chains that are arranged in parallel. If they are distributed homogeneously, it is possible to apply the closed-cell approach as described by Berg and Blomberg (4). We enclose every DNA with a parallel cylinder (of length 2L) with a radius of R2 that is equal to half the distance to the nearest DNA chains. Thus, this radius is an effective measure of the density of DNA chains and determines at which distance the sector of a neighboring chain begins. Because we assume an initially homogeneous distribution of chains, for TFs diffusing in the bulk, the point at which it attempts to enter its neighbor's sector is equivalent to the point at which it attempts to leave its own sector. Consequently, it is in order to consider a single particle in a single sector that is reflected at the boundaries of the sector instead of analyzing multiple particles that are allowed to leave their sectors.

For F˜ as well as for G˜, we solve the cylindrical diffusion equation

P(r,z,t)t=D3(2z2+1rrrr)P(r,z,t) (12)

for the probability density P(r,z,t) to find the particle at position (r, z) at time t. Because of the closed-cell approach, we consider the boundary conditions,

P(r,z,t)z|z=0,L=P(r,z,t)r|r=R2=0, (13)

where, again, due to symmetry, we only consider positive values of z.

In the case of F˜, initially there is no TF in the space between the two cylinders and the protein that has just dissociated from the DNA chain of radius R1. This is introduced in terms of a δ-function in the reactive boundary condition at r = R1,

P(r,z,t=0)=0andD3P(r,z,t)r|r=R1=δ(t)δ(zz)2πR1+konP|r=R1. (14)

Here and in the following formulas, 2πR1 denotes the effective circumference of the DNA chain and has to be included as rotational symmetry is assumed throughout. After solving for P(r,z,t) and calculating its Laplace transform, F˜(z,z,u) is obtained via

F˜(z,z,u)=2πR1konP˜(r=R1,z,u), (15)

where kon denotes a phenomenological binding rate to the DNA.

In the case of G˜, initially the starting position of the TF is uniformly distributed outside the DNA and we have a reactive boundary condition at r = R1,

P(r,z,t=0)=1πL(R22R12)andD3P(r,z,t)r|r=R1=konP|r=R1. (16)

The denominator includes the cross-section π(R22R12) between inner and outer cylinders. G˜(z,u) is calculated in the form

G˜(z,u)=2πR1konP˜(r=R1,z,u). (17)

The results for the cylindrical geometry are identical to the ones for Berg and Blomberg (4) (note that their definition of the binding rate k slightly differs from the one used here for kon). We obtain

F˜(z,z,u)=m=0gm(u)cos(mπzL)cos(mπzL), (18)

where

gm(u)=2δm,0LkonΔ01(amR1)konΔ01(amR1)D3amΔ11(amR1)

with

am=uD3+m2π2L2,

and where we introduced the auxiliary functions

Δ01(amr)=I0(amr)K1(amR2)+I1(amR2)K0(amr), (19)
Δ11(amr)=I1(amr)K1(amR2)I1(amR2)K1(amr). (20)

Here Ii and Ki represent modified Bessel functions of order i. The occurrence of the term F˜ in fact leads to a short-time superdiffusive motion along the cylinder that was studied for infinitely long cylinders by Chechkin et al. (37,38).

In an analogous way, we obtain

G˜(u)=2R1L(R22R12)konuD3a0Δ11(a0R1)D3a0Δ11(a0R1)konΔ01(a0R1). (21)

Note that because we assume that the protein initially is homogeneously distributed outside the DNA chain, the function G˜(u) does not depend on z.

For DNA in the relaxed, coiled conformation, the situation is slightly different (see Fig. 2). Here segments of the same DNA chain can be very close to each other. For instance, for the TF in Fig. 2, which has just dissociated from the lower part of the chain, the probability to be captured by the upper part (close in real-space, but distant measured along the contour of the chain) is very high. In a rather simplified model for such coiled DNA, it is common to assume that, after dissociating from the chain, the TF immediately loses track of its z coordinate and that it reassociates with equal probability to any position on the chain. In terms of our model, this assumption corresponds to putting gm(u) = 0 for all m ≥ 1 in Eq. 18.

Figure 2.

Figure 2

Schematic of the search process on coiled DNA.

Results and Discussion

Mean association time

We first concentrate on the case of rodlike DNA: once F˜(z,z,u) and G˜(u) are known, it is possible to determine the mean association time, τ, by solving the coupled Eqs. 7 and 8. The corresponding calculations are presented in the Appendix. We obtain

τ=τ1+N(τ2+τ3)τ4. (22)

The result is in its form similar to the classical result of Berg and Blomberg (4) for only one bound state, and the two terms τ1 and τ2 are, in fact, identical in both models. The various entries of Eq. 22 can be interpreted in the following way.

The value τ1 is the mean time for the first encounter with the DNA chain that results in successful binding of the TF to the DNA,

τ1=LG˜˙(0)=R222D3{1R12/R22konR1/D3+ln(R2/R1)1R12/R22}R228D3{3R12/R22}, (23)

where we abbreviated G˜˙(u)=dG˜(u)/du. The value τ2 is the mean time the TF spends in the bulk after a dissociation from the chain,

τ2=Lg˙0(0)=R22R122konR1. (24)

The value τ3 denotes the mean time that the protein spends bound to the chain both in the recognition and in the search mode,

τ3=1ϑ(1krs+1ksr). (25)

Here, ϑ = koff/ksr is a measure for proteins in the search state describing how likely the occurrence of a dissociation event is in comparison to a prior conformational change to the recognition state.

The term N denotes the number of rounds of three-dimensional excursions followed by reassociation events and the subsequent sliding along the DNA needed to reach the target. All the information about the spatial organization of the chain is stored in this term, or, more precisely, in the functions gm(u). We find

N=ϑxrcoth(1xr)+2ϑm=1[pr(m)(pr(m)(ps(m)+ϑ(1Lgm(0)2))1)]1 (26)

where pi(m) = 1 + m2π2xi2 and i stands for either recognition or search mode. The terms

xr=Dr(krsL2)andxs=Ds(ksrL2)

are dimensionless quantities measuring the fraction of the whole contour length that a protein in the corresponding conformation can slide along before changing to the other state (neglecting dissociation events).

Finally, τ4 = 1/krs is a correction term accounting for the fact that, in contrast to the first N−1 rounds of consecutive three-dimensional and one-dimensional diffusion, in the final round the last conformational change to the search state is not performed and the TF therefore reaches the target in the recognition mode.

Connection to previous models

Before we consider the result in more detail, it is in order to relate it to previous results. We first consider the case when the conformational changes between search and recognition mode occur very rapidly, that is, before a considerable fraction of the DNA was scanned. Mathematically this corresponds to krsDr/L2 and ksrDs/L2 or xr/s ≈ 0.

In this limit, N can be written as

Nm=12ϑϑ(1Lgm(0)/2)+m2π2(xr2+xs2) (27)

and τ4 ≈ 0 can be neglected. If we further assume that Dr = Ds = D1 and krs = ksr such that both states are indistinguishable apart from the fact that the search state is insensitive to the target, we get

τ3=2koff (28)

and

N2koffkoff(1Lgm(0)/2)+2m2π2D12/L2. (29)

If we identify koff/2 with the λ in the work of Berg and Blomberg (4), both results for the mean association time coincide. The factor of 2 between the dissociation rates stems from the fact that the protein is only able to dissociate while being in the search state that in the fast-switching limit with krs = ksr amounts to half of the total time bound to the chain. Thus, to obtain matching results, the rate at which the TF dissociates must be doubled with respect to the classical model. The result of Lomholt et al. (25) for the straight-rod configuration corresponds to the limit of an infinitely long DNA chain, and different boundary conditions. Namely, instead of our finite system, Lomholt et al. (25) consider an infinite system (both longitudinally and radially) with a finite protein concentration in the vicinal bulk. Technically, the infinite chain limit replaces the Fourier series 18 with a continuous Fourier transform; compare Eqs. 4 and 12 in Lomholt et al. (25).

As a second limiting case, we consider the simplified model for coiled DNA as described in the last section; that is, we use gm(u) = 0 for all m ≥ 1. Then it is possible to evaluate the sum in Eq. 26 explicitly, yielding

N=ϑ(1+X+(xs2xr2(1+ϑ))2xr2Xf(y+)+X(xs2xr2(1+ϑ))2xr2Xf(y)), (30)

where we use the notation of Reingruber and Holcman (33) for f(x)=coth(x)/x1/x. In Eq. 30, we introduced the quantities

X=xs42xr2xs2(ϑ1)+xr4(1+ϑ)2 (31)

and

y±=xr2(1+ϑ)+xs2±X2xr2xs2. (32)

Changing our initial condition to nr(z,t = 0) = 1/L and G(z,t) = 0, i.e., the particle starts uniformly distributed in the recognition state and there are initially no unbound particles, we obtain

τ=(τ2+τ3)(Nϑ). (33)

This coincides with the result of Reingruber and Holcman (32). Note the typo in the definition of ξ2 in Reingruber and Holcman (32); ξ2 should read

12(l23l21)l12+(l21+l23)2l122.

Finally, their previous result (33) can be obtained by setting koff = 0 (that is, the TF will not dissociate from the DNA at all). Then we have

τ=(1krs+1ksr)xr2+3xs2f(1/xr2+1/xs2)3xr2(xr2+xs2). (34)

Performing the same approximation for coiled DNA in the original one-state model of Berg and Blomberg, we obtain for TFs starting homogeneously distributed in the bulk solution,

τ=τ1+N(τ2+τ3), (35)

where τ1 and τ2 are the same as before. But in this case we have τ3=koff1 and

N=koffL2D1coth(koffL2D1)1=koffL2D1f(koffL2D1), (36)

in accordance with the result obtained by Coppey et al. (12) and similarly by Slutsky and Mirny (26). For the latter search mechanism, it is known that the mean association time is minimized when the times spent on and off the chain are equal (12,26).

Numerical results

In the general case, it is not feasible to obtain a result in closed form for the mean association time τ, due to the complex infinite sum in Eq. 26. However, it is possible to examine τ numerically. If not stated otherwise, we consider the following parameter values that are suitable for the lac repressor in E. coli.

Following Riggs et al. (1), we consider a concentration of operators of n0 = 1 pM. Using n0 = (πR222L)−1 and 2L = 16 μm, this corresponds to R2 ≈ 5.75 μm (4). Furthermore, we take for the effective radius of the DNA cylinder, R1 = 6 nm, a three-dimensional diffusion constant of D3 = 50 μm2/s and a reaction rate of kon = 5.56 × 104 μm/s (4,5). Note that the physical dimension of length/time is a matter of choice when we define the coupling constant kon in a cylindrical geometry; see Eq. 14 and the discussion in Chechkin et al. (38).

As described in Zhou (31), the transition rate from the recognition to the search state should be much larger than the opposite one, to guarantee that the TF spends most of the time in the faster diffusing state. Consequently we use ksr = krs/10 = 104/s, in the range of rates known for repressor isomerizations (39). The experimentally accessible apparent diffusion constant (32)

Da=Ds/ksr+Dr/krs1/ksr+1/krs (37)

in such a fast switching case approximates as

DaDs1+ksr/krs. (38)

For the one-dimensional diffusion coefficient in the search state, we employ Ds = 0.05 μm2/s (18,20). Now, according to Eq. 1, the diffusion constant in the recognition state, Dr, can be written as Dr = Ds × exp(−χ), where χ denotes the activation (32).

We consider the contributions of the characteristic terms in Eq. 22 to the mean association time for χ = 8. The corresponding values are plotted in Fig. 3 as a function of the dissociation rate koff. Experimentally, this corresponds to different salt concentrations because increasing the salt concentration leads to an increasing dissociation frequency (6,7). The solid line represents the total mean association time, the dotted line the contribution of τ1, and the dashed and dot-dashed lines the contributions of 2 and 3. The latter two represent the total times spent off and on the chain after the first nonspecific association with the chain. Apparently, the time it takes for this first association, τ1, is independent of the dissociation rate and is negligible for the parameters considered here. The total time the particle spends off the chain increases with koff, whereas the time spent on the DNA decreases. At koff/ksr ≈ 10−3, the mean association time has a minimum. For values much smaller and larger than this optimal value, one of the two terms is dominant and the other one negligible. For very small koff, a plateau is reached (out of the range of Fig. 3).

Figure 3.

Figure 3

Contributions to the mean association time (solid line) as a function of the dissociation rate koff in units of ksr: first association time τ1 (dotted line), total time spent off the chain, 2 (dashed line), and total time spent on the chain, 3 (dot-dashed line). For other parameters, see text.

However, unlike in the simplified one-state model for coiled DNA at the optimal value of the dissociation rate, the TF does not spend equal amounts of time bound and unbound. In fact, for the parameters used in Fig. 3, the optimal partition is that the particle spends ≈73% of the time on the DNA chain. This observation is reminiscent of the experimental findings that, in vivo, TFs seem to spend nearly 90% of the time nonspecifically bound to the DNA chain (7,20,40). The value of the optimal fraction of time spent nonspecifically bound depends obviously on the choice of the parameter values. The characteristic measure for the influence of the two-state scenario on this optimal fraction is the typical time spent in the recognition mode as quantified by 1/krs, or the typical sliding length in the recognition mode, r=Dr/krs (32). Increase of ℓr will further increase the value of the optimal fraction, and vice versa.

Next we investigate the influence of χ, measuring the difference of the diffusivities in both states, on the association rate ka = 1/(τn0), because this is the rate that is usually studied experimentally. The results are depicted in Fig. 4, again as a function of the dissociation rate koff, but this time over a larger range of values. We consider the case where χ = 4 (dashed line), implying Ds/Dr ≈ 55, in more detail, because the nonmonotonic shape of the corresponding curve here is most definite. We note that, experimentally, this ratio has, to our best knowledge, not been measured, and only the apparent diffusion constant Da from Eq. 37 is available in literature (18,20). To begin, we observe that the maximal rate ka,max ≈ 9.2 × 109 M−1 s−1 is close to the experimentally obtained value (1). Winter et al. (7) measured that this maximum occurred at 0.1 M KCl concentration. Moreover, the shape of the curves is similar to earlier results (7,23) typical for facilitated diffusion models where three different regimes can be distinguished.

Figure 4.

Figure 4

Association rate ka as a function of the dissociation rate, koff in units of ksr, for three values of χ: χ = 4 (dashed line), χ = 8 (solid line), and χ = 12 (dot-dashed line). For other parameters, see text.

For koff/ksr ≲ 10−6, the association rate is nearly constant. In this regime, ksr is several orders of magnitude larger than koff; consequently, after binding to the DNA the protein will not even once leave the chain for a round of three-dimensional diffusion. Thus, the time to reach the target is determined by the time of first association, τ1, and the time it takes to slide to the target without dissociating, but undergoing numerous conformational changes. The corresponding association rate ka can be calculated from Eq. 34 to which τ1 and ksr1 have to be added to take into account the different initial conditions. However, such an essentially one-dimensional search strategy is highly redundant and thus, disadvantageous, because already scanned regions are visited repeatedly—an effect known as oversampling.

In the opposite regime, koff/ksr ≳ 10−1, there is a substantial probability that the particle will dissociate before it could explore the neighboring region extensively in both the recognition and the search state. The larger the koff is, the more unfavorable this effect becomes, which is revealed by the behavior kakoff1 in this regime. In a way, with these parameters, the protein spends too much time diffusing in three dimensions, which is equally disadvantageous for the total search process.

As already seen in the discussion of the contributions to the mean association time in the intermediate regime koff/ksr ≈ 10−3, a maximum of the association rate can be found. Here the partition between the times spent on and off the chain is optimal. On the one hand, the TF leaves the chain soon enough before the redundant one-dimensional search becomes disadvantageous; but, on the other hand, it is attached long enough to the chain to efficiently search for the target.

For even lower diffusion constants in the recognition mode, χ = 8, corresponding to Ds/Dr ≈ 3000 (solid line) and χ = 12 (equivalent to Ds/Dr ≈ 160,000 (dot-dashed line)), the nonmonotonic shape of the curve is still present although less pronounced. Thus, the acceleration due to facilitated diffusion becomes less and less important, the slower the one-dimensional diffusion in the recognition mode is. We also observe that, for TFs with a lowered diffusivity, the value of koff at which the maximal association rate is reached is shifted to lower values. This is obvious, because slower diffusing TFs should stay bound to the chain for longer times to obtain an effective search.

In Fig. 5 we compare our generalized model with the same parameters as in Fig. 3 to three of its special cases that correspond to previously studied models. The solid line corresponds to the full two-state model (with χ = 8) and is thus identical to the solid line in Fig. 4. The dotted line displays the behavior in the simplified two-state model for coiled DNA (32) in which the reassociation time τ2 was chosen to be the same as in our model. The dashed line corresponds to the classical one-state model of Berg and Blomberg (4) and the dot-dashed line to the limiting case for coiled DNA (12). We use D1 = 0.0455 μm2/s such that Da = D1 for the parameters stated in the preceding section.

Figure 5.

Figure 5

Association rate, ka, as a function of the dissociation rate koff in units of ksr for four different models: the two-state model presented in this article (solid line), two-state model for coiled DNA (dotted line), original one-state model (4) (dashed line), and one-state model for coiled DNA (12) (dot-dashed line). For other parameters, see text.

In all four cases, we observe the three regimes that we mentioned before. For small values of koff, the two-state model for coiled DNA approaches the same value as the corresponding full model. In the same regime, both one-state models approach the rate: ka = 1/(n0(τ1 + τ3)), where τ3 is the well-known first-passage time in an interval τ3 = L2/(3D1) (41). Also, for very high dissociation rates the association rate obtained with the two models for coiled DNA approach their corresponding values for straight DNA. However, in the one-state models, the scaling in this regime reads kakoff1/2. Thus, for the one-state model, a further increase of the dissociation rate in this regime is less fatal than for the two-state model, where kakoff1.

We observe that, in both cases, the search rate for coiled DNA is larger than for rodlike DNA, and that the value at which the maximum occurs is shifted. The first point can be attributed to the fact that τ2 that was chosen identical for both conformations is just the mean time the TF spends for a bulk excursion. In our approximation for coiled DNA it means that, on this timescale, the TF jumps to any other position on the chain, whereas for straight DNA most of the jumps lead to a nearby position, and the rare long-range jumps take more time. Thus, in some sense, the distribution of the TF equilibrates much faster and the redundant, effectively one-dimensional search on straight DNA is avoided.

Another observation that is, at first sight, disappointing is that the two-state model is always slower in target location than the corresponding one-state model. This might seem to contradict the findings of Reingruber and Holcman (33). However, they showed that the introduction of a second faster diffusing state decreases the mean first-passage time from an interval, whereas, in the context of this article, the new state is much slower than the one in the one-state model (such that Da = D1) and its motivation is to overcome the speed-stability paradox.

Conclusions

The generalized model for facilitated diffusion studied herein describes the specific association of a TF with its operator region on DNA, as studied in typical in vitro experiments. We generalized the classical Berg-von Hippel model of facilitated diffusion to the case when the TF is assumed to alternate between two conformational states. The protein starts the search process somewhere in the bulk solution, and binds nonspecifically to the DNA chain. Bound to the chain, it explores the DNA one-dimensionally in the search state characterized by its relatively high diffusivity. Stochastically, it switches to the recognition state in which it has a significantly reduced diffusivity but is able to recognize, and bind, to the target binding site. A key result is that, for typical parameters, the association rate to the target is reduced compared to previous models, which correspond to limiting cases of our generalized model. Although the generalized model still has a maximum of the association rate as function of the ambient salt concentration, this maximum is more pronounced when the diffusion constant of the recognition state is close to the one of the search state. However, even for nearly immobile TFs in the recognition mode, the maximum is present.

The classical Berg-von Hippel model of facilitated diffusion with only one bound state, as well as related studies that interpret the target search process as a narrow-escape problem, are presented as limiting cases of our model. Unlike many recent reformulations of the problem in which the TF spends equal amounts of time on and off the DNA, we obtain that already in the dilute in vitro situation it is advantageous for the TF to spend more time bound to the chain than in the bulk solution. The optimal partition depends on the actual values of the system parameters.

When taking typical experimentally obtained one-dimensional diffusion constants for the effective diffusion constant in the fast-switching limit of our generalized model, we obtain association rates that are actually lower than the ones obtained with the corresponding classical Berg-von Hippel model. However, our model with two conformational states reconciles tight binding to the target with fast diffusion along the chain and treats the microscopic excursions explicitly.

An important question for future studies is whether it is possible to extend the model to the in vivo situation, where molecular crowding and the resulting slow or even anomalous diffusion might play an important role. Recent simulation studies indicate that, particularly, magnitudes of the binding constants become modified (42).

Appendix A: Determination of the Mean Association Time

To obtain the mean association time, τ, we use an analogous ansatz to Berg and Blomberg (4) that satisfies the boundary conditions, Eq. 5 and Eq. 6:

n˜r(u,z)=n=0fn(u)sin(2n+12Lπz), (39)
n˜s(u,z)=m=0hm(u)cos(mπzL). (40)

According to Eq. 9, we have

j˜(u)=Drπ2Ln=0(2n+1)fn(u). (41)

Inserting the ansatz in Eq. 7 and Eq. 8, we obtain two equations that can be complemented by multiplying Eq. 7 with cos(mπz/L) and a subsequent integration over z. Omitting the argument u for a better readability, we obtain

2LπγmYm+j˜=ksrLmhm, (42)
hmLm(αmkoffLmgm)=GLδm,0+2LπkrsYm, (43)
j˜=2DrLm,n=0ksrβnhm(2n+1)2(2n+1)24m2, (44)

where all the functions denoted by Greek letters have the physical dimension of a frequency:

αm(u)=u+koff+ksr+Dsm2π2L2,
βn(u)=u+krs+Dr(2n+1)2π2(4L2),
γm(u)=u+krs+Drm2π2L2,
Lm=L(1+δm,0)2,

and

Ym(u)=n=0fn(u)2n+1(2n+1)24m2.

With the first two equations, Ym(u) can be eliminated such that

hmLm[γm(αmkoffLmgm)krsksr]=GLγmδm,0krsj˜. (45)

Together with Eq. 44, we obtain, omitting the argument u,

j˜=LG/{krsγ0+ϵ0(γ0[α0koffLg0]krsksr)γ0×[L22Drksr+m=12krs/ϵmγm(αmkoffL2gm)krsksr]}, (46)

where

ϵm(u)=(n=01βn(2n+1)2(2n+1)24m2)1.

Starting from this result, we can calculate the mean association time Eq. 22 via Eq. 10.

Acknowledgments

Support from the Academy of Finland within the FiDiPro scheme is gratefully acknowledged.

References

  • 1.Riggs A.D., Bourgeois S., Cohn M. The lac repressor-operator interaction. 3. Kinetic studies. J. Mol. Biol. 1970;53:401–417. doi: 10.1016/0022-2836(70)90074-4. [DOI] [PubMed] [Google Scholar]
  • 2.von Smoluchowski M. Three presentations on diffusion, molecular movement according to Brown and coagulation of colloid particles. Physikal. Zeitschr. 1916;17:557–571. [Google Scholar]
  • 3.Richter P.H., Eigen M. Diffusion controlled reaction rates in spheroidal geometry. Application to repressor-operator association and membrane bound enzymes. Biophys. Chem. 1974;2:255–263. doi: 10.1016/0301-4622(74)80050-5. [DOI] [PubMed] [Google Scholar]
  • 4.Berg O.G., Blomberg C. Association kinetics with coupled diffusional flows. Special application to the lac repressor-operator system. Biophys. Chem. 1976;4:367–381. doi: 10.1016/0301-4622(76)80017-8. [DOI] [PubMed] [Google Scholar]
  • 5.Berg O.G., Blomberg C. Association kinetics with coupled diffusion. III. Ionic-strength dependence of the lac repressor-operator association. Biophys. Chem. 1978;8:271–280. doi: 10.1016/0301-4622(78)80010-6. [DOI] [PubMed] [Google Scholar]
  • 6.Berg O.G., Winter R.B., von Hippel P.H. Diffusion-driven mechanisms of protein translocation on nucleic acids. 1. Models and theory. Biochemistry. 1981;20:6929–6948. doi: 10.1021/bi00527a028. [DOI] [PubMed] [Google Scholar]
  • 7.Winter R.B., Berg O.G., von Hippel P.H. Diffusion-driven mechanisms of protein translocation on nucleic acids. 3. The Escherichia coli lac repressor-operator interaction: kinetic measurements and conclusions. Biochemistry. 1981;20:6961–6977. doi: 10.1021/bi00527a030. [DOI] [PubMed] [Google Scholar]
  • 8.von Hippel P.H., Berg O.G. Facilitated target location in biological systems. J. Biol. Chem. 1989;264:675–678. [PubMed] [Google Scholar]
  • 9.Lomholt M.A., Ambjörnsson T., Metzler R. Optimal target search on a fast-folding polymer chain with volume exchange. Phys. Rev. Lett. 2005;95:260603. doi: 10.1103/PhysRevLett.95.260603. [DOI] [PubMed] [Google Scholar]
  • 10.Halford S.E., Marko J.F. How do site-specific DNA-binding proteins find their targets? Nucleic Acids Res. 2004;32:3040–3052. doi: 10.1093/nar/gkh624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Klenin K.V., Merlitz H., Wu C.X. Facilitated diffusion of DNA-binding proteins. Phys. Rev. Lett. 2006;96:018104. doi: 10.1103/PhysRevLett.96.018104. [DOI] [PubMed] [Google Scholar]
  • 12.Coppey M., Bénichou O., Moreau M. Kinetics of target site localization of a protein on DNA: a stochastic approach. Biophys. J. 2004;87:1640–1649. doi: 10.1529/biophysj.104.045773. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Meroz Y., Eliazar I., Klafter J. Facilitated diffusion in a crowded environment: from kinetics to stochastics. J. Phys. A. 2009;42:434012. [Google Scholar]
  • 14.Eliazar I., Koren T., Klafter J. Searching circular DNA strands. J. Phys. Condens. Matter. 2007;19:065140. [Google Scholar]
  • 15.Oshanin G., Wio H.S., Burlatsky S.F. Intermittent random walks for an optimal search strategy: one-dimensional case. J. Phys. Condens. Matter. 2007;19:065142. [Google Scholar]
  • 16.Pant K., Karpel R.L., Williams M.C. Kinetic regulation of single DNA molecule denaturation by T4 gene 32 protein structural domains. J. Mol. Biol. 2003;327:571–578. doi: 10.1016/s0022-2836(03)00153-0. [DOI] [PubMed] [Google Scholar]
  • 17.Sokolov I.M., Metzler R., Williams M.C. Target search of N sliding proteins on a DNA. Biophys. J. 2005;89:895–902. doi: 10.1529/biophysj.104.057612. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Wang Y.M., Austin R.H., Cox E.C. Single molecule measurements of repressor protein 1D diffusion on DNA. Phys. Rev. Lett. 2006;97:048302. doi: 10.1103/PhysRevLett.97.048302. [DOI] [PubMed] [Google Scholar]
  • 19.Bonnet I., Biebricher A., Desbiolles P. Sliding and jumping of single EcoRV restriction enzymes on non-cognate DNA. Nucleic Acids Res. 2008;36:4118–4127. doi: 10.1093/nar/gkn376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Elf J., Li G.-W., Xie X.S. Probing transcription factor dynamics at the single-molecule level in a living cell. Science. 2007;316:1191–1194. doi: 10.1126/science.1141967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Florescu A.-M., Joyeux M. Description of nonspecific DNA-protein interaction and facilitated diffusion with a dynamical model. J. Chem. Phys. 2009;130:015103. doi: 10.1063/1.3050097. [DOI] [PubMed] [Google Scholar]
  • 22.Florescu A.-M., Joyeux M. Dynamical model of DNA-protein interaction: effect of protein charge distribution and mechanical properties. J. Chem. Phys. 2009;131:105102. [Google Scholar]
  • 23.Koslover E.F., Díaz de la Rosa M.A., Spakowitz A.J. Theoretical and computational modeling of target-site search kinetics in vitro and in vivo. Biophys. J. 2011;101:856–865. doi: 10.1016/j.bpj.2011.06.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.van den Broek B., Lomholt M.A., Wuite G.J. How DNA coiling enhances target localization by proteins. Proc. Natl. Acad. Sci. USA. 2008;105:15738–15742. doi: 10.1073/pnas.0804248105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Lomholt M.A., van den Broek B., Metzler R. Facilitated diffusion with DNA coiling. Proc. Natl. Acad. Sci. USA. 2009;106:8204–8208. doi: 10.1073/pnas.0903293106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Slutsky M., Mirny L.A. Kinetics of protein-DNA interaction: facilitated target location in sequence-dependent potential. Biophys. J. 2004;87:4021–4035. doi: 10.1529/biophysj.104.050765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Mirny L., Slutsky M., Kosmrlj A. How a protein searches for its site on DNA: the mechanism of facilitated diffusion. J. Phys. A. 2009;42:434013. [Google Scholar]
  • 28.Zwanzig R. Diffusion in a rough potential. Proc. Natl. Acad. Sci. USA. 1988;85:2029–2030. doi: 10.1073/pnas.85.7.2029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kalodimos C.G., Biris N., Kaptein R. Structure and flexibility adaptation in nonspecific and specific protein-DNA complexes. Science. 2004;305:386–389. doi: 10.1126/science.1097064. [DOI] [PubMed] [Google Scholar]
  • 30.Hu L., Grosberg A.Y., Bruinsma R. Are DNA transcription factor proteins Maxwellian demons? Biophys. J. 2008;95:1151–1156. doi: 10.1529/biophysj.108.129825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Zhou H.-X. Rapid search for specific sites on DNA through conformational switch of nonspecifically bound proteins. Proc. Natl. Acad. Sci. USA. 2011;108:8651–8656. doi: 10.1073/pnas.1101555108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Reingruber J., Holcman D. Transcription factor search for a DNA promoter in a three-state model. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2011;84:020901. doi: 10.1103/PhysRevE.84.020901. [DOI] [PubMed] [Google Scholar]
  • 33.Reingruber J., Holcman D. Narrow escape for a stochastically gated Brownian ligand. J. Phys. Condens. Matter. 2010;22:065103. doi: 10.1088/0953-8984/22/6/065103. [DOI] [PubMed] [Google Scholar]
  • 34.Hu T., Grosberg A.Y., Shklovskii B.I. How proteins search for their specific sites on DNA: the role of DNA conformation. Biophys. J. 2006;90:2731–2744. doi: 10.1529/biophysj.105.078162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kolomeisky A.B. Physics of protein-DNA interactions: mechanisms of facilitated target search. Phys. Chem. Chem. Phys. 2011;13:2088–2095. doi: 10.1039/c0cp01966f. [DOI] [PubMed] [Google Scholar]
  • 36.Jeltsch A., Pingoud A. Kinetic characterization of linear diffusion of the restriction endonuclease EcoRV on DNA. Biochemistry. 1998;37:2160–2169. doi: 10.1021/bi9719206. [DOI] [PubMed] [Google Scholar]
  • 37.Chechkin A.V., Zaid I.M., Metzler R. Bulk-mediated surface diffusion along a cylinder: propagators and crossovers. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2009;79:040105. doi: 10.1103/PhysRevE.79.040105. [DOI] [PubMed] [Google Scholar]
  • 38.Chechkin A.V., Zaid I.M., Metzler R. Effective surface motion on a reactive cylinder of particles that perform intermittent bulk diffusion. J. Chem. Phys. 2011;134:204116. doi: 10.1063/1.3593198. [DOI] [PubMed] [Google Scholar]
  • 39.Wu F.Y.-H., Bandyopadhyay P., Wu C.-W. Conformational transitions of the lac repressor from Escherichia coli. J. Mol. Biol. 1976;100:459–472. doi: 10.1016/s0022-2836(76)80040-x. [DOI] [PubMed] [Google Scholar]
  • 40.Bakk A., Metzler R. In vivo non-specific binding of λ CI and Cro repressors is significant. FEBS Lett. 2004;563:66–68. doi: 10.1016/S0014-5793(04)00249-2. [DOI] [PubMed] [Google Scholar]
  • 41.Szabo A., Schulten K., Schulten Z. First passage time approach to diffusion controlled reactions. J. Chem. Phys. 1980;72:4350–4357. [Google Scholar]
  • 42.Morelli M.J., Allen R.J., Wolde P.R. Effects of macromolecular crowding on genetic networks. Biophys. J. 2011;101:2882–2891. doi: 10.1016/j.bpj.2011.10.053. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES