Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2008 Sep 22.
Published in final edited form as: Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Aug 24;76(2 Pt 1):021121. doi: 10.1103/PhysRevE.76.021121

Molecular Spiders with Memory

Tibor Antal 1, P L Krapivsky 2
PMCID: PMC2546873  NIHMSID: NIHMS49257  PMID: 17930020

Abstract

Synthetic bio-molecular spiders with “legs” made of single-stranded segments of DNA can move on a surface which is also covered by single-stranded segments of DNA complementary to the leg DNA. In experimental realizations, when a leg detaches from a segment of the surface for the first time it alters that segment, and legs subsequently bind to these altered segments more weakly. Inspired by these experiments we investigate spiders moving along a one-dimensional substrate, whose legs leave newly visited sites at a slower rate than revisited sites. For a random walk (one-leg spider) the slowdown does not effect the long time behavior. For a bipedal spider, however, the slowdown generates an effective bias towards unvisited sites, and the spider behaves similarly to the excited walk. Surprisingly, the slowing down of the spider at new sites increases the diffusion coefficient and accelerates the growth of the number of visited sites.

I. INTRODUCTION

Chemists have recently constructed various synthetic molecular systems (see e.g. [13] and a review [4]) which can move on surfaces and tracks. One class of such objects aptly named spiders [5] consists of relatively small molecules with legs which are short single-stranded segments of DNA. These spiders can move on a surface covered with single-stranded DNA segments, called substrates. The substrate DNA is complementary to the leg DNA. The motion proceeds as legs bind to the surface DNA through the Watson-Crick mechanism, then dissociate, then rebind again, etc. More precisely, a bond on the substrate with an attached leg is first cleaved [5], and the leg then dissociates from the affected substrate (which is called product [5]). After that the leg can rebind to the new substrate or to the product.

The rate of attachment of a leg of a spider to the substrate and the rate of detachment from the substrate are different from the corresponding rates involving the product. Therefore for the proper description of the motion of a single spider one must keep track of its entire trajectory. This memory requirement makes the problem non- Markovian [6, 7]. Such problems tend to be intractable, though in one dimension non-Markovian problems are occasionally solvable, see e.g. [812].

In this paper we continue the investigation of molecular spiders that move along one-dimensional tracks [13]. The goal is to study the interplay between the spiders and their environment which has been ignored in Ref. [13]. We shall assume that each leg of a spider steps independently and symmetrically to neighboring empty sites at rates 1 from revisited sites and at rates r from sites visited for the first time. For molecular spiders r < 1 since legs are bound more firmly to new sites. Two types of spiders were introduced in [13] which differ in the type of constraint that keeps the legs close to each other. For local spiders or centipedes, the distance between adjacent legs is ≤ s; for global spiders, the distance between any two legs is ≤ S. Without memory (r = 1), spiders with local constraint can be mapped onto exclusion processes with open boundaries, while spiders with global constraint map to the simple exclusion process with periodic boundary conditions [13]. These mappings simplify the computations of the diffusion coefficient and, e.g., they allow to determine the diffusion coefficient for general global spiders with an arbitrary number of legs L and an arbitrary maximal distance S, and for local L-leg spiders with s = 2.

The non-Markovian character does not make the problem hopeless since the visited area is very simple, namely the visited sites (product) form an island in the sea of unvisited sites. (This happens only in one dimension, and the two-dimensional problem appears intractable.) Graphically, we display a leg of the spider by •, an empty sites as o, and we put a hat at sites which have never been visited earlier, i.e. where the DNA substrate is still uncleaved. A typical configuration for a bipedal spider then looks like graphic file with name nihms-49257-f0004.jpg

The memory solely acts through the slowdown of legs on newly visited sites. For a one-leg spider, which is merely a random walk, memory does not affect the asymptotic behavior of the number of visited sites and other quantities. On the other hand, the slowdown generates an effective bias of multi-pedal spiders. This bias always points outward. Remarkably, the slowdown effectively speeds the spider up, that is, it leads to the increase of the diffusion coefficient and the number of visited sites.

To mimic the effective bias of the spiders, we investigate a random walk which hops asymmetrically from newly visited sites. This model is similar to the so-called exited walk which has been actively investigated in recent years in the mathematics literature, see e.g. [810] and references therein.

The outline of this paper is as follows. In Sec. II, we investigate a random walk with memory. First we analyze the symmetric random walk that slows down on the new sites, and then we obtain exact results for the exited walk which additionally takes into account a bias on the new sites. In Sec. III, we discuss the behavior of the bipedal spiders with maximal distance between the legs s = 2. Some of the properties of these spiders are exactly computed, while others are established via the analogy with the excited random walk. In Sec. IV, we generalize these results to global spiders with L = S. We draw conclusions in Sec. V. Finally, in an appendix, some of the details required for calculations of Sec. II are presented.

II. EXCITED RANDOM WALK

In this section we consider the case of a one-leg spider. We shall begin with a nearest-neighbor symmetric random walk on the one-dimensional lattice. By definition, if the walker occupies a site for the first time, the hopping rate to each of its nearest neighbors is r; if the walker has already visited this site in the past, it hops at rates 1 to the left and to the right.

A. Average time to cover N sites

A quantity which is most readily computable is the average time TN for the random walk to cover N sites. The total number of distinct sites visited by the random walk grows with time, and let TN be the time when the number of distinct sites first reaches N + 1. This particular definition of TN makes all formulas simpler, and one can think of TN as the time when the walker first visits N distinct sites not counting the one with the random walker on it. Graphically, graphic file with name nihms-49257-f0005.jpg The walk starts at some site and hence T0 = 0; all the following times 0 < T1 < T2 < … are random quantities. We shall actually compute the first two moments of TN. The first moment (the average) is

TN=N(N1)4+N2r 1

To derive (1) we first recall how to compute the mean exit time from the interval (0, M) for the random walker hopping at unit rates to the right and to the left. By definition, the exit time T(x) is the time for the random walker to reach any of the two boundaries, 0 or M, given that the walker starts at site x. The procedure (explained e.g. in [14]) is to analyze changes in an infinitesimal time interval dt. One writes

T(x)=dt+{T(x) prob 12dtT(x+1)prob dtT(x1)prob dt

and making the averaging one arrives at

T(x)=dt+(12dt)T(x)+dt[T(x+1)+T(x1)] 2

for the mean exit time, which simplifies to

T(x+1)2T(x)+T(x1)=1 3

Solving this equation subject to the boundary conditions T(0) = T(M) = 0 we obtain

T(x)=x(Mx)2 4

Returning to our problem, let us compute the time interval τN during which the number of visited sites jumps from N + 1 to N + 2. Without loss of generality we can assume that at TN the visited sites are 1, …, N +1, and the random walk is at site N + 1. Similarly to (2) we write an equation for the mean duration

τN=(12rdt)τN+rdtN+dt 5

The factor N in the middle term on the right-hand side was derived by taking into account that if the random walker hops to the left to site N, the number of distinct sites will increase when the random walk reaches either site 0 or N + 2. This latter problem is equivalent to the above exit problem after identifying M = N + 2 and x = N, and (4) shows that the mean exit time is N. From Eq. (5) we obtain

τN=12r+N2 6

Using

TN=j=0N1τj 7

in conjunction with (6) we arrive at (1).

If the hopping rates were equal to unity everywhere (no difference between the substrate and the product), we would have

TNuniform=N(N+1)4 8

instead of (1). Comparing these two results we see that only the sub-leading corrections (linear in N) are different. Hence the non-Markovian nature has an asymptotically negligible influence on TN for the random walk.

Other calculations have led to the same conclusion. For instance, we have computed TN2 and we found

TN2=112N4+(14r112)N3+(1614r+14r2)N2+(14r216)N 9

For r = 1 (no memory) equation (9) becomes

TN2uniform=112N4+N3+N26+112N 10

i.e. it has the same leading behavior.

In the r1 limit, memory is very important in the earlier stage; the crossover time tc after which memory becomes irrelevant is estimated by equating the linear and quadratic (in N) terms in (1). This gives

tcr1 11

Thus the memory does not affect the asymptotic behavior of TN for the nearest-neighbor random walk.

This conclusion may be affected by the gait of the walk. Unfortunately, even for the random walk it is hard to probe complicated gates when we take memory into account. For instance, if the random walk makes ±1, ±2 hops, say all at the same rates, the problem seems intractable. The above approach allows to compute TN only when the random walk hops to nearest-neighbor sites or gets back to its original site. For such gait we found

TN=N24+(13r112)N 12

Without memory (r = 1) we recover (8). By comparing (12) to (8) we see that the influence of memory on TN for this random walk with modified gait is again asymptotically irrelevant.

B. Growth of the average number of visited sites

We now turn to a more general excited walk; we shall use these results for the bipedal spider in Sec. IIIC. An excited walk is defined as follows: When the walker leaves a particular site for the first time it hops at rate f forward (to unvisited sites) and at rate b backward (to revisit sites). When the walk hops from a site it had already visited earlier, it hops at rates one in both directions. For concreteness we define its first ever hop from the origin to the right at rate f and to the left at rate b. This special rule, however, is irrelevant in the long time limit which is our main focus. Note also that the above definition slightly differs from the definition of the excited random walk studied in Refs. [810].

On the already visited sites, the walker performs a simple symmetric random walk. In the following analysis, we will need certain results for the simple random walk. Particularly, for the walker on the interval [0, k] we will need the exit probability density gk(t|m) that the walker which starts at site m inside the interval, 0 < m < k, exits at any of the two end sites of the interval during the time interval (t, t + dt). An elementary derivation of this probability density (more precisely, of its Laplace transform) is given in Appendix A.

One can try to repeat the calculation of TN in this more general situation. Instead, we consider a dual quantity, namely we shall compute how the average number of visited sites increases with time. Suppose that exited walk had already visited k − 2 sites and it has just hopped to a new site. Let the already visited sites be 1, …, k − 2, and let the random walker be currently at x = k − 1. (The latter assumption is acceptable since the rules are symmetric. This is not true for the excited random walk of Refs. [810].) The probability density that the walker will visit the next new site (either site 0 or site k) exactly t times later is

Qk(t)=fe(b+f)t+b0e(b+f)tgk(tt|2)dt 13

The first term in this expression accounts for a direct hop forward from site k − 1 to k, since the walk stays at its original position k − 1 for time t with probability exp[−(b + f)t]. The second term describes the walk stepping backwards to site k − 2 after time t′, doing a symmetric walk on sites 1, …, k −1, and exiting this interval at either ends at time t with probability density gk(tt′|2). The Laplace transform of (13) is

Qk(s)=f+bgk(s|2)s+b+f 14

[The Laplace transform of the general exit probability density gk(s|m) is given by (A5).]

The above argument works also for the first ever step of the walk with our definition. If we prefer the assumption of hopping at rate f in both directions for the first time, we have to use Q2(s) = 2f/(s + 2f). This, however, does not affect the long time behavior.

The probability density that the walk visits the nth new site at time t is

Fn(t)=k=2nQk(tk)dtk 15

where t2 + t3 + … + tn = t. The Laplace transform of the convolution (15) is

Fn(s)=k=2nQk(s) 16

One anticipates that the main contribution in the long time limit comes from terms with kt. This suggests to take the s → 0 limit while keeping sk finite. In this limit gk(s|m) is given by (A7) and hence (14) becomes

Qk(s)=1as tanhsk2 17

where we have used the shorthand notation

a=21+f/b 18

Note that a and consequently the whole distribution Fn(s) only depends on the ratio f/b of the hopping rates. The no bias case corresponds to a = 1, which coincides with any symmetric hopping f = g. This has an interesting consequence — if the walk is only slowed down at new sites (but not biased), then it has no effect on its long time behavior, in agreement with the conclusions of the previous subsection II A. Note also that 0 ≤ a ≤ 2.

Now we return to the calculation of Fn(s). The product in (16) can be re-written as

Fn(s)=exp(k=2nln Qk(s)) 19

In the s → 0 limit we use (17) to obtain

Fn(s)=exp(ask=2ntanh sk2) 20

In the same limit the sum becomes an integral

0ntanhsk2dk=2sln cosh sn2 21

and hence

Fn(s)=(coshsn2)2a 22

At this point we recognize that this generating function is identical (up to a change in notation) to the one obtained in [9] for a discrete time excited walk on the half infinite line. This allows us to use the results of Ref. [9].

Note that Fn(s=0)=0Fn(t)dt=1 since the walk will cover any number of sites with probability one. The cumulants [7] can be directly obtained from the generating function Fn(s) to yield

κ=aCn2 23

with

C=81/3(122)(1)!π2ζ(2) 24

For example, C1 = 1/4, C2 = 1/48, C3 = 1/240. The average time to cover n sites, and its fluctuation, are

κ1=t=an24,κ2=t2t2=an448 25

Note that these exact expressions have been already derived and given by equations (1) and (9) in the symmetric a = 1 case.

The distribution Fn(t) attains a scaling form

Fn(t)=t1Φ(μ),μ=nt 26

in the scaling limit n, t → ∞ with the scaling variable μ kept finite. The scaled distribution ф(μ) has been computed in Ref. [9], which in the present notation reads

Φ(μ)=μ22a1 πk=0(2ak)(k+a)eμ2(k+a)2/4 27

The average number of visited sites at time t is

n=0nFn(t)dn0Fn(t)dn=A(a)t 28

with

A(a)=20dμ Φ(μ)=2Γ(a)Γ(a+1/2) 29

where we have integrated (27) term by term, and used the duplication formula for Gamma functions. Note that A(a) is a monotonically decreasing function for a > 0. (This can be proved e.g. by differentiating ln A(a) and using the properties of the di-gamma function.)

In the no-bias case (a = 1), equation (29) reduces to the well known result [15] for the amplitude A(1)=4/π. In the limits of strong forward and backward biases, the amplitude behaves as

A(a)={2aπ+O(1) for a183π+O(2a) for 2a1 30

Note that the naive inversion of the average time (25) would result in the incorrect A=2/a.

III. BIPEDAL SPIDER

Here we compute various characteristics of the simplest bipedal spider with s = 2. We shall see that memory affects the leading behaviors, and we will explain these findings by showing that the slowdown of legs on new sites leads to effective outward bias when the spider is at the boundary of the island of visited sites.

A. Average time to cover N sites

Since the spider configuration is ●○●̂ when a new site is first visited, it is convenient to start from the same type of configuration graphic file with name nihms-49257-f0007.jpg This assumption simplifies the calculation, but does not alter the asymptotic behavior.

Denote by τN the time interval during which the number of visited sites jumps from N + 3 to N + 4; then the number of visited sites first reaches N +3 at the moment TN = Σ0≤j≤N−1 τj. One can alternatively think about TN as the time at which the spider first visits N distinct sites not counting the three sites the spider occupies at that time. We can assume that at TN the visited sites are 1, …, N + 3, the spider's legs are at sites N + 1 and N + 3. Thus the configuration at time TN is

graphic file with name nihms-49257-f0008.jpg 31

Similarly to (5) we write

τN=[1(1+r)dt]τN+rdt3(2N+1)2+dtτ^N+dt 32

for the mean duration. The second term on the righthand side accounts for the right leg of the spider hopping from site N + 3 to N + 2. The site N + 3 turns from the substrate into the product and the configuration becomes graphic file with name nihms-49257-f0009.jpg We term the mean position of the legs as the “center of mass” of the spider. Note that the center of mass of the spider is at position N + 3/2 and it makes ±1/2 hops with unit rates. The visit of sites 1 or N + 3 by the center of mass corresponds to visiting sites 0 or N + 4 by the spider. The mapping to the random walk allows us to use (4) with M = 2N + 4 and x = 2N + 1 to yield 3(2N + 1)/2 for the average time.

The third term on the right-hand side of Eq. (32) accounts for the left leg of the spider hopping from site N + 1 to N + 2. Starting from this position, the mean duration time τ^N before a new site is visited is found by writing an equation similar to Eq. (32)

τ^N=[1(1+r)dt]τ^N+dtτN+dt 33

Simplifying Eqs. (32)–(33) we obtain

(1+r)τN=r3(2N+1)2+τ^N+1 34a
(1+r)τ^N=τN+1 34b

from which we find

τN=1r+3(2N+1)21+r2+r 35

Plugging (35) into the sum (7) we obtain

TN=31+r2+rN22+Nr 36

If there is no difference between the substrate and the product and the hopping rates are equal to r everywhere, the average time is equal to

τNuniform=N(N+1) 37

Hence memory is important for spiders as it affects the leading asymptotic behavior of TN.

B. Effective bias

We now argue that a slowdown at new sites leads to an effective bias for bipedal spiders. Imagine that the spider is in configuration (31). Analyzing various routes, we obtain that the right leg first hops to the left with probability

q=r1+r+11+r11+rq 38

and otherwise it hops to right with probability p = 1 − q. An explanation of (38) is as follows. The first term on the right-hand side describes the probability that the first leg which is going to hop will be the right leg — this happens with probability r1+r and the leg ought to hop to the left. If the first leg to hop is the left leg — this happens with probability 11+r — it then has to hop back (the probability is again 11+r) and then we are back to the initial configuration thereby providing us with factor q. Solving (38) we find

q=1+r2+r,p=12+r 39

The probabilities p and q are somewhat misleading, e.g. if the right leg first hops to the left, the spider configurations are different — in the starting one graphic file with name nihms-49257-f0010.jpg the legs are separated, while in the final configuration graphic file with name nihms-49257-f0011.jpg the legs are in adjacent sites. It is easier to appreciate the probabilities p+ and p that the spider first moves forward and backward, respectively, while the configuration remains the same.

Let us rephrase this definition. First note that knowing the position x of the center of mass fully determines the position of both legs of the bipedal spider with s = 2. We assign x = 0 to the initial configuration (31). The hopping of any leg changes the position of the center of mass by ±1/2, hence x takes half-integer values. We are interested in the probability p+ that the center of mass moves forward by one lattice site to x = 1 first, and in the probability p = 1 − p+ that it moves backward to x = −1 first.

We already know the probabilities of the first hop of the right leg (39). With probability p the right leg hops to the right first, and the spider is already at x = 1. On the other hand, the right leg hops to the left first with probability q, which puts the spider at x = −1/2. The spider then performs a simple random walk on the half integers, and reaches x = 1 first with probability 1/4. Collecting these terms we arrive at

p+=p+q4=12+1r4(2+r) 40

and p = 1 − p+. Thus the spider moves symmetrically when r = 1 (no memory), and as r decreases, the bias forward increases monotonically. The bias is the strongest in the r → 0 limit when p = 3/8 and p+ = 5/8.

C. Growth of the average number of visited sites

We now apply the above results to the bipedal spider. First note that the center of mass of the spider steps from already visited sites at rate one in both directions, but with a step length 1/2. Therefore it visits only half as many sites of the original lattice as a particle whose step length is one. Thus asymptotically N=A2(r)t with

A2(r)=A(r)2 41

and A2(r=1)=2/π exactly. We arrive at the same conclusion by replacing our spider by an effective particle which hops at rate 1/2 in both directions on the original lattice. Here the effective particle corresponds to the center of mass of the spider only when its legs are separated. From new sites, however, the effective particle hops predominantly in the forward direction, with the ratio of the forward and backward rates being the ratio of the forward and backward probabilities of the spider, f/b = p+/p. Using (40) we conclude that parameter a, see (18), becomes

a=3(1+r)2(2+r) 42

Using (29) and (41)–(42) we arrive at the prediction for the average number of visited sites at time t:

N=A2(r)t,A2(r)=Γ(3+3r4+2r)Γ(5+4r4+2r) 43

The amplitude A2(r) depends only weakly on r in the 0 < r ≤ 1 region. As r increases from 0 to 1, A2(r) decreases monotonically from

A2(r=0)=Γ(3/4)Γ(5/4)1.3519

to A2(r=1)=2/π1.1283.

Fig. 2 shows an excellent (perhaps exact) agreement between the above theory and simulation results. Thus both the spider and the exciting random walk whose bias is exactly equal to the bias of the spider apparently exhibit the same asymptotic growth of the average number of distinct visited sites N. This is easy to appreciate heuristically. Recall that for the excited random walk, only the ratio of forward and backward hopping rates appears in the asymptotic of N. This implies a substantial universality — only the magnitude of bias matters. Details of the spider are different from the exciting walk, but they seem irrelevant as long as the bias is exactly computed.

FIG. 2.

FIG. 2

The amplitude A in the growth law N=At as a function of r. For the bipedal L = 2 spider with maximal distance between the legs s = 2, simulations are in excellent agreement with the theoretical prediction (43). The same is also valid for the L = 3 spider with S = 3, where the theoretical prediction is given by (49).

D. Diffusion coefficient

Up to now we have probed the spider's motion through the trace it leaves — the number of visited sites. This is the quantity which is measured in experiments [5]. Interestingly, a direct analysis of the spider motion is much more challenging both experimentally and theoretically.

One can of course proceed numerically. According to simulations, the position of the spider (averaged over many runs) follows a Gaussian distribution in the large time limit. This is somewhat counterintuitive since the spider slows down at the extreme positions. Overall this is a greatly simplifying feature since if the position is (asymptotically) Gaussian, we can characterize the spider's motion solely by its diffusion coefficient D.

Since the Gaussian distribution gives the position of the spider in the large time limit, one might try to approximate its diffusion coefficient through the relation N=4Dt/π which is valid for simple random walks. This leads to

DπA216 44

For r = 1 (no memory), the relation D = πA2/16 is exact. For r < 1, however, it underestimates the spider's diffusion coefficient, see Fig. 3.

FIG. 3.

FIG. 3

Simulation results for the diffusion coefficient as a function of rate r for the bipedal L = 2 spider with maximal distance between the legs S = 2, and for the L = 3 spider with S = 3. The dashed lines are the approximation (44).

IV. MULTI-PEDAL SPIDERS

Spiders with an arbitrary number of legs L are difficult to analyze. Here we consider global spiders which (by definition) satisfy the following constraint — the distance between any two legs is ≤ S. Since the site cannot accommodate more than one leg SL−1, and the spiders with S = L−1 are immobile, so the first interesting case corresponds to S = L. These global spiders include the bipedal spiders studied in Sec. III. We now show that these global spiders can be analyzed generalizing the approach of Sec. III.

First we notice that knowing the center of mass position of a spider with S = L we can read off the position of all its legs. Hence as for the L = 2 case we can replace the spider by an effective particle whose position coincides with the center of mass of the spider. The step length of this effective particle is 1/L. Consider a configuration graphic file with name nihms-49257-f0012.jpg where the right leg has just stepped to a new site. We are interested in the first passage probability p+ for the effective particle to move one whole lattice spacing (L steps) to the right before moving one lattice spacing to the left. Denote again by q the probability that the rightmost leg will first hop to the left. Repeating an argument that led us to (38), we arrive at

q=r1+r+11+rεq,ε=1+(L2)r1+(L1)r 45

Here ε is the probability that after the right hopping of the second leg from the right, the spider returns to the initial configuration without ever moving the rightmost leg. This simple first passage problem can be solved, see e.g. [7], to obtain (45). From (45), the probability that the rightmost leg first hops to the left is

q=1+(L1)r2+(L1)r 46

The probability that the rightmost leg first hops to the right is p = 1 − q. Following the same reasoning as in the L = 2 case, we finally arrive at

p+=12+(L1)(1r)2L[2+r(L1)] 47

and p = 1 − p+. As r increases from 0 to ∞, the bias to the right p+ decreases monotonically, and the motion is unbiased p+ = 1/2 for the neutral r = 1 case. Hence, a slowdown at new sites (r < 1) leads to the outward bias for arbitrary L. It is also clear from (47) that the bias is weaker for larger spiders.

To calculate the number of visited sites we first obtain

a=L+1L1+(L1)r2+(L1)r 48

from (47). Recalling (29) and taking into account the normalization AL = A/L we find the amplitude

AL(r)=2LΓ(a)Γ(a+1/2) 49

in the growth law N=AL(r)t. An agreement between theory and simulations is excellent, see Fig. 2.

Equation (48) shows that a = a(r, L) is an increasing function of both r and L. Since Γ(a)/Γ(a + 1/2) is a decreasing function of a, we conclude that for L = S global spiders the number of visited sites decreases as r or L grows.

We believe that this is generally true for arbitrary spiders, i.e. A decreases as r or L grows. Intuitively it is easy to understand the monotonic L dependence, as an additional leg only blocks the motion of the others. The r dependence of A is more intriguing, since slowing down the spider actually results in more visited sites. One could argue that the slowdown at the boundary keeps the spider around the unvisited sites, hence it visits more of them. This argument, however, is flawed since the slowdown does not effect the behavior of a random walk as we saw it in Sec. II.

Here is an argument that supports the above conjecture. When the rightmost leg steps to a new site it has to stretch out a bit. If legs slow down on new sites, it gives other legs time to catch up, which leads to the outward bias. Another argument is that the when the rightmost leg stays longer at the new site, the second leg has more chance to step next to it, hence forcing the rightmost leg to the right. On the other hand, when the rightmost leg steps to a new site, the spider is a bit stretched out and possibly cannot step further due to some constraint (e.g. local or global). When the rightmost leg stays in place longer, it gives more time to the other legs to step closer, allowing the rightmost leg to step to the right, hence it leads again to a bias to the right.

V. CONCLUSION

In experimental realizations [5], spiders affect the environment which in turn affects their motion. More precisely, the state of any site on a surface changes irreversibly after the first visit of a leg of a spider and then it remains the same. During the subsequent visits legs are bound less firmly to previously visited sites, and we model this feature by prescribing different hopping rates from sites visited in the past and sites visited for the first time. Namely a leg slows down on new sites while the spider remains agile inside a previously visited region. Thus the knowledge of the spider's entire trajectory is required to describe its motion.

First, we considered a random walk (a one-leg spider) that slows down on the new sites. We calculated first two moments, TN and TN2, of the time TN to cover N sites, and also the dual quantity N, that is the average number of sites visited until time T. All these quantities exhibit the same leading asymptotic behavior as in the case of a simple random walk without slowing down. These concrete results clearly reveal the general truth, namely that the non-Markovian nature of the problem affects only sub-leading behaviors of the random walk with slowing down. All exact calculations support this conjecture, and we think that it is valid even in two dimensions. Heuristically, the above hypothesis relies on the crucial feature of a random walk in one and two dimensions — the random walk keeps visiting the same sites again and again (the average number of visits diverges with time), and therefore the slowing down on new sites becomes asymptotically less and less important.

The above argument seemingly applies to spiders as well, so it was surprising that the computation of the average time TN to cover N sites for the simplest bipedal spider with s = 2 gave a result that differs from the corresponding result for the same bipedal spider without slowdown. Thus, the slowdown affects the behavior even in the leading order. The especially striking feature is that the spider is effectively more agile, that is, the number of covered sites grows faster than for the same spider without slowdown. The mechanism leading to this acceleration is an effective bias towards unvisited sites. This bias is not apparent since legs hop symmetrically. Yet once the front leg steps onto a new site, it slows down and the hind leg hops to the neighboring site thereby effectively pushing (due to exclusion) the front leg further into the unvisited region. This argument does not fully explain the bias as the hopping processes are stochastic, but it does make the bias plausible. For the bipedal sider with s = 2, and more generally for the class of multi-pedal spiders with the maximal span equal to the number of legs (S = L), we computed the bias analytically. We believe that the bias exists for all spiders; numerically we confirmed this belief in all cases we looked at, but this is still a conjecture which may be very difficult to prove.

The bias explains the qualitative behavior of both TN and N. The latter quantity is particularly important as it is directly measured in experiments [5]. We computed N by replacing the spider by an excited walk, that is a walk that hops symmetrically from previously visited sites but biased towards the unvisited region when hopping from new sites. For this excited walk we calculated not only the average, but the whole probability distribution of the number of visited sites. We argued that the replacement of the spider by an excited random walk gives asymptotically exact results for the average number of distinct sites N visited by a spider.

Apart from TN and N, we tried to compute the diffusion coefficient. Numerically we observed that the probability distribution of the spider position approaches a Gaussian distribution in the long time limit, and therefore the diffusion coefficient fully characterizes the probability distribution of the spider position. We neither proved that the probability distribution is (asymptotically) Gaussian nor computed the diffusion coefficient. This is an interesting challenge for future work.

Finally we note that models of synthetic molecular spiders are closely related to models of natural molecular motors [1622]. Memory effects can occur when motors change their environment, which then affects the motors motion. (This happens e.g. for the collagenase which moves along collagen fibrils [23].) The non-Markovian nature of these situations makes them quite challenging. Nevertheless, some problems of this type have been analyzed[12, 24, 25]. It should be possible to employ the methods developed here in the context of molecular spiders, and to imply them to the behavior of molecular motors in situations when memory effects are important.

FIG. 1.

FIG. 1

The scaled distribution (27) of the number of visited sites by the exited walk for several values of bias.

Acknowledgments

We are very grateful to Kirone Mallick for numerous helpful suggestions, and to Milan Stojanovic for stimulating questions. We also acknowledge financial support to the Program for Evolutionary Dynamics at Harvard University by Jeffrey Epstein (TA), NIH grant R01GM078986 (TA), and NSF grant CHE0532969 (PLK).

APPENDIX A: ESCAPE OF A SYMMETRIC RANDOM WALK FROM AN INTERVAL

We have a random walk on the integers which hops at rate one to the left, and at rate one to the right. The walk starts at time t = 0 from site m, where 1 ≤ mk − 1, and site 0 and k are absorbing. We are interested in the first passage probability distribution to reach either site 0 or k for the first time.

The position of the walk on the infinite lattice at time t is given by [7]

q(n,t|m)=e2tI|nm|(2t) A1

where Iv is the modified Bessel function of the first kind. We then do a mirror trick. We start infinitely many walkers at time t = 0 from positions n = 2ak + m, and ‘negative’ walkers from n = 2akm, where a runs over all integers. Now their superposition gives the probability distribution of the walker in the interval

p(n,t|m)=a=[q(n,t|2ak+m)q(n,t|2akm)] A2

since, from symmetry reasons, p(0, t) = p(k, t) = 0 for all time.

The Laplace transform of the position distribution q(n, t|m) is

Q(n,s|m)=(s+2s(s+4))|nm|2|nm|s(s+4)=c|nm|d A3

where we have used shorthand notation

c=1+ss(s+4)2,d=s(s+4) A4

Hence we have to sum up a geometric series to obtain the Laplace transform of (A2)

P(n,s|m)=c|nm|cn+md+(cmcm)(cncn)d(1c2k)

The first passage probability to site 0 and site k is simply

L(s|m)=P(1,s|m)=(cc1)(cmkckm)d(ckck)R(s|m)=P(k1,s|m)=(cc1)(cmcm)d(ckck) A5

respectively, and the probability to leave the interval at either end is gk(s|m) = L(s|m) + R(s|m).

We are interested in the s → 0, k → ∞ limit, with ks and m being constants. Noting that in this limit

cm1ms+O(s)cmcm2ms+O(s3/2)d2s+O(s3/2)ckeks+O(s)

we establish the expansions

L(s|m)1ms cotanh ks+O(s)R(s|m)ms cosech ks+O(s) A6

Therefore the exit at either side of the interval becomes

gk(s|m)1ms tanhks2+O(s) A7

This asymptotic has been used in subsection II B.

Finally we notice that the diffusion approximation [14] of this simple random walk

L˜(s|m)=sinh(km)ssinh ks,R˜(s|m)=sinh mssinh ks

leads to the exact asymptotic behavior (A6) in the leading order. This property has been exploited in [9].

References

  • 1.Yan H, Zhang X, Chen Z, Seeman NC. Nature. 2002;415:62–65. doi: 10.1038/415062a. [DOI] [PubMed] [Google Scholar]; Sherman WB, Seeman NC. Nano Lett. 2004;4:1203–1207. [Google Scholar]; Seeman NC. Trends Biochem. Sci. 2005;30:119. doi: 10.1016/j.tibs.2005.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Shin J-S, Pierce NA. J. Am. Chem. Soc. 2004;126:10834. doi: 10.1021/ja047543j. [DOI] [PubMed] [Google Scholar]
  • 3.Shu WM, et al. J. Am. Chem. Soc. 2005;127:17054. doi: 10.1021/ja0554514. [DOI] [PubMed] [Google Scholar]
  • 4.Kay ER, Leigh DA, Zerbetto F. Angew. Chem. Int. Ed. 2007;46:72–191. doi: 10.1002/anie.200504313. [DOI] [PubMed] [Google Scholar]
  • 5.Pei R, Taylor SK, Stefanovic D, Rudchenko S, Mitchell TE, Stojanovic MN. J. Am. Chem. Soc. 2006;128:12693. doi: 10.1021/ja058394n. [DOI] [PubMed] [Google Scholar]
  • 6. For Markov processes, the future is determined by the present [7].
  • 7.Van Kampen NG. Stochastic Processes in Physics and Chemistry. Elsevier; Amsterdam: 2003. [Google Scholar]
  • 8.Perman M, Werner W. Probab. Theory Related Fields. 1997;108:357–383. [Google Scholar]; Davis B. Probab. Theory Related Fields. 1999;113:501–518. [Google Scholar]; Dickman R, ben-Avraham D. Phys. Rev. E. 2001;64:020102(R). doi: 10.1103/PhysRevE.64.020102. [DOI] [PubMed] [Google Scholar]; Benjamini I, Wilson DB. Elect. Commun. Probab. 2003;8:86–92. [Google Scholar]
  • 9.Antal T, Redner S. J. Phys. A. 2005;38:2555. [Google Scholar]
  • 10.Zerner MPW. Elect. Comm. in Probab. 2006;11:118. [Google Scholar]; Kozma G. arXiv:math.PR/0512535.; Basdevant AL, Singh A. arXiv:math.PR/0611580.; ibid. math.PR/0703275.
  • 11.Pemantle R, Volkov S. Ann. Probab. 1999;27:1368–1388. [Google Scholar]
  • 12.Mai J, Sokolov IM, Blumen A. Phys. Rev. E. 2001;64:011102. doi: 10.1103/PhysRevE.64.011102. [DOI] [PubMed] [Google Scholar]; Antal T, Krapivsky PL. Phys. Rev. E. 2005;72:046104. doi: 10.1103/PhysRevE.72.046104. [DOI] [PubMed] [Google Scholar]; Morozov A. Yu., Pronina E, Kolomeisky AB, Artyomov MN. Phys. Rev. E. 2007;75:031910. doi: 10.1103/PhysRevE.75.031910. [DOI] [PubMed] [Google Scholar]; Artyomov MN, Morozov A. Yu., Pronina E, Kolomeisky AB. arXiv:0705.0690.
  • 13.Antal T, Krapivsky PL, Mallick K. arXiv:0705.2594.
  • 14.Redner S. A Guide to First-Passage Processes. Cambridge University Press; New York: 2001. [Google Scholar]
  • 15.Weiss GH. Aspects and Applications of the Random Walk. North-Holland; Amsterdam: 1994. [Google Scholar]
  • 16.Howard J. Mechanics of Motor Proteins and the Cytoskeleton. Sinauer Associates; Sunderland, MA: 2001. [Google Scholar]
  • 17.Parmeggiani A, Franosch T, Frey E. Phys. Rev. Lett. 2003;90:086601. doi: 10.1103/PhysRevLett.90.086601. [DOI] [PubMed] [Google Scholar]
  • 18.Evans MR, Hanney T, Kafrí Y. Phys. Rev. E. 2004;70:066124. doi: 10.1103/PhysRevE.70.066124. [DOI] [PubMed] [Google Scholar]
  • 19.Klumpp S, Müller MJI, Lipowsky R. In: Traffic and Granular Flow ’05. Schadschneider A, et al., editors. Springer; Berlin: 2007. pp. 251–261. [Google Scholar]
  • 20.Greulich P, Garai A, Nishinari K, Schadschneider A, Chowdhury D. arXiv:physics/0612054. [DOI] [PubMed]
  • 21.Mobilia M, Reichenbach T, Hinsch H, Franosch T, Frey E. arXiv:cond-mat/0612516.
  • 22.Klumpp S, Lipowsky R. Proc. Natl. Acad. Sci. USA. 2005;102:17284. doi: 10.1073/pnas.0507363102. [DOI] [PMC free article] [PubMed] [Google Scholar]; Vershinin M, et al. Proc. Natl. Acad. Sci. USA. 2007;104:87. [Google Scholar]
  • 23.Saffarian S, Collier IE, Marmer BL, Elson EL, Goldberg G. Science. 2004;306:108–111. doi: 10.1126/science.1099179. [DOI] [PubMed] [Google Scholar]; Saffarian S, Qian H, Collier I, Elson E, Goldberg G. Phys. Rev. E. 2006;73:041909. doi: 10.1103/PhysRevE.73.041909. [DOI] [PubMed] [Google Scholar]
  • 24.Qian J, Xie P, Dou SX, Wang PY. J. Theor. Biol. 2006;243:322–327. doi: 10.1016/j.jtbi.2006.06.002. [DOI] [PubMed] [Google Scholar]
  • 25.Oshanin G, Klafter J, Urbakh M. Europhys. Lett. 2004;68:26. [Google Scholar]

RESOURCES