Abstract
Stochastic models of reaction-diffusion systems are important for the study of biochemical reaction networks where species are present in low copy numbers or if reactions are highly diffusion limited. In living cells many such systems include reactions and transport on one-dimensional structures, such as DNA and microtubules. The cytoskeleton is a dynamic structure where individual fibers move, grow, and shrink. In this paper we present a simulation algorithm that combines single molecule simulations in three-dimensional space with single molecule simulations on one-dimensional structures of arbitrary shape. Molecules diffuse and react with each other in space, they associate with and dissociate from one-dimensional structures as well as diffuse and react with each other on the one-dimensional structure. A general curve embedded in space can be approximated by a piecewise linear curve to arbitrary accuracy. The resulting algorithm is hence very flexible. Molecules bound to a curve can move by pure diffusion or via active transport, and the curve can move in space as well as grow and shrink. The flexibility and accuracy of the algorithm is demonstrated in five numerical examples.
INTRODUCTION
It has been established that modeling biochemical reaction networks with a macroscopic deterministic model, for instance with ordinary differential equations (ODEs), can be unsuitable when some species are present in low copy numbers or if reactions are highly diffusion limited.1, 2 Such systems may only be accurately modeled by a discrete stochastic model. Usually two levels of stochastic modeling are considered: the mesocopic level and the microscopic level.
The mesoscopic scale is described by the chemical master equation (CME) which is a discrete, stochastic model. Molecules are treated as discrete entities but assumed to be well stirred with a uniform distribution within the simulation volume. The copy number of each species is computed but the position of individual molecules is not tracked. Exact trajectories of such a system can be generated with the stochastic simulation algorithm (SSA) by Gillespie.3
Systems for which the condition of spatial homogeneity does not hold can be modeled at the mesoscopic level by subdividing the domain into smaller compartments, where in each compartment the well stirred condition is assumed to hold. Diffusion can now be modeled as discrete jumps between the compartments. The governing equation of the system is the reaction-diffusion master equation (RDME) and exact trajectories can be computed with the next subvolume method (NSM),2 which is an efficient implementation of the SSA in a spatial setting. Several software packages have been developed for simulating systems at the mesoscopic level, including MesoRD,4 URDME,5 and STEPS.6 The spatial accuracy of the RDME depends on the size of the voxels. If the voxels are too large we might not resolve the spatial dynamics of the system accurately, but on the other hand it has also been shown that when the size of the voxels becomes too small the accuracy deteriorates.7, 8, 9 By allowing the reaction rates in the system to depend on the size of the voxels in a general way, accuracy can be retained, but at most up to a critical lower bound of the voxel size, at which point the model breaks down regardless of how the reaction rates are chosen.8, 9, 10 The mesoscopic model may therefore be unsuitable for systems where a very high spatial accuracy is required to capture important dynamics.
The microscopic level usually refers to models where the positions of single molecules are tracked. Either we compute the continuous position in space or let the individual molecules move on a lattice, as in Ref. 11. Reactions occur with some probability per unit time when molecules collide or when they are within some prescribed reaction radius. There are two models that have been used extensively to model systems at the microscopic scale: the Smoluchowski model12 and the Doi model.13, 14 In this paper we will be only concerned with the Smoluchowski model.
In Ref. 15 they show that fine-grained spatial correlations can affect the macroscopic behaviour of a system. Furthermore, the dynamics of the system cannot be captured on the macroscopic or mesoscopic level but rather require a microscopic level of simulation. This is due to highly diffusion-limited reactions and fast rebindings that the mesoscopic model cannot resolve without hitting the critical lower bound of the voxel size. This motivates development of efficient methods for simulating systems at the microscopic level.
The Green's function reaction dynamics (GFRD) algorithm16 is an efficient method to generate accurate trajectories of the Smoluchowski model. A full N-body problem is divided into many 1- and 2-body problems that can be simulated independently by choosing the time step in the algorithm sufficiently small. For these subproblems the Smoluchowski equation can be solved analytically or numerically. Improvements of GFRD have been developed in Refs. 15 and 17. Sampling random numbers from the analytical solution of the Smoluchowski equation is complicated and expensive, but in Ref. 17 this step is simplified by solving the Smoluchowski equation using first order or second order operator splitting schemes. In Ref. 15 the algorithm is made exact in both space and time by combining GFRD with the first-passage kinetic Monte Carlo (FPKMC) algorithm developed in Ref. 18, and this improved algorithm is called eGFRD. The main idea in eGFRD is to introduce protective spheres around single molecules and pairs of molecules and then sample exact times for the molecules to exit the spheres.
Software that implements eGFRD is E-Cell.19 Other software packages that simulate systems with the Smoluchowski model as the target model are Smoldyn,20 MCell,21 and Spatiocyte.11 Both Smoldyn and MCell discretize time while GFRD is exact in time, thus making them less accurate. On the other hand, Smoldyn and MCell can be significantly more efficient for larger systems. In Spatiocyte molecules move on a lattice of densely packed spheres.
Instead of simulating the system with a purely microscopic method one can combine the microscopic level and the mesoscopic level in a hybrid method.22, 23, 24 By restricting the microscale simulations to the species or the parts of space that are necessary for capturing the important dynamics of the system, computational time can be saved without losing a significant amount of accuracy.
In addition to the observed spatial heterogeneity in systems where molecules move only by pure diffusion in free space, there can be spatial effects in systems due to interactions with internal lower dimensional structures in the cells. This could for instance be reactions between molecules and curved surfaces or one-dimensional structures such as DNA or microtubules, as well as reactions between molecules bound to such structures. In Ref. 22 a method for simulating interactions with curved surfaces at the microscopic level is developed, and in Ref. 25 they develop an FPKMC method for simulating active transport due to chemical potentials on embedded lines of fixed length and with fixed configuration. A mesoscopic method that simulates interactions between molecules in free space and lines is considered in Ref. 26 and a mesoscopic method incorporating active transport is suggested in Ref. 27
In this paper we have developed an algorithm that combines single molecule simulations in three-dimensional space with single molecule simulation on embedded curves. Molecules in some general three-dimensional volume, for instance a sphere or a cylinder, are simulated with the GFRD algorithm. They can react with the embedded curves according to the Smoluchowski model. Molecules attached to a curve can also diffuse and react with each other according to the one-dimensional Smoluchowski equation. In addition to pure diffusion, we simulate active transport along the curves by letting the molecules bound to the curves move deterministically in one direction. The curves can have an arbitrary shape. This flexibility is obtained by approximating a continuous curve with non-constant curvature by straight linear segments, and this approximation can be made arbitrarily accurate by making the discretization of the curve finer and finer. It is also known that many one-dimensional structures in cells are highly dynamic.28 A microtubule may for instance both move in space as well as grow and shrink. We take those dynamics into account by letting the embedded curves move according to some general random process as well as letting them grow and shrink over time.
The following sections are organized as follows: In Sec. 2 we review the GFRD method in one dimension as well as three dimensions. In Sec. 3 we describe the algorithm for coupling the one-dimensional simulations with the three-dimensional simulations, as well as simulation of moving lines embedded in space. In Sec. 4 we demonstrate the flexibility of the algorithm with five examples. In the first example we verify the accuracy of the method by comparing with mesoscopic simulations and theoretical predictions. The second example concerns models that were studied experimentally in Ref. 29. In the third example we consider two spirals embedded in a sphere. In the fourth example we consider several straight lines embedded in a spherical domain. Molecules can bind to the lines and are then moved by active transport along the lines. The lines move in space according to a random process, and we show how the active transport affects the distribution of the molecules in space. In the final example we consider lines that grow and shrink. We show how the number of molecules attached to a line will vary with the total length of the lines.
SIMULATIONS IN THREE AND ONE DIMENSIONS
We consider a system of N molecules, where the molecules can undergo bimolecular or unimolecular reactions. The evolution of the system is modeled by the Smoluchowski equation with a mixed boundary condition at the reaction radius of the molecules. In principle one could write down the Smoluchowski equation for the full system, but due to the high dimension it would not be tractable to solve. In GFRD16 the main idea is to divide the full N-body problem into smaller subproblems consisting of the simulation of single molecules and pairs of molecules. The subdivision is done in two steps. First we find the nearest neighbor of every molecule. Molecules that have each other as nearest neighbors form pairs and the remaining molecules will be updated as single molecules. Second we choose a time step Δt such that no molecule in a pair is likely to react with any other molecule except its nearest neighbor and molecules that are not in a pair should be unlikely to react with any other molecule. The full N-body problem can now be approximated by smaller, independent, subproblems during the time step Δt.
Updating positions of single and pairs of molecules
The positions of single molecules are updated by sampling from a normal distribution with standard deviation in each direction, where D is the diffusion constant of the molecule. A pair of molecules, where the molecules have the positions x1n and x2n at time tn, are updated according to the probability distribution function (PDF) p(x1, x2, t|x1n, x2n, tn) obtained by solving the Smoluchowski equation for a pair of molecules, given by
(1) |
where Di and xi are the diffusion coefficient and position of particle i, and operates on the xi coordinate. By making a change of variables as suggested in Ref. 16 this equation can be split into two equations: one for the relative position of the molecules and one for a weighted mean of the positions
(2) |
Now p can be written as p(x1, x2, t|x1n, x2n, tn) = pY(Y, t|Yn, tn)py(y, t|yn, tn), and the equations become
where D = D1 + D2 is the sum of the diffusion coefficients of the molecules. The motion in the Y direction is in free space and will be described by a normal distribution, but the equation for the y coordinate will have an additional boundary condition at the reaction radius of the molecules to model associations between molecules. By considering the equation in a spherical coordinate system, r = (r, θ, ϕ), the boundary condition becomes
(3) |
where σ is the reaction radius of the molecules, r = ‖r‖ and kr is the intrinsic association rate. The initial condition is given by
and pr(r, t|rn, tn) vanishes as r → ∞.
This equation can be solved analytically,16, 30 but due to the complexity of the analytical solution we use the method described in Ref. 17. There the full equation is solved in two steps using a first order splitting scheme (or if higher accuracy is needed with a second order splitting scheme). This simplifies the sampling of new positions for the molecules, at the price of an additional error. This error is shown to be small in Ref. 17. The first step of the operator splitting scheme is to solve for the radial part of the equation (in a spherical coordinate system rotated such that rn = (rn, 0, 0)), given by
(4) |
with the initial condition pr(r, tn|rn, tn) = δ(r − rn) and with the boundary condition given by Eq. 3. The analytical solution is derived in Ref. 31. The second step is to solve for the angular part:
(5) |
with initial condition given by pθ(θ, tn|θn, ϕn, tn) = δ(θ)/r2sin (θ). The solution to Eq. 5 will be independent of ϕ, and the distribution of ϕ will therefore be uniform on the interval [0, 2π). It is possible to solve Eq. 5 analytically and the solution is given in the form of an infinite series, where the terms are products of Legendre functions and exponentials. Efficient strategies to compute the analytical solutions of Eqs. 4, 5 are discussed in Ref. 32. An efficient method to generate random numbers from the PDF pθ is suggested in Ref. 33.
Note that the probability for two molecules to survive until time tn + Δt is given by
From S we can sample the exact time when the pair of molecules react (cf. Ref. 16). Thus, the first step is to check whether the two molecules react before t + Δt. If they do, we execute the reaction at the sampled time, and if they do not react we sample new positions for each molecule by
Sample rn+1 from the PDF pr.
Sample θn+1 from the PDF pθ with r = rn+1.
Sample ϕn+1 from a uniform distribution on the interval [0, 2π).
Transform rn + 1 back to yn + 1. Sample Yn + 1 from a normal distribution and then solve Eq. 2 to obtain x1(n + 1) and x2(n + 1).
Also molecules in one dimension can react. For pairs of molecules we solve the one-dimensional Smoluchowski equation
(6) |
with the boundary condition at the reaction radius
(7) |
to obtain the PDF p(s, t|sn, tn) for the distance s at time t between the two molecules, given that the distance was sn at time tn. The analytical solution to Eq. 6 with boundary condition given by Eq. 7 is derived in Ref. 30.
Both molecules in one dimension and molecules in three dimensions can dissociate. Consider a reaction . The time until an A-molecule dissociates is sampled from an exponential distribution with mean kd. After a molecule A has dissociated the B- and C-molecules are placed at a distance equal to the reaction radius.
Molecules close to a curved boundary
Most biologically realistic geometries are fairly complex. Cells can be close to spherical or have a shape reminding of a spheroid. Eukaryotes typically have complex internal two-dimensional structures. One flexible and efficient way to deal with such non-regular surfaces is to discretize the surface with an unstructured mesh.
In this paper we do not consider volumes with internal two-dimensional structures, and are consequently only concerned with the outer boundaries of the volume. The outer boundary is approximated by a triangular surface mesh. The vertex of each triangle is also a center vertex in the dual mesh generated from the primal mesh. Consider a center vertex ψi. This center vertex is the vertex of several triangles T1, …, TM in the mesh, and with each dual element we associate one plane defined by the center vertex ψi and with normal given by the average of all the normals n1, …, nM to the triangles T1, …, TM. We have now assigned one plane to each dual element in the mesh, and these planes are an approximation of the surface.
Now, assume that a molecule is close to one of the planes approximating the surface. Choose a time step Δt small enough such that the interaction between the molecule and the plane can be approximated by the interaction between an infinite flat plane and a molecule during Δt. Then consider the change of coordinate system where the molecule diffuses parallel to the plane in two directions and along the normal of the plane in the third direction. The molecule can then be updated according to normal diffusion in the first two directions and in the third direction by sampling from a normal distribution and then, in case the molecule ends up on the wrong side of the plane, simply reflecting that coordinate in the plane.
This approach gives great flexibility to the algorithm and enables us to simulate molecules in different complicated geometries. The details of this approach is described in Ref. 22. Another method to simulate molecules close to an absorbing or reflecting boundary has been suggested in Ref. 34.
Note that by making a locally planar approximation we can also include reactions between molecules and surfaces by solving the one-dimensional Smoluchowski equation for molecules that are close to the surface, as done in Ref. 22. In this paper, however, we only consider reflecting outer boundaries, but the methodology for simulating interactions with reactive surfaces (outer boundaries as well as internal surfaces) developed in Ref. 22 can be combined with the method described in this paper. Furthermore, we could also combine this method with the hybrid method coupling microscopic and mesoscopic simulation algorithms, also developed in Ref. 22.
EMBEDDED ONE-DIMENSIONAL STRUCTURES
We consider a continuous and differentiable curve Γ embedded in space, parametrized by
Γ can be approximated by a piecewise linear curve γ. Denote the linear segments of γ by γ1, …, γN. The segment γi is defined by the points and . The points are chosen such that and thus all segments are connected. The curve Γ can be approximated to arbitrary accuracy by letting N → ∞ in such a way that .
We want to simulate reactions between molecules in space and the curve as well as diffusion and reactions between molecules on the curve. Furthermore, we want to simulate moving curves that grow and shrink.
Interactions with the curve
If a molecule A is close to γ, we will choose a time step Δt such that the molecule is unlikely to interact with more than one of the linear segments during that time step, and such that the linear segment can be approximated, to high accuracy, by an infinitely long straight line during Δt. The problem of simulating a molecule close to an infinitely long straight line can be reduced to a two-dimensional problem, which can be seen by considering the position of the molecule at time tn, xn, in cylindrical coordinates r = (r, θ, z), with the z-coordinate in the direction of the line. The diffusion of the molecule in the z-coordinate is now independent of the diffusion in the r and θ directions and the interaction with the line.
Assume that the closest linear segment to the molecule is γi. Now consider the sphere S with center at xn and radius . By choosing the time step Δt such that the molecule A is unlikely to diffuse outside of the sphere S during the time step, the requirement that the molecule is unlikely to react with more than one linear segment is satisfied. If the diffusion coefficient of A is DA, the molecule will, on average, diffuse the distance during a time step Δt. Thus, we choose the time step to be
where K is a constant chosen sufficiently large in order to ensure that the probability for the molecule to diffuse outside of the sphere is small. Note that if the molecule is really close to or the time step could become very small, making the simulation too slow. To avoid this problem we have to choose a smallest value that Δt can take, thus introducing an error. However, with a carefully chosen smallest value of Δt that error will be small.
The interaction between the molecule and the linear segment can now be approximated by the two-dimensional Smoluchowski equation during Δt. The two-dimensional Smoluchowski equation for the relative position (in cylindrical coordinates) is given by
(8) |
with the boundary condition
(9) |
Again we solve the equation in two steps with a first order operator splitting scheme. The radial part is given by
(10) |
with the boundary condition given by Eq. 9, the initial condition given by pr(r, tn|rn, tn) = δ(r − rn) and with pr vanishing at infinity. The solution of Eq. 10 with the boundary condition given by Eq. 9 is solved analytically in Ref. 30. The analytical solution is
where k = 2πσD and the function C is defined by
Here J0 and J1 are Bessel functions of the first kind and Y0 and Y1 are Bessel functions of the second kind. Despite having the analytical solution available we choose to compute the solution using a finite difference method, as the analytical solution is fairly complex and expensive to evaluate. The solution could also be tabulated as described in Ref. 32. Tabulating the solution is more efficient but has the drawback of introducing an additional error as well as consuming more memory.
The second step is to solve for θ and z
(11) |
with initial condition
Again, p vanishes at infinity. The solution of Eq. 11 is a Gaussian in both the z-direction and the θ-direction.
Molecules can also dissociate from the lines. The time until a dissociation is assumed to be exponentially distributed with mean equal to the intrinsic dissociation rate, and after a dissociation from a line the molecule is placed at a distance equal to the reaction radius between the molecule and the line. As we are using a finite difference scheme to compute the solution of Eq. 10 we are in fact forced to place the dissociating molecule at a slightly larger distance from the line than the reaction radius, in order to have discretization points between the molecule and the line. This introduces a small error, but it can be made arbitrarily small by placing the molecule closer and closer to the boundary while increasing the number of discretization points used in the finite difference scheme.
Diffusion and reactions on moving curves
A new position at time t + Δt, xn + 1, of a single molecule moving by pure diffusion, bound to Γ at position xn at time t, where xn ∈ γi, is computed by sampling a distance d from a one-dimensional normal distribution with standard deviation and then updating
(12) |
It is possible that xn + 1 is no longer on the segment γi. If this is the case we project xn + 1 to the closest point on γ. The time step Δt should thus be chosen such that is small in comparison to , so that the molecules do not diffuse a long distance compared to the distance of the segments. Since we are projecting the molecule onto the line we are introducing a small error and the effective diffusion will be slightly slower than the exact diffusion process. However, if the angles between the linear segments are small, this error will also be small, and it is therefore important to ensure that the discretization of Γ is fine enough to make the angles small enough. Note that the process defined by Eq. 12 can be modified to account for other types of motion on curves, such as active transport. For instance, d could be sampled from another distribution than the normal distribution, or be completely deterministic. Molecules on the same line segment can react according to the PDF obtained by solving Eq. 6 with boundary condition given by Eq. 7.
Many one-dimensional structures in cells are not stationary, but rather move around in space. They can also grow and shrink. We simulate this process with a first order operator splitting scheme. Choose a time step Δtsplit. First we keep the lines constant in space and propagate the molecules according to the algorithm described above for the full time step Δtsplit. The second step is to move the lines according to some transformation in space, for the full time step Δtsplit. T can both move the line in space but also change the length of the line. We apply T to all the points, such that the linear segments at time t + Δtsplit are defined by the pair of points
The position of the molecules are now no longer guaranteed to be on the lines. Each molecule is therefore projected down to the closest point on its respective line. For high accuracy we should therefore choose the time step Δtsplit such that each molecule is projected only a short distance. After each step it is possible that some molecules on the line that were previously separated by some small distance are now overlapping. If this is the case they are moved apart and placed at a distance equal to their reaction radius. Molecules in space that overlap with the lines are also moved such that they are at a distance from the line equal to their reaction radius. We then repeat this process until the final time tf. If higher accuracy is needed it would be possible to consider higher order splitting schemes, such as Strang splitting.35
The algorithm outlined above is summarized in Algorithm 1. Note that this is only a summarized version of the GFRD algorithm. There are some special cases that require extra care, such as pairs of molecules that are also close to a boundary. For a more detailed description of how to handle such cases the reader is referred to Refs. 15, 16, 17.
There are several parameters that have to be chosen, that also affect the accuracy of the algorithm. The parameter N should be chosen large enough to make the modeling error comparable to the approximation error of the line. Exactly how to optimally choose N will therefore depend on the model we are studying.
The parameter Δt is chosen as described in the introduction to Sec. 2, and for most systems that choice will make Δt small enough to keep the error due to the operator splitting small. The parameter Δtsplit should be chosen such that the molecule and the line are likely to be close before they react, ideally within a small multiple of the reaction radius.
NUMERICAL RESULTS
We show the practical use of the algorithm by studying both examples with straight lines through a three-dimensional domain as well as lines with non-constant curvature. We demonstrate how to use the algorithm to simulate active transport by letting molecules attached to a line move with a deterministic speed in a predetermined direction. The lines can also move around in space as well as grow and shrink.
1: | Initialize the system at the time t = 0 and choose a final time tf. Choose a time step Δtsplit for the operator splitting. |
2: | Divide the set of all molecules into subsets of one or two molecules such that pairs are formed by molecules that are each others nearest neighbor. Set tloc = 0. |
3: | whiletloc < Δtsplitdo |
4: | Choose a time step Δt ⩽ Δtsplit − tloc such that molecules are unlikely to interact with more than one other molecule during that time step. Δt should also be chosen such that no molecule is likely to interact with a two-dimensional boundary or a one-dimensional curve and a neighbor during the time step. To fulfill this requirement certain pairs and single molecules may have to be updated in several smaller steps during Δt. |
5: | Pairs of molecules are updated according to the PDF obtained by solving the three-dimensional Smoluchowski equation 1, as described in Sec. 2A. |
6: | Molecules close to a line are updated according to the PDF obtained by solving the two-dimensional Smoluchowski equation 10, as described in Sec. 3A. |
7: | Molecules on a line are updated according to the PDF obtained by solving the one-dimensional Smoluchowski equation 6, as described in Sec. 2A. |
8: | Molecules that dissociate from the cylinder at time τ < Δt are placed at the reaction radius of the molecule and the cylinder. If a molecule in free space dissociates we place the products at a distance equal to the sum of the reaction radii of the two molecules. |
9: | Update tloc = tloc + Δt. |
10: | end while |
11: | Update the linear segments according to the transformation T. |
12: | Update the time t = t + Δtsplit. |
13: | Repeat (2)–(12) until the final time tf. |
Polymer embedded in a cylinder
We consider a cylinder with radius R = 10−6 m, height H = 2 × 10−6 m and aligned along the x-axis. Inside the cylinder we consider a line of length H, for instance modeling a polymer, going through the center of the cylinder. The polymer has a reaction radius rl = 10−9 m. Molecules of species A can associate with the polymer to form molecules Acyl. Molecules Acyl can dissociate from the polymer to become species A. Thus, we consider the reaction
where A is a molecule diffusing in three-dimensional space and Acyl is a molecule bound to the polymer. The diffusion constant of A is DA = 10−12 m2/s and the diffusion constant of Acyl is . The intrinsic reaction rates are chosen to be kr = 10−11 m2/s and kd = 50 s−1. The problem setup is shown in Fig. 1.
Due to the cylindrical symmetry of the problem, the reactions with the polymer can be reduced to a two-dimensional problem. The average time Tbind until a molecule A reacts with the polymer can thus be computed using a formula for association times on disks, derived in Ref. 10. We can therefore compare the microscopic simulation with a corresponding mesoscopic simulation of the reaction
where and, due to the detailed balance condition, . Here Adisk = πR2 is the area of the face of the cylinder. In Fig. 2 we show that for this simple problem, the microscopic simulations agree well with spatially homogeneous mesoscopic simulations, performed with the SSA.
Facilitated diffusion of transcription factors in living cells
That the lac repressor in vitro finds its operator by combining three-dimensional diffusion with one-dimensional sliding on chromosomal DNA was established in 1970.36 That this is also the case in living cells was shown in Ref. 29, through a series of experiments using single molecule imaging to study binding times of transcription factors to their operators. These experiments can also be studied using the numerical method developed in this paper.
The geometrical setup is the same as in the example above, but molecules of species Acyl will be able to react with operator sites on the line, thus considering the line a coarse model of DNA. Operator sites are denoted by B and we consider the reactions
For the purpose of the experiments in this section, we can assume that operator sites are absorbing and of radius 10−9 m, but generally, this is not a necessary restriction. The first example that we will consider is placing two operator sites on the DNA, a distance lop apart. If sliding on the DNA is important to the binding time of the transcription factor, Tactive, we would expect the distance between the operator sites to affect Tactive. If the operator sites are farther apart than the average sliding distance of the transcription factors, they will be essentially independent and Tactive would be expected to be significantly reduced compared to the case of having just one operator site on the DNA. On the other hand, if the operator sites are closer than the average sliding distance, they will appear more as a single operator site to the transcription factor. In Fig. 3 we show the dependence of Tactive on lop.
For the second experiment we place road blocks at a distance lrb from a single operator site. Road blocks are modeled by stationary, non-reactive (but reflective) molecules of radius 10−9 m. Thus, molecules binding to the DNA at a distance larger than lrb from the operator site, will not be able to find it since molecules cannot diffuse through the road blocks. Therefore we expect that the binding time will increase as the distance between the road blocks decrease, as sliding onto the operator site becomes more and more unlikely.
We compute the average time, Tactive, until an Acyl-molecule binds to the operator site, and this time will depend on the length lrb between the road blocks. In Fig. 4 we show that this time decreases with increasing length between the road blocks, as was shown experimentally and theoretically in Refs. 26 and 29. The decrease in the binding time levels off as lrb becomes greater than the average distance that a molecule slides on the DNA before dissociating.
The distance between the road blocks models different levels of crowding on the DNA. Thus, the time Tactive represents how the crowding of road blocks, or DNA binding proteins, affects the search time of an operator. This model predicts that a higher concentration of DNA binding proteins increases the search time.
Spirals embedded in a sphere
One-dimensional structures in living cells are often not well approximated by straight lines due to their curvature, and they may also have a complicated spatial distribution relative to each other. It is therefore important to be able to simulate molecules reacting with and on lines of non-constant curvature. To demonstrate the flexibility of the method we therefore consider the following system of reactions:
where Acyl, Bcyl, and Ccyl are molecules bound to one of the lines and A and B are molecules diffusing in three-dimensional space that can react with the lines. We will consider two spirals Γ1 and Γ2 along the y-axis, parameterized by
with 0 ⩽ s ⩽ 3 and rc = 3 × 10−7. Each spiral is discretized using 30 points. The setup is shown in Fig. 5.
We start with 150 molecules of species A and 150 molecules of species B, with initial positions sampled from a uniform distribution. The reactions rates are chosen to be , , and . The diffusion rate for the molecules in space is 10−12 m2/s and the diffusion rate for the molecules on the spirals is 10−14 m2/s. We simulate the system for 2 s and the evolution of the system is shown in Fig. 6.
Active transport on moving microtubules
Molecules in living cells do not only move by normal diffusion but can also be actively transported. For instance, intracellular transport on the cytoskeleton, a structure in the cell composed of fibers such as actin and microtubules, is an important mechanism for directed transport of cargo in the cell.37 Cargo attach to the fibers and are then transported by molecular motors: myosins along actin and dynein and kinesins along microtubules.
We will consider a much simplified model of active transport from the cytosol towards the center of a cell. Molecules bind to microtubules, modeled by straight lines passing through the origin. We assume that the transport is of constant speed in the direction towards the origin and we do not explicitly model the molecular motors. Other more complex and more biologically accurate models of active transport could be incorporated into the method by modifying Eq. 12 in a suitable way. The model used in this example is not meant to be a very accurate model of such processes, but is rather meant to serve as an example of the flexibility of the algorithm.
We initialize the system by sampling 25 points, p1, …, p25, uniformly on a sphere of radius R = 10−6 m. Twenty five straight line segments through the origin are then defined by the pairs (pi, −pi), i = 1, …, 25. We start with 100 A-molecules in space, with initial positions sampled from a uniform distribution. The A-molecules can then associate and dissociate to the microtubules through the reaction . A-molecules in space diffuse with the diffusion constant D = 10−12 m2/s, but molecules on the microtubules move deterministically towards the center of the sphere with constant speed . The microtubules are rotated Δθ and Δϕ radians around the x- and y-axis per time step Δt, where
(13) |
X1 and X2 are random numbers sampled from a normal distribution with standard deviation 1. Δtsplit is chosen to be 0.01. Thus, the microtubule li defined by the points (pi, −pi) at time t, will be updated at time t + Δt by rotating pi and −pi around the x- and y-axis Δθ and Δϕ degrees, with Δθ and Δϕ given by Eq. 13. The trajectory of one of the microtubules is plotted in Fig. 7 and two snapshots of a simulation, one at t1 = 0.05 s and one at the final time tf = 1.0 s are shown in Fig. 8.
In Fig. 9 the distribution of molecules in the radial direction is plotted. The initial distribution is uniform in space. Due to the transport towards the center, this distribution is shifted and more molecules are close to the center of the sphere at the final time tf = 1.0 s. The reaction rates in the simulation are kr = 10−11 m2/s and kd = 103 s−1.
Growing and shrinking microtubules
Microtubules are highly dynamic and can, in addition to moving around in space, both grow and shrink. Therefore, as a final example, we will consider fibers embedded in a sphere and whose lengths can both increase and decrease.
The sphere has a radius R = 10−6 m, with two straight fibers passing through the origin. The fibers are initially of length l = 2R. The lines rotate in the same way as in Sec. 4D but will also grow and shrink. After each time step, the length of the line is updated, and molecules are projected down to the closest point on the line. Note that the closest point could in fact be the endpoint of the line if the molecule was close to the end of the line before the length was updated. One possibility is to let such molecules be transformed to molecules in free space, but in this example they will stay bound to the line. Now, assume that the length of the line at time tn is given by l(tn). Then the length at tn + Δt is given by
where X is a random number sampled from a normal distribution with mean 0 and standard deviation , with the restrictions that l(tn + Δt) < 2R and l(tn + Δt) > 10−8. Thus, if the line is defined by the points (pn, −pn) at time tn, it will be defined by the points (l(tn + Δt)/l(tn) · (pn, −pn) at time tn+1.
Intuitively one would expect that when the length of the fibers decreases, the number of bound molecules will also decrease, under the condition that the fibers grow and shrink at a time scale that is longer than or similar to the time scale for the system to reach steady state. Thus, if we choose parameters such that this condition is fulfilled, we would expect to see a correlation between the number of molecules of species Acyl and the total length of the two lines.
We start the simulation with 600 molecules of species A uniformly distributed in space. The diffusion constants are DA = 10−12 m2/s and . In Fig. 10 we plot the total length of the two lines and the total number of molecules bound to a line. In Fig. 10 a trajectory of the system, with Dl = 10−12 m2/s, is plotted together with the sum of the length of the lines. As expected the two trajectories are clearly correlated.
CONCLUSIONS
We have described an algorithm to simulate biochemical systems in complex geometries with embedded and dynamic one-dimensional structures. Molecules in space can associate with and dissociate from the embedded curves. Molecules in space and on the lines are simulated using the GFRD algorithm.
The accuracy of the algorithm depends on the discretization of the lines and the time step chosen in the algorithm. The simulation of the molecules and the spatial transformation of the lines are split in time according to a first order operator splitting scheme and the error will therefore depend on the time step chosen for this splitting, as well as the curvature of the lines.
We have demonstrated the flexibility of this method in five numerical examples. We have shown that the length between two operator sites affects the binding time of transcription factors, and we have also shown that the binding time depends on the length between two road blocks on each side of the operator site. As the length increases, the time until a molecule binds decreases. The decrease levels off as the distance gets bigger than the average sliding distance of a molecule on the DNA, in agreement with Refs. 26 and 29. In the second example we consider two spirals embedded in a sphere. Molecules in space undergo a reversible reaction with the spirals and molecules on the spirals also react reversibly. In a third example we have considered a simple model of active transport. Molecules in a sphere react reversibly with embedded lines and move deterministically towards the center, while bound. The lines also move in space and we have shown how the radial distribution of the molecules is shifted due to the active transport. In a final example we have simulated lines that are growing and shrinking in space and shown how the dynamics of the lines correlates to the concentration of the molecules in the system.
ACKNOWLEDGMENTS
Andreas Hellander and Per Lötstedt have read the manuscript carefully and suggested a number of improvements. This work has been supported by the NIH grant for StochSS with Grant No. 1R01EB014877-01 and the Swedish Research Council.
References
- McAdams H. H. and Arkin A., Trends Genet. 15, 65 (1999). 10.1016/S0168-9525(98)01659-X [DOI] [PubMed] [Google Scholar]
- Elf J. and Ehrenberg M., Syst. Biol. 1, 230 (2004). 10.1049/sb:20045021 [DOI] [PubMed] [Google Scholar]
- Gillespie D. T., J. Comput. Phys. 22, 403 (1976). 10.1016/0021-9991(76)90041-3 [DOI] [Google Scholar]
- Hattne J., Fange D., and Elf J., Bioinformatics 21, 2923 (2005). 10.1093/bioinformatics/bti431 [DOI] [PubMed] [Google Scholar]
- Drawert B., Engblom S., and Hellander A., BMC Syst. Biol. 6, 76 (2012). 10.1186/1752-0509-6-76 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wils S. and Schutter E. D., Front. Neuroinform. 3, 15 (2009). 10.3389/neuro.11.015.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Isaacson S. A., SIAM J. Appl. Math. 70, 77 (2009). 10.1137/070705039 [DOI] [Google Scholar]
- Hellander S., Hellander A., and Petzold L., Phys. Rev. E 85, 042901 (2012). 10.1103/PhysRevE.85.042901 [DOI] [PubMed] [Google Scholar]
- Erban R. and Chapman J., Phys. Biol. 6, 046001 (2009). 10.1088/1478-3975/6/4/046001 [DOI] [PubMed] [Google Scholar]
- Fange D., Berg O. G., Sjöberg P., and Elf J., Proc. Natl. Acad. Sci. U.S.A. 107, 19820 (2010). 10.1073/pnas.1006565107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arjunan S. N. V. and Tomita M., Syst. Synth. Biol. 4, 35 (2010). 10.1007/s11693-009-9047-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smoluchowski M. V., Z. Phys. Chem. 92, 129 (1917). [Google Scholar]
- Doi M., J. Phys. A 9, 1465 (1976). 10.1088/0305-4470/9/9/008 [DOI] [Google Scholar]
- Doi M., J. Phys. A 9, 1479 (1976). 10.1088/0305-4470/9/9/009 [DOI] [Google Scholar]
- Takahashi K., Tănase-Nicola S., and ten Wolde P. R., Proc. Natl. Acad. Sci. USA. 107, 2473 (2010). 10.1073/pnas.0906885107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Zon J. S. and ten Wolde P. R., J. Chem. Phys. 123, 234910 (2005). 10.1063/1.2137716 [DOI] [PubMed] [Google Scholar]
- Hellander S. and Lötstedt P., J. Comput. Phys. 230, 3948 (2011). 10.1016/j.jcp.2011.02.020 [DOI] [Google Scholar]
- Donev A., Bulatov V. V., Oppelstrup T., Gilmer G. H., Sadigh B., and Kalos M. H., J. Comput. Phys. 229, 3214 (2010). 10.1016/j.jcp.2009.12.038 [DOI] [Google Scholar]
- Tomita M., Hashimoto K., Takahashi K., Shimizu T. S., Matsuzaki Y., Miyoshi F., Saito K., Tanida S., Yugi K., Venter J. C., and C. A.HutchisonIII, Bioinformatics 15, 72 (1999). 10.1093/bioinformatics/15.1.72 [DOI] [PubMed] [Google Scholar]
- Andrews S. S. and Bray D., Phys. Biol. 1, 137 (2004). 10.1088/1478-3967/1/3/001 [DOI] [PubMed] [Google Scholar]
- Kerr R. A., Bartol T. M., Kaminsky B., Dittrich M., Chang J.-C. J., Baden S. B., Sejnowski T. J., and Stiles J. R., SIAM J. Sci. Comput. 30, 3126 (2008). 10.1137/070692017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hellander A., Hellander S., and Lötstedt P., Multiscale Model. Simul. 10, 585 (2012). 10.1137/110832148 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klann M., Ganguly A., and Koeppl H., Bioinformatics 28, i549 (2012). 10.1093/bioinformatics/bts384 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flegg M. B., Chapman S. J., and Erban R., J. R. Soc., Interface 9, 859 (2011). 10.1098/rsif.2011.0574 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mauro A. J., Sigurdsson J. K., Shrake J., Atzberger P. J., and Isaacson S. A., ArXiv e-prints (2013), arXiv:1302.0793 [math.NA].
- Wang S., Elf J., Hellander S., and Lötstedt P., “Stochastic reaction-diffusion processes with embedded lower dimensional structures,” Tech. Rep. 2012-034 (Department of Information Technology, Uppsala University, 2012). [DOI] [PMC free article] [PubMed]
- Hellander A. and Lötstedt P., Multiscale Model. Simul. 8, 1691 (2010). 10.1137/100784709 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mitchison T. and Kirschner M., Nature 312, 237 (1984). 10.1038/312237a0 [DOI] [PubMed] [Google Scholar]
- Hammar P., Leroy P., Mahmutovic A., Marklund E. G., Berg O. G., and Elf J., Science 336, 1595 (2012). 10.1126/science.1221648 [DOI] [PubMed] [Google Scholar]
- Carslaw J. C. J. H. S., Conduction of Heat in Solids (Oxford University Press, 1959). [Google Scholar]
- Kim H. and Shin K. J., Phys. Rev. Lett. 82, 1578 (1999). 10.1103/PhysRevLett.82.1578 [DOI] [Google Scholar]
- Bani-Hashemian M. H., Hellander S., and Lötstedt P., Comm. Comp. Phys. 13, 958 (2013). 10.4208/cicp.271011.230312a [DOI] [Google Scholar]
- Carlsson T., Ekholm T., and Elvingson C., J. Phys. A: Math. Theor. 43, 505001 (2010). 10.1088/1751-8113/43/50/505001 [DOI] [Google Scholar]
- Peters E. A. J. F. and Barenbrug T. M. A. O. M., Phys. Rev. E 66, 056701 (2002). 10.1103/PhysRevE.66.056701 [DOI] [PubMed] [Google Scholar]
- Strang G., SIAM (Soc. Ind. Appl. Math.) J. Numer. Anal. 5, 506 (1968). 10.1137/0705041 [DOI] [Google Scholar]
- Riggs A. D., Bourgeois S., and Cohn M., J. Mol. Biol. 53, 401 (1970). 10.1016/0022-2836(70)90074-4 [DOI] [PubMed] [Google Scholar]
- Howard J., Annu. Rev. Physiol. 58, 703 (1996). 10.1146/annurev.ph.58.030196.003415 [DOI] [PubMed] [Google Scholar]