Abstract
Analysis of complex networks has been widely used as a powerful tool for investigating various physical, chemical, and biological processes. To understand the emergent properties of these complex systems, one of the most basic issues is to determine the structure and topology of the underlying networks. Recently, a new theoretical approach based on first-passage analysis has been developed for investigating the relationship between structure and dynamic properties for network systems with exponential dwell time distributions. However, many real phenomena involve transitions with non-exponential waiting times. We extend the first-passage method to uncover the structure of distinct pathways in complex networks with non-exponential dwell time distributions. It is found that the analysis of early time dynamics provides explicit information on the length of the pathways associated to their dynamic properties. It reveals a universal relationship that we have condensed in one general equation, which relates the number of intermediate states on the shortest path to the early time behavior of the first-passage distributions. Our theoretical predictions are confirmed by extensive Monte Carlo simulations.
INTRODUCTION
Complex networks consisting of discrete states connected by dynamic transitions have been successfully applied for investigating many physical, chemical, and biological processes.1, 2, 3, 4, 5 Since the functioning of these complex systems is strongly influenced by their structures, the most important step in theoretical analysis is to determine the topology of underlying networks. Despite recent strong advances in understanding the dynamical and structural properties of complex chemical and biological networks,6, 7, 8, 9, 10, 11, 12, 13, 14 revealing the hidden structures of networks and their relations to dynamics remains a challenging task. One of the main reasons for this is a relatively small number of theoretical methods that can be employed.
Significant experimental progress in measuring dynamic properties of various chemical and biological processes has been reported. For many complex systems specific events between two arbitrary states can be measured with high temporal resolution.15, 16 Although the underlying structural information is contained in those experimental measurements, it is very difficult to extract it and to determine the unique mechanism of the process. It was realized that the analysis of distributions of such events is connected with a first-passage problem,19, 20, 21 which is a powerful method successfully applied to many stochastic chemical and biological problems.17, 18 Other theoretical methods for uncovering network structures, including hidden Markov models22, 23 and the absorbing boundary method,24 have been developed recently. However, their practical application is limited to small networks and systems where one has prior knowledge of some details of the network structure.
Recently, a new theoretical approach to determine the relationship between structural and dynamic properties of complex networks has been developed based on the first-passage idea.25, 26 At early time, the probability density of the first-passage time between two states on a linear network has been found to follow a power law behavior.25 It was proposed that similar relationships hold for general complex networks, and it was supported by Monte Carlo computer simulations. Later, the conjecture was proved to be correct for the network of any topology by deriving the Taylor series of first-passage time probability density using graph theory methods.26 It was found that the smallest number of intermediate states between two arbitrary states of the network is given by the corresponding exponent of the power law function. By applying this theoretical method for several motor protein systems, it was argued that it might be especially useful for analyzing single-molecule experiments in various chemical and biological systems.25 However, this approach explicitly assumed that all dynamic transitions between states are Poissonian, i.e., the corresponding dwell times on each single state are exponential. However, there is a large number of natural and industrial processes that involve non-exponential waiting times.27, 28, 29, 30 For example, coupling of mechanical degrees of freedom with chemical processes in motor proteins might lead to non-exponential waiting time distributions.28 More generally, when a given state has an internal structure, for instance, a smaller sub-network of states, we should expect non-exponential dwell time distributions and violations of the Markov property. Since this is a more general case than systems modeled by Markov chains, it is important to extend the first-passage approach for investigating these more complex systems.
In this article, we study the coupling between structure and dynamic properties of complex networks with non-exponential dwell time distributions for each transition. We have found a universal identity that connects dynamic information, which can be obtained from experimental measurements, with the number of states along the selected path as a first step to unveil the complete structure of the complex network. It is shown that this general result reduces to previously found relations25, 26 for exponential dwell times. Our theoretical predictions are supported by Monte Carlo computer simulations for several networks with different topology.
RESULTS
Laplace transforms of the first-passage distribution functions
We consider a general network composed of a series of discrete states k which form the state space of the network σ. We identify the state i ∈ σ as the initial state and the state j ∈ σ as the final state for which we would like to know the distribution fij(t) of the first-passage times when starting from the state i. We associate to each state k ∈ σ a set of transition probabilities 0 ⩽ Pkℓ ⩽ 1 to reach any of the states ℓ ∈ σ in the state space of the network in one step. The normalization condition ∑ℓ ∈ σPkℓ = 1 must hold for these transition probabilities and Pkk = 0. The matrix P whose elements are the Pkℓ is thus the embedded Markov chain in discrete time.
We now assign to each state k a general dwell time probability density ϕk(t) in the following form:
(1) |
where Ck has a dimension of inverse time (s−1) and is fixed by the normalization. The parameters ak, γk are both larger than zero and can depend on the state k. We initially restrict our analysis to the case where the parameter α is the same for all states of the network and satisfies α > −1. In the following, we assume that t0 = 1 s. For γk = 1 and α = 0 we recover the exponential waiting times from previous studies.25, 26 Equation 10 is, in essence, a mathematical generalization of the Gamma distribution, which can be obtained by setting γk = 1. We have allowed for the possibility to take γk ≠ 1 in order to have the most general dwell time distribution that we are able to treat mathematically with the method described in this work. Apart from the Gamma distribution, which arises naturally in many stochastic natural phenomena or as the convolution of several exponential dwell times, other choices for the value of γk > 0 do not have an obvious physical or chemical origin.
Now consider a first-passage time from the initial state i to the final state j for general networks as shown in Fig. 1. Since we are using the more general dwell time distributions 1, we cannot use the approach based on the chemical Master equation to derive the first passage time. Let τij be the random number that gives the first-passage time from the state i to the state j, and be the time from i to any of its nearest neighbors k1. With the assumptions given above, the time to reach any of the neighbors is independent of the neighbor and is distributed according to 1. We will thus put . We also define as the first-passage time from each of the neighbors of i to the final state j. These times are related by
(2) |
if k1 is the first state visited by the process after leaving the state i. In terms of the Laplace transforms from Eq. 2 we obtain
(3) |
under the condition that k1 is the first visited state. Here we have defined . Each neighbor k1 of the state i can be visited with the corresponding probability after leaving state i. Therefore, the final Laplace transforms of the first-passage time from i to j is given by
(4) |
where Di(s) is the Laplace transform of the dwell time distribution density at state i defined as,
(5) |
Note that the notation means a sum over all k1 that are neighbors of the state i, i.e., they can be reached from the state i in a single transition.
In Eq. 2 we split the random time to go from the state i to the state j as the time to reach one of the first neighbors of the state i plus the first-passage time to reach the state j from this neighboring state. In terms of Laplace transforms it is expressed as the product of the respective functions as given in Eq. 3. Since the first-passage time from any k1 to j can also be split into a dwell time to be in k1 and the first-passage time to one of its neighbors, one can extend the expansion in Eq. 4 into sums of products,
(6) |
where ℓ can be arbitrarily large and means a sum over all kb that are neighbors of the state ka.
The universal identity for the shortest path
We assume that the shortest path from the state i to the state j is unique in the network and it has exactly m intermediate states. Notice that the sums in Eq. 6 stop whenever they reach the final state j. Thus, as ℓ → ∞, Eq. 6 will be split into an infinite sum of generic terms like
(7) |
where k1, k2, …, kn are the state members of an arbitrary random walk from the state i to state j and n is the length of the walk. The dwell time distribution densities ϕk(t) from Eq. 1 have the following asymptotic behavior at early times t → 0:
(8) |
where the second term goes to zero faster than the first term. Recall that early times correspond to large Laplace variables (s → ∞). Therefore, the Laplace transforms Dk(s) of the ϕk(t) for all states k ∈ σ at large s can be written as
(9) |
where Bk is some constant. Then, the Laplace transform Gn(s) in Eq. 7 is simply given by
(10) |
for large s with being another constant. Thus, under the assumption that there exists a unique shortest path with m intermediate states between the state i and the state j, this path gives a contribution similar to Eq. 10. One can see that, apart from a multiplicative constant, the expression of 10 for n = m necessarily dominates the expansion in Eq. 4 at large s since m is the smallest possible value that n can take. It yields the asymptotic result,
(11) |
in the limit of s → ∞. Other paths with larger number of intermediate states have contributions of smaller order of magnitude. Then, by inverting the Laplace transform, the asymptotic behavior of the probability density fij(t) at early times can be obtained as
(12) |
where Am is a constant. It suggests that the first-passage time probability function fij(t) at early times has a power-law dependence (∼tβ), which is similar to results derived in previous studies with exponential waiting times.25, 26 The corresponding exponent β satisfies the equation
(13) |
Note that this exponent does not depend on the parameter γk and it reproduces the known identity β = m for continuous time Markov chains, i.e., if α = 0.25, 26 One important consequence of Eq. 13 is that for any given β, derived for instance after fitting experimental data, the number of intermediate states m and the parameter α cannot be chosen arbitrarily and independently from each other. Therefore, Eq. 13 delivers a strong constraint that couples the structural properties along the pathway and the dynamical properties at the level of the individual dwell times. As a side remark, using the general expansion given in Eq. 6 it is easy to see how Eq. 13 can be generalized when the parameter α does depend on the state k. Using the same method that relates early times to large s we indeed obtain
(14) |
where αi is the parameter α associated to the initial state i and is the parameter α associated to the state kℓ.
Numerical test with Monte Carlo simulations
To test our theoretical prediction, Eq. 13, we performed a series of computer simulations, analyzing complex networks presented in Fig. 1. Many distinct paths with varied numbers of intermediate states exist between the initial state and any of the final states in the system. For the four final states jk with k = 0, 1, 2, 3 considered in the network, the numbers of intermediate states m for the shortest paths starting from the initial state i are given by 0, 1, 2, and 3, respectively. Equation 13 suggests that the relationship between α, β, and m does not depend on other parameters such as ak, γk which appear in Eq. 1. Therefore, we simply set ak = γk = 1. We fix, instead, the parameter α in Eq. 1 to take one of the two possible values, α = 0.5 and α = 1, as examples. Each state in the network shown in Fig. 1 is labeled by a number starting from 1 to 11.
Since we want to compare our results both in the case of exponential (Markov) and non-exponential dwell times, we first proceed by creating a continuous time Markov chain by assigning transition rates between connected states. We assume that the transition rate from the state k to the state ℓ rkℓ = 0.5 s−1 for k < ℓ and rkℓ = 0.1 s−1 if k > ℓ. The transition probabilities from the state k to one of its neighbors ℓ is given by
(15) |
where the notation ∑⟨nk⟩ means a sum over all n that are neighbors of the state k. The probability matrix 15 defines our embedded discrete time chain introduced in Sec. 2A. Notice that the continuous time chain thus created corresponds to the case of α = 0 and γk = 1, but with varying parameters ak depending on the transition rates from the state k to their neighbors.
To proceed to the non-exponential dwell times, we instead consider the embedded chain 15 and define the dwell times according to the distribution 1. We first consider the case when α = 0.5 for the dwell time distribution at each state of the network. The first-passage time probability densities starting from the initial state i to the final state jk with k = 0, 1, 2, 3 are presented in Fig. 2. The early time behavior of the probability densities is shown in the main figure on log-log scale, while the inset gives the full time picture. The solid lines in Fig. 2 correspond to linear fits of the logarithm of the simulation results at early times, i.e., before the distribution reaches its maximum value (hereafter we will refer to the linear fit of the logarithm of the distribution simply as linear fit). The slopes of these solid lines approximate the values of β as expressed in Eq. 13. The number m of the intermediate states for the shortest path between the initial state i and final state jk with k = 0, 1, 2, 3 is also indicated in Fig. 2. From Eq. 13 with α = 0.5 we expect that the corresponding value for β is equal to 0.5, 2, 3.5, 5 with m varied from 0, 1, 2 to 3, respectively. The slopes obtained from the linear fit of the early times in Fig. 2 are close to the expected values calculated from Eq. 13. Similarly as was observed before,25 the slopes are slightly smaller than the expected values of β as given by Eq. 13. However, as discussed below, we notice that the slopes approach the expected values as the time span used for fitting is reduced. One can see that the number of intermediate states for the shortest path between the initial and the final states can be derived from the slope of first-passage time densities at early times. As shown in Fig. 2, the slope increases with the number of intermediate states, in agreement with the prediction of Eq. 13.
To test further predictions from Eq. 13 we calculated the first-passage probability densities using α = 1 for the networks shown in Fig. 1. As presented in Fig. 3, the slope for the solid lines from linear fit increases with the number of intermediate states for the shortest path between the initial and final states, fully supporting our theoretical predictions. Compared with Fig. 2, we can also find that the corresponding slopes with the same number m of intermediate states become larger as the value of α increased from 0.5 to 1, which is again consistent with Eq. 13. The expected values for β are equal to 1, 3, 5, and 7 for α = 1 when the number m of intermediate states on the shortest path between the initial and final states takes the values 0, 1, 2, and 3, respectively. One can see that the slopes obtained from linear fits (Fig. 3) are close to the expected values. For a fixed value of α we can also determine the number of intermediate states on the shortest path between any two states on a complex network from the early time behavior. For α = 1, a slope between 0 and 1 means m = 0, i.e., there are no intermediate states in the shortest path between the two states. A slope between 1 and 3 means there is one intermediate state, i.e., m = 1. These arguments can be extended to any number of intermediate states (see Fig. 3).
For the network in Fig. 1 we analyzed the case with exponential dwell time distributions, as was discussed previously,25, 26 and obtained the corresponding first-passage time probability densities from the initial state i to final states jk. The results are presented in Fig. 4. In this case, where α = 0, Eq. 13 reduces into a simple form, β = m, as was found in previous studies.25, 26 Following the procedure described above, one can obtain the number of intermediate states for the shortest path directly from the early time behavior of the first-passage time probabilities (see Fig. 4). The probability density is an exponentially decaying function of time if the final state is one of the neighbors of the initial state, and the corresponding slope has a value close to zero (Fig. 4). As the number of the intermediate states becomes larger, a rising phase followed by a decaying phase appears in the probability density function, as demonstrated in Fig. 4. We also find that the slope of the fitting curves is very close to the expected value of β for each case as discussed above. The number m of intermediate states can even be obtained directly by rounding to the upper integer of the slope for each linear fitting curve, as indicated in Fig. 4. The same results were found previously.25 However, the number m of the intermediate states between two states of a complex network with general dwell time distributions cannot be read out directly from these fittings, but one can still easily obtain them using Eq. 13 as discussed above.
Analyzing first-passage distribution functions for various sets of parameters, one can notice that slopes obtained by linear fitting of the first-passage distributions at early times are always smaller than suggested by Eq. 13. It is found that predictions from Eq. 13 are getting more accurate with decreasing the time interval utilized for fitting. It is not surprising since our theoretical conclusions were obtained in the limit of t → 0. To illustrate this, let us consider the network shown in Fig. 1 with the initial state i, the final state j1, and the parameter α = 1. Different slopes obtained by linear fittings of first-passage probability functions for varying time intervals are given in Fig. 5. As we reduce the time range the corresponding slope values increase monotonically. In order to see this trend clearly these slopes are also presented in Fig. 6. For this case there is 1 intermediate state (m = 1) and Eq. 13 predicts the slope should be equal to 3. The fitted slope values approach this number fast as the size of the time segment decreases, supporting our theoretical arguments.
DISCUSSION AND CONCLUSIONS
We investigated the relationship between topology and dynamic properties of general networks via theoretical analysis of first-passage probability distribution functions for various transition events. In contrast to previous studies25, 26 where simple exponential dwell time distributions were assumed, more general dwell time probabilities were used in this work. Using analytical calculations based on asymptotic analysis of Laplace transforms it is shown that there is a universal behavior of the first-passage distribution functions at early times. It is specified by a power-law temporal dependence of the probability functions for events starting and finishing at different sites of the network. We found that the exponent of the power-law is directly related to the number of intermediate states on the shortest path connecting the initial and the final states. For purely exponential waiting times this exponent simply reduces to the number of intermediate states, as was already shown before.25, 26 In general case, the exponent depends on details of transition dynamics at each site. Furthermore, our analysis suggests that the universal behavior of first-passage probability function is observed if the power-law component of waiting time distributions is the same for all states on the network.
The physical mechanism of this universal behavior can be easily explained using the following arguments. At early times, only trajectories that follow the shortest path between the specified start and end states can be observed since the system should pause at each intermediate state. This argument is more precise the shorter the time for observations, and the arrival dynamics should reflect the number of intermediate states. Even for very fast transitions, i.e., for very short dwell times at some sites, there is a time when this universal behavior should be observed. In other words, this dynamic picture is independent of the absolute values of the average dwell times at each state.25, 26 The reversibility of individual transitions and therefore the probability that one path is taken instead of another also does not play any role at this level of analysis.
Our theoretical ideas were tested in extensive Monte Carlo simulations for various networks with different parameters to describe the non-exponential waiting times. In all cases it was found that analytical predictions correctly describe first-passage arrival dynamics at early times. The analysis also shows that the agreement with theoretical calculations is asymptotic in time: decreasing the time segment for extracting the universal exponents leads to convergence to predicted values. It supports the universal nature of presented theoretical results, and it also provides a direct method for connecting structural and dynamic properties for any network. The most significant outcome of this study is the development of an explicit theoretical framework for analyzing complex physical, chemical, and biological processes in order to extract relevant microscopic information. It will be critically important to test our theoretical predictions in real experimental systems.
ACKNOWLEDGMENTS
A.B.K. and X.L. were supported by grants from National Institutes of Health (1R01GM094489-01) and the Welch Foundation (C-1559). A.V. thanks C. Sin for a critical reading of the manuscript.
References
- Temkin O. N., Zeigarnik A. V., and Bonchev D. G., Chemical Reaction Networks: A Graph-Theoretical Approach (CRC Press, New York, 1996). [Google Scholar]
- Strogatz S. H., Nature (London) 410, 268 (2001). 10.1038/35065725 [DOI] [PubMed] [Google Scholar]
- Albert R. and Barabasi A.-L., Rev. Mod. Phys. 74, 47 (2002). 10.1103/RevModPhys.74.47 [DOI] [Google Scholar]
- Karalus S. and Porto M., Eur. Phys. Lett. 99, 38002 (2012). 10.1209/0295-5075/99/38002 [DOI] [Google Scholar]
- Goutsias J. and Jenkinson G., Phys. Rep. 529, 199 (2013). 10.1016/j.physrep.2013.03.004 [DOI] [Google Scholar]
- Sowa Y., Rowem A. D., Leake M. C., Yakushi T., Homma M., Ishijima A., and Berry R. M., Nature (London) 437, 916 (2005). 10.1038/nature04003 [DOI] [PubMed] [Google Scholar]
- Kerssemakers J. W. J., Munteanu L., Laan L., Noetzel T. L., Janson M. J., and Dogterom M., Nature (London) 442, 709 (2006). 10.1038/nature04928 [DOI] [PubMed] [Google Scholar]
- Kolomeisky A. B. and Fisher M. E., Ann. Rev. Phys. Chem. 58, 675 (2007). 10.1146/annurev.physchem.58.032806.104532 [DOI] [PubMed] [Google Scholar]
- Dimitrov A., Quesnolt M., Moutel S., Cantaloube I., Pous C., and Perez F., Science 322, 1353 (2008). 10.1126/science.1165401 [DOI] [PubMed] [Google Scholar]
- Dunn A. R., Chuan P., Bryant Z., and Spudich J. A., Proc. Natl. Acad. Sci. U.S.A. 107, 7746 (2010). 10.1073/pnas.1002430107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tyson J. J. and Novak B., Ann. Rev. Phys. Chem. 61, 219 (2010). 10.1146/annurev.physchem.012809.103457 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Straub J. E. and Thirumalai D., Ann. Rev. Phys. Chem. 62, 437 (2011). 10.1146/annurev-physchem-032210-103526 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chung H. S., McHale K., Louis J. M., and Eaton W. A., Science 335, 981 (2012). 10.1126/science.1215768 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jin Q., Fleming A. M., Burrows C. J., and White H. S., J. Am. Chem. Soc. 134, 11006 (2012). 10.1021/ja304169n [DOI] [PMC free article] [PubMed] [Google Scholar]
- Myong S., Bruno M. M., Pyle A. M., and Ha T., Science 317, 513–516 (2007). 10.1126/science.1144130 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yildiz A., Forkey J. N., McKinney S. A., Ha T., Goldman Y. E., and Selvin P. R., Science 300, 2061 (2003). 10.1126/science.1084398 [DOI] [PubMed] [Google Scholar]
- van Kampen N. G., Stochastic Processes in Chemistry and Physics (North Holland, Amsterdam, 1992). [Google Scholar]
- Redner S., A Guide to First-Passage Processes (Cambridge University Press, Cambridge, 2001). [Google Scholar]
- Kolomeisky A. B., Stukalin E. B., and Popov A. A., Phys. Rev. E 71, 031902 (2005). 10.1103/PhysRevE.71.031902 [DOI] [PubMed] [Google Scholar]
- Valleriani A., Liepelt S., and Lipowsky R., Eur. Phys. Lett. 82, 28011 (2008). 10.1209/0295-5075/82/28011 [DOI] [Google Scholar]
- Keller P. and Valleriani A., J. Chem. Phys. 137, 084106 (2012). 10.1063/1.4747337 [DOI] [PubMed] [Google Scholar]
- Colquhoun D. and Hawkes A. G., Proc. R. Soc. London, Ser. B 211, 205 (1981). 10.1098/rspb.1981.0003 [DOI] [PubMed] [Google Scholar]
- Milescu L. S., Yildiz A., Selvin P. R., and Sachs F., Biophys. J. 91, 1156 (2006). 10.1529/biophysj.105.079541 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liao J.-C., Spudich J. A., Parker D., and Delp S. L., Proc. Natl. Acad. Sci. U.S.A. 104, 3171 (2007). 10.1073/pnas.0611519104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li X. and Kolomeisky A. B., J. Chem. Phys. 139, 144106 (2013). 10.1063/1.4824392 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Valleriani A. Li X., and Kolomeisky A. B., J. Chem. Phys. 140, 064101 (2014). 10.1063/1.4863997 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paxson V., and Floyd S., IEEE/ACM Trans. Netw. 3, 226 (1995). 10.1109/90.392383 [DOI] [Google Scholar]
- Kolomeisky A. B. and Fisher M. E., J. Chem. Phys. 113, 10867 (2000). 10.1063/1.1326912 [DOI] [Google Scholar]
- Masoliver J., Montero M., and Weiss G. H., Phys. Rev. E 67, 021112 (2003). 10.1103/PhysRevE.67.021112 [DOI] [PubMed] [Google Scholar]
- Barabasi A.-L., Nature (London) 435, 207 (2005). 10.1038/nature03459 [DOI] [PubMed] [Google Scholar]